Chengcheng Wan (UChicago) – ALERT: Accurate Anytime Learning for Energy and Timeliness

Date & Time:

September 27, 2019 2:00 pm – 3:00 pm

Location:

Crerar 298, 5730 S. Ellis Ave., Chicago, IL,

09/27/2019 02:00 PM 09/27/2019 03:00 PM America/Chicago Chengcheng Wan (UChicago) – ALERT: Accurate Anytime Learning for Energy and Timeliness Crerar 298, 5730 S. Ellis Ave., Chicago, IL,

ALERT: Accurate Anytime Learning for Energy and Timeliness

An increasing number of software applications incorporate runtime Deep Neural Network (DNN) inference for its great accuracy in many problem domains. While much prior work has separately tackled the problems of improving DNN-inference accuracy and improving DNN-inference efficiency, an important problem is under-explored: disciplined methods for dynamically managing application-specific latency, accuracy, and energy tradeoffs and constraints at run time. To address this need, we propose ALERT, a co-designed combination of runtime system and DNN nesting technique. The runtime takes latency, accuracy, and energy constraints, and uses dynamic feedback to predict the best DNN-model and system power-limit setting. The DNN nesting creates a type of flexible network that efficiently delivers a series of results with increasing accuracy as time goes on. These two parts well complement each other: the runtime is aware of the tradeoffs of different DNN settings, and the nested DNNs' flexibility allows the runtime prediction to satisfy application requirements even in unpredictable, changing environments. On real systems for both image and speech, ALERT achieves close to-optimal results. Comparing with the optimal static DNN-model and power-limit setting, which is impractical to predict, ALERT achieves a harmonic mean 33% of energy savings while satisfying accuracy constraints, and reduces image-classification error rate by 58% and sentence-prediction perplexity by 52% while satisfying energy constraints.

Chengcheng Wan

M.S. Candidate, University of Chicago

Chengcheng's advisor is Prof. Shan Lu

Resources

Community

University of Chicago PhD Graduates Secure Tenure-Track Faculty Positions Amid a Competitive Job Market

Democratizing Digital Graphics: An Undergrad’s Unlikely Path To Putting Agency of 3D-Generation in Users’ Hands

Faculty Spotlight: Get to Know Kexin Pei

The Future of AI Panel: Alumni Weekend

Can we authenticate human creativity?

AI and the Future of Work Panel: Featuring Nick Feamster

ALERT: Accurate Anytime Learning for Energy and Timeliness

Chengcheng Wan

Five UChicago CS students named to Siebel Scholars Class of 2024

UChicago Computer Scientists Bring in Generative Neural Networks to Stop Real-Time Video From Lagging

UChicago Team Wins The NIH Long COVID Computational Challenge

UChicago Assistant Professor Raul Castro Fernandez Receives 2023 ACM SIGMOD Test-of-Time Award

Computer Science Displays Catch Attention at MSI’s Annual Robot Block Party

Professor Heather Zheng Named ACM Fellow

Ian Foster – Better Information Faster: Programming the Continuum

Q&A: Ian Foster on Receiving the 2023 IEEE Internet Award

Professor Fred Chong Named IEEE Fellow

Associate Professor Diana Franklin Named ACM Distinguished Member

UChicago’s Parsl Project Pivots to Sustainability and Community with New Grants

Trending Now: How Netflix Chills Our Free Will