Date & Time:
March 31, 2022 2:00 pm – 3:00 pm
Crerar 390, 5730 S. Ellis Ave., Chicago, IL,
03/31/2022 02:00 PM 03/31/2022 03:00 PM America/Chicago Udit Gupta (Harvard) – Faster, Smarter, and Greener Systems for Data-Center Scale AI Crerar 390, 5730 S. Ellis Ave., Chicago, IL,

Watch Via Live Stream

The modern Internet is driven by AI-centric services that determine how we interact with technology and society on a daily basis. The exponential rise in AI is largely fueled by the design, development, and deployment of domain-specific software and hardware that have yielded orders of magnitude improvements for deep learning. Despite these efforts, this talk focuses on an important, yet under-studied area: systems for deep learning-based personalized recommendation. Personalized recommendations form the backbone of our interaction with the Internet including search, e-commerce, streaming, and social media. Systems play a crucial role in enabling accurate, efficient, and sustainable recommendation engines.

In this talk I show how modern deep learning-based personalized recommendation engines not only consume the majority of AI training and inference cycles in production data centers, but also introduce unique system design challenges to efficient execution. To tackle these challenges, I design solutions across the software and hardware stack to optimize inference efficiency by jointly considering application-level characteristics, unique neural network model architectures, data-center scale implications, and the underlying hardware. Given the rapidly growing infrastructure demands posed by AI and recommendation engines, my work highlights that systems must go beyond performance, power, and energy efficiency to consider environmental footprint as a first order design target to enable sustainable computing. Finally, I chart paths to designing future systems that enable emerging AI-driven applications by balancing performance, efficiency, sustainability, and privacy.


Udit Gupta

PhD Student, Harvard University

Udit Gupta is a PhD student at Harvard University and visiting research scientist at Facebook AI Research. His research interests focus on enabling next-generation responsible AI platforms by designing novel computer systems and hardware. His recent work focuses on the optimization of data center-scale deep learning-based personalized recommendation engines (HPCA 2020, ISCA 2020, MICRO 2021, ASPLOS 2021) and enabling sustainable computing by considering the environmental impact of end-to-end hardware life cycles (HPCA 2021, MLSys 2022). Udit’s work has been evaluated at-scale in production data centers and incorporated into standardized benchmarks and infrastructure used by the research community. His research has been recognized as an IEEE MICRO Top Picks honorable mention in 2020 and received an IEEE MICRO Top Picks award in 2021, as well as nominated for best paper at PACT 2019 and DAC 2018. In addition to research, Udit is passionate about building interdisciplinary communities. He has co-founded the PeRSonAl (personalized recommendation systems and algorithms) workshop and CLEAR (computing landscapes with environmental accountability and responsibility) workshops co-located at systems and machine learning conferences like ASPLOS, ISCA, and MLSys. He is also the co-chair of the Computer Architecture Student Association.

Related News & Events

UChicago CS News

NeurIPS 2023 Award-winning paper by DSI Faculty Bo Li, DecodingTrust, provides a comprehensive framework for assessing trustworthiness of GPT models

Feb 01, 2024

“Machine Learning Foundations Accelerate Innovation and Promote Trustworthiness” by Rebecca Willett

Jan 26, 2024

Nightshade: Data Poisoning to Fight Generative AI with Ben Zhao

Jan 23, 2024
UChicago CS News

UChicago Undergrad Analyzes Machine Learning Models Used By CPD, Uncovers Lack of Transparency About Data Usage

Oct 31, 2023
UChicago CS News

Five UChicago CS students named to Siebel Scholars Class of 2024

Oct 02, 2023
In the News

In The News: U.N. Officials Urge Regulation of Artificial Intelligence

"Security Council members said they feared that a new technology might prove a major threat to world peace."
Jul 27, 2023
UChicago CS News

UChicago Computer Scientists Bring in Generative Neural Networks to Stop Real-Time Video From Lagging

Jun 29, 2023
UChicago CS News

UChicago Team Wins The NIH Long COVID Computational Challenge

Jun 28, 2023
UChicago CS News

UChicago Assistant Professor Raul Castro Fernandez Receives 2023 ACM SIGMOD Test-of-Time Award

Jun 27, 2023
Michael Franklin
UChicago CS News

Mike Franklin, Dan Nicolae Receive 2023 Arthur L. Kelly Faculty Prize

Jun 02, 2023
UChicago CS News

PhD Student Kevin Bryson Receives NSF Graduate Research Fellowship to Create Equitable Algorithmic Data Tools

Apr 14, 2023
UChicago CS News

Computer Science Displays Catch Attention at MSI’s Annual Robot Block Party

Apr 07, 2023
arrow-down-largearrow-left-largearrow-right-large-greyarrow-right-large-yellowarrow-right-largearrow-right-smallbutton-arrowclosedocumentfacebookfacet-arrow-down-whitefacet-arrow-downPage 1CheckedCheckedicon-apple-t5backgroundLayer 1icon-google-t5icon-office365-t5icon-outlook-t5backgroundLayer 1icon-outlookcom-t5backgroundLayer 1icon-yahoo-t5backgroundLayer 1internal-yellowinternalintranetlinkedinlinkoutpauseplaypresentationsearch-bluesearchshareslider-arrow-nextslider-arrow-prevtwittervideoyoutube