Muhammad Shahbaz (Stanford) – Fast and Flexible Next-Generation Data Centers

Date & Time:

February 18, 2020 10:30 am – 11:30 am

Location:

Crerar 298, 5730 S. Ellis Ave., Chicago, IL,

02/18/2020 10:30 AM 02/18/2020 11:30 AM America/Chicago Muhammad Shahbaz (Stanford) – Fast and Flexible Next-Generation Data Centers Crerar 298, 5730 S. Ellis Ave., Chicago, IL,

Fast and Flexible Next-Generation Data Centers

Modern and future data centers must be fast and flexible: fast (i.e., high throughput and low latency), to meet users' quality of service expectations; and flexible (i.e., customizing compute and switching behavior with ease), to let operators manage and run their data centers effectively and efficiently. However, recent trends (i.e., the decline of Moore's Law and the rise of Big Data) are forcing us to rethink how we build these data centers. CPUs—hosting systems—must now repurpose resources (like caches and out-of-order execution) needed for general-purpose processing, to logic tailored to a particular application domain (e.g., machine learning). And, switches—running networks—must now take on more responsibilities (e.g., congestion control and packet scheduling) to further offload CPUs; they can no longer be fixed-function, line-rate devices for packet forwarding only. The concern, however, is that such efforts will cause systems to sacrifice flexibility for higher performance, and networks to sacrifice performance for more flexibility.

In this talk, I will address the latter concern and show how exposing the right domain-specific abstractions (Pisces) and hardware primitives (Elmo) for networks can scale datacenter services, like group communication (i.e., multicast), more flexibly with negligible (or zero) loss in performance. With Pisces, we let network operators modify the behavior of end-host software switches using a high-level language (called P4) without compromising performance; we can use it to offload state (e.g., multicast) from hardware switches and efficiently encode it inside packets. With Elmo, we introduce new primitives for hardware switches that can read and act on the encoded state from within packets and forward them in the network. Together, they allow network operators to scale datacenter services, like multicast to millions of groups, with ease while operating at line-rate.

I will conclude by discussing ongoing and future work on abstractions and architectures for serverless systems and self-driving networks, and how such architectures can interoperate to materialize fast and flexible next-generation data centers.

Host: Nick Feamster

Muhammad Shahbaz

Postdoctoral Research Scholar, Stanford University

Muhammad Shahbaz is a Postdoctoral Research Scholar in the Department of Electrical Engineering at Stanford University. His research focuses on the design and development of domain-specific languages, compilers, and architectures and their applications to emerging workloads (including machine learning). Shahbaz received his Ph.D. and M.A. in Computer Science from Princeton University and B.E. in Computer Engineering from the National University of Sciences and Technology (NUST). Before joining Princeton University, Shahbaz worked as a Research Assistant at the University of Cambridge, Computer Laboratory, and was a Senior Design Engineer at the Center for Advanced Research in Engineering (CARE). In the past, Shahbaz has built open-source systems like PISCES, SDX, and NetFPGA-10G that are widely used in industry and academia. He has earned (and shared) awards, including Internet2 Innovation Award, ACM SOSR Systems Award, Facebook Research Award, APNet Best Paper Award, and Outstanding Graduate Teaching Assistant Award.

Resources

Community

Non-Unital Noise Adds a New Wrinkle to the Quantum Supremacy Debate

The Science of Computer Security: An Interview with Grant Ho, Assistant Professor in Computer Science

Four Students Receive Honorable Mention in CRA Undergraduate Research Awards

Moon Duchin (Tufts University) – Design for Democracy

“Machine Learning Foundations Accelerate Innovation and Promote Trustworthiness” by Rebecca Willett

Nightshade: Data Poisoning to Fight Generative AI with Ben Zhao

Ian Foster – Better Information Faster: Programming the Continuum

Fast and Flexible Next-Generation Data Centers

Muhammad Shahbaz

“Machine Learning Foundations Accelerate Innovation and Promote Trustworthiness” by Rebecca Willett

Nightshade: Data Poisoning to Fight Generative AI with Ben Zhao

Five UChicago CS students named to Siebel Scholars Class of 2024

In The News: U.N. Officials Urge Regulation of Artificial Intelligence

UChicago Computer Scientists Bring in Generative Neural Networks to Stop Real-Time Video From Lagging

UChicago Team Wins The NIH Long COVID Computational Challenge

UChicago Assistant Professor Raul Castro Fernandez Receives 2023 ACM SIGMOD Test-of-Time Award

Computer Science Displays Catch Attention at MSI’s Annual Robot Block Party

UChicago, Stanford Researchers Explore How Robots and Computers Can Help Strangers Have Meaningful In-Person Conversations

Postdoc Alum John Paparrizos Named ICDE Rising Star

New EAGER Grant to Asst. Prof. Eric Jonas Will Explore ML for Quantum Spectrometry

Assistant Professor Chenhao Tan Receives Sloan Research Fellowship