Ohad Shamir (Weizmann) – Training Neural Networks: The Bigger the Better?

Date & Time:

November 1, 2019 10:30 am – 11:30 am

Location:

Crerar 390, 5730 S. Ellis Ave., Chicago, IL,

11/01/2019 10:30 AM 11/01/2019 11:30 AM America/Chicago Ohad Shamir (Weizmann) – Training Neural Networks: The Bigger the Better? CS / Toyota Technological Institute of Chicago Machine Learning Seminar Series Crerar 390, 5730 S. Ellis Ave., Chicago, IL,

Training Neural Networks: The Bigger the Better?

Artificial neural networks are nowadays routinely trained to solve challenging learning tasks, but our theoretical understanding of this phenomenon remains quite limited. One increasingly popular approach, which is aligned with practice, is to study how making the network sufficiently large (a.k.a. “over-parameterized'') makes the associated training problem easier. In this talk, I'll describe some of the possibilities and challenges in understanding neural networks using this approach. Based on joint works with Itay Safran and Gilad Yehudai.

Ohad Shamir

Faculty Member, Department of Computer Science and Applied Mathematics, Weizmann Institute

Ohad Shamir is a faculty member at the Department of Computer Science and Applied Mathematics at the Weizmann Institute. He received his PhD in 2010 at the Hebrew University, and between 2010-2013 and 2017-2018 was a researcher at Microsoft Research in Boston. His research focuses on theoretical machine learning, in areas such as theory of deep learning, learning with information and communication constraints, and topics at the intersection of machine learning and optimization. He received several awards, and served as program co-chair of COLT as well as a member of its steering committee.

Resources

Community

What’s Real and What’s Not? Watermarking to Identify AI-Generated Text

Enhancing Multitasking Efficiency: The Role of Muscle Stimulation in Reducing Mental Workload

From wildfires to bird calls: Sage redefines environmental monitoring

“Machine Learning Foundations Accelerate Innovation and Promote Trustworthiness” by Rebecca Willett

Nightshade: Data Poisoning to Fight Generative AI with Ben Zhao

Ian Foster – Better Information Faster: Programming the Continuum

Training Neural Networks: The Bigger the Better?

Ohad Shamir

What’s Real and What’s Not? Watermarking to Identify AI-Generated Text

Enhancing Multitasking Efficiency: The Role of Muscle Stimulation in Reducing Mental Workload

From wildfires to bird calls: Sage redefines environmental monitoring

Unlocking the Future of AI: How CacheGen is Revolutionizing Large Language Models

UChicago Partners With UMass On NSF Expedition To Elevate Computational Decarbonization As A New Field In Computing

Assistant Professor Raul Castro Fernandez Awarded NSF CAREER Grant to investigate Data-sharing Markets

Empowering Middle School Girls in Tech: compileHER’s <prompt/HER> Capstone Event

Haifeng Xu Wins Best Paper Award at Leading AI Conference for Pioneering Research on Mechanism Design for LLMs

Fred Chong Receives Quantrell Award for Excellence in Teaching

Unveiling Attention Receipts: Tangible Reflections on Digital Consumption

NASA to Launch UChicago Undergraduates’ Satellite

University of Chicago Computer Science Researchers To Present Ten Papers at CHI 2024