Alan Ritter (Georgia Tech)- Towards Cost Efficient Use of Pre-trained Models

Date & Time:

November 3, 2023 11:00 am – 12:00 pm

Location:

TTIC, 6045 S. Kenwood Ave., Chicago, IL,

11/03/2023 11:00 AM 11/03/2023 12:00 PM America/Chicago Alan Ritter (Georgia Tech)- Towards Cost Efficient Use of Pre-trained Models TTIC, 6045 S. Kenwood Ave., Chicago, IL,

Large language models are leading to many exciting breakthroughs, but this comes at a significant cost in terms of both computational and data labeling expenses. Training state-of-the-art models requires access to high-end GPUs for pre-training and inference, in addition to labeled data for fine-tuning. In this talk I will examine the tradeoff between these costs, with the goal supporting better decisions. Conventional wisdom holds that annotating data is expensive, so computational methods that use unlabeled data to improve performance can present an economical alternative. I will examine this assumption in the context of pretraining-based adaptation, which requires significant computation for each new domain. As a second example where the tradeoff between computation and annotation arises, I will show that training and then distilling large models can be an economical strategy for improving performance. Finally, I will discuss applications on chemical synthesis protocols, and show a demo of a system that can help chemists to more efficiently find experimental conditions described in the literature. I will also present a new approach to extracting data from tables in scientific articles where the only supervision provided to the model is a database schema, eliminating the need for labeled data or custom data extraction pipelines.

Speakers

Alan Ritter

Associate Professor

Alan Ritter is an associate professor in the College of Computing at Georgia Tech. His research on natural language processing aims to solve technical challenges that help machines read the web and engage in safe and helpful dialogue with people. In a recent project, covered by WIRED (https://www.wired.com/story/machine-learning-tweets-critical-security-flaws/), Alan’s group built a system that reads millions of online messages for mentions of new software vulnerabilities. He completed his Ph.D. at the University of Washington and was a postdoctoral fellow in the Machine Learning Department at Carnegie Mellon. Alan is the recipient of an NSF CAREER award and an Amazon Research Award.

Registration

Alan will have some time to meet with faculty and students virtually after the talk. Please sign up here if you are interested in meeting him:

Alan Ritter UChicago/TTIC Seminar Schedule

Please join us in TTIC 501 from 11-12 on Friday (you can also join on zoom).

Register

11/03/2023 11:00 AM 11/03/2023 12:00 PM America/Chicago Alan Ritter (Georgia Tech)- Towards Cost Efficient Use of Pre-trained Models TTIC, 6045 S. Kenwood Ave., Chicago, IL,

Resources

Community

University of Chicago PhD Graduates Secure Tenure-Track Faculty Positions Amid a Competitive Job Market

Democratizing Digital Graphics: An Undergrad’s Unlikely Path To Putting Agency of 3D-Generation in Users’ Hands

Faculty Spotlight: Get to Know Kexin Pei

The Future of AI Panel: Alumni Weekend

Can we authenticate human creativity?

AI and the Future of Work Panel: Featuring Nick Feamster

Speakers

Alan Ritter

Registration

UChicago Partners On New National Science Foundation Large-Scale Research Infrastructure For Education

Data Ecology: A Socio-Technical Approach to Controlling Dataflows

NeurIPS 2023 Award-winning paper by DSI Faculty Bo Li, DecodingTrust, provides a comprehensive framework for assessing trustworthiness of GPT models

UChicago Undergrad Analyzes Machine Learning Models Used By CPD, Uncovers Lack of Transparency About Data Usage

UChicago Team Wins The NIH Long COVID Computational Challenge

UChicago Assistant Professor Raul Castro Fernandez Receives 2023 ACM SIGMOD Test-of-Time Award

Mike Franklin, Dan Nicolae Receive 2023 Arthur L. Kelly Faculty Prize

PhD Student Kevin Bryson Receives NSF Graduate Research Fellowship to Create Equitable Algorithmic Data Tools

Computer Science Displays Catch Attention at MSI’s Annual Robot Block Party

UChicago / School of the Art Institute Class Uses Art to Highlight Data Privacy Dangers

UChicago Undergrad Team Places Second Overall In Regionals For World’s Largest Programming Competition

Postdoc Alum John Paparrizos Named ICDE Rising Star