Date & Time:
February 25, 2025 2:00 pm – 3:00 pm
Location:
JCL 390
02/25/2025 02:00 PM 02/25/2025 03:00 PM America/Chicago Jiachen Wang (Princeton)- Fueling Responsible AI Advancement with Data Attribution JCL 390

Abstract: As artificial intelligence (AI) systems expand across society, understanding how training data shapes model behavior has become fundamental to building trustworthy AI. Data attribution techniques quantify the influence of individual training samples on machine learning models, enabling us to address pressing challenges around data quality, training efficiency, copyright disputes, and interpretability.
In this talk, I will present our advances in developing theoretically rigorous yet practical data attribution methods. First, I will introduce Data Banzhaf, a data value notion derived from cooperative game theory that provides provably robust data influence estimation for any learning algorithms. While this provides a general framework, we then develop specialized techniques to analyze how data influence evolves during deep learning optimization. Through this lens, we uncover that examples from early and late training stages have an outsized impact on foundation model pretraining—insights that enable strategic data selection to reduce computational overhead while maintaining model performance.

Speakers

Jiachen Wang

PhD Candidate, Princeton University

Jiachen (“Tianhao”) is a Ph.D. student at Princeton University, advised by Prof. Prateek Mittal. His research focuses on developing theoretical foundations and practical tools for trustworthy machine learning from a data-centric perspective. Most recently, he has been developing scalable, theoretically grounded data attribution and curation techniques for foundation models. His contributions have been recognized through multiple fellowships and oral/spotlight presentations at top AI/ML venues. He was selected as a Rising Star in Data Science in 2024.

Related News & Events

UChicago CS News

Quantum Leap: New Research Reveals Secrets of Random Quantum Circuits

Feb 04, 2025
UChicago CS News

Fred Chong from the Department of Computer Science Named ACM Fellow for Contributions to Quantum Computing

Jan 22, 2025
UChicago CS News

Rethinking AI as a Thought Partner: Perspectives on Writing, Programming, and More

Jan 16, 2025
UChicago CS News

UChicago Partners On New National Science Foundation Large-Scale Research Infrastructure For Education

Dec 10, 2024
UChicago CS News

Saturdays with CSIL — How Undergraduates are Transforming CS Education for Local High School Students

Dec 05, 2024
UChicago CS News

UChicago Researchers Receive Google Privacy Faculty Award for Research on AI Privacy Risks

Nov 22, 2024
UChicago CS News

The Climate App Designed to Tackle Chatham’s Flooding Crisis

Nov 21, 2024
In the News

Globus Receives Multiple Honors in 2024 HPCwire Readers’ and Editors’ Choice Awards

Nov 20, 2024
In the News

Argonne Team Breaks New Ground in AI-Driven Protein Design

Nov 15, 2024
UChicago CS News

DOE Awards Fred Chong and his National Research Team $7.5M to Develop a SMART Software Stack to Control Quantum Computer Noise

Nov 12, 2024
UChicago CS News

CS/LSSG Showcases Sustainability Research and Education

Nov 11, 2024
UChicago CS News

Ph.D. Student Jibang Wu Receives the Stigler Center Ph.D. Dissertation Award for His Work Modeling the Incentive Structures of Reward and Recommendation–Based Systems

Oct 24, 2024
arrow-down-largearrow-left-largearrow-right-large-greyarrow-right-large-yellowarrow-right-largearrow-right-smallbutton-arrowclosedocumentfacebookfacet-arrow-down-whitefacet-arrow-downPage 1CheckedCheckedicon-apple-t5backgroundLayer 1icon-google-t5icon-office365-t5icon-outlook-t5backgroundLayer 1icon-outlookcom-t5backgroundLayer 1icon-yahoo-t5backgroundLayer 1internal-yellowinternalintranetlinkedinlinkoutpauseplaypresentationsearch-bluesearchshareslider-arrow-nextslider-arrow-prevtwittervideoyoutube