Date & Time:
July 24, 2025 2:00 pm – 3:00 pm
Location:
Searle 236
07/24/2025 02:00 PM 07/24/2025 03:00 PM America/Chicago Chi Han (UIUC)- Internal Workings of Foundation Models: Diagnosing and Adapting Internal Representations Searle 236

Abstract: While foundation models (FMs) continue to revolutionize natural language processing and AI applications, tracing, locating, and precisely addressing their limitations remains a major challenge. The development of future language models would greatly benefit from a structural understanding of FMs. This presentation brings together several recent papers that systematically explain the internal representations of FMs from both theoretical and empirical perspectives. These works offer preliminary characterizations of the roles and adaptation of internal components: (1) how positional representations hold clues for resolving the context length limitations of FMs, (2) how word representations can be used as steers for generation control, and (3) how cross-modal representations can be best aligned for scientific discovery. Together, they provide insights into addressing inherent limitations of FMs in a principled and efficient way, and point to a promising future of developing a modular “anatomy” for foundation models.

Speakers

headshot

Chi Han

PhD Student, University of Illinois Urbana-Champaign

Chi Han is currently a final-year Computer Science Ph.D. student in the NLP group at the University of Illinois Urbana-Champaign (UIUC), under the advisory of Prof. Heng Ji. Before joining UIUC, he was an undergraduate student at Tsinghua University, China, in the Yao Class program. He visited the CoCoSci Lab at the Massachusetts Institute of Technology (MIT) during his undergraduate studies. He has first-authored papers in top conferences, including NeurIPS, ICLR, ACL, and NAACL, and received first-authored outstanding paper awards in NAACL 2024 and ACL 2024, and received IBM PhD Fellowship and Amazon AICE PhD Fellowship. His research interests are centered around a theoretical understanding of representations in foundation models (FMs), with the aim of providing insights and tools for efficient, controllable, and interpretable foundation models.

Related News & Events

test of time headshots
UChicago CS News

Five Paths to Lasting Influence: Celebrating Five UChicago CS Test of Time Award Recipients

Dec 02, 2025
technology architecture
UChicago CS News

Researchers Built Their Own ISP to Fix the Internet– A Decade Later, It’s Still Running

Nov 20, 2025
presenting research at a conference
UChicago CS News

Hard to Discover, Harder to Use: The Widespread Failure of Ad Transparency Settings

Nov 18, 2025
computation performed on qubits
UChicago CS News

Constraints on Quantum-Advantage Experiments Due to Noise

Nov 13, 2025
headshot
UChicago CS News

Data Movement Without Borders: Ian Foster and the Globus Team Honored with SC25’s Test of Time Award

Nov 13, 2025
Video

How artists can protect their work from AI | Dr. Heather Zheng | TEDxChicago

Nov 05, 2025
figure detailing how net diffusion works
UChicago CS News

AI-Powered Network Management: GATEAU Project Advances Synthetic Traffic Generation

Oct 29, 2025
girl with robot
UChicago CS News

Sebo Lab: Programming robots to better interact with humans

Oct 28, 2025
Inside the Lab icon
Video

Inside The Lab: How Can Robots Improve Our Lives?

Oct 27, 2025
headshot
UChicago CS News

UChicago CS Student Awarded NSF Graduate Research Fellowship

Oct 27, 2025
LLM graphic
UChicago CS News

Why Can’t Powerful LLMs Learn Multiplication?

Oct 27, 2025
headshot
UChicago CS News

Celebrating Excellence in Human-Computer Interaction: Yudai Tanaka Named 2025 Google North America PhD Fellow

Oct 23, 2025
arrow-down-largearrow-left-largearrow-right-large-greyarrow-right-large-yellowarrow-right-largearrow-right-smallbutton-arrowclosedocumentfacebookfacet-arrow-down-whitefacet-arrow-downPage 1CheckedCheckedicon-apple-t5backgroundLayer 1icon-google-t5icon-office365-t5icon-outlook-t5backgroundLayer 1icon-outlookcom-t5backgroundLayer 1icon-yahoo-t5backgroundLayer 1internal-yellowinternalintranetlinkedinlinkoutpauseplaypresentationsearch-bluesearchshareslider-arrow-nextslider-arrow-prevtwittervideoyoutube