Deqing Fu (USC)- Algorithmic Perspectives on Understanding Transformers

Date & Time:

May 8, 2025 1:00 pm – 2:00 pm

Location:

Crerar 298, 5730 S. Ellis Ave., Chicago, IL,

05/08/2025 01:00 PM 05/08/2025 02:00 PM America/Chicago Deqing Fu (USC)- Algorithmic Perspectives on Understanding Transformers Crerar 298, 5730 S. Ellis Ave., Chicago, IL,

Abstract: In this talk, we explore how algorithmic insights and tools from optimization theory and Fourier transforms can shed light on the mechanisms underlying Transformers’ ability to solve fundamental computational tasks, including linear regression and addition. We will examine the interplay between architectural design and pre-training data in enabling Transformers to learn these mechanisms effectively. Lastly, we will discuss recent advancements in directly mapping numbers to their Fourier representations, eliminating the tokenization step entirely for numbers to improve efficiency and accuracy.

Speakers

Deqing Fu

PhD Student

Deqing Fu is a third-year Ph.D. student in Computer Science at the University of Southern California (USC). His research focuses on deep learning theory, natural language processing, and the interpretability of AI systems. He is co-advised by Prof. Vatsal Sharan in the USC Theory Group and Prof. Robin Jia in the USC NLP Group. Prior to his Ph.D., Deqing earned his undergraduate and master’s degrees in mathematics and statistics from the University of Chicago.

Resources

Community

Innovation at the Forefront: UChicago CS Researchers Make Significant Contributions to CHI 2025

The University of Chicago Hosts the First Great Lakes Graphics Workshop

Quantum Materials, Built By AI Robot