Workshop

High-dimensional Learning Dynamics Workshop: The Emergence of Structure and Reasoning

Atish Agarwala · Courtney Paquette · Andrea Montanari · Cengiz Pehlevan · Sungyoon Lee · Murat Erdogdu · Naomi Saphra · Gowthami Somepalli · Swabha Swayamdipta · Tom Goldstein · Boaz Barak · Leshem Choshen · Shikhar Murty · Mengzhou Xia · Depen Morwani · Rosie Zhao

Project Page

Abstract

Modeling learning dynamics has long been a goal of the empirical science and theory communities in deep learning. These communities have grown rapidly in recent years, as our newly expanded understanding of the latent structures and capabilities of large models permits researchers to study these phenomena through the lens of the training process. Recent progress in understanding fully trained models can therefore enable understanding of their development and lead to insights that improve optimizer and architecture design, provide model interpretations, inform evaluation, and generally enhance the science of neural networks and their priors. We aim to foster discussion, discovery, and dissemination of state-of-the-art research in high-dimensional learning dynamics relevant to ML.

We invite participation in the 2nd Workshop on High-dimensional Learning Dynamics (HiLD), to be held as a part of the ICML 2024 conference. This year’s theme focuses on understanding how reasoning capabilities and internal structures develop over the course of neural network training; we encourage submissions related to our theme as well as other topics around the theoretical and empirical understanding of learning in high dimensional spaces. We will accept high quality submissions as poster presentations during the workshop, especially work-in-progress and state-of-art ideas.

We welcome any topics in pursuit of understanding how model behaviors evolve or emerge. Example topics include but are not limited to:

The emergence of interpretable behaviors (e.g., circuit mechanisms) and capabilities (e.g., compositionality and reasoning) Work that adapts tools from stochastic differential equations, high-dimensional probability, random matrix theory, and other theoretical frameworks to understand learning dynamics and phase transitions Scaling laws related to internal structures and functional differences Competition and dependencies among structures and heuristics, e.g., simplicity bias or learning staircase functions Relating optimizer design and loss landscape geometry to implicit regularization, inductive bias, and generalization

Video

Chat is not available.

Schedule

Timezone: America/Los_Angeles

12:00 AM

Opening Remarks

Atish Agarwala

Video

12:00 AM

Spectral alignment for high-dimensional SGD, Aukosh Jagannath

Aukosh Jagannath

Video

12:30 AM

Misleading Endpoints – Lessons from LLM Training Dynamics, Angelica Chen

Angelica Chen

Video

1:00 AM

Poster Session 1

2:00 AM

Learning Representations and Associations with Gradient Descent, Jason Lee

Jason Lee

Video

2:30 AM

When is theory useful in practice? A guide to pitching your work to LLM trainers, Stella Biderman

Stella Biderman

3:30 AM

Lunch

5:00 AM

Phase transition in high-dimensional learning, Lenka Zdeborová

Lenka Zdeborova

Video

5:30 AM

Generalization Error of min-norm interpolators in transfer learning, Pragya Sur

Pragya Sur

Video

6:00 AM

Brain-Wide Compositionality and Learning Dynamics in Biological Agents, Kanaka Rajan

Kanaka Rajan

Video

6:30 AM

Poster Session 2

6:30 AM

Best Paper Awards

Video

7:59 AM

Closing Remarks

Atish Agarwala

u-μP: The Unit-Scaled Maximal Update Parametrization

Charlie Blake · Constantin Eichenberg · Josef Dean · Lukas Balles · Luke Prince · Björn Deiseroth · Andres Felipe Cruz Salinas · Carlo Luschi · Samuel Weinbach · Douglas Orr