Workshop

Workshop on Mechanistic Interpretability

Fazl Barez · Lawrence Chan · Mor Geva · Kayo Yin · Neel Nanda · Max Tegmark

Project Page

Abstract

We are holding a one-day workshop on mechanistic interpretability -- the study of reverse-engineering trained neural networks to understand their learned algorithms and internal structure. Check out our website for more details, including our proposed topics for discussion, speakers and panellists!

Video

Chat is not available.

Schedule

Timezone: America/Los_Angeles

12:00 AM

Welcome + Talk 1: David Bau

Video

12:30 AM

Oral Presentation

Video

1:30 AM

Spotlights 1

Video

2:00 AM

Poster Session 1

3:00 AM

Panel Discussion

Video

5:00 AM

Spotlights 2

Video

5:30 AM

Poster Session 2

7:00 AM

Talk 2: Asma Ghandeharioun

Video

7:30 AM

Talk 3: Chris Olah (remote)

Video