Workshop
WORKSHOP ON MECHANISTIC INTERPRETABILITY
Fazl Barez · Lawrence Chan · Mor Geva · Kayo Yin · Neel Nanda · Max Tegmark
Lehar 1
We propose a one-day workshop on mechanistic interpretability -- reverse-engineering algorithms from the internals of neural networks.
Live content is unavailable. Log in and register to view live content