Knowledge and Certainty
Neel discusses how models can recognize patterns, like associating Tim's scarf with him, and explores the nuances of knowledge and certainty in neural networks. He distinguishes between false facts and genuine ignorance, emphasizing that while models may have fewer false beliefs over time, some level of uncertainty will likely persist. The conversation also touches on innovative methods, such as semantic entropy, to assess a model's confidence and the intriguing concept of an entity detection circuit.In this clip
From this podcast

Machine Learning Street Talk (MLST)
Neel Nanda - Mechanistic Interpretability (Sparse Autoencoders)
Related Questions