Transformer Insights
The discussion delves into the nuances of transformer behavior, highlighting that while 78% of its predictions align with template matching, this doesn't imply superiority. Timothy emphasizes the distinction between describing outputs and explaining the underlying mechanisms, focusing on the statistical nature of transformer predictions without delving into its internal workings.In this clip
From this podcast

Machine Learning Street Talk (MLST)
Is ChatGPT an N-gram model on steroids?
Related Questions