Dexa/Machine Learning Street Talk (MLST)

Language Model Performance

Sameer discusses the limitations of abstract reasoning puzzles for real-world applications. Yasaman explores the performance gap in language models related to pre-training corpus frequency and the challenge of detangling accuracy from pre-training effects.

In this clip
From this podcast
Machine Learning Street Talk (MLST)
#73 - YASAMAN RAZEGHI & Prof. SAMEER SINGH - NLP benchmarks
Related Questions
- Are there related cognitive benefits to memorization?
- Is doing math a type of cognitive training?