GPU Limitations
Stella discusses the limitations of fitting large language models on GPUs, emphasizing the challenges individuals face with high parameter models. Lukas raises questions about model comparisons, highlighting the lack of public information on training strategies and data sources.In this clip
From this podcast

Gradient Dissent - A Machine Learning Podcast
How EleutherAI Trains and Releases LLMs: Interview with Stella Biderman
Related Questions