Published Jan 9, 2024

747: Technical Intro to Transformers and LLMs — with Kirill Eremenko

Kirill Eremenko and Jon Krohn delve into the intricate workings of transformers and large language models (LLMs), examining their training and inference processes, parallelization capabilities, and transformative impact on AI and career opportunities, highlighting the attention mechanism and architectural innovations that drive their efficiency.
Episode Highlights
Super Data Science: ML & AI Podcast with Jon Krohn logo

Popular Clips

Episode Highlights

Related Episodes