Published Jan 9, 2024
747: Technical Intro to Transformers and LLMs — with Kirill Eremenko
Kirill Eremenko and Jon Krohn delve into the intricate workings of transformers and large language models (LLMs), examining their training and inference processes, parallelization capabilities, and transformative impact on AI and career opportunities, highlighting the attention mechanism and architectural innovations that drive their efficiency.

Topics covered
Popular Clips
Episode Highlights
Related Episodes


759: Full Encoder-Decoder Transformers Fully Explained — with Kirill Eremenko
Answers 383 questions

695: NLP with Transformers — with Hugging Face's Lewis Tunstall
Answers 383 questions

649: Introduction to Machine Learning — with Kirill Eremenko and Hadelin de Ponteves
Answers 383 questions

SDS 513: Transformers for Natural Language Processing — with Denis Rothman
Answers 383 questions

721: Quantum Machine Learning — with Dr. Amira Abbas
Answers 383 questions

767: Open-Source LLM Libraries and Techniques — with Dr. Sebastian Raschka
Answers 383 questions

671: Cloud Machine Learning — with Kirill Eremenko and Hadelin de Ponteves
Answers 383 questions

787: MLOps: The Job and The Key Tools — with Demetrios Brinkmann
Answers 383 questions
772: In Case You Missed It in March 2024 — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions

771: Gradient Boosting: XGBoost, LightGBM and CatBoost — with Kirill Eremenko
Answers 383 questions

661: Designing Machine Learning Systems — with Chip Huyen
Answers 383 questions

853: Generative AI for Business — with Kirill Eremenko and Hadelin de Ponteves
Answers 383 questions

847: AI Engineering 101 — with Ed Donner
Answers 383 questions














