Dexa/Super Data Science: ML & AI Podcast with Jon Krohn

Published Jan 9, 2024

747: Technical Intro to Transformers and LLMs — with Kirill Eremenko

Kirill Eremenko and Jon Krohn delve into the intricate workings of transformers and large language models (LLMs), examining their training and inference processes, parallelization capabilities, and transformative impact on AI and career opportunities, highlighting the attention mechanism and architectural innovations that drive their efficiency.

Episode Highlights

Topics covered

Episode Highlights

Related Episodes

759: Full Encoder-Decoder Transformers Fully Explained — with Kirill Eremenko
Answers 383 questions
695: NLP with Transformers — with Hugging Face's Lewis Tunstall
Answers 383 questions
649: Introduction to Machine Learning — with Kirill Eremenko and Hadelin de Ponteves
Answers 383 questions
SDS 513: Transformers for Natural Language Processing — with Denis Rothman
Answers 383 questions
721: Quantum Machine Learning — with Dr. Amira Abbas
Answers 383 questions
767: Open-Source LLM Libraries and Techniques — with Dr. Sebastian Raschka
Answers 383 questions
671: Cloud Machine Learning — with Kirill Eremenko and Hadelin de Ponteves
Answers 383 questions
787: MLOps: The Job and The Key Tools — with Demetrios Brinkmann
Answers 383 questions
801: Merged LLMs Are Smaller And More Capable — with Arcee AI's Mark McQuade and Charles Goddard
Answers 383 questions
772: In Case You Missed It in March 2024 — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions
771: Gradient Boosting: XGBoost, LightGBM and CatBoost — with Kirill Eremenko
Answers 383 questions
661: Designing Machine Learning Systems — with Chip Huyen
Answers 383 questions
853: Generative AI for Business — with Kirill Eremenko and Hadelin de Ponteves
Answers 383 questions
704: Jon’s “Generative A.I. with LLMs” Hands-on Training — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions
847: AI Engineering 101 — with Ed Donner
Answers 383 questions