Published Oct 12, 2021

SDS 513: Transformers for Natural Language Processing — with Denis Rothman

Delve into the fascinating world of transformers in natural language processing with AI expert Denis Rothman, as he unpacks the intricate workings of explainable AI, shares his prolific writing journey, and discusses the innovations and ethical considerations surrounding AI-driven language models.

Episode Highlights

Topics covered

Episode Highlights

Transformer Basics

provides an insightful overview of transformer models, emphasizing their transformative impact on natural language processing (NLP). He highlights the evolution from traditional models like recurrent neural networks to transformers, which offer unprecedented accuracy in NLP tasks 1. Rothman shares his experience with training models like GPT-2 and BERT, illustrating their ability to handle vast amounts of data and perform complex tasks such as question answering and text summarization 2.

The goal here is to play around with it. I mean, if you. You're not. You have to have a lot of fun. Otherwise, you'll never understand transformers.

---

He stresses the importance of experimentation and fun in understanding these models, encouraging users to engage with them creatively 3.

Applications & Advancements

The practical applications and advancements in transformer technology are vast, as explains. He describes the industrial model evolution, where layers are standardized and scaled to handle massive data efficiently, leading to innovations like OpenAI's GPT-3 with its 175 billion parameters 4. This scalability allows transformers to perform a wide range of tasks, from language translation to sentiment analysis, with remarkable speed and accuracy 5.

So instead of having, like, a convolutional neural network where you have layers, but none of these layers are the same size, none of these layers do the same thing. That's like a 1930 car.

---

Rothman also discusses the complexity of neural networks, likening the challenge of building representations in transformers to the intricate processes of the human brain 6.

Related Episodes

695: NLP with Transformers — with Hugging Face's Lewis Tunstall
Answers 383 questions
SDS 583: The State of Natural Language Processing — with Rongyao Huang
Answers 383 questions
747: Technical Intro to Transformers and LLMs — with Kirill Eremenko
Answers 383 questions
759: Full Encoder-Decoder Transformers Fully Explained — with Kirill Eremenko
Answers 383 questions
SDS 559: GPT-3 for Natural Language Processing — with Melanie Subbiah
Answers 383 questions
SDS 549: Engineering Natural Language Models — with Lauren Zhu
Answers 383 questions
SDS 564: Clem Delangue on Hugging Face and Transformers
Answers 383 questions
SDS 539: Interpretable Machine Learning — with Serg Masís
Answers 383 questions
687: Generative Deep Learning — with David Foster
Answers 383 questions
SDS 445: Conversational A.I. — with Sinan Ozdemir
Answers 383 questions
SDS 464: A.I. vs Machine Learning vs Deep Learning — with Jon Krohn
Answers 383 questions
SDS 433: Data Science Trends for 2021 — with Ben Taylor
Answers 383 questions
SDS 589: Narrative A.I. — with Hilary Mason
Answers 383 questions
SDS 587: Data Engineering for Data Scientists — with Mark Freeman
Answers 383 questions
SDS 558: @JonKrohnLearns's Answers to Questions on Machine Learning
Answers 383 questions

SDS 513: Transformers for Natural Language Processing — with Denis Rothman

Topics covered

Popular Clips

Episode Highlights

Explainable AIDenis Rothman explores model-agnostic methods for explainable AI, emphasizing tools like SHAP and LIME. He also discusses the ethical challenges of implementing these methods in real-world applications.

Explainable AI

AI Writing and PublishingDenis Rothman discusses his efficient book writing process and the themes of his works on natural language processing and explainable AI. He shares insights into his rapid writing style and the content of his influential books.

AI Writing and Publishing

Transformer ModelsDenis Rothman, a pioneer in artificial intelligence, explores the revolutionary role of transformers in natural language processing. He shares insights into their development, applications, and the future of AI-driven language models.

Transformer Models

Transformer Basics

Applications & Advancements

Related Episodes