Published Jul 11, 2023

695: NLP with Transformers — with Hugging Face's Lewis Tunstall

Join AI expert Jon Krohn and Hugging Face's Lewis Tunstall as they dive into the transformative world of NLP with transformers, exploring how Hugging Face's open-source efforts are democratizing AI, the integration of reinforcement learning for improved NLP performance, and the innovative power of transformers in revolutionizing human-computer interactions.

Episode Highlights

Topics covered

Episode Highlights

RL Basics

Reinforcement learning (RL) is pivotal in training NLP models, as it allows for the creation of systems that align more closely with human preferences. explains that RL involves a feedback loop where models are trained to optimize outputs based on human evaluations 1. This process is exemplified by OpenAI's approach to summarization, where human feedback is used to refine model outputs beyond traditional metrics like Rouge scores 1.

Instead of trying to use some metric like Rouge, which always has some, you know, limitations, the thing we really care about is people reading summaries.

---

This method has shown that models trained with RL are often preferred by humans, highlighting the importance of integrating human feedback into AI development 2.

Human Feedback

Human feedback plays a crucial role in refining AI models through reinforcement learning. describes how human evaluations are used to train models to produce more desirable outputs, as seen in systems like ChatGPT 1. This feedback loop involves humans rating model outputs, which then informs the model's learning process to better align with human expectations 3.

The summary point is that it allows the model to have outputs that are more aligned with the kind of thing that you would like to see.

---

This approach has led to significant improvements in model performance, making AI outputs more relevant and useful to users 3.

Related Episodes

SDS 564: Clem Delangue on Hugging Face and Transformers
Answers 383 questions
659: Open-Source Tools for Natural Language Processing — with Vincent Warmerdam
Answers 383 questions
SDS 513: Transformers for Natural Language Processing — with Denis Rothman
Answers 383 questions
747: Technical Intro to Transformers and LLMs — with Kirill Eremenko
Answers 383 questions
767: Open-Source LLM Libraries and Techniques — with Dr. Sebastian Raschka
Answers 383 questions
661: Designing Machine Learning Systems — with Chip Huyen
Answers 383 questions
687: Generative Deep Learning — with David Foster
Answers 383 questions
SDS 559: GPT-3 for Natural Language Processing — with Melanie Subbiah
Answers 383 questions
SDS 583: The State of Natural Language Processing — with Rongyao Huang
Answers 383 questions
759: Full Encoder-Decoder Transformers Fully Explained — with Kirill Eremenko
Answers 383 questions
791: Reinforcement Learning from Human Feedback (RLHF) — with Dr. Nathan Lambert
Answers 383 questions
679: The A.I. and Machine Learning Landscape — with investor George Mathew
Answers 383 questions
807: Superintelligence and the Six Singularities — with Dr. Daniel Hulme
Answers 383 questions
853: Generative AI for Business — with Kirill Eremenko and Hadelin de Ponteves
Answers 383 questions
SDS 549: Engineering Natural Language Models — with Lauren Zhu
Answers 383 questions

Dexa/Super Data Science: ML & AI Podcast with Jon Krohn

695: NLP with Transformers — with Hugging Face's Lewis Tunstall

Topics covered

Popular Clips

Curious Conversations

Democratizing ML Models

Hugging Face Ecosystem

Next Word Prediction

Model Efficiency Techniques

Open Models Debate

Understanding Transformers

Transforming NLP Approaches

Open Source Breakthroughs

Reinforcement Learning from Humans

Feature Extraction Revolution

Contributing to Open Source

Transforming NLP Models

Understanding Transformers

Episode Highlights

Hugging Face Contributions

Reinforcement Learning in NLP

RL Basics

Human Feedback

Understanding Transformers

Related Episodes

SDS 564: Clem Delangue on Hugging Face and Transformers

659: Open-Source Tools for Natural Language Processing — with Vincent Warmerdam

SDS 513: Transformers for Natural Language Processing — with Denis Rothman

747: Technical Intro to Transformers and LLMs — with Kirill Eremenko

767: Open-Source LLM Libraries and Techniques — with Dr. Sebastian Raschka

661: Designing Machine Learning Systems — with Chip Huyen

687: Generative Deep Learning — with David Foster

SDS 559: GPT-3 for Natural Language Processing — with Melanie Subbiah

SDS 583: The State of Natural Language Processing — with Rongyao Huang

759: Full Encoder-Decoder Transformers Fully Explained — with Kirill Eremenko

791: Reinforcement Learning from Human Feedback (RLHF) — with Dr. Nathan Lambert

679: The A.I. and Machine Learning Landscape — with investor George Mathew

807: Superintelligence and the Six Singularities — with Dr. Daniel Hulme

853: Generative AI for Business — with Kirill Eremenko and Hadelin de Ponteves

SDS 549: Engineering Natural Language Models — with Lauren Zhu

695: NLP with Transformers — with Hugging Face's Lewis Tunstall

Topics covered

Popular Clips

Episode Highlights

Hugging Face ContributionsHugging Face's open-source initiatives are revolutionizing AI by making advanced models accessible to a broader audience. Lewis Tunstall and Jon Krohn discuss the impact of these contributions on the AI community and the growing importance of community engagement.

Hugging Face Contributions

Reinforcement Learning in NLPReinforcement learning is essential for enhancing NLP models by aligning them with human preferences. Lewis Tunstall discusses how human feedback is integrated into this process, improving model outputs and performance.

Reinforcement Learning in NLP

RL Basics

Human Feedback

Understanding Transformers

Related Episodes