Published Nov 3, 2023

728: Use Contrastive Search to get Human-Quality LLM Outputs — with Jon Krohn (@JonKrohnLearns)

Jon Krohn delves into the transformative power of contrastive search technology in enhancing large language models' text generation, spotlighting its superiority over traditional methods like greedy and beam search, and analyzing advanced sampling techniques such as top k and nuclear sampling for more coherent outputs.

Episode Highlights

Topics covered

Popular Clips

Contrastive Search Breakthrough
Play Clip

Episode Highlights

Search Methods

explains the limitations of greedy search in generating AI text. This method selects the highest probability word, but often misses high probability words hidden behind low probability ones. Beam search improves upon this by looking ahead several words to find better sequences. However, it increases computational complexity and can still produce repetitive outputs 1.

Beam Search

Beam search, while an improvement over greedy search, has its own drawbacks. It tends to generate repetitive sequences, which can limit the diversity of AI-generated text. suggests sampling as an alternative to overcome this repetitiveness 1.

Decoding Methods

The choice of decoding method is crucial for achieving high-quality outputs from LLMs. emphasizes that model parameters alone are insufficient for human-like text generation. Decoding methods like beam search and sampling play a critical role in enhancing the quality of AI-generated content 1.

Related Episodes

704: Jon’s “Generative A.I. with LLMs” Hands-on Training — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions
772: In Case You Missed It in March 2024 — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions
SDS 464: A.I. vs Machine Learning vs Deep Learning — with Jon Krohn
Answers 383 questions
788: Multi-Agent Systems: How Teams of LLMs Excel at Complex Tasks — with @JonKrohnLearns
Answers 383 questions
670: LLaMA: GPT-3 performance, 10x smaller — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions
720: OpenAI’s DALL-E 3, Image Chat and Web Search — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions
684: Get More Language Context out of your LLM — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions
808: In Case You Missed It in July 2024 — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions
666: GPT-4 — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions
750: How AI is Transforming Science — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions
664: MIT Study: ChatGPT Dramatically Increases Productivity — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions
SDS 568: PaLM: Google's Breakthrough Natural Language Model — with Jon Krohn
Answers 383 questions
740: Q*: OpenAI's Rumored AGI Breakthrough — with @JonKrohnLearns
Answers 383 questions
812: The AI Scientist: Towards Fully Automated, Open-Ended Scientific Discovery — with Jon Krohn
Answers 383 questions
694: CatBoost: Powerful, efficient ML for large tabular datasets — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions

Dexa/Super Data Science: ML & AI Podcast with Jon Krohn

728: Use Contrastive Search to get Human-Quality LLM Outputs — with Jon Krohn (@JonKrohnLearns)

Topics covered

Popular Clips

Contrastive Search Breakthrough

Episode Highlights

Contrastive Search Innovation

Decoding Methods

Search Methods

Beam Search

Decoding Methods

Advanced Sampling Techniques

Related Episodes

704: Jon’s “Generative A.I. with LLMs” Hands-on Training — with Jon Krohn (@JonKrohnLearns)

772: In Case You Missed It in March 2024 — with Jon Krohn (@JonKrohnLearns)

SDS 464: A.I. vs Machine Learning vs Deep Learning — with Jon Krohn

788: Multi-Agent Systems: How Teams of LLMs Excel at Complex Tasks — with @JonKrohnLearns

670: LLaMA: GPT-3 performance, 10x smaller — with Jon Krohn (@JonKrohnLearns)

720: OpenAI’s DALL-E 3, Image Chat and Web Search — with Jon Krohn (@JonKrohnLearns)

684: Get More Language Context out of your LLM — with Jon Krohn (@JonKrohnLearns)

808: In Case You Missed It in July 2024 — with Jon Krohn (@JonKrohnLearns)

666: GPT-4 — with Jon Krohn (@JonKrohnLearns)

750: How AI is Transforming Science — with Jon Krohn (@JonKrohnLearns)

664: MIT Study: ChatGPT Dramatically Increases Productivity — with Jon Krohn (@JonKrohnLearns)

SDS 568: PaLM: Google's Breakthrough Natural Language Model — with Jon Krohn

740: Q*: OpenAI's Rumored AGI Breakthrough — with @JonKrohnLearns

812: The AI Scientist: Towards Fully Automated, Open-Ended Scientific Discovery — with Jon Krohn

694: CatBoost: Powerful, efficient ML for large tabular datasets — with Jon Krohn (@JonKrohnLearns)

728: Use Contrastive Search to get Human-Quality LLM Outputs — with Jon Krohn (@JonKrohnLearns)

Topics covered

Popular Clips

Episode Highlights

Contrastive Search InnovationJon Krohn explores the advantages of contrastive search over traditional decoding methods in generating human-like outputs from large language models. He discusses its development and growing adoption in the AI industry.

Contrastive Search Innovation

Decoding MethodsJon Krohn explores the challenges of generating human-like text with LLMs, focusing on the limitations of greedy and beam search methods. He highlights the importance of choosing the right decoding method to improve AI text generation quality.

Decoding Methods

Search Methods

Beam Search

Decoding Methods

Advanced Sampling TechniquesJon Krohn explores the use of sampling methods like top k and nuclear sampling to achieve human-like outputs from large language models (LLMs). He contrasts these with the innovative contrastive search, which offers even more coherent and natural text generation.

Advanced Sampling Techniques

Related Episodes