Published Nov 3, 2023

728: Use Contrastive Search to get Human-Quality LLM Outputs — with Jon Krohn (@JonKrohnLearns)

Jon Krohn delves into the transformative power of contrastive search technology in enhancing large language models' text generation, spotlighting its superiority over traditional methods like greedy and beam search, and analyzing advanced sampling techniques such as top k and nuclear sampling for more coherent outputs.

Episode Highlights

Topics covered

Popular Clips

Contrastive Search Breakthrough
Play Clip

Episode Highlights

Sampling Methods

Sampling methods in LLMs, such as top k and nuclear sampling, are pivotal for generating coherent and human-like text outputs. explains that these methods rely on probability distributions to select words, resulting in varied outputs each time a model is run, unlike deterministic methods like greedy or beam search 1. This randomness is why tools like ChatGPT provide unique responses to the same query.

In this paradigm, the highest probability word will be selected most frequently, but not always more specifically, using the technical terminology of probability theory, we sample words in this sampling paradigm according to the probability distribution that we extract from the large language model that we're using.

---

Despite their effectiveness, Krohn notes that contrastive search offers superior results, making it a preferred choice for production environments.

Advanced Techniques

Top k and nuclear sampling are specific techniques within the broader sampling paradigm that enhance the fluency of LLM outputs. highlights that these methods support the generation of human-like text by leveraging the probability distribution of words 1. However, he emphasizes that contrastive search, introduced at the NeuRIps conference, surpasses these methods in producing the most human-like outputs.

Two specific and popular decoding approaches that leverage sampling are top k sampling and nuclear sampling.

---

Krohn encourages the use of contrastive search in production, as it consistently yields superior results.

Related Episodes

704: Jon’s “Generative A.I. with LLMs” Hands-on Training — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions
772: In Case You Missed It in March 2024 — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions
SDS 464: A.I. vs Machine Learning vs Deep Learning — with Jon Krohn
Answers 383 questions
788: Multi-Agent Systems: How Teams of LLMs Excel at Complex Tasks — with @JonKrohnLearns
Answers 383 questions
670: LLaMA: GPT-3 performance, 10x smaller — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions
720: OpenAI’s DALL-E 3, Image Chat and Web Search — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions
684: Get More Language Context out of your LLM — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions
808: In Case You Missed It in July 2024 — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions
666: GPT-4 — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions
750: How AI is Transforming Science — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions
664: MIT Study: ChatGPT Dramatically Increases Productivity — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions
SDS 568: PaLM: Google's Breakthrough Natural Language Model — with Jon Krohn
Answers 383 questions
740: Q*: OpenAI's Rumored AGI Breakthrough — with @JonKrohnLearns
Answers 383 questions
812: The AI Scientist: Towards Fully Automated, Open-Ended Scientific Discovery — with Jon Krohn
Answers 383 questions
694: CatBoost: Powerful, efficient ML for large tabular datasets — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions

Dexa/Super Data Science: ML & AI Podcast with Jon Krohn

728: Use Contrastive Search to get Human-Quality LLM Outputs — with Jon Krohn (@JonKrohnLearns)

Topics covered

Popular Clips

Contrastive Search Breakthrough

Episode Highlights

Contrastive Search Innovation

Decoding Methods

Advanced Sampling Techniques

Sampling Methods

Advanced Techniques

Related Episodes

704: Jon’s “Generative A.I. with LLMs” Hands-on Training — with Jon Krohn (@JonKrohnLearns)

772: In Case You Missed It in March 2024 — with Jon Krohn (@JonKrohnLearns)

SDS 464: A.I. vs Machine Learning vs Deep Learning — with Jon Krohn

788: Multi-Agent Systems: How Teams of LLMs Excel at Complex Tasks — with @JonKrohnLearns

670: LLaMA: GPT-3 performance, 10x smaller — with Jon Krohn (@JonKrohnLearns)

720: OpenAI’s DALL-E 3, Image Chat and Web Search — with Jon Krohn (@JonKrohnLearns)

684: Get More Language Context out of your LLM — with Jon Krohn (@JonKrohnLearns)

808: In Case You Missed It in July 2024 — with Jon Krohn (@JonKrohnLearns)

666: GPT-4 — with Jon Krohn (@JonKrohnLearns)

750: How AI is Transforming Science — with Jon Krohn (@JonKrohnLearns)

664: MIT Study: ChatGPT Dramatically Increases Productivity — with Jon Krohn (@JonKrohnLearns)

SDS 568: PaLM: Google's Breakthrough Natural Language Model — with Jon Krohn

740: Q*: OpenAI's Rumored AGI Breakthrough — with @JonKrohnLearns

812: The AI Scientist: Towards Fully Automated, Open-Ended Scientific Discovery — with Jon Krohn

694: CatBoost: Powerful, efficient ML for large tabular datasets — with Jon Krohn (@JonKrohnLearns)

728: Use Contrastive Search to get Human-Quality LLM Outputs — with Jon Krohn (@JonKrohnLearns)

Topics covered

Popular Clips

Episode Highlights

Contrastive Search InnovationJon Krohn explores the advantages of contrastive search over traditional decoding methods in generating human-like outputs from large language models. He discusses its development and growing adoption in the AI industry.

Contrastive Search Innovation

Decoding MethodsJon Krohn explores the challenges of generating human-like text with LLMs, focusing on the limitations of greedy and beam search methods. He highlights the importance of choosing the right decoding method to improve AI text generation quality.

Decoding Methods

Advanced Sampling TechniquesJon Krohn explores the use of sampling methods like top k and nuclear sampling to achieve human-like outputs from large language models (LLMs). He contrasts these with the innovative contrastive search, which offers even more coherent and natural text generation.

Advanced Sampling Techniques

Sampling Methods

Advanced Techniques

Related Episodes