Published Nov 3, 2023

728: Use Contrastive Search to get Human-Quality LLM Outputs — with Jon Krohn (@JonKrohnLearns)

Jon Krohn delves into the transformative power of contrastive search technology in enhancing large language models' text generation, spotlighting its superiority over traditional methods like greedy and beam search, and analyzing advanced sampling techniques such as top k and nuclear sampling for more coherent outputs.
Episode Highlights
Super Data Science: ML & AI Podcast with Jon Krohn logo

Popular Clips

Episode Highlights

  • Search Methods

    explains the limitations of greedy search in generating AI text. This method selects the highest probability word, but often misses high probability words hidden behind low probability ones. Beam search improves upon this by looking ahead several words to find better sequences. However, it increases computational complexity and can still produce repetitive outputs 1.

       

    Beam Search

    Beam search, while an improvement over greedy search, has its own drawbacks. It tends to generate repetitive sequences, which can limit the diversity of AI-generated text. suggests sampling as an alternative to overcome this repetitiveness 1.

       

    Decoding Methods

    The choice of decoding method is crucial for achieving high-quality outputs from LLMs. emphasizes that model parameters alone are insufficient for human-like text generation. Decoding methods like beam search and sampling play a critical role in enhancing the quality of AI-generated content 1.

Related Episodes