Published Jul 29, 2020

Chip Huyen of Claypot AI— ML Research and Production Pipelines

Chip Huyen, co-founder of Claypot AI, delves into the dynamics of echo chambers in social media, her innovative path in teaching machine learning, and the realities of startup life, while offering insights into the challenges of deploying ML models and the value of maintaining robust evaluation practices.
Episode Highlights
Gradient Dissent - A Machine Learning Podcast logo

Popular Clips

Episode Highlights

  • Deployment Issues

    Transitioning machine learning models from research to production is fraught with challenges. highlights the importance of monitoring deployed systems to detect when retraining is necessary due to data distribution shifts 1. She notes that inference time for large models like GPT-2 can be prohibitively slow and costly, impacting a company's financial viability 1. also points out that many companies mistakenly chase buzzwords, opting for complex techniques when simpler algorithms would suffice 2.

    A lot of the problems can be solved by traditional classical algorithms.

    ---

    Additionally, she emphasizes the gap between clean research datasets and the messy, dynamic data encountered in real-world applications 2.

       

    Role of Baselines

    Baselines play a crucial role in evaluating machine learning models, yet they are often overlooked. argues that metrics like accuracy are meaningless without a baseline for comparison 3. She explains that baselines serve as landmarks, helping to contextualize model performance and set realistic goals 3.

    Baselines are landmarks to help you localize where the model performance is and where you want to get you.

    ---

    further discusses the concept of human baselines, which can serve as a best-case scenario for model performance 3.

Related Episodes