Published Sep 21, 2021

SDS 507: Bayesian Statistics — with Rob Trangucci

Rob Trangucci delves into Bayesian statistics, contrasting its complexity and practical applications with traditional methods, while offering insights from his PhD journey and the integration of Bayesian approaches in machine learning.
Episode Highlights
Super Data Science: ML & AI Podcast with Jon Krohn logo

Popular Clips

Episode Highlights

  • Bayesian Methods

    Bayesian methods offer a unique approach to machine learning, emphasizing the importance of priors and posteriors in model training. explains that while uninformative priors are rare, they can guide the learning process by providing soft constraints on parameter values. This approach contrasts with typical machine learning models, which often start with random initial values sampled from distributions.

    You want to have like a soft constraint. You want to, you know, your prior should put, I don't know, 99% of the sort of probability mass between two points, but, you know, there are tails where you can be wrong.

    ---

    The challenge of multimodal optimization in deep learning is also highlighted, where multiple parameter sets can yield similar results, complicating the search for a global optimum 1 2.

       

    Bayesian vs ML

    The comparison between Bayesian statistics and machine learning reveals distinct philosophical and practical differences. points out that Bayesian inference focuses on distributions, offering a richer understanding of parameter spaces compared to the point estimates typical in machine learning. This distinction is crucial, as Bayesian methods provide a full distribution of parameter values, enhancing the depth of analysis.

    There is this big difference between using a single, what you call the point value, a single number, as both the initialized weight in the machine learning model as well as the weight that we have come out of it that we, that we've learned and that we can use in a production model.

    ---

    Frequentist approaches, on the other hand, focus on hypothetical data sets and null models, which can limit their applicability in certain contexts 3 4.

Related Episodes