SDS 585: PyMC for Bayesian Statistics in Python — with Thomas Wiecki

Topics covered
Popular Clips
Episode Highlights
Development
The PyMC library has undergone significant evolution, reflecting advancements in computational technology. shares how PyMC 4.0, the latest version, integrates JAX and GPU support, enhancing its capabilities for probabilistic programming 1. This development is part of a broader trend where open-source projects like PyMC benefit from community contributions and industry support, as notes, "We now make the library better for that particular customer, optimizing it for their use case, but also, of course, for everyone else, the whole community" 2. The legacy of Theano, a discontinued project, also plays a role in PyMC's evolution, with highlighting its influence on current computational frameworks 3.
Innovations
Technological innovations have been pivotal in PyMC's growth, particularly with the integration of new computational backends. explains that PyMC 4.0 can now run models on different backends, such as JAX, leading to significant speed improvements 4. This flexibility allows for complex models to be estimated quickly, even with large datasets. Additionally, mentions the revolutionary potential of Pyscript, which enables Python to run in web browsers, expanding accessibility 5. He describes this as "magic," emphasizing its impact on the Python ecosystem.
Comparisons
PyMC distinguishes itself from other Bayesian statistical libraries like Stan through its unique features and interfaces. acknowledges the influence of Stan on PyMC but highlights key differences, such as PyMC's integration with Python and its dynamic graph approach 6. Theano's legacy also contributes to PyMC's distinctiveness, with noting, "Theano is such an amazing system," despite its discontinuation 3. This historical context underscores PyMC's adaptability and innovation in the field of probabilistic programming.
Related Episodes


793: Bayesian Methods and Applications — with Alexandre Andorra
Answers 383 questions

SDS 507: Bayesian Statistics — with Rob Trangucci
Answers 383 questions

SDS 617: Causal Modeling and Sequence Data — with Sean Taylor
Answers 383 questions

SDS 523: Open-Source Analytical Computing (pandas, Apache Arrow) — with Wes McKinney
Answers 383 questions

SDS 599: MLOps: Machine Learning Operations — with @Miki_ML
Answers 383 questions

675: Pandas for Data Analysis and Visualization — with Stefanie Molin
Answers 383 questions

SDS 571: Collaborative, No-Code Machine Learning — with Tim Kraska
Answers 383 questions

SDS 587: Data Engineering for Data Scientists — with Mark Freeman
Answers 383 questions

765: NumPy, SciPy and the Economics of Open-Source — with Dr. Travis Oliphant
Answers 383 questions

SDS 563: How to Rock at Data Science — with @TinaHuang1
Answers 383 questions

SDS 435: Scaling Up Machine Learning — with Erica Greene
Answers 383 questions

SDS 539: Interpretable Machine Learning — with Serg Masís
Answers 383 questions













