Published Apr 16, 2023
#114 - Secrets of Deep Reinforcement Learning (Minqi Jiang)
Dive into the secrets of deep reinforcement learning with Minqi Jiang as he unravels the complexities of defining intelligence, the strategic use of minimax regret, and the dynamic balance of creativity and reliability in language models through Reinforcement Learning from Human Feedback.

Topics covered
Popular Clips
Episode Highlights
Related Episodes


Can we build a generalist agent? Dr. Minqi Jiang and Dr. Marc Rigter
Answers 383 questions

#045 Microsoft's Platform for Reinforcement Learning (Bonsai)
Answers 383 questions

#49 - Meta-Gradients in RL - Dr. Tom Zahavy (DeepMind)
Answers 383 questions

Understanding Deep Learning - Prof. SIMON PRINCE [STAFF FAVOURITE]
Answers 383 questions

#046 The Great ML Stagnation (Mark Saroufim and Dr. Mathew Salvaris)
Answers 383 questions

#036 - Max Welling: Quantum, Manifolds & Symmetries in ML
Answers 383 questions

#53 Quantum Natural Language Processing - Prof. Bob Coecke (Oxford)
Answers 383 questions

Dr. Paul Lessard - Categorical/Structured Deep Learning
Answers 383 questions
#65 Prof. PEDRO DOMINGOS [Unplugged]
Answers 383 questions

#107 - Dr. RAPHAËL MILLIÈRE - Linguistics, Theory of Mind, Grounding
Answers 383 questions

#60 Geometric Deep Learning Blueprint (Special Edition)
Answers 383 questions

CURL: Contrastive Unsupervised Representations for Reinforcement Learning
Answers 383 questions

Prof. Chris Bishop's NEW Deep Learning Textbook!
Answers 383 questions

Robert Lange on NN Pruning and Collective Intelligence
Answers 383 questions
