Dexa/Super Data Science: ML & AI Podcast with Jon Krohn

Published Jun 11, 2024

791: Reinforcement Learning from Human Feedback (RLHF) — with Dr. Nathan Lambert

Dr. Nathan Lambert delves into the transformative role of AI in robotics and the evolution of Reinforcement Learning from Human Feedback (RLHF), while addressing the challenges of aligning AI systems with human preferences and exploring innovative solutions like Constitutional AI to enhance safety and model behavior.

Episode Highlights

Topics covered

Episode Highlights

Related Episodes

774: RFM-1 Gives Robots Human-like Reasoning and Conversation Abilities — with @JonKrohnLearns
Answers 383 questions
SDS 503: Deep Reinforcement Learning for Robotics — with Pieter Abbeel
Answers 383 questions
767: Open-Source LLM Libraries and Techniques — with Dr. Sebastian Raschka
Answers 383 questions
695: NLP with Transformers — with Hugging Face's Lewis Tunstall
Answers 383 questions
SDS 551: Deep Reinforcement Learning — with Wah Loon Keng
Answers 383 questions
679: The A.I. and Machine Learning Landscape — with investor George Mathew
Answers 383 questions
847: AI Engineering 101 — with Ed Donner
Answers 383 questions
687: Generative Deep Learning — with David Foster
Answers 383 questions
775: What will humans do when machines are vastly more intelligent? — with Aleksa Gordić
Answers 383 questions
823: Virtual Humans and AI Clones — with Natalie Monbiot
Answers 383 questions
769: Generative AI for Medicine — with Prof. Zack Lipton
Answers 383 questions
SDS 583: The State of Natural Language Processing — with Rongyao Huang
Answers 383 questions
656: A.I. Talent and the Red-Hot A.I. Skills — with Jaclyn Rice Nelson
Answers 383 questions
SDS 611: Open-Ended A.I.: Practical Applications for Humans and Machines — with Kenneth Stanley
Answers 383 questions
841: AI Vision, Agents and Business Value — with Andrew Ng
Answers 383 questions