Historical Foundations of RLHF

The discussion delves into the historical roots of reinforcement learning from ancient philosophy to modern economics, exploring how concepts like the von Neumann-Morgenstern utility theorem shape current methodologies. Questions arise about the nature of preferences in AI models, particularly whether they are more influenced by internet-sourced data or curated datasets. This exploration highlights the importance of integrating insights from social sciences to better understand human preferences in AI development.

In this clip
From this podcast
Super Data Science: ML & AI Podcast with Jon Krohn
791: Reinforcement Learning from Human Feedback (RLHF) — with Dr. Nathan Lambert
Related Questions
- What is this clip about?
- What is the main topic of this clip?

Historical Foundations of RLHF

In this clip

From this podcast

Super Data Science: ML & AI Podcast with Jon Krohn

791: Reinforcement Learning from Human Feedback (RLHF) — with Dr. Nathan Lambert

Related Questions

What is this clip about?

What is the main topic of this clip?