Historical Foundations of RLHF

Nathan explores the deep-rooted connections between reinforcement learning and ancient philosophy, highlighting how the von Neumann Morgenstern utility theorem shapes our understanding of preferences. He raises critical questions about the influence of internet-sourced data versus curated datasets on AI models, emphasizing the need for insights from social sciences to better comprehend human preferences in modern AI.