AI Reward Challenges
Tim and Connor discuss the challenges of reward hacking in AI systems, highlighting the complexities of aligning utility functions with human preferences and the limitations of understanding neural network policies. They ponder the uncertainties and potential solutions in navigating the evolving landscape of AI reward learning.In this clip
From this podcast

Machine Learning Street Talk (MLST)
#112 AVOIDING AGI APOCALYPSE - CONNOR LEAHY
Related Questions
Is reinforcement learning a turning point for large language models (LLMs) and artificial intelligence (AI)?
Is reinforcement learning a turning point for large language models (LLMs) and artificial intelligence (AI) as discussed in the episodes The Future of Machine Learning, Deep Learning and Computer Vision with Thomas Dietterich and Automating Scientific Discovery, and in the episode Exploring Open-Ended Algorithms: POET and the clip Evolving Learning Frameworks?
Is reinforcement learning a turning point for large language models (LLMs) and artificial intelligence (AI) as discussed in the episode Pieter Abbeel: Deep Reinforcement Learning | Lex Fridman Podcast #10 and the clip Hierarchical Learning Insights?