Machine Learning Street Talk (MLST) avatar

Dexa/Machine Learning Street Talk (MLST)

Learn more

AI Reward Challenges

Tim and Connor discuss the challenges of reward hacking in AI systems, highlighting the complexities of aligning utility functions with human preferences and the limitations of understanding neural network policies. They ponder the uncertainties and potential solutions in navigating the evolving landscape of AI reward learning.
  • In this clip

  • From this podcast

    Machine Learning Street Talk (MLST) avatar

    Machine Learning Street Talk (MLST)

    #112 AVOIDING AGI APOCALYPSE - CONNOR LEAHY

  • Related Questions

    • Is reinforcement learning a turning point for large language models (LLMs) and artificial intelligence (AI)?

    • Is reinforcement learning a turning point for large language models (LLMs) and artificial intelligence (AI) as discussed in the episodes The Future of Machine Learning, Deep Learning and Computer Vision with Thomas Dietterich and Automating Scientific Discovery, and in the episode Exploring Open-Ended Algorithms: POET and the clip Evolving Learning Frameworks?

    • Is reinforcement learning a turning point for large language models (LLMs) and artificial intelligence (AI) as discussed in the episode Pieter Abbeel: Deep Reinforcement Learning | Lex Fridman Podcast #10 and the clip Hierarchical Learning Insights?

Built by
Charlie AI
© 2024 Machine Learning Street Talk (MLST)TermsPrivacySupport