Dexa/Machine Learning Street Talk (MLST)

Intelligent Alignment Theories

Connor discusses the concept of corrigibility in building agents that strive for alignment. The discussion delves into the potential behavior of highly intelligent agents and the implications of their utility functions. The chapter transitions into a deep dive into decision theory.

In this clip
From this podcast
Machine Learning Street Talk (MLST)
AI Alignment & AGI Fire Alarm - Connor Leahy
Related Questions
- Can you explain more about the Orthogonality Thesis as discussed in the episode George Hotz vs Eliezer Yudkowsky AI Safety Debate and the clip Intelligence and Morality that links to the idea of AI alignment?
- What does Jake Heller say about how to build AI agents?