Reinforcement Learning Discussion
Edward discusses the benefits and challenges of Reinforcement Learning compared to preference-based methods. Tim shares insights on when RLHF might be more suitable, emphasizing the importance of stable signals and scalability in data acquisition.In this clip
From this podcast

Machine Learning Street Talk (MLST)
#103 - Prof. Edward Grefenstette - Language, Semantics, Philosophy
Related Questions