Unlikely Trajectories
Heinrich discusses manipulating RNG in games to create unlikely trajectories, paralleling it to reinforcement learning algorithms. Lukas and Tim explore training basic reinforcement algorithms on Nethack and the challenges of optimizing for winning in such environments.In this clip
From this podcast

Gradient Dissent - A Machine Learning Podcast
Tim & Heinrich — Democraticizing Reinforcement Learning Research
Related Questions