Reporting Uncertainty

Emphasizing the importance of reporting uncertainties, Rishabh discusses how confidence intervals complement distributions in reinforcement learning experiments. He introduces a novel metric that calculates the probability of one algorithm outperforming another, highlighting that this probability itself is a random variable subject to uncertainty. This approach encourages a deeper understanding of the variability in algorithm performance.