Kirill explains the concept of random forests through a relatable analogy, emphasizing how averaging multiple guesses can yield accurate predictions. He highlights the randomness in both the data subsets and feature selection for each decision tree, which helps combat overfitting and enhances model performance. This discussion reveals the intricacies of ensemble methods and their effectiveness in machine learning.