Evaluating Machine Learning

Lukas and Harrison discuss the challenges of evaluating machine learning models, touching on using intuition ("vibes") for testing and the potential for automating evaluation processes using language models.