AI Benchmarking Challenges

Lucas discusses the complexities of benchmarking in safety-critical environments, highlighting existing AI verification benchmarks and the contributions from both academia and industry. He introduces the MLLEAP project as a detailed framework for AI verification processes, emphasizing the importance of explainability in applications like runway sign classification for pilots.