LLM Evaluation Insights

Katarina shares her journey from studying psychology to becoming a principal data consultant, highlighting her growing passion for data design and analysis. She discusses the importance of leaderboards for evaluating both open source and commercial large language models, as well as the challenges and benefits of LLM evaluation benchmarks. The conversation also touches on the rich history of AI research at the University of Edinburgh, a key player in the field.

In this clip
From this podcast
Super Data Science: ML & AI Podcast with Jon Krohn
706: Large Language Model Leaderboards and Benchmarks — with Caterina Constantinescu
Related Questions
- What is this clip about?
- What is the main topic of this clip?

LLM Evaluation Insights

In this clip

From this podcast

Super Data Science: ML & AI Podcast with Jon Krohn

706: Large Language Model Leaderboards and Benchmarks — with Caterina Constantinescu

Related Questions

What is this clip about?

What is the main topic of this clip?