LLM Evaluation Insights

Katarina shares her journey from studying psychology to becoming a principal data consultant, highlighting her growing passion for data design and analysis. She discusses the importance of leaderboards for evaluating both open source and commercial large language models, as well as the challenges and benefits of LLM evaluation benchmarks. The conversation also touches on the rich history of AI research at the University of Edinburgh, a key player in the field.