Uniting Multimodality

Letitia shares her journey of merging computer vision with linguistic text, driven by her passion for both fields. She discusses the challenges of comparing evolving models and the importance of personal interest in driving research forward.