797: Deep Learning Classics and Trends — with Dr. Rosanne Liu

Topics covered
Popular Clips
Episode Highlights
Intrinsic Dimensions
Dr. discusses the concept of intrinsic dimensions in deep learning, a foundational idea that has influenced modern techniques like LoRA. She explains how intrinsic dimensions help measure the difficulty of a task when combined with a network, offering a scientific approach to understanding task complexity 1. This concept has been pivotal in developing parameter-efficient fine-tuning methods, allowing large language models to be trained more effectively without overfitting 2. reflects on the impact of her work, noting that it has inspired significant advancements in the field 3.
The dimension of training, usually people think of it as the number of parameters, because however many parameters you have, you can move alongside each parameter.
---
Her insights into intrinsic dimensions continue to shape the landscape of machine learning research.
Training Dynamics
In exploring training dynamics, emphasizes the complexity of understanding neural networks' internal representations. She argues that as models become more powerful, they become harder to interpret, akin to understanding complex human emotions compared to simpler animal behaviors 4. This complexity necessitates a balance between curiosity-driven and goal-driven research, allowing for both innovative exploration and structured advancements 5. also highlights the importance of fostering research growth through practice and community support, particularly for those without access to traditional labs 6.
Understanding is like, you push forward, you can always understand more, because there's nothing in this world we can confidently say that we understand 100%.
---
Her work underscores the ongoing challenge of interpreting and improving AI models.
LLM Evaluations
Evaluating large language models (LLMs) presents unique challenges, as explains through her involvement with the Big Bench project. This initiative aims to crowdsource benchmark tests, allowing the community to contribute tasks that assess model capabilities comprehensively 7. However, notes the difficulties in maintaining neutrality and preventing models from overfitting to specific evaluations 8. The dynamic nature of evaluation metrics requires constant innovation to ensure they remain relevant and unbiased 9.
It's possible to organize over 100 institutions and 400 authors.
---
Her insights highlight the importance of transparent and adaptive evaluation processes in AI research.
Related Episodes


SDS 549: Engineering Natural Language Models — with Lauren Zhu
Answers 383 questions

661: Designing Machine Learning Systems — with Chip Huyen
Answers 383 questions

706: Large Language Model Leaderboards and Benchmarks — with Caterina Constantinescu
Answers 383 questions

767: Open-Source LLM Libraries and Techniques — with Dr. Sebastian Raschka
Answers 383 questions

SDS 439: Deep Learning for Machine Vision — with Deblina Bhattacharjee
Answers 383 questions
SDS 551: Deep Reinforcement Learning — with Wah Loon Keng
Answers 383 questions

SDS 503: Deep Reinforcement Learning for Robotics — with Pieter Abbeel
Answers 383 questions
SDS 558: @JonKrohnLearns's Answers to Questions on Machine Learning
Answers 383 questions

679: The A.I. and Machine Learning Landscape — with investor George Mathew
Answers 383 questions

695: NLP with Transformers — with Hugging Face's Lewis Tunstall
Answers 383 questions
SDS 464: A.I. vs Machine Learning vs Deep Learning — with Jon Krohn
Answers 383 questions

SDS 583: The State of Natural Language Processing — with Rongyao Huang
Answers 383 questions

817: The Positron IDE, Tidy NLP and MLOps — with Dr. Julia Silge
Answers 383 questions

627: AutoML: Automated Machine Learning — with Erin LeDell
Answers 383 questions

649: Introduction to Machine Learning — with Kirill Eremenko and Hadelin de Ponteves
Answers 383 questions













