Automating Model Optimization

Gennady discusses the complexities of optimizing models for different hardware configurations, emphasizing the trade-offs between memory and compute. He highlights the importance of automation in this process, sharing how his team quickly adapted to new models like Llama 2 and Llama 3, showcasing their ability to streamline optimization in less than a day. This efficiency not only saves time but also allows for more flexible and rapid deployment of advanced models.

In this clip
From this podcast
Machine Learning Street Talk (MLST)
Why Your GPUs Only Run at 10%! - CentML CEO Explains
Related Questions
- Have you seen any good techniques for automatically optimizing few-shot examples for large language models (LLMs)?

Automating Model Optimization

In this clip

From this podcast

Machine Learning Street Talk (MLST)

Why Your GPUs Only Run at 10%! - CentML CEO Explains

Related Questions

Have you seen any good techniques for automatically optimizing few-shot examples for large language models (LLMs)?