Published Jan 13, 2025

Evolving MLOps Platforms for Generative AI and Agents with Abhijit Bose - 714

Abhijit Bose of Capital One delves into the evolution of their Generative AI platform, spotlighting a platform-centric approach that marries centralized governance with cutting-edge AI tools and Kubernetes integration, enhancing flexibility and resource optimization for enterprise operations and customer service.

Episode Highlights

Topics covered

Questions from this episode

- What role can AI play in various fields?
  Asked by 91 people
- What is the future of AI?
  Asked by 69 people
- What can we expect in the future of AI?
  Asked by 67 people
- What's the future of generative AI as discussed in the episode 888: Marc Andreessen | Exploring the Power, Peril, and Potential of AI and the clip The Future of AI Entertainment?
  Asked by 63 people
- How can we integrate AI into our business processes?
  Asked by 55 people
- What's the potential impact of AI in the future?
  Asked by 54 people
- What opportunities does AI open up?
  Asked by 45 people
- What is the current news around Generative AI?
  Asked by 41 people
- What's the future of generative AI?
  Asked by 40 people
- How will AI change software?
  Asked by 38 people
- What are some hot takes on web-based AI agents from the episode Tektonic AI Secures $10M for GenAI Business Automation and the clip AI Automation Insights?
  Asked by 36 people
- What's next in large language models (LLMs)?
  Asked by 33 people
- How is machine learning evolving?
  Asked by 31 people
- How are generative AI use cases evolving for code and developer productivity in the episode Supercharging Developer Productivity with ChatGPT and Claude with Simon Willison - 701 and the clip Daily AI Tools?
  Asked by 30 people
- What are some applications of language models?
  Asked by 27 people

Episode Highlights

Kubernetes

The integration of Kubernetes has significantly enhanced AI platform development at Capital One. explains that their robust platform control plane, based on Kubernetes, allows for flexibility in incorporating various tools and services, including those from AWS and open-source communities 1. This flexibility has been crucial in extending their machine learning platform to support generative AI use cases, enabling rapid adaptation and innovation. highlights the complexity of data annotation in generative AI compared to traditional machine learning, emphasizing the need for refined capabilities and tools 2.

Observability

Enhancing observability tools is vital for managing the complexities of generative AI applications. notes that while traditional machine learning requires solid monitoring for model drift and input features, generative AI introduces new challenges like LLM hallucinations, necessitating advanced guardrails and logging systems 3. These enhancements ensure proper governance and execution of agentic workflows, making observability not just important but complex. emphasizes leveraging existing anomaly detection algorithms and extending them to handle new data types, ensuring comprehensive monitoring across platforms 4.

Inference Optimization

Optimizing inference efficiency is a critical focus at Capital One, with efforts to reduce costs and latency from the outset. shares that maintaining low cost per token and latency are key performance indicators, requiring continuous optimization of GPU utilization and other techniques 5. This involves leveraging both proprietary and open-source tools to enhance inference workflows, ensuring effective deployment of fine-tuned models. highlights the collaboration between science and engineering teams to integrate advanced techniques like quantization and speculative decoding into their inference systems 6.

Related Episodes

Feature Platforms for Data-Centric AI with Mike Del Balso - #577
Answers 383 questions
Machine Learning Platforms at Uber with Mike Del Balso - #115
Answers 383 questions
The Evolution of the NLP Landscape with Oren Etzioni - #598
Answers 383 questions
Deploying Edge and Embedded AI Systems with Heather Gorr - 655
Answers 383 questions
Generative AI on the Edge with Vinesh Sukumar - 623
Answers 383 questions
Compositional ML and the Future of Software Development with Dillon Erb - #520
Answers 383 questions
Jupyter and the Evolution of ML Tooling with Brian Granger - #544
Answers 383 questions
Interactive Machine Learning Systems with Alekh Agarwal - #17
Answers 383 questions
Transforming Oil & Gas with AI with Adi Bhashyam and Daniel Jeavons - TWIML Talk #279
Answers 383 questions
AutoML for Natural Language Processing with Abhishek Thakur - #475
Answers 383 questions
Building Foundational ML Platforms with Kubernetes and Kubeflow with Ali Rodell - #595
Answers 383 questions
Evolving AI Systems Gracefully with Stefano Soatto - #502
Answers 383 questions
Scaling AI for the Enterprise with Mazin Gilbert - #78
Answers 383 questions
Live from TWIMLcon! Operationalizing ML at Scale with Hussein Mehanna - #306
Answers 383 questions
Scaling Deep Learning: Systems Challenges & More with Shubho Sengupta - #14
Answers 383 questions

Evolving MLOps Platforms for Generative AI and Agents with Abhijit Bose - 714

Topics covered

Popular Clips

Questions from this episode

Episode Highlights

Platform-Centric AI Strategy

MLOps Platform EvolutionCapital One's AI platforms have evolved to incorporate Kubernetes, enhancing flexibility and innovation in generative AI applications. Abhijit Bose discusses the importance of observability and inference optimization in managing these complex systems.

MLOps Platform Evolution

Kubernetes

Observability

Inference Optimization

GenAI Workflows and ApplicationsAgentic workflows are transforming enterprise operations by automating tasks and enhancing efficiency. Abhijit Bose discusses how Capital One is leveraging AI to improve customer service and adapt to the rapidly evolving AI landscape.

GenAI Workflows and Applications

Related Episodes