Data Centric AI

A groundbreaking multilingual spoken word corpus has emerged, marking a significant advancement in open-source datasets for 46 languages. The focus is shifting towards data-centric AI, emphasizing the importance of dataset quality and manipulation in enhancing model accuracy. Looking ahead, there’s a desire to witness innovative applications of these datasets in various fields, from audio denoising to traditional speech recognition.

In this clip
From this podcast
The AI Podcast
MLCommons’ David Kanter, NVIDIA’s Daniel Galvez on Publicly Accessible Datasets - Ep. 167
Related Questions
- What is this clip about?
- What is the main topic of this clip?

Data Centric AI

In this clip

From this podcast

The AI Podcast

MLCommons’ David Kanter, NVIDIA’s Daniel Galvez on Publicly Accessible Datasets - Ep. 167

Related Questions

What is this clip about?

What is the main topic of this clip?