Data Centric AI
A groundbreaking multilingual spoken word corpus has emerged, marking a significant advancement in open-source datasets for 46 languages. The focus is shifting towards data-centric AI, emphasizing the importance of dataset quality and manipulation in enhancing model accuracy. Looking ahead, there’s a desire to witness innovative applications of these datasets in various fields, from audio denoising to traditional speech recognition.In this clip
From this podcast

The AI Podcast
MLCommons’ David Kanter, NVIDIA’s Daniel Galvez on Publicly Accessible Datasets - Ep. 167
Related Questions