Speech Recognition Scale
Discover the scale of data needed for functional speech recognition models and how it varies based on language and application. Josh explains how even low resource languages can utilize existing models, while larger vocabulary tasks may require around 2000 hours of audio data. Remy discusses the availability of data for Kenya Rwanda, a well-documented language with abundant resources.In this clip
From this podcast

Practical AI
Speech tech and Common Voice at Mozilla
Related Questions