Speech Recognition Scale

Discover the scale of data needed for functional speech recognition models and how it varies based on language and application. Josh explains how even low resource languages can utilize existing models, while larger vocabulary tasks may require around 2000 hours of audio data. Remy discusses the availability of data for Kenya Rwanda, a well-documented language with abundant resources.