Meta's MusicGen was trained on over 20,000 hours of licensed audio, allowing it to replicate various styles without infringing on copyrights. By sourcing data from its own music initiative and prominent stock libraries, Meta ensures compliance while focusing on research rather than commercial use. The removal of vocals from training data further protects artists' identities, highlighting a careful approach to AI development in the music industry.