Published Jan 27, 2023

648: VALL-E: Uncannily Realistic Voice Imitation from a 3-Second Clip — with Jon Krohn

Jon Krohn delves into the revolutionary text-to-speech model VALL-E by Microsoft, which can flawlessly imitate voices from a mere three-second audio clip, exploring its technological breakthroughs alongside potential ethical implications and security challenges.
Episode Highlights
Super Data Science: ML & AI Podcast with Jon Krohn logo

Popular Clips

Episode Highlights

Related Episodes