Published Jan 27, 2023
648: VALL-E: Uncannily Realistic Voice Imitation from a 3-Second Clip — with Jon Krohn
Jon Krohn delves into the revolutionary text-to-speech model VALL-E by Microsoft, which can flawlessly imitate voices from a mere three-second audio clip, exploring its technological breakthroughs alongside potential ethical implications and security challenges.

Topics covered
Popular Clips
Episode Highlights
Related Episodes

652: A.I. Speech for the Speechless — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions
720: OpenAI’s DALL-E 3, Image Chat and Web Search — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions
SDS 624: Imagen Video: Incredible Text-to-Video Generation — with @JonKrohnLearns
Answers 383 questions
840: Delicate Viticultural Robotics — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions
832: The Anthropic CEO’s Techno-Utopia — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions
750: How AI is Transforming Science — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions

823: Virtual Humans and AI Clones — with Natalie Monbiot
Answers 383 questions

808: In Case You Missed It in July 2024 — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions

852: In Case You Missed It in December 2024 — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions
670: LLaMA: GPT-3 performance, 10x smaller — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions
SDS 570: DALL-E 2: Stunning Photorealism from Any Text Prompt — with Jon Krohn
Answers 383 questions
SDS 464: A.I. vs Machine Learning vs Deep Learning — with Jon Krohn
Answers 383 questions

