Video generation with realistic motion

Topics covered
Popular Clips
Questions from this episode
- Asked by 124 people
- Asked by 114 people
- Asked by 96 people
- Asked by 90 people
- Asked by 87 people
- Asked by 84 people
- Asked by 74 people
- Asked by 72 people
- Asked by 67 people
- Asked by 65 people
- Asked by 63 people
- Asked by 49 people
- Asked by 43 people
Episode Highlights
Data Challenges
Video generation models face significant data and computational challenges, primarily due to the immense volume of video data compared to images or text. explains that training these models requires massive datasets, often in the petabytes, which poses a barrier for new entrants without specialized expertise 1. He highlights the importance of sourcing high-quality motion data, as most online videos lack dynamic movement, which is crucial for teaching models about physics and realism 2.
The goal with video models is to learn physics and realism and the laws that govern our world.
---
The challenge extends to curating datasets that can effectively teach these base rules, making it a non-trivial task for developers.
  Â
Realistic Motion
Achieving realistic motion in video generation is a complex technical challenge that requires significant computational resources. notes that training a video model is akin to managing a million-token context window in language models, demanding extensive GPU power 3. He emphasizes the need for models to understand the laws of reality, such as ensuring a character drinks water correctly from a glass, which is a test of the model's grasp of physics 4.
It's a Jedi mind trick. Like you just cannot. You should not be able to do that.
---
These challenges highlight the importance of balancing model size and capability to ensure accessibility without compromising on realism.
Related Episodes


AI-driven automation in manufacturing
Answers 383 questions

Creating tested, reliable AI applications
Answers 383 questions

Generative models: exploration to deployment
Answers 383 questions

AI in healthcare, synthesizing dance moves, hardware acceleration
Answers 383 questions

Generating the future of art & entertainment
Answers 383 questions

Generating product imagery at Shopify
Answers 383 questions

Making GANs practical
Answers 383 questions

Data science for intuitive user experiences
Answers 383 questions
AI is more than GenAI
Answers 383 questions

Photonic computing for AI acceleration
Answers 383 questions

From ML to AI to Generative AI
Answers 383 questions

Productionizing AI at LinkedIn
Answers 383 questions

The perplexities of information retrieval
Answers 383 questions

Visualizing and understanding RNNs
Answers 383 questions

2019's AI top 5
Answers 383 questions
