Published Nov 20, 2023

Visual Generative AI Ecosystem Challenges with Richard Zhang - 656

Richard Zhang, a senior research scientist at Adobe Research, delves into the multifaceted challenges of the visual generative AI ecosystem, covering deepfake detection, data attribution, model customization, and aligning AI outputs with human perception. He emphasizes the need for innovative tools and metrics to enhance the adaptability and effectiveness of generative AI technologies.

Episode Highlights

Topics covered

Episode Highlights

User Customization

Richard Zhang discusses the importance of enhancing user interactions with AI systems, emphasizing the need for dynamic and personalized customization. He highlights the limitations of current text-based interfaces and suggests a spectrum of interaction methods, such as style transfer and personal object integration, to provide users with more control over AI-generated content 1. Zhang explains that while text-to-image models like DALL-E 2 offer foundational capabilities, they lack the detailed control creators need 2.

We want to have some sort of permanent state that allows you to iterate with it kind of meaningfully.

---

He envisions a future where creators can iteratively refine AI outputs, making the process more intuitive and efficient.

Model Tools

Zhang introduces model customization tools like custom diffusion, which allow users to modify AI models by integrating personal content or removing specific concepts. This method, described as a form of network surgery, enables targeted changes without compromising the model's overall integrity 3. He also discusses the challenges of generalizing detection tools across different generative models, emphasizing the need for adaptable solutions to keep pace with evolving AI technologies 4.

We want to do the removal, but we also want to be careful. Like, we don't want to blow out, like, all the other types of painting styles that are in the model.

---

These advancements aim to empower users with greater control and flexibility in their interactions with AI systems.

Related Episodes

Generative AI on the Edge with Vinesh Sukumar - 623
Answers 383 questions
Gen AI at the Edge: Qualcomm AI Research at CVPR 2024 with Fatih Porikli - 688
Answers 383 questions
Deep Neural Nets for Visual Recognition with Matt Zeiler - #22
Answers 383 questions
Generating Ground-Level Images From Overhead Imagery Using GANs with Yi Zhu - TWiML Talk #172
Answers 383 questions
AI Sentience, Agency and Catastrophic Risk with Yoshua Bengio - 654
Answers 383 questions
Retinal Image Generation for Disease Discovery with Stephen Odaibo - TWIML Talk #284
Answers 383 questions
Accelerating Intelligence with AI-Generating Algorithms with Jeff Clune - 602
Answers 383 questions
Evolving MLOps Platforms for Generative AI and Agents with Abhijit Bose - 714
Answers 383 questions
Deep Learning for 3D Sensors and Cameras in Lighthouse with Alex Teichman - #103
Answers 383 questions
Scaling Deep Learning: Systems Challenges & More with Shubho Sengupta - #14
Answers 383 questions
Video as a Universal Interface for AI Reasoning with Sherry Yang - 676
Answers 383 questions
Trends in Machine Learning & Deep Learning with Zack Lipton - #334
Answers 383 questions
Runway Gen-2: Generative AI for Video Creation with Anastasis Germanidis - 622
Answers 383 questions
Modeling Human Behavior with Generative Agents with Joon Sung Park - 632
Answers 383 questions
Global AI Trends with Ben Lorica - #26
Answers 383 questions

Visual Generative AI Ecosystem Challenges with Richard Zhang - 656

Topics covered

Popular Clips

Episode Highlights

Detection and ForensicsRichard Zhang discusses the challenges of detecting deepfakes and the importance of data-driven approaches in improving AI forensics. He highlights the need for adaptable detection tools that can generalize across evolving generative methods.

Detection and Forensics

Contributor and Data ManagementRichard Zhang addresses the challenges of data attribution and concept removal in generative AI, focusing on the complexities of tracing training data influences and removing specific elements from models.

Contributor and Data Management

Control in Generative AIRichard Zhang explores the challenges and innovations in customizing interactions with visual generative AI systems. He highlights the need for dynamic user interfaces and model customization tools to enhance control and adaptability.

Control in Generative AI

User Customization

Model Tools

Perceptual Metrics

Related Episodes