Published Sep 20, 2024

820: OpenAI's o1 "Strawberry" Models — with Jon Krohn (@JonKrohnLearns)

Jon Krohn delves into the groundbreaking capabilities of OpenAI's o1 "Strawberry" models, discussing their superior problem-solving skills, safety advancements, and real-world applications in AI development.

Episode Highlights

Topics covered

Popular Clips

Episode Highlights

Safety Measures

OpenAI has implemented significant safety measures in their new o1 models. highlights the advancements in resisting jailbreaking attempts, with the o1 model scoring 84 out of 100 compared to GBT-4's 22 out of 100 1. This improvement flips the probability of successful jailbreaks from usually possible to relatively rare.

OpenAI claims to have developed a new training approach that leverages the model's reasoning capabilities to better adhere to safety and alignment guidelines.

---

These measures are crucial as the capabilities of AI models continue to grow, making safety a top priority for developers 1.

Alignment Strategies

Alignment strategies are essential to ensure AI models adhere to ethical guidelines. Jon explains that OpenAI's new training approach leverages the model's reasoning capabilities to enhance safety and alignment 1. This approach allows the model to review its own intermediate steps for errors, improving its ability to resist misuse.

I really do believe that this is the state of the art now available today, given these terrific capabilities.

---

Such strategies are vital as they help mitigate risks associated with advanced AI models, ensuring they operate within intended ethical boundaries 1.

Related Episodes

812: The AI Scientist: Towards Fully Automated, Open-Ended Scientific Discovery — with Jon Krohn
Answers 383 questions
740: Q*: OpenAI's Rumored AGI Breakthrough — with @JonKrohnLearns
Answers 383 questions
SDS 438: Artificial General Intelligence — with Jon Krohn
Answers 383 questions
750: How AI is Transforming Science — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions
SDS 620: OpenAI Whisper: General-Purpose Speech Recognition — with @JonKrohnLearns
Answers 383 questions
720: OpenAI’s DALL-E 3, Image Chat and Web Search — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions
832: The Anthropic CEO’s Techno-Utopia — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions
818: In Case You Missed It in August 2024 — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions
794: Exciting (and Frightening!) Trends in Open-Source AI — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions
852: In Case You Missed It in December 2024 — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions
774: RFM-1 Gives Robots Human-like Reasoning and Conversation Abilities — with @JonKrohnLearns
Answers 383 questions
SDS 464: A.I. vs Machine Learning vs Deep Learning — with Jon Krohn
Answers 383 questions
808: In Case You Missed It in July 2024 — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions
SDS 584: OpenAI Codex — with Jon Krohn
Answers 383 questions
802: In Case You Missed It in June 2024 — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions

Dexa/Super Data Science: ML & AI Podcast with Jon Krohn

820: OpenAI's o1 "Strawberry" Models — with Jon Krohn (@JonKrohnLearns)

Topics covered

Popular Clips

Hot Dogs and Derivatives

AI Model Breakthrough

AI Model Advancements

AI Safety Advances

AI Model Advancements

Impressive AI Problem Solving

Thinking Time Revolution

Interactive Neural Network

Episode Highlights

Model Capabilities

Safety and Alignment

Safety Measures

Alignment Strategies

User Access and Applications

Related Episodes

812: The AI Scientist: Towards Fully Automated, Open-Ended Scientific Discovery — with Jon Krohn

740: Q*: OpenAI's Rumored AGI Breakthrough — with @JonKrohnLearns

SDS 438: Artificial General Intelligence — with Jon Krohn

750: How AI is Transforming Science — with Jon Krohn (@JonKrohnLearns)

SDS 620: OpenAI Whisper: General-Purpose Speech Recognition — with @JonKrohnLearns

720: OpenAI’s DALL-E 3, Image Chat and Web Search — with Jon Krohn (@JonKrohnLearns)

832: The Anthropic CEO’s Techno-Utopia — with Jon Krohn (@JonKrohnLearns)

818: In Case You Missed It in August 2024 — with Jon Krohn (@JonKrohnLearns)

794: Exciting (and Frightening!) Trends in Open-Source AI — with Jon Krohn (@JonKrohnLearns)

852: In Case You Missed It in December 2024 — with Jon Krohn (@JonKrohnLearns)

774: RFM-1 Gives Robots Human-like Reasoning and Conversation Abilities — with @JonKrohnLearns

SDS 464: A.I. vs Machine Learning vs Deep Learning — with Jon Krohn

808: In Case You Missed It in July 2024 — with Jon Krohn (@JonKrohnLearns)

SDS 584: OpenAI Codex — with Jon Krohn

802: In Case You Missed It in June 2024 — with Jon Krohn (@JonKrohnLearns)

820: OpenAI's o1 "Strawberry" Models — with Jon Krohn (@JonKrohnLearns)

Topics covered

Popular Clips

Episode Highlights

Model CapabilitiesJon Krohn explores the advanced reasoning capabilities of OpenAI's o1-preview model, showcasing its potential to revolutionize generative AI. He highlights its performance in complex problem-solving, mathematical excellence, and comparisons to PhD-level benchmarks.

Model Capabilities

Safety and AlignmentJon Krohn discusses the safety measures and alignment strategies implemented in OpenAI's new o1 models. He highlights the advancements in resisting jailbreaking attempts and the importance of adhering to ethical guidelines.

Safety and Alignment

Safety Measures

Alignment Strategies

User Access and ApplicationsJon Krohn explores the access points and practical applications of OpenAI's o1 models, highlighting their advanced capabilities and potential impact on AI development.

User Access and Applications

Related Episodes