820: OpenAI's o1 "Strawberry" Models — with Jon Krohn (@JonKrohnLearns)

Topics covered
Popular Clips
Episode Highlights
Safety Measures
OpenAI has implemented significant safety measures in their new o1 models. highlights the advancements in resisting jailbreaking attempts, with the o1 model scoring 84 out of 100 compared to GBT-4's 22 out of 100 1. This improvement flips the probability of successful jailbreaks from usually possible to relatively rare.
OpenAI claims to have developed a new training approach that leverages the model's reasoning capabilities to better adhere to safety and alignment guidelines.
---
These measures are crucial as the capabilities of AI models continue to grow, making safety a top priority for developers 1.
Alignment Strategies
Alignment strategies are essential to ensure AI models adhere to ethical guidelines. Jon explains that OpenAI's new training approach leverages the model's reasoning capabilities to enhance safety and alignment 1. This approach allows the model to review its own intermediate steps for errors, improving its ability to resist misuse.
I really do believe that this is the state of the art now available today, given these terrific capabilities.
---
Such strategies are vital as they help mitigate risks associated with advanced AI models, ensuring they operate within intended ethical boundaries 1.
Related Episodes

740: Q*: OpenAI's Rumored AGI Breakthrough — with @JonKrohnLearns
Answers 383 questions
SDS 438: Artificial General Intelligence — with Jon Krohn
Answers 383 questions
750: How AI is Transforming Science — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions
SDS 620: OpenAI Whisper: General-Purpose Speech Recognition — with @JonKrohnLearns
Answers 383 questions
720: OpenAI’s DALL-E 3, Image Chat and Web Search — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions
832: The Anthropic CEO’s Techno-Utopia — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions
818: In Case You Missed It in August 2024 — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions

852: In Case You Missed It in December 2024 — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions
SDS 464: A.I. vs Machine Learning vs Deep Learning — with Jon Krohn
Answers 383 questions

808: In Case You Missed It in July 2024 — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions
SDS 584: OpenAI Codex — with Jon Krohn
Answers 383 questions

802: In Case You Missed It in June 2024 — with Jon Krohn (@JonKrohnLearns)
Answers 383 questions







