Published Feb 12, 2025

Daniel Franzen & Jan Disselhoff - ARC Prize 2024 winners

Explore the pioneering achievements of ARC Prize 2024 winners Daniel Franzen and Jan Disselhoff, as they delve into their breakthrough use of large language models, achieving unprecedented accuracy with advanced token selection and novel validation techniques, while balancing model efficiency and size.

Episode Highlights

Topics covered

Solution Strategy
01
LLM Performance
02

Questions from this episode

- Will large language models scale all the way to artificial general intelligence (AGI)?
  Asked by 143 people
- Tell me something unique about large language models in the context of the episode "Stephen Wolfram: ChatGPT and the Nature of Truth, Reality & Computation | Lex Fridman Podcast #376" and the clip "Language Model Limitations."
  Asked by 33 people
- What's next in large language models (LLMs)?
  Asked by 33 people
- Is reinforcement learning a turning point for large language models (LLMs) and artificial intelligence (AI)?
  Asked by 29 people
- What are the most recent developments and trends in AI, large language models (LLMs), retrieval-augmented generation (RAG), etc., as discussed in the episode #159 - Inflection-2.5, Devin, OpenAI board update, SIMA, EU AI Act passed and the clip Cohere's Command R?
  Asked by 28 people
- Tell me something unique about large language models in the context of the episode Stephen Wolfram: ChatGPT and the Nature of Truth, Reality & Computation | Lex Fridman Podcast #376 and the clip Language Model Limitations
  Asked by 24 people
- What techniques are used with large language models (LLMs)?
  Asked by 23 people
- What are the next advancements in large language models (LLMs)?
  Asked by 21 people
- How do you leverage different models in machine learning?
  Asked by 18 people
- Tell me something unique about large language models in the context of the episode 972: Mustafa Suleyman | Navigating The 21st Century's Greatest Dilemma and the clip Language Models' Predictive Power
  Asked by 15 people
- What is the best insight on prompt engineering and engaging large language models (LLMs)?
  Asked by 11 people
- What are the pros of language models?
  Asked by 10 people
- What do you think about the potential for Large Language Models (LLMs) to scale to Artificial General Intelligence (AGI) as discussed in the episode Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI | Lex Fridman Podcast #333 and the clip Path to AGI?
  Asked by 8 people
- Will large language models scale all the way to artificial general intelligence (AGI) as discussed in the episode MEGATHREAT: The Dangers Of AI Are WEIRDER Than You Think! | Yoshua Bengio and the clip Neural Networks Explained?
  Asked by 8 people
- What do you think about the potential for Large Language Models (LLMs) to scale to Artificial General Intelligence (AGI) as discussed in the episode EP 434: Will OpenAI run away in the LLM race in 2025? and the clip Future of AI Models?
  Asked by 6 people

Episode Highlights

Model Adaptations

In the ARC Prize 2024, and explored model adaptations to enhance performance. They discovered that larger models, like a 32 billion parameter model, didn't necessarily outperform smaller ones due to the increased complexity and computational demands 1. Jan noted that models often struggled with tasks like counting or size estimation, but excelled in structured problems, indicating a nuanced understanding of problem-solving 2.

Transformers are just very, very good at learning facts. I think it's frankly incredible what they can store.

---

This adaptability suggests that continuous fine-tuning and learning from user interactions can significantly improve model efficiency over time 1.

Tokenization & Sampling

The team employed innovative tokenization and sampling methods, notably using Depth First Search (DFS) to enhance solution accuracy. explained that DFS helped mitigate errors by generating solutions above a certain probability threshold, thus ensuring the most likely outcomes were considered 3. highlighted the challenge of predicting computational budgets for DFS, noting that iterative deepening could be a viable strategy to manage resources effectively 4.

The space of solutions the LLM generates is much, much smaller than the theoretical large space because a lot of the pixels are trivial.

---

This approach underscores the importance of balancing computational efficiency with solution accuracy, especially in complex problem-solving scenarios 5.

Related Episodes

Francois Chollet - ARC reflections - NeurIPS 2024
Answers 383 questions
New 50% ARC result and current winners interviewed
Answers 383 questions
How Do AI Models Actually Think? - Laura Ruis
Answers 383 questions
It's Not About Scale, It's About Abstraction - Francois Chollet
Answers 383 questions
Ryan Greenblatt - Solving ARC with GPT4o
Answers 383 questions
#114 - Secrets of Deep Reinforcement Learning (Minqi Jiang)
Answers 383 questions
Robert Lange on NN Pruning and Collective Intelligence
Answers 383 questions
Pattern Recognition vs True Intelligence - Francois Chollet
Answers 383 questions
Subbarao Kambhampati - Do o1 models search?
Answers 383 questions
Decompiling Dreams: A New Approach to ARC? - Alessandro Palmarini
Answers 383 questions
Nicholas Carlini (Google DeepMind)
Answers 383 questions
Dr. Paul Lessard - Categorical/Structured Deep Learning
Answers 383 questions
Jürgen Schmidhuber - Neural and Non-Neural AI, Reasoning, Transformers, and LSTMs
Answers 383 questions
Sepp Hochreiter - LSTM: The Comeback Story?
Answers 383 questions
#106 - Prof. KARL FRISTON 3.0 - Collective Intelligence [Special Edition]
Answers 383 questions

Daniel Franzen & Jan Disselhoff - ARC Prize 2024 winners

Topics covered

Popular Clips

Questions from this episode

Episode Highlights

Solution StrategyDaniel Franzen and Jan Disselhoff, winners of the ARC Prize 2024, reveal their groundbreaking use of large language models to achieve a 53.5% accuracy. Their innovative techniques include depth-first search for token selection and a novel augmentation-based validation system.

Solution Strategy

LLM PerformanceDaniel Franzen and Jan Disselhoff discuss their innovative model adaptations and tokenization strategies that led to their ARC Prize 2024 victory. Their work highlights the balance between model size, computational efficiency, and solution accuracy.

LLM Performance

Model Adaptations

Tokenization & Sampling

Related Episodes