Daniel Franzen & Jan Disselhoff - ARC Prize 2024 winners

Topics covered
Popular Clips
Questions from this episode
- Asked by 33 people
- Asked by 23 people
- Asked by 21 people
- Asked by 18 people
- Asked by 10 people
Episode Highlights
Model Adaptations
In the ARC Prize 2024, and explored model adaptations to enhance performance. They discovered that larger models, like a 32 billion parameter model, didn't necessarily outperform smaller ones due to the increased complexity and computational demands 1. Jan noted that models often struggled with tasks like counting or size estimation, but excelled in structured problems, indicating a nuanced understanding of problem-solving 2.
Transformers are just very, very good at learning facts. I think it's frankly incredible what they can store.
---
This adaptability suggests that continuous fine-tuning and learning from user interactions can significantly improve model efficiency over time 1.
Tokenization & Sampling
The team employed innovative tokenization and sampling methods, notably using Depth First Search (DFS) to enhance solution accuracy. explained that DFS helped mitigate errors by generating solutions above a certain probability threshold, thus ensuring the most likely outcomes were considered 3. highlighted the challenge of predicting computational budgets for DFS, noting that iterative deepening could be a viable strategy to manage resources effectively 4.
The space of solutions the LLM generates is much, much smaller than the theoretical large space because a lot of the pixels are trivial.
---
This approach underscores the importance of balancing computational efficiency with solution accuracy, especially in complex problem-solving scenarios 5.
Related Episodes


Francois Chollet - ARC reflections - NeurIPS 2024
Answers 383 questions

New 50% ARC result and current winners interviewed
Answers 383 questions

How Do AI Models Actually Think? - Laura Ruis
Answers 383 questions
It's Not About Scale, It's About Abstraction - Francois Chollet
Answers 383 questions

Ryan Greenblatt - Solving ARC with GPT4o
Answers 383 questions

#114 - Secrets of Deep Reinforcement Learning (Minqi Jiang)
Answers 383 questions

Robert Lange on NN Pruning and Collective Intelligence
Answers 383 questions

Pattern Recognition vs True Intelligence - Francois Chollet
Answers 383 questions

Subbarao Kambhampati - Do o1 models search?
Answers 383 questions

Decompiling Dreams: A New Approach to ARC? - Alessandro Palmarini
Answers 383 questions

Nicholas Carlini (Google DeepMind)
Answers 383 questions

Dr. Paul Lessard - Categorical/Structured Deep Learning
Answers 383 questions

Jürgen Schmidhuber - Neural and Non-Neural AI, Reasoning, Transformers, and LSTMs
Answers 383 questions

Sepp Hochreiter - LSTM: The Comeback Story?
Answers 383 questions

#106 - Prof. KARL FRISTON 3.0 - Collective Intelligence [Special Edition]
Answers 383 questions
