How Do AI Models Actually Think? - Laura Ruis

Topics covered
Popular Clips
Questions from this episode
- Asked by 139 people
- Asked by 109 people
- Asked by 108 people
- Asked by 84 people
- Asked by 80 people
- Asked by 72 people
- Asked by 69 people
- Asked by 63 people
- Asked by 62 people
- Asked by 61 people
- Asked by 61 people
- Asked by 47 people
- Asked by 42 people
Episode Highlights
Factual vs. Reasoning
The distinction between factual retrieval and reasoning tasks in language models is crucial for understanding their capabilities. explains that factual retrieval relies on specific documents, whereas reasoning tasks synthesize knowledge from multiple sources, demonstrating procedural knowledge 1. This synthesis allows models to perform complex tasks like arithmetic and linear equations, which require abstract reasoning 2.
The important point is that it is seemingly taking knowledge from many different documents and applying it to the same task.
---
Influence functions help analyze how pre-training data affects these reasoning steps, revealing that reasoning involves a more diffused approach compared to the focused retrieval of factual information 3.
  Â
Formal Reasoning
Exploring the potential for formal reasoning in language models, suggests that connectionist models can learn systematic rules and achieve high accuracy on novel problems 4. This capability indicates that models might handle symbolic computation, although challenges remain in dealing with entirely new tokens.
We have shown that they can do a form of systematicity or symbolic computation, although it's still limited.
---
The debate continues on whether scaling current approaches will yield better results or if new methods are needed to enhance data efficiency and adaptability 5.
  Â
Role of Code
The inclusion of code in training data significantly influences language models' reasoning abilities. notes that code provides a robust framework for models to learn step-by-step reasoning, enhancing their ability to generalize across tasks 6. This abstraction allows models to handle diverse expressions of the same problem, making them more adaptable.
It seems like the model can learn to do these step-by-step reasoning traces to output them from descriptions of procedures in code.
---
Interestingly, code influences reasoning both positively and negatively, highlighting the complexity of its role in model training 7.
Related Episodes


DOES AI HAVE AGENCY? With Professor. Karl Friston and Riddhi J. Pitliya
Answers 383 questions

Pattern Recognition vs True Intelligence - Francois Chollet
Answers 383 questions

#107 - Dr. RAPHAËL MILLIÈRE - Linguistics, Theory of Mind, Grounding
Answers 383 questions

Yoshua Bengio - Designing out Agency for Safe AI
Answers 383 questions
It's Not About Scale, It's About Abstraction - Francois Chollet
Answers 383 questions

Francois Chollet - ARC reflections - NeurIPS 2024
Answers 383 questions

Eliezer Yudkowsky and Stephen Wolfram on AI X-risk
Answers 383 questions

Prof. Murray Shanahan - Machines Don't Think Like Us
Answers 383 questions

#64 Prof. Gary Marcus 3.0
Answers 383 questions

How AI Could Be A Mathematician's Co-Pilot by 2026 (Prof. Swarat Chaudhuri)
Answers 383 questions

Jurgen Schmidhuber on Humans co-existing with AIs
Answers 383 questions

Daniel Franzen & Jan Disselhoff - ARC Prize 2024 winners
Answers 383 questions

Jürgen Schmidhuber - Neural and Non-Neural AI, Reasoning, Transformers, and LSTMs
Answers 383 questions

#57 - Prof. Melanie Mitchell - Why AI is harder than we think
Answers 383 questions
