Published Jan 20, 2025

How Do AI Models Actually Think? - Laura Ruis

Researcher Laura Ruis delves into AI agency, reasoning, and scaling challenges, unraveling the nuances of control, ethical implications, and the integration of symbolic computation within connectionist frameworks to enhance procedural knowledge and formal reasoning in language models.

Episode Highlights

Topics covered

Questions from this episode

- How will AI impact humanity's future?
  Asked by 139 people
- Will large language models scale all the way to artificial general intelligence (AGI)?
  Asked by 139 people
- Tell me something about artificial intelligence
  Asked by 109 people
- What are the ethical concerns around artificial intelligence (AI)?
  Asked by 108 people
- Is artificial intelligence really intelligent?
  Asked by 84 people
- What is the current sentiment around the maturation of AI technology?
  Asked by 80 people
- What do experts say about the future of artificial intelligence?
  Asked by 72 people
- What has been said about artificial intelligence?
  Asked by 69 people
- What can we expect in the future of AI?
  Asked by 63 people
- How can artificial intelligence (AI) be regulated?
  Asked by 62 people
- What are the implications for artificial intelligence?
  Asked by 61 people
- What is the future of AI?
  Asked by 61 people
- Can artificial intelligence have global implications as discussed in the episode Demis Hassabis - Scaling, Superhuman AIs, AlphaZero atop LLMs, Rogue Nations Threat and the clip AI Impact and AGI?
  Asked by 52 people
- What's the potential impact of AI in the future?
  Asked by 47 people
- Are concerns about artificial intelligence valid?
  Asked by 42 people

Episode Highlights

Factual vs. Reasoning

The distinction between factual retrieval and reasoning tasks in language models is crucial for understanding their capabilities. explains that factual retrieval relies on specific documents, whereas reasoning tasks synthesize knowledge from multiple sources, demonstrating procedural knowledge 1. This synthesis allows models to perform complex tasks like arithmetic and linear equations, which require abstract reasoning 2.

The important point is that it is seemingly taking knowledge from many different documents and applying it to the same task.

---

Influence functions help analyze how pre-training data affects these reasoning steps, revealing that reasoning involves a more diffused approach compared to the focused retrieval of factual information 3.

Formal Reasoning

Exploring the potential for formal reasoning in language models, suggests that connectionist models can learn systematic rules and achieve high accuracy on novel problems 4. This capability indicates that models might handle symbolic computation, although challenges remain in dealing with entirely new tokens.

We have shown that they can do a form of systematicity or symbolic computation, although it's still limited.

---

The debate continues on whether scaling current approaches will yield better results or if new methods are needed to enhance data efficiency and adaptability 5.

Role of Code

The inclusion of code in training data significantly influences language models' reasoning abilities. notes that code provides a robust framework for models to learn step-by-step reasoning, enhancing their ability to generalize across tasks 6. This abstraction allows models to handle diverse expressions of the same problem, making them more adaptable.

It seems like the model can learn to do these step-by-step reasoning traces to output them from descriptions of procedures in code.

---

Interestingly, code influences reasoning both positively and negatively, highlighting the complexity of its role in model training 7.

Related Episodes

#84 LAURA RUIS - Large language models are not zero-shot communicators [NEURIPS UNPLUGGED]
Answers 383 questions
DOES AI HAVE AGENCY? With Professor. Karl Friston and Riddhi J. Pitliya
Answers 383 questions
Pattern Recognition vs True Intelligence - Francois Chollet
Answers 383 questions
#107 - Dr. RAPHAËL MILLIÈRE - Linguistics, Theory of Mind, Grounding
Answers 383 questions
Yoshua Bengio - Designing out Agency for Safe AI
Answers 383 questions
It's Not About Scale, It's About Abstraction - Francois Chollet
Answers 383 questions
Francois Chollet - ARC reflections - NeurIPS 2024
Answers 383 questions
Eliezer Yudkowsky and Stephen Wolfram on AI X-risk
Answers 383 questions
Prof. Murray Shanahan - Machines Don't Think Like Us
Answers 383 questions
#64 Prof. Gary Marcus 3.0
Answers 383 questions
How AI Could Be A Mathematician's Co-Pilot by 2026 (Prof. Swarat Chaudhuri)
Answers 383 questions
Jurgen Schmidhuber on Humans co-existing with AIs
Answers 383 questions
Daniel Franzen & Jan Disselhoff - ARC Prize 2024 winners
Answers 383 questions
Jürgen Schmidhuber - Neural and Non-Neural AI, Reasoning, Transformers, and LSTMs
Answers 383 questions
#57 - Prof. Melanie Mitchell - Why AI is harder than we think
Answers 383 questions

How Do AI Models Actually Think? - Laura Ruis

Topics covered

Popular Clips

Questions from this episode

Episode Highlights

AI Agency and EthicsLaura Ruis, a researcher at Cohere, discusses the intricate concept of agency in AI, emphasizing its implications for understanding and controlling AI systems. She explores how agency might emerge in AI models and the societal and ethical challenges it presents.

AI Agency and Ethics

LLM Reasoning CapabilitiesLaura Ruis explores the distinction between factual retrieval and reasoning in language models, highlighting their reliance on procedural knowledge. She examines the potential for formal reasoning and the significant role of code in enhancing model capabilities.

LLM Reasoning Capabilities

Factual vs. Reasoning

Formal Reasoning

Role of Code

Scaling and Learning

Related Episodes