Name: AI Safety Thursdays: Reasoning Models Don't Always Say What They Think
Start: 2025-06-12T18:00:00-04:00
End: 2025-06-12T21:00:00-04:00
Location: 30 Adelaide East, Industrious Office 12th Floor Common Area

​​**Description**
"Reasoning Models" have become among the most prominent state-of-the-art tools in the AI world. Can we trust the way they reason, and does it matter if they come up with the right answer but with incorrect reasoning?
At today's event, Giles Edkins will guide us through these questions as explored in [Anthropic's paper from last month](https://assets.anthropic.com/m/71876fabef0f0ed4/original/reasoning_models_paper.pdf)
​​**Event Schedule**
6:00 to 6:45 - Networking and refreshments
6:45 to 8:00 - Main Presentation
8:00 to 9:00 - Breakout Discussions

Juliana Eberschlag

Giles

Mario Gibney

Toronto AI Safety

Technology

Risk Management

New Technology

Safety

Critical Thinking

Artificial Intelligence Applications

AI and Society

Mathematics

Artificial Intelligence Machine Learning Robotics

Artificial Intelligence

Machine Learning

Software Engineering

Machine Learning Interpretability

Deep Learning

Surendrapalsingh Jhiout

Archana Arakkal

Gurpreet Singh

Vincent Lu

AI Safety Thursdays: Reasoning Models Don't Always Say What They Think

30 Adelaide East, Industrious Office 12th Floor Common Area

Share this event

AI Safety Thursdays: Reasoning Models Don't Always Say What They Think

Details