Name: AI Safety Thursday: Chain-of-Thought Monitoring for AI Control
Start: 2025-10-30T18:00:00-04:00
End: 2025-10-30T21:00:00-04:00
Location: 30 Adelaide East, Industrious Office 12th Floor Common Area

​**Registration Instructions**
This is a paid event ($5 general admission, free for students & job seekers) with limited tickets - you must [RSVP on Luma](https://luma.com/gwlal4v9) to secure your spot.
​​If you can't make it in person, feel free to join the live stream starting at 6:30 pm, via [this link](https://www.youtube.com/@Trajectory-Labs/live).

**Description**
Modern reasoning models do a lot of thinking in natural language before producing their outputs. Can we catch misbehaviours by our LLMs and interpret their motivations simply by reading these chains of thought?
​In this talk, [Rauno Arike](https://www.linkedin.com/in/rauno-arike/) and [Rohan Subramani](https://rohansubramani.github.io/personal-website/) will give an overview of research areas in chain-of-thought monitorability and AI control, and discuss their recent research on the usefulness of chain-of-thought monitoring for ensuring that LLM agents only pursue objectives that their developers intended them to follow.

​**Event Schedule**
6:00 to 6:30 - Food and introductions
6:30 to 7:30 - Presentation and Q&A
7:30 to 9:00 - Open Discussions

Georgia Berg

Mario Gibney

Toronto AI Safety

Technology

Risk Management

New Technology

Safety

Critical Thinking

Artificial Intelligence Applications

AI and Society

Mathematics

Artificial Intelligence Machine Learning Robotics

Artificial Intelligence

Machine Learning

Software Engineering

Machine Learning Interpretability

Deep Learning

AI Safety Thursday: Chain-of-Thought Monitoring for AI Control

30 Adelaide East, Industrious Office 12th Floor Common Area

Share

Toronto AI Safety

AI Safety Thursday: Chain-of-Thought Monitoring for AI Control

Toronto AI Safety

Details

Members are also interested in