Name: AI Safety Thursday: Attempts and Successes of LLMs Persuading on Harmful Topics
Start: 2025-10-02T18:00:00-04:00
End: 2025-10-02T21:00:00-04:00
Location: 30 Adelaide East, Industrious Office 12th Floor Common Area

**Registration Instructions**
This is a paid event ($5 general admission, free for students & job seekers) with limited tickets - you must [RSVP on Luma](https://luma.com/nmk5v3s3) to secure your spot.
​​
If you can't make it in person, feel free to join the live stream starting at 6:30 pm, via [this link](https://www.youtube.com/@trajectory-labs/live).

**Description**[​](https://www.linkedin.com/in/wim-howson-creutzberg-15005021b/)
Large Language Models can persuade people at unprecedented scale—but how effectively, and are they willing to try persuading us toward harmful ideas?

In this talk, [Matthew Kowal](https://www.linkedin.com/in/mkowal2/) and [Jasper Timm](https://ca.linkedin.com/in/jasper-timm-38417314) will present findings showing that LLMs can shift beliefs toward conspiracy theories as effectively as they debunk them, and that many models are willing to attempt harmful persuasion on dangerous topics.

**​Event Schedule**

6:00 to 6:30 - Food & Networking
6:30 to 7:30 - Main Presentation & Questions
7:30 to 9:00 - Breakout Discussions

Georgia Berg

Mario Gibney

Toronto AI Safety

Technology

Risk Management

New Technology

Safety

Critical Thinking

Artificial Intelligence Applications

AI and Society

Mathematics

Artificial Intelligence Machine Learning Robotics

Artificial Intelligence

Machine Learning

Software Engineering

Machine Learning Interpretability

Deep Learning

Archana Arakkal

Gurpreet Singh

Vincent Lu

Jonathan Shachar

Steve Glickman

Masoud 

AI Safety Thursday: Attempts and Successes of LLMs Persuading on Harmful Topics

Information Science

30 Adelaide East, Industrious Office 12th Floor Common Area

Share this event

AI Safety Thursday: Attempts and Successes of LLMs Persuading on Harmful Topics

Details