Name: Could Frontier Labs’ Internal Agents Already Go Rogue?
Start: 2026-06-04T18:00:00-04:00
End: 2026-06-04T21:00:00-04:00
Location: 30 Adelaide East, Industrious Office 12th Floor Common Area

This is a ticketed event. You can register [here](https://luma.com/trajec-2gru). ​

Could an AI company’s internal coding agents create a “rogue deployment”, a set of agents running without human knowledge or permission? In February and March 2026, [METR](https://metr.org/?utm_source=luma), the organization behind the [time horizons graph](https://metr.org/time-horizons/), conducted a pilot of a process to assess just that. Anthropic, Google DeepMind, Meta, and OpenAI gave us access to their most capable internal LLMs and a wide range of non-public information. We concluded that, while internal agents plausibly had the means, motive, and opportunity to start small rogue deployments, they didn’t have the means to avoid human detection indefinitely.

METR researcher Thomas Broadley explains the process, the six key facts that informed our conclusion, and how we expect risk to evolve over the next few months.
​You can watch a livestream of the talk [here](https://www.youtube.com/@Trajectory-Labs/live?utm_source=luma).

Georgia Berg

Mario Gibney

Toronto AI Safety

Technology

Risk Management

New Technology

Safety

Critical Thinking

Artificial Intelligence Applications

AI and Society

Mathematics

Artificial Intelligence Machine Learning Robotics

Artificial Intelligence

Machine Learning

Software Engineering

Machine Learning Interpretability

Deep Learning

Could Frontier Labs’ Internal Agents Already Go Rogue?

30 Adelaide East, Industrious Office 12th Floor Common Area

Share

Toronto AI Safety

Could Frontier Labs’ Internal Agents Already Go Rogue?

Toronto AI Safety

Details

Related topics

You may also like