Model Behavior Study Group

Name: Model Behavior Study Group
Start: 2026-02-27T22:00:00+09:00
End: 2026-02-27T23:00:00+09:00

Hosted by Suzana

Machine Learning Tokyo

Details

Study Group Topic:
Claude's Constitution (Anthropic, 2026)
Anthropic's transparent explanation of the principles guiding Claude's behavior. 🔗 https://www.anthropic.com/news/claude-new-constitution

📚 Full reading list and How-To: https://github.com/suzana-ilic/study_model_behavior

Join us for our monthly reading group where we dive into the research and specs that shape how AI systems like ChatGPT and Claude actually behave. We read together for 30 minutes, then discuss for 30 minutes. Pre-reading is recommended, but not required.

Who is this for?
Anyone curious about how AI systems work—researchers, builders, policy folks, or just thoughtful people who use these tools and want to understand them better. No technical background needed. We start with accessible industry standards and papers and build from there.

What will we read?
We're working through resources in six areas:

Industry Specs — How leading AI companies define model behavior
Constitutional AI — Training models with principles instead of human feedback
Safety Methods — RLHF and alignment techniques
Behavioral Science — How researchers study what AI actually does
Interpretability — Understanding what's happening inside the models
Critical Perspectives — Challenges to current approaches

Format

30 minutes: Read together (with discussion questions)
30 minutes: Talk through key insights and implications
Monthly sessions

Location: [Online]
💻 RSVP for Zoom Link
⚙️ Discord https://discord.gg/CT7nBdYCsY
📬 Updates: https://mltaicommunities.substack.com/

Machine Learning Tokyo

Model Behavior Study Group

Machine Learning Tokyo

Details

Related topics

You may also like