Skip to content

Details

Study Group Topic:
Claude's Constitution (Anthropic, 2026)
Anthropic's transparent explanation of the principles guiding Claude's behavior. ๐Ÿ”— https://www.anthropic.com/news/claude-new-constitution

๐Ÿ“š Full reading list and How-To: https://github.com/suzana-ilic/study_model_behavior

Join us for our monthly reading group where we dive into the research and specs that shape how AI systems like ChatGPT and Claude actually behave. We read together for 30 minutes, then discuss for 30 minutes. Pre-reading is recommended, but not required.

Who is this for?
Anyone curious about how AI systems workโ€”researchers, builders, policy folks, or just thoughtful people who use these tools and want to understand them better. No technical background needed. We start with accessible industry standards and papers and build from there.

What will we read?
We're working through resources in six areas:

  1. Industry Specs โ€” How leading AI companies define model behavior
  2. Constitutional AI โ€” Training models with principles instead of human feedback
  3. Safety Methods โ€” RLHF and alignment techniques
  4. Behavioral Science โ€” How researchers study what AI actually does
  5. Interpretability โ€” Understanding what's happening inside the models
  6. Critical Perspectives โ€” Challenges to current approaches

Format

  • 30 minutes: Read together (with discussion questions)
  • 30 minutes: Talk through key insights and implications
  • Monthly sessions

Location: [Online]
๐Ÿ’ป RSVP for Zoom Link
โš™๏ธ Discord https://discord.gg/CT7nBdYCsY
๐Ÿ“ฌ Updates: https://mltaicommunities.substack.com/

Related topics

Artificial Intelligence
Deep Learning
Machine Learning
Natural Language Processing
New Technology

You may also like