What we're about
Upcoming events (1)
1. Meeting opens at 5:45pm
2. Introductions (Starts at 6pm)
3. Speaker Event
4. Networking Event (30 minutes)
Topic: Multi-Agent Reinforcement Learning: Dodging Tragedy of the Commons with Simple Mechanisms
Speaker: Quinn Dougherty
Some problems can be described in terms of states, actions, and rewards. A computer program that maximizes rewards in such an environment by selecting actions is called an agent, and the study of these agents is called reinforcement learning. You can select actions with deep learning, leading the research community to advances in playing Go and autonomous vehicles. Naturally, problems and environments arise that are best thought of as a confluence of two or more such agents, the study of which is called multi-agent reinforcement learning. Meanwhile, over in economics, common pool resources are studied as an approximate prisoner's dilemma: if the collective harvests too much, everyone loses, yet if any individual unilaterally implements a sustainable policy others are incentivized not to follow suit. In the literature this is called tragedy of the commons, but economist Elinor Ostrom took an empirical approach  and found emergent mechanisms all over the world that caused communities to dodge this outcome. You're asking a natural question: do we want to simulate these environments with multi-agent reinforcement learning, simulate a mechanism suggested by Ostrom, and observe if our agents can dodge tragedy of the commons? In this talk, we will discuss my team's journey through this research question and observe the surprisingly easy interface to Ray's RLlib library  for training agents to play multi-player games in python. There will be a follow-along repo, if not a notebook.
Quinn Dougherty is a logician at platonic.systems, working on auditing a new decentralized finance project for the Cardano ecosystem and on formal verification. Previously he was a research intern at the Stanford Existential Risks Initiative profiling how the AGI Safety and Alignment research community should prioritize multi-stakeholder and/or multi-agent scenarios. In 2020 he worked on the hospital traffic forecasting app CHIME and did some python and cloud security work for a startup. Quinn is also a coorganizer at Effective Altruism Philadelphia. More information including socials and contact at quinnd.net.
Join us after the talk for a chance to chat with Quinn and network with the other members of the DataPhilly community.