Name: Reinforcement Learning: Policy Self Play and Monte Carlo Tree Search
Start: 2026-05-25T17:30:00-07:00
End: 2026-05-25T19:00:00-07:00

This meeting will begin to cover [Multi-Agent Reinforcement Learning: Foundations and Modern Approaches](https://www.marl-book.com/) section 9.8 which covers Policy Self-Play in Zero-Sum games. We will introduce a test environment for zero-sum games and discuss a subset of these in which agents take turns selecting actions. In such games, the environment on any given step is equivalent to an MDP and techniques such as MCTS can be used to search for optimal play. We will also discuss MDP solutions that optimize play against a fixed opponent and compare that solution to self play techniques from MARL.

As usual you can find below links to the textbook, previous chapter notes, slides, and recordings of some of the previous meetings.

Meetup Links:
[Recordings of Previous RL Meetings](https://youtube.com/playlist?list=PLYqXmZaxvwmy2CNaK-DLailou1VIU1UZn&si=n6uQm863MCcHuKT7)
[Recordings of Previous MARL Meetings](https://youtube.com/playlist?list=PLYqXmZaxvwmzikjw-cNyZfI051ms05czB&si=A7-AeX0dcRW67PDB)
[Short RL Tutorials](https://youtube.com/playlist?list=PLYqXmZaxvwmyLEXMpk-n4RFr59tpJjNXt&si=RHy_FAnOJnPa4p1N)
[My exercise solutions and chapter notes for Sutton-Barto](https://github.com/jekyllstein/Reinforcement-Learning-Sutton-Barto-Exercise-Solutions)
[My MARL repository](https://github.com/jekyllstein/MARL_course/tree/main)
[Kickoff Slides which contain other links](https://docs.google.com/presentation/d/1QD3iw5BgIpPpl_K_ApAlDr1NRseR1WmXme1dKQGqTOg/edit?usp=sharing)
[MARL Kickoff Slides](https://docs.google.com/presentation/d/1FHXGVWkzjKsnNxzVN-29dx5vdkffAx5Vji5nWrDvg1Y/edit?usp=sharing)

MARL Links:
[Multi-Agent Reinforcement Learning: Foundations and Modern Approaches](https://www.marl-book.com/)
[MARL Summer Course Videos](https://youtube.com/playlist?list=PLkoCa1tf0XjCU6GkAfRCkChOOSH6-JC_2&si=lEljXo65s3fMUsRC)
[MARL Slides](https://github.com/marl-book/slides)

Sutton and Barto Links:
[Reinforcement Learning: An Introduction by Richard S. Sutton and Andrew G. Barto](http://incompleteideas.net/book/the-book.html)
[Video lectures from a similar course](https://youtube.com/playlist?list=PLqYmG7hTraZDVH599EItlEWsUOsJbAodm)

Jason Eckstein

Silicon Valley Generative AI ~ The AI Collective Network

Technology

Artificial Intelligence

Media

New Technology

Machine Learning

Knowledge Sharing

Artificial Intelligence Applications

AI and Society

Education & Technology

Artificial Intelligence Machine Learning Robotics

New Media

AI Algorithms

Data Science

Deep Learning

Neural Networks

Every 2 weeks on Monday until March 15, 2027

Reinforcement Learning: Policy Self Play and Monte Carlo Tree Search

Online event

Share

Silicon Valley Generative AI ~ The AI Collective Network

Reinforcement Learning: Policy Self Play and Monte Carlo Tree Search

Silicon Valley Generative AI ~ The AI Collective Network

Details

Related topics

You may also like