Skip to content

Reinforcement Learning: Chapter 3 Finite Markov Decision Processes

Photo of Jason Eckstein
Hosted By
Jason E.
Reinforcement Learning: Chapter 3 Finite Markov Decision Processes

Details

Chapter 3 introduces the mathematical formalism for defining the full reinforcement learning problem in the book. We will cover the definition of probability transition functions, reward signals, and the discounted return. If there is time we will continue with the discussion of policies and value functions as explained with the gridworld example.

As usual you can find below links to the textbook, previous chapter notes, slides, and recordings of some of the previous meetings.
Useful Links:
Reinforcement Learning: An Introduction by Richard S. Sutton and Andrew G. Barto
Recordings of Previous Meetings
Short RL Tutorials
My exercise solutions and chapter notes
Kickoff Slides which contain other links
Video lectures from a similar course

Photo of Silicon Valley Generative AI – A GenAI Collective Member group
Silicon Valley Generative AI – A GenAI Collective Member
See more events

Every 2 weeks on Monday until March 7, 2026

Online event
Link visible for attendees
FREE