Pierre de Lacaze on Deep Reinforcement Learning


Details
Abstract
This talk provides an introduction to Deep Reinforcement Learning. First we present Reinforcement Learning. Both Q-Learning and Temporal Difference Learning are discussed. We provide an accelerated introduction to Artificial Neural Nets and specifically the Backpropagation algorithm. We then cover Deep Learning topics and describe Convolutional Neural Nets and specifically the concepts of local receptive fields, feature maps and max-pooling. Next we explain how Reinforcement Learning has been successfully combined with Deep Learning. Finally, time permitting we will describe how Google DeepMind used Deep Reinforcement Learning to play and master Atari games and how the AlphaGo program beat world champion Go player Lee Se-dol earlier this year. Code examples will be provided in Clojure.
References
• Reinforcement Learning
–http://personal.disco.unimib.it/Vanneschi/McGrawHill_-Machine_Learning-Tom_Mitchell.pdf
–https://webdocs.cs.ualberta.ca/~sutton/book/ebook/the-book.html
• Neural Nets & Deep Learning
–http://personal.disco.unimib.it/Vanneschi/McGrawHill_-Machine_Learning-Tom_Mitchell.pdf ;
–http://neuralnetworksanddeeplearning.com/chap2. (http://neuralnetworksanddeeplearning.com/chap2.html)html (http://neuralnetworksanddeeplearning.com/chap2.html)
• Convolutional Neural Networks
–http://cs231n.github.io/convolutional-networks (http://cs231n.github.io/convolutional-networks/)/ (http://cs231n.github.io/convolutional-networks/)
• Deep Reinforcement Learning –http://www0.cs.ucl.ac.uk/staff/d.silver/web/Resources_files/deep_rl.pdf ;
•AlphaGo
– https://storage.googleapis.com/deepmind-media/alphago/AlphaGoNaturePaper.pdf
– http://deeplearningskysthelimit.blogspot.com/
Bio
Pierre de Lacaze has over 20 years experience with Lisp and AI based technologies. He holds a Bachelor of Science in Applied Mathematics and Computer Science and a Master’s Degree in Computer Science. He is the President of LispNYC (http://www.lispnyc.org) and Director of Machine Intelligence at Shareablee (http://www.shareablee.com).

Pierre de Lacaze on Deep Reinforcement Learning