Deep Reinforcement Learning
Deep reinforcement learning is Reinforcement Learning using deep Neural Networks. It is a field popularized by (Mnih et al. 2015), but using Connectionist Networks in RL has been persued much earlier (Tesauro 1994), (Bakker 2002), (Rummery and Niranjan 1994), (Williams 1992), (Liu and Zou 2017) and many others.
References
Bakker, Bram. 2002. “Reinforcement Learning with Long Short-Term Memory.” In Advances in Neural Information Processing Systems 14, edited by T. G. Dietterich, S. Becker, and Z. Ghahramani. MIT Press.
Liu, Ruishan, and James Zou. 2017. “The Effects of Memory Replay in Reinforcement Learning.” arXiv:1710.06574 [Cs, Stat]. https://arxiv.org/abs/1710.06574.
Mnih, Volodymyr, Koray Kavukcuoglu, David Silver, Andrei A. Rusu, Joel Veness, Marc G. Bellemare, Alex Graves, et al. 2015. “Human-Level Control through Deep Reinforcement Learning.” Nature.
Rummery, Gavin A, and Mahesan Niranjan. 1994. On-Line Q-learning Using Connectionist Systems. University of Cambridge, Department of Engineering Cambridge, UK.
Links to this note:
- Reinforcement Learning
- henderson2018deep: Deep Reinforcement Learning That Matters
- Atari
- jaderberg2017reinforcement: Reinforcement Learning with Unsupervised Auxiliary Tasks
- machado2018revisiting: Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents
- veeriah2019discovery: Discovery of Useful Questions as Auxiliary Tasks
- Auxiliary Tasks
- mnih2016asynchronous: Asynchronous Methods for Deep Reinforcement Learning
- Experience Replay
- DeepMind Lab