Matthew Schlegel

Matthew Schlegel

Lover of Espresso; Focused on RL and ML to improve the world; Research Scientist with a penchant for good software and alliteration.

  • Edmonton, Alberta
  • LinkedIn
  • Twitter
  • Github
  • Tags
  • Collections
Home About CV Publications Code BrainDump

Temporal Difference Learning

Feb 21, 2025
tags
Reinforcement Learning

This is a method for learning Value Functions and was first described by (Sutton 1988).

References

Sutton, Richard. 1988. “Learning to Predict by the Methods of Temporal Differences.” Machine Learning 3 (1): 9–44. doi:10.1007/BF00115009.

Links to this note:

  • sutton1988learning: Learning to predict by the methods of temporal differences

Copyright © 2020 Matthew Schlegel. All Rights Reserved. Powered by Hugo and Minimal Academic.