Matthew Schlegel

Lover of Espresso; Focused on RL and ML to improve the world; Research Scientist with a penchant for good software and alliteration.

Temporal Difference Learning

Feb 21, 2025

tags: Reinforcement Learning

This is a method for learning Value Functions and was first described by (Sutton 1988).

References

Sutton, Richard. 1988. “Learning to Predict by the Methods of Temporal Differences.” Machine Learning 3 (1): 9–44. doi:10.1007/BF00115009.

Links to this note:

sutton1988learning: Learning to predict by the methods of temporal differences

Copyright © 2020 Matthew Schlegel. All Rights Reserved. Powered by Hugo and Minimal Academic.