Temporal Difference Learning
Concept
A class of model-free reinforcement learning methods which learn by bootstrapping from the current estimate of the value function, with proposed similarities to dopamine processes in the brain.
Mentioned in 1 video
