Temporal Difference Learning

Concept

A class of model-free reinforcement learning methods which learn by bootstrapping from the current estimate of the value function, with proposed similarities to dopamine processes in the brain.

Mentioned in 1 video