Temporal Difference (TD) learning

Concept

A reinforcement learning method about making predictions over time, using observed reward and value estimates from future states to update the current state's value estimate.

Mentioned in 1 video