Temporal Difference (TD) learning
Concept
A reinforcement learning method about making predictions over time, using observed reward and value estimates from future states to update the current state's value estimate.
Mentioned in 1 video
A reinforcement learning method about making predictions over time, using observed reward and value estimates from future states to update the current state's value estimate.