Regret

Concept

In reinforcement learning, regret is the difference between the performance of an optimal policy and the performance of the agent's current policy on a given environment. It indicates the potential for learning.

Mentioned in 1 video