Optimal Policy

Concept

The policy that yields the highest expected reward for a given task in reinforcement learning.

Mentioned in 1 video