O

Optimal Policy

ConceptMentioned in 1 video

The policy that yields the highest expected reward for a given task in reinforcement learning.