pi hat
Software / App
The learned policy in imitation learning, which aims to achieve cumulative reward close to that of the expert policy (pi star).
Mentioned in 1 video
The learned policy in imitation learning, which aims to achieve cumulative reward close to that of the expert policy (pi star).