pi hat

Software / App

The learned policy in imitation learning, which aims to achieve cumulative reward close to that of the expert policy (pi star).

Mentioned in 1 video