A reinforcement learning policy style used for controlling the lower body mobility of the robot.
Stanford Online