PPO
Software / App
Proximal Policy Optimization, a reinforcement learning algorithm developed by John Schulman, which OpenAI scaled up significantly for the Dota project, revealing emergent behaviors at larger scales.
Mentioned in 1 video
Proximal Policy Optimization, a reinforcement learning algorithm developed by John Schulman, which OpenAI scaled up significantly for the Dota project, revealing emergent behaviors at larger scales.