PPO

Software / App

Proximal Policy Optimization, a reinforcement learning algorithm developed by John Schulman, which OpenAI scaled up significantly for the Dota project, revealing emergent behaviors at larger scales.

Mentioned in 2 videos

Save the 2 videos on PPO to your own pod.

Sign up free to keep building your knowledge base on PPO as more episodes are added.

Get Started Free