MPO

Software / App

A multi-agent reinforcement learning algorithm mentioned as a baseline that struggled with complex coordination tasks, contrasted with methods that use coaching.

Mentioned in 1 video