QMIX
Software / App
A state-of-the-art multi-agent reinforcement learning algorithm used as a baseline, showing lower reward performance compared to LLM-critic based methods.
Mentioned in 1 video
A state-of-the-art multi-agent reinforcement learning algorithm used as a baseline, showing lower reward performance compared to LLM-critic based methods.