QMIX

Software / App

A state-of-the-art multi-agent reinforcement learning algorithm used as a baseline, showing lower reward performance compared to LLM-critic based methods.

Mentioned in 1 video

Videos Mentioning QMIX

Stanford Robotics Seminar ENGR319 | Spring 2026 | Interactive Autonomy

Stanford Online

A state-of-the-art multi-agent reinforcement learning algorithm used as a baseline, showing lower reward performance compared to LLM-critic based methods.