Mixtral 87B

Software / App

A sparse mixture of experts model that significantly outperforms LLaMA 2 70B on math, code generation, and multilingual benchmarks.

Mentioned in 1 video