Mixtral 87B
Software / App
A sparse mixture of experts model that significantly outperforms LLaMA 2 70B on math, code generation, and multilingual benchmarks.
Mentioned in 1 video
A sparse mixture of experts model that significantly outperforms LLaMA 2 70B on math, code generation, and multilingual benchmarks.