Quen 2.5

Software / App

A model that uses scaling experiments to tune hyperparameters like batch size and learning rate, following a similar approach to DeepSeek.

Mentioned in 1 video