SmolLM 2

Software / App

A model mentioned as an example for its training data and MMLU performance, specifically its random performance on QA format before a certain token count.

Mentioned in 1 video