S

SmolLM 2

Software / AppMentioned in 1 video

A model mentioned as an example for its training data and MMLU performance, specifically its random performance on QA format before a certain token count.