Features
Discover
Use Cases
Pricing
Blog
Login
Get Started
Toggle theme
Discover
Entities
Concepts
strong to weak distillation
strong to weak distillation
Concept
A technique used by Qwen's developers for training smaller models from larger ones.
Mentioned in
1 video
Videos Mentioning strong to weak distillation
OpenAI vs. Deepseek vs. Qwen: Comparing Open Source LLM Architectures
Y Combinator
A technique used by Qwen's developers for training smaller models from larger ones.