DeepSeek OSS

Software / App

Large open-weight models that operate with approximately 5% active parameters, showcasing advanced sparsity.

Mentioned in 1 video