D

DeepSeek OSS

Software / AppMentioned in 1 video

Large open-weight models that operate with approximately 5% active parameters, showcasing advanced sparsity.