T
Transformer architecture
ConceptMentioned in 1 video
The foundational neural network architecture used in most modern language models, which DeepSeek is reportedly working to replace.
The foundational neural network architecture used in most modern language models, which DeepSeek is reportedly working to replace.