T

Transformer architecture

ConceptMentioned in 1 video

The foundational neural network architecture used in most modern language models, which DeepSeek is reportedly working to replace.