Transformer architectures

Concept

The dominant architecture in neural networks, built around the concept of attention, enabling soft tree structures.

Mentioned in 1 video