Attention Is All You Need

Study / Research

scientific article published in June 2017

Mentioned in 5 videos

Save the 5 videos on Attention Is All You Need to your own pod.

Andrej Karpathy

Original Transformer paper referenced to explain positional encodings and encoder/decoder distinctions.

Latent Space

The paper that introduced the Transformer architecture, which was used by GPT.

Latent Space

Mentioned as the source of positional embeddings comparable to the Fourier embeddings used in consistency models.

Latent Space

A seminal paper in machine learning that introduced the Transformer architecture, which is the basis for Whisper's encoder-decoder model.

Latent Space

The paper that introduced the Transformer architecture, a key innovation in AI.