The Stack
ConceptMentioned in 2 videos
A common training corpus for AI models, used in combination with The Pile.
Videos Mentioning The Stack

The "Normsky" architecture for AI coding agents — with Beyang Liu + Steve Yegge of SourceGraph
Latent Space
A common training corpus for AI models, used in combination with The Pile.

FlashAttention-2: Making Transformers 800% faster AND exact
Latent Space
A large code dataset dataset, mentioned as an example of impactful open data.