SlimPajamas

Book

A diverse dataset used for the initial continual pre-training phase of Gradient's context extension.

Mentioned in 1 video