OpenWebText

Software / AppMentioned in 1 video

A community reproduction attempt of the web-text dataset used historically for GPT-2 (referenced when discussing GPT-2 training data).