DataComp

Study / Research

An open effort by Levik Schmidt and students to curate Common Crawl data, serving as a benchmark for data quality.

Mentioned in 1 video