DataPro
Software / App
A Hugging Face library for data filtering and tokenization at scale, considered underrated and used for FineWeb and FinePDF.
Mentioned in 1 video
A Hugging Face library for data filtering and tokenization at scale, considered underrated and used for FineWeb and FinePDF.