FineWeb-Edu

Software / AppMentioned in 1 video

A dataset created by filtering FineWeb for highly educational content using LLaMA 3 for annotations and a classifier.