Open Math Text

Study / Research

A 2023 paper focused on creating a large corpus of mathematical text, employing rules, generative models (KenLM), and classifiers to filter for mathematical content.

Mentioned in 1 video