OpenHermes
Software / App
A dataset used to train a classifier for the DCLM dataset, focusing on instruction tuning.
Mentioned in 2 videos
Save the 2 videos on OpenHermes to your own pod.
Sign up free to keep building your knowledge base on OpenHermes as more episodes are added.
Videos Mentioning OpenHermes
![Best of 2024: Synthetic Data / Smol Models, Loubna Ben Allal, HuggingFace [LS Live! @ NeurIPS 2024]](https://i.ytimg.com/vi/AjmdDy7Rzx0/maxresdefault.jpg)
Best of 2024: Synthetic Data / Smol Models, Loubna Ben Allal, HuggingFace [LS Live! @ NeurIPS 2024]
Latent Space
A dataset used to train a classifier for the DCLM dataset, focusing on instruction tuning.

Stanford CS336 Language Modeling from Scratch | Spring 2026 | Lecture 13: Data (Sources, Datasets)
Stanford Online
Instruction data generated by GPT-4, used as negative examples for training the quality classifier in DCLAM.