Hellaswag

Study / Research

A common NLP benchmark mentioned as one of the public evaluations that IMB has reviewed and cleaned for ambiguity and data contamination.

Mentioned in 2 videos

Save the 2 videos on Hellaswag to your own pod.

Sign up free to keep building your knowledge base on Hellaswag as more episodes are added.

Get Started Free