Fiction Lifespan

Organization

A benchmark that tests AI models on their ability to analyze long texts and piece together information, such as identifying names from a story based on promises and caveats in different chapters.

Mentioned in 1 video