AA omniscience hallucination rate benchmark

Study / Research

A specific benchmark used to measure AI hallucination rates, where Anthropic claims Claude Mythos achieves the best net rating.

Mentioned in 1 video