G
GDP‑Val benchmark
Study / ResearchA broad benchmark comparing LLM performance to domain experts across many white‑collar tasks (used as an AGI/benchmark reference).
Mentioned in 1 video
A broad benchmark comparing LLM performance to domain experts across many white‑collar tasks (used as an AGI/benchmark reference).
Mentioned in 1 video