G

GDP‑Val benchmark

Study / Research

A broad benchmark comparing LLM performance to domain experts across many white‑collar tasks (used as an AGI/benchmark reference).

Mentioned in 1 video