G

GDP-X Eval

Software / AppMentioned in 1 video

An evaluation developed by OpenAI's Human Data and Frontier Evals teams to measure real-world white-collar work by AI agents.