G
GDP-X Eval
Software / AppMentioned in 1 video
An evaluation developed by OpenAI's Human Data and Frontier Evals teams to measure real-world white-collar work by AI agents.
An evaluation developed by OpenAI's Human Data and Frontier Evals teams to measure real-world white-collar work by AI agents.