Agentic computer use

Software / App

A benchmark evaluating an AI model's capability in utilizing computational resources and tasks.

Mentioned in 1 video