Terminal Bench

Software / App

A benchmark for evaluating AI agents' ability to perform tasks in a terminal environment.

Mentioned in 2 videos

Save the 2 videos on Terminal Bench to your own pod.

Sign up free to keep building your knowledge base on Terminal Bench as more episodes are added.

Get Started Free