agentic terminal coding

Software / App

A benchmark measuring an AI model's ability to interact with and code within a terminal environment.

Mentioned in 1 video