Aider Code Editing Benchmark

Study / Research

A benchmark for evaluating agents on individual file code editing tasks.

Mentioned in 1 video