Aider Code Editing Benchmark

Study / ResearchMentioned in 1 video

A benchmark for evaluating agents on individual file code editing tasks.