Benchmark for software engineering tasks; cited in OS world/Swebench comparisons.
Mentioned in 1 video
AI Explained