SWE Verified

Software / App

Benchmark area where Claude 4.5 Sonnet is close to Gemini 3 Pro; a point of competition in coding benchmarks.

Mentioned in 1 video