Open RCA
Study / Research
Root cause analysis benchmark with 335 software failure cases; Opus 4.6 performed around one-third correct.
Mentioned in 1 video
Root cause analysis benchmark with 335 software failure cases; Opus 4.6 performed around one-third correct.