A
Arc AGI 2
Study / ResearchPattern-recognition benchmark used to test models outside training data; GPT-5.2 shows strong results.
Mentioned in 1 video
Pattern-recognition benchmark used to test models outside training data; GPT-5.2 shows strong results.
Mentioned in 1 video