Amy benchmark

Concept

A benchmark for AI models, especially for reasoning and coding tasks, indicating the shift in focus for evaluating advanced AI capabilities.

Mentioned in 1 video