Claude 3.7
Software / App
Mentioned as a version of Claude, evaluated for code generation quality and security.
Mentioned in 2 videos
Videos Mentioning Claude 3.7

AI Dev 25 x NYC | Manish Kapur: Assessing the Quality of AI Generated Code
DeepLearningAI
Mentioned as a version of Claude, evaluated for code generation quality and security.

Fullstack-Bench: The Eval for Coding Agents — with Sujay Jayakar, Chief Scientist, Convex
Latent Space
An AI model that performed worse than Claude 3.5 on Convex evals, suggesting that model improvements don't always translate directly to gains on specific benchmarks.