G
GPT-4.5
Tool / ProductMentioned in 2 videos
Mentioned as a frontier model tested for contamination in SWE-Bench Verified.
Videos Mentioning GPT-4.5

The End of SWE-Bench Verified — Mia Glaese & Olivia Watkins, OpenAI Frontier Evals
Latent Space
Mentioned as a frontier model tested for contamination in SWE-Bench Verified.

Scaling Test Time Compute to Multi-Agent Civilizations — Noam Brown, OpenAI
Latent Space
Mentioned in the context of playing games like tic-tac-toe, where it performs reasonably well but can make mistakes, suggesting a need for system two thinking for perfect play.