GPT-4o
large multimodal model from OpenAI
Common Themes
Videos Mentioning GPT-4o

What the Freakiness of 2025 in AI Tells Us About 2026
AI Explained
OpenAI model referenced for extreme prompts and user behavior; an example of frontier-model incentives.

Answer.ai & AI Magic with Jeremy Howard
Latent Space
A recent model from OpenAI, influencing the development of tools to be compatible with OpenAI's offerings.

The Winds of AI Winter (Q2 Four Wars of the AI Stack Recap)
Latent Space
OpenAI's latest model, with its voice capabilities and multimodal features discussed.

Training Llama 2, 3 & 4: The Path to Open Source AGI — with Thomas Scialom of Meta AI
Latent Space
A version of GPT-4 that may outperform Llama 3 in some benchmarks.

Beating OpenAI and Anthropic by Looking At Data: the new #1 on SWE-Bench w/ W&B CTO Shawn Lewis
Latent Space
An OpenAI model used by Sean Lewis's agent for both logic and programming, achieving a 64% success rate on SWE-Bench.

OpenAI o1 isn’t a chat model (and that’s the point)
Latent Space
A model mentioned alongside GPT-3.5 as commonly used LLMs for coding use cases, but O1 is presented as superior for complex, multi-file implementations.

Best of 2024 in Agents (from #1 on SWE-Bench Full, Prof. Graham Neubig of OpenHands/AllHands)
Latent Space
A language model that is considered good but lacks strong error recovery, leading to loops, and performs moderately in coding agent evaluations.

2024 Year in Review: The Big Scaling Debate, the Four Wars of AI, Top Themes and the Rise of Agents
Latent Space
Released in May, an 'omni-model' with native vision and voice capabilities, highly impactful for its efficiency and multimodal demos, despite the 'Sky Voice' controversy.
![Best of 2024 in Vision [LS Live @ NeurIPS]](https://i.ytimg.com/vi/76EL7YVAwVo/maxresdefault.jpg)
Best of 2024 in Vision [LS Live @ NeurIPS]
Latent Space
Mentioned as a multimodal model that has become mainstream in the year, highlighting the trend of LLMs incorporating vision capabilities.

0 to over $8M ARR in 2 months as a Claude Wrapper (Bolt.new, Qodo)
Latent Space
An AI model from OpenAI mentioned in the context of its performance on Olympiad-level problems, and how Alpha Codium improves upon it by breaking down tasks.

In the Arena: How LMSys changed LLM Benchmarking Forever
Latent Space
A recent model that showed significant improvements and challenged the idea of benchmark saturation, noted for its slower interface latency.
![[Paper Club] Molmo + Pixmo + Whisper 3 Turbo - with Vibhu Sapra, Nathan Lambert, Amgadoz](https://i.ytimg.com/vi/8BN9CdIYaqc/maxresdefault.jpg)
[Paper Club] Molmo + Pixmo + Whisper 3 Turbo - with Vibhu Sapra, Nathan Lambert, Amgadoz
Latent Space
A multimodal model from OpenAI. It ranked highest in human preference evaluations for vision-language models, followed by MOMO-72B.

Production AI Engineering starts with Evals
Latent Space
OpenAI's multimodal flagship model, whose capabilities are leading to a shift where complex reasoning and agentic logic will be integrated directly into the model, rather than requiring external frameworks.

Building AGI in Real Time (OpenAI Dev Day 2024)
Latent Space
A model that Co-op Labs successfully fine-tuned to achieve higher scores than GPT-01 on S-bench, demonstrating that older models can be enhanced through custom reasoning.

AI CEO: ‘Stock Crash Could Stop AI Progress’, Llama 4 Anti-climax + ‘Superintelligence in 2027’ ...
AI Explained

NBA Gambling Scandal, Billionaire Tax, Tesla's Future, Amazon Robots, AWS Outage, Dangerous AI Bias
All-In Podcast

Scaling Test Time Compute to Multi-Agent Civilizations — Noam Brown, OpenAI
Latent Space
A model that is discussed as passing the Turing test, with its capabilities improving since 2022. It's also mentioned in the context of agentic systems and conversational AI.

GPT 4.1: The New OpenAI Workhorse
Latent Space
The previous generation model line that GPT-4.1 significantly improves upon, especially in instruction following and coding.

New DeepSeek Research - The Future Is Here!
Two Minute Papers
Previous large model; the DeepSeek small model reportedly beats it by up to ~6x on competition-style math questions.

Fullstack-Bench: The Eval for Coding Agents — with Sujay Jayakar, Chief Scientist, Convex
Latent Space
An AI model mentioned as performing better than GPT-4 on Convex evals, though not a 'slam dunk' improvement.

GPT-4o launches, Glue demo, Ohalo breakthrough, Druck's Argentina bet, did Google kill Perplexity?
All-In Podcast
OpenAI's new multimodal AI model, 'Omni,' which processes audio, text, images, and video simultaneously, offering faster, cheaper, and more conversational interactions with improved performance over previous versions.

Jason Boehmig, CEO of Ironclad on Balancing Risk, Innovation, and AI Opportunity in the Legal Field
AssemblyAI
The latest advanced model from OpenAI, offering benefits over GPT-3 and considered superior to forked models, especially when combined with techniques like RAG for specific applications like legal data extraction.

The Utility of Interpretability — Emmanuel Amiesen
Latent Space
An OpenAI model mentioned in the context of interpretability research and the question of whether more investment in interpretability could have prevented unexpected behaviors.

GPT-4.5 = Big Model Energy | YC Decoded
Y Combinator
An earlier OpenAI model, used for benchmark comparisons against GPT-4.5, showing lower accuracy and higher hallucination rates.