GPT-4
OpenAI multimodal large language model
Common Themes
Videos Mentioning GPT-4

Godfather of AI: I Tried to Warn Them, But We’ve Already Lost Control! Geoffrey Hinton
The Diary Of A CEO
Advanced language model cited as already knowing far more than a typical human in many domains.

E174: Inflation stays hot, AI disclosure bill, Drone warfare, defense startups & more
All-In Podcast
A large language model developed by OpenAI, trained on a vast amount of data, including potentially copyrighted material.

The Agent Reasoning Interface: Claude, ChatGPT Canvas, Tasks, Operator — with Karina Nguyen, OpenAI
Latent Space
Mentioned in the context of comparing model card numbers and evaluation settings, highlighting the difficulty of apples-to-apples comparisons across different model versions.
![[LLM Paper Club] Llama 3.1 Paper: The Llama Family of Models](https://i.ytimg.com/vi/TgLSYIBoX5U/maxresdefault.jpg)
[LLM Paper Club] Llama 3.1 Paper: The Llama Family of Models
Latent Space
A large language model that is used as a benchmark for comparison with Llama 3.1, particularly in coding and reasoning tasks.

Training Llama 2, 3 & 4: The Path to Open Source AGI — with Thomas Scialom of Meta AI
Latent Space
A leading large language model against which Llama models are often compared; Llama 3 aims to close the gap with GPT-4.

Beating OpenAI and Anthropic by Looking At Data: the new #1 on SWE-Bench w/ W&B CTO Shawn Lewis
Latent Space
A previous model from OpenAI, discussed as being more adept at agentic tasks over longer sequences compared to GPT-4o.

E170: Tech's Vibe Shift, TikTok ban debate, Vertical AI boom, Florida bans lab-grown meat & more
All-In Podcast
OpenAI's advanced large language model, speculated to be the foundation for Devon, the AI software engineer.

The 10,000x Yolo Researcher Metagame — with Yi Tay of Reka
Latent Space
A model whose architecture is speculated to have pluggable experts, particularly for vision, suggesting a modular approach to multimodal capabilities.

E143: Nvidia smashes earnings, Arm walks the plank, M&A market, Vivek dominates GOP debate & more
All-In Podcast
OpenAI's most advanced language model, mentioned as the benchmark against which fine-tuned GPT-3.5 turbo can now compete on narrow tasks.

E167: Google's Woke AI disaster, Nvidia smashes earnings (again), Groq's LPU breakthrough & more
All-In Podcast
Mentioned as a benchmark for comparison with Google's Gemini Ultra model.

Beating Google at Search with Neural PageRank and $5M of H200s — with Will Bryk of Exa.ai
Latent Space
A language model that Exa uses for certain tasks like labeling queries, showcasing its ability to understand complex meaning beyond keywords.

This World Does Not Exist — Joscha Bach, Karan Malhotra, Rob Haisfield (WorldSim, WebSim, Liquid AI)
Latent Space
Mentioned as part of the OpenAI series of models, characterized as difficult to steer due to heavy RHF (Reinforcement Learning from Human Feedback).

Can We Contain Artificial Intelligence?: A Conversation with Mustafa Suleyman (Episode #332)
Sam Harris
A frontier model expected to become open-source in the coming years, similar to GPT-3.5 and Inflection's models.

Supervise the Process of AI Research — with Jungwon Byun and Andreas Stuhlmüller of Elicit
Latent Space
The fourth generation of OpenAI's GPT models, which enabled new features for Elicit, particularly in processing tabular data.
![[Paper Club] BERT: Bidirectional Encoder Representations from Transformers](https://i.ytimg.com/vi/V64q3p7DNjc/maxresdefault.jpg)
[Paper Club] BERT: Bidirectional Encoder Representations from Transformers
Latent Space
A large language model from OpenAI. Mentioned as a potential initial service before full BERT deployment.

Agents @ Work: Dust.tt — with Stanislas Polu
Latent Space
A major model from OpenAI, its capabilities were recognized by Stanislas Polu as creating significant value.

Why Compound AI + Open Source will beat Closed AI — with Lin Qiao, CEO of Fireworks AI
Latent Space
A powerful language model from OpenAI, discussed as a benchmark or point of comparison for model quality.

Agents @ Work: Lindy.ai (with live demo!)
Latent Space
Mentioned as being overhyped and not ideal for agentic behavior compared to GPT-3.5.

Jared Kushner: Israel-Hamas War, paths forward, macro picture, AI
All-In Podcast
OpenAI's advanced language model, used as a benchmark for Grok's performance.

In the Arena: How LMSys changed LLM Benchmarking Forever
Latent Space
A proprietary large language model from OpenAI that was a benchmark for open-source models like Vicuna.

Personal AI Meetup - Bee, BasedHardware, LangChain LangFriend, Deepgram EmilyAI
Latent Space
An LLM model noted for being slower and having potential latency fluctuations in hosted APIs, but offering better performance when deployed on Azure.

Building the Silicon Brain - Drew Houston of Dropbox
Latent Space
Early access to GPT-4 was a key indicator for Drew Houston that the AI era was truly underway.

Building AGI in Real Time (OpenAI Dev Day 2024)
Latent Space
A previous OpenAI language model, compared to GPT-01 which surpasses it in advanced math and complex coding, but GPT-4 is still suitable for tasks like screenplay writing.
![[Paper Club] Who Validates the Validators? Aligning LLM-Judges with Humans (w/ Eugene Yan)](https://i.ytimg.com/vi/4o_ic83U1Kw/maxresdefault.jpg)
[Paper Club] Who Validates the Validators? Aligning LLM-Judges with Humans (w/ Eugene Yan)
Latent Space
Mentioned as a tool for generating criteria within the EvalGen design.