Latent Space Podcast
46K subscribers200 summaries· AI & Tech

DeepSeek V3, SGLang, and the state of Open Model Inference in 2025 (Quantization, MoEs, Pricing)
Jan 19, 2025

OpenAI o1 isn’t a chat model (and that’s the point)
Jan 17, 2025

Beating Google at Search with Neural PageRank and $5M of H200s — with Will Bryk of Exa.ai
Jan 10, 2025

AI Engineering for Art - with comfyanonymous
Jan 4, 2025

2024 Year in Review: The Big Scaling Debate, the Four Wars of AI, Top Themes and the Rise of Agents
Jan 1, 2025

Best of 2024 in Agents (from #1 on SWE-Bench Full, Prof. Graham Neubig of OpenHands/AllHands)
Dec 25, 2024
![2024 in Post-Transformer Architectures: State Space Models, RWKV [Latent Space LIVE! @ NeurIPS 2024]](https://i.ytimg.com/vi/LPe6iC73lrc/maxresdefault.jpg)
2024 in Post-Transformer Architectures: State Space Models, RWKV [Latent Space LIVE! @ NeurIPS 2024]
Dec 24, 2024
![Best of 2024: Synthetic Data / Smol Models, Loubna Ben Allal, HuggingFace [LS Live! @ NeurIPS 2024]](https://i.ytimg.com/vi/AjmdDy7Rzx0/maxresdefault.jpg)
Best of 2024: Synthetic Data / Smol Models, Loubna Ben Allal, HuggingFace [LS Live! @ NeurIPS 2024]
Dec 24, 2024
![Best of 2024: Open Models [LS LIVE! at NeurIPS 2024]](https://i.ytimg.com/vi/jX1nuoTs2WU/maxresdefault.jpg)
Best of 2024: Open Models [LS LIVE! at NeurIPS 2024]
Dec 23, 2024
![Best of 2024 in Vision [LS Live @ NeurIPS]](https://i.ytimg.com/vi/76EL7YVAwVo/maxresdefault.jpg)
Best of 2024 in Vision [LS Live @ NeurIPS]
Dec 22, 2024
![The State of AI Startups in 2024 [LS Live @ NeurIPS]](https://i.ytimg.com/vi/HM1d7kMebEI/maxresdefault.jpg)
The State of AI Startups in 2024 [LS Live @ NeurIPS]
Dec 21, 2024

Windsurf: The Enterprise AI IDE
Dec 13, 2024
![[Paper Club] Weight Streaming on Wafer-Scale Clusters (w/ Sarah Chieng of Cerebras)](https://i.ytimg.com/vi/eNKe04apEaE/maxresdefault.jpg)
[Paper Club] Weight Streaming on Wafer-Scale Clusters (w/ Sarah Chieng of Cerebras)
Dec 7, 2024

0 to over $8M ARR in 2 months as a Claude Wrapper (Bolt.new, Qodo)
Dec 2, 2024
![[Paper Club] Embeddings in 2024: OpenAI, Nomic Embed, Jina Embed, cde-small-v1 - with swyx](https://i.ytimg.com/vi/VIqXNRsRRQo/maxresdefault.jpg)
[Paper Club] Embeddings in 2024: OpenAI, Nomic Embed, Jina Embed, cde-small-v1 - with swyx
Dec 1, 2024
![[Paper Club] DocETL: Agentic Query Rewriting + Eval for Complex Document Processing w Shreya Shankar](https://i.ytimg.com/vi/G8d3txDwLZc/maxresdefault.jpg)
[Paper Club] DocETL: Agentic Query Rewriting + Eval for Complex Document Processing w Shreya Shankar
Nov 29, 2024

The new Claude 3.5 Sonnet, Computer Use, and Building SOTA Agents — with Erik Schluntz, Anthropic
Nov 28, 2024
![[Paper Club] BERT: Bidirectional Encoder Representations from Transformers](https://i.ytimg.com/vi/V64q3p7DNjc/maxresdefault.jpg)
[Paper Club] BERT: Bidirectional Encoder Representations from Transformers
Nov 27, 2024

Why Compound AI + Open Source will beat Closed AI — with Lin Qiao, CEO of Fireworks AI
Nov 25, 2024

Agents @ Work: Lindy.ai (with live demo!)
Nov 15, 2024

Agents @ Work: Dust.tt — with Stanislas Polu
Nov 11, 2024
![[Paper Club] Intro to Diffusion Models and OpenAI sCM: Simple, Stable, Scalable Consistency Models](https://i.ytimg.com/vi/epwgOz8mZMw/maxresdefault.jpg)
[Paper Club] Intro to Diffusion Models and OpenAI sCM: Simple, Stable, Scalable Consistency Models
Nov 2, 2024

In the Arena: How LMSys changed LLM Benchmarking Forever
Nov 1, 2024
![[Paper Club] Upcycling Large Language Models into Mixture of Experts](https://i.ytimg.com/vi/e_mkhFkKPEk/maxresdefault.jpg)
[Paper Club] Upcycling Large Language Models into Mixture of Experts
Oct 29, 2024