Model Architecture
5 video summaries
Videos About Model Architecture

Build and Train an LLM with JAX
DeepLearningAI

The Unreasonable Effectiveness of Reasoning Distillation: using DeepSeek R1 to beat OpenAI o1
Latent Space
![[Paper Club] Molmo + Pixmo + Whisper 3 Turbo - with Vibhu Sapra, Nathan Lambert, Amgadoz](https://i.ytimg.com/vi/8BN9CdIYaqc/maxresdefault.jpg)
[Paper Club] Molmo + Pixmo + Whisper 3 Turbo - with Vibhu Sapra, Nathan Lambert, Amgadoz
Latent Space

The 10,000x Yolo Researcher Metagame — with Yi Tay of Reka
Latent Space

How Scaling Laws Will Determine AI's Future | YC Decoded
Y Combinator