Flamingo
Software / App
A DeepMind model that adds vision capabilities to language, built by freezing Chinchilla's weights and adding new visual components, enabling dialogue about images.
Mentioned in 2 videos
Save the 2 videos on Flamingo to your own pod.
Sign up free to keep building your knowledge base on Flamingo as more episodes are added.
Videos Mentioning Flamingo

Oriol Vinyals: Deep Learning and Artificial General Intelligence | Lex Fridman Podcast #306
Lex Fridman
A DeepMind model that adds vision capabilities to language, built by freezing Chinchilla's weights and adding new visual components, enabling dialogue about images.

Stanford CME296 Diffusion & Large Vision Models | Spring 2026 | Lecture 7 - Evaluation
Stanford Online
A multimodal LLM developed by Google that uses cross-attention where images are given as keys and values, allowing text tokens to interact with encoded images.