Reinforcement Learning

30 video summaries

Build a research pod on Reinforcement Learning.

30 videos curated. Save them to your own pod, ask any question across the body of expert opinion, and connect it to Claude or ChatGPT.

Get Started Free

Videos About Reinforcement Learning

How Dopamine & Serotonin Shape Decisions, Motivation & Learning | Dr. Read Montague

How Dopamine & Serotonin Shape Decisions, Motivation & Learning | Dr. Read Montague

Andrew Huberman

⚡️Factorio Learning Environment: the ultimate Game Agent Eval — Jack Hopkins

⚡️Factorio Learning Environment: the ultimate Game Agent Eval — Jack Hopkins

Latent Space

⚡️Multi-Turn RL for Multi-Hour Agents — with Will Brown, Prime Intellect

⚡️Multi-Turn RL for Multi-Hour Agents — with Will Brown, Prime Intellect

Latent Space

The #1 SWE-Bench Verified Agent

The #1 SWE-Bench Verified Agent

Latent Space

How Claude Plays Pokémon was made

How Claude Plays Pokémon was made

Latent Space

Can We Contain Artificial Intelligence?: A Conversation with Mustafa Suleyman (Episode #332)

Can We Contain Artificial Intelligence?: A Conversation with Mustafa Suleyman (Episode #332)

Sam Harris

Stanford Robotics Seminar ENGR319 | Spring 2026 | Interactive Autonomy

Stanford Robotics Seminar ENGR319 | Spring 2026 | Interactive Autonomy

Stanford Online

5 Papers That Show Where AI Research Is Heading Right Now

5 Papers That Show Where AI Research Is Heading Right Now

Y Combinator

⚡️Every product of the future will be a living system — Ronak Malde, Trajectory.ai

⚡️Every product of the future will be a living system — Ronak Malde, Trajectory.ai

Latent Space

Stanford MS&E435 Economics of the AI Supercycle | Spring 2026 | Enterprise Internal Knowledge

Stanford MS&E435 Economics of the AI Supercycle | Spring 2026 | Enterprise Internal Knowledge

Stanford Online

Cooking with OpenAI’s Research Chief: AGI, o1, Evals, and Scaling Laws — Mark Chen

Cooking with OpenAI’s Research Chief: AGI, o1, Evals, and Scaling Laws — Mark Chen

Latent Space

Stanford CS336 Language Modeling from Scratch | Spring 2026 | Lecture 16: Post-Training - RLVR

Stanford CS336 Language Modeling from Scratch | Spring 2026 | Lecture 16: Post-Training - RLVR

Stanford Online

Building Dota Bots That Beat Pros - OpenAI's Greg Brockman, Szymon Sidor, and Sam Altman

Building Dota Bots That Beat Pros - OpenAI's Greg Brockman, Szymon Sidor, and Sam Altman

Y Combinator

An AI Primer with Wojciech Zaremba

An AI Primer with Wojciech Zaremba

Y Combinator

The Engineering Unlocks Behind DeepSeek | YC Decoded

The Engineering Unlocks Behind DeepSeek | YC Decoded

Y Combinator

Stanford CS25: Transformers United V6 I From Next-Token Prediction to Next-Generation Intelligence

Stanford CS25: Transformers United V6 I From Next-Token Prediction to Next-Generation Intelligence

Stanford Online

Sergey Levine: Robotics and Machine Learning | Lex Fridman Podcast #108

Sergey Levine: Robotics and Machine Learning | Lex Fridman Podcast #108

Lex Fridman

Matt Botvinick: Neuroscience, Psychology, and AI at DeepMind | Lex Fridman Podcast #106

Matt Botvinick: Neuroscience, Psychology, and AI at DeepMind | Lex Fridman Podcast #106

Lex Fridman

Ilya Sutskever: Deep Learning | Lex Fridman Podcast #94

Ilya Sutskever: Deep Learning | Lex Fridman Podcast #94

Lex Fridman

Deep Learning State of the Art (2020)

Deep Learning State of the Art (2020)

Lex Fridman

Yann LeCun: Deep Learning, ConvNets, and Self-Supervised Learning | Lex Fridman Podcast #36

Yann LeCun: Deep Learning, ConvNets, and Self-Supervised Learning | Lex Fridman Podcast #36

Lex Fridman

George Hotz: Comma.ai, OpenPilot, and Autonomous Vehicles | Lex Fridman Podcast #31

George Hotz: Comma.ai, OpenPilot, and Autonomous Vehicles | Lex Fridman Podcast #31

Lex Fridman

MIT 6.S093: Introduction to Human-Centered Artificial Intelligence (AI)

MIT 6.S093: Introduction to Human-Centered Artificial Intelligence (AI)

Lex Fridman

Greg Brockman: OpenAI and AGI | Lex Fridman Podcast #17

Greg Brockman: OpenAI and AGI | Lex Fridman Podcast #17

Lex Fridman

Page 1 of 2Next