Distributional Temporal Difference Learning

Concept

A method in reinforcement learning that represents future rewards not as a single expected value, but as a distribution of possible outcomes, leading to richer representation learning and accelerated performance.

Mentioned in 1 video

Videos Mentioning Distributional Temporal Difference Learning

Matt Botvinick: Neuroscience, Psychology, and AI at DeepMind | Lex Fridman Podcast #106

Matt Botvinick: Neuroscience, Psychology, and AI at DeepMind | Lex Fridman Podcast #106

Lex Fridman

A method in reinforcement learning that represents future rewards not as a single expected value, but as a distribution of possible outcomes, leading to richer representation learning and accelerated performance.