Satinder Singh

Person

Influential reinforcement learning researcher at DeepMind and former student of Andy Barto, who was particularly impressed with AlphaGo Zero's ability to learn purely from self-play.

Mentioned in 1 video