Dagger

Concept

An algorithm from RL that addresses the train-test mismatch by having the learner query the teacher for feedback on its own generated data samples.

Mentioned in 1 video