Dagger

ConceptMentioned in 1 video

An algorithm from RL that addresses the train-test mismatch by having the learner query the teacher for feedback on its own generated data samples.