Dagger
Concept
An algorithm from RL that addresses the train-test mismatch by having the learner query the teacher for feedback on its own generated data samples.
Mentioned in 1 video
An algorithm from RL that addresses the train-test mismatch by having the learner query the teacher for feedback on its own generated data samples.