Dagger
ConceptMentioned in 1 video
An algorithm from RL that addresses the train-test mismatch by having the learner query the teacher for feedback on its own generated data samples.
An algorithm from RL that addresses the train-test mismatch by having the learner query the teacher for feedback on its own generated data samples.