Process Supervision
Concept
A method that involves labeling or evaluating intermediate reasoning steps to train models to produce better reasoning traces.
Mentioned in 1 video
A method that involves labeling or evaluating intermediate reasoning steps to train models to produce better reasoning traces.