Process Supervision

Concept

A method that involves labeling or evaluating intermediate reasoning steps to train models to produce better reasoning traces.

Mentioned in 1 video