Star technique

Concept

A technique discussed by Peter Liu that fine-tunes a model on its own better outputs, particularly those that lead to correct answers, to improve performance.

Mentioned in 1 video