S

Star technique

ConceptMentioned in 1 video

A technique discussed by Peter Liu that fine-tunes a model on its own better outputs, particularly those that lead to correct answers, to improve performance.