s

speculative decoding

ConceptMentioned in 1 video

A technique used to make language model generation faster by having a smaller model predict draft tokens that a larger model then verifies. Cursor uses 'speculative edits' as a variant.