Contiguous Decoding
ConceptMentioned in 1 video
A technique supported by SGLang that uses finite state machines (derived from schemas like JSON via tools like XGrammar) to control output, enabling faster decoding by skipping unnecessary tokens.
A technique supported by SGLang that uses finite state machines (derived from schemas like JSON via tools like XGrammar) to control output, enabling faster decoding by skipping unnecessary tokens.