SentencePiece

Software / App

Google's library used for both training and inference (used by LLaMA and others); video explains its different design (codepoint-level BPE, byte-fallback, many options).

Mentioned in 1 video