B

Byte Pair Encoding (BPE)

Tool / ProductMentioned in 1 video

Tokenization algorithm (merging common byte/byte-pair sequences) used to build large vocabularies (~100k tokens).