encoder.json & vocab.bpe (OpenAI tokenizer artifacts)

Software / App

Files used by OpenAI to store the trained tokenizer: encoder (ID→string) and vocab.bpe (merge list). Described as the two items needed to represent a trained tokenizer.

Mentioned in 1 video