G
GPT-4 tokenizer whitespace improvements (design choice)
Tool / ProductMentioned in 1 video
Design choice in CL100k/GPT‑4 tokenizer to group repeated spaces (improves code density), explicitly shown with Python indentation examples.
Design choice in CL100k/GPT‑4 tokenizer to group repeated spaces (improves code density), explicitly shown with Python indentation examples.