Andrej Karpathy deep dives into tokenization for LLMs, building a BPE tokenizer from scratch and explaining why tokenization matters.