A narrative tour of a single token — Pi — from raw text through tokenization, special-token wrapping, the forward pass, scratchpad reasoning, and output stripping. With the Life of Pi framing.
Embeddings, projection, Q/K/V, dot product, softmax, weighted sum, positional encoding, matrix shapes, and the intuition behind attention — built up from first principles.