LLM

Articles on large language models — attention, training, inference, evaluation, and the systems around them. Standalone pieces rather than a strict curriculum.

About This Series

Each article tackles a specific question or mental model around LLMs — what attention is really doing, how training loops actually run, why inference behaves the way it does, and the practical engineering that surrounds production deployments. Read in any order.

Articles

Attention resolves through DNS

Transformer attention from first principles using the DNS mnemonic — Dot, Normalize, Sum — with interactive visualizations.

May 31, 2026• 26 min read

Read

Life of

A narrative tour of one token from tokenization through scratchpad reasoning to output stripping. The hidden middle is where the interesting stuff lives.

June 3, 2026• 16 min read

Read

More on the way

New articles are added as topics come up. Follow the blog or check back for the next piece.

LLM

About This Series

Articles

Attention resolves through DNS

Life of TokenπToken

More on the way

Life of