The AI Explainer, Part 2: Inside the LLM

The AI Explainer, Part 2: Inside the LLM

A large language model is not a brain, a search engine, or a database. It's a stack of mathematical layers shaped in three distinct phases to predict the next token well enough that the result looks like thought. Each phase leaves a fingerprint on what the model can and cannot do, and most of the public confusion about LLMs traces back to collapsing those phases into a single fuzzy idea called "AI."

Part 2 of the trilogy opens the object. Tokens, not words. Pretraining as fluency-without-knowledge. Why prompts work at all. The post-training pass that turns a base model into an assistant. Why these things hallucinate, forget, and sound confident anyway. And why retrieval and memory are software features wrapped around the model, not properties of it. Six articles on the interior.

The AI Explainer, Part 2: Inside the LLM

The AI Explainer, Part 2: Inside the LLM

In This Series

Why LLMs Can't Count the R's in Strawberry

Pretraining Builds Fluency, Not Knowledge

Random Labels Work Almost As Well

Why a 1.3B Model Beat GPT-3

Plausible, Not True

Your Chatbot Doesn't Remember You

The AI Explainer, Part 2: Inside the LLM

In This Series

Why LLMs Can't Count the R's in Strawberry

Pretraining Builds Fluency, Not Knowledge

Random Labels Work Almost As Well

Why a 1.3B Model Beat GPT-3

Plausible, Not True

Your Chatbot Doesn't Remember You