topicfoundation

Tokenization

Token = the atomic unit the model sees. Token count = cost + context consumption.

2 hours2 hours3 resources1 prereqs

Models don't see sentences, they see tokens. For GPT/Claude, 1 token ≈ 4 English characters but only ≈ 1.5-2 Turkish characters. Turkish costs more because BPE was trained heavily on English.

Practical:

Same content in EN may use 30-50% fewer tokens
Stripping JSON whitespace saves money
Long-context calls are expensive; send only what's needed

What you'll gain

You'll estimate token counts at a glance and budget cost intentionally.

Prerequisites

How LLMs Work

Transformer architecture, attention, decoding — understand what the model is doing and why.

→

Resources(3)

TTool(2)

OpenAI Tokenizer

OpenAI· en

freeofficial

Anthropic Token Counter

Anthropic· en

freeofficial

VVideo(1)

Karpathy — Let's build the GPT Tokenizer

Andrej Karpathy· 2h 13m· en

free

How LLMs Work

Context Window

Open the full interactive roadmap