What is a context window?
LLM Engineer Interview: Tokenization, BPE, SentencePiece, and Token Counting in Production
Audio flashcard · 0:21Nortren·
What is a context window?
0:21
The context window is the maximum number of tokens an LLM can process in a single forward pass, including both the input prompt and the generated output. As of 2026, frontier models support context windows from 128 thousand up to several million tokens. Larger contexts enable longer conversations and document processing but cost more in compute and memory.
docs.anthropic.com