Question 1

What happens when you exceed the context window?

Accepted Answer

The API will either return an error or silently truncate the oldest tokens. The exact behaviour depends on the provider. OpenAI returns a 400 error. Anthropic returns a similar error. If you are building an application, manage token counts proactively.

Question 2

Does a larger context window mean better quality?

Accepted Answer

Not necessarily. Research has shown a "lost in the middle" effect — models attend well to the beginning and end of long inputs but less to content in the middle. A 200K context window may not utilise all tokens with equal quality.

Question 3

How are context windows measured?

Accepted Answer

In tokens. A token is roughly 3-4 characters in English, or about 0.75 words. Different models use different tokenisers, so the same text may produce different token counts. Most providers offer tokeniser libraries for pre-counting.

Context window

What is context window?

Why it matters

Where models stand

How sourc.dev tracks this

Related

Input price (per 1M tokens)

Output price (per 1M tokens)

Max output tokens

Frequently asked questions

What happens when you exceed the context window?

Does a larger context window mean better quality?

How are context windows measured?