Context Window #186494

ryukenshin546-a11y · 2026-02-06T11:42:31Z

ryukenshin546-a11y
Feb 6, 2026

Select Topic Area

Question

Copilot Feature Area

VS Code

Body

Why does the Claude Opus4.6 token context window only have 128K input and 64K output, when the model can handle up to 1M?

ryukenshin546-a11y · 2026-02-06T11:43:30Z

ryukenshin546-a11y
Feb 6, 2026
Author

0 replies

aryankumar06 · 2026-02-06T11:48:54Z

aryankumar06
Feb 6, 2026

Claude Opus 4.6 “supporting 1M context” is basically the model’s maximum architectural capability, but in the actual product Anthropic sets practical limits for speed, cost, and reliability. At 1M tokens the compute and latency blow up massively, and even if the model can technically read that much, its effective recall and consistency can become less stable. That’s why they cap it at 128K input and 64K output to keep performance predictable, inference fast enough, and pricing manageable.

2 replies

anuragchauhan06 Feb 6, 2026

In cursor you have 200k token context window and you're absolutely right.

SeaDude Feb 14, 2026

I believe the 128k limit is imposed by GitHub Copilot's agent harness. Just like Claude Code, Cline, Cursor, etc., under the hood, GitHub Copilot is a "coding assistant agent" with a "harness"; a combination of programmatic and non-deterministic loops that handle input, context, and output.

If you want to work with 4.6's 1M context window right now, you'll have to select an agentic tool whose harness can handle 1M OR create your own custom harness.

MuhammedSinanHQ · 2026-02-14T18:46:10Z

MuhammedSinanHQ
Feb 14, 2026

Claude Opus 4.6 can support up to ~1M tokens at the model level.

Copilot Chat does not expose the model’s maximum context.

Copilot applies its own caps for:

Cost control
Latency
Reliability
Tool orchestration
Multi-tenant fairness

So you see:

128K input / 64K output
even though the underlying model supports more.

Important distinction

Model capability ≠ Product limit.

Vendors frequently gate large contexts behind:

Enterprise tiers
Private preview
Custom contracts

Copilot currently uses a constrained configuration of Claude models.

Why Copilot does this

Long-context inference is expensive
Tool calls + embeddings scale with context
UI responsiveness degrades past ~100K tokens

GitHub optimizes for interactive developer workflows, not massive document ingestion.

If you truly need >128K

Use Claude directly through Anthropic’s API or a provider that exposes 1M context.

Copilot is not the right surface today.

0 replies

nickchomey · 2026-03-16T03:06:37Z

nickchomey
Mar 16, 2026

Claude Opus and Sonnet 4.6 both just changed 1M context window from beta to GA. Copilot should make use of it as well

https://claude.com/blog/1m-context-ga

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GitHub Community

Context Window #186494

Uh oh!

{{title}}

Uh oh!

Replies: 4 comments 2 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

GitHub Community

Context Window #186494

Uh oh!

ryukenshin546-a11y Feb 6, 2026

Select Topic Area

Copilot Feature Area

Body

Replies: 4 comments · 2 replies

Uh oh!

ryukenshin546-a11y Feb 6, 2026 Author

Uh oh!

aryankumar06 Feb 6, 2026

Uh oh!

anuragchauhan06 Feb 6, 2026

Uh oh!

Uh oh!

SeaDude Feb 14, 2026

Uh oh!

MuhammedSinanHQ Feb 14, 2026

Uh oh!

nickchomey Mar 16, 2026

ryukenshin546-a11y
Feb 6, 2026

Replies: 4 comments 2 replies

ryukenshin546-a11y
Feb 6, 2026
Author

aryankumar06
Feb 6, 2026

MuhammedSinanHQ
Feb 14, 2026

nickchomey
Mar 16, 2026