If GPT 5 truly has 400k context, that might be all it needs to meaningfully surp...

andrewmutz · 2025-08-07T18:39:47 1754591987

Having a large context window is very different from being able to effectively use a lot of context.

To get great results, it's still very important to manage context well. It doesn't matter if the model allows a very large context window, you can't just throw in the kitchen sink and expect good results

dimal · 2025-08-07T18:34:15 1754591655

Even with large contexts there's diminishing returns. Just having the ability to stuff more tokens in context doesn't mean the model can effectively use it. As far as I can tell, they always reach a point in which more information makes things worse.

Byamarro · 2025-08-07T18:39:35 1754591975

More of a question is its context rot tendency than the size of its context :) LLMs are supposed to load 3 bibles into their context, but they forget what they were about to do after loading a 600LoC of locales.

simonw · 2025-08-07T18:22:36 1754590956

It's 272,000 input tokens and 128,000 output tokens.

dudeinhawaii · 2025-08-08T14:24:35 1754663075

The website clearly lays them out as 400k input and 128k output [1]. I just updated my AI apps to support the new models. I routinely fill the entire context on large code calls. Input is not a "shared" context.

I found 100k was barely enough for a single project without spillover, so 4x allows for linking more adjacent codebases for large scale analysis.

[1] https://platform.openai.com/docs/models/gpt-5

6thbit · 2025-08-07T21:24:18 1754601858

Oh, I had not grasped that the “context window” size advertised had to include both input and output.

But is it really 272k even if the output was say 10k? Cause it does say “max output” in the docs, so I wonder

simonw · 2025-08-07T22:25:05 1754605505

This is the only model where the input limit and the context limit are different values. OpenAI docs team are working on updating that page.

zurfer · 2025-08-07T20:40:25 1754599225

Woah that's really kind of hidden. But I think you can specify max output tokens. Need to test that!

AS04 · 2025-08-07T18:08:13 1754590093

400k context with 100% on the fiction livebench would make GPT-5 the undisputably best model IMHO. Don't think it will achieve that though, sadly.

tekacs · 2025-08-07T18:48:56 1754592536

Coupled with the humungous price difference...