Skip to content
Better HN
Top
Best
Ask
Show
New
Jobs
Search
⌘K
0 points
jryio
3mo ago
0 comments
Save
Share
1 million tokens is great until you notice the long context scores fall off a cliff past 256K and the rest is basically vibes and auto compacting.
0 comments
3 comments · 2 top-level
top
newest
oldest
olliepro
3mo ago
· 1 in thread
I bet they lack good long context training data and need to start a flywheel of collecting it via their api (from willing customers)
jbergqvist
3mo ago
This would be my guess too. It can probably be generated synthetically or via agentic rollouts, but high quality long context examples where outputs meaningfully depend on long-range interactions probably remain scarce
rrr_oh_man
3mo ago
It's the same now with Gemini as well. Unfortunately. :(
j
/
k
navigate · click thread line to collapse