These things are so tricky because everyone has a seemingly conflicting experience. Part of the fun I guess!
Range of 48-73.5 (peak 53.1+% higher than trough) with a single day shift of ~30%.
You suggest people are usually influenced more by narrative than data, but provide a narrative-heavy, data-light comment, e.g. "always" "know" "mostly steady" (hazy terms for data) "cannot beat" "evil companies" "meme strong".
A followup defining "mostly" and "steady" more clearly, and your purpose in writing in a narrative-shaping style would be helpful.
Opus 4.7 + Rust is a killer combo.
I use Codex when Claude Code is down, and I only began using Claude when ChatGPT was down
yes codex is very fast, I go back to Claude for now
Heck I prefer DeepSeek to both of those.
I was running deepseek through claude's code agent harness. Maybe it works better through a different tool?
Harness also matters, and also provider. I was using openrouter and switched to the Deepseek api and suddenly all the tool call issues I was having resolved themselves. Flash is so damn fast at doing stuff like generating boilerplate I can’t go back to the bigger slower models.
Dario and co seem to be on some elevated pedestal - us mere mortals are beneath them - and they have this scattershot devrel where each engineer has their own X way of communicating to the public often at odds with each other.
I loved Sonnet and Opus fwiw but not anymore.
My systems are hitting exponential delay retries, so this might not get better because retries overload things again.
> {'type': 'error', 'error': {'details': None, 'type': 'overloaded_error', 'message': 'Overloaded'}, 'request_id': 'req_ ...
I can see a weird spike in my cache hit-rate a few minutes before, so this might actually be some extra caching they have thrown in.