undefined | Better HN

0 pointsJweb_Guru14h ago0 comments

This jives with what I've experienced in the brief time I had access to 5.5 Pro. It's the very first LLM that I feel like I can wrangle into solving tedious, but straightforward, problems correctly. It still makes a ton of mistakes and needs to be very rigidly guided, but it does a pretty good job of tracing its own reasoning and correcting itself in a way that the other models do not.

The downside (not noted in the article, but noted by others here) is cost. It uses tokens at an insane rate, the tokens cost a lot, and using it with subagent flows that you can use to have it tackle large problems with high accuracy costs even more. It is also much "slower" for large scale problems because of context limitations -- it has to constantly rediscover context for each part of the problem, and in order to make it accurate you need to wipe its context before progressing to the next small part, or launch even more agents. For mathematical proofs like these, where the required context to understand the problem and proof besides stuff that's already available in its training set is small and the problems are considered "important" enough, this might not be a problem, but for many of the tasks I would like to use it for (ensuring correctness of code that affects large codebases, or validating subtle assumptions) it definitely is one.

So I think it will be a while before the impressive capabilities of these models really percolate into our lives as programmers, unless you're one of the lucky ones given unlimited access to 5.5 Pro.

0 comments

y1n014h ago

> This jives with what I've experienced

Just as an fyi, the word you are looking for is jibes. Jive is something else entirely.

jibe5h ago

I'm with you!

shnock4h ago

Oh look it's jibe's account!

billfor2h ago

Blame The Bee Gees: https://en.wikipedia.org/wiki/Jive_Talkin'

boring-human4h ago

Cut me some slack, Jack.

bicepjai12h ago

Interesting I did not know that I would have used jives :) thanks

refulgentis13h ago

That ship sailed looooong ago.

ignoramous8h ago

> looooong

Just as an fyi, the words you are looking for are ages/eons/an eternity.

2 more replies

idiotsecant7h ago

The only thing worse than complaining about this is being the guy complaining about the guy complaining about this. So congratulations on being second most annoying.

j / k navigate · click thread line to collapse