undefined | Better HN

0 pointsbelter3y ago0 comments

Leetcode (hard) from 0/45 (GPT-3.5) to 3/45 (GPT-4).

The lack of progress here, says a lot more about is NOT happening as an AI paradigm change. Still a glorified pattern matching and pattern creation engine, even if a very impressive one.

0 comments

nextworddev3y ago

Hmm, can the average developer get even 1 out of 45 right, without practice? (zero shot)

mtc0101703y ago

Idk about that. The jump from 0 to 1 may be a whole lot harder than 1 to 45.

bitshiftfaced3y ago

It would be interesting to know how this compares with human 0-shot, single attempt coding tasks.

zamadatix3y ago

The difference I've noticed is the first shot is generally cleaner but the ceiling of what it can correct is limited. If it is given more independent or simple things to correct and it hears about it then you're usually golden but if that thing it has to correct interacts with other constraints then when it shifts approach to fix the issue it is told about it often forgets other things and can break them. Typically this happens on the more complex (as in how interrelated) problems, for complex (as in just a lot of stuff needs to be done) it does fine.

nextworddev3y ago

You can have GPT4 inspect its own errors and make corrections- I'm sure self-reflection works better this time than GPT3.5

zamadatix3y ago

You can but as I said the ceiling on what it can correct seems limited, particularly in the described situations. GPT 4 doesn't seem to have really broken that barrier much more than GPT 3.5 in my use so far. I posted about some examples of this experience over here https://news.ycombinator.com/item?id=35158149

j / k navigate · click thread line to collapse

0 comments

nextworddev3y ago

Hmm, can the average developer get even 1 out of 45 right, without practice? (zero shot)

mtc0101703y ago

Idk about that. The jump from 0 to 1 may be a whole lot harder than 1 to 45.

bitshiftfaced3y ago

It would be interesting to know how this compares with human 0-shot, single attempt coding tasks.

zamadatix3y ago

nextworddev3y ago

You can have GPT4 inspect its own errors and make corrections- I'm sure self-reflection works better this time than GPT3.5

zamadatix3y ago

j / k navigate · click thread line to collapse