undefined | Better HN

0 pointsnobodywillobsrv1y ago0 comments

Can't we all just go test the responses with old chats?

0 comments

1 comments · 1 top-level

afpx1y ago

I've tested old chats with the latest 4 and 4o models, and what had been zero-shot now sometimes can't even be done (or at least not without carefully guiding it to the answer).

My old chats say they have been migrated to 4o. But, I swear (can't confirm) that they perform better than a new 4o session. I haven't had time yet, but I wanted to side-by-side compare the responses from those old chats with the current 4o model.

j / k navigate · click thread line to collapse