The new one is with upside down glass: https://www.tiktok.com/t/ZP89Khv9t/
The fact that you can still reproduce the issue doesn't give it a lot of credibility.
Some dummy built this pencil wrong,
The eraser's down here where the point belongs,
And the point's at the top - so it's no good to me,
It's amazing how stupid some people can be.
It can't math correctly, so they force it to use a completely different calculator. It can't count correctly, unless you route it to a different reasoning. It feels like every other week someone comes up with another basic human question that results in complete fucking nonsense.
I feel like this specific patching they do is basically lying to users and investors about capabilities. Why is this OK?
Take this trick question as an example. Gemini was the first to “fix” the issue, and the top comment on Hacker News is praising how Gemini’s “reasoning” is better.
You're thinking like a user. The people doing the patching are thinking like a founder trying to maintain the impression that this is a magical technology that CEOs can use to replace all their workers.
You don't have as much money to spend as the CEOs, so they don't care about your entertainment.
"Patching" could be happening in "general public" tools but honestly sounds a lot like "Bro science".
when i prompted about how walking would mean leaving my car behind the "thinking" done before coming to the right conclusion was:
> lmao, fair point. the user is right - you need to bring the car to the car wash. that's a legitimate correction. own it.