technocratius on Hacker News

Ask HN: Is GPT-5 a regression, or is it just me?

Context: I have been using GPT5 since its release over a month ago, within my Plus subscription. Before this release, I heavily relied on gpt-o3 for most complex tasks, with 4o for simple question. I use it for a mix of scientific literature websearch for e.g. understanding health related topics, the occasional coding assistance and helping me out with *nix sysadmin related tasks. Note that I have not used its API or integration with an IDE.

Based on a month of GPT5 usage, this model feels like primarily like a regression:

1. It's slow: thinking mode can take ages, and sometimes gets downright stuck. It's auto-assessment of whether or not it needs to think feels poorly tuned to most tasks and defaults too easily to going into deep reasoning mode.

2. Hallucinations are in overdrive: I would assess that in 7/10 tasks, hallucinations continuously clutter the responses and warrant corrections and careful monitoring and steering back. It hallucinates list items from your prompt that weren't there, software package functionalities/capabilities and CLI parameters etc. Even thorough prompting with explicit linking to sources, e.g. also wihtin deep research frequently goes of the rails.

3. Not self critical: even in thinking mode, it frequently spews out incorrect stuff, that a blatant "this is not correct, check your answer" can directly correct.

Note: I am not a super advanced prompt engineer, and this above assessment is mainly wrt the previous generation of models. I would expect that with progression of model capabilities, the need for users to apply careful prompt engineering goes down, not up.

I am very curious to hear your experiences.

6technocratius7mo ago7

Ask HN: Is GPT-5 a regression, or is it just me?

Based on a month of GPT5 usage, this model feels like primarily like a regression:

3. Not self critical: even in thinking mode, it frequently spews out incorrect stuff, that a blatant "this is not correct, check your answer" can directly correct.

I am very curious to hear your experiences.

technocratius

Recent submissions

Ask HN: Is GPT-5 a regression, or is it just me?

The AMD Open Ecosystem and Open Models [video] (opens in new tab)

How fly neurons compute the direction of visual motion [video] (opens in new tab)

Skyglobe (opens in new tab)

Foxconn Working Conditions Controversy (opens in new tab)

Recent submissions

Ask HN: Is GPT-5 a regression, or is it just me?

The AMD Open Ecosystem and Open Models [video] (opens in new tab)

How fly neurons compute the direction of visual motion [video] (opens in new tab)

Skyglobe (opens in new tab)

Foxconn Working Conditions Controversy (opens in new tab)