Professional software developers don't vibe, they control (opens in new tab)

(arxiv.org)

217 pointsdpflan5mo ago247 comments

247 comments

106 comments · 23 top-level

runtimepanic5mo ago· 17 in thread

The title is doing a lot of work here. What resonated with me is the shift from “writing code” to “steering systems” rather than the hype framing. Senior devs already spend more time constraining, reviewing, and shaping outcomes than typing syntax. AI just makes that explicit. The real skill gap isn’t prompt cleverness, it’s knowing when the agent is confidently wrong and how to fence it in with tests, architecture, and invariants. That part doesn’t scale magically.

_cenw5mo ago

Is anyone else getting more mentally exhausted by this? I get more done, but I also miss the relaxing code typing in the middle of the process.

agumonkey5mo ago

I think there are two groups of people emerging. deep / fast / craft-and-decomposition-loving vs black box / outcome-only.

I've seen people unable to work at average speed on small features suddenly reach above average output through a llm cli and I could sense the pride in them. Which is at odds with my experience of work.. I love to dig down, know a lot, model and find abstractions on my own. There a llm will 1) not understand how my brain work 2) produce something workable but that requires me to stretch mentally.. and most of the time I leave numb. In the last month I've seen many people expressing similar views.

ps: thanks everybody for the answers, interesting to read your pov

4 more replies

jghn5mo ago

That's kind of the point here. Once a dev reached a certain level, they often weren't doing much "relaxing code typing" anyways before the AI movement. I don't find it to be much different than being a tech lead, architect, or similar role.

1 more reply

tikimcfee5mo ago

Ya know, I have to admit feeling something like this. Normally, the amount of stuff I put together in a work day offers a sense of completion or even a bit of a dopamine bump because of a "job well done". With this recent work I've been doing, it's instead felt like I've been spending a multiplier more energy communicating intent instead of doing the work myself; that communication seems to be making me more tired than the work itself. Similar?

3 more replies

bugglebeetle5mo ago

Nah, I don’t miss at all typing all the tests, CLIs, and APIs I’ve created hundreds of times before. I dunno if I it’s because I do ML stuff, but it’s almost all “think a lot about something, do some math, and and then type thousands of lines of the same stuff around the interesting work.”

simonw5mo ago

Yes, absolutely, I can be mentally wiped out by lunch.

epolanski5mo ago

Yes it's taxing and mentally draining, reading code and connecting dots is always harder than writing it.

And if you let the AI too loose, as when you try to vibe code an entirely new program, I end up in the situation where in 1 day I have a good prototype and then I can spend easily 5 times as much sorting the many issues and refactoring in order to have it scale to the next features.

SJMG5mo ago

I think it's the serial waiting game and inevitable context switching while you wait.

Long iteration cycles are taxing

bccdee5mo ago

So far what I've been doing is, I look for the parts that seem like they'd be rewarding to code and I do them myself with no input from the machine whatsoever. It's hard to really understand a codebase without spending time with the code, and when you're using a model, I think there's a risk of things changing more quickly than you can internalize them. Also, I worry I'll get too comfortable bossing chatbots around & I'll become reluctant to get my hands dirty and produce code directly. People talk about ruining their attention spans by spending all their time on TikTok until they can no longer read novels; I think it'd be a real mistake to let that happen to my professional skill set.

mupuff12345mo ago

For me it's the opposite, I'm wasting less energy over debugging silly bugs and fighting/figuring out some annoying config.

But it does feel less fulfilling I suppose.

teaearlgraycold5mo ago

I like to alternate focusing on AI wrangling and writing code the old fashioned way.

AlotOfReading5mo ago

It's difficult to steer complex systems correctly, because no one has a complete picture of the end goal at the outset. That's why waterfall fails. Writing code agentically means you have to go out of your way to think deeply about what you're building, because it won't be forced on you by the act of writing code. If your requirements are complex, they might actually be a hindrance because you're going have to learn those lessons from failed iterations instead of avoiding them preemptively.

codeformoney5mo ago

The stereotype that writing code is for junior developers needs to die. Some devs are hired with lofty titles specifically for their programming aptitude and esoteric systems knowlege, not to play implementation telephone with inexperienced devs.

remich5mo ago

I don't think that anyone actually believes that writing code is only for junior developers. That seems to be a significant exaggeration at the very least. However, it is definitely true that most organizations of this size are hiring people into technical lead, staff engineer, or principal engineer roles are hiring those people not only for their individual expertise, or ability to apply that expertise themselves, but also for their ability to use that expertise as a force multiplier to make other less experienced people better at the craft.

2 more replies

Madmallard5mo ago

"it’s knowing when the agent is confidently wrong and how to fence it in with tests, architecture, and invariants."

Strongly suspect this is simply less efficient than doing it yourself if you have enough expertise.

llmslave25mo ago

Does using an LLM to craft Hackernews comments count as "steering systems"?

coip5mo ago

You're totally right! It's not steering systems -- it's cooking, apparently

banbangtuth5mo ago· 16 in thread

You know what. After seeing all these articles about AI/LLM for these past 4 years, about how they are going to replace me as software developers and about how I am not productive enough without using 5 agents and being a project manager.

I. Don't. Care.

I don't even care about those debates outside. Debates about do LLM work and replace programmers? Say they do, ok so what?

I simply have too much fun programming. I am just a mere fullstack business line programmer, generic random replaceable dude, you can find me dime a dozen.

I do use LLM as Stack Overflow/docs replacement, but I always code by hand all my code.

If you want to replace me, replace me. I'll go to companies that need me. If there are no companies that need my skill, fine, then I'll just do this as a hobby, and probably flip burgers outside to make a living.

I don't care about your LLM, I don't care about your agent, I probably don't even care about the job prospects for that matter if I have to be forced to use tools that I don't like and to use workflows I don't like. You can go ahead find others who are willing to do it for you.

As for me, I simply have too much fun programming. Now if you excuse me, I need to go have fun.

lifetimerubyist5mo ago

Hear hear. I didn't spend half my life getting an education, competing in the corporate crab bucket, retraining and upskilling just to turn into a robot babysitter.

danielbln5mo ago

Then continue to write code as a hobby, noone is going to take that away from you. But if you want someone to pay you for hand setting code the way you always have then .. well you might find that harder and harder as time goes on.

1 more reply

yacthing5mo ago

Easy to say if you either:

(1) already have enough money to survive without working, or

(2) don't realize how hard of a life it would be to "flip burgers" to make a living in 2026.

We live very good lives as software developers. Don't be a fool and think you could just "flip burgers" and be fine.

banbangtuth5mo ago

Ah, I actually did flip burgers. So I know.

I also did dry cleaning, cleaning service, deli, delivery guy, etc.

Yup I now have enough money to survive without working.

But I also am very low maintenance, thanks to my early life being raised in harsh conditions.

I am not scared to go back flipping burgers again.

2 more replies

hecanjog5mo ago

I appreciate this perspective. I'm actually hoping LLM hype will help to pop the bubble of tech salaries, make the profession roughly as profitable as going into teaching, so maybe the gold diggers will clear out and go play the stock market or something, rest of us can stick around and build things. Maybe software quality will even improve as a result? Would be nice...

falkensmaize5mo ago

Man, come on - what planet are you from, seriously? I got into this business because I enjoy programming, but I also wanted to for once in my life make a decent living and be able to save something. I have kids I'd like to send to college. I'd like to be able to retire someday. I have aging parents that need expensive care. This is one of the few professions that you can upskill into without years of expensive degrees.

People need to make money to survive, now more than ever. It seems incredibly selfish to wish for that to disappear just so you can "purify" the profession.

1 more reply

dinkumthinkum5mo ago

I hear you but I feel like you (and really others like you, in mass) should not be so passive about your replacement. For most programmers, simply flipping burgers for money to enjoy programming a few hours a week is not going to work. Making a living is a thing. If you are reduced to having to flip burgers that means the economy will gave collapsed and there won’t be any magic Elon UBI money to save us.

banbangtuth5mo ago

We will have bigger problems when that happens. I am not worried.

1 more reply

llmslave25mo ago

I simply will not spend my life begging and coaxing a machine to output working code. If that is what becomes of this profession, I will just do something else :)

ryanobjc5mo ago

If I wanted to do that, I'd just move into engineering management and work with something less temperamental and predictable - humans.

I'd at least be more likely to get a boost in impact and ability to affect decision making, maybe.

1 more reply

aspenmartin5mo ago

It would definitely be the profession if we stopped developing things today. Think about the idea of coding agents 2 years ago, I personally found them very unrealistic and am now coding exclusively with them despite them being either a neutral or net negative to my development time simply because I see the writing on the wall that in 6 mos to a year they will probably be a huge net positive and in 2-3 years the dismissive attitude towards adoption will start to look kind of silly (no offense). To me we are _just_ at the inflection point where using and not using coding agents are both totally sensible decisions.

agentifysh5mo ago

having fun isn't tied to employment unless you are self-employed even then what's fun should not be the driving force

lifetimerubyist5mo ago

"get a job doing something you enjoy and you'll never work a day in your life"

or something like that

banbangtuth5mo ago

Why? It is a matter of values. Fun can be a driving force just like money and stability is. It is simply a matter of your values (and your sacrifices).

Like I said, I am just a generic replaceable dime a dozen programmer dude.

1 more reply

throw-12-165mo ago

i think you angered the hustle bros

1 more reply

llmslave25mo ago

That sounds miserable to me :(

1 more reply

simonw5mo ago· 14 in thread

This is pretty recent - the survey they ran (99 respondents) was August 18 to September 23 2025 and the field observations (watching developers for 45 minute then a 30 minute interview, 13 participants) were August 1 to October 3.

The models were mostly GPT-5 and Claude Sonnet 4. The study was too early to catch the 5.x Codex or Claude 4.5 models (bar one mention of Sonnet 4.5.)

This is notable because a lot of academic papers take 6-12 months to come out, by which time the LLM space has often moved on by an entire model generation.

utopiah5mo ago

> academic papers take 6-12 months to come out, by which time the LLM space has often moved on by an entire model generation.

This is a recurring argument which I don't understand. Doesn't it simply mean that whatever conclusion they did was valid then? The research process is about approximating a better description of a phenomenon to understand it. It's not about providing a definitive answer. Being "an entire model generation" behind would be important if fundamental problems, e.g. no more hallucinations, would be solved but if it's going from incremental changes then most likely the conclusions remain correct. Which fundamental change (I don't think labeling newer models as "better" is sufficient) do you believe invalidate their conclusions in this specific context?

soulofmischief5mo ago

2025 has been a wild year for agentic coding models. Cutting-edge models in January 2025 don't hold a candle to cutting edge models in December 2025.

Just the jump from Sonnet 3.5 to 3.7 to 4.5, and Opus 4.5 has been pretty massive in terms of holistic reasoning, deep knowledge as well as better procedural and architectural adherence.

GPT-5 Pro convinced me to pay $200/mo for an OpenAI subscription. Regular 5.2 models, and 5.2 codex, are leagues better than GPT-4 when it comes to solving problems procedurally, using tools, and deep discussion of scientific, mathematic, philosophical and engineering problems.

Models have increasingly longer context, especially some Google models. OpenAI has released very good image models, and great editing-focused image models in general have been released. Predictably better multimodal inference over the short term is unlocking many cool near-term possibilities.

Additionally, we have seen some incredible open source and open weight models released this year. Some fully commercially viable without restriction. And more and more smaller TTS/STT projects are in active development, with a few notable releases this year.

Honestly, the landscape at the end of the year is impressive. There has been great work all over the place, almost too much to keep up with. I'm very interested in the Genie models and a few others.

For an idea:

At the beginning of the year, I was mildly successful getting at coding models to make changes in some of my codebases, but the more esoteric problems were out of reach. Progress in general was deliberate and required a lot of manual intervention.

By comparison, in the last week I've prototyped six applications at levels that would take me days to weeks individually, often developing multiple at the same time, monitoring agentic workflows and intervening only when necessary, relying on long preproduction phases with architectural discussions and development of documentation, requirements, SDDs... and detailed code review and refactoring processes to ensure adherence to constraints. I'm morphing from a very busy solo developer into a very busy product manager.

2 more replies

simonw5mo ago

The problem is with how people interpret these results.

A paper comes out that says "we did a study of developers and found that AI-assistance had no impact on their productivity (using the state of the art models available in September 2024) and a lot of people will point to that as incontestable evidence that "AI doesn't work".

bbor5mo ago

I’m glad someone else noticed the time frames — turns out the lead author here has published 28 distinct preprints in the past 60 days, almost all of which are marked as being officially published already/soon.

Certainly some scientists are just absurdly efficient and all 28 involved teams, but that’s still a lot.

Personally speaking, this gives me second thoughts about their dedication to truly accurately measuring something as notoriously tricky as corporate SWE performance. Any number of cut corners in a novel & empirical study like this would be hard to notice from the final product, especially for casual readers…TBH, the clickbait title doesn’t help either!

I don’t have a specific critique on why 4 months is definitely too short to do it right tho. Just vibe-reviewing, I guess ;)

aaronblohowiak5mo ago

are they a PI with a lab? in this field, does the PI get first or last author?

1 more reply

dheera5mo ago

> academic papers take 6-12 months to come out

It takes about 6 months to figure out how to get LaTeX to position figures where you want them, and then another 6 months to fight with reviewers

zeristor5mo ago

Couldn't AI help with the LaTeX?

Cutting it down to 6 minutes

1 more reply

ActionHank5mo ago

For what it’s worth I know this is likely intended to read as the new generation of models will somehow better than any paper will be able to gauge, that hasn’t been my experience.

Results are getting worse and less accurate, hell, I even had Claude drop some Chinese into a response out of the blue one day.

danielbln5mo ago

I can absolutely not corroborate this, Opus 4.5 has been nothing but stellar.

mannycalavera425mo ago

same here. While getting a commandline for ffmpeg instead of giving me the option "soft-knee" it used "soft-膝" (where 膝 is the chinese for knee) was easy to spot and figure out but still... pretty rubbishy ¯ \ _ (ツ) _ / ¯

reactordev5mo ago

I knew in October the game had changed. Thanks for keeping us in the know.

mikasisiki5mo ago

I'm not sure what you mean by “the game has changed.” If you’re referring to Opus 4.5, it’s somewhat better, but it’s far from game-changing.

1 more reply

joenot4435mo ago

Thanks Simon - always quick on the draw.

Off your intuition, do you think the same study with Codex 5.2 and Opus 4.5 would see even better results?

simonw5mo ago

Depends on the participants. If they're cutting-edge LLM users then yes, I think so. If they continue to use LLMs like they would have back in the first half of 2025 I'm not sure if a difference would be noticeable.

2 more replies

websiteapi5mo ago· 11 in thread

we've never seen a profession drive themselves so aggressively to irrelevance. software engineering will always exist, but it's amazing the pace to which pressure against the profession is rising. 2026 will be a very happy new year indeed for those paying the salaries. :)

simonw5mo ago

We've been giving our work away to each other for free as open source to help improve each other's productivity for 30+ years now and that's only made our profession more valuable.

websiteapi5mo ago

I see little proof open source has resulted in higher wages and not the fact that everything is being digitized and the subsequent demand for such people to assist in such.

1 more reply

aussieguy12345mo ago

This makes sense. Imagine PHP or NodeJS without a framework, or front end development without React. Your projects would take much longer to build. The time saved with the open source frameworks and libraries is more than what an AI agent can save you.

cheema335mo ago

> we've never seen a profession drive themselves so aggressively to irrelevance.

Should we be trying to put the genie back in the bottle? If not, what exactly are you suggesting?

Even if we all agreed to stop using AI tools today, what about the rest of world? Will everybody agree to stop using it? Do you think that is even a remote possibility?

dinkumthinkum5mo ago

Does the rest of the world want to make money in a way not involving digging ditches? I feel like people from developing countries that spend 18 hours a day studying, giving their entire childhood to some standardized test, may not want yo be rewarded with no job prospects. Maybe that’s a crazy position.

mkoubaa5mo ago

Don't care have too much to do must automate away my today responsibilities so I can do more tomorrow trvst the plqn

throw-12-165mo ago

Software Engineers will still exist.

Software Devs not so much.

There is a huge difference between the two and they are not interchangeable.

wiseowise5mo ago

Good luck convincing new overlords.

Your take is this meme https://knowyourmeme.com/memes/dig-the-fucking-hole.

1 more reply

zwnow5mo ago

Also it really baffles me how many are actually in on the hype train. Its a lot more than the crypto bros back in the day. Good thing AI still cant reason and innovate stuff. Also leaking credentials is a felony in my country so I also wont ever attach it to my codebases.

aspenmartin5mo ago

I think the issue is folks talk past each other. People who find coding agents useful or enjoyable are labeled “on the hype train” and folks for which coding agents don’t work for them or their workflow are considered luddites. There are an incredible number of contradicting claims and predictions out there as well, and I believe what we see is folks projecting their reaction to some amalgamation of them onto others. I see a lot of “they” language, and a lot of viral articles about business leadership “shoving AI down our throats” and it becomes a divisive issue like American political scene with really no one having a real conversation

3 more replies

fragmede5mo ago

your credentials shouldn't be in your codebase to begin with!

1 more reply

game_the0ry5mo ago· 6 in thread

> Through field observations (N=13) and qualitative surveys (N=99)...

Not a statistically significant sample size.

flurie5mo ago

This is a qualitative methods paper, so statistical significance is not relevant. The rough qualitative equivalent would instead be "data saturation" (responses generally look like ones you've received already) and "thematic saturation" (you've likely found all the themes you will find through this method of data collection). There's an intuitive quality to determining the number of responses needed based on the topic and research questions, but this looks to me like they have achieved sufficient thematic saturation based on the results.

game_the0ry5mo ago

So, I upvoted your comment bc I genuinely believe there is something in your comments worth learning from, but...

> This is a qualitative methods paper, so statistical significance is not relevant.

I have never heard of a "qualitative methods paper" and it sounds like something a researcher would do to push a narrative with "qualitative data" rather than data that could be measured.

Tell me why I am wrong.

1 more reply

bee_rider5mo ago

97 samples is enough to get a 95% confidence level if you accept a 10% margin of error. 99 is not so bad, at least.

https://www.surveymonkey.com/mp/sample-size-calculator/

HPsquared5mo ago

Significance depends on effect size.

energy1235mo ago

How many independent witnesses would you need to convict someone of murder?

superjose5mo ago

Same thoughts exactly.

lesuorac5mo ago· 3 in thread

> Most Recent Task for Survey

> Number of Survey Respondents

> Building apps 53

> Testing 1

I think this sums up everybody complaints about AI generated code. Don't ask me to be the one to review work you didn't even check.

rco87865mo ago

Yea. Nobody wants to be a full-time code reviewer.

jaggederest5mo ago

Hi it's me, the guy who wants to be a full-time code reviewer.

2 more replies

throw-12-165mo ago

I fired someone over this a few months ago.

danavar5mo ago· 3 in thread

So much of my professional SWE jobs isn't even programming - I feel like this is a detail missed by so many. Generally people just stereotype SWE as a programmer, but being an engineer (in any discipline) is so much more than that. You solve problems. AI will speed up the programming work-streams, but there is so much more to our jobs than that.

whstl5mo ago

Agreed.

Most of the work brought to me gets done before I even think about sitting down to type.

And it's interesting to see the divide here between "pure coder" and "coder + more". A lot of people seem to be in the job to just do what the PM, designer and business people ask. A lot of work is pushing back against some of those requests. In conversations here in HN about "essential complexity" I even see commenters arguing that the spec brought to you is entirely essential. It's not.

ciaranmca5mo ago

^This 100%. Junior SWE here. Agentic coding has kinda felt like a promotion for me. I code less by hand and spend more time on the actual engineering side of things. There’s hype in both directions though. I don’t AI is replacing me anytime soon(fingers crossed), but it’s already way more useful than the skeptics give it credit for. Like most things the truth’s somewhere in the middle.

danielbln5mo ago

There is also so much more you can automate and use AI agents for than "programming". It's the world's best rubber duck, for one. It also can dig through code bases and compile information on data flows, data models and so on. Hell, it can automate effectively any task you do on the terminal.

zkmon5mo ago· 3 in thread

I haven't seen the definition of an agent, in the paper. Do they differentiate agents from generic online chat interfaces?

senshan5mo ago

Page 2: We define agentic tools or agents as AI tools integrated into an IDE or a terminal that can manipulate the code directly (i.e., excluding web-based chat interfaces)

esafak5mo ago

An agent takes actions. Chat bots only return text.

zkmon5mo ago

"takes actions" is automation and its is hardly new. Code was always taking actions over the decades. Interpreting and generating text belongs to chat bots. What's new with agents?

1 more reply

zwnow5mo ago· 2 in thread

Idk, I still mostly avoid using it and if I do, I just copy and paste shit into the Claude web version. I wont ever manage agents as that sounds just as complicated as coding shit myself.

lexandstuff5mo ago

It's not complicated at all. You don't "manage agents". You just type your prompt into an terminal application that can update files, read your docs and run your tests.

As with every new tech there's a hell of a lot of noise (plugins, skills, hooks, MCP, LSP - to quote Kaparthy) but most of it can just be disregarded. No one is "behind" - it's all very easy to use.

danielbln5mo ago

Easy to use, hard to master. Or: low skill floor, high skill ceiling. My output wouldn't be nearly as good without subagents and skills, and MCPs are somewhat required if you deploy tool using agents at scale.

It's like saying all you need is notepad to develop. It's not wrong, but.. you know.

1 more reply

andy995mo ago· 2 in thread

Is the title an ironic play on AI’s trademark writing style, is it AI generated, or is the style just rubbing off on people?

mattnewton5mo ago

I think it’s a popular style before gen ai and the training process of LLMs picked up on that.

andy995mo ago

That’s not how LLMs work, it’s part of the reinforcement learning or SFT dataset, data labelers would have written or generated tons of examples using this and other patterns (all the emoji READMEs for example) that the models emulate. The early ones had very formulaic essay style outputs that always ended with “in conclusion”, lots of the same kind of bullet lists, and a love of adjectives and delving, all of which were intentionally trained in. It’s more subtle now but it’s still there.

1 more reply

andrewstuart5mo ago· 2 in thread

Don’t let anyone tell you the right way to program a computer.

Do it in the way that makes you feel happy, or conforms to organizational standards.

mkoubaa5mo ago

The right way to program a computer:

Well

andrewstuart5mo ago

No.

There’s many contexts in which programming a computer well is not important.

throw-12-165mo ago· 2 in thread

Getting big "I'll keep making saddles in the era of automobiles" vibes from these comments.

danielbln5mo ago

Yeah, it feels many SWEs have painted themselves into a corner. They love the nose-to-code-grindstone process and chain themselves to the abstraction layer of today. I don't think it's gonna end well for them, let's see.

Snuggly735mo ago

This type of comment implies that it’s going to stop with “them” and somehow “us that adopted the LLM” will be the winners. The goal is full automation, there is no “adapt or be left behind”.

1 more reply

senshan5mo ago· 1 in thread

Excellent survey, but one has to be careful when participating in such surveys:

"I’m on disability, but agents let me code again and be more productive than ever (in a 25+ year career). - S22"

Once Social Security Administration learns this, there goes the disability benefit...

LoganDark5mo ago

I think you eventually lose disability benefits anyway once you start making money.

senshan5mo ago· 1 in thread

I often tell people that agentic programming tools are the best thing since cscope. The last 6 months I have not used cscope even once after decades of using it nearly daily.

[0] https://en.wikipedia.org/wiki/Cscope

utopiah5mo ago

Well, looks like that's how I'm spending my day https://cscope.sourceforge.net/cscope_vim_tutorial.html

Out of curiosity, if I wanted to setup cscope for a bunch of small projects, say dozens of prototypes in their own directory, would it be useful? Too broad?

AYBABTME5mo ago

It feels like we're doing another lift to a higher level of abstraction. Whereas we had "automatic programming" and "high level programming languages" free us from assembly, where higher level abstractions could be represented without the author having to know or care about the assembly (and it took decades for the switch to happen), we now once again get pulled up another layer.

We're in the midst of another abstraction level becoming the working layer - and that's not a small layer jump but a jump to a completely different plane. And I think once again, we'll benefit from getting tools that help us specify the high level concepts we intend, and ways to enforce that the generated code is correct - not necessarily fast or efficient but at least correct - same as compilers do. And this lift is happening on a much more accelerated timeline.

The problem of ensuring correctness of the generated code across all the layers we're now skipping is going to be the crux of how we manage to leverage LLM/agentic coding.

Maybe Cursor is TurboPascal.

geldedus5mo ago

The "Ai-assisted programming" mistaken for "vibe coding" is getting old and annoying

learningstud5mo ago

If developers are not using TLA+ or Lean4 etc. They are vibe coding. Nothing wrong with that. They just have to realize that they were never in control. Thinking logically is much harder than developers imagined. As Dijkstra observed, the whole field has adopted the mentra, "How to program when you cannot." I estimate that 80% of what developers do can be done once and for all for all of humanity, yet we don't learn. Be offended all you want, but I am fed up with this idiocy given all the usual rebuttals of deadlines etc.

https://news.ycombinator.com/item?id=43679634

ramoz5mo ago

> Takeaway 3c: Experienced developers disagree about using agents for software planning and design. Some avoided agents out of concern over the importance of design, while others embraced back-and-forth design with an AI.

Im in the back-and-forth camp. I expect a lot of interesting UX to develop here. I built https://github.com/backnotprop/plannotator over the weekend to give me a better way to review & collaborate around plans - all while natively integrated into the coding agent harness.

amkharg265mo ago

The title is provocative but there's truth to it. The distinction between "vibing" with AI tools and actually controlling the output is crucial for production code.

I've seen this with code generation tools - developers who treat AI suggestions as magic often struggle when the output doesn't work or introduces subtle bugs. The professionals who succeed are those who understand what the AI is doing, validate the output rigorously, and maintain clear mental models of their system.

This becomes especially important for code quality and technical debt. If you're just accepting AI-generated code without understanding architectural implications, you're building a maintenance nightmare. Control means being able to reason about tradeoffs, not just getting something that "works" in the moment.

softwaredoug5mo ago

The new layer of abstraction is tests. Mostly end-to-end and integration tests. It describes the important constraints to the agents, essentially long lived context.

So essentially what this means is a declarative programming system of overall system behavior.

000ooo0005mo ago

Have to wonder about the motivations of research when the intro leads with such a quote.

4b11b45mo ago

I like to think of it as "maintaining fertile soil"

SunlitCat5mo ago

Funny how the title alone evokes the old “real programmers” trope https://xkcd.com/378/

j / k navigate · click thread line to collapse

247 comments

106 comments · 23 top-level

runtimepanic5mo ago· 17 in thread

_cenw5mo ago

Is anyone else getting more mentally exhausted by this? I get more done, but I also miss the relaxing code typing in the middle of the process.

agumonkey5mo ago

I think there are two groups of people emerging. deep / fast / craft-and-decomposition-loving vs black box / outcome-only.

ps: thanks everybody for the answers, interesting to read your pov

4 more replies

jghn5mo ago

1 more reply

tikimcfee5mo ago

3 more replies

bugglebeetle5mo ago

simonw5mo ago

Yes, absolutely, I can be mentally wiped out by lunch.

epolanski5mo ago

Yes it's taxing and mentally draining, reading code and connecting dots is always harder than writing it.

SJMG5mo ago

I think it's the serial waiting game and inevitable context switching while you wait.

Long iteration cycles are taxing

bccdee5mo ago

mupuff12345mo ago

For me it's the opposite, I'm wasting less energy over debugging silly bugs and fighting/figuring out some annoying config.

But it does feel less fulfilling I suppose.

teaearlgraycold5mo ago

I like to alternate focusing on AI wrangling and writing code the old fashioned way.

AlotOfReading5mo ago

codeformoney5mo ago

remich5mo ago

2 more replies

Madmallard5mo ago

"it’s knowing when the agent is confidently wrong and how to fence it in with tests, architecture, and invariants."

Strongly suspect this is simply less efficient than doing it yourself if you have enough expertise.

llmslave25mo ago

Does using an LLM to craft Hackernews comments count as "steering systems"?

coip5mo ago

You're totally right! It's not steering systems -- it's cooking, apparently

banbangtuth5mo ago· 16 in thread

I. Don't. Care.

I don't even care about those debates outside. Debates about do LLM work and replace programmers? Say they do, ok so what?

I simply have too much fun programming. I am just a mere fullstack business line programmer, generic random replaceable dude, you can find me dime a dozen.

I do use LLM as Stack Overflow/docs replacement, but I always code by hand all my code.

As for me, I simply have too much fun programming. Now if you excuse me, I need to go have fun.

lifetimerubyist5mo ago

Hear hear. I didn't spend half my life getting an education, competing in the corporate crab bucket, retraining and upskilling just to turn into a robot babysitter.

danielbln5mo ago

1 more reply

yacthing5mo ago

Easy to say if you either:

(1) already have enough money to survive without working, or

(2) don't realize how hard of a life it would be to "flip burgers" to make a living in 2026.

We live very good lives as software developers. Don't be a fool and think you could just "flip burgers" and be fine.

banbangtuth5mo ago

Ah, I actually did flip burgers. So I know.

I also did dry cleaning, cleaning service, deli, delivery guy, etc.

Yup I now have enough money to survive without working.

But I also am very low maintenance, thanks to my early life being raised in harsh conditions.

I am not scared to go back flipping burgers again.

2 more replies

hecanjog5mo ago

falkensmaize5mo ago

People need to make money to survive, now more than ever. It seems incredibly selfish to wish for that to disappear just so you can "purify" the profession.

1 more reply

dinkumthinkum5mo ago

banbangtuth5mo ago

We will have bigger problems when that happens. I am not worried.

1 more reply

llmslave25mo ago

I simply will not spend my life begging and coaxing a machine to output working code. If that is what becomes of this profession, I will just do something else :)

ryanobjc5mo ago

If I wanted to do that, I'd just move into engineering management and work with something less temperamental and predictable - humans.

I'd at least be more likely to get a boost in impact and ability to affect decision making, maybe.

1 more reply

aspenmartin5mo ago

agentifysh5mo ago

having fun isn't tied to employment unless you are self-employed even then what's fun should not be the driving force

lifetimerubyist5mo ago

"get a job doing something you enjoy and you'll never work a day in your life"

or something like that

banbangtuth5mo ago

Why? It is a matter of values. Fun can be a driving force just like money and stability is. It is simply a matter of your values (and your sacrifices).

Like I said, I am just a generic replaceable dime a dozen programmer dude.

1 more reply

throw-12-165mo ago

i think you angered the hustle bros

1 more reply

llmslave25mo ago

That sounds miserable to me :(

1 more reply

simonw5mo ago· 14 in thread

The models were mostly GPT-5 and Claude Sonnet 4. The study was too early to catch the 5.x Codex or Claude 4.5 models (bar one mention of Sonnet 4.5.)

This is notable because a lot of academic papers take 6-12 months to come out, by which time the LLM space has often moved on by an entire model generation.

utopiah5mo ago

> academic papers take 6-12 months to come out, by which time the LLM space has often moved on by an entire model generation.

soulofmischief5mo ago

2025 has been a wild year for agentic coding models. Cutting-edge models in January 2025 don't hold a candle to cutting edge models in December 2025.

Just the jump from Sonnet 3.5 to 3.7 to 4.5, and Opus 4.5 has been pretty massive in terms of holistic reasoning, deep knowledge as well as better procedural and architectural adherence.

Honestly, the landscape at the end of the year is impressive. There has been great work all over the place, almost too much to keep up with. I'm very interested in the Genie models and a few others.

For an idea:

2 more replies

simonw5mo ago

The problem is with how people interpret these results.

bbor5mo ago

Certainly some scientists are just absurdly efficient and all 28 involved teams, but that’s still a lot.

I don’t have a specific critique on why 4 months is definitely too short to do it right tho. Just vibe-reviewing, I guess ;)

aaronblohowiak5mo ago

are they a PI with a lab? in this field, does the PI get first or last author?

1 more reply

dheera5mo ago

> academic papers take 6-12 months to come out

It takes about 6 months to figure out how to get LaTeX to position figures where you want them, and then another 6 months to fight with reviewers

zeristor5mo ago

Couldn't AI help with the LaTeX?

Cutting it down to 6 minutes

1 more reply

ActionHank5mo ago

For what it’s worth I know this is likely intended to read as the new generation of models will somehow better than any paper will be able to gauge, that hasn’t been my experience.

Results are getting worse and less accurate, hell, I even had Claude drop some Chinese into a response out of the blue one day.

danielbln5mo ago

I can absolutely not corroborate this, Opus 4.5 has been nothing but stellar.

mannycalavera425mo ago

reactordev5mo ago

I knew in October the game had changed. Thanks for keeping us in the know.

mikasisiki5mo ago

I'm not sure what you mean by “the game has changed.” If you’re referring to Opus 4.5, it’s somewhat better, but it’s far from game-changing.

1 more reply

joenot4435mo ago

Thanks Simon - always quick on the draw.

Off your intuition, do you think the same study with Codex 5.2 and Opus 4.5 would see even better results?

simonw5mo ago

2 more replies

websiteapi5mo ago· 11 in thread

simonw5mo ago

We've been giving our work away to each other for free as open source to help improve each other's productivity for 30+ years now and that's only made our profession more valuable.

websiteapi5mo ago

I see little proof open source has resulted in higher wages and not the fact that everything is being digitized and the subsequent demand for such people to assist in such.

1 more reply

aussieguy12345mo ago

cheema335mo ago

> we've never seen a profession drive themselves so aggressively to irrelevance.

Should we be trying to put the genie back in the bottle? If not, what exactly are you suggesting?

Even if we all agreed to stop using AI tools today, what about the rest of world? Will everybody agree to stop using it? Do you think that is even a remote possibility?

dinkumthinkum5mo ago

mkoubaa5mo ago

Don't care have too much to do must automate away my today responsibilities so I can do more tomorrow trvst the plqn

throw-12-165mo ago

Software Engineers will still exist.

Software Devs not so much.

There is a huge difference between the two and they are not interchangeable.

wiseowise5mo ago

Good luck convincing new overlords.

Your take is this meme https://knowyourmeme.com/memes/dig-the-fucking-hole.

1 more reply

zwnow5mo ago

aspenmartin5mo ago

3 more replies

fragmede5mo ago

your credentials shouldn't be in your codebase to begin with!

1 more reply

game_the0ry5mo ago· 6 in thread

> Through field observations (N=13) and qualitative surveys (N=99)...

Not a statistically significant sample size.

flurie5mo ago

game_the0ry5mo ago

So, I upvoted your comment bc I genuinely believe there is something in your comments worth learning from, but...

> This is a qualitative methods paper, so statistical significance is not relevant.

I have never heard of a "qualitative methods paper" and it sounds like something a researcher would do to push a narrative with "qualitative data" rather than data that could be measured.

Tell me why I am wrong.

1 more reply

bee_rider5mo ago

97 samples is enough to get a 95% confidence level if you accept a 10% margin of error. 99 is not so bad, at least.

https://www.surveymonkey.com/mp/sample-size-calculator/

HPsquared5mo ago

Significance depends on effect size.

energy1235mo ago

How many independent witnesses would you need to convict someone of murder?

superjose5mo ago

Same thoughts exactly.

lesuorac5mo ago· 3 in thread

> Most Recent Task for Survey

> Number of Survey Respondents

> Building apps 53

> Testing 1

I think this sums up everybody complaints about AI generated code. Don't ask me to be the one to review work you didn't even check.

rco87865mo ago

Yea. Nobody wants to be a full-time code reviewer.

jaggederest5mo ago

Hi it's me, the guy who wants to be a full-time code reviewer.

2 more replies

throw-12-165mo ago

I fired someone over this a few months ago.

danavar5mo ago· 3 in thread

whstl5mo ago

Agreed.

Most of the work brought to me gets done before I even think about sitting down to type.

ciaranmca5mo ago

danielbln5mo ago

zkmon5mo ago· 3 in thread

I haven't seen the definition of an agent, in the paper. Do they differentiate agents from generic online chat interfaces?

senshan5mo ago

Page 2: We define agentic tools or agents as AI tools integrated into an IDE or a terminal that can manipulate the code directly (i.e., excluding web-based chat interfaces)

esafak5mo ago

An agent takes actions. Chat bots only return text.

zkmon5mo ago

"takes actions" is automation and its is hardly new. Code was always taking actions over the decades. Interpreting and generating text belongs to chat bots. What's new with agents?

1 more reply

zwnow5mo ago· 2 in thread

Idk, I still mostly avoid using it and if I do, I just copy and paste shit into the Claude web version. I wont ever manage agents as that sounds just as complicated as coding shit myself.

lexandstuff5mo ago

It's not complicated at all. You don't "manage agents". You just type your prompt into an terminal application that can update files, read your docs and run your tests.

As with every new tech there's a hell of a lot of noise (plugins, skills, hooks, MCP, LSP - to quote Kaparthy) but most of it can just be disregarded. No one is "behind" - it's all very easy to use.

danielbln5mo ago

It's like saying all you need is notepad to develop. It's not wrong, but.. you know.

1 more reply

andy995mo ago· 2 in thread

Is the title an ironic play on AI’s trademark writing style, is it AI generated, or is the style just rubbing off on people?

mattnewton5mo ago

I think it’s a popular style before gen ai and the training process of LLMs picked up on that.

andy995mo ago

1 more reply

andrewstuart5mo ago· 2 in thread

Don’t let anyone tell you the right way to program a computer.

Do it in the way that makes you feel happy, or conforms to organizational standards.

mkoubaa5mo ago

The right way to program a computer:

Well

andrewstuart5mo ago

No.

There’s many contexts in which programming a computer well is not important.

throw-12-165mo ago· 2 in thread

Getting big "I'll keep making saddles in the era of automobiles" vibes from these comments.

danielbln5mo ago

Snuggly735mo ago

1 more reply

senshan5mo ago· 1 in thread

Excellent survey, but one has to be careful when participating in such surveys:

"I’m on disability, but agents let me code again and be more productive than ever (in a 25+ year career). - S22"

Once Social Security Administration learns this, there goes the disability benefit...

LoganDark5mo ago

I think you eventually lose disability benefits anyway once you start making money.

senshan5mo ago· 1 in thread

I often tell people that agentic programming tools are the best thing since cscope. The last 6 months I have not used cscope even once after decades of using it nearly daily.

[0] https://en.wikipedia.org/wiki/Cscope

utopiah5mo ago

Well, looks like that's how I'm spending my day https://cscope.sourceforge.net/cscope_vim_tutorial.html

Out of curiosity, if I wanted to setup cscope for a bunch of small projects, say dozens of prototypes in their own directory, would it be useful? Too broad?

AYBABTME5mo ago

The problem of ensuring correctness of the generated code across all the layers we're now skipping is going to be the crux of how we manage to leverage LLM/agentic coding.

Maybe Cursor is TurboPascal.

geldedus5mo ago

The "Ai-assisted programming" mistaken for "vibe coding" is getting old and annoying

learningstud5mo ago

https://news.ycombinator.com/item?id=43679634

ramoz5mo ago

amkharg265mo ago

The title is provocative but there's truth to it. The distinction between "vibing" with AI tools and actually controlling the output is crucial for production code.

softwaredoug5mo ago

The new layer of abstraction is tests. Mostly end-to-end and integration tests. It describes the important constraints to the agents, essentially long lived context.

So essentially what this means is a declarative programming system of overall system behavior.

000ooo0005mo ago

Have to wonder about the motivations of research when the intro leads with such a quote.

4b11b45mo ago

I like to think of it as "maintaining fertile soil"

SunlitCat5mo ago

Funny how the title alone evokes the old “real programmers” trope https://xkcd.com/378/

j / k navigate · click thread line to collapse