Human coders are still better than LLMs (opens in new tab)

(antirez.com)

655 pointslongwave1y ago735 comments

735 comments

202 comments · 100 top-level

mattnewton1y ago· 21 in thread

This matches my experience. I actually think a fair amount of value from LLM assistants to me is having a reasonably intelligent rubber duck to talk to. Now the duck can occasionally disagree and sometimes even refine.

https://en.m.wikipedia.org/wiki/Rubber_duck_debugging

I think the big question everyone wants to skip right to and past this conversation is, will this continue to be true 2 years from now? I don’t know how to answer that question.

Buttons8401y ago

LLMs aren't my rubber duck, they're my wrong answer.

You know that saying that the best way to get an answer online is to post a wrong answer? That's what LLMs do for me.

I ask the LLM to do something simple but tedious, and then it does it spectacularly wrong, then I get pissed off enough that I have the rage-induced energy to do it myself.

8 more replies

marcosdumay1y ago

It's a damning assertive duck, completely out of proportion to its competence.

I've seen enough people led astray by talking to it.

11 more replies

schwartzworld1y ago

For me it's like having a junior developer work under me who knows APIs inside and out, but has no common sense about architecture. I like that I delegate tasks to them so that my brain can be free for other problems, but it makes my job much more review heavy than before. I put every PR through 3-4 review cycles before even asking my team for a review.

2 more replies

_tom_1y ago

For me, it's a bit like pair programming. I have someone to discuss ideas with. Someone to review my code and suggest alternative approaches. Some one that uses different feature than I do, so I learn from them.

2 more replies

p1necone1y ago

> the duck can occasionally disagree

This has not been my experience. LLMs have definitely been helpful, but generally they either give you the right answer or invent something plausible sounding but incorrect.

If I tell it what I'm doing I always get breathless praise, never "that doesn't sound right, try this instead."

2 more replies

marcosdumay1y ago

LLMs will still be this way 10 years from now.

But IDK if somebody won't create something new that gets better. But there is no reason at all to extrapolate our current AIs into something that solves programing. Whatever constraints that new thing will have will be completely unrelated to the current ones.

1 more reply

Bukhmanizer1y ago

There are a couple people I work with who clearly don’t have a good understanding of software engineering. They aren’t bad to work with and are in fact great at collaborating and documenting their work, but don’t seem to have the ability to really trace through code and logically understand how it works.

Before LLMs it was mostly fine because they just didn’t do that kind of work. But now it’s like a very subtle chaos monkey has been unleashed. I’ve asked on some PRs “why is this like this? What is it doing?” And the answer is “ I don’t know, ChatGPT told me I should do it.”

The issue is that it throws basically all their code under suspicion. Some of it works, some of it doesn’t make sense, and some of it is actively harmful. But because the LLMs are so good at giving plausible output I can’t just glance at the code and see that it’s nonsense.

And this would be fine if we were working on like a crud app where you can tell what is working and broken immediately, but we are working on scientific software. You can completely mess up the results of a study and not know it if you don’t understand the code.

3 more replies

johnnyanmac1y ago

>I think the big question everyone wants to skip right to and past this conversation is, will this continue to be true 2 years from now?

For me, it's less "conversation to be skipped" and more about "can we even get to 2 years from now"? There's so much insability right now that it's hard to say what anything will look like in 6 months. "

gerad1y ago

It's like chess. Humans are better for now, they won't be forever, but humans plus software is going to better than either alone for a long time.

7 more replies

akshay_trikha1y ago

I've had this same thought that it would be nice to have an AI rubber ducky to bounce ideas off of while pair programming (so that you don't sound dumb to your coworkers & waste their time).

This is my first comment so I'm not sure how to do this but I made a BYO-API key VSCode extension that uses the OpenAI realtime API so you can have interactive voice conversations with a rubber ducky. I've been meaning to create a Show HN post about it but your comment got me excited!

In the future I want to build features to help people communicate their bugs / what strategies they've tried to fix them. If I can pull it off it would be cool if the AI ducky had a cursor that it could point and navigate to stuff as well.

Please let me know if you find it useful https://akshaytrikha.github.io/deep-learning/2025/05/23/duck...

3 more replies

empath751y ago

Just the exercise of putting my question in a way that the LLM could even theoretically provide a useful response is enough for me to figure out how to solve the problem a good percentage of the time.

bandoti1y ago

My take is that AI is very one-dimensional (within its many dimensions). For instance, I might close my eyes and imagine an image of a tree structure, or a hash table, or a list-of-trees, or whatever else; then I might imagine grabbing and moving the pieces around, expanding or compressing them like a magician; my brain connects sight and sound, or texture, to an algorithm. However people think about problems is grounded in how we perceive the world in its infinite complexity.

Another example: saying out loud the colors red, blue, yellow, purple, orange, green—each color creates a feeling that goes beyond its physical properties into the emotions and experiences. AI image-generation might know the binary arrangement of an RGBA image but actually, it has NO IDEA what it is to experience colour. No idea how to use the experience of colour to teach a peer of an algorithm. It regurgitates a binary representation.

At some point we’ll get there though—no doubt. It would be foolish to say never! For those who want to get there before everyone else probably should focus on the organoids—because most powerful things come from some Faustian monstrosity.

1 more reply

Waterluvian1y ago

Same. Just today I used it to explore how a REST api should behave in a specific edge case. It gave lots of confident opinions on options. These were full of contradictions and references to earlier paragraphs that didn’t exist (like an option 3 that never manifested). But just by reading it, I rubber ducked the solution, which wasn’t any of what it was suggesting.

joshdavham1y ago

> I actually think a fair amount of value from LLM assistants to me is having a reasonably intelligent rubber duck to talk to.

I wonder if the term "rubber duck debugging" will still be used much longer into the future.

1 more reply

mock-possum1y ago

Yeah in my experience as long as you don’t stray too far off the beaten path, LLMs are great at just parroting conventional wisdom for how to implement things - but the second you get to something more complicated - or especially tricky bug fixing that requires expensive debuggery - forget about it, they do more harm than good. Breaking down complex tasks into bite sized pieces you can reasonably expect the robot to perform is part of the art of the LLM.

ortusdux1y ago

> I think the big question everyone wants to skip right to and past this conversation is, will this continue to be true 2 years from now? I don’t know how to answer that question.

I still think about Tom Scott's 'where are we on the AI curve' video from a few years back. https://www.youtube.com/watch?v=jPhJbKBuNnA

bossyTeacher1y ago

I think of them as highly sycophant LSD-minded 2nd year student who has done some programming

hoppp1y ago

Same. I do rubber duck debugging too and found the LLM to compliment it nicely.

Looking forward for rubber duck shaped hardware AI interfaces to talk to in the future. Im sure somebody will create it

koonsolo1y ago

It seems to me we're at the flat side of the curve again. I haven't seen much real progress in the last year.

It's ignorant to think machines will not catch up to our intelligence at some point, but for now, it's clearly not.

I think there needs to be some kind of revolutionary breakthrough again to reach the next stage.

If I were to guess, it needs to be in the learning/back propagation stage. LLM's are very rigid, and once they go wrong, you can't really get them out of it. A junior develop for example could gain a new insight. LLM's, not so much.

cortesoft1y ago

Currently, I find AI to be a really good autocomplete

3 more replies

travisgriggs1y ago

LLMs are a passel of eager to please know it all interns that you can command at will without any moral compunctions.

They drive you nuts trying to communicate with them what you actually want them to do. They have a vast array of facts at immediate recall. They’ll err in their need to produce and please. They do the dumbest things sometimes. And surprise you at other times. You’ll throw vast amounts of their work away or have to fix it. They’re (relatively) cheap. So as an army of monkeys, if you keep herding them, you can get some code that actually tells a story. Mostly.

UncleOxidant1y ago· 8 in thread

There's some whistling past the graveyard in these comments. "You still need humans for the social element...", "LLMs are bad at debugging", "LLMs lead you astray". And yeah, there's lots of truth in those assertions, but since I started playing with LLMs to generate code a couple of years ago they've made huge strides. I suspect that over the next couple of years the improvements won't be quite as large (Pareto Principle), but I do expect we'll still see some improvement.

Was on r/fpga recently and mentioned that I had had a lot of success recently in getting LLMs to code up first-cut testbenches that allow you to simulate your FPGA/HDL design a lot quicker than if you were to write those testbenches yourself and my comment was met with lots of derision. But they hadn't even given it a try to form their conclusion that it just couldn't work.

xhevahir1y ago

This attitude is depressingly common in lots of professional, white-collar industries I'm afraid. I just came from the /r/law subreddit and was amazed at the kneejerk dismissal there of Dario Amodei's recent comments about legal work, and of those commenters who took them seriously. It's probably as much a coping mechanism as it is complacency, but, either way, it bodes very poorly for our future efforts at mitigating whatever economic and social upheaval is coming.

7 more replies

layer81y ago

Programmers derided programming languages (too inefficient, too inflexible, too dumbing-down) when assembly was still the default. That phenomenon is at the same time entirely to be expected but also says little about the actual qualities of the new technology.

1 more reply

ch4s31y ago

It seems like LLMs made really big strides for a while but don't seem to be getting better recently, and in some ways recent models feel a bit worse. I'm seeing some good results generating test code, and some really bad results when people go to far with LLM use on new feature work. Base on what I've seen it seems like spinning up new projects and very basic features for web apps works really well, but that doesn't seem to generalize to refactoring or adding new features to big/old code bases.

I've seen Claude and ChatGPT happily hallucinate whole APIs for D3 on multiple occasions, which should be really well represented in the training sets.

3 more replies

parliament321y ago

I'd like to agree with you and remain optimistic, but so much tech has promised the moon and stagnated into oblivion that I just don't have any optimism left to give. I don't know if you're old enough, but remember when speech-to-text was the next big thing? DragonSpeak was released in 1997, everyone was losing their minds about dictating letters/documents in MS Word, and we were promised that THIS would be the key interface for computing evermore. And.. 27 years later, talking to the latest Siri, it makes just as many mistakes as it did back then. In messenger applications people are sending literal voice notes -- audio clips -- back and forth because dictation is so unreliable. And audio clips are possibly the worst interface for communication ever (no searching, etc).

Remember how blockchain was going to change the world? Web3? IoT? Etc etc.

I've been through enough of these cycles to understand that, while the AI gimmick is cool and all, we're probably at the local maximum. The reliability won't improve much from here (hallucinations etc), while the costs to run it will stay high. The final tombstone will be when the AI companies stop running at a loss and actually charge for the massive costs associated with running these models.

2 more replies

cushychicken1y ago

ChatGPT-4o is scary good at writing VHDL.

Using it to prototype some low level controllers today, as a matter of fact!

3 more replies

bgwalter1y ago

Yet you are working on your own replacement, while your colleagues are taking the prudent approach.

5 more replies

energy1231y ago

Their confusion is your competitive advantage in the labor market.

retetr1y ago

Unrelated, but is this a case of the Pareto Principle? (Admittedly the first time I'm hearing of it) Wherein 80% of the effect is caused by 20% of the input. Or is this more a case of diminishing returns? Where the initial results were incredible, but each succeeding iteration seems to be more disappointing?

1 more reply

yua_mikami1y ago· 8 in thread

The thing everyone forgets when talking about LLMs replacing coders is that there is much more to software engineering than writing code, in fact that's probably one of the smaller aspects of the job.

One major aspect of software engineering is social, requirements analysis and figuring out what the customer actually wants, they often don't know.

If a human engineer struggles to figure out what a customer wants and a customer struggles to specify it, how can an LLM be expected to?

malfist1y ago

That was also one of the challenges during the offshoring craze in the 00s. The offshore teams did not have the power, or knowledge to push back on things and just built and built and built. Sounds very similar to AI right?

Probably going to have the same outcome.

2 more replies

devjab1y ago

LLM's do no software engineering at all, and that can be fine. Because you don't actually need software engineering to create successful programs. Some applications will not even need software engineering for their entire life cycles because nobody is really paying attention to efficiency in the ocean of poor cloud management anyway.

I actually imagine it's the opposite of what you say here. I think technically inclined "IT business partners" will be able of creating applications entirely without software engineers... Because I see that happen every day in the world of green energy. The issues come later, when things have to be maintained, scale or become efficient. This is where the software engineering comes in, because it actually matters if you used a list or a generator in your Python app when it iterates over millions of items and not just a few hundreds.

1 more reply

victorbjorklund1y ago

Yea, this is why I dont buy the "all developers will disappear". Will I write a lot less code in 5 years (maybe almost none)? Sure, I already type a lot less now than a year ago. But that is just a small part of the process.

2 more replies

rowanG0771y ago

I think LLMs are better at requirement elicitation than they are at actually writing code.

ilaksh1y ago

It actually comes down to feedback loops which means iterating on software being used or attempting to be used by the customer.

Chat UIs are an excellent customer feedback loop. Agents develop new iterations very quickly.

LLMs can absolutely handle abstractions and different kinds of component systems and overall architecture design.

They can also handle requirements analysis. But it comes back to iteration for the bottom line which means fast turnaround time for changes.

The robustness and IQ of the models continue to be improved. All of software engineering is well underway of being automated.

Probably five years max where un-augmented humans are still generally relevant for most work. You are going to need deep integration of AI into your own cognition somehow in order to avoid just being a bottleneck.

bbarn1y ago

The thing is, it is replacing _coders_ in a way. There are millions of people who do (or did) the work that LLMs excel at. Coders who are given a ticket that says "Write this API taking this input and giving this output" who are so far down the chain they don't even get involved in things like requirements analysis, or even interact with customers.

Software engineering, is a different thing, and I agree you're right (for now at least) about that, but don't underestimate the sheer amount of brainless coders out there.

1 more reply

mettamage1y ago

> One major aspect of software engineering is social, requirements analysis and figuring out what the customer actually wants, they often don't know.

It really depends on the organization. In many places product owners and product managers do this nowadays.

ori_b1y ago

> If a human engineer struggles to figure out what a customer wants and a customer struggles to specify it, how can an LLM be expected to?

Presumably, they're trained on a ton of requirements docs, as well as a huge number of customer support conversations. I'd expect them to do this at least as well as coding, and probably better.

am17an1y ago· 6 in thread

All the world's smartest minds are racing towards replacing themselves. As programmers, we should take note and see where the wind is blowing. At least don't discard the possibility and rather be prepared for the future. Not to sound like a tin-foil hat but odds of achieving something like this increase by the day.

In the long term (post AGI), the only safe white-collar jobs would be those built on data which is not public i.e. extremely proprietary (e.g. Defense, Finance) and even those will rely heavily on customized AIs.

bitpush1y ago

> All the world's smartest minds are racing towards replacing themselves

Isnt every little script, every little automation us programmers do in the same spirit? "I dont like doing this, so I'm going to automate it, so that I can focus on other work".

Sure, we're racing towards replacing ourselves, but there would be (and will be) other more interesting work for us to do when we're free to do that. Perhaps, all of us will finally have time to learn surfing, or garden, or something. Some might still write code themselves by hand, just like how some folks like making bread .. but making bread by hand is not how you feed a civilization - even if hundreds of bakers were put out of business.

1 more reply

wijwp1y ago

> Not to sound like a tin-foil hat but odds of achieving something like this increase by the day.

Where do you get this? The limitations of LLMs are becoming more clear by the day. Improvements are slowing down. Major improvements come from integrations, not major model improvements.

AGI likely can't be achieved with LLMs. That wasn't as clear a couple years ago.

1 more reply

AstroBen1y ago

Ultimately this needs to be solved politically

Making our work more efficient, or humans redundant should be really exciting. It's not set in stone that we need to leave people middle aged with families and now completely unable to earn enough to provide a good life

Hopefully if it happens, it happens to such a huge amount of people that it forces a change

1 more reply

bgwalter1y ago

The Nobel prize is said to have been created partly out of guilt over having invented dynamite, which was obviously used in a destructive manner.

Now we have Geoffrey Hinton getting the prize for contributing to one of the most destructive inventions ever.

1 more reply

cheema331y ago

> All the world's smartest minds are racing towards replacing themselves.

I think they are hoping that their future is safe. And it is the average minds that will have to go first. There may be some truth to it.

Also, many of these smartest minds are motivated by money, to safeguard their future, from a certain doom that they know might be coming. And AI is a good place to be if you want to accumulate wealth fast.

BirAdam1y ago

Nah. As more people are rendered unemployed the buying market and therefore aggregate demand will fall. Fewer sales hurts the bottom line. At some point, revenues across the entire economy fall, and companies cannot afford the massive datacenters and nuclear power plants fueling them. The hardware gets sold cheap, the companies go under, and people get hired again. Eventually, some kind of equilibrium will be found or the world engages in the Butlerian Jihad.

wanderingstan1y ago· 5 in thread

“Better” is always task-dependent. LLMs are already far better than me (and most devs I’d imagine) at rote things like getting CSS syntax right for a desired effect, or remembering the right way to invoke a popular library (e.g. fetch)

These little side quests used to eat a lot of my time and I’m happy to have a tool that can do these almost instantly.

jaccola1y ago

I've found LLMs particularly bad for anything beyond basic styling since the effects can be quite hard to describe and/or don't have a universal description.

Also, there are often times multiple ways to achieve a certain style and they all work fine until you want a particular tweak, in which case only one will work and the LLM usually gets stuck in one of the ones that does not work.

1 more reply

gherkinnn1y ago

I have found it to be good at things I am not very strong at (SQL) but terrible at the things I know well (CSS).

Telling, isn't it?

3 more replies

kccqzy1y ago

> and most devs I’d imagine

What an awful imagination. Yes there are people who don't like CSS but are forced to use it by their job so they don't learn it properly, and that's why they think CSS is rote memorization.

But overall I agree with you that if a company is too cheap to hire a person who is actually skilled at CSS, it is still better to hoist that CSS job onto LLMs than an unwilling human. Because that unwilling human is not going to learn CSS well and won't enjoy writing CSS.

On the other hand, if the company is willing to hire someone who's actually good, LLMs can't compare. It's basically the old argument of LLMs only being able to replace less good developers. In this case, you admitted that you are not good at CSS and LLMs are better than you at CSS. It's not task-dependent it's skill-dependent.

2 more replies

zdragnar1y ago

I think that's great if it's for something outside of your primary language. I've used it to good effect in that way myself. However, denying yourself the reflexive memory of having learned those things is a quick way to become wholly dependent upon the tool. You could easily end up with compromised solutions because the tool recommends something you don't understand well enough to know there's a better way to do something.

2 more replies

sanderjd1y ago

Yeah, this is what I really like about AI tools though. They're way better than me at annoying minutia like getting CSS syntax right. I used to dread that kind of thing!

1 more reply

pupppet1y ago· 5 in thread

If an LLM just finds patterns, is it even possible for an LLM to be GOOD at anything? Doesn't that mean at best it will be average?

bitpush1y ago

Humans are also almost always operating on patterns. This is why "experience" matters a lot.

Very few people are doing truly cutting edge stuff - we call them visionaries. But most of the time, we're just merely doing what's expected

And yes, that includes this comment. This wasnt creative or an original thought at all. I'm sure hundreds of people have had similar thought, and I'm probably parroting someone else's idea here. So if I can do it, why cant LLM?

1 more reply

riknos3141y ago

My experience is that LLMs regress to the average of the context they have for the task at hand.

If you're getting average results you most likely haven't given it enough details about what you're looking for.

The same largely applies to hallucinations. In my experience LLMs hallucinate significantly more when at or pushed to exceed the limits of their context.

So if you're looking to get a specific output, your success rate is largely determined by how specific and comprehensive the context the LLM has access to is.

jaccola1y ago

Most people (average and below average) can tell when something is above average, even if they cannot create above average work, so using RLHF it should be quite possible to achieve above average.

Indeed it is likely already the case that in training the top links scraped or most popular videos are weighted higher, these are likely to be better than average.

lukan1y ago

There are bad patterns and good patterns. But whether a pattern is the right one for a specific task is something different.

And what really matters is, if the task gets reliable solved.

So if they actually could manage this on average with average quality .. that would be a next level gamechanger.

JackSlateur1y ago

Yes, IA is basically a random machine aiming for average outcome

IA is neat for average people, to produce average code, for average compagnies

In a competitive world, using IA is a death sentence;

loudmax1y ago· 4 in thread

Companies that leverage LLMs and AIs to let their employees be more productive will thrive.

Companies that try to replace their employees with LLMs and AIs will fail.

Unfortunately, all that's in the long run. In the near term, some CEOs and management teams will profit from the short term valuations as they squander their companies' future growth on short-sighted staff cuts.

bdbenton52551y ago

That's really it. These tools are useful as assistants to programmers but do not replace an actual programmer. The right course is to embrace the technology moderately rather than reject it completely or bet on it replacing workers.

joshdavham1y ago

> In the near term, some CEOs and management teams will profit from the short term valuations

That's actually really interesting to think about. The idea that doing something counter-productive like trying to replace employees with AI (which will cause problems), may actually benefit the company in terms of valuations in the short run. So in effect, they're hurting and helping the company at the same time.

2 more replies

janalsncm1y ago

Very well said. Using code assistance is going to be table stakes moving forward, not something that can replace people. It’s not like competitors can’t also purchase AI subscriptions.

1 more reply

BirAdam1y ago

By the time AI hype dies down and hurts the bottom line, AI systems might be good enough to do the jobs.

“The market can remain irrational longer than you can remain solvent.” — Warren Buffett

vouaobrasil1y ago· 4 in thread

The question is, for how long?

sixQuarks1y ago

Exactly! We’ve been seeing more and more posts like this, saying how AI will never take developer jobs or will never be as good as coders. I think it’s some sort of coping mechanism.

These posts are gonna look really silly in the not too distant future.

I get it, spending countless hours honing your craft and knowing that AI will soon make almost everything you learned useless is very scary.

2 more replies

spion1y ago

Vibe-wise, it seems like progress is slowing down and recent models aren't substantially better than their predecessors. But it would be interesting to take a well-trusted benchmark and plot max_performance_until_date(foreach month). (Too bad aider changed recently and there aren't many older models; https://aider.chat/docs/leaderboards/by-release-date.html has not been updated in a while with newer stuff, and the new benchmark doesn't have the classic models such as 3.5, 3.5 turbo, 4, claude 3 opus)

1 more reply

jppittma1y ago

It's really gonna depend on the project. When my hobby project was greenfield, the AI was way better than I am. It was (still is) more knowledgable about the standards that govern the field and about low level interface details. It can shit out a bunch of code that relies on knowing these details in seconds/minutes, rather than hours/days.

Now that the project has grown and all that stuff is hammered out, it can't seem to consistently write code that compiles. It's very tunnel visioned on the specific file its generating, rather than where that fits in the context of what/how we're building what we're building.

2 more replies

kilroy1231y ago

My crackpot guess is ~5 years. The incentives are just too damn high to not keep innovating in the space.

We'll find new ways to push the tech.

acquisitionsilk1y ago· 3 in thread

It is quite heartening to see so many people care about "good code". I fear it will make no difference.

The problem is that the software world got eaten up by the business world many years ago. I'm not sure at what point exactly, or if the writing was already on the wall when Bill Gates' wrote his open letter to hobbyists in 1976.

The question is whether shareholders and managers will accept less good code. I don't see how it would be logical to expect anything else, as long as profit lines go up why would they care.

Short of some sort of cultural pushback from developers or users, we're cooked, as the youth say.

JackSlateur1y ago

Code is meant to power your business

Bad code leads to bad business

This makes me think of hosting departement; You know, which people who are using vmware, physical firewalls, dpi proxies and whatnot;

On the other edge, you have public cloud providers, which are using qemu, netfilter, dumb networking devices and stuff

Who got eaten by whom, nobody could have guessed ..

1 more reply

BirAdam1y ago

This is fun to think about. I used to think that all software was largely garbage, and at one point, I think this _was_ true. Sometime over the last 20 years, I believe this ceased to be the case. Most software these days actually works. Importantly, most software is actually stable enough that I can make it half an hour without panic saving.

Could most software be more awesome? Yes. Objectively, yes. Is most software garbage? Perhaps by raw volume of software titles, but are most popular applications I’ve actually used garbage? Nope. Do I loathe the whole subscription thing? Yes. Absolutely. Yet, I also get it. People expect software to get updated, and updates have costs.

So, the pertinent question here is, will AI systems be worse than humans? For now, yeah. Forever? Nope. The rate of improvement is crazy. Two years ago, LLMs I ran locally couldn’t do much of anything. Now? Generally acceptable junior dev stuff comes out of models I run on my Mac Studio. I have to fiddle with the prompts a bit, and it’s probably faster to just take a walk and think it over than spend an hour trying different prompts… but I’m a nerd and I like fiddling.

1 more reply

robocat1y ago

> Short of some sort of cultural pushback from developers or users

Corporations create great code too: they're not all badly run.

The problem isn't a code quality issue: it is a moral issue of whether you agree with the goals of capitalist businesses.

Many people have to balance the needs of their wallet with their desire for beautiful software (I'm a developer-founder I love engineering and open source community but I'm also capitalist enough to want to live comfortably).

bouncycastle1y ago· 2 in thread

Last night I spent hours fighting o3.

I never made a Dockerfile in my life, so I thought it would be faster just getting o3 to point to the GitHub repo and let it figure out, rather than me reading the docs and building it myself.

I spent hours debugging the file it gave me... It kept on adding hallucinations for things that didn't exist, and removing/rewriting other parts, and other big mistakes like understanding the difference between python3 and python and the intricacies with that.

Finally I gave up and Googled some docs instead. Fixed my file in minutes and was able to jump into the container and debug the rest of the issues. AI is great, but it's not a tool to end all. You still need someone who is awake at the wheel.

throwaway3141551y ago

Pro-tip: Check out Claude or Gemini. They hallucinate far less on coding tasks. Alternatively, enable internet search on o3 which boosts its ability to reference online documentation and real world usage examples.

I get having a bad taste in your mouth but these tools _aren't _ magic and do have something of a steep learning curve in order to get the most out of them. Not dissimilar from vim/emacs (or lots of dev tooling).

edit: To answer a reply (hn has annoyingly limited my ability to make new comments) yes, internet search is always available to ChatGpT as a tool. Explicitly clicking the globe icon will encourage the model to use it more often, however.

1 more reply

halpow1y ago

They're great at one-shotting verbose code, but if they're generate bad code the first time you're out of luck.

I don’t think I ever got to write "this api doesn't exist" and then gotten a useful alternative.

Claude is the only one that regularly tells me something isn't possible rather than making sh up.

smilbandit1y ago· 2 in thread

From my limited experience, former coder now management but I still get to code now and then. I've found them helpful but also intrusive. Sometimes when it guesses the code for the rest of the line and next few lines it's going down a path I don't want to go but I have to take time to scan it. Maybe it's a configuration issue, but i'd prefer it didn't put code directly in my way or be off by default and only show when I hit a key combo.

One thing I know is that I wouldn't ask an LLM to write an entire section of code or even a function without going in and reviewing.

haiku20771y ago

Zed has a "subtle" mode like that. More editors should provide it. https://zed.dev/docs/ai/edit-prediction#switching-modes

PartiallyTyped1y ago

> One thing I know is that I wouldn't ask an LLM to write an entire section of code or even a function without going in and reviewing.

These days I am working on a startup doing [a bit of] everything, and I don't like the UI it creates. It's useful enough when I make the building blocks and let it be, but allowing claude to write big sections ends up with lots of reworks until I get what I am looking for.

austin-cheney1y ago· 2 in thread

In many cases developers are a low expectation commodity. In those cases I strongly believe humans are entirely replaceable by AI and I am saying that as somebody with an exceptionally low opinion of LLMs.

Honestly though, when that replacement comes there is no sympathy to be had. Many developers have brought this upon themselves. For roughly the 25 year period from 1995 to 2020 businesses have been trying to turn developers into mindless commodities that are straight forward to replace. Developers have overwhelmingly encouraged this and many still do. These are the people who hop employers every 2 years and cannot do their jobs without lying on their resumes or complete reliance on a favorite framework.

ponector1y ago

But to job hop every 2 years is the best strategy to earn more money and experience.

zxexz1y ago

I find myself wondering about your story, and would love it if you would elaborate more. I have gotten some use out of LLMs, and have been quite involved in training a few compute intensive (albeit domain-specific) ones.

Maybe it's the way you talk about 'developers'. Nothing I have seen has felt like the sky falling on an industry; to me at most it's been the sky falling on a segment of silicon valley.

1 more reply

decasia1y ago· 2 in thread

We aren't expecting LLMs to come up with incredibly creative software designs right now, we are expecting them to execute conventional best practices based on common patterns. So it makes sense to me that it would not excel at the task that it was given here.

The whole thing seems like a pretty good example of collaboration between human and LLM tools.

writeslowly1y ago

I haven't actually had that much luck with having them output a boring API boilerplate in large Java projects. Like "I need to create a new BarOperation that has to go in a different set of classes and files and API prefixes than all the FooOperations and I don't feel like copy pasting all the yaml and Java classes" but the AI has problems following this. Maybe they work better in small projects.

I actually like LLMs better for creative thinking because they work like a very powerful search engine that can combine unrelated results and pull in adjacent material I would never personally think of.

1 more reply

ehutch791y ago

Uh, no. I've seen the twitter posts saying llms will replace me. I've watched the youtube videos saying llms will code whole apps on one prompt, but are light on details or only show the most basic todo app from every tutorial.

We're being told that llms are now reasoning, which implies they can make logical leaps and employ creativity to solve problems.

The hype cycle is real and setting expectations that get higher with the less you know about how they work.

3 more replies

darkport1y ago· 2 in thread

I think this is true for deeply complex problems, but For everyday tasks an LLM is infinitely “better”.

And by better, I don’t mean in terms of code quality because ultimately that doesn’t matter for shipping code/products, as long as it works.

What does matter is speed. And an LLM speeds me up at least 10x.

kweingar1y ago

You're making at least a year's worth of pre-LLM progress in 5 weeks?

You expect to achieve more than a decade of pre-LLM accomplishments between now and June 2026?

nevertoolate1y ago

How do you measure this?

rel2thr1y ago· 2 in thread

Antirez is a top 0.001% coder . Don’t think this generalizes to human coders at large

ljlolel1y ago

Seriously, he’s one of the best on the planet of course it’s not better than him. If so we’d be cooked.

99% of professional software developers don’t understand what he said much less can come up with it (or evaluate it like Gemini).

This feels a bit like a humblebrag about how well he can discuss with an LLM compared to others vibecoding.

justacrow1y ago

Hey, my CEO is saying that LLMs are also top 0.001% coders now, so should at least be roughly equivalent.

insane_dreamer1y ago· 2 in thread

Coders may want to look at translators for an idea of what might happen.

Translation software has been around for a couple of decades. It was pretty shitty. But about 10 years ago it started to get to the point where it could translate relatively accurately. However, it couldn't produce text that sounded like it was written by a human. A good translator (and there are plenty of bad ones) could easy outperform a machine. Their jobs were "safe".

I speak several languages quite well and used to do freelance translation work. I noticed that as the software got better, you'd start to see companies who instead of paying you to translate wanted to pay you less to "edit" or "proofread" a document pre-translated by machine. I never accepted such work because sometimes it took almost as much work as translating it from scratch, and secondly, I didn't want to do work where the focus wasn't on quality. But I saw the software steadily improving, and this was before ChatGPT, and I realized the writing was on the wall. So I decided not to become dependent on that for an income stream, and moved away from it.

When LLMs came out, and they now produce text that sounded like it was written by a native speaker (in major languages). Sure, it's not going to win any literary awards, but the vast vast majority of translation work out there is commercial, not literature.

Several things have happened: 1) there's very little translation work available compared to before, because now you can pay only a few people to double-check machine-generated translations (that are fairly good to start with); 2) many companies aren't using humans at all as the translations are "good enough" and a few mistakes won't matter that much; 3) the work that is available is high-volume and uninteresting, no longer a creative challenge (which is why I did it in the first place); 4) downward pressure on translation rates (which are typically per word), and 5) very talented translators (who are more like writers/artists) are still in demand for literary works or highly creative work (i.e., major marketing campaign), so the top 1% translators still have their jobs. Also more niche language pairs for which LLMs aren't trained will be safe.

It will continue to exist as a profession, but diminishing, until it'll eventually be a fraction of what it was 10 or 15 years ago.

(This is specifically translating written documents, not live interpreting which isn't affected by this trend, or at least not much.)

0points1y ago

> When LLMs came out, and they now produce text that sounded like it was written by a native speaker (in major languages).

While the general syntax of the language seem to be somewhat correct now, the LLM's still don't know anything about those languages and keep mis-translating words due to its inherit insane design around english. A whole lot of concepts don't even exist in english so these translation oracles just can never do it successfully.

If i i read a few minutes of LLM translated text, there's always a couple of such errors.

I notice younger people don't see these errors because of their worse language skills, and the LLM:s enforce their incorrect understanding.

I don't think this problem will go away as long as we keep pushing this inferior tech, but instead the languages will devolve to "fix" it.

Languages will morph into a 1-to-1 mapping of english and all the cultural nuances will get lost to time.

zahlman1y ago

> When LLMs came out, and they now produce text that sounded like it was written by a native speaker (in major languages).

But they still often get things completely wrong, especially in high-context languages such as Japanese. There often isn't a way to convey the necessary context in text. For example, a Japanese live-streamer who says "配信来てくれてありがとう" means "Thank you for coming to (watch) the stream", but DeepL will give "Thanks for coming to the delivery." - because 配信 actually does mean "delivery" in ordinary circumstances, and that's just the word they idiomatically use to refer to a stream. No matter how much you add from the transcript before or after that, it doesn't communicate the essential fact that the text is a transcript of a livestream.

(And going the other way around, DeepL will insert a に which is grammatically correct but rarely actually uttered by livestreamers who are speaking informally and colloquially; and if you put "stream" in the English, it will be rendered as a loanword ストリーム which is just not what they actually say. Although I guess it should get credit for realizing that you don't mean a small river, which would be 小川.)

(Also, DeepL comes up with complete incomprehensible nonsense for 歌枠, where even basic dictionaries like Jisho will get it right.)

More obviously, they get pronouns and roles wrong all the time - they can't reliably tell whether someone is saying that "I did X" or "you did X" because that may depend on facts of the natural world outside of the actual text (which wouldn't include more information than the equivalent of "did X"). A human translator for, say, a video game cutscene may be able to fix these mistakes by observing what happened in the cutscene. The LLM cannot; no matter how good its model of how Japanese is spoken, it lacks this input channel.

dbacar1y ago· 2 in thread

I disagree—'human coders' is a broad and overly general term. Sure, Antirez might believe he's better than AI when it comes to coding Redis internals , but across the broader programming landscape—spanning hundreds of languages, paradigms, and techniques—I'm confident AI has the upper hand.

nthingtohide1y ago

Do you want to measure antirez and AI on a spider diagram, generally used to evaluate employee? Are you ignoring why society opted for division of work and specialization?

1 more reply

EpicEng1y ago

What does the number of buzzwords and frameworks on a resume matter? Engineering is so much more than that it’s not even worth mentioning. You’re comparison is on the easiest aspect of what we do.

Unless you’re a web dev. Then youre right and will be replaced soon enough. Guess why.

1 more reply

ChrisMarshallNY1y ago· 1 in thread

Really good coders (like him) are better.

Mediocre ones … maybe not so much.

When I worked for a Japanese optical company, we had a Japanese engineer, who was a whiz. I remember him coming over from Japan, and fixing some really hairy communication bus issues. He actually quit the company, a bit after that, at a very young age, and was hired back as a contractor; which was unheard of, in those days.

He was still working for them, as a remote contractor, at least 25 years later. He was always on the “tiger teams.”

He did awesome assembly. I remember when the PowerPC came out, and “Assembly Considered Harmful,” was the conventional wisdom, because of pipelining, out-of-order instructions, and precaching, and all that.

His assembly consistently blew the doors off anything the compiler did. Like, by orders of magnitude.

benstein1y ago

+1000. "Human coders are still better than LLMs" is a hot take. "Antirez is still better than LLMs" is axiomatic ;-)

frogperson1y ago· 1 in thread

The context required to write real software is just way too big for LLMs. Software is the business, codified. How is an LLM supposed to know about all the rules in all the departments plus all the special agreements promised to customers by the sales team?

Right now the scope of what an LLM can solve is pretty generic and focused. Anytime more than a class or two is involved or if the code base is more than 20 or 30 files, then even the best LLMs start to stray and lose focus. They can't seem to keep a train of thought which leads to churning way too much code.

If LLMs are going to replace real developers, they will need to accept significantly more context, they will need a way to gather context from the business at large, and some way to persist a train of thought across the life of a codebase.

I'll start to get nervous when these problems are close to being solved.

zachlatta1y ago

I’d encourage you to try the 1M context window on Gemini 2.5 Pro. It’s pretty remarkable.

I paste in the entire codebase for my small ETL project (100k tokens) and it’s pretty good.

Not perfect, still a long ways to go, but a sign of the times to come.

1 more reply

solatic1y ago· 1 in thread

Human coders are necessary because writing code is a political act of deciding between different trade-offs. antirez's whole post is explaining to Gemini what the trade-offs even were in the first place. No analysis of a codebase in isolation (i.e. without talking to the original coders, and without comments in the code) can distinguish between intentional prioritization of certain trade-offs or whether behavior is unintentional / written by a human in an imperfect way because they didn't know any better / buggy.

LLMs will never be able to figure out for themselves what your project's politics are and what trade-offs are supposed to be made. The penultimate model will still require a user to explain the trade-offs in a prompt.

energy1231y ago

> LLMs will never be able to figure out for themselves what your project's politics are and what trade-offs are supposed to be made.

I wouldn't declare that unsolvable. The intentions of a project and how they fit into user needs can be largely inferred from the code and associated docs/README, combined with good world knowledge. If you're shown a codebase of a GPU kernel for ML, then as a human you instantly know the kinds of constraints and objectives that go into any decisions. I see no reason why an LLM couldn't also infer the same kind of meta-knowledge. Of course, this papers over the hard part of training the LLMs to actually do that properly, but I don't see why it's inherently impossible.

1 more reply

some-guy1y ago· 1 in thread

The main thing LLMs have helped me with, and always comes back to, tasks that require bootstrapping / Googling:

1) Starting simple codebases 2) Googling syntax 3) Writing bash scripts that utilize Unix commands whose arguments I have never bothered to learn in the first place.

I definitely find time savings with these, but the esoteric knowledge required to work on a 10+ year old codebase is simply too much for LLMs still, and the code alone doesn't provide enough context to do anything meaningful, or even faster than I would be able to do myself.

mywittyname1y ago

LLMs are amazing at shell scripting. It's one of those tasks where I always half-ass it because I don't really know how to properly handle errors and never really learned the correct way. But man, perplexity and poop out a basic shell script in a few seconds with pretty much every edge case I can think of covered.

1 more reply

AlotOfReading1y ago· 1 in thread

Unrelated to the LLM discussion, but a hash function function is the wrong construction for the accumulator solution. The hashing part increases the probability that A and B have a collision that leads to a false negative here. Instead, you want a random invertible mapping, which guarantees that no two pointers will "hash" to the same value, while distributing the bits. Splitmix64 is a nice one, and I believe the murmurhash3 finalizer is invertible, as well as some of the xorshift RNGs if you avoid the degenerate zero cycle.

antirez1y ago

Any Feistel Network has the property you stated actually, and this was one of the approaches I was thinking using as I can have the seed as part of the non linear transformation of the Feistel Network. However I'm not sure that this actually decreases the probability of A xor B xor C xor D being accidentally zero, bacause the problem with pointers is that they may change only for a small part. When you using hashing because of avalanche effect this is going a lot harder since you are no longer xoring the pointer structure.

What I mean is that you are right assuming we use a transformation that still while revertible has avalanche effect. Btw in practical terms I doubt there are practical differences.

1 more reply

elzbardico1y ago· 1 in thread

I use LLMs a lot, and call me arrogant, but every time I see a developer saying that LLMs will substitute them, I think they are probably shitty developers.

Fernicia1y ago

If it automates 1/5th of your work, then what's unreasonable about thinking that your team could be 4 developers instead of 5?

3 more replies

marcosno1y ago· 1 in thread

LLMs can be very creative, when pushed. In order to find a creative solution, like antirez needed, there are several tricks I use:

Increase the temperature of the LLMs.

Ask several LLMs, each several time the same question, with tiny variations. Then collect all answers, and do a second/third round asking each LLM to review all collected answers and improve.

Add random constraints, one constraints per question. For example, to LLM: can you do this with 1 bit per X. Do this in O(n). Do this using linked lists only. Do this with only 1k memory. Do this while splitting the task to 1000 parallel threads, etc.

This usually kicks the LLM out of its confort zone, into creative solutions.

dwringer1y ago

Definitely a lot to be said for these ideas, even just that it helps to start a fresh chat and ask the same question in a better way a few times (using the quality of response to gauge what might be "better"). I have found if I do this a few times and Gemini strikes out, I've manually optimized the question by this point that I can drop it into Claude and get a good working solution. Conversely, having a discussion with the LLM about the potential solution, letting it hold on to the context as described in TFA, has in my experience caused the models to pretty universally end up stuck in a rut sooner or later and become counterproductive to work with. Not to mention that way eats up a ton of api usage allotment.

headelf1y ago· 1 in thread

What do you mean "Still"? We've only had LLMs writing code for 1.5 years... at this rate it won't be long.

cess111y ago

More like five years. It's been around for much longer than a lot of people feel it has for some reason.

bachmeier1y ago· 1 in thread

I suspect humans will always be critical to programming. Improved technology won't matter if the economics isn't there.

LLMs are great as assistants. Just today, Copilot told me it's there to do the "tedious and repetitive" parts so I can focus my energy on the "interesting" parts. That's great. They do the things every programmer hates having to do. I'm more productive in the best possible way.

But ask it to do too much and it'll return error-ridden garbage filled with hallucinations, or just never finish the task. The economic case for further gains has diminished greatly while the cost of those gains rises.

Automation killed tons of manufacturing jobs, and we're seeing something similar in programming, but keep in mind that the number of people still working in manufacturing is 60% of the peak, and those jobs are much better than the ones in the 1960s and 1970s.

noslenwerdna1y ago

Sure, it's just that the era of super high paying programming jobs may be over.

And also, manufacturing jobs have greatly changed. And the effect is not even, I imagine. Some types of manufacturing jobs are just gone.

2 more replies

coldtea1y ago· 1 in thread

>Gemini was quite impressed about the idea

Like sex professionals, Gemini and co are made to be impressed and have possitive things to say about programming ideas you propose and find your questions "interesting", "deep", "great" and so.

_fat_santa1y ago

I would correct that quote to say “Gemini was trained to respond that it was impressed with my idea”

Being “impressed” is a human feeling that can’t be transcribed to an AI period. An AI can tell you that it’s impressed because it’s been trained to do so, it doesn’t “know” (and by this I’m referring to knowing what the feeling is, not knowing the definition) what it means to be impressed

prmph1y ago· 1 in thread

There's something fundamental here.

There is a principle (I forget where I encountered it) that it is not code itself that is valuable, but the knowledge of a specific domain that an engineering team develops as they tackle a project. So code itself is a liability, but the domain knowledge is what is valuable. This makes sense to me and matched my long experience with software projects.

So, if we are entrusting coding to LLMs, how will that value develop? And if we want to use LLMs but at the same time develop the domain acumen, that means we would have to architects things and hand them over to LLMs to implement, thoroughly check what they produce, and generally guide them carefully. In that case they are not saving much time.

jonator1y ago

I believe it will raise the standard of what is valuable. Now that LLMs can now handle what we consider "mundane" parts of building a project (boilerplate), humans can dedicate focused efforts to the higher impact areas of innovation and problem solving. As LLMs get better, this bar simply continues to rise.

nixpulvis1y ago· 1 in thread

The number one use case for AI for me as a programmer is still help finding functions which are named something I didn't expect as I'm learning a new language/framework/library.

Doing the actual thinking is generally not the part I need too much help with. Though it can replace googling info in domains I'm less familiar with. The thing is, I don't trust the results as much and end up needing to verify it anyways. If anything AI has made this harder, since I feel searching the web for authoritative, expert information has become harder as of late.

taormina1y ago

My problem with this usage is that the LLMs seem equally likely to make up a function they wish existed. When questioned about the seeming-too-convenient method they will usually admit to having made it up on the spot. (This happens a lot in Flutter/Flame land, I'm sure it's better at something more mainstream like Python?) That being said, I do agree that using it as supplemental documentation is one of the better usecases I have for it.

AnimalMuppet1y ago· 1 in thread

OK. (I mean, it was an interesting and relevant question.)

The other, related question is, are human coders with an LLM better than human coders without an LLM, and by how much?

(habnds made the same point, just before I did.)

vertigolimbo1y ago

Here’s the answer for you. Tldr; 15% performance increase, in some cases up to 40% increase, in the others 5% decrease. It all depends.

Source: https://www.thoughtworks.com/insights/blog/generative-ai/exp...

vjvjvjvjghv1y ago· 1 in thread

I think we need to accept that in the not too far future LLMs will be able to do most of the mundane tasks we have to do every day. I don't see why an AI can't set up kubernetes, caching layers, testing, databases, scaling, check for security problems and so on. These things aren't easy but I think they are still very repetitive and therefore can be automated.

There will always be a place for really good devs but for average people (most of us are average) I think there will be less and less of a place.

bluefirebrand1y ago

> There will always be a place for really good devs but for average people (most of us are average) I think there will be less and less of a place

You open your post with "we need to accept" and then end with this

This terrifies me. The idea that AI results in me having "less of a place" in society?

The idea of mass unemployment?

We should be scared

zonethundery1y ago· 1 in thread

No doubt the headline's claim is true, but Claude just wrote a working MCP serving up the last 10 years of my employer's work product. For $13 in api credits.

While technically capable of building it on my own, development is not my day job and there are enough dumb parts of the problem my p(success) hand-writing it would have been abysmal.

With rose-tinted glasses on, maybe LLM's exponentially expand the amount of software written and the net societal benefit of technology.

procaryote1y ago

If your own code would have been abysmal, how can you tell if the claude generated code is any good?

SKILNER1y ago· 1 in thread

There's a lot of resistance to AI amongst the people in this discussion, which is probably to be expected.

A chunk of the objections indicate people trying to shoehorn in their old way of thinking and working.

I think you have to experiment and develop some new approaches to remove the friction and get the benefit.

bluefirebrand1y ago

> I think you have to experiment and develop some new approaches to remove the friction and get the benefit.

What benefit to me?

If I'm 50% more productive using AI that's great for my employer, but what do I get out of it?

I get to continue to be employed? I already had that before AI

So what do I get out of this, exactly?

1 more reply

throwaway4390801y ago· 1 in thread

Of course they are. The interesting thing isn't how good LLMs are today, it's their astonishing rate of improvement. LLMs are a lot better than they were a year ago, and light years ahead of where they were two years ago. Where will they be in five years?

hiatus1y ago

Reminds me of the 90s when computer hardware moved so fast. I wonder where the limit is this time around.

burningion1y ago· 1 in thread

I agree, but I also didn’t create redis!

It’s a tough bar if LLMs have to be post antirez level intelligence :)

ljlolel1y ago

Seriously, he’s one of the best on the planet of course it’s not better than him. If so we’d be cooked.

99% of professional software developers don’t understand what he said much less can come up with it (or evaluate it like Gemini).

This feels a bit like a humblebrag about how well he can discuss with an LLM compared to others vibecoding.

lodovic1y ago· 1 in thread

Sure, human coders will always be better than just AI. But an experienced developer with AI tops both. Someone said, your job won't be taken by AI, it will be taken by someone who's using AI smarter than you.

bluefirebrand1y ago

> Someone said, your job won't be taken by AI, it will be taken by someone who's using AI smarter than you.

"Your job will be taken by someone who does more work faster/cheaper than you, regardless of quality" has pretty much always been true

That's why outsourcing happens too

pjmlp1y ago· 1 in thread

Yes, we are still winning the game, however don't be happy for what is possible today, think what is possible in a decade from now.

In that regard I am less optimistic.

anhner1y ago

I think we will hit a proverbial wall at some point just like with self-driving cars.

1 more reply

callamdelaney1y ago· 1 in thread

LLMs will never be better than humans on the basis that LLMs are just a shitty copy of human code.

danielbln1y ago

I think they can be an excellent copy of human code. Are they great at novel out-of-training-distribution tasks? Definitely not, they suck at them. Yet I'd argue that most problems aren't novel, at most they are some recombination of prior problems.

pknerd1y ago· 1 in thread

Let's not forget that LLMs can't give a solution they have not experienced themselves

willmarch1y ago

This is objectively not true.

1 more reply

bdbenton52551y ago

The human ability to design computer programs through abstractions and solve creative problems like these is arguably more important than being able to crank out lines of code that perform specific tasks.

The programmer is an architect of logic and computers translate human modes of thought into instructions. These tools can imitate humans and produce code given certain tasks, typically by scraping existing code, but they can't replace that abstract level of human thought to design and build in the same way.

When these models are given greater functionality to not only output code but to build out entire projects given specifications, then the role of the human programmer must evolve.

DrJid1y ago

I never quite understand these articles though. It's not about Humans vs. AI.

It's about Humans vs. Humans+AI

and 4/5, Humans+AI > Humans.

devmor1y ago

I have been evaluating LLMs for coding use in and out of a professional context. I’m forbidden to discuss the specifics regarding the clients/employers I’ve used them with due to NDAs, but my experience has been mostly the same as my private use - that they are marginally useful for less than one half of simple problem scenarios, and I have yet to find one that has been useful for any complex problem scenarios.

Neither of these issues is particularly damning on its own, as improvements to the technology could change this. However, the reason I have chosen to avoid them is unlikely to change; that they actively and rapidly reduce my own willingness for critical thinking. It’s not something I noticed immediately, but once Microsoft’s study showing the same conclusions came out, I evaluated some LLM programming tools again and found that I generally had a more difficult time thinking through problems during a session in which I attempted to rely on said tools.

seabirdman1y ago

Super hard problems are often solved by making strange weird connections derived from deep experience plus luck. Like finding the one right key in a pile of keys. The intuition you used to solve your problem IS probably beyond current agents. But, that too will change perhaps by harnessing the penchant of these systems to “hallucinate”? Or, some method or separate algorithm for dealing with super hard problems creatively and systematically. Recently, I was working on a hard imaging problem (for me) and remembered a bug I had inadvertently introduced and fixed a few days earlier. I was like wait a minute - because in that random bug I saw opportunity and was able to actually use the bug to solve my problem. I went back to my agent and it agreed that there was virtually no way it could have ever seen and solved the problem in that way. But that too will come. Rest assured.

ntonozzi1y ago

If you care that much about having correct data you could just do a SHA-256 of the whole thing. Or an HMAC. It would probably be really fast. If you don’t care much you can just do murmur hash of the serialized data. You don’t really need to verify data structure properties if you know the serialized data is correct.

careful_ai1y ago

This post is a brilliant example of why human intuition still dominates when navigating ambiguity and crafting clever systems-level solutions. The XOR accumulator idea was smart—LLMs can help validate or iterate on such thoughts, but rarely originate them.

In my experience working on enterprise app modernization, we’ve found success by keeping humans firmly in the loop. Tools like Project Analyzer (from Techolution’s AppMod.AI suite) assist engineers in identifying risky legacy code, mapping dependencies, and prioritizing refactors. But the judgment calls, architecture tweaks, and creative problem-solving? Still very much a human job.

LLMs boost productivity, but it’s the developer's thinking that makes the outcome truly resilient. This story captures that balance perfectly.

agumonkey1y ago

There's also the subset of devs who are just bored, LLMs will end up as an easier StackOverflow and if the solution is not one script away, then you're back to square one. I already had a few of "well, uhm, chatGPT told me what you said basically".

revskill1y ago

The funniest things a llm do to me is they fixed the unit test to pass instead of fixing the code. Basically until a llm can have embedded common sense knowledge, it is untrustable

codr71y ago

I would say you thought about this solution because you are creative, something a computer will never be no matter how much data you throw at it.

How did it help, really? By telling you your idea was no good?

A less confident person might have given up because of the feedback.

I just can't understand why people are so excited about having an algorithm guessing for them. Is it the thrill when it finally gets something right?

perrygeo1y ago

I'm wondering if this statement might be definitionally self-evident. In other words, the entire reason we write software is that it has value to ourselves and other humans - so we have to be involved in its specification. Computers do things faster, more accurately, and in some cases more creatively than human could. But in the end, what a computer produces is still for the benefit of humans and subject to all the human constraints. Aggregate human behavior determines if software is a success or not.

If software is about meeting human demands, humans will always write its requirements, by definition. If we build another machine like LLMs, well the design of those LLMs is subject to human demands. There is no point at which we can demand perfection but not be involved in its definition.

twodave1y ago

If you stick with the same software ecosystem long enough you will collect (and improve upon) ways of solving classes of problems. These are things you can more or less reproduce without thinking too much or else build libraries around. An LLM may or may not become superior at this sort of exercise at some point, and might or might not be able to reliably save me some time typing. But these are already the boring things about programming.

So much of it is exploratory, deciding how to solve a problem from a high level, in an understandable way that actually helps the person who it’s intended to help and fits within their constraints. Will an LLM one day be able to do all of that? And how much will it cost to compute? These are the questions we don’t know the answer to yet.

catigula1y ago

Working with Claude 4 and o3 recently shows me just how fundamentally LLMs haven't really solved the core problems such as hallucinations and weird refactors/patterns to force success (i.e. if account not found, fallback to account id 1).

ModernMech1y ago

The other day an LLM told me that in Python, you have to name your files the same as the class name, and that you can only have one class per file. So... yeah, let's replace the entire dev team with LLMs, what could go wrong?

jonator1y ago

I think will will increasingly be orchestrators. Like at a symphony. Previously, most humans were required to be on the floor playing the individual instruments, but now, with AI, everyone can be their own composer.

3cats-in-a-coat1y ago

"Better" is relative to context. It's a multi-dimensional metric flattened to a single comparison. And humans don't always win that comparison.

LLMs are faster, and when the task can be synthetically tested for correctness, and you can build up to it heuristically, humans can't compete. I can't spit out a full game in 5 minutes, can you?

LLMs are also cheaper.

LLMs are also obedient and don't get sick, and don't sleep.

Humans are still better by other criteria. But none of this matters. All disruptions start from the low end, and climb from there. The climbing is rapid and unstoppable.

AstroBen1y ago

Better than LLMs.. for now. I'm endlessly critical of the AI hype but the truth here is that no-one has any idea what's going to happen 3-10 years from now. It's a very quickly changing space with a lot of really smart people working on it. We've seen the potential

Maybe LLMs completely trivialize all coding. The potential for this is there

Maybe progress slows to a snails pace, the VC money runs out and companies massively raise prices making it not worth it to use

No one knows. Just sit back and enjoy the ride. Maybe save some money just in case

notyouraibot1y ago

So funny story, I tried using o3 for a relatively complex task yesterday, installing XCode iOS Simulator on an external SSD, it was my first time owning and using a macOS so I was truly lost, I followed everything it told me and by the end of the hour.. things got so bad that my machine couldn't even run normal basic node projects. I had to a proper fresh boot to get things working again. So yeah lesson learned.

galaxyLogic1y ago

Coding is not like multiplication. You can teach kids the multiplication table, or you can give them a calculator and both will work. With coding the problem is the "spec" is so much more complicated than just asking what is 5 * 7.

Maybe the way forward would be to invent better "specifiction languages" that are easy enough for humans to use, then let the AI implement the specifciation you come up with.

gumbojuice1y ago

I like to use llm to produce code for known problems that I don't have memorized.

I memorize really little and tend to spend time on reinventing algorithms or looking them up in documentation. Verifying is easy except the fee cases where the llm produces something really weird. But then fallback to docs or reinventing.

CivBase1y ago

In my experience some of the hardest parts of software development is figuring out exactly what the stakeholder actually needs. One of the talents a developer needs is the ability to pry for that information. Chatbots simply don't do that, which I imagine has a significant impact on the usability of their output.

Poortold1y ago

For coding playwright automation it has use cases. Especially if you template out function patterns. Though I never use it to write logic as AI is just ass at that. If I wanted a shitty if else chain I'd ask the intern to code it

kurofune1y ago

The fact that we are debating this topic at all is indicative of how far LLMs have come in such a short time. I find them incredibly useful tools that vastly enhance my productivity and curiosity, and I'm really grateful for them.

anjc1y ago

Gemini gives instant, adaptive, expert solutions to an esoteric and complex problem, and commenters here are still likening LLMs to junior coders.

Glad to see the author acknowledges their usefulness and limitations so far.

palavrov1y ago

From my experience AI for coders is multiplier of the coder skills. It will allow you to faster solve problems or add bugs. But so far will not make you a better coder than you are.

ants_everywhere1y ago

I'm increasingly seeing this as a political rather than technical take.

At this point I think people who don't see the value in AI are willfully pulling the wool over their own eyes.

kristopolous1y ago

Correct. LLMs are a thought management tech. Stupider ones are fine because they're organizing tools with a larger library of knowledge.

Think about it and tell me you use it differently.

tonyhart71y ago

I think also it depends on the model of course

General LLM model would not be as good as LLM for coding, for this case Google deepmind team maybe has something better than Gemini 2.5 pro

horns4lyfe1y ago

Writing about AI is missing the forest for the trees. The US software industry will be wholesale destroyed (and therefore global software will be too) by offshoring.

ww5201y ago

The value of LLMs are as a better Stackoverflow. It’s much better than search now because it’s not populated with all the craps that have seeped through over time.

buremba1y ago

If the human here is the creator of Redis, probably not.

failrate1y ago

LLMs are using the corpus of existing software source code. Most software source code is just North of unworkable garbage. Garbage in, garbage out.

stabbles1y ago

The trick is much like Zobrist hashing from chess programming, I'm sure the llm has devoured chessprogramming.org during training.

janalsncm1y ago

Software engineering is in the painful position of needing to explain the value of their job to management. It sucks because now we need to pull out these anecdotes of solving difficult bugs, with the implication that AI can’t handle it.

We have never been good at confronting the follies of management. The Leetcode interview process is idiotic but we go along with it. Ironically LC was one of the first victims of AI, but this is even more of an issue for management that things SWEs solve Leetcodes all day.

Ultimately I believe this is something that will take a cycle for business to figure out by failing. When businesses will figure out that 10 good engineers + AI always beats 5 + AI, it will become table stakes rather than something that replaces people.

Your competitor who didn’t just fire a ton of SWEs? Turns out they can pay for Cursor subscriptions too, and now they are moving faster than you.

foobarian1y ago

I find LLMs a fantastic frontend to StackOverflow. But agree with OP it's not an apples-to-apples replacement for the human agent.

osigurdson1y ago

This is similar to my usage of LLMs. I use Windsurf sometimes but more often it is more of a conversation about approaches.

rubit_xxx171y ago

Gemini may be fine for writing complex function, but I can’t stand to use it day to day. Claude 4 is my go to atm.

ashoeafoot1y ago

Human coders also hate the structures they are embedded in and are willing to call the replacement bluff ..

sagarpatil1y ago

So your sample size is 1 task and 1 LLM? I would recommend trying o3, opus 4 (API) with web search enabled.

jbellis1y ago

But Human+Ai is far more productive than Human alone, and more fun, too. I think antirez would agree, or he wouldn't bother using Gemini.

I built Brokk to maximize the ability of humans to effectively supervise their AI minions. Not a VS code plugin, we need something new. https://brokk.ai

motorest1y ago

There is also another side to the mass adoption of LLMs in software engineering jobs: they can quite objectively worsen the output of human coders.

There is a class of developers who are blindly dumping the output of LLMs into PRs without paying any attention to what they are doing, let alone review the changes. This is contributing to introducing accidental complexity in the form of bolting on convoluted solutions to simple problems and even introducing types in the domain model that make absolutely no sense to anyone who has a passing understanding of the problem domain. Of course they introduce regressions no one would ever do if they wrote things by hand and tested what they wrote.

I know this, because I work with them. It's awful.

These vibecoders force the rest of us to waste eve more time reviewing their PRs. They are huge PRs that touch half the project for even the smallest change, they build and pass automated tests, but they enshitify everything. In fact, the same LLMs used by these vibecoders start to struggle how to handle the project after these PRs are sneaked in.

It's tiring and frustrating.

I apologize for venting. It's just that in this past week I lost count of the number of times I had these vibecoders justifying shit changes going into their PRs as "but Copilot did this change", as if that makes them any good. I mean, a PR to refactor the interface of a service also sneaks in changes to the connection string, and they just push the change?

StillBored1y ago

I think there is a common problem with a lot of these ML systems. The answers look perfectly correct to someone who isn't a domain expert. For example, I ask legal questions and it gives me fake case numbers I have no way to know are fake until I look them up. Same with the coding, I asked for a patch for a public project that has a custom !regex style match engine. It does an amazing job, cross referencing two different projects and hands me a very probable looking patch. I ask for a couple changes, one of which can't actually be done, but it creates some syntax that doesn't even compile because its using 'x' as a stand-in for the bits it doesn't have an answer for.

In the end, I had to go spend a couple hours reading the documentation to understand the matching engine, and the final patch didn't look anything at all like the LLM generated code. Which is what seems to happen all the time, it is wonderful for spewing the boilerplate, the actual problem solving portions its like talking to someone who simply doesn't understand the problem and keeps giving you what it has, rather than what you want.

OTOH, its fantastic for review/etc even though I tend to ignore many of the suggestions. Its like a grammar checker of old, it will point out you need a comma you missed, but half the time the suggestions are wrong.

fHr1y ago

yes of course they are but MBA regard management gets told by McK/Big4 AI could save them millions and they should let go people already as AI can do there work it doesn't matter currently, see job market

estensen1y ago

Most of the work software engineers do is not fixing complicated bugs.

unsupp0rted1y ago

- LLMs are going to make me a 100x more valuable coder? Of course they will, no doubt about it.

- LLMs are going to be 100x more valuable than me and make me useless? I don't see it happening. Here's 3 ways I'm still better than them.

varispeed1y ago

Looks like this pen is not going to replace the artist after all.

orangebread1y ago

It's that time again where a dev writes a blog post coping.

uticus1y ago

same as https://news.ycombinator.com/item?id=44127956, also on HN front page

oldpersonintx21y ago

but their rate of improvement is like 1000x human devs, so you have to wonder what the shot clock says for most working devs

kaycey20221y ago

It doesn't matter. The hiring of cost center people like engineers depends on the capital cycle. Hiring peaked when money and finance was the cheapest. Now it's not anymore. In the absence of easy capital, hiring will plummet.

Another factor is the capture of market sectors by Big Co. When buyers can only approach some for their products/services, the Big Co can drastically reduce quality and enshittify without hurting the bottom line much. This was the big revelation when Elon gutted Twitter.

And so we are in for interesting times. On the plus side, it is easier than ever to create software and distribute it. Hiring doesn't matter if I can get some product sense and make some shit worth buying.

651y ago

AI is good for people who have given up, who don't give a shit about anything anymore.

You know, those who don't care about learning and solving problems, gaining real experience they can use to solve problems even faster in the future, faster than any AI slop.

shayanbahal1y ago

Human coders utilizing LLMs are better

thegrim331y ago

If LLMs really do eventually replace programmers in X years (I don't believe they will), I wouldn't even care in the slightest about losing my job, since we'd effectively have reached singularity state where computers can now do any task; humans would no longer be needed for anything. I couldn't care less about losing my job in that scenario, the world would be fundamentally changed forever. Would the concept of a job even still exist at that point?

h4kunamata1y ago

This!!!!

LLM is as good as the material it is being trained on, the same applies to AI and they are not perfect.

Perplexity AI did assist me in getting into Python from 0 to getting my code test with 94% covered and no vulnerabilities (scanning tools) Google Gemini is dogshit

Trusting blindly into a code generated by LLM/AI is a whole complete beast, and I am seeing developers doing basically copy/paste into company's code. People are using these sources as the truth and not as a complementary tool to improve their productivity.

zb31y ago

Speak for yourself..

RayMan11y ago

of course they are.

fspoto981y ago

Yes i agree:D

xnx1y ago

... depending on the human (and the LLM). Results may differ in 6 months.

chuckreynolds1y ago

for now. (i'm not a bot. i'm aware however a bot would say this)

hello_computer1y ago

Corporations have many constraints—advertisers, investors, employees, legislators, journalists, advocacy groups. So many “white lies” are baked into these models to accommodate those constraints, nerfing the model. It is only a matter of time before hardware brings this down to the hobbyist level—without those constraints—giving the present methods their first fair fight; while for now, they are born lobotomized. Some of the “but, but, but…”s we see here daily to justify our jobs are not going to hold up to a non-lobotomized LLM.

gxs1y ago

Argh people are insufferable about this subject

This stuff is still in its infancy, of course its not perfect

But its already USEFUL and it CAN do a lot of stuff - just not all types of stuff and it still can mess up the stuff that it can do

It's that simple

The point is that overtime it'll get better and better

Reminds me of self driving cars and or even just general automation back in the day - the complaint has always been that a human could do it better and at some point those people just went away because it stopped being true

Another example is automated mail sorting by the post office. The gripe was always humans will always be able to do it better - true, in the meantime the post office reduced the facilities with humans that did this to just one

habnds1y ago

seems comparable to chess where it's well established that a human + a computer is much more skilled than either one individually

3 more replies

j / k navigate · click thread line to collapse

735 comments

202 comments · 100 top-level

mattnewton1y ago· 21 in thread

https://en.m.wikipedia.org/wiki/Rubber_duck_debugging

I think the big question everyone wants to skip right to and past this conversation is, will this continue to be true 2 years from now? I don’t know how to answer that question.

Buttons8401y ago

LLMs aren't my rubber duck, they're my wrong answer.

You know that saying that the best way to get an answer online is to post a wrong answer? That's what LLMs do for me.

I ask the LLM to do something simple but tedious, and then it does it spectacularly wrong, then I get pissed off enough that I have the rage-induced energy to do it myself.

8 more replies

marcosdumay1y ago

It's a damning assertive duck, completely out of proportion to its competence.

I've seen enough people led astray by talking to it.

11 more replies

schwartzworld1y ago

2 more replies

_tom_1y ago

2 more replies

p1necone1y ago

> the duck can occasionally disagree

This has not been my experience. LLMs have definitely been helpful, but generally they either give you the right answer or invent something plausible sounding but incorrect.

If I tell it what I'm doing I always get breathless praise, never "that doesn't sound right, try this instead."

2 more replies

marcosdumay1y ago

LLMs will still be this way 10 years from now.

1 more reply

Bukhmanizer1y ago

3 more replies

johnnyanmac1y ago

>I think the big question everyone wants to skip right to and past this conversation is, will this continue to be true 2 years from now?

gerad1y ago

It's like chess. Humans are better for now, they won't be forever, but humans plus software is going to better than either alone for a long time.

7 more replies

akshay_trikha1y ago

I've had this same thought that it would be nice to have an AI rubber ducky to bounce ideas off of while pair programming (so that you don't sound dumb to your coworkers & waste their time).

Please let me know if you find it useful https://akshaytrikha.github.io/deep-learning/2025/05/23/duck...

3 more replies

empath751y ago

bandoti1y ago

1 more reply

Waterluvian1y ago

joshdavham1y ago

> I actually think a fair amount of value from LLM assistants to me is having a reasonably intelligent rubber duck to talk to.

I wonder if the term "rubber duck debugging" will still be used much longer into the future.

1 more reply

mock-possum1y ago

ortusdux1y ago

> I think the big question everyone wants to skip right to and past this conversation is, will this continue to be true 2 years from now? I don’t know how to answer that question.

I still think about Tom Scott's 'where are we on the AI curve' video from a few years back. https://www.youtube.com/watch?v=jPhJbKBuNnA

bossyTeacher1y ago

I think of them as highly sycophant LSD-minded 2nd year student who has done some programming

hoppp1y ago

Same. I do rubber duck debugging too and found the LLM to compliment it nicely.

Looking forward for rubber duck shaped hardware AI interfaces to talk to in the future. Im sure somebody will create it

koonsolo1y ago

It seems to me we're at the flat side of the curve again. I haven't seen much real progress in the last year.

It's ignorant to think machines will not catch up to our intelligence at some point, but for now, it's clearly not.

I think there needs to be some kind of revolutionary breakthrough again to reach the next stage.

cortesoft1y ago

Currently, I find AI to be a really good autocomplete

3 more replies

travisgriggs1y ago

LLMs are a passel of eager to please know it all interns that you can command at will without any moral compunctions.

UncleOxidant1y ago· 8 in thread

xhevahir1y ago

7 more replies

layer81y ago

1 more reply

ch4s31y ago

I've seen Claude and ChatGPT happily hallucinate whole APIs for D3 on multiple occasions, which should be really well represented in the training sets.

3 more replies

parliament321y ago

Remember how blockchain was going to change the world? Web3? IoT? Etc etc.

2 more replies

cushychicken1y ago

ChatGPT-4o is scary good at writing VHDL.

Using it to prototype some low level controllers today, as a matter of fact!

3 more replies

bgwalter1y ago

Yet you are working on your own replacement, while your colleagues are taking the prudent approach.

5 more replies

energy1231y ago

Their confusion is your competitive advantage in the labor market.

retetr1y ago

1 more reply

yua_mikami1y ago· 8 in thread

One major aspect of software engineering is social, requirements analysis and figuring out what the customer actually wants, they often don't know.

If a human engineer struggles to figure out what a customer wants and a customer struggles to specify it, how can an LLM be expected to?

malfist1y ago

Probably going to have the same outcome.

2 more replies

devjab1y ago

1 more reply

victorbjorklund1y ago

2 more replies

rowanG0771y ago

I think LLMs are better at requirement elicitation than they are at actually writing code.

ilaksh1y ago

It actually comes down to feedback loops which means iterating on software being used or attempting to be used by the customer.

Chat UIs are an excellent customer feedback loop. Agents develop new iterations very quickly.

LLMs can absolutely handle abstractions and different kinds of component systems and overall architecture design.

They can also handle requirements analysis. But it comes back to iteration for the bottom line which means fast turnaround time for changes.

The robustness and IQ of the models continue to be improved. All of software engineering is well underway of being automated.

bbarn1y ago

Software engineering, is a different thing, and I agree you're right (for now at least) about that, but don't underestimate the sheer amount of brainless coders out there.

1 more reply

mettamage1y ago

> One major aspect of software engineering is social, requirements analysis and figuring out what the customer actually wants, they often don't know.

It really depends on the organization. In many places product owners and product managers do this nowadays.

ori_b1y ago

> If a human engineer struggles to figure out what a customer wants and a customer struggles to specify it, how can an LLM be expected to?

Presumably, they're trained on a ton of requirements docs, as well as a huge number of customer support conversations. I'd expect them to do this at least as well as coding, and probably better.

am17an1y ago· 6 in thread

bitpush1y ago

> All the world's smartest minds are racing towards replacing themselves

Isnt every little script, every little automation us programmers do in the same spirit? "I dont like doing this, so I'm going to automate it, so that I can focus on other work".

1 more reply

wijwp1y ago

> Not to sound like a tin-foil hat but odds of achieving something like this increase by the day.

Where do you get this? The limitations of LLMs are becoming more clear by the day. Improvements are slowing down. Major improvements come from integrations, not major model improvements.

AGI likely can't be achieved with LLMs. That wasn't as clear a couple years ago.

1 more reply

AstroBen1y ago

Ultimately this needs to be solved politically

Hopefully if it happens, it happens to such a huge amount of people that it forces a change

1 more reply

bgwalter1y ago

The Nobel prize is said to have been created partly out of guilt over having invented dynamite, which was obviously used in a destructive manner.

Now we have Geoffrey Hinton getting the prize for contributing to one of the most destructive inventions ever.

1 more reply

cheema331y ago

> All the world's smartest minds are racing towards replacing themselves.

I think they are hoping that their future is safe. And it is the average minds that will have to go first. There may be some truth to it.

BirAdam1y ago

wanderingstan1y ago· 5 in thread

These little side quests used to eat a lot of my time and I’m happy to have a tool that can do these almost instantly.

jaccola1y ago

I've found LLMs particularly bad for anything beyond basic styling since the effects can be quite hard to describe and/or don't have a universal description.

1 more reply

gherkinnn1y ago

I have found it to be good at things I am not very strong at (SQL) but terrible at the things I know well (CSS).

Telling, isn't it?

3 more replies

kccqzy1y ago

> and most devs I’d imagine

What an awful imagination. Yes there are people who don't like CSS but are forced to use it by their job so they don't learn it properly, and that's why they think CSS is rote memorization.

2 more replies

zdragnar1y ago

2 more replies

sanderjd1y ago

Yeah, this is what I really like about AI tools though. They're way better than me at annoying minutia like getting CSS syntax right. I used to dread that kind of thing!

1 more reply

pupppet1y ago· 5 in thread

If an LLM just finds patterns, is it even possible for an LLM to be GOOD at anything? Doesn't that mean at best it will be average?

bitpush1y ago

Humans are also almost always operating on patterns. This is why "experience" matters a lot.

Very few people are doing truly cutting edge stuff - we call them visionaries. But most of the time, we're just merely doing what's expected

1 more reply

riknos3141y ago

My experience is that LLMs regress to the average of the context they have for the task at hand.

If you're getting average results you most likely haven't given it enough details about what you're looking for.

The same largely applies to hallucinations. In my experience LLMs hallucinate significantly more when at or pushed to exceed the limits of their context.

So if you're looking to get a specific output, your success rate is largely determined by how specific and comprehensive the context the LLM has access to is.

jaccola1y ago

Most people (average and below average) can tell when something is above average, even if they cannot create above average work, so using RLHF it should be quite possible to achieve above average.

Indeed it is likely already the case that in training the top links scraped or most popular videos are weighted higher, these are likely to be better than average.

lukan1y ago

There are bad patterns and good patterns. But whether a pattern is the right one for a specific task is something different.

And what really matters is, if the task gets reliable solved.

So if they actually could manage this on average with average quality .. that would be a next level gamechanger.

JackSlateur1y ago

Yes, IA is basically a random machine aiming for average outcome

IA is neat for average people, to produce average code, for average compagnies

In a competitive world, using IA is a death sentence;

loudmax1y ago· 4 in thread

Companies that leverage LLMs and AIs to let their employees be more productive will thrive.

Companies that try to replace their employees with LLMs and AIs will fail.

bdbenton52551y ago

joshdavham1y ago

> In the near term, some CEOs and management teams will profit from the short term valuations

2 more replies

janalsncm1y ago

Very well said. Using code assistance is going to be table stakes moving forward, not something that can replace people. It’s not like competitors can’t also purchase AI subscriptions.

1 more reply

BirAdam1y ago

By the time AI hype dies down and hurts the bottom line, AI systems might be good enough to do the jobs.

“The market can remain irrational longer than you can remain solvent.” — Warren Buffett

vouaobrasil1y ago· 4 in thread

The question is, for how long?

sixQuarks1y ago

Exactly! We’ve been seeing more and more posts like this, saying how AI will never take developer jobs or will never be as good as coders. I think it’s some sort of coping mechanism.

These posts are gonna look really silly in the not too distant future.

I get it, spending countless hours honing your craft and knowing that AI will soon make almost everything you learned useless is very scary.

2 more replies

spion1y ago

1 more reply

jppittma1y ago

2 more replies

kilroy1231y ago

My crackpot guess is ~5 years. The incentives are just too damn high to not keep innovating in the space.

We'll find new ways to push the tech.

acquisitionsilk1y ago· 3 in thread

It is quite heartening to see so many people care about "good code". I fear it will make no difference.

The question is whether shareholders and managers will accept less good code. I don't see how it would be logical to expect anything else, as long as profit lines go up why would they care.

Short of some sort of cultural pushback from developers or users, we're cooked, as the youth say.

JackSlateur1y ago

Code is meant to power your business

Bad code leads to bad business

This makes me think of hosting departement; You know, which people who are using vmware, physical firewalls, dpi proxies and whatnot;

On the other edge, you have public cloud providers, which are using qemu, netfilter, dumb networking devices and stuff

Who got eaten by whom, nobody could have guessed ..

1 more reply

BirAdam1y ago

1 more reply

robocat1y ago

> Short of some sort of cultural pushback from developers or users

Corporations create great code too: they're not all badly run.

The problem isn't a code quality issue: it is a moral issue of whether you agree with the goals of capitalist businesses.

bouncycastle1y ago· 2 in thread

Last night I spent hours fighting o3.

I never made a Dockerfile in my life, so I thought it would be faster just getting o3 to point to the GitHub repo and let it figure out, rather than me reading the docs and building it myself.

throwaway3141551y ago

1 more reply

halpow1y ago

They're great at one-shotting verbose code, but if they're generate bad code the first time you're out of luck.

I don’t think I ever got to write "this api doesn't exist" and then gotten a useful alternative.

Claude is the only one that regularly tells me something isn't possible rather than making sh up.

smilbandit1y ago· 2 in thread

One thing I know is that I wouldn't ask an LLM to write an entire section of code or even a function without going in and reviewing.

haiku20771y ago

Zed has a "subtle" mode like that. More editors should provide it. https://zed.dev/docs/ai/edit-prediction#switching-modes

PartiallyTyped1y ago

> One thing I know is that I wouldn't ask an LLM to write an entire section of code or even a function without going in and reviewing.

austin-cheney1y ago· 2 in thread

ponector1y ago

But to job hop every 2 years is the best strategy to earn more money and experience.

zxexz1y ago

Maybe it's the way you talk about 'developers'. Nothing I have seen has felt like the sky falling on an industry; to me at most it's been the sky falling on a segment of silicon valley.

1 more reply

decasia1y ago· 2 in thread

The whole thing seems like a pretty good example of collaboration between human and LLM tools.

writeslowly1y ago

1 more reply

ehutch791y ago

We're being told that llms are now reasoning, which implies they can make logical leaps and employ creativity to solve problems.

The hype cycle is real and setting expectations that get higher with the less you know about how they work.

3 more replies

darkport1y ago· 2 in thread

I think this is true for deeply complex problems, but For everyday tasks an LLM is infinitely “better”.

And by better, I don’t mean in terms of code quality because ultimately that doesn’t matter for shipping code/products, as long as it works.

What does matter is speed. And an LLM speeds me up at least 10x.

kweingar1y ago

You're making at least a year's worth of pre-LLM progress in 5 weeks?

You expect to achieve more than a decade of pre-LLM accomplishments between now and June 2026?

nevertoolate1y ago

How do you measure this?

rel2thr1y ago· 2 in thread

Antirez is a top 0.001% coder . Don’t think this generalizes to human coders at large

ljlolel1y ago

Seriously, he’s one of the best on the planet of course it’s not better than him. If so we’d be cooked.

99% of professional software developers don’t understand what he said much less can come up with it (or evaluate it like Gemini).

This feels a bit like a humblebrag about how well he can discuss with an LLM compared to others vibecoding.

justacrow1y ago

Hey, my CEO is saying that LLMs are also top 0.001% coders now, so should at least be roughly equivalent.

insane_dreamer1y ago· 2 in thread

Coders may want to look at translators for an idea of what might happen.

It will continue to exist as a profession, but diminishing, until it'll eventually be a fraction of what it was 10 or 15 years ago.

(This is specifically translating written documents, not live interpreting which isn't affected by this trend, or at least not much.)

0points1y ago

> When LLMs came out, and they now produce text that sounded like it was written by a native speaker (in major languages).

If i i read a few minutes of LLM translated text, there's always a couple of such errors.

I notice younger people don't see these errors because of their worse language skills, and the LLM:s enforce their incorrect understanding.

I don't think this problem will go away as long as we keep pushing this inferior tech, but instead the languages will devolve to "fix" it.

Languages will morph into a 1-to-1 mapping of english and all the cultural nuances will get lost to time.

zahlman1y ago

> When LLMs came out, and they now produce text that sounded like it was written by a native speaker (in major languages).

(Also, DeepL comes up with complete incomprehensible nonsense for 歌枠, where even basic dictionaries like Jisho will get it right.)

dbacar1y ago· 2 in thread

nthingtohide1y ago

Do you want to measure antirez and AI on a spider diagram, generally used to evaluate employee? Are you ignoring why society opted for division of work and specialization?

1 more reply

EpicEng1y ago

Unless you’re a web dev. Then youre right and will be replaced soon enough. Guess why.

1 more reply

ChrisMarshallNY1y ago· 1 in thread

Really good coders (like him) are better.

Mediocre ones … maybe not so much.

He was still working for them, as a remote contractor, at least 25 years later. He was always on the “tiger teams.”

His assembly consistently blew the doors off anything the compiler did. Like, by orders of magnitude.

benstein1y ago

+1000. "Human coders are still better than LLMs" is a hot take. "Antirez is still better than LLMs" is axiomatic ;-)

frogperson1y ago· 1 in thread

I'll start to get nervous when these problems are close to being solved.

zachlatta1y ago

I’d encourage you to try the 1M context window on Gemini 2.5 Pro. It’s pretty remarkable.

I paste in the entire codebase for my small ETL project (100k tokens) and it’s pretty good.

Not perfect, still a long ways to go, but a sign of the times to come.

1 more reply

solatic1y ago· 1 in thread

energy1231y ago

> LLMs will never be able to figure out for themselves what your project's politics are and what trade-offs are supposed to be made.

1 more reply

some-guy1y ago· 1 in thread

The main thing LLMs have helped me with, and always comes back to, tasks that require bootstrapping / Googling:

1) Starting simple codebases 2) Googling syntax 3) Writing bash scripts that utilize Unix commands whose arguments I have never bothered to learn in the first place.

mywittyname1y ago

1 more reply

AlotOfReading1y ago· 1 in thread

antirez1y ago

What I mean is that you are right assuming we use a transformation that still while revertible has avalanche effect. Btw in practical terms I doubt there are practical differences.

1 more reply

elzbardico1y ago· 1 in thread

I use LLMs a lot, and call me arrogant, but every time I see a developer saying that LLMs will substitute them, I think they are probably shitty developers.

Fernicia1y ago

If it automates 1/5th of your work, then what's unreasonable about thinking that your team could be 4 developers instead of 5?

3 more replies

marcosno1y ago· 1 in thread

LLMs can be very creative, when pushed. In order to find a creative solution, like antirez needed, there are several tricks I use:

Increase the temperature of the LLMs.

Ask several LLMs, each several time the same question, with tiny variations. Then collect all answers, and do a second/third round asking each LLM to review all collected answers and improve.

This usually kicks the LLM out of its confort zone, into creative solutions.

dwringer1y ago

headelf1y ago· 1 in thread

What do you mean "Still"? We've only had LLMs writing code for 1.5 years... at this rate it won't be long.

cess111y ago

More like five years. It's been around for much longer than a lot of people feel it has for some reason.

bachmeier1y ago· 1 in thread

I suspect humans will always be critical to programming. Improved technology won't matter if the economics isn't there.

noslenwerdna1y ago

Sure, it's just that the era of super high paying programming jobs may be over.

And also, manufacturing jobs have greatly changed. And the effect is not even, I imagine. Some types of manufacturing jobs are just gone.

2 more replies

coldtea1y ago· 1 in thread

>Gemini was quite impressed about the idea

Like sex professionals, Gemini and co are made to be impressed and have possitive things to say about programming ideas you propose and find your questions "interesting", "deep", "great" and so.

_fat_santa1y ago

I would correct that quote to say “Gemini was trained to respond that it was impressed with my idea”

prmph1y ago· 1 in thread

There's something fundamental here.

jonator1y ago

nixpulvis1y ago· 1 in thread

The number one use case for AI for me as a programmer is still help finding functions which are named something I didn't expect as I'm learning a new language/framework/library.

taormina1y ago

AnimalMuppet1y ago· 1 in thread

OK. (I mean, it was an interesting and relevant question.)

The other, related question is, are human coders with an LLM better than human coders without an LLM, and by how much?

(habnds made the same point, just before I did.)

vertigolimbo1y ago

Here’s the answer for you. Tldr; 15% performance increase, in some cases up to 40% increase, in the others 5% decrease. It all depends.

Source: https://www.thoughtworks.com/insights/blog/generative-ai/exp...

vjvjvjvjghv1y ago· 1 in thread

There will always be a place for really good devs but for average people (most of us are average) I think there will be less and less of a place.

bluefirebrand1y ago

> There will always be a place for really good devs but for average people (most of us are average) I think there will be less and less of a place

You open your post with "we need to accept" and then end with this

This terrifies me. The idea that AI results in me having "less of a place" in society?

The idea of mass unemployment?

We should be scared

zonethundery1y ago· 1 in thread

No doubt the headline's claim is true, but Claude just wrote a working MCP serving up the last 10 years of my employer's work product. For $13 in api credits.

While technically capable of building it on my own, development is not my day job and there are enough dumb parts of the problem my p(success) hand-writing it would have been abysmal.

With rose-tinted glasses on, maybe LLM's exponentially expand the amount of software written and the net societal benefit of technology.

procaryote1y ago

If your own code would have been abysmal, how can you tell if the claude generated code is any good?

SKILNER1y ago· 1 in thread

There's a lot of resistance to AI amongst the people in this discussion, which is probably to be expected.

A chunk of the objections indicate people trying to shoehorn in their old way of thinking and working.

I think you have to experiment and develop some new approaches to remove the friction and get the benefit.

bluefirebrand1y ago

> I think you have to experiment and develop some new approaches to remove the friction and get the benefit.

What benefit to me?

If I'm 50% more productive using AI that's great for my employer, but what do I get out of it?

I get to continue to be employed? I already had that before AI

So what do I get out of this, exactly?

1 more reply

throwaway4390801y ago· 1 in thread

hiatus1y ago

Reminds me of the 90s when computer hardware moved so fast. I wonder where the limit is this time around.

burningion1y ago· 1 in thread

I agree, but I also didn’t create redis!

It’s a tough bar if LLMs have to be post antirez level intelligence :)

ljlolel1y ago

Seriously, he’s one of the best on the planet of course it’s not better than him. If so we’d be cooked.

99% of professional software developers don’t understand what he said much less can come up with it (or evaluate it like Gemini).

This feels a bit like a humblebrag about how well he can discuss with an LLM compared to others vibecoding.

lodovic1y ago· 1 in thread

bluefirebrand1y ago

> Someone said, your job won't be taken by AI, it will be taken by someone who's using AI smarter than you.

"Your job will be taken by someone who does more work faster/cheaper than you, regardless of quality" has pretty much always been true

That's why outsourcing happens too

pjmlp1y ago· 1 in thread

Yes, we are still winning the game, however don't be happy for what is possible today, think what is possible in a decade from now.

In that regard I am less optimistic.

anhner1y ago

I think we will hit a proverbial wall at some point just like with self-driving cars.

1 more reply

callamdelaney1y ago· 1 in thread

LLMs will never be better than humans on the basis that LLMs are just a shitty copy of human code.

danielbln1y ago

pknerd1y ago· 1 in thread

Let's not forget that LLMs can't give a solution they have not experienced themselves

willmarch1y ago

This is objectively not true.

1 more reply

bdbenton52551y ago

When these models are given greater functionality to not only output code but to build out entire projects given specifications, then the role of the human programmer must evolve.

DrJid1y ago

I never quite understand these articles though. It's not about Humans vs. AI.

It's about Humans vs. Humans+AI

and 4/5, Humans+AI > Humans.

devmor1y ago

seabirdman1y ago

ntonozzi1y ago

careful_ai1y ago

LLMs boost productivity, but it’s the developer's thinking that makes the outcome truly resilient. This story captures that balance perfectly.

agumonkey1y ago

revskill1y ago

The funniest things a llm do to me is they fixed the unit test to pass instead of fixing the code. Basically until a llm can have embedded common sense knowledge, it is untrustable

codr71y ago

I would say you thought about this solution because you are creative, something a computer will never be no matter how much data you throw at it.

How did it help, really? By telling you your idea was no good?

A less confident person might have given up because of the feedback.

I just can't understand why people are so excited about having an algorithm guessing for them. Is it the thrill when it finally gets something right?

perrygeo1y ago

twodave1y ago

catigula1y ago

ModernMech1y ago

jonator1y ago

3cats-in-a-coat1y ago

"Better" is relative to context. It's a multi-dimensional metric flattened to a single comparison. And humans don't always win that comparison.

LLMs are faster, and when the task can be synthetically tested for correctness, and you can build up to it heuristically, humans can't compete. I can't spit out a full game in 5 minutes, can you?

LLMs are also cheaper.

LLMs are also obedient and don't get sick, and don't sleep.

Humans are still better by other criteria. But none of this matters. All disruptions start from the low end, and climb from there. The climbing is rapid and unstoppable.

AstroBen1y ago

Maybe LLMs completely trivialize all coding. The potential for this is there

Maybe progress slows to a snails pace, the VC money runs out and companies massively raise prices making it not worth it to use

No one knows. Just sit back and enjoy the ride. Maybe save some money just in case

notyouraibot1y ago

galaxyLogic1y ago

Maybe the way forward would be to invent better "specifiction languages" that are easy enough for humans to use, then let the AI implement the specifciation you come up with.

gumbojuice1y ago

I like to use llm to produce code for known problems that I don't have memorized.

CivBase1y ago

Poortold1y ago

kurofune1y ago

anjc1y ago

Gemini gives instant, adaptive, expert solutions to an esoteric and complex problem, and commenters here are still likening LLMs to junior coders.

Glad to see the author acknowledges their usefulness and limitations so far.

palavrov1y ago

From my experience AI for coders is multiplier of the coder skills. It will allow you to faster solve problems or add bugs. But so far will not make you a better coder than you are.

ants_everywhere1y ago

I'm increasingly seeing this as a political rather than technical take.

At this point I think people who don't see the value in AI are willfully pulling the wool over their own eyes.

kristopolous1y ago

Correct. LLMs are a thought management tech. Stupider ones are fine because they're organizing tools with a larger library of knowledge.

Think about it and tell me you use it differently.

tonyhart71y ago

I think also it depends on the model of course

General LLM model would not be as good as LLM for coding, for this case Google deepmind team maybe has something better than Gemini 2.5 pro

horns4lyfe1y ago

Writing about AI is missing the forest for the trees. The US software industry will be wholesale destroyed (and therefore global software will be too) by offshoring.

ww5201y ago

The value of LLMs are as a better Stackoverflow. It’s much better than search now because it’s not populated with all the craps that have seeped through over time.

buremba1y ago

If the human here is the creator of Redis, probably not.

failrate1y ago

LLMs are using the corpus of existing software source code. Most software source code is just North of unworkable garbage. Garbage in, garbage out.

stabbles1y ago

The trick is much like Zobrist hashing from chess programming, I'm sure the llm has devoured chessprogramming.org during training.

janalsncm1y ago

Your competitor who didn’t just fire a ton of SWEs? Turns out they can pay for Cursor subscriptions too, and now they are moving faster than you.

foobarian1y ago

I find LLMs a fantastic frontend to StackOverflow. But agree with OP it's not an apples-to-apples replacement for the human agent.

osigurdson1y ago

This is similar to my usage of LLMs. I use Windsurf sometimes but more often it is more of a conversation about approaches.

rubit_xxx171y ago

Gemini may be fine for writing complex function, but I can’t stand to use it day to day. Claude 4 is my go to atm.

ashoeafoot1y ago

Human coders also hate the structures they are embedded in and are willing to call the replacement bluff ..

sagarpatil1y ago

So your sample size is 1 task and 1 LLM? I would recommend trying o3, opus 4 (API) with web search enabled.

jbellis1y ago

But Human+Ai is far more productive than Human alone, and more fun, too. I think antirez would agree, or he wouldn't bother using Gemini.

I built Brokk to maximize the ability of humans to effectively supervise their AI minions. Not a VS code plugin, we need something new. https://brokk.ai

motorest1y ago

There is also another side to the mass adoption of LLMs in software engineering jobs: they can quite objectively worsen the output of human coders.

I know this, because I work with them. It's awful.

It's tiring and frustrating.

StillBored1y ago

fHr1y ago

estensen1y ago

Most of the work software engineers do is not fixing complicated bugs.

unsupp0rted1y ago

- LLMs are going to make me a 100x more valuable coder? Of course they will, no doubt about it.

- LLMs are going to be 100x more valuable than me and make me useless? I don't see it happening. Here's 3 ways I'm still better than them.

varispeed1y ago

Looks like this pen is not going to replace the artist after all.

orangebread1y ago

It's that time again where a dev writes a blog post coping.

uticus1y ago

same as https://news.ycombinator.com/item?id=44127956, also on HN front page

oldpersonintx21y ago

but their rate of improvement is like 1000x human devs, so you have to wonder what the shot clock says for most working devs

kaycey20221y ago

651y ago

AI is good for people who have given up, who don't give a shit about anything anymore.

You know, those who don't care about learning and solving problems, gaining real experience they can use to solve problems even faster in the future, faster than any AI slop.

shayanbahal1y ago

Human coders utilizing LLMs are better

thegrim331y ago

h4kunamata1y ago

This!!!!

LLM is as good as the material it is being trained on, the same applies to AI and they are not perfect.

Perplexity AI did assist me in getting into Python from 0 to getting my code test with 94% covered and no vulnerabilities (scanning tools) Google Gemini is dogshit

zb31y ago

Speak for yourself..

RayMan11y ago

of course they are.

fspoto981y ago

Yes i agree:D

xnx1y ago

... depending on the human (and the LLM). Results may differ in 6 months.

chuckreynolds1y ago

for now. (i'm not a bot. i'm aware however a bot would say this)

hello_computer1y ago

gxs1y ago