undefined | Better HN

0 pointsbaxtr7d ago0 comments

Isn’t most standard software these days a permutation of things already done before?

0 comments

Author here: it's not even clear that agents can reliably permute their training data (I'm not saying that it's impossible or never happens but that it's not something we can take for granted as a reliable feature of agentic coding).

As I mentioned in one of the footnotes in the post:

> People often tell me "you would get better results if you generated code in a more mainstream language rather than Haskell" to which I reply: if the agent has difficulty generating Haskell code then that suggests agents aren't capable of reliably generalizing beyond their training data.

If an agent can't consistently apply concepts learned in one language to generate code in another language, then that calls into question how good they are at reliably permuting the training dataset in the way you just suggested.

mike_hearn7d ago

Your argument is far too dependent on observations made about the model's ability with Haskell, which is irrelevant. The concepts in Haskell are totally different to almost any other language - you can't easily "generalize" from an imperative strict language like basically everything people really use to a lazy pure FP language that uses monads for IO like Haskell. The underlying concepts themselves are different and Haskell has never been mainstream enough for models to get good at it.

Pick a good model, let it choose its own tools and then re-evaluate.

reedlaw6d ago

Why is Haskell irrelevant to the argument that LLMs can't reliably permute programming knowledge from one language to another? In fact, the purity of the language and dearth of training data seems like the perfect test case to see whether concepts found in more mainstream languages are actually understood.

1 more reply

rytis7d ago

> if the agent has difficulty generating Haskell code then that suggests agents aren't capable of reliably generalizing beyond their training data.

doesn't that apply to flesh-and-bone developers? ask someone who's only working in python to implement their current project in haskell and I'm not so sure you'll get very satisfying results.

Frieren7d ago

> doesn't that apply to flesh-and-bone developers?

No, it does not. If you have a developer that knows C++, Java, Haskell, etc. and you ask that developer to re-implement something from one language to another the result will be good. That is because a developer knows how to generalize from one language (e.g. C++) and then write something concrete in the other (e.g. Haskell).

1 more reply

cassianoleal7d ago

Your argument fails where it equates someone who only codes in one language to an LLM who is usually trained in many languages.

In my experience, a software engineer knows how to program and has experience in multiple languages. Someone with that level of experience tends to pick up new languages very quickly because they can apply the same abstract concepts and algorithms.

If an LLM that has a similar (or broader) data set of languages cannot generalise to an unknown language, then it stands to reason that it is indeed only capable of reproducing what’s already in its training data.

ozlikethewizard7d ago

The hard bit of programming has never been knowing the symbols to tell the computer what to do. It is more difficult to use a completely unknown language, sure, but the paradigms and problem solving approaches are identical and thats the actual work, not writing the correct words.

1 more reply

debugnik7d ago

But the model has seen pretty much all the public Haskell code around, and possibly been trained to write it in different settings.

graemep7d ago

I am very sceptical mainstream languages will be better. I have seen plenty of bad Python from LLMs. Even with simple CRUD apps and when provided with detailed instructions.

lukan7d ago

"that suggests agents aren't capable of reliably generalizing beyond their training data."

Yes? If they could, we would have a strong general intelligence by now and only few people are claiming this.

ChrisGreenHeur7d ago

It can also mean that the other programming language is above the cognitive abilities of the LLM

loveparade7d ago

But what's the point of re-building "standard software" if it is so standard that it already exists 100 times in the training data with slight variations?

lynx977d ago

I read this attitude very often on HN. "If someone else has already built it before, your effort is a waste of time." To me, it has this "Someone else already makes money from it, go somewhere else where you dont have competition." Well, I get the drift... But... Not everyone is into getting rich. You know, some of us just have fun building things and learning while doing so. It really doesn't matter if the path has been walked before. Not everything has to be plain novelty to count.

loveparade7d ago

If you do it for fun then why do you care whether an LLM can do it well or not, which was the original argument? Shouldn't matter to you in that case.

1 more reply

baxtrOP7d ago

See here:

https://news.ycombinator.com/item?id=47435808

ChrisGreenHeur7d ago

The point is the small variations

roarcher7d ago

I'd say that's pretty much the definition of standard, yeah. And it's why you can't make a profit selling a simple ToDo app. If you expect people to pay for what you build, you have to build something that doesn't have a thousand free clones on the app store.

baxtrOP7d ago

I politely disagree.

I think you’re conflating software and product.

A product can be a recombination of standard software components and yet be something completely new.

layer87d ago

That isn’t saying much. Every software is a permutation of zeros and ones. The novelty or ingenuity, or just quality and fitness for purpose, can lie in the permutation you come up with. And an LLM is limited by its training in the permutations it is likely to come up with, unless you give it heaps of specific guidance on what to do.

mfabbri777d ago

In my experience, the further you move away from the user and toward the hardware and fundamental theoretical algorithms, the less true this becomes.

This is very true for an email client, but very untrue for an innovative 3D rendering engine technology (just an example).

layer87d ago

An email client is highly nontrivial, due to the complexities of the underlying standards, and how the real implementations you have to be compatible with don’t strictly follow them. Making an email client that doesn’t suck and is fully interoperable is quite an ambitious endeavor.

mfabbri777d ago

The point was to answer the question: "Can every piece of software be viewed as a permutation of software that has already been developed?" In my opinion, an email client is a more favorable example than a 3D engine. In fields where it is necessary to differentiate, improve, or innovate at the algorithmic level, where research and development play a fundamental role, it is not simply a matter of permuting software or leveraging existing software components by simply assembling them more effectively.

1 more reply

umanwizard7d ago

What complexities specifically? Implementing SMTP (from the client’s perspective) that other SMTP servers can understand is not very hard. I have done it. Does it follow every nuance of the standard? I don’t know, but it works for me. I haven’t implemented IMAP but I don’t see why it should be much harder. Is there a particular example you have in mind?

fmbb7d ago

I would be surprised if there are more working email clients out there than working 3D engines. The gaming market is huge, most people do not pay to use email, hobbyists love creating game engines.

umanwizard7d ago

Idk, a working basic email client is just not that hard to write though. SMTP and IMAP are simple protocols and the required graphical interface is a very straightforward combination of standard widgets.

1 more reply

nitwit0056d ago

There's this weird schizophrenia around this in the software world.

In the past I have had people here suggest they're just writing boilerplate CRUD software, and I've suggested that means they could just use low code tools instead. They then suggest it's too complex for that to work.

I think we tend to view ourselves as just hooking together basic operations, which might be technically true, but that becomes complex very quickly. A product can be built off of straight forward REST and database operations, but take you months of learning to get up to speed on.

j / k navigate · click thread line to collapse

0 comments

Gabriel4397d ago

As I mentioned in one of the footnotes in the post:

mike_hearn7d ago

Pick a good model, let it choose its own tools and then re-evaluate.

reedlaw6d ago

1 more reply

rytis7d ago

> if the agent has difficulty generating Haskell code then that suggests agents aren't capable of reliably generalizing beyond their training data.

doesn't that apply to flesh-and-bone developers? ask someone who's only working in python to implement their current project in haskell and I'm not so sure you'll get very satisfying results.

Frieren7d ago

> doesn't that apply to flesh-and-bone developers?

1 more reply

cassianoleal7d ago

Your argument fails where it equates someone who only codes in one language to an LLM who is usually trained in many languages.

ozlikethewizard7d ago

1 more reply

debugnik7d ago

But the model has seen pretty much all the public Haskell code around, and possibly been trained to write it in different settings.

graemep7d ago

I am very sceptical mainstream languages will be better. I have seen plenty of bad Python from LLMs. Even with simple CRUD apps and when provided with detailed instructions.

lukan7d ago

"that suggests agents aren't capable of reliably generalizing beyond their training data."

Yes? If they could, we would have a strong general intelligence by now and only few people are claiming this.

ChrisGreenHeur7d ago

It can also mean that the other programming language is above the cognitive abilities of the LLM

loveparade7d ago

But what's the point of re-building "standard software" if it is so standard that it already exists 100 times in the training data with slight variations?

lynx977d ago

loveparade7d ago

If you do it for fun then why do you care whether an LLM can do it well or not, which was the original argument? Shouldn't matter to you in that case.

1 more reply

baxtrOP7d ago

See here:

https://news.ycombinator.com/item?id=47435808

ChrisGreenHeur7d ago

The point is the small variations

roarcher7d ago

baxtrOP7d ago

I politely disagree.

I think you’re conflating software and product.

A product can be a recombination of standard software components and yet be something completely new.

layer87d ago

mfabbri777d ago

In my experience, the further you move away from the user and toward the hardware and fundamental theoretical algorithms, the less true this becomes.

This is very true for an email client, but very untrue for an innovative 3D rendering engine technology (just an example).

layer87d ago

mfabbri777d ago

1 more reply

umanwizard7d ago

fmbb7d ago

I would be surprised if there are more working email clients out there than working 3D engines. The gaming market is huge, most people do not pay to use email, hobbyists love creating game engines.

umanwizard7d ago

1 more reply

nitwit0056d ago

There's this weird schizophrenia around this in the software world.

j / k navigate · click thread line to collapse