undefined | Better HN

0 pointssystemvoltage3y ago0 comments

Curious why even companies at the very edge of innovation are unable to build moats?

I know nothing about AI, but when DALLE was released, I was under the impression that the leap of tech here is so crazy that no one is going to beat OpenAI at it. We have a bunch now: Stable Diffusion, MidJourney, lots of parallel projects that are similar.

Is it because OpenAI was sharing their secret sauce? Or is it that the sauce isn’t that special?

0 comments

PaulHoule3y ago

Google got a patent on transfomers but didn't enforce it.

If it wasn't for patents you'd never get a moat from technology. Google, Facebook, Apple and all have a moat because of two sided markets: advertisers go where the audience is, app makers go where the users are.

(There's another kind of "tech" company that is wrongly lumped in with the others, this is an overcapitalized company that looks like it has a moat because it is overcapitalized and able to lose money to win market share. This includes Amazon, Uber and Netflix.)

mgfist3y ago

I don't think this is strictly true, though it's rare. The easiest example is the semiconductor industry. ASML's high end lithography machines are basically alien and cannot be reproduced by anyone else. China has spent billions trying. I don't even think there's a way to make the IP public because of how much of it is in people's heads and in the processes in place. I wonder how much money, time and ASML resources it would take to stand up a completely separate company that can do what ASML does assuming that ASML could dedicate 100% of their time in assisting in training the personnel at said company.

da_chicken3y ago

The semiconductor industry is only tangentially or partially a tech company. They're producing physical goods that require complex physical manufacturing processes. The means of production are expensive, complex, and require significant expertise to operate once set up. The whole thing involves multiple levels of complex engineering challenges. Even if you wanted to make a small handful of chips, you'd still have to go through all that.

Most modern tech companies are software companies. To them, the means of production are a commodity server in a rack. It might be an expensive server, but that's actually dependent on scale. It might even be a personal computer on a desk, or a smartphone in a pocket. Further, while creating software is highly technical, duplicating it is probably the most trivial computing operation that exists. Not that distribution is trivial (although it certainly can be) just that if you have one copy of software or data, you have enough software or data for 8 billion people.

2 more replies

PaulHoule3y ago

Yeah, this is probably also true for TSMC, Intel and ARM. Look how slow progress is on RISC-V on the high end despite RISC-V having the best academic talent.

3 more replies

light_hue_13y ago

> Google got a patent on transfomers but didn't enforce it.

Google's Transformer patent isn't relevant to GPT at all. https://patents.google.com/patent/US10452978B2/en

They patented the original Transformer encoder-decoder architecture. But most modern models are built either only out of encoders (the BERT family) or only out of decoders (the GPT family).

Even if they wanted to enforce their patent, they couldn't. It's a classic problem with patenting things that every lawyer warns you about "what if someone could make a change to circumvent your patent".

novaRom3y ago

Wait until Google goes down inevitably, then they will apply all their legal force just to save their sinking ship.

varjag3y ago

You can't tell unless you read the claims thoroughly. Degenerate use cases can be covered by general claims.

1 more reply

sokoloff3y ago

Or, Amazon, Uber, and Netflix have access to so much capital based on investors' judgment that they will be able to win and protect market share by effective execution, thereby creating a defensible moat.

Tanjreeve3y ago

I think his point was that If that moat doesn't exist without the ongoing context of more money being thrown at it then it isn't a moat.

light_hue_13y ago

It's because moving forward is hard, but moving backward when you know what the space of answers is, is much easier.

Once you know that OpenAI gets a certain set of results with roughly technology X, it's much easier to recreate that work than to do it in the first place.

This is true of most technology. Inventing the telephone is something, but if you told a competent engineer the basic idea, they'd be able to do it 50 years earlier no problem.

Same with flight. There are some really tricky problems with counter-intuitive answers (like how stalls work and how turning should work; which still mess up new pilots today). The space of possible answers is huge, and even the questions themselves are very unclear. It took the Wright brothers years of experiments to understand that they were stalling their wing. But once you have the basic questions and their rough answers, any amateur can build a plane today in their shed.

zamnos3y ago

I agree with your overall point, but I don't think that we'd be able to get the telephone 50 years earlier because of how many other industries had to align to allow for its invention. Insulated wire didn't readily or cheaply come in spools until after the telegraph in the 1840's. The telephone was in 1876 so 50 years earlier was 1826.

hnick3y ago

You didn't mention it explicitly but I think the morale factor is also huge. Once you know it's possible, it does away with all those fears of wasted nights/weekends/resources/etc for something that might not actually be possible.

elevaet3y ago

I think it's because everyone's swimming in the same bath. People move around between companies, things are whispered, papers are published, techniques are mentioned and details filled in, products are backwards-engineered. Progress is incremental.

usrbinbash3y ago

> Or is it that the sauce isn’t that special?

The sauce is special, but the recipe is already known. Most of the stuff things like LLMs are based on comes from published research, so in principle coming up with the architecture that can do something very close, is doable to everyone with the skills to understand the research material.

The problems start with a) taking the architecture to a finished and fine tuned model and b) running that model. Because now we are talking about non-trivial amounts of compute, storage and bandwidth, so quite simple resources suddenly become a very real problem.

sounds3y ago

OpenAI can't build a moat because OpenAI isn't a new vertical, or even a complete product.

Right now the magical demo is being paraded around, exploiting the same "worse is better" that toppled previous ivory towers of computing. It's helpful while the real product development happens elsewhere, since it keeps investors hyped about something.

The new verticals seem smaller than all of AI/ML. One company dominating ML is about as likely as a single source owning the living room or the smartphones or the web. That's a platitude for companies to woo their shareholders and for regulators to point at while doing their job. ML dominating the living room or smartphones or the web or education or professional work is equally unrealistic.

taneq3y ago

I'm not sure how "keep the secret sauce secret and only offer it as a service" isn't a moat? Here the 'secret sauce' is the training data and the trained network, not the methodology, but the way they're going, it's only a matter of time before they start withholding key details of the methodology too.

kybernetyk3y ago

Luckily ML isn't that complicated. People will find out stuff without the cool kids at OpenAI telling them.

kybernetyk3y ago

>Or is it that the sauce isn’t that special?

Most likely this.

raducu3y ago

I also expect a high moat, especially regarding training data.

But the counter for the high moat would be the atomic bomb -- the soviets were able to build it for a fraction of what it cost the US because the hard parts were leaked to them.

GPT-3 afik is an easier picking because they used a bigger model than necessary, but afterwards there appeared guidelines about model size vs. training data, so GPT-4 probably won't be as easily trimmed down.

siva73y ago

You can have the most special sauce in the world but if you're hiding it in the closet because you fear that it will hurt sales of your classic sauce then don't be surprised with what will happen (also known as Innovators Dilemma)

panzi3y ago

Isn't MidJourney a fork of Stable Diffusion?

astrange3y ago

One of the middle version models was, but the first and latest model versions are homegrown.

pavo-etc3y ago

Not originally, MidJourney came out before Stable Diffusion

hoseja3y ago

The sauce really doesn't seem all that special.

dr_dshiv3y ago

Because we are headed to a world of semi-automated luxury socialism. Having a genius at your service for less than $1000 per year is just an insane break to the system we live in. We all need to think hard about how to design the world we want to live in.

j / k navigate · click thread line to collapse

0 comments

PaulHoule3y ago

Google got a patent on transfomers but didn't enforce it.

mgfist3y ago

da_chicken3y ago

2 more replies

PaulHoule3y ago

Yeah, this is probably also true for TSMC, Intel and ARM. Look how slow progress is on RISC-V on the high end despite RISC-V having the best academic talent.

3 more replies

light_hue_13y ago

> Google got a patent on transfomers but didn't enforce it.

Google's Transformer patent isn't relevant to GPT at all. https://patents.google.com/patent/US10452978B2/en

They patented the original Transformer encoder-decoder architecture. But most modern models are built either only out of encoders (the BERT family) or only out of decoders (the GPT family).

novaRom3y ago

Wait until Google goes down inevitably, then they will apply all their legal force just to save their sinking ship.

varjag3y ago

You can't tell unless you read the claims thoroughly. Degenerate use cases can be covered by general claims.

1 more reply

sokoloff3y ago

Tanjreeve3y ago

I think his point was that If that moat doesn't exist without the ongoing context of more money being thrown at it then it isn't a moat.

light_hue_13y ago

It's because moving forward is hard, but moving backward when you know what the space of answers is, is much easier.

Once you know that OpenAI gets a certain set of results with roughly technology X, it's much easier to recreate that work than to do it in the first place.

This is true of most technology. Inventing the telephone is something, but if you told a competent engineer the basic idea, they'd be able to do it 50 years earlier no problem.

zamnos3y ago

hnick3y ago

elevaet3y ago

usrbinbash3y ago

> Or is it that the sauce isn’t that special?

sounds3y ago

OpenAI can't build a moat because OpenAI isn't a new vertical, or even a complete product.

taneq3y ago

kybernetyk3y ago

Luckily ML isn't that complicated. People will find out stuff without the cool kids at OpenAI telling them.

kybernetyk3y ago

>Or is it that the sauce isn’t that special?

Most likely this.

raducu3y ago

I also expect a high moat, especially regarding training data.

But the counter for the high moat would be the atomic bomb -- the soviets were able to build it for a fraction of what it cost the US because the hard parts were leaked to them.

siva73y ago

panzi3y ago

Isn't MidJourney a fork of Stable Diffusion?

astrange3y ago

One of the middle version models was, but the first and latest model versions are homegrown.

pavo-etc3y ago

Not originally, MidJourney came out before Stable Diffusion

hoseja3y ago

The sauce really doesn't seem all that special.

dr_dshiv3y ago

j / k navigate · click thread line to collapse