undefined | Better HN

0 pointscolinsane3y ago0 comments

> What we really need is a model that you can run on your own hardware on site.

we won’t have that until we come up with a better way to fund these things. “””Open””” AI was founded on that idea, had the most likely chance of anyone in reaching it: even going into things with that intent they failed and switched to lock down the distribution of their models, somehow managed to be bought by MS despite the original non-profit-like structure. you just won’t see what you’re asking for for however long this field is dominated by the profit motive.

0 comments

sounds3y ago

Nah, it's already being done for GPT-3's competitors and will likely be done soon for GPT-4's competitors

https://arstechnica.com/information-technology/2023/03/you-c...

systemvoltage3y ago

Curious why even companies at the very edge of innovation are unable to build moats?

I know nothing about AI, but when DALLE was released, I was under the impression that the leap of tech here is so crazy that no one is going to beat OpenAI at it. We have a bunch now: Stable Diffusion, MidJourney, lots of parallel projects that are similar.

Is it because OpenAI was sharing their secret sauce? Or is it that the sauce isn’t that special?

PaulHoule3y ago

Google got a patent on transfomers but didn't enforce it.

If it wasn't for patents you'd never get a moat from technology. Google, Facebook, Apple and all have a moat because of two sided markets: advertisers go where the audience is, app makers go where the users are.

(There's another kind of "tech" company that is wrongly lumped in with the others, this is an overcapitalized company that looks like it has a moat because it is overcapitalized and able to lose money to win market share. This includes Amazon, Uber and Netflix.)

3 more replies

light_hue_13y ago

It's because moving forward is hard, but moving backward when you know what the space of answers is, is much easier.

Once you know that OpenAI gets a certain set of results with roughly technology X, it's much easier to recreate that work than to do it in the first place.

This is true of most technology. Inventing the telephone is something, but if you told a competent engineer the basic idea, they'd be able to do it 50 years earlier no problem.

Same with flight. There are some really tricky problems with counter-intuitive answers (like how stalls work and how turning should work; which still mess up new pilots today). The space of possible answers is huge, and even the questions themselves are very unclear. It took the Wright brothers years of experiments to understand that they were stalling their wing. But once you have the basic questions and their rough answers, any amateur can build a plane today in their shed.

2 more replies

elevaet3y ago

I think it's because everyone's swimming in the same bath. People move around between companies, things are whispered, papers are published, techniques are mentioned and details filled in, products are backwards-engineered. Progress is incremental.

usrbinbash3y ago

> Or is it that the sauce isn’t that special?

The sauce is special, but the recipe is already known. Most of the stuff things like LLMs are based on comes from published research, so in principle coming up with the architecture that can do something very close, is doable to everyone with the skills to understand the research material.

The problems start with a) taking the architecture to a finished and fine tuned model and b) running that model. Because now we are talking about non-trivial amounts of compute, storage and bandwidth, so quite simple resources suddenly become a very real problem.

sounds3y ago

OpenAI can't build a moat because OpenAI isn't a new vertical, or even a complete product.

Right now the magical demo is being paraded around, exploiting the same "worse is better" that toppled previous ivory towers of computing. It's helpful while the real product development happens elsewhere, since it keeps investors hyped about something.

The new verticals seem smaller than all of AI/ML. One company dominating ML is about as likely as a single source owning the living room or the smartphones or the web. That's a platitude for companies to woo their shareholders and for regulators to point at while doing their job. ML dominating the living room or smartphones or the web or education or professional work is equally unrealistic.

taneq3y ago

I'm not sure how "keep the secret sauce secret and only offer it as a service" isn't a moat? Here the 'secret sauce' is the training data and the trained network, not the methodology, but the way they're going, it's only a matter of time before they start withholding key details of the methodology too.

1 more reply

kybernetyk3y ago

>Or is it that the sauce isn’t that special?

Most likely this.

raducu3y ago

I also expect a high moat, especially regarding training data.

But the counter for the high moat would be the atomic bomb -- the soviets were able to build it for a fraction of what it cost the US because the hard parts were leaked to them.

GPT-3 afik is an easier picking because they used a bigger model than necessary, but afterwards there appeared guidelines about model size vs. training data, so GPT-4 probably won't be as easily trimmed down.

siva73y ago

You can have the most special sauce in the world but if you're hiding it in the closet because you fear that it will hurt sales of your classic sauce then don't be surprised with what will happen (also known as Innovators Dilemma)

panzi3y ago

Isn't MidJourney a fork of Stable Diffusion?

2 more replies

hoseja3y ago

The sauce really doesn't seem all that special.

dr_dshiv3y ago

Because we are headed to a world of semi-automated luxury socialism. Having a genius at your service for less than $1000 per year is just an insane break to the system we live in. We all need to think hard about how to design the world we want to live in.

malborodog3y ago

> we won’t have that until we come up with a better way to fund these things.

Isn't this already happening with LLaMA and Dalai etc.? Already now you can run Whisper yourself. And you can run a model almost as powerful as gpt-3.5-turbo. So I can't see why it's out of bounds that we'll be able to host a model as powerful as gpt4.0 on our own (highly specced) Mac Studio M3s, or whatever it may be.

j / k navigate · click thread line to collapse

0 comments

sounds3y ago

Nah, it's already being done for GPT-3's competitors and will likely be done soon for GPT-4's competitors

https://arstechnica.com/information-technology/2023/03/you-c...

systemvoltage3y ago

Curious why even companies at the very edge of innovation are unable to build moats?

Is it because OpenAI was sharing their secret sauce? Or is it that the sauce isn’t that special?

PaulHoule3y ago

Google got a patent on transfomers but didn't enforce it.

3 more replies

light_hue_13y ago

It's because moving forward is hard, but moving backward when you know what the space of answers is, is much easier.

Once you know that OpenAI gets a certain set of results with roughly technology X, it's much easier to recreate that work than to do it in the first place.

This is true of most technology. Inventing the telephone is something, but if you told a competent engineer the basic idea, they'd be able to do it 50 years earlier no problem.

2 more replies

elevaet3y ago

usrbinbash3y ago

> Or is it that the sauce isn’t that special?

sounds3y ago

OpenAI can't build a moat because OpenAI isn't a new vertical, or even a complete product.

taneq3y ago

1 more reply

kybernetyk3y ago

>Or is it that the sauce isn’t that special?

Most likely this.

raducu3y ago

I also expect a high moat, especially regarding training data.

But the counter for the high moat would be the atomic bomb -- the soviets were able to build it for a fraction of what it cost the US because the hard parts were leaked to them.

siva73y ago

panzi3y ago

Isn't MidJourney a fork of Stable Diffusion?

2 more replies

hoseja3y ago

The sauce really doesn't seem all that special.

dr_dshiv3y ago

malborodog3y ago

> we won’t have that until we come up with a better way to fund these things.

j / k navigate · click thread line to collapse