Once those laws are in place, the moat is dug - and large corporations with access to capital vacuum up good training data in bulk. The whole time loudly proclaiming that they're 'empowering creators' and 'rewarding original thinkers.' Now the large corporations have the largest and best datasets tied up and neat, deep moat. Any startup wanting to challenge the incumbents could have significantly better models, but without access to the same quality and volume of training data, they'll be at a massive disadvantage.
There is little to no need for anyone to scrape the web themselves, CommonCrawl does it for free and provides access to all that data for free, they are a charity similar to The Internet Archive:
As for using that data to train a model, I've done it at my own company and it's not that expensive, certainly not the astronomically expensive figures people seem to think it is. Maybe a hobbyist can't do it for themselves, but my small company managed to get pretty good results training on our existing hardware. It's hard to give concrete figures because we manage our own hardware instead of using AWS, but all in we're talking on the order of 100k, as opposed to what I've heard people on HN think it costs, as if it's a $10 million+ enterprise.
The real issue for machine learning is that it lets you build systems that optimize for some metric. That's what corporations do. I've been pointing out lately that the ethical problems for AI in the near term look very much like those for corporations.
Remember... Bits have color and AI models are a rainbow.
Seems like we almost need pre-ai scraped datasets, almost like how it’s sometimes very useful to have metals that were created before the first atom bombs (I don’t know too much about this, just that there’s a definite land in the radiated sand around that timeframe).
I think the future you described is possible. I also think it's possible that corporations are overpaying for data that will become stale after several years and new fresh data is generated in such great quantity/quality that access to it is not capital intensive.
There seems to be a lot of different ideas about 'quality'. A lot of people want a better-than-search tool that knows vast amounts and can be assigned tasks. I'm personally much more interested in a reasoning machine, I don't need it to include the entire contents of the New York Public Library.
any law can be legislated, if it can be put down as text.
It may be considered unreasonable, but if the majority of people _do_ agree, then it could be done.
To operate in the real world like humans you need to have motivation. What is an AI motivated by?
The law is simple... You have to have a license to the data to train a model with that data. If you don't... Get ready to be sued.
Forgive my ignorance but is this not the whole point of civilized society? Social safety net are what my taxes are supposed to be for isn't it?
I'm from Europe living in USA and the discussion around "What if there were some way to pool our resources to help those in need" is always ... interesting.
The claim goes too far though. Automation, thus far, has been good for everyone. Mostly those with capital, but also those without. The zero sum outcomes implied by the headline don't hold up to what we've observed with increased living standards globally as automation has improved economic output.
Once they have mass produced Terminator-like invincible robot cops they'll be able to do whatever they wish with us. They could give us universal income, they could also round us up and push us into the sea.
The personal home computer, the early web etc were productivity boosts but moved power from incumbents to small players. AI as it currently stands almost achieves the exact opposite. A relatively straight forward heuristic to ask is, how much does an authoritarian power like the technology?
Even Bitcoin is not going to fix any of this.
AI does not improve productivity in the technical sense. Productivity improvements are the result of using better tools. I would classify better tools as force multipliers. AI is a force replacement as far as I'm concerned. If I'm using stable diffusion, I'm not painting, I'm writing prompts and hoping I get what I want. It's as far removed from the actual practice as it can be.
Modern DJ decks have an auto-mixing switch. DJs still exist and use that switch when they need to take a bathroom break, etc.
Prompt-writing is to interacting with AI as the command line is to using Photoshop - an early UI to a new tech.
Even if AGI itself was “good”, that does not mean it will be used or deployed fairly and equitably. In fact, I’d posit without major regulation it will just increase the wealth gap
this wasn't the case for other capital goods that was invented in the past, so why is AI going to be different?
In the past, other forms of automation brought on more wealth, which did enrich the rest of society (on top of bringing personal wealth to the founders).
The only difference is that _your_ current view has been normalized to see the products of that capital goods as normal!
It really seems like “all of human history” tells a different story. The standard of living for virtually everyone is much higher now than it was in the past. Obviously there are people who capture a large share of the value. That is true is in any competitive system. But technological advancements have improved the entire world on average.
In other words, why is this time different?
The problem isn't new, it's always existed as a pathology of American society. You already have been trading away the gains of technological improvement to an ever shrinking group of people in control of capital.
AI isn't creating a new problem, it's a reminder of an old one. Which should make it clear that even bothering to talk about AI specifically is talking about the wrong issue.
i dont think that's true. The average wealth has grown, and even if you didn't get wage growth, the availability of things you could buy with said wage has grown (which is in and of itself a form of wealth).
I think Stable Diffusion, being FOSS & usable on a decent range of consumer hardware, is a pretty clear counterexample to the article's claim - it doesn't matter if you're a capitalist or not, you can get great value from the technology. Funnily, the article totally avoids mentioning SD.
A1111 and the extensions system they've built is really amazing for putting FOSS into nontechnical consumers' hands.
I personally found that stable diffusion has been just as good at producing imagery. And i have seen plenty of fine tuning done for specific purposes (see https://civitai.com/models/5585/deliberate-for-invoke for example).
And the fact that SD is available locally, as opposed to midjourney, makes it more of a hotbed for new innovations.
I’m asking, I have no idea. I was under the impression that MJ was a easier to use toy. I didn’t know they were close let alone Mj superior.
But the other side of this is that large corporations will have the upper hand in AI only if the success of that AI is dependent on their other advantages (like having tons of data about all of us). The fact that model architectures are getting somewhat universalized seems to suggest that eventually there may be some highly reusable building blocks that will provide good outcomes ... when trained on enough data with enough diversity. And that data comes from everyone; if we insist on different data governance principles, we could support an ecosystem where anyone can create, train, and run their own AI services.
If dumping laundry detergent into a pile creates gold, but once the pile reaches 1 million lbs it causes the Earth to explode, you have to find a way that no one can gather such a pile. Not "democratize" the practice and letting anyone anywhere have as much detergent as they wish.
Being able to automate valuable work means you need a smaller coalition of power brokers to maintain the bulk of your GDP, meaning a government can remain in power with the support of an ever-shrinking class of influential people.
This is something people have been talking about for decades, with semi-serious proposals like a tax for robots being thrown around. The people in power have yet to take a hard look at any of it, though -- and why should they? It's a safer bet to curry favor with the ever-shrinking circle of influential people. The core problem in my opinion is how to create an incentive for politicians today to put into place measures to combat this consolidation of power. The longer we wait, the stronger the incentives against liberal democracy become, and the harder it will be to make a change.
They're somewhere between reactionary efforts to strangle the technology in the crib, and just completely misinformed naval-gazing by people with nothing better to do - proposing fictional scenarios to argue about rather then dealing with real data.
https://www.cbsnews.com/news/mcdonalds-automated-restaurant-...
And then even if you have the data and the engineering talent, it takes competent leadership to deploy a product and shepherd it to success. (Looking at Google’s graveyard here)
(I think, I have no idea what the copyright rules are and don’t care, not an artist, but I can make crazy good pictures now)
Barely 10 days holiday per year, no universal healthcare, car-infested cities, unsafe cities filled by homelessness, racial tension all over the country, drug addiction, and many more. The US is a living nightmare yet people like you are blinded to think you're still the best in the world. Pity.
No matter how you slice it, the rich gets richer, those in power centres in government gets more power, and the majority in the lower middle class loses the most.
https://issuu.com/stanfordchaparral/docs/parody_119_3-4/17
To summarize: Larry Page is bragging about how, in 2017, Alphabet created a "data REIT" which contained all of Google's data, and licensed it on FRAND terms to all comers. As a "REIT" it's required to pay out 95% of its profits as dividends, and everyone whose data they use is a shareholder in the "REIT."
Yes, I know that's not what REIT's are for. This would take legislation, as Larry hints in the interview.
Basically, the AI training data is nationalized, with compensation to the owners, i.e. Google, FB, etc. You could argue, and people would, about who deserves compensation. Congress would have to do its job, for once.In this "interview" Larry is bragging about how well this worked out for Google, since the clients of Google Data can make much better use of the data than Google itself can.
This misses the deeper reality, in my view. AI is predicated on and bootstrapped by the free labor of others. Even if it were “open source” and “owned” by end-users, AI fundamentally requires people to do free work. That’s the problem with it.
― Frank Herbert, Dune
Y = K^(1-x)*L^(x)
Here's ONE WEIRD TRICK that CAPITALISTS HATE. Increase X.
FTFY