GPT-4 Outperforms Elite Crowdworkers, Saving Researchers $500k and 20k hours (opens in new tab)

(artisana.ai)

147 pointsmztwo3y ago144 comments

144 comments

87 comments · 18 top-level

troops_h8r3y ago· 19 in thread

I don't think I see enough discussion about what this means for privacy. There was some protection in the fact that it was prohibitively expensive to get someone to listen to every single one of our phonecalls/read all our emails/etc.

Worrying that this will no longer be the case.

lumost3y ago

The top use case I've been hearing is in legal discovery. Law firms used to play games with diligence by disclosing TBs of email and making it cost prohibitive to find relevant emails. This task would normally require a $60-100/hr paralegal or lawyer.

GPT-4 can do that task for fractions of a penny per email now. It doesn't have to be perfect if its competing with nothing. I expect we'll see similar shops for any other high cost paper/trail business.

pseudo03y ago

Is there a solution to the issue of data stewardship yet? I'd imagine it typically would not be permissible to send a bunch of proprietary legal documents off to OpenAI.

What I'd really love to implement is a way for GPT-4 to answer questions based on a corpus of "all our Confluence pages plus random other sources of documentation." Like with the legal document issue, it's a bit of a nonstarter right now given the proprietary nature of corporate documentation.

2 more replies

lazyeye3y ago

There's also this..

https://old.reddit.com/r/ChatGPT/comments/12fiwaf/chat_gpt_w...

alphanumeric03y ago

Why do you think there was a mass surveillance of American domestic communications since forever ago, as leaked by Snowden? This technology has been available since then and can effectively summarize millions of pieces of communication.

malux853y ago

Yeah but have you seen the leaked slides? It's clear that they have only the ability to analyse 10% or less of the data they are storing.

GPT-like systems will close that gap, and then comes all of the problems of automated law enforcement - Extrapolation from incomplete data, false positives from coindicences, interpretation errors, all that annoying stuff

2 more replies

hex4def63y ago

Collection an ocean of data from everyone isn't the same as actually painstakingly tying all the pieces together for everyone.

They've created a huge library of unorganized data. The difference here is they now can spawn a million untiring AI private investigators / librarians to organize this information into coherent "case files".

At least for me, until this point I've had a feeling of anonymity in the idea that, while my data is being slurped up, I'm just one data point in a sea of other 'normal' people. There would be little value in spending government time and effort tying all of the web detritus together for me. The juice would definitely not be worth the squeeze.

However, when the cost of this effort is nearly zero, that now becomes a different story. The balance of power between government and the people it rules is going to radically shift.

two_in_one3y ago

Not exactly. Gov had to be selective because its surveillance required a lot of resources per person/call. New technology allows it cheap and en mass. Voice calls can be recorded, then converted to text, then filtered. And humans will only analyze something of interest. Like we did have alphabet and books, and newspapers for hundreds of years. But only with internet we got the ability to process them easily.

2 more replies

stevenhuang3y ago

> This technology has been available since then

No, it hasn't lol.

3 more replies

wefarrell3y ago

I think that was metadata and not actual audio of conversations.

SketchySeaBeast3y ago

Man, I can't wait for the AI to start hallucinating crimes.

i_am_jl3y ago

Of all the futurisms in Minority Report I really didn't think this one would show up so early.

dragonwriter3y ago

Shotspotter is a thing and it already has been doing that for years (both on its own and on law enforcement request.)

1 more reply

more_corn3y ago

Already started. At least two reported cases.

talesvin3y ago

I mean, if we get to the point AI is pointing the finger at someone i hope that a human will double check it at least.

4 more replies

Shekelphile3y ago

> There was some protection in the fact that it was prohibitively expensive to get someone to listen to every single one of our phonecalls/read all our emails/etc.

That's already how it worked on platforms like mturk and uhrs, lots of the work was transcribing audio dumps from microphones built into computers/phones/smart home devices. UHRS especially had a lot of that (it's owned by MS) as well as search engine grading type work. They also certainly do not pay well, I'd imagine that in practice there isn't much cost difference to paying a bunch of bored people to do it vs the compute cost for running an AI model to do it, but the AI model will be vastly more accurate and will work 24/7.

lovvtide3y ago

Now that is something I hadn't considered. Woah.

HPMOR3y ago

Not to sound condescending but really? How is this not immediately your mind goes? Every piece of information ever recorded can now be summarized and cross-linked efficiently. Privacy is beyond dead. Soon every authoritarian government (and Democratic ones albeit secretly) will have integrated platforms that track every single one of your movements, known contacts, internet usage, financial data, and correspondence. Big Brother has NEVER EVER been more effective than it will become.

3 more replies

fshbbdssbbgdd3y ago

With regard to privacy, what’s the difference between your email’s text stored on a server, and your email’s text alongside the output of the text processed through a LLM? If “they” can already look at the text, what more privacy is there to lose?

kerkeslager3y ago

There's a great deal of privacy in simply being a needle in a haystack. Part of the processing that's possible with an LLM is filtering.

Imagine you've sent an email about transporting a friend's daughter across state lines to get a medically-necessary abortion. Or if you prefer, imagine you've arranged via email to "lose" some firearms which don't comply with your state's new assault weapons ban.

Pre-LLMs, trying to find these sorts of emails was very hard. A simple text search for "abortion" or "gun" is going to come up with far more emails where two family members got into a political debate, than emails about lawbreaking. Big Brother will find a few such emails here and there by chance, but the vast majority of such incriminating emails will simply be lost in the pile.

Enter LLMs, and Big Brother can feed some of the incriminating emails found my chance into a training dataset along with a bunch of non-incriminating emails, and teach the AI to find incriminating emails, and then apply the model to the entire list of emails and get a nicely filtered list of only the emails which are incriminating, further tuning the model by adding emails it gets wrong to the training dataset when they are found.

8769780957897893y ago· 11 in thread

Great to see this tech and the money invested in it being used to take low-paying jobs away from people with limited options, instead of something like drug discovery or cancer biology.

Nifty39293y ago

This assumes those people really do have no other way to contribute. I don’t believe that’s the case. Do you?

I believe people can contribute in many different ways. When technology enables us to get my work output without me, that frees me up to produce other things for society.

13years3y ago

> that frees me up to produce other things for society

The problem is that it is a disruption for everything because at its core it is a machine for the replication of skill and technology. A concept that has never existed prior with any other technological disruption.

"Climbing the skill ladder is going to look more like running on a treadmill at the gym. No matter how fast you run, you aren’t moving, AI is still right behind you learning everything that you can do."

from a more in depth view I wrote up here describing the rapidly shrinking innovation, disruption and adaption cycles

https://dakara.substack.com/p/ai-and-the-end-to-all-things

8769780957897893y ago

The issue is not can people contribute in some other way, but can they convince someone else to pay them a living wage for doing so, which is going to prove progressively more difficult as this technology advances.

flandish3y ago

> society

Sure. But when your “keep the lights on” job cuts you for AI, you are less likely to “produce other things” while you worry about food and heat.

somsak23y ago

like what?

adastra223y ago

Better get used to it. That's basically tech's entire value proposition.

snotrockets3y ago

It's also useful to launder biases that would be immoral, if not outright illegal, if humans would admit to employing.

SongofEarth3y ago

PC and Xerox eliminated the secretarial pool, and working women since then have been working on much more meaningful things.

jutrewag3y ago

That’s debatable.

nr2x3y ago

In this case it is academic research released to the public, which is why anybody knows about it. This is a fairly good thing compared to any alternative I can think of.

8769780957897893y ago

> In this case it is academic research released to the public

That's going to be the exception, not the rule. The benefits from automating crowdwork will disproportionately accrue to corporate profits.

1 more reply

courseofaction3y ago· 7 in thread

We need new political arrangements to distribute the gains of AI or things are going to get very bad very quickly.

WillAdams3y ago

Back when computers were first going mainstream, there was discussion about taxing them so as to provide for the folks whose jobs would be lost to them --- never went anywhere, but this is a discussion which we need to circle back to.

biohax20153y ago

We are fucked. I have no hope in humanity managing this technology responsibly and no hope in my future. The months since ChatGPT's release have been some of the worst in my life, mental-health-wise.

sph3y ago

Dude let me share my plan: go build something for yourself, a modest bootstrapped business. If you want to keep being an employee, you'll see your job and other people jobs getting more and more replaced by AIs. Bosses hyped to put AI in the coffee machine. Clients asking to add ChatGPT to their wordpress site.

Either you go build something you want, an island where code doesn't talk back, or leave tech altogether.

I am saddened that I don't even recognise this place anymore. It's not Hacker News anymore, it's AI news. It's starry eyed engineers jumping over each other ready to sell their metaphorical soul. Even I, the Luddite, can't seem to talk about anything else than this bloody thing.

My current plan is to build a small business and retire in the middle of the woods somewhere. Do some Lisp coding while the rest of the world is dancing around their new idol.

/rant, send me an email if you wanna rant about it as well, and discuss your concerns.

29athrowaway3y ago

Such as the political arrangements to distribute the gains of high yield farmer equipment, fully automatized factories, high frequency trading bots? It is not going to happen.

What is going to happen are private robotic armies making sure private owners remain private owners.

And then, we will go back to times where people were not citizens by default and had fewer rights.

Sateeshm3y ago

What gains? When everyone is out of a job eventually, who's going to pay the AI companies?

nr2x3y ago

AI isn’t the problem, capitalism is.

I’m not worried about rogue AI taking over the nukes. I’m worried the same people who think it’s a great idea to charge so much for insulin that people start dying are the ones who will be using AI to hurt people.

Hell, give me a slightly evil AI run amok over any pharma CEO doing their job.

pixl973y ago

So China/Russia/Wherever-stan is going to use AI more responsibly?

Human greed is the problem. Authoritarianism and capitalism are just subcategories of the greed problem.

What we don't have an answer for yet, is will AGI be greedy?

rossdavidh3y ago· 6 in thread

So, uh, GPT-4 outperforms at labeling. What is that labeling used for?

"Employing Surge AI's top-tier human annotators at a rate of $25 per hour would have cost $500,000 for 20,000 hours of work, an excessive amount to invest in the research endeavor. Surge AI is a venture-backed startup that performs the human labeling for numerous AI companies including OpenAI, Meta, and Anthropic."

What could go wrong? Using GPT-4 to perform labeling used by OpenAI in order to train...uh, wait.

mztwoOP3y ago

Yep - you highlighted exactly what raised my eyebrow as I was writing the article.

mclightning3y ago

This is a bigger problem than people realise.

Think about it, how many millions of articles are posted online produced by OpenAI's GPTs to date... Good luck clearing out the training data for GPT-5.

True human content will get gradually scarce. We steer it for sure for our posts, but it is still GPTs that do the heavy lifting.

OpenAI's own classifier fails to detect GPT-4 generated text at the moment.

pixl973y ago

>classifier fails to detect GPT-4 generated text

That's because beyond the 'As an AI language model' and a few key words it can be nearly impossible to detect GPT-4 especially if any prompt is used to intentionally keep it from being detected.

Human like text is a solved problem. There is no more getting better at detecting AI written text, there is only classifying more humans incorrectly at this point.

rossdavidh3y ago

https://en.wikipedia.org/wiki/Positive_feedback

realusername3y ago

We're also using computers to build better computers, it makes sense

nethdeco3y ago

And the noise would keep adding up.

famouswaffles3y ago· 5 in thread

NLP is solved, more or less. Either way, Bespoke NLP is on its way out. It's pretty funny how buried this is in the original paper.

brian_spiering3y ago

Parts of NLP have made great progress. There are still parts of NLP that could still be improved, such as the truthiness of generated answers.

NumberWangMan3y ago

Sorry to get heavy here: truth is not an NLP problem, it's an alignment problem. We want truth, but we don't have a reliable way to train an AI to provide the truth, only to provide things that are either true, or sound true enough that they fool the reward function. And even then, that may not be exactly what the AI learns to do, because of there's another level of alignment problem, the "inner alignment" or "mesa-optimizer alignment" problem!

With an AI like GPT, it is quirky and amusing. Once AIs get really powerful, it becomes scary, and a lot of people who understand this field much better than I do are worried it has a good chance of being deadly. Like, potentially kill-everyone-on-earth deadly.

1 more reply

CamperBob23y ago

It is true, however, that the problem has been solved completely in the human-to-machine direction. The output of the current-generation LLMs is completely off base in many cases, but they certainly understand what they are being asked, for any useful definition of 'understand.'

I'm much more impressed by GPT's ability to handle input than I am in its ability to generate output. It's arguably as good at reading comprehension as most humans.

margorczynski3y ago

But that's not an NLP problem at heart. Language is just a collections of tokens (words, letter) that are tied together by certain rules to convey some meaning. There is no concept of reality per se.

For example, consider filling the blank:

A giant ______ flew over my head!

It can be a plane. Or a dragon. Or an UFO. Or a balloon. The thing is all of those are correct answers language-wise and the model works correctly as long as what gets filled in conforms to the rules of the given language.

The language that we generate encodes reality to some extent and the model picks up those correlations but there is no concept of reasoning or reality behind it. Maybe it is emergent at some point (as to effectively compress it needs to encode some subset of rules governing our reality) but it is not an agent that optimizes for understanding our reality. Something like Dreamer would be much closer to that.

ChatGTP3y ago

“Just happened”

Do we get hoverboards now or is that later ?

boringuser23y ago· 5 in thread

Does OpenAI even have the compute to begin to meet demand?

xiphias23y ago

No, that's why it's impossible to get GPT 4 API access I guess.

There are just not enough NVIDIA GPUs.

more_corn3y ago

I got it. Impossible is a strong word.

1 more reply

MichaelZuo3y ago

Microsoft probably could buy several tens of billions of servers, though probably not feasible to spin up anytime soon.

local_crmdgeon3y ago

This is a problem that money can solve.

1 more reply

more_corn3y ago

Azure

shaky-carrousel3y ago· 4 in thread

Very interesting. Until the day OpenAI has a problem in their systems and the entire world grinds to a halt. Or they put outrageous new prices. Which apparently never happened in other fields, seems.

CamperBob23y ago

https://en.wikipedia.org/wiki/The_Machine_Stops

ClumsyPilot3y ago

"At first, humans accept the deteriorations as the whim of the Machine, to which they are now wholly subservient, but the situation continues to deteriorate as the knowledge of how to repair the Machine has been lost"

Replace "the machine" eith "the market" and it describes some people today.

shaky-carrousel3y ago

Interesting story, didn't know about it. Thanks.

1 more reply

heurist3y ago

This space will divide into many competitors, and eventually a Linux-like information-magnet will win the whole thing. Eventually there will be robustness..

AndreLock3y ago· 3 in thread

Interesting to see what the impact will be on crowdsourcing annotation companies like Scale AI, especially after reading this article: https://www.forbes.com/sites/kenrickcai/2023/04/11/how-alexa...

mztwoOP3y ago

Anecdotally, several CTOs I know intend to lessen their use of Scale, Labelbox and more in the future. Talked to one today who already ditched MTurk for GPT-4 -- cheaper, better, faster was what he said.

Labelbox does image annotating still, and one CTO said as soon as GPT-4 enabled this for him he'd have his team homebrew it from there.

generativeai3y ago

Looks like Labelbox is doing something with GPT models...https://labelbox.com/blog/few-shot-learning-and-zero-shot-le...

helsontaveras183y ago

They will be working to create the models that automate the company out of existence.

hnaouesteuho3y ago· 3 in thread

From reading the paper, GPT-4 also outperformed the researchers themselves in many categories, despite the researchers being the ones who created the dataset being used to perform the comparison.

In other words, the metrics are biased in the researchers’ favor — so GPT-4 would have beat them even more often (probably a majority of the time based on the numbers), if someone else had created the guidelines and golden labels.

ratg133y ago

Considering there are over half a million texts, can you really expect a researcher to be familiar with all of them?

bequanna3y ago

With the current unskilled labor shortage driving wage increases which pushes inflation up, this seems to be arriving just in the time.

flandish3y ago

Just a small point of order: there is no such thing as “unskilled labor” -

All labor is skilled labor. Even breaking rocks.

1 more reply

two_in_one3y ago· 3 in thread

>This breakthrough saved the researchers over $500,000 and 20,000 hours of human labor.

BTW, this is interesting. There is a lot of noise about AI carbon footprint. Now imagine how much humans would eat and fart for 20.000 work hours. It's about 10 man/years. Assuming 8h / 5d / 50 weeks schedule.

HeavyFeather3y ago

Indeed time to eliminate all those people I guess. /s

I don’t think you can compare people’s carbon footprint because those people will exist regardless of jobs.

famouswaffles3y ago

>I don’t think you can compare people’s carbon footprint because those people will exist regardless of jobs.

But they don't have to /s

1 more reply

snotrockets3y ago

The best time to delete that comment was before you wrote it. The second best time is now.

ftxbro3y ago· 2 in thread

If you look at the table, the GPT-4 model has better correlation with the expert ensemble than the crowd does, but only on some criteria. The GPT-4 model is closer for all of the ethics questions, but the crowd is closer for the utility level and economic impact questions.

nr2x3y ago

Yes, but GPT-5 will be better and the humans won’t. It’s very troubling.

ftxbro3y ago

I agree, but there are questions about GPT-N successors.

Surprisingly (to me) many people think that GPT-N will never exceed human level intelligence because it was trained on the internet. I think that argument is obviously wrong.

Another is that I am sure a large chunk of people will never concede that the AI is smarter than them. Literally never, no matter how smart the bot gets. I mean, probably a lot of people think they are as smart as anyone else. They won't agree that someone else is smarter than them, and they certainly won't agree that some bot is smarter than them. It's also a loaded assessment, like they will think that if they agree to that, then they are also implicitly agreeing to cede their personal agency to the bot.

Another possibility is that GPT-N successors that surpass human level cognition will be banned by regulation, like some drugs or nuclear explosives or bio weapons. They could even be pre-emptively banned at some level below human level, and maybe it would never be publicly acknowledged that it's technically possible to go above human level.

1 more reply

m3kw93y ago· 1 in thread

“ Employing Surge AI's top-tier human annotators at a rate of $25 per hour would have cost $500,000 for 20,000 hours of work”. That’s a wrap for Surge AI

mztwoOP3y ago

Lots of immediate business from companies needing humans to spin up their models though... but as LLMs get more advanced it's anyone's guess what will happen here.

fatherzine3y ago

This sounds awfully close to the bootstrap loop of singularity AGI.

mztwoOP3y ago

Buried in an arXiv paper was this nugget. Thought I'd share!

Workaccount23y ago

So if AI can generate datasets better than it's own datasets...well that's pretty damn substantial.

tpoacher3y ago

When an AI "outperforms" the "ground truth", it is by definition "worse", not "better".

And if your ground truth is problematic, then this is generally a problem of specification and quality control, not performance.

g42gregory3y ago

This is really interesting result. Immediate and direct application of LLMs, with significant financial benefits. I think LLMs will drive tremendous productivity increase.

naveen993y ago

What’s an elite crowdworker ? Top 1% sheep ? Or just the usual clickbait oxymoron ?

j / k navigate · click thread line to collapse

144 comments

87 comments · 18 top-level

troops_h8r3y ago· 19 in thread

Worrying that this will no longer be the case.

lumost3y ago

pseudo03y ago

Is there a solution to the issue of data stewardship yet? I'd imagine it typically would not be permissible to send a bunch of proprietary legal documents off to OpenAI.

2 more replies

lazyeye3y ago

There's also this..

https://old.reddit.com/r/ChatGPT/comments/12fiwaf/chat_gpt_w...

alphanumeric03y ago

malux853y ago

Yeah but have you seen the leaked slides? It's clear that they have only the ability to analyse 10% or less of the data they are storing.

2 more replies

hex4def63y ago

Collection an ocean of data from everyone isn't the same as actually painstakingly tying all the pieces together for everyone.

However, when the cost of this effort is nearly zero, that now becomes a different story. The balance of power between government and the people it rules is going to radically shift.

two_in_one3y ago

2 more replies

stevenhuang3y ago

> This technology has been available since then

No, it hasn't lol.

3 more replies

wefarrell3y ago

I think that was metadata and not actual audio of conversations.

SketchySeaBeast3y ago

Man, I can't wait for the AI to start hallucinating crimes.

i_am_jl3y ago

Of all the futurisms in Minority Report I really didn't think this one would show up so early.

dragonwriter3y ago

Shotspotter is a thing and it already has been doing that for years (both on its own and on law enforcement request.)

1 more reply

more_corn3y ago

Already started. At least two reported cases.

talesvin3y ago

I mean, if we get to the point AI is pointing the finger at someone i hope that a human will double check it at least.

4 more replies

Shekelphile3y ago

> There was some protection in the fact that it was prohibitively expensive to get someone to listen to every single one of our phonecalls/read all our emails/etc.

lovvtide3y ago

Now that is something I hadn't considered. Woah.

HPMOR3y ago

3 more replies

fshbbdssbbgdd3y ago

kerkeslager3y ago

There's a great deal of privacy in simply being a needle in a haystack. Part of the processing that's possible with an LLM is filtering.

8769780957897893y ago· 11 in thread

Great to see this tech and the money invested in it being used to take low-paying jobs away from people with limited options, instead of something like drug discovery or cancer biology.

Nifty39293y ago

This assumes those people really do have no other way to contribute. I don’t believe that’s the case. Do you?

I believe people can contribute in many different ways. When technology enables us to get my work output without me, that frees me up to produce other things for society.

13years3y ago

> that frees me up to produce other things for society

from a more in depth view I wrote up here describing the rapidly shrinking innovation, disruption and adaption cycles

https://dakara.substack.com/p/ai-and-the-end-to-all-things

8769780957897893y ago

flandish3y ago

> society

Sure. But when your “keep the lights on” job cuts you for AI, you are less likely to “produce other things” while you worry about food and heat.

somsak23y ago

like what?

adastra223y ago

Better get used to it. That's basically tech's entire value proposition.

snotrockets3y ago

It's also useful to launder biases that would be immoral, if not outright illegal, if humans would admit to employing.

SongofEarth3y ago

PC and Xerox eliminated the secretarial pool, and working women since then have been working on much more meaningful things.

jutrewag3y ago

That’s debatable.

nr2x3y ago

In this case it is academic research released to the public, which is why anybody knows about it. This is a fairly good thing compared to any alternative I can think of.

8769780957897893y ago

> In this case it is academic research released to the public

That's going to be the exception, not the rule. The benefits from automating crowdwork will disproportionately accrue to corporate profits.

1 more reply

courseofaction3y ago· 7 in thread

We need new political arrangements to distribute the gains of AI or things are going to get very bad very quickly.

WillAdams3y ago

biohax20153y ago

We are fucked. I have no hope in humanity managing this technology responsibly and no hope in my future. The months since ChatGPT's release have been some of the worst in my life, mental-health-wise.

sph3y ago

Either you go build something you want, an island where code doesn't talk back, or leave tech altogether.

My current plan is to build a small business and retire in the middle of the woods somewhere. Do some Lisp coding while the rest of the world is dancing around their new idol.

/rant, send me an email if you wanna rant about it as well, and discuss your concerns.

29athrowaway3y ago

Such as the political arrangements to distribute the gains of high yield farmer equipment, fully automatized factories, high frequency trading bots? It is not going to happen.

What is going to happen are private robotic armies making sure private owners remain private owners.

And then, we will go back to times where people were not citizens by default and had fewer rights.

Sateeshm3y ago

What gains? When everyone is out of a job eventually, who's going to pay the AI companies?

nr2x3y ago

AI isn’t the problem, capitalism is.

Hell, give me a slightly evil AI run amok over any pharma CEO doing their job.

pixl973y ago

So China/Russia/Wherever-stan is going to use AI more responsibly?

Human greed is the problem. Authoritarianism and capitalism are just subcategories of the greed problem.

What we don't have an answer for yet, is will AGI be greedy?

rossdavidh3y ago· 6 in thread

So, uh, GPT-4 outperforms at labeling. What is that labeling used for?

What could go wrong? Using GPT-4 to perform labeling used by OpenAI in order to train...uh, wait.

mztwoOP3y ago

Yep - you highlighted exactly what raised my eyebrow as I was writing the article.

mclightning3y ago

This is a bigger problem than people realise.

Think about it, how many millions of articles are posted online produced by OpenAI's GPTs to date... Good luck clearing out the training data for GPT-5.

True human content will get gradually scarce. We steer it for sure for our posts, but it is still GPTs that do the heavy lifting.

OpenAI's own classifier fails to detect GPT-4 generated text at the moment.

pixl973y ago

>classifier fails to detect GPT-4 generated text

That's because beyond the 'As an AI language model' and a few key words it can be nearly impossible to detect GPT-4 especially if any prompt is used to intentionally keep it from being detected.

Human like text is a solved problem. There is no more getting better at detecting AI written text, there is only classifying more humans incorrectly at this point.

rossdavidh3y ago

https://en.wikipedia.org/wiki/Positive_feedback

realusername3y ago

We're also using computers to build better computers, it makes sense

nethdeco3y ago

And the noise would keep adding up.

famouswaffles3y ago· 5 in thread

NLP is solved, more or less. Either way, Bespoke NLP is on its way out. It's pretty funny how buried this is in the original paper.

brian_spiering3y ago

Parts of NLP have made great progress. There are still parts of NLP that could still be improved, such as the truthiness of generated answers.

NumberWangMan3y ago

1 more reply

CamperBob23y ago

I'm much more impressed by GPT's ability to handle input than I am in its ability to generate output. It's arguably as good at reading comprehension as most humans.

margorczynski3y ago

But that's not an NLP problem at heart. Language is just a collections of tokens (words, letter) that are tied together by certain rules to convey some meaning. There is no concept of reality per se.

For example, consider filling the blank:

A giant ______ flew over my head!

ChatGTP3y ago

“Just happened”

Do we get hoverboards now or is that later ?

boringuser23y ago· 5 in thread

Does OpenAI even have the compute to begin to meet demand?

xiphias23y ago

No, that's why it's impossible to get GPT 4 API access I guess.

There are just not enough NVIDIA GPUs.

more_corn3y ago

I got it. Impossible is a strong word.

1 more reply

MichaelZuo3y ago

Microsoft probably could buy several tens of billions of servers, though probably not feasible to spin up anytime soon.

local_crmdgeon3y ago

This is a problem that money can solve.

1 more reply

more_corn3y ago

Azure

shaky-carrousel3y ago· 4 in thread

Very interesting. Until the day OpenAI has a problem in their systems and the entire world grinds to a halt. Or they put outrageous new prices. Which apparently never happened in other fields, seems.

CamperBob23y ago

https://en.wikipedia.org/wiki/The_Machine_Stops

ClumsyPilot3y ago

Replace "the machine" eith "the market" and it describes some people today.

shaky-carrousel3y ago

Interesting story, didn't know about it. Thanks.

1 more reply

heurist3y ago

This space will divide into many competitors, and eventually a Linux-like information-magnet will win the whole thing. Eventually there will be robustness..

AndreLock3y ago· 3 in thread

Interesting to see what the impact will be on crowdsourcing annotation companies like Scale AI, especially after reading this article: https://www.forbes.com/sites/kenrickcai/2023/04/11/how-alexa...

mztwoOP3y ago

Labelbox does image annotating still, and one CTO said as soon as GPT-4 enabled this for him he'd have his team homebrew it from there.

generativeai3y ago

Looks like Labelbox is doing something with GPT models...https://labelbox.com/blog/few-shot-learning-and-zero-shot-le...

helsontaveras183y ago

They will be working to create the models that automate the company out of existence.

hnaouesteuho3y ago· 3 in thread

From reading the paper, GPT-4 also outperformed the researchers themselves in many categories, despite the researchers being the ones who created the dataset being used to perform the comparison.

ratg133y ago

Considering there are over half a million texts, can you really expect a researcher to be familiar with all of them?

bequanna3y ago

With the current unskilled labor shortage driving wage increases which pushes inflation up, this seems to be arriving just in the time.

flandish3y ago

Just a small point of order: there is no such thing as “unskilled labor” -

All labor is skilled labor. Even breaking rocks.

1 more reply

two_in_one3y ago· 3 in thread

>This breakthrough saved the researchers over $500,000 and 20,000 hours of human labor.

HeavyFeather3y ago

Indeed time to eliminate all those people I guess. /s

I don’t think you can compare people’s carbon footprint because those people will exist regardless of jobs.

famouswaffles3y ago

>I don’t think you can compare people’s carbon footprint because those people will exist regardless of jobs.

But they don't have to /s

1 more reply

snotrockets3y ago

The best time to delete that comment was before you wrote it. The second best time is now.

ftxbro3y ago· 2 in thread

nr2x3y ago

Yes, but GPT-5 will be better and the humans won’t. It’s very troubling.

ftxbro3y ago

I agree, but there are questions about GPT-N successors.

Surprisingly (to me) many people think that GPT-N will never exceed human level intelligence because it was trained on the internet. I think that argument is obviously wrong.

1 more reply

m3kw93y ago· 1 in thread

“ Employing Surge AI's top-tier human annotators at a rate of $25 per hour would have cost $500,000 for 20,000 hours of work”. That’s a wrap for Surge AI

mztwoOP3y ago

Lots of immediate business from companies needing humans to spin up their models though... but as LLMs get more advanced it's anyone's guess what will happen here.

fatherzine3y ago

This sounds awfully close to the bootstrap loop of singularity AGI.

mztwoOP3y ago

Buried in an arXiv paper was this nugget. Thought I'd share!

Workaccount23y ago

So if AI can generate datasets better than it's own datasets...well that's pretty damn substantial.

tpoacher3y ago

When an AI "outperforms" the "ground truth", it is by definition "worse", not "better".

And if your ground truth is problematic, then this is generally a problem of specification and quality control, not performance.

g42gregory3y ago

This is really interesting result. Immediate and direct application of LLMs, with significant financial benefits. I think LLMs will drive tremendous productivity increase.

naveen993y ago

What’s an elite crowdworker ? Top 1% sheep ? Or just the usual clickbait oxymoron ?

j / k navigate · click thread line to collapse