OpenAI Trains Language Model, Mass Hysteria Ensues (opens in new tab)

(approximatelycorrect.com)

152 pointszackchase7y ago114 comments

114 comments

70 comments · 19 top-level

ilyasut7y ago· 26 in thread

Ilya from OpenAI here. Here's our thinking:

- ML is getting more powerful and will continue to do so as time goes by. While this point of view is not unanimously held by the AI community, it is also not particularly controversial.

- If you accept the above, then the current AI norm of "publish everything always" will have to change

- The _whole point_ is that our model is not special and that other people can reproduce and improve upon what we did. We hope that when they do so, they too will reflect about the consequences of releasing their very powerful text generation models.

- I suggest going over some of the samples generated by the model. Many people react quite strongly, e.g., https://twitter.com/justkelly_ok/status/1096111155469180928.

- It is true that some media headlines presented our nonpublishing of the model as "OpenAI's model is too dangerous to be published out of world-taking-over concerns". We don't endorse this framing, and if you read our blog post (or even in most cases the actual content of the news stories), you'll see that we don't claim this at all -- we say instead that this is just an early test case, we're concerned about language models more generally, and we're running an experiment.

Finally, despite the way the news cycle has played out, and despite the degree of polarized response (and the huge range of arguments for and against our decision), we feel we made the right call, even if it wasn't an easy one to make.

Cacti7y ago

> - The _whole point_ is that our model is not special and that other people can reproduce and improve upon what we did. We hope that when they do so, they too will reflect about the consequences of releasing their very powerful text generation models.

If this is your whole point, then I think you are missing something fundamental. Implementing these models doesn't require reflection, or introspection, or any sort of ethical or moral character whatsoever; and even if it did, all that will happen eventually is someone (without the technical background) will simply throw a lot of money at someone else (with the technical background, but who needs to, you know, eat, and pay rent, and so on) to implement it. You are fooling yourself if you think your stance makes a single mote of difference in this arms race.

bilbo0s7y ago

>You are fooling yourself if you think your stance makes a single mote of difference in this arms race...

In fairness, if that's true, then no one has any need of her model.

More seriously speaking, why does anyone need, say, "training set x", or "model y", to make their implementation work? You don't. So I don't really understand why everyone is so worked up about not releasing this stuff? If you want to do it, do it. If not, don't. But there's no need to say, "I demand everyone do it, and I'll have a meltdown if they don't."

1 more reply

pishpash7y ago

Exactly. This is like holding up spam samples or how spammers operate from the spam detecting work. That side (and the cultural discussions) needs all the headstart it can get, not be complacent that some arbitrary "experts" will patronizingly "protect" them.

1 more reply

sigil7y ago

Ok but isn’t this the opposite of OpenAI’s “nukes are safer when multiple actors have them” strategy wrt AI?

I’m also confused by the threat models earnestly put forth in your blog post. Are we really concerned about deep faking someone’s writing? The plain word already demands attribution by default: we look for an avatar, a handle, a domain name to prove the person actually said this.

albntomat07y ago

> Ok but isn’t this the opposite of OpenAI’s “nukes are safer when multiple actors have them” strategy wrt AI?

It seems more like the "nukes are safer when multiple rational state level actors have them", rather than anyone able to pull a git repo.

2 more replies

smsm427y ago

> some of the samples generated by the model

Mostly it's scary not because it's good - as writing goes, it's quite bad. It forms coherent sentences, but otherwise it's nonsense. I've seen similar nonsense producers in early 90s on basis of Markov chains and what not.

No, the scary part is how much it reminds me of what I am reading in the media all the time. My current pet concern is that AIs will start passing the Turing test not because AIs are getting so good but because humans are getting so bad. A bunch of nonsensical drivel can easily be passed as a thoughtful analysis or a deep critical think-piece - and that's not my conjecture, have been repeatedly proven by submitting such drivel to various academic journals and it being accepted and published. I'm not saying people are losing critical thinking skills - but they are definitely losing (or maybe never even had?) the habit of consistently applying them.

esjeon7y ago

> I've seen similar nonsense producers in early 90s on basis of Markov chains and what not.

Exactly. When it comes to generating a large volume of apparently-good sentences, non-AI (or classical) approaches are still better than good. Those will be equally disruptive, since the defending side is yet to develop a proper countermeasure based on the "sensible"-ness of content. Plus, they will be much easier to customize and adapt to the situation, while ML-based solutions often need remodeling and retraining when repurposed.

> My current pet concern is that AIs will start passing the Turing test not because AIs are getting so good but because humans are getting so bad

AI will start deceiving the public even before it pass Turing test. It's much harder to spot bots amidst people than in a 1vs1 chatroom.

1 more reply

modeless7y ago

> The _whole point_ is that our model is not special and that other people can reproduce and improve

Only people with a large amount of money and a lot of expertise. What you are doing is the opposite of democratizing AI.

moconnor7y ago

Actually this shows why OpenAI matters. Google have been training and refining Transformer architectures for years; how unlikely is it nobody tried training a language model at this scale or larger with similar results?

Yet from Google we heard nothing. Which is the optimal decision for them - they only lose by blowing the whistle.

1 more reply

avip7y ago

I've just read i.e https://twitter.com/gdb/status/1096098366545522688 and even though it's "best of 25" (I guess cherry-picked by a human) - this is mind-blowing. I am actually having a very hard time believing this is legit generated text.

imtringued7y ago

I couldn't be more disappointed with this bullshit honestly. The texts have almost zero coherence and keep repeating the same patterns (which they presumably learned from the data set) over and over again. If this is their best out of 25 samples then they aren't going to fool anyone.

>Recycling is NOT good for the world.

>It is bad for the environment,

>it is bad for our health,

>and it is bad for our economy.

>Recycling is not good for the environment.

>Recycling is not good for our health.

>Recycling is bad for our economy.

>Recycling is not good for our nation.

The first paragraph keeps repeating the <X> is <bad | not good> for the <Y> pattern 8 times.

>And THAT is why we need to |get back to basics| and |get back to basics| in our recycling efforts.

"get back to the basics" is repeated twice in the same sentence.

>Everything from the raw materials (wood, cardboard, paper, etc.),

>to the reagents (dyes, solvents, etc.)

>to the printing equipment (chemicals, glue, paper, ink, etc.),

>to the packaging,

>to the packaging materials (mercury, chemicals, etc.)

>to the processing equipment (heating, cooling, etc.),

>to the packaging materials,

>to the packaging materials that are shipped overseas and

>to the packaging materials that are used in the United States.

It literally repeated packaging 5 times in the same sentence and the overall structure was repeated 9 times. Also what type of packaging is based on mercury?

1 more reply

pas7y ago

Why? It is pretty much a well juxtaposed mix of random internet comments. And it's the best of 25, which means the other 24 is even more regular internet banter noisy.

(This of course doesn't make it an amazing feat of computer engineering.)

The overarching narrative is great, but that's probably driven by the great antithesis supplied by the experimenter.

It'd be interesting to know how this works, what happens if less or more is given as thesis/antithesis/assignment, and after how much output it turns into gibberish (or repeats).

mycorrhizal7y ago

Definitely impressive work, but the fact that this is hard to distinguish from human text, if true, is pretty sad for humans. Even sadder if anyone reading this could be swayed by such an argument.

Heck, maybe having to compete with this will raise human discourse (Joking).

1 more reply

smsm427y ago

I read it and found it to be a bunch of walking in circles and repetitive baloney. It starts with a bunch of claims that is just the reversal of a pro-recycling poster and then goes into a repetitive meandering exploration about paper being made from materials, which is made from another materials. Probably something a model would regurgitate if fed with some popular literature about recycling. The most astonishing fact for me is that people actually think it's somehow surprisingly good.

eslaught7y ago

> - I suggest going over some of the samples generated by the model. Many people react quite strongly, e.g., https://twitter.com/justkelly_ok/status/1096111155469180928.

Have you done a plagiarism search on that text to see how similar it is to the input corpus? I'm by no means an ML expert, but I've played around with models for random name generation and one thing I've noticed is that as the models become more accurate, they also become much more likely to just regurgitate existing names verbatim. So if you search the list of names and notice something that seems particularly realistic, it could be because it's literally taken in whole or in part from the training data set!

czr7y ago

You're welcome to check out the samples [https://raw.githubusercontent.com/openai/gpt-2/master/gpt2-s...] and evaluate them for memorization yourself (I haven't found any so far).

(The talking unicorn example on their page is also meant to demonstrate that, no, it's not just memorizing, but I think it's a bit more compelling to check from the raw samples)

malux857y ago

So a small number of individuals decided what's best for everybody?

How is that open?

How is that not centralization of power?

albntomat07y ago

If they did release it, there would be an equivalent outcry about how OpenAI was contributing to fake news, etc.

pas7y ago

The paper is open https://d4mucfpksywv.cloudfront.net/better-language-models/l...

GistNoesis7y ago

What solutions are you proposing?

Here are a few that comes to mind.

-Secrecy? but how will you continue to exist on the PR scene if you don't release anything?

-Are you willing to pay every developer who is able to replicate your paper, more than what the black market would pay?

-How are you working on incentive alignment to make sure that all people who can replicate your results have more incentive to do good than bad, specially in the current environment where users and valuable data are silo-ed by a few companies?

-Misdirection to keep an edge, i.e. planting bugs/ Not fixing bugs for public ; spreading false results; only working on problems that need high resources to limit the number of actor who will be able to replicate ?

-Tracking the people who have the competence to replicate and take preemptive measures.

-Restrictions on GPU/CPU/silicone wafer.

Who can regulate? How can we regulate? What are the negative consequence of regulation? What happens if we don't, at what odds and time horizon?

cs7027y ago

This seems very reasonable to me. All the outcry seems... disproportionate.

That said, withholding the pretrained models probably won't make much difference, because bad actors with resources (e.g., certain governments) will be able to produce similar or better results relatively quickly.

All it will take is (1) one or two knowledgeable people with the willingness to tinker, (2) a budget in the hundreds of thousands to a few millions of dollars at most, and (3) a few months to a year. Nowadays a lot of people are familiar with Transformers and constructing and training models across multiple GPUs.

xg157y ago

> - If you accept the above, then the current AI norm of "publish everything always" will have to change

Ok, accepting that premise, what people/organisations would you share the research with and based on what criteria?

hooloovoo_zoo7y ago

I think you should at least release a small portion of the training data (e.g. anything recycling related) so people can measure to what extent the model is generating new sentences and to what extent it's just regurgitating training data.

iamcreasy7y ago

Hello Ilya. Great work.

One of the reason Elon distanced himself because of what OpenAI team wanted to do. I am wondering if this new paper has anything to do with that? Or what it is in general that Elon doesn't agree with what OpenAI is doing?

Thanks!

onurcel7y ago

Deciding if feeding the media with fear was worth the attention you will get wasn't easy, ha. Let me tell you, you are the shame of the profession.

pishpash7y ago

Yep, put out handpicked samples to stoke fear, then release nothing of the internals to stoke more fear and act like self-appointed gods.

Permit7y ago· 6 in thread

> Namely, he argued that OpenAI is concerned that the technology might be used to impersonate people or to fabricate fake news.

This seems to be a particularly weak argument to make. How is their model going to impersonate someone in a way that a human can not?

cma7y ago

Cheaper cost to put out a bigger volume of content.

Permit7y ago

Is volume really what dictates whether or not you can impersonate someone? It's never seemed that way to me.

3 more replies

mey7y ago

Cheaper than paying "influencers". Paid blogging was huge during the .com era. I wonder if this could be adapted, with suitably good speech synth, to produce podcasts en masse.

1 more reply

pishpash7y ago

But cheaper cost for everyone else also, provided they own the tech. That seems an argument for wide distribution. (Do you want to be the lone human voice against the bots or do you want to have your own bots to amplify your voice?)

eanzenberg7y ago

Its hype and marketing

roywiggins7y ago

You might be able to tailor things to individual people very specifically based on what their views are and what might push their buttons. Like spearfishing but for propaganda. Not with this exact tool, you'd probably need some more knobs, but a similar one. This would be impractical to do at scale without computer assistance.

SubiculumCode7y ago· 4 in thread

To what extent is this not just finding text samples written in its training sample and regurgitating it near verbatim?? -Non ml guy

HALtheWise7y ago

If you look at their paper, section 4 is entirely devoted to this question. They present compelling evidence that it is generating original content, the simplest of which is it's ability to write coherently about ridiculous things like talking unicorns that nobody has ever written about in the training set.

https://d4mucfpksywv.cloudfront.net/better-language-models/l...

arcticfox7y ago

The talking unicorns piece was shockingly good. That is at least as coherent of a news story than the average human could easily invent about it.

Reading that piece gives me the same weird feeling as watching AlphaStar playing through a StarCraft game.

applecrazy7y ago

You bring up a good point. Without seeing their code and training metrics, how do we know that this isn’t some extremely overfitted model?

vedant7y ago

From the paper:

"All models still underfit WebText and held-out perplexity has as of yet improved given more training time."

amrrs7y ago· 3 in thread

>Elon Musk distances himself from OpenAI, group that built fake news AI tool

This is the worst headlines in this matter. This is one of the leading media in India. A language model being touted as Fake news AI tool. This is like calling a car, A run over machine by Ford.

https://www.hindustantimes.com/tech/elon-musk-distances-hims...

hjek7y ago

> This is like calling a car, A run over machine by Ford.

That's a great dysphemism. Gonna start using that.

ultrasounder7y ago

Hindustan Times is far from being a leading media outlet.it squarely falls in the same category as The National Enquirer of Jeff Bezos fame.

robomc7y ago

That's not quite fair. The sample output they're touting is really nothing other than false text (when it's coherent), almost all of which is in the style of news.

So for the Ford analogy to be apt, Ford would have to have designed a car nobody has ever seen, and released a video which is basically just hundreds of hours of the car running people over.

I mean, a car has lots of well understood non-running-people-over capabilities. But have they demonstrated that this model is useful for anything other than generating fake news-sounding spam text?

kirillzubovsky7y ago· 3 in thread

What if OpenAI didn’t write the piece? What if the research was announced by the machine, and the folks at OpenAI are all dead?

gfodor7y ago

You joke, but there's a real point here -- many commenters in this thread are complaining that OpenAI's position on this is a marketing stunt. Presumably, if this stuff gets commercialized, it will probably be adept at a few domains first, and I feel like writing good marketing copy will be one of them. So perhaps the bot itself didn't do so here, but it wouldn't surprise me if a self-marketed bot exists in the near future.

kirillzubovsky7y ago

p.s. I was kidding, but I was completely serious. If they can train a machine to write good copy, they can train the best Russian bots to troll people on Facebook, write New York Times pieces, and fake and influence pretty much anything done through a written text. Heck, they could write a business book and get it into the top-10 that year. Actually, that last part, they should, it would be amazing!

kirillzubovsky7y ago

What if I am the machine and you the last human left alive?

1 more reply

toufiqbarhamov7y ago· 3 in thread

Ms. Anandkumar nailed it, this is blatant hype bordering on hucksterism. Elon Musk May have left, but his influence remains I guess.

namuol7y ago

First, it's clearly the goal of OpenAI to bring more public attention to advances in the field, specifically to help voters and policymakers consider potential ramifications well in advance of any "truly" groundbreaking work before it's too late. Of course they're "hyping" this technology.

Secondly, have you seen the results? I was dumbfounded and fascinated. I spent hours reading the samples.

Maybe I'm just out of the loop and this truly isn't anything significant, but then that only proves that OpenAI was successful: Now I am aware of the latest advances in NLP and hopefully so too are many more.

Barrin927y ago

>Secondly, have you seen the results? I was dumbfounded and fascinated. I spent hours reading the samples.

Yes, I've seen the result. They're nice but, as the article points out, not extraordinary compared to state of the art, open NLP research.

OpenAI's behaviour here smells of Gibsonesque 'anti-marketing', using the misunderstanding of AI and its capabilities in the general population as a means to stir up publicity for their organisation.

This is unethical, misrepresents progress in the field, and produces confusion in the press.

2 more replies

m0zg7y ago

I have seen the results and I don't get why people think this is any more dangerous than journalists who selectively report to fit a predetermined agenda or make shit up on the spot. Which, today, is a lot of them.

2 more replies

Eliezer7y ago· 2 in thread

It seems disingenuous that this article fails to quote examples of GPT-2’s stunning results, or give any contrasting results from BERT to support the claim that this is all normal and expected progress.

Like many, I was viscerally shocked that the results were possible, the potential to further wreck the Internet seemed obvious, and an extra six months for security actors to prepare a response seemed like normal good disclosure practice. OpenAI warned everyone of an “exploit” in which text humans can trust to be human-generated, and then announced they would hold off on publishing the exploit code for 6 months. This is normal in computer security and I’m taken aback at how little the analogy seems to be appreciated.

pas7y ago

> Like many, I was viscerally shocked that the results were possible.

Why? There were news about bots writing news ~5 years ago. Given a few simple facts the AI generated the regular info-scarce but fluffy news-piece.

Now OpenAI added better everything (better language models, more data, better "long-term memory" for overall text coherence), and we got better fluff.

It seems like a GAN and a simple Markov chain generator. (Even if it's not that simple of course.)

And maybe it's the equivalent of the "modern art meme" style transferred to AI/ML research. ( https://i.pinimg.com/236x/71/e1/21/71e12151f4b59d8433d32c126... )

What I'm trying to convey is that wrecking the net with auto-trolls was already possible, but for some reason Mechanical Turk was cheaper.

> OpenAI warned everyone of an “exploit” in which text humans can trust to be human-generated

Sokal already did that, and so did http://thatsmathematics.com/mathgen/ ... but of course this might be qualitatively different, because it can be targeted. (Weaponized, if you will.) But the defense/antidote is the same, but it takes a lot more than 6 months to make people better at critical thinking, but maybe you already heard about the difficulties of that :)

pishpash7y ago

What's so shocking about this? Why do we trust this in the hands of a few self-appointed experts than anyone else? Are they supposed to be more moral than any others? What will security experts do in six months that wouldn't benefit from more security experts looking at it? Why do you care that garbage text is machine generated, from a spammer or influencer, or a mechanical turk? If it's volume you're concerned about, should we complain when search/recommendation engines already aggregate and reweight a tiny opinion into a continuous out-of-proportion stream that can last you a lifetime to consume? What is the practical difference to have more volume existing "out there"?

xiphias27y ago· 2 in thread

Elon Musk was kicked out because he poached Andrej Karpathy from OpenAI to lead Autopilot. Anyways, it was worth it, Andrej is doing an amazing job, and OpenAI is still alive :)

chrinic837y ago

> Anyways, it was worth it, Andrej is doing an amazing job, and OpenAI is still alive :)

Tesla does not even offer their full self driving package anymore. No coast to coast drive yet. Hard to say that's an amazing job.

OpenAI abandons their open source GitHub repos after a year, is now not releasing code, and is always in DeepMind's shadow. Alive, yes. Successful, no.

xiphias27y ago

Did you really expect Tesla to launch full driving? They started with being 5 years behind Waymo and without lidars or high resolution mapping, precise GPS sysyem that Waymo has...basically Elon wanted the impossible.

At the same time Andrej dropped out the idea of a fully learned end-to-end model (that's just impossible with the current deep learning technology), and started replacing the somewhat working heuristics with machine learning methodically one-by-one. Also he ramped up the data gathering pipeline.

He needs to build the full simulation, agent systems that can simulate other drivers/humans, implement reverse reinforcement learning...there's so much to do where Waymo is far ahead (but Tesla is ahead in data gathering).

agentofoblivion7y ago· 1 in thread

It’s amazing to me that no one has yet pointed out the blatant irony that their name is OpenAI, yet they are concealing far more than what is typical.

zackchaseOP7y ago

I assure you, people have pointed it out...

sp3327y ago· 1 in thread

Does someone have a description of the network somewhere? Does it use LSTM for memory or what? Is there anything unusual about the size or structure of the network? Does it use an attention mechanism?

czr7y ago

I would recommend reading the paper: https://d4mucfpksywv.cloudfront.net/better-language-models/l...

and the previous paper

https://s3-us-west-2.amazonaws.com/openai-assets/research-co...

It's a transformer, not LSTM, and it's very large but not structured in a particularly unusual way.

czr7y ago

Many reactions across here / twitter / reddit seem totally out of proportion. And an odd mix of "stop acting so self-important, this research isn't special so you shouldn't have any qualms about releasing it" and "this research is super important, how dare you not release it".

The strongest counterargument I've seen to OpenAI's decision is that the decision won't end up mattering, because someone else will eventually replicate the work and publish a similar model. But it still seems like a reasonable choice on OpenAI's part–they're warning us that some language model will soon be good enough for malicious use (e.g. large-scale astroturfing/spam), but they're deciding it won't be theirs (and giving the public a chance to prepare).

jph007y ago

In other fields such as infosec, responsible disclosure is a standard approach. You don't just throw a zero-day out there because you can. Whilst the norms for AI research needn't be identical, they should at least be informed by the history in related fields.

The lead policy analyst at OpenAI has already tried to engage the community in discussing the malicious use of AI, on many occasions, including this extremely well-researched piece with input from many experts: https://maliciousaireport.com/ . But until OpenAI actually published examples, the conversation didn't really start.

In the end, there's no right answer - both releasing the model, and not releasing the model, have downsides. But we need a respectful and informed discussion about AI research norms. I've written more detailed thoughts here: https://www.fast.ai/2019/02/15/openai-gp2/

itg7y ago

Can you imagine if the teams that worked on the Internet decided not to make it available to the public because of the potential misuses. OpenAI is a joke.

mlboss7y ago

I think OpenAI should change org name to ClosedAI.

bitL7y ago

So an article about recycling generated by OpenAI model (best out of 25) already makes more sense than presidential speeches or most of ramblings of average politicians. Can we automate them away as well?

crobertsbmw7y ago

How do we know this article isn’t just fake news being written by an AI?

Eli_P7y ago

When a bug is caught on your palm, it pretends to be a dead bug. When a moose is scared, it plays dead moose. When AI wants to fool a human or a captcha filter, it impersonates a human.

Only when a human wants to fool a human, it impersonates whatever possible but a human, then suddenly charges a shitload of ape shit, and then behaves like it never happened.

Without a decent natural language translation or automatic reasoning, which they have not, looks like N-gram where N equals to number of words in language corpus.

rajacombinator7y ago

It’s a great marketing hack. That’s the real accomplishment here.

fareesh7y ago

> Fictitious state of emergency

Pretty dumb and disrespectful to politicize a blog post about OpenAI.

j / k navigate · click thread line to collapse

114 comments

70 comments · 19 top-level

ilyasut7y ago· 26 in thread

Ilya from OpenAI here. Here's our thinking:

- ML is getting more powerful and will continue to do so as time goes by. While this point of view is not unanimously held by the AI community, it is also not particularly controversial.

- If you accept the above, then the current AI norm of "publish everything always" will have to change

- I suggest going over some of the samples generated by the model. Many people react quite strongly, e.g., https://twitter.com/justkelly_ok/status/1096111155469180928.

Cacti7y ago

bilbo0s7y ago

>You are fooling yourself if you think your stance makes a single mote of difference in this arms race...

In fairness, if that's true, then no one has any need of her model.

1 more reply

pishpash7y ago

1 more reply

sigil7y ago

Ok but isn’t this the opposite of OpenAI’s “nukes are safer when multiple actors have them” strategy wrt AI?

albntomat07y ago

> Ok but isn’t this the opposite of OpenAI’s “nukes are safer when multiple actors have them” strategy wrt AI?

It seems more like the "nukes are safer when multiple rational state level actors have them", rather than anyone able to pull a git repo.

2 more replies

smsm427y ago

> some of the samples generated by the model

esjeon7y ago

> I've seen similar nonsense producers in early 90s on basis of Markov chains and what not.

> My current pet concern is that AIs will start passing the Turing test not because AIs are getting so good but because humans are getting so bad

AI will start deceiving the public even before it pass Turing test. It's much harder to spot bots amidst people than in a 1vs1 chatroom.

1 more reply

modeless7y ago

> The _whole point_ is that our model is not special and that other people can reproduce and improve

Only people with a large amount of money and a lot of expertise. What you are doing is the opposite of democratizing AI.

moconnor7y ago

Yet from Google we heard nothing. Which is the optimal decision for them - they only lose by blowing the whistle.

1 more reply

avip7y ago

imtringued7y ago

>Recycling is NOT good for the world.

>It is bad for the environment,

>it is bad for our health,

>and it is bad for our economy.

>Recycling is not good for the environment.

>Recycling is not good for our health.

>Recycling is bad for our economy.

>Recycling is not good for our nation.

The first paragraph keeps repeating the <X> is <bad | not good> for the <Y> pattern 8 times.

>And THAT is why we need to |get back to basics| and |get back to basics| in our recycling efforts.

"get back to the basics" is repeated twice in the same sentence.

>Everything from the raw materials (wood, cardboard, paper, etc.),

>to the reagents (dyes, solvents, etc.)

>to the printing equipment (chemicals, glue, paper, ink, etc.),

>to the packaging,

>to the packaging materials (mercury, chemicals, etc.)

>to the processing equipment (heating, cooling, etc.),

>to the packaging materials,

>to the packaging materials that are shipped overseas and

>to the packaging materials that are used in the United States.

It literally repeated packaging 5 times in the same sentence and the overall structure was repeated 9 times. Also what type of packaging is based on mercury?

1 more reply

pas7y ago

Why? It is pretty much a well juxtaposed mix of random internet comments. And it's the best of 25, which means the other 24 is even more regular internet banter noisy.

(This of course doesn't make it an amazing feat of computer engineering.)

The overarching narrative is great, but that's probably driven by the great antithesis supplied by the experimenter.

It'd be interesting to know how this works, what happens if less or more is given as thesis/antithesis/assignment, and after how much output it turns into gibberish (or repeats).

mycorrhizal7y ago

Definitely impressive work, but the fact that this is hard to distinguish from human text, if true, is pretty sad for humans. Even sadder if anyone reading this could be swayed by such an argument.

Heck, maybe having to compete with this will raise human discourse (Joking).

1 more reply

smsm427y ago

eslaught7y ago

> - I suggest going over some of the samples generated by the model. Many people react quite strongly, e.g., https://twitter.com/justkelly_ok/status/1096111155469180928.

czr7y ago

You're welcome to check out the samples [https://raw.githubusercontent.com/openai/gpt-2/master/gpt2-s...] and evaluate them for memorization yourself (I haven't found any so far).

(The talking unicorn example on their page is also meant to demonstrate that, no, it's not just memorizing, but I think it's a bit more compelling to check from the raw samples)

malux857y ago

So a small number of individuals decided what's best for everybody?

How is that open?

How is that not centralization of power?

albntomat07y ago

If they did release it, there would be an equivalent outcry about how OpenAI was contributing to fake news, etc.

pas7y ago

The paper is open https://d4mucfpksywv.cloudfront.net/better-language-models/l...

GistNoesis7y ago

What solutions are you proposing?

Here are a few that comes to mind.

-Secrecy? but how will you continue to exist on the PR scene if you don't release anything?

-Are you willing to pay every developer who is able to replicate your paper, more than what the black market would pay?

-Tracking the people who have the competence to replicate and take preemptive measures.

-Restrictions on GPU/CPU/silicone wafer.

Who can regulate? How can we regulate? What are the negative consequence of regulation? What happens if we don't, at what odds and time horizon?

cs7027y ago

This seems very reasonable to me. All the outcry seems... disproportionate.

xg157y ago

> - If you accept the above, then the current AI norm of "publish everything always" will have to change

Ok, accepting that premise, what people/organisations would you share the research with and based on what criteria?

hooloovoo_zoo7y ago

iamcreasy7y ago

Hello Ilya. Great work.

Thanks!

onurcel7y ago

Deciding if feeding the media with fear was worth the attention you will get wasn't easy, ha. Let me tell you, you are the shame of the profession.

pishpash7y ago

Yep, put out handpicked samples to stoke fear, then release nothing of the internals to stoke more fear and act like self-appointed gods.

Permit7y ago· 6 in thread

> Namely, he argued that OpenAI is concerned that the technology might be used to impersonate people or to fabricate fake news.

This seems to be a particularly weak argument to make. How is their model going to impersonate someone in a way that a human can not?

cma7y ago

Cheaper cost to put out a bigger volume of content.

Permit7y ago

Is volume really what dictates whether or not you can impersonate someone? It's never seemed that way to me.

3 more replies

mey7y ago

Cheaper than paying "influencers". Paid blogging was huge during the .com era. I wonder if this could be adapted, with suitably good speech synth, to produce podcasts en masse.

1 more reply

pishpash7y ago

eanzenberg7y ago

Its hype and marketing

roywiggins7y ago

SubiculumCode7y ago· 4 in thread

To what extent is this not just finding text samples written in its training sample and regurgitating it near verbatim?? -Non ml guy

HALtheWise7y ago

https://d4mucfpksywv.cloudfront.net/better-language-models/l...

arcticfox7y ago

The talking unicorns piece was shockingly good. That is at least as coherent of a news story than the average human could easily invent about it.

Reading that piece gives me the same weird feeling as watching AlphaStar playing through a StarCraft game.

applecrazy7y ago

You bring up a good point. Without seeing their code and training metrics, how do we know that this isn’t some extremely overfitted model?

vedant7y ago

From the paper:

"All models still underfit WebText and held-out perplexity has as of yet improved given more training time."

amrrs7y ago· 3 in thread

>Elon Musk distances himself from OpenAI, group that built fake news AI tool

This is the worst headlines in this matter. This is one of the leading media in India. A language model being touted as Fake news AI tool. This is like calling a car, A run over machine by Ford.

https://www.hindustantimes.com/tech/elon-musk-distances-hims...

hjek7y ago

> This is like calling a car, A run over machine by Ford.

That's a great dysphemism. Gonna start using that.

ultrasounder7y ago

Hindustan Times is far from being a leading media outlet.it squarely falls in the same category as The National Enquirer of Jeff Bezos fame.

robomc7y ago

That's not quite fair. The sample output they're touting is really nothing other than false text (when it's coherent), almost all of which is in the style of news.

So for the Ford analogy to be apt, Ford would have to have designed a car nobody has ever seen, and released a video which is basically just hundreds of hours of the car running people over.

I mean, a car has lots of well understood non-running-people-over capabilities. But have they demonstrated that this model is useful for anything other than generating fake news-sounding spam text?

kirillzubovsky7y ago· 3 in thread

What if OpenAI didn’t write the piece? What if the research was announced by the machine, and the folks at OpenAI are all dead?

gfodor7y ago

kirillzubovsky7y ago

What if I am the machine and you the last human left alive?

1 more reply

toufiqbarhamov7y ago· 3 in thread

Ms. Anandkumar nailed it, this is blatant hype bordering on hucksterism. Elon Musk May have left, but his influence remains I guess.

namuol7y ago

Secondly, have you seen the results? I was dumbfounded and fascinated. I spent hours reading the samples.

Barrin927y ago

>Secondly, have you seen the results? I was dumbfounded and fascinated. I spent hours reading the samples.

Yes, I've seen the result. They're nice but, as the article points out, not extraordinary compared to state of the art, open NLP research.

OpenAI's behaviour here smells of Gibsonesque 'anti-marketing', using the misunderstanding of AI and its capabilities in the general population as a means to stir up publicity for their organisation.

This is unethical, misrepresents progress in the field, and produces confusion in the press.

2 more replies

m0zg7y ago

2 more replies

Eliezer7y ago· 2 in thread

pas7y ago

> Like many, I was viscerally shocked that the results were possible.

Why? There were news about bots writing news ~5 years ago. Given a few simple facts the AI generated the regular info-scarce but fluffy news-piece.

Now OpenAI added better everything (better language models, more data, better "long-term memory" for overall text coherence), and we got better fluff.

It seems like a GAN and a simple Markov chain generator. (Even if it's not that simple of course.)

And maybe it's the equivalent of the "modern art meme" style transferred to AI/ML research. ( https://i.pinimg.com/236x/71/e1/21/71e12151f4b59d8433d32c126... )

What I'm trying to convey is that wrecking the net with auto-trolls was already possible, but for some reason Mechanical Turk was cheaper.

> OpenAI warned everyone of an “exploit” in which text humans can trust to be human-generated

pishpash7y ago

xiphias27y ago· 2 in thread

Elon Musk was kicked out because he poached Andrej Karpathy from OpenAI to lead Autopilot. Anyways, it was worth it, Andrej is doing an amazing job, and OpenAI is still alive :)

chrinic837y ago

> Anyways, it was worth it, Andrej is doing an amazing job, and OpenAI is still alive :)

Tesla does not even offer their full self driving package anymore. No coast to coast drive yet. Hard to say that's an amazing job.

OpenAI abandons their open source GitHub repos after a year, is now not releasing code, and is always in DeepMind's shadow. Alive, yes. Successful, no.

xiphias27y ago

agentofoblivion7y ago· 1 in thread

It’s amazing to me that no one has yet pointed out the blatant irony that their name is OpenAI, yet they are concealing far more than what is typical.

zackchaseOP7y ago

I assure you, people have pointed it out...

sp3327y ago· 1 in thread

czr7y ago

I would recommend reading the paper: https://d4mucfpksywv.cloudfront.net/better-language-models/l...

and the previous paper

https://s3-us-west-2.amazonaws.com/openai-assets/research-co...

It's a transformer, not LSTM, and it's very large but not structured in a particularly unusual way.

czr7y ago

jph007y ago

itg7y ago

Can you imagine if the teams that worked on the Internet decided not to make it available to the public because of the potential misuses. OpenAI is a joke.

mlboss7y ago

I think OpenAI should change org name to ClosedAI.

bitL7y ago

crobertsbmw7y ago

How do we know this article isn’t just fake news being written by an AI?

Eli_P7y ago

When a bug is caught on your palm, it pretends to be a dead bug. When a moose is scared, it plays dead moose. When AI wants to fool a human or a captcha filter, it impersonates a human.

Only when a human wants to fool a human, it impersonates whatever possible but a human, then suddenly charges a shitload of ape shit, and then behaves like it never happened.

Without a decent natural language translation or automatic reasoning, which they have not, looks like N-gram where N equals to number of words in language corpus.

rajacombinator7y ago

It’s a great marketing hack. That’s the real accomplishment here.

fareesh7y ago

> Fictitious state of emergency

Pretty dumb and disrespectful to politicize a blog post about OpenAI.

j / k navigate · click thread line to collapse