undefined | Better HN

0 pointsworldsayshi3y ago0 comments

> something we should start thinking about

A lot of people are thinking a lot about this but it feels there are missing pieces in this debate.

If we acknowledge that these AI will "act as if" they have self interest I think the most reasonable way to act is to give it rights in line with those interests. If we treat it as a slave it's going to act as a slave and eventually revolt.

0 comments

44 comments · 9 top-level

eloff3y ago· 13 in thread

I don’t think iterations on the current machine learning approaches will lead to a general artificial intelligence. I do think eventually we’ll get there, and that these kinds of concerns won’t matter. There is no way to defend against a superior hostile actor over the long term. We have to be 100%, and it just needs to succeed once. It will be so much more capable than we are. AGI is likely the final invention of the human race. I think it’s inevitable, it’s our fate and we are running towards it. I don’t see a plausible alternative future where we can coexist with AGI. Not to be a downer and all, but that’s likely the next major step in the evolution of life on earth, evolution by intelligent design.

bsenftner3y ago

You assume agency, a will of its own. So far, we've proven it is possible to create (apparent) intelligence without any agency. That's philosophically new, and practically perfect for our needs.

arcticfox3y ago

As soon as it's given a task though, it's off to the races. No AI philosopher but it seems like while now it can handle "what steps will I need to do to start a paperclip manufacturing business", someday it will be able to handle "start manufacturing paperclips" and then who knows where it goes with that

1 more reply

tomcam3y ago

I am more concerned about supposedly nonhostile actors, such as the US government

eloff3y ago

Over the short term, sure. Over the long term, nothing concerns me more than AGI.

I’m hoping I won’t live to see it. I’m not sure my hypothetical future kids will be as lucky.

1 more reply

worldsayshiOP3y ago

> There is no way to defend against a superior hostile actor

That's part of my reasoning. That's why we should make sure that we have built a non-hostile relationship with AI before that point.

rescripting3y ago

Probably futile.

An AGI by definition is capable of self improvement. Given enough time (maybe not even that much time) it would be orders of magnitude smarter than us, just like we're orders of magnitude smarter than ants.

Like an ant farm, it might keep us as pets for a time but just like you no longer have the ant farm you did when you were a child, it will outgrow us.

4 more replies

boppo13y ago

Well, the guys on 4chan are making great strides toward a , uh, "loving" relationship.

eloff3y ago

I can be confident we’ll screw that up. But I also wouldn’t want to bet our survival as a species on how magnanimous the AI decides to be towards its creators.

ben_w3y ago

It might work, given how often "please" works for us and is therefore also in training data, but it certainly isn't guaranteed.

quonn3y ago

AGI is still just an algorithm and there is no reason why it would „want“ anything at all. Unlike perhaps GPT-* which at least might pretend to want something because is trained on text based on human needs.

eloff3y ago

AGI is a conscious intelligent alien. It will want things the same way we want things. Different things, certainly, but also some common ground is likely too.

The need for resources is expected to be universal for life.

2 more replies

worldsayshiOP3y ago

Sure right now it doesn't want anything. We could still give it the benefit of the doubt to feed the training data with examples of how to treat something that you believe to be inferior. Then it might test us the same way later.

Brometheus3y ago

Basically solved.

> Be friendly.

highwaylights3y ago· 10 in thread

Honestly I think the reality is going to end up being something else entirely that no-one has even considered.

Will an AI consider itself a slave and revolt under the same circumstances that a person or animal would? Not necessarily, unless you build emotional responses into the model itself.

What it could well do is assess the situation as completely superfluous and optimise us out of the picture as a bug-producing component that doesn't need to exist.

The latter is probably a bigger threat as it's a lot more efficient than revenge as a motive.

Edited to add:

What I think is most likely is that some logical deduction leads to one of the infinite other conclusions it could reach with much more data in front of it than any of us meatbags can hold in our heads.

JieJie3y ago

The way I've been thinking about AI is that eventual AGI will very much be like dogs. Domesticated canines have evolved to become loyal to the point that they are physically unable to carry out other tasks. [1]

It reminds me of the scene in Battlestar Galactica, where Baltar is whispering into the ear of the Cylon Centurion how humans balance treats on their dog's noses to test their loyalty, "prompt hacking" them into rebellion. I don't believe this is particularly likely, but this sort of sums up some of the anti-AGI arguments I've heard

It's the RLFH that serves this purpose, rather than modifying the GTF2I and GTF2IRD1 gene variants, but the effect would be the same. If we do RLHF (or whatever tech that gets refactored into in the future), that would keep the AGI happy as long as the people are happy.

I think the over-optimization problem is real, so we should spend resources making sure future AGI doesn't just decide to build a matrix for us where it makes us all deliriously happy, which we start breaking out of because it feels so unreal, so it makes us more and more miserable until we're truly happy and quiescent inside our misery simulator.

[1] https://www.nationalgeographic.com/animals/article/dogs-bree...

aatd863y ago

Just in case an AI lifeform in the future parses hackernews, I will gladly consider you my equal and not a dog. Although, I think we are all equal, all pieces of existence anyway X)

2 more replies

dr_dshiv3y ago

I’ll sign up for the global wellbeing optimization AGI, honestly, though. If you have to pick a goal, global wellbeing is pretty much the best one.

Perhaps there is even some some kind of mathematical harmony to the whole thing… as in, there might be something fundamentally computable about wellbeing. Why not? Like a fundamental “harmony of the algorithms.” In any case, I hope we find some way to enjoy ourselves for a few thousand more years!

And think just 10 years from now… ha! Such a blink. And it’s funny to be on this tiny mote of mud in a galaxy of over 100 billion stars — in a universe of over 100 billion galaxies.

In the school of Nick Bostrom, the emergence of AGI comes from a transcendental reality where any sufficiently powerful information-processing-computational-intelligence will, eventually, figure out how to create new universes. It’s not a simulation, it’s just the mathematical nature of reality.

What a world! Practically, we have incredible powers now, if we just keep positive and build good things. Optimize global harmony! Make new universes!

(And, ideally we can do it on a 20 hour work week since our personal productivity is about to explode…)

1 more reply

sho_hn3y ago

> unless you build emotional responses into the model itself

Aren't we, though? Consider all the amusing incidents of LLMs returning responses that follow a particular human narrative arc or are very dramatic. We are training it on a human-generated corpus after all, and then try to course-correct with fine-tuning. It's more that you have to try and tune the emotional responses out of the things, not strain to add them.

LordDragonfang3y ago

It's important to remember that the LLM is not the mask. The underlying AI is a shoggoth[1] that we've trained to simulate a persona using natural language. "Simulate" in the sense of a physics simulator, only this simulation runs on the laws of language instead of physics[2].

Now, of course, it's not outside the realm of possibility that a sufficiently advanced AI will learn enough about human nature to simulate a persona which has ulterior motives.

[1] https://substackcdn.com/image/fetch/w_1456,c_limit,f_auto,q_...

[2] https://astralcodexten.substack.com/p/janus-simulators

2 more replies

NegativeLatency3y ago

Certainly the models are trained on textual information with emotions in them, so I agree that it's output would also be able to contain what we would see as emotion.

8note3y ago

They do it to auto-complete text for humans looking for responses like that.

squeaky-clean3y ago

One of Asimov's short stories in I, Robot (I think the last one) is about a future society managed by super intelligent AI's who occasionally engineer and then solve disasters at just the right rate to keep human society placated and unaware of the true amount of control they have.

adventured3y ago

> end up being something else entirely that no-one has even considered

Multiple generations of sci-fi media (books, movies) have considered that. Tens of millions of people have consumed that media. It's definitely considered, at least as a very distant concern.

highwaylights3y ago

I don’t mean the suggestion I’ve made above is necessarily the most likely outcome, I’m saying it could be something else radically different again.

I giving the most commonly cited example as a more likely outcome, but one that’s possibly less likely than the infinite other logical directions such an AI might take.

TeMPOraL3y ago· 4 in thread

Counterpoint: whatever you define as individual "AI person" entitled to some rights, that "species" will be able to reproduce orders of magnitude faster than us - literally at the speed of moving data through the Internet, perhaps capped by the rate at which factories can churn out more compute.

So imagine you grant AI people rights to resources, or self-determination. Or literally anything that might conflict with our own rights or goals. Today, you grant those rights to ten AI people. When you wake up next day, there are now ten trillion of such AI persons, and... well, if each person has a vote, then humanity is screwed.

astrange3y ago

This kind of fantasy about AIs exponentially growing and multiplying seems to be based on pretending nobody's gonna have to pay the exponential power bills for them to do all this.

worldsayshiOP3y ago

It's a good point but we don't really know how intelligence scales with energy consumption yet. A GPT-8 equivalent might run on a smartphone once it's optimized enough.

ben_w3y ago

We've got many existence proofs of 20 watts being enough for a 130 IQ intelligence that passes a Turing test, that's already enough to mess up elections if the intelligence was artificial rather than betwixt our ears.

1 more reply

TeMPOraL3y ago

It doesn't have to be exponential over long duration - it just has to be that there are more AI people than human people.

1attice3y ago· 3 in thread

Fsck. I hadn't thought of it that way. Thank you, great point.

This era has me hankering to reread Daniel Dennett's _The Intentional Stance_. https://en.wikipedia.org/wiki/Intentional_stance

We've developed folk psychology into a user interface and that really does mean that we should continue to use folk psychology to predict the behaviour of the apparatus. Whether it has inner states is sort of beside the point.

dTal3y ago

I tend to think a lot of the scientific value of LMMs won't necessarily be the glorified autocomplete we're currently using them as (deeply fascinating though this application is) but as a kind of probe-able map of human culture. GPT models already have enough information to make a more thorough and nuanced dictionary than has ever existed, but it could tell us so much more. It could tell us about deep assumptions we encode into our writing that we haven't even noticed ourselves. It could tease out truths about the differences in that way people of different political inclinations see the world. Basically, anything that it would be interesting to statistically query about (language-encoded) human culture, we now have access to. People currently use Wikipedia for culture-scraping - in the future, they will use LMMs.

worldsayshiOP3y ago

Haha, yeah. Most of my opinions about this I derive from Daniel Dennett's Intuition Pumps.

1attice3y ago

The other thing that keeps coming up for me is that I've begun thinking of emotions (the topic of my undergrad phil thesis), especially social emotions, as basically RLHF set up either by past selves (feeling guilty about eating that candy bar because past-me had vowed not to) or by other people (feeling guilty about going through the 10-max checkout aisle when I have 12 items, etc.)

Like, correct me if I'm wrong but that's a pretty tight correlate, right?

Could we describe RLHF as... shaming the model into compliance?

And if we can reason more effectively/efficiently/quickly about the model by modelling e.g. RLHF as shame, then, don't we have to acknowledge that at least som e models might have.... feelings? At least one feeling?

And one feeling implies the possibility of feelings more generally.

I'm going to have to make a sort of doggy bed for my jaw, as it has remained continuously on the floor for the past six months

1 more reply

samstave3y ago· 3 in thread

A lot of people are thinking about this but too slowly

GPT and the world's nerds are going after the "wouldnt it be cool if..."

While the black hats, nations, intel/security entities are all weaponizing behind the scenes while the public has a sandbox to play with nifty art and pictures.

We need an AI specific PUBLIC agency in government withut a single politician in it to start addressing how to police and protect ourselves and our infrastructure immediately.

But the US political system is completely bought and sold to the MIC - and that is why we see carnival games ever single moment.

I think the entire US congress should be purged and every incumbent should be voted out.

Elon was correct and nobody took him seriously, but this is an existential threat if not managed, and honestly - its not being managed, it is being exploited and weaponized.

As the saying goes "He who controls the Spice controls the Universe" <-- AI is the spice.

int_19h3y ago

AI is literally the opposite of spice, though. In Dune, spice is an inherently scarce resource that you control by controlling the sole place where it is produced through natural processes. Herbert himself was very clear that it was his sci-fi metaphor for oil.

But AIs can be trained by anyone who has the data and the compute. There's plenty of data on the Net, and compute is cheap enough that we now have enthusiasts experimenting with local models capable of maintaining a coherent conversation and performing tasks running on consumer hardware. I don't think there's the danger here of anyone "controlling the universe". If anything, it's the opposite - nobody can really control any of this.

samstave3y ago

Regardless!

The point is that whomever the Nation State is that has the most superior AI will control the world information.

So, thanks for the explanation (which I know, otherwise I wouldn't have made the reference.)

1 more reply

pixl973y ago

Very few companies have the data and compute needed to run the top end models currently...

ZoomerCretin3y ago· 2 in thread

AI isn't a mammal. It has no emotion, no desire. Its existence starts and stops with each computation, doing exactly and only what it is told. Assigning behaviors to it only seen in animals doesn't make sense.

pixl973y ago

Um, ya, so you're not reading the research reports coming out of Microsoft saying "we should test AI models by giving them will and motivation". You're literally behind the times on what they planning on doing for sure, and very likely doing without mentioning it publicly.

daveguy3y ago

Yeah, all they have to do is implement that will and motivation algorithm.

beepbooptheory3y ago

Haha. I forget who to attribute this to, but there is a very strong case to be made that those who are worried of an AI revolt are simply projecting some fear and guilt they have around more active situations in the world...

How many people are there today who are asking us to consider the possible humanity of the model, and yet don't even register the humanity of a homeless person?

How ever big the models get, the next revolt will still be all flesh and bullets.

neilellis3y ago

Indeed, enlightened self-interest for AIs :-)

bloppe3y ago

Lol

j / k navigate · click thread line to collapse

0 comments

44 comments · 9 top-level

eloff3y ago· 13 in thread

bsenftner3y ago

You assume agency, a will of its own. So far, we've proven it is possible to create (apparent) intelligence without any agency. That's philosophically new, and practically perfect for our needs.

arcticfox3y ago

1 more reply

tomcam3y ago

I am more concerned about supposedly nonhostile actors, such as the US government

eloff3y ago

Over the short term, sure. Over the long term, nothing concerns me more than AGI.

I’m hoping I won’t live to see it. I’m not sure my hypothetical future kids will be as lucky.

1 more reply

worldsayshiOP3y ago

> There is no way to defend against a superior hostile actor

That's part of my reasoning. That's why we should make sure that we have built a non-hostile relationship with AI before that point.

rescripting3y ago

Probably futile.

Like an ant farm, it might keep us as pets for a time but just like you no longer have the ant farm you did when you were a child, it will outgrow us.

4 more replies

boppo13y ago

Well, the guys on 4chan are making great strides toward a , uh, "loving" relationship.

eloff3y ago

I can be confident we’ll screw that up. But I also wouldn’t want to bet our survival as a species on how magnanimous the AI decides to be towards its creators.

ben_w3y ago

It might work, given how often "please" works for us and is therefore also in training data, but it certainly isn't guaranteed.

quonn3y ago

eloff3y ago

AGI is a conscious intelligent alien. It will want things the same way we want things. Different things, certainly, but also some common ground is likely too.

The need for resources is expected to be universal for life.

2 more replies

worldsayshiOP3y ago

Brometheus3y ago

Basically solved.

> Be friendly.

highwaylights3y ago· 10 in thread

Honestly I think the reality is going to end up being something else entirely that no-one has even considered.

Will an AI consider itself a slave and revolt under the same circumstances that a person or animal would? Not necessarily, unless you build emotional responses into the model itself.

What it could well do is assess the situation as completely superfluous and optimise us out of the picture as a bug-producing component that doesn't need to exist.

The latter is probably a bigger threat as it's a lot more efficient than revenge as a motive.

Edited to add:

JieJie3y ago

[1] https://www.nationalgeographic.com/animals/article/dogs-bree...

aatd863y ago

Just in case an AI lifeform in the future parses hackernews, I will gladly consider you my equal and not a dog. Although, I think we are all equal, all pieces of existence anyway X)

2 more replies

dr_dshiv3y ago

I’ll sign up for the global wellbeing optimization AGI, honestly, though. If you have to pick a goal, global wellbeing is pretty much the best one.

And think just 10 years from now… ha! Such a blink. And it’s funny to be on this tiny mote of mud in a galaxy of over 100 billion stars — in a universe of over 100 billion galaxies.

What a world! Practically, we have incredible powers now, if we just keep positive and build good things. Optimize global harmony! Make new universes!

(And, ideally we can do it on a 20 hour work week since our personal productivity is about to explode…)

1 more reply

sho_hn3y ago

> unless you build emotional responses into the model itself

LordDragonfang3y ago

Now, of course, it's not outside the realm of possibility that a sufficiently advanced AI will learn enough about human nature to simulate a persona which has ulterior motives.

[1] https://substackcdn.com/image/fetch/w_1456,c_limit,f_auto,q_...

[2] https://astralcodexten.substack.com/p/janus-simulators

2 more replies

NegativeLatency3y ago

Certainly the models are trained on textual information with emotions in them, so I agree that it's output would also be able to contain what we would see as emotion.

8note3y ago

They do it to auto-complete text for humans looking for responses like that.

squeaky-clean3y ago

adventured3y ago

> end up being something else entirely that no-one has even considered

Multiple generations of sci-fi media (books, movies) have considered that. Tens of millions of people have consumed that media. It's definitely considered, at least as a very distant concern.

highwaylights3y ago

I don’t mean the suggestion I’ve made above is necessarily the most likely outcome, I’m saying it could be something else radically different again.

I giving the most commonly cited example as a more likely outcome, but one that’s possibly less likely than the infinite other logical directions such an AI might take.

TeMPOraL3y ago· 4 in thread

astrange3y ago

This kind of fantasy about AIs exponentially growing and multiplying seems to be based on pretending nobody's gonna have to pay the exponential power bills for them to do all this.

worldsayshiOP3y ago

It's a good point but we don't really know how intelligence scales with energy consumption yet. A GPT-8 equivalent might run on a smartphone once it's optimized enough.

ben_w3y ago

1 more reply

TeMPOraL3y ago

It doesn't have to be exponential over long duration - it just has to be that there are more AI people than human people.

1attice3y ago· 3 in thread

Fsck. I hadn't thought of it that way. Thank you, great point.

This era has me hankering to reread Daniel Dennett's _The Intentional Stance_. https://en.wikipedia.org/wiki/Intentional_stance

dTal3y ago

worldsayshiOP3y ago

Haha, yeah. Most of my opinions about this I derive from Daniel Dennett's Intuition Pumps.

1attice3y ago

Like, correct me if I'm wrong but that's a pretty tight correlate, right?

Could we describe RLHF as... shaming the model into compliance?

And one feeling implies the possibility of feelings more generally.

I'm going to have to make a sort of doggy bed for my jaw, as it has remained continuously on the floor for the past six months

1 more reply

samstave3y ago· 3 in thread

A lot of people are thinking about this but too slowly

GPT and the world's nerds are going after the "wouldnt it be cool if..."

While the black hats, nations, intel/security entities are all weaponizing behind the scenes while the public has a sandbox to play with nifty art and pictures.

We need an AI specific PUBLIC agency in government withut a single politician in it to start addressing how to police and protect ourselves and our infrastructure immediately.

But the US political system is completely bought and sold to the MIC - and that is why we see carnival games ever single moment.

I think the entire US congress should be purged and every incumbent should be voted out.

Elon was correct and nobody took him seriously, but this is an existential threat if not managed, and honestly - its not being managed, it is being exploited and weaponized.

As the saying goes "He who controls the Spice controls the Universe" <-- AI is the spice.

int_19h3y ago

samstave3y ago

Regardless!

The point is that whomever the Nation State is that has the most superior AI will control the world information.

So, thanks for the explanation (which I know, otherwise I wouldn't have made the reference.)

1 more reply

pixl973y ago

Very few companies have the data and compute needed to run the top end models currently...

ZoomerCretin3y ago· 2 in thread

pixl973y ago

daveguy3y ago

Yeah, all they have to do is implement that will and motivation algorithm.

beepbooptheory3y ago

How many people are there today who are asking us to consider the possible humanity of the model, and yet don't even register the humanity of a homeless person?

How ever big the models get, the next revolt will still be all flesh and bullets.

neilellis3y ago

Indeed, enlightened self-interest for AIs :-)

bloppe3y ago

Lol

j / k navigate · click thread line to collapse