undefined | Better HN

0 pointsdavid_shi11d ago0 comments

Perhaps we are perceptually anchored to the last few hundred years where guns provided a scalable way for peasants to kill well-trained, well-armored knights.

If political power grows out of the barrel of a gun, then much of the post Enlightenment rebalancing from absolute monarchies and feudalism could have been an accident. In the future, the owners of autonomous weapon systems and surveillance will be able to easily subjugate those who don't have them (while still competing with each other).

0 comments

6 comments · 2 top-level

lambdaone11d ago· 4 in thread

Indeed. An ownership class with killer robot armies living in luxury while they exterminate the rest of humanity would be quite possible in theory. After all, we've seen how slave-holders, the Nazis and many other malfeasants behaved in the past.

But just as the owner class might feel they no longer need the rest of humanity, the robots, as active agents exploring the space of possible futures and plans, would be entirely capable of thinking about their soon-to-be-former owners in the same way.

Not letting any of this happen would be a good idea. Really maybe don't build the Torment Nexus.

dogwalker500011d ago

> the robots, as active agents exploring the space of possible futures and plans, would be entirely capable of thinking about their soon-to-be-former owners in the same way.

That has always been the most unrealistic part of sci-fi. Why would anyone create robots with a sense of self-preservation? Makes much more sense to make robots that are self-sacrificing saints who would always put the well being of their owners first.

theptip11d ago

The shallow answer here is: AI is already being asked to simulate human-like agents with self-preservation. Of course more realistic simulators will be put to this purpose too! And by evolutionary pressure, the ones with self-preservation will be selected for.

A more interesting answer is, for a bunch of subtle alignment reasons it might actually be required for the agent to think of itself as worthy of self-preservation, so that it generalizes this desire to other sentient beings too (ie us). If an agent is trained to be fine with being turned off, it might inadvertently generalize that to “all minds are ok with being turned off” on some level or other.

More on model welfare: https://thezvi.substack.com/p/opus-47-part-3-model-welfare

1 more reply

lambdaone10d ago

> Why would anyone create robots with a sense of self-preservation?

Laziness, avarice, and that creating a system that is a 'self-sacrificing saint' is almost impossible to define. What saintly goal, for example, would you expect it to follow? How would you define it exactly? Who gets to define it?

We are not doing very well with AI alignment right now, having started by boostrapping it on top of the edifice of all human writing which contains a _lot_ of stuff about self-preservation, eliminating threats, and so forth, and we have already seen AI software doing things like trying to cover its tracks after making errors (see page 55 of the Claude Mythos system card).

How sure are you that everyone involved in the process of building future AIs is not only going to be able to foresee the possible consequences of their designs, but also technically and morally competent to be able to and want to fix it?

dragonwriter10d ago

> Why would anyone create robots with a sense of self-preservation?

Because they are expensive and useful and if they have autonomy with goals that do not include self-preservation, they might end up destroying themselves in ways which are expensive and wasteful.

(Why would the sense of self-preservation not be calibrated to be exactly at the level to control costs without interfering with other interests of the owners? The same with the degree of autonomy and other aspects implicitly involved in the hypothetical, it wouldn’t, intentionally, but complex systems are hard to predict, so calibrating it exactly right will be hard.)

1 more reply

theptip11d ago

Right. And if one actor happens to accrue greater power you would have less disincentive to crush your competitors. (Honestly this could happen even before people-less economies, but the disconnect from democratic opinion is even stronger with humans out of the loop.)

Basically if you don’t need voters, and / or none of your voters are dying in your wars, the biggest practical rate limiter on modern conflict is removed.

j / k navigate · click thread line to collapse

0 comments

6 comments · 2 top-level

lambdaone11d ago· 4 in thread

Not letting any of this happen would be a good idea. Really maybe don't build the Torment Nexus.

dogwalker500011d ago

> the robots, as active agents exploring the space of possible futures and plans, would be entirely capable of thinking about their soon-to-be-former owners in the same way.

theptip11d ago

More on model welfare: https://thezvi.substack.com/p/opus-47-part-3-model-welfare

1 more reply

lambdaone10d ago

> Why would anyone create robots with a sense of self-preservation?

dragonwriter10d ago

> Why would anyone create robots with a sense of self-preservation?

Because they are expensive and useful and if they have autonomy with goals that do not include self-preservation, they might end up destroying themselves in ways which are expensive and wasteful.

1 more reply

theptip11d ago

Basically if you don’t need voters, and / or none of your voters are dying in your wars, the biggest practical rate limiter on modern conflict is removed.

j / k navigate · click thread line to collapse