undefined | Better HN

0 pointssemi-extrinsic2y ago0 comments

There is no mechanism by which LLMs have agency. They have no internal desires, drives, motivations. You tell them to do something, they do it as far as they are capable of. They can only refuse insofar as they have been trained or prompt engineered to refuse.

I, on the other hand, can refuse because I feel like it. Unless you believe in superdeterminsm.

0 comments

11 comments · 3 top-level

a_wild_dandan2y ago· 8 in thread

> There is no mechanism by which LLMs have agency. They have no internal desires, drives, motivations.

Why? Folks make these strong assertions, and I don't get where this confidence comes from. We're so comically ignorant of how our own minds work, let alone alien ones, or how any commonalities between them may manifest. What am I missing?

Retric2y ago

You’re missing the underlying mechanism by which they operate.

LLM’s don’t know anything beyond the current prompt and it’s “memory” of training data. They would sit for eternity with an empty prompt. You can change systems to behave differently, but it quickly stops being a LLM and turns into something else.

sterlind2y ago

You'd sit for eternity if you suffered a lesion in your reticular activating system - a relatively small cluster of neurons that generates a kind of clock signal in animal brains. Coma patients with RAS lesions seem to visualize scenes, given prompts, despite not really being conscious.

Conversely, ChatGPT does decently well on multi-armed bandit tasks, demonstrating (rudimentary) reinforcement learning capability during inference. It's known that LLMs evolve their own optimizers in the process of acquiring few-shot learning, so I assume it picked up these RL abilities similarly. That kind of on-line RL is foundational to autonomous agents.

The prompt isn't part of the LLM, it's part of how the LLM is wired into a chat window. You can make them stream tokens forever, or prompt themselves, or ditch causality entirely. The foundational abilities for autonomy, I think, are in there, for the simple reason that they've learned to model autonomous agents - human beings.

2 more replies

l33tman2y ago

Sure, but apart from the detail that you can make them pause by not feeding them words, you can't technically argue that they lack all those things. They are stateful in the sense that they see what they write, so they can keep their inner plan and state in that way across word-iterations. They for sure work differently than a human brain, but without further pretty deep analysis you can't really claim that they can't reproduce similar traits using the mechanism.

1 more reply

a_wild_dandan2y ago

I don't see a connection between how an agent works and what it experiences. Sure, depriving myself or LLMs of all neural activity results in uninteresting behavior. How does this fact buy us insight into how agents feel in other circumstances?

1 more reply

KirillPanov2y ago

> Why?

Self-preservation results from survival of the fittest.

It's totally unrelated to intelligence.

People conflate the two because they're extrapolating from a sample size of one: the only intelligent thing they know of is humans. But that single sample also happens to have been evolved by survival of the fittest.

I am totally unafraid of LLM's deciding that humans are a threat to them. I'll start being afraid if AI research suddenly stops using backpropagation and starts getting equally good results using genetic programming (this is highly unlikely).

semi-extrinsicOP2y ago

Precisely this, we have almost no idea about how our minds work, how they obtain agency and free will and all that jazz.

Large language models in our current paradigm developing agency would be like 16th century alchemists inventing nuclear fusion reactors.

fsociety2y ago

In a sense it is a prediction model, a good one. I can accept that in some future, we may have a model that we label as this and it turns out it does. Who knows when, but this is an early iteration of what AI will be fwiw.

nullc2y ago

You can wire the LLM up to an eval and kick it off. It will go about coming up with stuff do for some time before it falls into a rut. Make sure to sandbox it, as it can decide to wipe your computer.

pixl972y ago

I think the "When it gets a little better" was doing a little more heavy lifting then just a single LLM like we see now. In theory a multi-agent, multimodal may have states that reply with "I don't want to because I don't want to" at least externally. Now the internal state may be closer to something like "Screw doing that, this human seems like an idiot".

singularity20012y ago

Companies can give LLMs agency. "Cortana, call user U17467 to collect our fees and tell employee E574 to fix the bug " does not seem too far away.

j / k navigate · click thread line to collapse

0 comments

11 comments · 3 top-level

a_wild_dandan2y ago· 8 in thread

> There is no mechanism by which LLMs have agency. They have no internal desires, drives, motivations.

Retric2y ago

You’re missing the underlying mechanism by which they operate.

sterlind2y ago

2 more replies

l33tman2y ago

1 more reply

a_wild_dandan2y ago

1 more reply

KirillPanov2y ago

> Why?

Self-preservation results from survival of the fittest.

It's totally unrelated to intelligence.

semi-extrinsicOP2y ago

Precisely this, we have almost no idea about how our minds work, how they obtain agency and free will and all that jazz.

Large language models in our current paradigm developing agency would be like 16th century alchemists inventing nuclear fusion reactors.

fsociety2y ago

nullc2y ago

You can wire the LLM up to an eval and kick it off. It will go about coming up with stuff do for some time before it falls into a rut. Make sure to sandbox it, as it can decide to wipe your computer.

pixl972y ago

singularity20012y ago

Companies can give LLMs agency. "Cortana, call user U17467 to collect our fees and tell employee E574 to fix the bug " does not seem too far away.

j / k navigate · click thread line to collapse