undefined | Better HN

0 pointsheavyset_go2y ago0 comments

Another factor to consider is that neural nets can function as lossy compression, which becomes extremely evident when using models that are overfit.

Sometimes they're so overfit that the compression isn't even lossy, and the data is encoded verbatim in the NN.

0 comments

20 comments · 2 top-level

TeMPOraL2y ago· 18 in thread

Yes, but this then hits against learning/understanding and compression being fundamentally the same thing. I can't think of a better way to argue in favor of "it's fine if human does it, therefore it's fine if LLM does it", than from the "lossy compression" angle.

heavyset_goOP2y ago

It's not okay for a human to pirate, plagiarize, violate IP rights and laws, etc.

But I disagree with the underlying assumption that you can anthropomorphize LLMs. Gradient descent and backpropagation don't take place in the brain. LLMs "learn" in the same way that Excel sheets "learn".

Humans are living beings with needs and rights. A person being able to legally squat in a home doesn't mean that a drone occupying property for some amount of time also has squatter's rights, even though you could easily and affordably automate and scale the deployment of drones to live and hide away on properties long enough to attain rights regarding properties all over the country.

ben_w2y ago

> But I disagree with the underlying assumption that you can anthropomorphize LLMs. Gradient descent and backpropagation don't take place in the brain. LLMs "learn" in the same way that Excel sheets "learn".

Backprop doesn't happen in us, but I think our neurones still do gradient descent – synapses that fire together, wire together.

And ultimately, at the deepest level we can analyse, our brains' atoms are doing quantum field diffusion equations, which you can also do in an Excel spreadsheet, so that kind of reductionism doesn't help either.

> Humans are living beings with needs and rights. A person being able to legally squat in a home doesn't mean that a drone occupying property for some amount of time also has squatter's rights, even though you could easily and affordably automate and scale the deployment of drones to live and hide away on properties long enough to attain rights regarding properties all over the country.

Yes, but we can also do tissue cultures and crude bioprinting, so it's a very foreseeable future where exactly the same argument will also be true for living organisms rather than digital minds.

We need to figure out what the deeper rules are that lead to the status quo, not merely mimic the superficial result. The latter is how cargo cults function.

2 more replies

vanviegen2y ago

> Gradient descent and backpropagation don't take place in the brain.

Not exactly, no, but the 'neurons that fire together wire together' way of learning has a pretty similar effect.

> LLMs "learn" in the same way that Excel sheets "learn".

I've never seen an excel sheet do anything like backpropagation.

2 more replies

pas2y ago

sure, but if I use an LLM to write a novel/article, I can be sued in civil court not the LLM.

but, more importantly, OpenAI can also be sued for tortious interference? (basically the civil equivalent of accessory)

2 more replies

TeMPOraL2y ago

> anthropomorphize LLMs (...) gradient descent (...) backpropagation (...) needs and rights

You misunderstood me. I was talking about something more fundamental.

Understanding is data compression. They are the same thing. Learning patterns, building mental models, creating abstractions, generalizing, gaining intuition/a feel for something - all the things humans engage in as part of learning and understanding the world - are all acts of lossy data compression.

mensetmanusman2y ago

"It's not okay for a human to pirate, plagiarize, violate IP rights and laws, etc."

most of the world disagrees with this view, and that means they will create the AI that wins.

sgt1012y ago

also if I write and article and quote some "text like this" [1] then that's not plagerism, but if my arguement is that the underlying assumption that you can anthropomorphize LLMs. Gradient descent and backpropagation don't take place in the brain. LLMs "learn" in the same way that Excel sheets "learn". Well, that's plagiarism and it's not allowed and people will get peeved and my career might get damaged.

I await the HN ban with fear..

[1] I'm not even doing referencing - so I am surely an LLM.

anileated2y ago

I can’t think of a better way to argue in favor of “LLMs are copyright laundering machines” than from the humanness angle.

Humans have rights, software tools don’t.

If you grant an LLM the full set of human rights, then it can consume information, regurgitate copyrighted works, and use it to generate money for itself. However, considering blatantly obvious theft as “homage” goes hand in hand with free will, agency, being in control of yourself, not being enslaved and abused, etc. Pondering various scenarios along those lines really gets to the heart of why an LLM is so very much not a human, and how subjecting it to the same treatment as humans is a ridiculous notion.

If you don’t grant LLM human rights, then ClosedAI’s stance is basically that pirating works is OK because they pass them through a black box of if conditions and it leads to results that they can monetize. That’s such a solid argument, it’ll surely play well in the court of law.

Training data is not an “LLM does it”; first because “it” here is not “learning” or understanding in human sense (otherwise you would have to presume that an LLM is a human), and second because a software tool doesn’t have agency and it’s really just Microsoft using a tool based on copyrighted works to generate profit.

kelseyfrog2y ago

Humans don't exactly have the greatest track record of granting other humans rights. I don't presume they'll get it any better with AI.

What I expect to happen is whoever has the most influence and power will get what they want and we'll end up raising a generation with the implicit understanding of "that's just how things are," natural order, truth, reality, and all that jazz.

The only thing that ever changes outcomes is if the contradiction status quo is incapable of being managed.

1 more reply

devsda2y ago

Humans are defined not just by their abilities but by their limitations too. We celebrate our achievements because sometimes they surpass the limitations of an average human.

Our collective human limitations(physical, mental and temporal) are sort of invisible implicit rules that we all follow in one way or the other. If an entity is not bound by those rules then I don't see why that entity should be treated the same as a human.

Companies already make this differentiation.

For example take captcha and bot detection. Some of the heuristics are based on inherent human limitations like response time, click time, mouse acceleration etc.

I doubt youtube or any other streaming service will be happy if you want to stream all their videos to train a hypothetical human like AI(which views and prepares notes like a human) at a hugely accelerated speed compared to a regular human. You can guess how quickly they will cite fair usage policies.

What I want to say is there are fundamental differences between a human and an AI. So, we should not be quick to dismiss any concerns just because AI can "mimic" humans in certain areas.

RandomLensman2y ago

We can have different rules for humans than for machines. In fact, that happens all the time.

wokwokwok2y ago

Is there some LLM meta where understanding and compression are argued to be the same thing I’m not aware of?

Anyone got more details on this?

Superficially it sounds like total BS; a highly compressed zip file does not exhibit any characteristics of learning.

Algorithmically derived highly compressed video streams do not exhibit characteristics of learning.

I’ve vaguely heard the learning can be considered to exhibit the characteristics of compression in that understanding of content (eg. segmentation of video content resulting in more highly compressed videos) can lead to better compression schemes.

…but saying you can “do a with b” and “a and b are fundamentally the same thing” seems like a leap…?

It seems self evident you can have compression without comprehension.

adroniser2y ago

Suppose you wanted to train an LLM to do addition.

An LLM has limited parameters. If an LLM had infinite parameters it could just memorize the results of every single addition question in existence and could not claim to have understood anything. Because it has finite parameters, if an LLM wants to get a lower loss on all addition questions, it needs to come up with a general algorithm to perform addition. Indeed, Neel Nanda trained a transformer to do addition mod 113 on relatively few examples, and it eventually learned some cursed Fourier transform mumbo jumbo to get 0 loss https://twitter.com/robertskmiles/status/1663534255249453056.

And the fact it has developed this "understanding" as an ability to learn a general pattern in the training data enables it to compress. I claim that the number of bits required to encode the general algorithm is fewer than the number of bits required to memorize every single example. If it weren't then the transformer would simply memorize every single example. But if it doesn't have space then it is forced to try to compress by developing a general model.

And the ability to compress enables you to construct a language model. Essentially, the more things compress, the higher the likelihood you assign them. Given a sequence of tokens say "the cat sat on the", we should expect "the cat sat on the mat" to compress into fewer bits than "the cat sat on the door". This is because the latter is far more common and intuitively more common sequences should compress more. You can then look at the number of bits used for every single choice of token following "the cat sat on the" and thus develop a probability distribution for the next token. The exact details of this I'm unclear on. https://www.hendrik-erz.de/post/why-gzip-just-beat-a-large-l... this gives a good summary.

1 more reply

amoss2y ago

The idea precedes LLMs by a couple of decades and is thought to apply more broadly within ML/AI than being a specific meta for LLMs. http://prize.hutter1.net/ has been around for a while, there is a link in there to the earlier work (called AIXI?).

vidarh2y ago

Even something as simple as LZW starts developing a dictionary. Not all compression is sufficient for understanding, but the more you compress a stream of data, the more dependent you are on understanding the source, because understanding the source allows you to take more shortcuts and still be able to reconstruct the data.

dns_snek2y ago

> fundamentally the same thing

I fundamentally disagree. That's not some established fact, just a narrative used by those who wish to plagiarize using "AI".

cyborgx72y ago

It's fine for a human to remember it. It's not fine for a human redistribute it for money (legally speaking). That's copyright infringement.

Robotbeat2y ago

Correct, just like it’s infringement to reproduce an article from memory using pen and paper intentionally. The person deciding to do that bears responsibility. OpenAI would be liable IFF they were intentionally facilitating that, instead of it being an undesired artifact from overfitting.

2 more replies

accrual2y ago

> Sometimes they're so overfit that the compression isn't even lossy, and the data is encoded verbatim in the NN.

Here's an article from November 2023 that discusses this:

https://not-just-memorization.github.io/extracting-training-...

j / k navigate · click thread line to collapse

0 comments

20 comments · 2 top-level

TeMPOraL2y ago· 18 in thread

heavyset_goOP2y ago

It's not okay for a human to pirate, plagiarize, violate IP rights and laws, etc.

ben_w2y ago

Backprop doesn't happen in us, but I think our neurones still do gradient descent – synapses that fire together, wire together.

Yes, but we can also do tissue cultures and crude bioprinting, so it's a very foreseeable future where exactly the same argument will also be true for living organisms rather than digital minds.

We need to figure out what the deeper rules are that lead to the status quo, not merely mimic the superficial result. The latter is how cargo cults function.

2 more replies

vanviegen2y ago

> Gradient descent and backpropagation don't take place in the brain.

Not exactly, no, but the 'neurons that fire together wire together' way of learning has a pretty similar effect.

> LLMs "learn" in the same way that Excel sheets "learn".

I've never seen an excel sheet do anything like backpropagation.

2 more replies

pas2y ago

sure, but if I use an LLM to write a novel/article, I can be sued in civil court not the LLM.

but, more importantly, OpenAI can also be sued for tortious interference? (basically the civil equivalent of accessory)

2 more replies

TeMPOraL2y ago

> anthropomorphize LLMs (...) gradient descent (...) backpropagation (...) needs and rights

You misunderstood me. I was talking about something more fundamental.

mensetmanusman2y ago

"It's not okay for a human to pirate, plagiarize, violate IP rights and laws, etc."

most of the world disagrees with this view, and that means they will create the AI that wins.

sgt1012y ago

I await the HN ban with fear..

[1] I'm not even doing referencing - so I am surely an LLM.

anileated2y ago

I can’t think of a better way to argue in favor of “LLMs are copyright laundering machines” than from the humanness angle.

Humans have rights, software tools don’t.

kelseyfrog2y ago

Humans don't exactly have the greatest track record of granting other humans rights. I don't presume they'll get it any better with AI.

The only thing that ever changes outcomes is if the contradiction status quo is incapable of being managed.

1 more reply

devsda2y ago

Humans are defined not just by their abilities but by their limitations too. We celebrate our achievements because sometimes they surpass the limitations of an average human.

Companies already make this differentiation.

For example take captcha and bot detection. Some of the heuristics are based on inherent human limitations like response time, click time, mouse acceleration etc.

What I want to say is there are fundamental differences between a human and an AI. So, we should not be quick to dismiss any concerns just because AI can "mimic" humans in certain areas.

RandomLensman2y ago

We can have different rules for humans than for machines. In fact, that happens all the time.

wokwokwok2y ago

Is there some LLM meta where understanding and compression are argued to be the same thing I’m not aware of?

Anyone got more details on this?

Superficially it sounds like total BS; a highly compressed zip file does not exhibit any characteristics of learning.

Algorithmically derived highly compressed video streams do not exhibit characteristics of learning.

…but saying you can “do a with b” and “a and b are fundamentally the same thing” seems like a leap…?

It seems self evident you can have compression without comprehension.

adroniser2y ago

Suppose you wanted to train an LLM to do addition.

1 more reply

amoss2y ago

vidarh2y ago

dns_snek2y ago

> fundamentally the same thing

I fundamentally disagree. That's not some established fact, just a narrative used by those who wish to plagiarize using "AI".

cyborgx72y ago

It's fine for a human to remember it. It's not fine for a human redistribute it for money (legally speaking). That's copyright infringement.

Robotbeat2y ago

2 more replies

accrual2y ago

> Sometimes they're so overfit that the compression isn't even lossy, and the data is encoded verbatim in the NN.

Here's an article from November 2023 that discusses this:

https://not-just-memorization.github.io/extracting-training-...

j / k navigate · click thread line to collapse