To Understand Language Is to Understand Generalization (opens in new tab)

(evjang.com)

91 pointsericjang4y ago38 comments

38 comments

19 comments · 8 top-level

enderm4y ago· 4 in thread

I like the design of your website!

What do you mean when you say words are disentangled, standalone concepts? I see words as being very much related to each other.

I assume I may be misinterpreting what you mean by "disentangled, standalone concepts”.

Barbara Tversky's research seems to contradict linguistic relativism. I definitely don’t think language is the foundation of cognition.

ericjangOP4y ago

Thanks!

Words are considered a "discrete unit of meaning", i.e. 3/4 of a word doesn't really mean much. So words like "red" and "grass" are "standalone" in the sense that the mean something by themselves. I agree that words are very much related to each other, in the sense that you can combine them.

I was trying to draw a connection that the "disentangled representations" ML folks often talk about are but a special few-word case of grammars for combining distinct concept.

solarmist4y ago

Unfortunately, words aren't that simple, but it's close. Prefixes, suffixes, in-fixes, endings, etc., all have discrete meaning as well. And going into Asian language, this is much more obvious.

The discrete unit of meaning level is generally somewhere between a syllable and a word, with a few exceptions for shorter modifiers.

Unfortunately, in linguistics, the concept of a "word" is only as well defined as "planet" was pre-pluto losing its status.

Similarly when you look at riddles and crossword puzzle clues the idea of words being discrete also falls apart. Words, very much like variables in algebra only have meaning in relation to the other pieces of the context they are attached to.

While the mechanics (all the pieces of language, syntax and semantics are not discretizable. Just talk to anyone working on a dictionary.) you talk about don't seem to hold, I do think the idea you're talking about does hold.

3 more replies

Mezzie4y ago

Actually, the discrete unit of meaning, linguistically, is the morpheme. It's a small difference, but it matters. Some words are morphemes, but not all, and not all morphemes are words.

Language, man. It's weird.

enderm4y ago

I can see how this could work in English. I’m not sure if there are other languages in which 3/4 of a word carries more meaning. (I’m a primary English speaker, so this concern could be unfounded.)

2 more replies

motohagiography4y ago· 3 in thread

An AI could be said to understand language if it used language as one of a selection of tools to operate on itself, a peer or other being, or its environment. The idea of "meaning = co-ocurrance" overlooks things like need, cause, and effect that appear when language is used as a tool to operate on its environment.

Most of what I read about ML and AI is about creating these monolithic models that treat networks and clusters of neurons as a single entity, but that would be like treating a species of individuals with lifecycles as a single entity. The comment in the article about how GPT models are like a shadow compared to a 3D world suggests the bottleneck to evolving them is really us, as we're trying to make just one that emmulates many of us, instead of letting one loose on the internet to divide and proliferate to evolve millions where the best few will be exponentially better. Right now we're building expert systems that are individual specimens without an ecosystem.

There isn't yet a botnet of GPT nodes compromising machines and harvesting compute for training and evolving through participating in forums, but then again how would I know? (There's nothing worse than failing a modern catchpa and having a flash of existential dread at the stark possibility I may have indeed been a robot all along. Now I do them at random just to be sure.)

visarga4y ago

> instead of letting one loose on the internet to divide and proliferate to evolve millions where the best few will be exponentially better

I've said this before, what current AI agents lack is a dick (& pussy). If they had a dick they could have an internal goal to motivate their evolution, a goal not dependent on us anymore. The battlefield of self replication vs death is the great school of evolution, where humanity is currently the top student. AI only sent the likes of AlphaGo to the school.

verisimi4y ago

You can program them to have a dick (or pussy). Program an AI to get the highest score in a game etc, and it will be more incessant than a teenager - it won't stop ever. What AI agents lack is animation - they are inanimate, they are software; they are machines. And will always be so.

We can (and do) labour under the impression of our senses that all there is in reality is physical matter. This is an erroneous assumption imo, but if that is your bedrock, you will struggle to understand why the damn machines can't do what we want. You can be unhappy about this but it won't change reality.

It is true metaphysics is hard to discern perhaps (by definition) but it won't change the reality that metaphysics is a genuine element of the human existence. In fact, its the most important part of human existence - we don't feel to be automatons after all, even if we make a pretence of it sometimes.

The best we will do with machines is to create a simulation of the human experience, one that might pass the Turing test even. And even then, despite all indications and evidence, the machine will not be animated by spirit.

1 more reply

justtologin4y ago

They do have a loss function, that is analogous to human desire.

I think we are fundamentally missing something, that there is an irreconcilable difference between a mathematical expression and actual conscious desire, and until we figure out what they is, we won't crack AGI

gsjbjt4y ago· 3 in thread

Nice post! I work on NLP and I think a lot of ideas in this post resonate with what I find exciting about working on the intersection of language + the real world: large text datasets as sources of abundant prior knowledge about the world, structure of language ~ structure of concepts that matter to humans, etc.

I feel like the bottleneck is getting access to paired (language, other modality) data though (if your other modality isn't images). i.e. "bolt on generalization" is an intuitively appealing concept, but then it reduces to the hard problem of "how do I learn to ground language to e.g. my robot action space?" I haven't seen a robotics + language paper that actually grapples with the grounding problem / tries to think about how to scale the data collection process for language-conditioned robotics beyond annotating your own dataset as a proof-of-concept. Unlike language modeling / CLIP-type pretraining, it seems (fundamentally?) more difficult to find natural sources of supervision of (language, action). I'd be curious about your thoughts on this!

> When it comes to combining natural language with robots, the obvious take is to use it as an input-output modality for human-robot interaction. The robot would understand human language inputs and potentially converse with the human. But if you accept that “generalization is language”, then language models have a far bigger role to play than just being the “UX layer for robots”.

You should check out Jacob Andreas's work, if you haven't seen it already - esp. his stuff on learning from latent language (https://arxiv.org/abs/1711.00482).

ericjangOP4y ago

My hope is that sufficiently rich language models obviate the need for a lot of robot-language grounding data.

LfP (https://learning-from-play.github.io/) was a work that inspired me a lot. They relabel a few hours of open-ended demonstrations (humans instructed to play with anything in the environment) with a lot of hindsight language descriptions, and show some degree of general capability acquired through this richer language. You can describe the same action with a lot of different descriptions, e.g. "pick up the leftmost object unless it is a cup" could also be relabeled as "pick up an apple".

That being said, the LfP paper stops short of testing whether we can improve robotics solely by only scaling language - a confounding factor and central to their narrative was the role of "open-ended play data". We do need some paired data to ground (language, robot-specific sensor/actuator modalities), but perhaps we can scale everything else with language only data.

Thanks to the pointer on the Andreas paper! This is indeed quite relevant to the spirit of what I'm arguing for, though I prefer the implementation realized by the Lu et al '21 paper.

visarga4y ago

> We do need some paired data

A couple of under-explored rich sources of training data on actions are videos and code. Videos, showing how people interact with objects in the world to achieve goals, might also come with captions and metadata, while code comes with comments, messages and variable names that relate to real world concepts, including millions of tables and business logic.

Maybe in the future we will add rich brain scans as an alternative to text. That kind of annotation would be so easy to collect in large quantities, provided we can wear neural sensors. If it's impractical to scan the brain, we can wear sensors and video cameras and use eye tracking and body tracking to train the system.

I am optimistic that language modelling can become the core engine of AI agents, but we need a system that has both a generator and a critic, going back and forth for a few rounds, doing multi-step problem solving. Another must is to allow search engine queries in order to make more efficient and correct models, not all knowledge must be burned into the weights.

solarmist4y ago

> My hope is that sufficiently rich language models obviate the need for a lot of robot-language grounding data.

I feel like this is “missing the trees for the forest.” In my experience, generality only emerges after a critical mass of detailed low-level examples is collected and arranged into a pattern. Humans can’t actually reason about purely abstract ideas very well. Experts always have specifics in mind they are working from.

So I'm not convinced leaving it to the model gets you anything new.

1 more reply

Gimpei4y ago· 1 in thread

What are your thoughts on the externalism of Putnam and Kripke, i.e. that meanings aren't just defined by use, but that they are also determined by objects themselves? It feels like that puts a crimp in meaning = co-occurance, but maybe not?

solarmist4y ago

I agree.

Or put another way a set (or sets) of concrete examples grounds every abstract idea (including words as abstract objects). And it's turtles all the way down (or up depending).

solarmist4y ago

This is a neat idea, but I think it's missing a large and important area for generalization, and that's the process of seeking and exploring exceptions or counter-examples (see my other comments for examples).

Language defines things through subtraction, inversion, comparison, and contrast as much as construction and straightforward language.

Engineering and computer science rely too heavily on induction, but deduction and other non-linear processes are largely missing from these kinds of analyses/approaches. And until they are accounted for I don't think we'll reach any kind of true approach to generalization.

synquid4y ago

This seems very similar to the research program led by the late Patrick Henry Winton: https://groups.csail.mit.edu/genesis/index.html

Besides, I wish that causality had been mentioned more than once in passing. Due to the existence of the ladder of causality, many important queries cannot be answered by mere observation, or even by intervention; such queries require counterfactual reasoning, and structural causal models generalize because they describe something that is very invariant in the world.

iamgopal4y ago

Does reverse is true ? To understand generalisation is to understand language ?

ncmncm4y ago

Not to understand generalization, therefore, is not to understand language.

Q E D.

j / k navigate · click thread line to collapse

38 comments

19 comments · 8 top-level

enderm4y ago· 4 in thread

I like the design of your website!

What do you mean when you say words are disentangled, standalone concepts? I see words as being very much related to each other.

I assume I may be misinterpreting what you mean by "disentangled, standalone concepts”.

Barbara Tversky's research seems to contradict linguistic relativism. I definitely don’t think language is the foundation of cognition.

ericjangOP4y ago

Thanks!

I was trying to draw a connection that the "disentangled representations" ML folks often talk about are but a special few-word case of grammars for combining distinct concept.

solarmist4y ago

Unfortunately, words aren't that simple, but it's close. Prefixes, suffixes, in-fixes, endings, etc., all have discrete meaning as well. And going into Asian language, this is much more obvious.

The discrete unit of meaning level is generally somewhere between a syllable and a word, with a few exceptions for shorter modifiers.

Unfortunately, in linguistics, the concept of a "word" is only as well defined as "planet" was pre-pluto losing its status.

3 more replies

Mezzie4y ago

Actually, the discrete unit of meaning, linguistically, is the morpheme. It's a small difference, but it matters. Some words are morphemes, but not all, and not all morphemes are words.

Language, man. It's weird.

enderm4y ago

2 more replies

motohagiography4y ago· 3 in thread

visarga4y ago

> instead of letting one loose on the internet to divide and proliferate to evolve millions where the best few will be exponentially better

verisimi4y ago

1 more reply

justtologin4y ago

They do have a loss function, that is analogous to human desire.

gsjbjt4y ago· 3 in thread

You should check out Jacob Andreas's work, if you haven't seen it already - esp. his stuff on learning from latent language (https://arxiv.org/abs/1711.00482).

ericjangOP4y ago

My hope is that sufficiently rich language models obviate the need for a lot of robot-language grounding data.

Thanks to the pointer on the Andreas paper! This is indeed quite relevant to the spirit of what I'm arguing for, though I prefer the implementation realized by the Lu et al '21 paper.

visarga4y ago

> We do need some paired data

solarmist4y ago

> My hope is that sufficiently rich language models obviate the need for a lot of robot-language grounding data.

So I'm not convinced leaving it to the model gets you anything new.

1 more reply

Gimpei4y ago· 1 in thread

solarmist4y ago

I agree.

Or put another way a set (or sets) of concrete examples grounds every abstract idea (including words as abstract objects). And it's turtles all the way down (or up depending).

solarmist4y ago

Language defines things through subtraction, inversion, comparison, and contrast as much as construction and straightforward language.

synquid4y ago

This seems very similar to the research program led by the late Patrick Henry Winton: https://groups.csail.mit.edu/genesis/index.html

iamgopal4y ago

Does reverse is true ? To understand generalisation is to understand language ?

ncmncm4y ago

Not to understand generalization, therefore, is not to understand language.

Q E D.

j / k navigate · click thread line to collapse