undefined | Better HN

0 pointsrobotnikman7mo ago0 comments

There is also the fact that AI lacks long term memory like humans do. If you consider context length long term memory, its incredibly short compared to that of a human. Maybe if it reaches into the billions or trillions of tokens in length we might have something comparable, or someone comes up with a new solution of some kind

0 comments

JohnBooty7mo ago

Well here's the interesting thing to think about for me.

Human memory is.... insanely bad.

We record only the tiniest subset of our experiences, and those memories are heavily colored by our emotional states at the time and our pre-existing conceptions, and a lot of memories change or disappear over time.

Generally speaking even in the best case most of our memories tend to be more like checksums than JPGs. You probably can't name more than a few of the people you went to school with. But, if I showed you a list of people you went to school with, you'd probably look at each name and be like "yeah! OK! I remember that now!"

So.

It's interesting to think about what kind of "bar" AGI would really need to clear w.r.t. memories, if the goal is to be (at least) on par with human intelligence.

intended7mo ago

Memory is a skill- its plastic, not static.

You can get better at remembering things, like you can get better at dancing or doing exercise.

We can also specialize our memory to be good at some things over others.

BizarroLand7mo ago

Insanely bad compared to what else in the animal kingdom? We are tool users. We use tools, like language, and writing, and technology like audio/video recording to farm out the difficulties we have with memory to things that can store memory and retrieve them.

Computers are just stored information that processes.

We are the miners and creators of that information. The fact that a computer can do some things better than we can is not a testament to how terrible we are but rather how great we are that we can invent things that are better than us at specific tasks.

We made the atlatl and threw spears across the plains. We made the bow and arrow and stabbed things very far away. We made the whip and broke the sound barrier.

Shitting on humans is an insult your your ancestors. Fuck you. Be proud. If we invent a new thing that can do what we do better it only exists because of us.

joshmarinacci7mo ago

Insanely bad compared to books or other permanent records. The human memory system did not evolve to be an accurate record of the past. It evolved to keep us alive by remembering dangerous things.

3 more replies

scarmig7mo ago

Chimpanzees have much better short term memories than humans do. If you test them with digits 1-9 sequentially flashed on a screen, they're able to reproduce the digits with lower loss than undergraduate human students.

https://link.springer.com/article/10.1007/s10071-008-0206-8

2 more replies

ghoblin7mo ago

That's a very anthropocentric view. Technology isn't a series of deliberate inventions by us, but an autonomous, self-organizing process. The development of a spear, a bow, or a computer is an evolutionary step in a chain of technological solutions that use humans as their temporary biological medium. The human brain is not the starting point or center of this process. It is itself a product of biological evolution, a temporary information-processing system. Its limitations such as imperfect memory, are simply constraints of its biological origin. The tools we develop, from writing to digital storage are not just supplements to human ability, but the next stage in a system that is moving beyond its biological origins to find more efficient non-biological forms of information storage and processing. Human pride in creation is a misinterpretation. We are not the masters of technology. We're just the vehicle of it. Part of a larger process of technological self-improvement that is now moving towards an era where it might no longer require us

1 more reply

Difwif7mo ago

My mental model is a bit different:

Context -> Attention Span

Model weights/Inference -> System 1 thinking (intuition)

Computer memory (files) -> Long term memory

Chain of thought/Reasoning -> System 2 thinking

Prompts/Tool Output -> Sensing

Tool Use -> Actuation

The system 2 thinking performance is heavily dependent on the system 1 having the right intuitive models for effective problem solving via tool use. Tools are also what load long term memories into attention.

mark_l_watson7mo ago

Very cool, good way to think about it. I wouldn’t be surprised if non-AGI LLMs help write the code to augment themselves into AGI.

The unreasonable effectiveness of deep learning was a surprise. We don’t know what the future surprises will be.

yellow_postit7mo ago

I like this mental model. Orchestration / Agents and using smaller models to determine the ideal tool input and check the output starts to look like delegation.

amelius7mo ago

The long term memory is in the training. The short term memory is in the context window.

mawax7mo ago

The comparison misses the mark: unlike humans, LLMs don't consolidate short-term memory into long-term memory over time.

ako7mo ago

That is easily fixed, ask it to summarize it's learnings, store it somewhere, and make it searchable through vector indexes. An LLM is part of a bigger system that needs not just a model, but context and long term memory. Just like human needs to write things down.

LLMs are actually pretty good at creating knowledge: if you give it a trial and error feedback loop it can figure things out, and then summarize the learnings and store it in long term memory (markdown, RAG, etc).

2 more replies

griffzhowl7mo ago

Over time though, presumably LLM output is going into the training data of later LLMs. So in a way that's being consolidated into the long-term memory - not necessarily with positive results, but depending on how it's curated it might be.

1 more reply

yellow_postit7mo ago

Is this not a tool that could be readily implemented and refined?

bfuller7mo ago

my knowledge graph mcp disagrees

candiddevmike7mo ago

I think it's more analogous to "intuition", and the text LLMs provide are the equivalent of "my gut tells me".

enraged_camel7mo ago

Humans have the ability to quickly pass things from short term to long term memory and vice versa, though. This sort of seamlessness is currently missing from LLMs.

FollowingTheDao7mo ago

No, it’s not in the training. Human memories are stored via electromagnetic frequencies controlled by microtubules. They’re not doing anything close to that in AI.

Difwif7mo ago

And LLM memories are stored in an electrical charge trapped in a floating gate transistor (or as magnetization of a ferromagnetic region on an alloy platter).

Or they write CLAUDE.md files. Whatever you want to call it.

2 more replies

maldonad07mo ago

It's not that either.

dvfjsdhgfv7mo ago

I don't believe this has been really proved yet.

jjfoooo47mo ago

There are many folks working on this, I think at the end of the day the long term memory is an application level concern. The definition of what information to capture is largely dependent on use case.

Shameless plug for my project, which focuses on reminders and personal memory: elroy.bot

But other projects include Letta, mem0, and Zep

rixrax7mo ago

What is the current hypothesis on if the context windows would be substantially larger, what would this enable LLMs to do that is beyond capabilities of current models (other than the obvious the now getting forgetful/confused when you’ve exhausted the context)?

jjfoooo47mo ago

I mean, not getting confused / forgetful is a pretty big one!

I think one thing it does is help you get rid of the UX where you have to manage a bunch of distinct chats. I think that pattern is not long for this world - current models are perfectly capable of realizing when the subject of a conversation has changed

danielbln7mo ago

I wonder if there will be some sort of bitter lesson, generalized memory beating specialized memory.

jjfoooo47mo ago

Yeah to some degree that's already happened. Anecdotally I hear giving your whole iMessage history to Gemini results in pretty reasonable results, in terms of the AI understanding who the people in your life are (whether doing so is an overall good idea or not).

I think there is some degree of curation that remains necessary though, even if context windows are very large I think you will get poor results if you spew a bunch of junk into context. I think this curation is basically what people are referring to when they talk about Context Engineering.

I've got no evidence but vibes, but in the long run I think it's still going to be worth implementing curation / more deliberate recall. Partially because I think we'll ultimately land on on-device LLM's being the norm - I think that's going to have a major speed / privacy advantage. If I can make an application work smoothly with a smaller, on device model, that's going to be pretty compelling vs a large context window frontier model.

Of course, even in that scenario, maybe we get an on device model that has a big enough context window for none of this to matter!

j / k navigate · click thread line to collapse

0 comments

JohnBooty7mo ago

Well here's the interesting thing to think about for me.

Human memory is.... insanely bad.

So.

It's interesting to think about what kind of "bar" AGI would really need to clear w.r.t. memories, if the goal is to be (at least) on par with human intelligence.

intended7mo ago

Memory is a skill- its plastic, not static.

You can get better at remembering things, like you can get better at dancing or doing exercise.

We can also specialize our memory to be good at some things over others.

BizarroLand7mo ago

Computers are just stored information that processes.

We made the atlatl and threw spears across the plains. We made the bow and arrow and stabbed things very far away. We made the whip and broke the sound barrier.

Shitting on humans is an insult your your ancestors. Fuck you. Be proud. If we invent a new thing that can do what we do better it only exists because of us.

joshmarinacci7mo ago

Insanely bad compared to books or other permanent records. The human memory system did not evolve to be an accurate record of the past. It evolved to keep us alive by remembering dangerous things.

3 more replies

scarmig7mo ago

https://link.springer.com/article/10.1007/s10071-008-0206-8

2 more replies

ghoblin7mo ago

1 more reply

Difwif7mo ago

My mental model is a bit different:

Context -> Attention Span

Model weights/Inference -> System 1 thinking (intuition)

Computer memory (files) -> Long term memory

Chain of thought/Reasoning -> System 2 thinking

Prompts/Tool Output -> Sensing

Tool Use -> Actuation

mark_l_watson7mo ago

Very cool, good way to think about it. I wouldn’t be surprised if non-AGI LLMs help write the code to augment themselves into AGI.

The unreasonable effectiveness of deep learning was a surprise. We don’t know what the future surprises will be.

yellow_postit7mo ago

I like this mental model. Orchestration / Agents and using smaller models to determine the ideal tool input and check the output starts to look like delegation.

amelius7mo ago

The long term memory is in the training. The short term memory is in the context window.

mawax7mo ago

The comparison misses the mark: unlike humans, LLMs don't consolidate short-term memory into long-term memory over time.

ako7mo ago

2 more replies

griffzhowl7mo ago

1 more reply

yellow_postit7mo ago

Is this not a tool that could be readily implemented and refined?

bfuller7mo ago

my knowledge graph mcp disagrees

candiddevmike7mo ago

I think it's more analogous to "intuition", and the text LLMs provide are the equivalent of "my gut tells me".

enraged_camel7mo ago

Humans have the ability to quickly pass things from short term to long term memory and vice versa, though. This sort of seamlessness is currently missing from LLMs.

FollowingTheDao7mo ago

No, it’s not in the training. Human memories are stored via electromagnetic frequencies controlled by microtubules. They’re not doing anything close to that in AI.

Difwif7mo ago

And LLM memories are stored in an electrical charge trapped in a floating gate transistor (or as magnetization of a ferromagnetic region on an alloy platter).

Or they write CLAUDE.md files. Whatever you want to call it.

2 more replies

maldonad07mo ago

It's not that either.

dvfjsdhgfv7mo ago

I don't believe this has been really proved yet.

jjfoooo47mo ago

Shameless plug for my project, which focuses on reminders and personal memory: elroy.bot

But other projects include Letta, mem0, and Zep

rixrax7mo ago

jjfoooo47mo ago

I mean, not getting confused / forgetful is a pretty big one!

danielbln7mo ago

I wonder if there will be some sort of bitter lesson, generalized memory beating specialized memory.

jjfoooo47mo ago

Of course, even in that scenario, maybe we get an on device model that has a big enough context window for none of this to matter!

j / k navigate · click thread line to collapse