Hopfield Networks Is All You Need (opens in new tab)

(ml-jku.github.io)

184 pointsmeiji1635y ago40 comments

40 comments

31 comments · 12 top-level

einpoklum5y ago· 7 in thread

Brief abstract for the lay person (like me):

1. Hopfield Networks are also known as "associative memory networks", a neural network model developed decades ago by a guy named Hopfield.

2. It's useful to plug these in somehow as layers in Deep Neural Networks today (particularly, in PyTorch).

I hate non-informative titles!

SneakyTornado295y ago

The title is a reference to the famous machine learning paper "Attention Is All You Need" which introduced the concept of transformers. Transformers have revolutionized how we process sequential data (i.e. natural language processing).

bjourne5y ago

And recently, a paper titled Attention Is Not All You Need has made the rounds arguing that some of the claims made in the AIAYN paper may have been overstated. https://arxiv.org/abs/2103.03404

1 more reply

einpoklum5y ago

> famous machine learning paper "Attention Is All You Need"

1. It's a paper from 2017. Unless you follow academic ML research, you will not have heard of it.

2. That paper's title is also inscrutable unless you've gone and read at least the abstract.

bonoboTP5y ago

Which itself is a reference to the 1967 Beatles song All You Need is Love (which also includes the line "Love is all you need").

isoprophlex5y ago

Also... While cute, I found the examples of storing and retrieving images of The Simpsons characters not very informative about what goes on in that weight matrix that stores patterns.

Edit: the linked pytorch implementation looks interesting, these layer types promise pretty incredible things https://github.com/ml-jku/hopfield-layers

virgil_disgr4ce5y ago

Not to mention grammatically incorrect ones :/

skrebbel5y ago

I think it's correct, in the same way that you can say "Rolling Stones is a great band". It's about the tech called "Hopfield Networks", not about any particular number of networks that are all you need.

1 more reply

tediousdemise5y ago· 5 in thread

Off-topic, but does anyone know what Jekyll theme this is? Absolutely beautiful formatting and color scheme.

ansk5y ago

Further off-topic, but do people actually consider this to be beautiful design? Looks like a rendered markdown document with MathJax and green headers. Perfectly appropriate for the content of the post, but beautiful isn't the first word that comes to mind for me.

mhh__5y ago

I don't think it's awful but I don't like it.

I really wish I could literally just dump LaTeX onto the web and be done with it. Everything I've tried either doesn't work (Pandoc is cute) properly / isn't 1:1, or does work but yields enormous amounts of html (pdf2htmlex).

I am fairly happy with [insert MD->Book tool of your choice], but sometimes I want citations and things like that.

tediousdemise5y ago

Beauty is in the eye of the beholder, isn’t it? I like the font, as well as the greens, blues, and header gradient. Green is my favorite color.

I also like dark themes (although I wouldn’t force those on my viewership).

dmix5y ago

No I very much dislike it.

phab5y ago

https://pages-themes.github.io/cayman/

aparsons5y ago· 4 in thread

I’ve seen a lot of efforts to add a notion of associative memory into neural networks. Have any exciting applications of such architectures been publicised?

truth_5y ago

Just some days ago researchers from Peking U and Microsoft published a paper[0] saying they can access "knowledge neurons" in pretrained embeddings that will enable "fact editing"[1].

[0]: https://arxiv.org/pdf/2104.08696.pdf

[1]: https://medium.com/syncedreview/microsoft-peking-u-researche...

ilaksh5y ago

I thought that Transformers were a type of associative memory.

SneakyTornado295y ago

https://arxiv.org/search/cs?searchtype=author&query=Hochreit...

orange3xchicken5y ago

Relevant paper from Misha Belkin's group https://arxiv.org/abs/1909.12362

ArtWomb5y ago· 1 in thread

Trending as John Hopfield scheduled to present his "biologically plausible" response to the Modern Hopfield Network at ICLR next week:

Large Associative Memory Problem in Neurobiology and Machine Learning

https://arxiv.org/abs/2008.06996

MHN seem ideal for prediction problems based purely on data, such as chemical reactions and drug discovery:

Modern Hopfield Networks for Few- and Zero-Shot Reaction Prediction

https://arxiv.org/abs/2104.03279

SpaceManNabs5y ago

Krotov (Hopfield's co-author in these set of papers) has a tweetutorial for that paper in your first link

https://twitter.com/DimaKrotov/status/1387770672542269449

kdavis5y ago· 1 in thread

“Sooner or later, everything old is new again.” -Steven King

mhh__5y ago

"I’m fashionable once every 15 years, for about three months" - John Cooper Clarke

scrubs5y ago· 1 in thread

Quoting: "We introduce a new energy function and a corresponding new update rule which is guaranteed to converge to a local minimum of the energy function."

Is this a minimum in a local area or local in the range of some function? I could see perhaps that'd being an advantage if you happen to know that local part of the range

In contrast we're usually looking for global min/max say with annealing algorithms. How is local is better in the context of this paper than global?

rubatuga5y ago

They mean local minimum as in an attractor state. Each "memory" is an attractor state stored in the Hopfield network.

zibzab5y ago

I looked at the paper but it was way over my head.

Can anyone explain it in simpler terms to a person who barely understands attention models and has no idea what associative memory means here?

mark_l_watson5y ago

Nice paper! I used Hopfield networks in the 1980s. I hope that I can clear a few hours of time this week to work through this. I admit that for machine learning, that I have fell into the “deep learning for everything pit” in the last six or seven years. Probably because DL is what I usually get paid for.

gyre0075y ago

This reminded me of a very old fun side project of mine [1] that had made me look at neural networks from a different perspective.

[1] https://github.com/milosgajdos/gopfield

EVa5I7bHFq9mnYK5y ago

If I understood them correctly, they store all the training samples and then select one most similar to a given input.

SneakyTornado295y ago

Are*

1 more reply

komalghori225y ago

Amazing

j / k navigate · click thread line to collapse

40 comments

31 comments · 12 top-level

einpoklum5y ago· 7 in thread

Brief abstract for the lay person (like me):

1. Hopfield Networks are also known as "associative memory networks", a neural network model developed decades ago by a guy named Hopfield.

2. It's useful to plug these in somehow as layers in Deep Neural Networks today (particularly, in PyTorch).

I hate non-informative titles!

SneakyTornado295y ago

bjourne5y ago

And recently, a paper titled Attention Is Not All You Need has made the rounds arguing that some of the claims made in the AIAYN paper may have been overstated. https://arxiv.org/abs/2103.03404

1 more reply

einpoklum5y ago

> famous machine learning paper "Attention Is All You Need"

1. It's a paper from 2017. Unless you follow academic ML research, you will not have heard of it.

2. That paper's title is also inscrutable unless you've gone and read at least the abstract.

bonoboTP5y ago

Which itself is a reference to the 1967 Beatles song All You Need is Love (which also includes the line "Love is all you need").

isoprophlex5y ago

Also... While cute, I found the examples of storing and retrieving images of The Simpsons characters not very informative about what goes on in that weight matrix that stores patterns.

Edit: the linked pytorch implementation looks interesting, these layer types promise pretty incredible things https://github.com/ml-jku/hopfield-layers

virgil_disgr4ce5y ago

Not to mention grammatically incorrect ones :/

skrebbel5y ago

1 more reply

tediousdemise5y ago· 5 in thread

Off-topic, but does anyone know what Jekyll theme this is? Absolutely beautiful formatting and color scheme.

ansk5y ago

mhh__5y ago

I don't think it's awful but I don't like it.

I am fairly happy with [insert MD->Book tool of your choice], but sometimes I want citations and things like that.

tediousdemise5y ago

Beauty is in the eye of the beholder, isn’t it? I like the font, as well as the greens, blues, and header gradient. Green is my favorite color.

I also like dark themes (although I wouldn’t force those on my viewership).

dmix5y ago

No I very much dislike it.

phab5y ago

https://pages-themes.github.io/cayman/

aparsons5y ago· 4 in thread

I’ve seen a lot of efforts to add a notion of associative memory into neural networks. Have any exciting applications of such architectures been publicised?

truth_5y ago

Just some days ago researchers from Peking U and Microsoft published a paper[0] saying they can access "knowledge neurons" in pretrained embeddings that will enable "fact editing"[1].

[0]: https://arxiv.org/pdf/2104.08696.pdf

[1]: https://medium.com/syncedreview/microsoft-peking-u-researche...

ilaksh5y ago

I thought that Transformers were a type of associative memory.

SneakyTornado295y ago

https://arxiv.org/search/cs?searchtype=author&query=Hochreit...

orange3xchicken5y ago

Relevant paper from Misha Belkin's group https://arxiv.org/abs/1909.12362

ArtWomb5y ago· 1 in thread

Trending as John Hopfield scheduled to present his "biologically plausible" response to the Modern Hopfield Network at ICLR next week:

Large Associative Memory Problem in Neurobiology and Machine Learning

https://arxiv.org/abs/2008.06996

MHN seem ideal for prediction problems based purely on data, such as chemical reactions and drug discovery:

Modern Hopfield Networks for Few- and Zero-Shot Reaction Prediction

https://arxiv.org/abs/2104.03279

SpaceManNabs5y ago

Krotov (Hopfield's co-author in these set of papers) has a tweetutorial for that paper in your first link

https://twitter.com/DimaKrotov/status/1387770672542269449

kdavis5y ago· 1 in thread

“Sooner or later, everything old is new again.” -Steven King

mhh__5y ago

"I’m fashionable once every 15 years, for about three months" - John Cooper Clarke

scrubs5y ago· 1 in thread

Quoting: "We introduce a new energy function and a corresponding new update rule which is guaranteed to converge to a local minimum of the energy function."

Is this a minimum in a local area or local in the range of some function? I could see perhaps that'd being an advantage if you happen to know that local part of the range

In contrast we're usually looking for global min/max say with annealing algorithms. How is local is better in the context of this paper than global?

rubatuga5y ago

They mean local minimum as in an attractor state. Each "memory" is an attractor state stored in the Hopfield network.

zibzab5y ago

I looked at the paper but it was way over my head.

Can anyone explain it in simpler terms to a person who barely understands attention models and has no idea what associative memory means here?

mark_l_watson5y ago

gyre0075y ago

This reminded me of a very old fun side project of mine [1] that had made me look at neural networks from a different perspective.

[1] https://github.com/milosgajdos/gopfield

EVa5I7bHFq9mnYK5y ago

If I understood them correctly, they store all the training samples and then select one most similar to a given input.

SneakyTornado295y ago

Are*

1 more reply

komalghori225y ago

Amazing

j / k navigate · click thread line to collapse