1. Hopfield Networks are also known as "associative memory networks", a neural network model developed decades ago by a guy named Hopfield.
2. It's useful to plug these in somehow as layers in Deep Neural Networks today (particularly, in PyTorch).
I hate non-informative titles!
1. It's a paper from 2017. Unless you follow academic ML research, you will not have heard of it.
2. That paper's title is also inscrutable unless you've gone and read at least the abstract.
Edit: the linked pytorch implementation looks interesting, these layer types promise pretty incredible things https://github.com/ml-jku/hopfield-layers
I really wish I could literally just dump LaTeX onto the web and be done with it. Everything I've tried either doesn't work (Pandoc is cute) properly / isn't 1:1, or does work but yields enormous amounts of html (pdf2htmlex).
I am fairly happy with [insert MD->Book tool of your choice], but sometimes I want citations and things like that.
I also like dark themes (although I wouldn’t force those on my viewership).
[0]: https://arxiv.org/pdf/2104.08696.pdf
[1]: https://medium.com/syncedreview/microsoft-peking-u-researche...
Large Associative Memory Problem in Neurobiology and Machine Learning
https://arxiv.org/abs/2008.06996
MHN seem ideal for prediction problems based purely on data, such as chemical reactions and drug discovery:
Modern Hopfield Networks for Few- and Zero-Shot Reaction Prediction
Is this a minimum in a local area or local in the range of some function? I could see perhaps that'd being an advantage if you happen to know that local part of the range
In contrast we're usually looking for global min/max say with annealing algorithms. How is local is better in the context of this paper than global?
Can anyone explain it in simpler terms to a person who barely understands attention models and has no idea what associative memory means here?