Show HN: PILF, The ultimate solution to catastrophic oblivion on AI models (opens in new tab)

(github.com)

31 pointsNetRunnerSu0y ago12 comments

12 comments

9 comments · 4 top-level

Ifkaluva0y ago· 2 in thread

It’s an interesting idea, I have two questions.

- Surprise is detected by the norm of the gradients. So, doesn’t this suggest that the model already has a way of adjusting to surprise?

- Is there a danger of model instability when the gradients become larger and the learning rate is also increased?

NetRunnerSuOP0y ago

1. an overly strong surprise is like PTSD in humans - it changes the model's previously learned experience forever, this is what we want to avoid

2. it's bound to happen, and our PILR-S is designed to keep the learning rate within the bell curve and decreasing as the surprise decreases (less new information, less learning).

derefr0y ago

But doesn’t this lead to the opposite problem: creating a model that can never learn to let go of an early-life mental model picked up from a skewed dataset?

By analogy to humans: if this model were raised in a cult, and then let out into the real world, it would be seemingly incapable of unlearning the cult’s indoctrination, despite the real-world data all contradicting it — as all of this real-world data would be too surprising for the model to accept.

Or, for a maybe-more-likely situation you might encounter in e.g. incremental model re-training of old models for chronologically-newer info: a model trained this way would “stubbornly” refuse to accept any major shift in scientific consensus on a topic.

The human cognitive architecture seems to solve this problem by 1. buffering this rejected-for-being-too-out-there info in a way where it can at least be pattern-recognized; and then 2. noticing when a lot of different, seemingly independent, seemingly trustworthy sources begin matching on the rejected pattern. At that point, the human brain seems to swing the other way — experiencing a “crisis of faith” per se.

1 more reply

vermilingua0y ago· 2 in thread

Caution: this appears to be part of a very involved sci-fi LARP (as I understand it), so I’d take whatever claims it makes with a grain of salt.

NetRunnerSuOP0y ago

You can Git clone down and run around on your own - science fiction with enough precision is futurology

alienbaby0y ago

Ooohhhh.

upghost0y ago· 1 in thread

This looks absolutely fantastic, please accept my meagre professional jealousy. I have long bemoaned manual hyperparam fiddling . I have on occasion dabbled with nonparametric ("genetic") methods of hyperparam tuning inspired by AutoML... but then you still have to manually tune the evolutionary hyperparams.

Finding a way to derive this from the gradients is amazing.

NetRunnerSuOP0y ago

This is definitely not just another machine learning method. It comes from a complete cognitive science theory, rooted in a complete understanding of intelligence and consciousness.

https://github.com/dmf-archive/IPWT

hackingonempty0y ago

Parameters I'd Like to Fiddle

j / k navigate · click thread line to collapse

12 comments

9 comments · 4 top-level

Ifkaluva0y ago· 2 in thread

It’s an interesting idea, I have two questions.

- Surprise is detected by the norm of the gradients. So, doesn’t this suggest that the model already has a way of adjusting to surprise?

- Is there a danger of model instability when the gradients become larger and the learning rate is also increased?

NetRunnerSuOP0y ago

1. an overly strong surprise is like PTSD in humans - it changes the model's previously learned experience forever, this is what we want to avoid

2. it's bound to happen, and our PILR-S is designed to keep the learning rate within the bell curve and decreasing as the surprise decreases (less new information, less learning).

derefr0y ago

But doesn’t this lead to the opposite problem: creating a model that can never learn to let go of an early-life mental model picked up from a skewed dataset?

1 more reply

vermilingua0y ago· 2 in thread

Caution: this appears to be part of a very involved sci-fi LARP (as I understand it), so I’d take whatever claims it makes with a grain of salt.

NetRunnerSuOP0y ago

You can Git clone down and run around on your own - science fiction with enough precision is futurology

alienbaby0y ago

Ooohhhh.

upghost0y ago· 1 in thread

Finding a way to derive this from the gradients is amazing.

NetRunnerSuOP0y ago

This is definitely not just another machine learning method. It comes from a complete cognitive science theory, rooted in a complete understanding of intelligence and consciousness.

https://github.com/dmf-archive/IPWT

hackingonempty0y ago

Parameters I'd Like to Fiddle

j / k navigate · click thread line to collapse