Teaching physics to neural networks removes 'chaos blindness' (opens in new tab)

(phys.org)

143 pointsJacobLinney6y ago74 comments

74 comments

37 comments · 10 top-level

keenmaster6y ago· 13 in thread

I’ve said this before, but I think that a lack of physical modeling might be the key barrier for AV technology. Human drivers have a mental model of physics that they’ve honed for 17-18 hours a day since they were born.

Fricken6y ago

Don't sell biology short like that. Human driver are born with a mental model of physics that's been honed 24 hours a day since before they were diatoms.

logicslave6y ago

But were all a blank slate!!!

3 more replies

mtgp10006y ago

I don't think that's quite right. I believe that humans are essentially born as blank neural networks; it's the structure, and the graph of connections between brain structures and sensory inputs, that is effectively primed for learning certain tasks that we find to be intuitive.

A baby is not born with the knowledge of body movement, for example, but through natural exploration of the body and environment, almost all physically capable humans learn to walk.

7 more replies

CardenB6y ago

You are likely correct. I think most researchers would agree, however. The bigger issue is actually learning how to form complex models. People want networks to just learn this implicitly, believing that we would likely impose counterproductive models. Other people simply struggle to incorporate models into the training process.

piyh6y ago

2 minute papers has good videos on neural nets learning physical modeling

https://www.youtube.com/watch?v=2Bw5f4vYL98

MereInterest6y ago

Side note, is "AV" to mean "autonomous vehicles" (assumed from context) a common usage? I've only ever heard it mean "audio/visual".

keenmaster6y ago

Yes. That usage of the acronym works when there are enough context clues. I think it will supplant the "audio/visual" meaning as autonomous vehicles become more salient. Here is text from an old job posting at Ford quoted on TechCrunch:

"We are seeking exceptional candidates to join our growing Autonomous Vehicle (AV) business team!"

https://techcrunch.com/2019/03/13/ford-is-expanding-its-self...

yongjik6y ago

> autonomous vehicles

...oh that makes so much more sense! -.-

mhh__6y ago

Vehicle dynamics is a fairly accurate science these days (50/50 for the tires)

jefft2556y ago

I'm working on autonomous off-road vehicles, and while this is (probably) true for autonomous cars, dynamics modeling for wheeled robots on rough terrain is another beast where these approaches could very much help.

1 more reply

keenmaster6y ago

Sure, but to be clear, I meant physical modeling which includes real-time modeling of all salient objects and surfaces in the immediate and foreseeable environment. I mean going as far as creating a physical model for deer, their range of behavior and speed, weight distribution, predictive modeling for subsequent behavior, etc...

solotronics6y ago

Racing teams and big car manufacturers have incredibly accurate models of vehicle dynamics.

1 more reply

CyberDildonics6y ago

This isn't something that has never been thought of. Jim Keller described many problems like changing lanes as a matter of ballistics.

mywittyname6y ago· 5 in thread

Why do you need a neural network when you have the Hamiltonian mechanics of the system modeled? I've always understood Langrangian/Hamiltonian mechanics to be methods of modeling the behavior of a system through the decomposition of the external constraints and forces acting on a body. In other words you can understand a complex model by doing some calculus on the less complex constituents of the model.

I'm probably misunderstanding what the accomplished, but it sounds like they've increased the accuracy of a neural network model of a system, notably for edge cases, by training it on complete a complete model of said system.

gajomi6y ago

> it sounds like they've increased the accuracy of a neural network model of a system, notably for edge cases, by training it on complete a complete model of said system.

Not quite. It's really just that they require the dynamics to be Hamiltonian, which would be highly atypical of the kind of dynamics an otherwise unconstrained neural network would learn. This is reflected in their loss function, the first of which learn an arbitrary second order differential equation, the second of which enforces Hamiltonian dynamics.

I don't understand how this was considered novel enough to warrant at PRE paper.

Here is a link to the paper:

https://journals.aps.org/pre/pdf/10.1103/PhysRevE.101.062207

joshlk6y ago

For some systems even with the Lagrangian/Hamiltonian setup your solving differential equations with numerical techniques that has error. It might be that the neural networks has less error than the standard techniques. This is a guess.

seesawtron6y ago

Hamiltonian NNs are not a new thing. There was a NIPS 2019 paper [0] that attempted to do that same for some toy problems.

In general the idea of including model or context-based information into neural networks goes along the line of Kahneman's System I and System II of the human mind. System I is the "emotional" brain that is fast and makes decisions quickly while System II is the "rational" brain that is slow and expensive and takes time to compute a response. Researchers have been trying to develop ML models that utilize this dichotomy by building corresponding dual modules but the major challenge remains in efficiently embedding the assumptions of the world dynamics into the models.

[0] https://arxiv.org/abs/1906.01563 [1] https://en.wikipedia.org/wiki/Thinking,_Fast_and_Slow

noobermin6y ago

To be frank, this should be the reference, compare to numerical integration and see which is better.

DrAwdeOccarim6y ago

I love your comment. I follow, and appreciate it, but I could not help but think of this xkcd: https://xkcd.com/793/

_iyig6y ago· 2 in thread

Brings to mind this classic from the Jargon File:

http://www.catb.org/~esr/jargon/html/koans.html

In the days when Sussman was a novice, Minsky once came to him as he sat hacking at the PDP-6. “What are you doing?”, asked Minsky.

“I am training a randomly wired neural net to play Tic-Tac-Toe” Sussman replied.

“Why is the net wired randomly?”, asked Minsky.

“I do not want it to have any preconceptions of how to play”, Sussman said. Minsky then shut his eyes. “Why do you close your eyes?”, Sussman asked his teacher.

“So that the room will be empty.”

At that moment, Sussman was enlightened.

nefasti6y ago

I don’t get it :(

epaga6y ago

I think it means - just as closing your eyes doesn't mean the room becomes empty, wiring the learning network randomly doesn't mean you'll end up with no pre-conceptions (e.g. the rule system at least will need to be programmed in).

1 more reply

awinter-py6y ago· 2 in thread

> the NAIL team incorporated Hamiltonian structure into neural networks

ML non-expert here. Is this the same as having an extra column of your input data that's a hamiltonian of the raw input? Or a kind of neuron that can compute a hamiltonian on an observation? Or something more complicated.

is this like a specialized 'functional region' in a biological brain? (broca's area, cerebellum)

vutekst6y ago

Also ML non-expert here. I think this is about a different kind of neuron(your 2nd suggestion). The paper another commenter linked says:

Hamiltonian neural network (HNN) intakes position and momenta {q,p}, outputs the scalar function H, takes its gradient to find its position and momentum rates of change, and minimizes the loss

which enforces Hamilton's equations of motion.

https://journals.aps.org/pre/abstract/10.1103/PhysRevE.101.0...

zone4116y ago

I haven't used HNNs in practice but it seems that the main difference from common NNs is that the loss function incorporates gradients. It's not a new type of a neuron.

cmehdy6y ago· 2 in thread

This sounds like the opposite of what Richard Sutton seemed to advocate for in his "Bitter Lesson"[0]. I don't know nearly enough to advocate for one thing or the other, but it is fascinating to see that those approaches seem to compete as we venture into the unknown.

[0] http://incompleteideas.net/IncIdeas/BitterLesson.html

fizixer6y ago

They're not the opposite, and both are correct.

Sutton is saying 'over a slightly longer time'.

You can wait 20 more years and super-duper-deep-NN-on-steroids, and hardware a million times as big and powerful, would rediscover all of theoretical physics.

Or you could inject some theoretical physics acquired by humans and make DNNs smarter today.

cmehdy6y ago

I assume your 20 years is a guesstimate, and I do think it misses the point of what Sutton's writing is. The trap here is that there's always to be more computing in the future, so where do we draw the line? The idea is to think differently now, for the pursuit of actual progress down the road. Which, by the way, is exactly what people were doing about 40 years ago and what put down more than the foundations for all the tricks we're pulling these days.

1 more reply

athesyn6y ago· 2 in thread

This sounds pretty terrifying.

civil_engineer6y ago

Careful there, athesyn. No need to offend our computer overlords.

jefft2556y ago

But... why?

jariel6y ago· 1 in thread

Can someone with AI knowledge please clarify - does this mean we can build 'rules based systems' into AI to synthesise intelligence from both domains?

If so, this would be dramatic, no?

If you could teach a translation service 'grammar' and then also leverage the pattern matching, could this be a 'fundamental' new idea in AI application?

Or is this just something specific?

samcodes6y ago

They model a system which they know to be constrained by a closed-form equation called the Hamiltonian. They (cleverly, IMO) force the network’s predictions to be constrained by the Hamiltonian, by choosing the right output and loss function.

I don’t see a way to generalize this to the procedural rule-based systems you describe, unless they too are governed by a fairly simple continuous function Like the Hamiltonian.

I don’t know if it was “dramatic”, but it made me really happy.

vajrabum6y ago

I believe this refers to work presented in this journal article. https://journals.aps.org/pre/abstract/10.1103/PhysRevE.101.0...

Abstract: Artificial neural networks are universal function approximators. They can forecast dynamics, but they may need impractically many neurons to do so, especially if the dynamics is chaotic. We use neural networks that incorporate Hamiltonian dynamics to efficiently learn phase space orbits even as nonlinear systems transition from order to chaos. We demonstrate Hamiltonian neural networks on a widely used dynamics benchmark, the Hénon-Heiles potential, and on nonperturbative dynamical billiards. We introspect to elucidate the Hamiltonian neural network forecasting.

thesz6y ago

Why not shamelessly plug my work here? I see no reason not to.

So, here it is: https://github.com/thesz/nn/tree/master/series

A proof of concept implementation of training neural networks process where loss function is a potential energy in Lagrangian function and I even incorporated "speed of light" - the "mass" of particle gets corrected using Lorenz multiplier m=m0/sqrt(1-v^2/c^2).

Everything is done using ideas from quite interesting paper about power of lazy semantics: https://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.32....

PS Proof-of-concept here means it is grossly inefficient, mainly due to amount of symbolic computation. Yet it works. In some cases. ;)

castratikron6y ago

So can you teach a NN an equation of motion, and if so would it execute faster than numerically integrating said equation? Could have impacts in physics simulations although the accuracy might not be as good

j / k navigate · click thread line to collapse

74 comments

37 comments · 10 top-level

keenmaster6y ago· 13 in thread

Fricken6y ago

Don't sell biology short like that. Human driver are born with a mental model of physics that's been honed 24 hours a day since before they were diatoms.

logicslave6y ago

But were all a blank slate!!!

3 more replies

mtgp10006y ago

A baby is not born with the knowledge of body movement, for example, but through natural exploration of the body and environment, almost all physically capable humans learn to walk.

7 more replies

CardenB6y ago

piyh6y ago

2 minute papers has good videos on neural nets learning physical modeling

https://www.youtube.com/watch?v=2Bw5f4vYL98

MereInterest6y ago

Side note, is "AV" to mean "autonomous vehicles" (assumed from context) a common usage? I've only ever heard it mean "audio/visual".

keenmaster6y ago

"We are seeking exceptional candidates to join our growing Autonomous Vehicle (AV) business team!"

https://techcrunch.com/2019/03/13/ford-is-expanding-its-self...

yongjik6y ago

> autonomous vehicles

...oh that makes so much more sense! -.-

mhh__6y ago

Vehicle dynamics is a fairly accurate science these days (50/50 for the tires)

jefft2556y ago

1 more reply

keenmaster6y ago

solotronics6y ago

Racing teams and big car manufacturers have incredibly accurate models of vehicle dynamics.

1 more reply

CyberDildonics6y ago

This isn't something that has never been thought of. Jim Keller described many problems like changing lanes as a matter of ballistics.

mywittyname6y ago· 5 in thread

gajomi6y ago

> it sounds like they've increased the accuracy of a neural network model of a system, notably for edge cases, by training it on complete a complete model of said system.

I don't understand how this was considered novel enough to warrant at PRE paper.

Here is a link to the paper:

https://journals.aps.org/pre/pdf/10.1103/PhysRevE.101.062207

joshlk6y ago

seesawtron6y ago

Hamiltonian NNs are not a new thing. There was a NIPS 2019 paper [0] that attempted to do that same for some toy problems.

[0] https://arxiv.org/abs/1906.01563 [1] https://en.wikipedia.org/wiki/Thinking,_Fast_and_Slow

noobermin6y ago

To be frank, this should be the reference, compare to numerical integration and see which is better.

DrAwdeOccarim6y ago

I love your comment. I follow, and appreciate it, but I could not help but think of this xkcd: https://xkcd.com/793/

_iyig6y ago· 2 in thread

Brings to mind this classic from the Jargon File:

http://www.catb.org/~esr/jargon/html/koans.html

In the days when Sussman was a novice, Minsky once came to him as he sat hacking at the PDP-6. “What are you doing?”, asked Minsky.

“I am training a randomly wired neural net to play Tic-Tac-Toe” Sussman replied.

“Why is the net wired randomly?”, asked Minsky.

“I do not want it to have any preconceptions of how to play”, Sussman said. Minsky then shut his eyes. “Why do you close your eyes?”, Sussman asked his teacher.

“So that the room will be empty.”

At that moment, Sussman was enlightened.

nefasti6y ago

I don’t get it :(

epaga6y ago

1 more reply

awinter-py6y ago· 2 in thread

> the NAIL team incorporated Hamiltonian structure into neural networks

is this like a specialized 'functional region' in a biological brain? (broca's area, cerebellum)

vutekst6y ago

Also ML non-expert here. I think this is about a different kind of neuron(your 2nd suggestion). The paper another commenter linked says:

Hamiltonian neural network (HNN) intakes position and momenta {q,p}, outputs the scalar function H, takes its gradient to find its position and momentum rates of change, and minimizes the loss

which enforces Hamilton's equations of motion.

https://journals.aps.org/pre/abstract/10.1103/PhysRevE.101.0...

zone4116y ago

I haven't used HNNs in practice but it seems that the main difference from common NNs is that the loss function incorporates gradients. It's not a new type of a neuron.

cmehdy6y ago· 2 in thread

[0] http://incompleteideas.net/IncIdeas/BitterLesson.html

fizixer6y ago

They're not the opposite, and both are correct.

Sutton is saying 'over a slightly longer time'.

You can wait 20 more years and super-duper-deep-NN-on-steroids, and hardware a million times as big and powerful, would rediscover all of theoretical physics.

Or you could inject some theoretical physics acquired by humans and make DNNs smarter today.

cmehdy6y ago

1 more reply

athesyn6y ago· 2 in thread

This sounds pretty terrifying.

civil_engineer6y ago

Careful there, athesyn. No need to offend our computer overlords.

jefft2556y ago

But... why?

jariel6y ago· 1 in thread

Can someone with AI knowledge please clarify - does this mean we can build 'rules based systems' into AI to synthesise intelligence from both domains?

If so, this would be dramatic, no?

If you could teach a translation service 'grammar' and then also leverage the pattern matching, could this be a 'fundamental' new idea in AI application?

Or is this just something specific?

samcodes6y ago

I don’t see a way to generalize this to the procedural rule-based systems you describe, unless they too are governed by a fairly simple continuous function Like the Hamiltonian.

I don’t know if it was “dramatic”, but it made me really happy.

vajrabum6y ago

I believe this refers to work presented in this journal article. https://journals.aps.org/pre/abstract/10.1103/PhysRevE.101.0...

thesz6y ago

Why not shamelessly plug my work here? I see no reason not to.

So, here it is: https://github.com/thesz/nn/tree/master/series

Everything is done using ideas from quite interesting paper about power of lazy semantics: https://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.32....

PS Proof-of-concept here means it is grossly inefficient, mainly due to amount of symbolic computation. Yet it works. In some cases. ;)

castratikron6y ago

j / k navigate · click thread line to collapse