How deep is the brain? The shallow brain hypothesis (opens in new tab)

(nature.com)

202 pointsvapemaster2y ago178 comments

178 comments

I seem to remember research stating that an individual neuron has very complex behaviour that requires several ML “neurons” / nodes to simulate. So if you do a comparison, perhaps the brain is deeper than you’d think by just looking at the graph of neurons and their synapses.

Could we construct a neutral net from nodes with more complex behaviour? Probably, but in computing we’ve generally found that it’s best to build up a system from simple building blocks. So what if it takes many ML nodes to simulate a neuron? That’s probably an efficient way to do it. Especially in the early phase where we’re not quite sure which architecture is the best. It’s easier to experiment with various neural net architectures when the building blocks are simple.

rmorey2y ago

> I seem to remember research stating that an individual neuron has very complex behaviour that requires several ML “neurons” / nodes to simulate.

This is probably what you're remembering: https://www.sciencedirect.com/science/article/pii/S089662732...

titzer2y ago

Yeah, biological brains could be remarkably more powerful than digital neural networks if the have primitive functions that we haven't accounted for. For example, some networks seem to encode information in the firing rate, rather than just the presence of a signal. If neurons could, e.g. do frequency-based calculations (and not just threshold-based, like spiking neural nets), they could be orders of magnitude more powerful and efficient. I am thinking particularly about neurons involved in, e.g. audio processing.

andbberger2y ago

the entropy rate goes way up if you consider spike timing dependent signals as well. but the difference in computational capacity between the brain and ML lies less in the brain's inherently time-dependent dynamics and more in the impressive computational capacity of single neurons. Dendrites compute, electrochemical dynamics during action potentials compute, synapses compute. All in complex time-dependent ways. check out izhikevich's dynamical systems in neuroscience for a taste of the computational capacity of the electrochemical dynamical system alone

jacobsimon2y ago

My guess is that the firing rate of biological neurons more or less simplifies to the activation in an artificial neuron. Higher firing rate = higher activation.

magicalhippo2y ago

> Could we construct a neutral net from nodes with more complex behaviour?

Well there's spiking neural networks (SNN)[1], which are modeled more closely to how neurons actually work.

Main obstacle is still, as far as I know, that there's no way to train a SSN as efficiently as a "regular" neural network, which lends itself very nicely to gradient descent and similar[2].

[1]: https://en.wikipedia.org/wiki/Spiking_neural_network

[2]: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9313413/

rsrsrs862y ago

The brain backprops??????

andbberger2y ago

there is no evidence to support this

chriskanan2y ago

The brain has a lot of skip connections and is massively recurrent. In a sense, the brain can be thought of as having infinite depth due to recurrent thalamno-cortical loops. They do mention thalamno-cortical loops in the paper, so I think a more concrete definition of what is meant by "depth" would be helpful.

lausbub2y ago

The "infinite depth" seems to be a matter of definition. It's practically infinite if you include feedback loops via learning. If you exclude learning, then it's far from "infinite". Activations linger for up to 15-30 seconds, so at oscillations of around 30 Hz that would result in about 450-900 loops (times an unknown small multiplier for the actual number of layers). But the brain presumably only backprops/optimizes a few layers at a time and not much "through" time.

sudosysgen2y ago

There's also evidence that the brain does optimize through time and might be implementing, at least in some places, algorithms close to LSTD.

sheeshkebab2y ago

It’s indeed odd that current dnn’s require massive amount of energy to retrain and lack any kind of practical continuous adaptation and learning.

quickthrower22y ago

With computer-based intelligence we have the overhead of computing every bit though (probably) inefficient silicon and direct electric currents. The brain leverages the properties of chemicals, though millions of years of evolution.

jakobson142y ago

The brain isn't a faster computer.

An infinitely-fast computer wouldn't meaningfully change the "expensive training vs fast, static inference" workflow that neural networks have always been developed around (except in the most brute force-y "retrain on the entire world, every single nanosecond" sense).

quickthrower22y ago

I think we agree? I am talking to the efficiency of the brain. Not processing speed. Efficiency of the brain to do things advantageous to the selfish genes I guess.

The brain is supremely efficient at what the brain has evolved to do. It is almost tautological! Because if it wasn't, it wouldn't have evolved to that.

Silicon comes from an alien land, and is emulating. Even with the best algorithms there has to be a limit on how efficient a computer-based intelligence can be without changing how the chips work.

You could spin it around and say, well computers are better at many things than humans, and there is no way you could get a biological brain to be as good for the same amount of power (e.g. a raspberry pi can do calculations our brain couldn't possibly do).

4 more replies

uoaei2y ago

It's an apples-to-oranges comparison. They're both fruit that grow on trees, but that's where the similarities end.

The primary difference, and likely the reason that brains are unreasonably effective, is the specifics of the architecture and internal representations (in the rigorous, information-theoretic sense) of its computational systems. It's not quite analog but it uses analog means. It's not quite digital but it does process via abstractions.

You can still reasonably call the brain a "computer" if you decide it can shed the laden history of that word and its close association with binary operations using transistors. You can do so because it uses internal structures to process inputs and emit outputs. But like I said above, it requires a generalized interpretation of the word to start to understand where and how the two fields of study may be unified.

jakobson142y ago

Yes, it's odd that sled dogs make terrible housepets. /s

Neural networks fundamentally aren't designed to be otherwise. The workflow that has guided their entire development for over a decade is based around expensive training and static inference.

sheeshkebab2y ago

Why then all the talk about AGI when fundamentals don’t even allow for it to emerge.

ben_w2y ago

Because "AGI" is very poorly defined, and ChatGPT is very "general" (compared to everything before it) and matches some (but not all) definitions of "intelligent".

jakobson142y ago

Because drumming up talk about AGI is a really great way to get funding for your startup. The tech industry sustains itself on hype.

1 more reply

anon2912y ago

Because transformers et al have gotten us the closest we've ever been to any system that can even claim to be AGI.

FeepingCreature2y ago

First make it work; then make it efficient.

1 more reply

4death42y ago

When you say “massive amount of energy” are you comparing the energy requirements to a single human or to the billions of years of solar and geothermal energy that went into producing the human species?

edmundsauto2y ago

I don’t think this is an apt comparison, but I do think the amount of energy it takes to grow a human into brain maturity in adulthood is an interesting one. Brains + bodies over a 20 year development cycle is still probably much less than training even a low quality Llm.

4death42y ago

Let’s say a human needs an average of 2000 calories a day. A calorie is roughly equivalent to 1 Watt hour, so over 20 years, it takes about 15 MWh to sustain a human.

Let’s say a single A100 has a peak power draw of 250W, and you need 100 to train an LLM. So each hour of training consumes 25,000 Wh of energy. 15 MWh / 25,000 W = 600 hours, or 25 days, which is probably pretty close to the true training time.

So the numbers are actually pretty close. But a human brain doesn’t start out as a set of random weights like an LLM. The human brain has predefined structure that’s the result of an extremely long evolutionary process.

1 more reply

Affric2y ago

By that token the amount of energy for neural networks will be bound to some extent by the development of the biosphere and the creators of neural networks.

fastball2y ago

Not really? The point is that most artificial neural networks are started from basically zero (random noisy weights), where as a human neural network is jump-started with an overall neural structure that has been shaped by millions of years of evolution. Sure, it's not fair to compare the overall energy required to get there, but the point is just that a biological neural network starts with a huge headstart that is frequently forgotten when talking about efficiency.

nomel2y ago

See “The last question” for some sci-fi solutions to this.

phkahler2y ago

>> It’s indeed odd that current dnn’s require massive amount of energy to retrain and lack any kind of practical continuous adaptation and learning.

To me that just means nobody has figured out how to do that effectively. The majority will simply make use of what's been done and proven, so we got a plateau at object recognition, and again at generative AI (with applications in several domains). One problem with continuous adaptation and learning is providing an "entity" and "environment" for it to "live" in which doing the adaptive learning. There are some researchers doing that either with robots, or simulations. That's much harder to set up than a lot of cloud compute resources. I do agree with you that these aspects are missing and things will be much more interesting when they get addressed.

golol2y ago

In-comtext learning exists though.

beaugunderson2y ago

https://anonymfile.com/dR8a/s41583-023-00756-z.pdf

hliyan2y ago

"brain seems shallow and neural networks are deep, ergo neural networks are doing it wrong"

Please don't claim things the author didn't. What I read was "ergo (artificial) neural networks may be missing a trick"

hliyan2y ago

Ignore. Reposted this under correct parent comment

rsrsrs862y ago

Beyond the mere topological metaphor of neural networks there is almost nothing in common between brains and widigital computation. This is a widespread fallacy of category.

naasking2y ago

> Beyond the mere topological metaphor of neural networks there is almost nothing in common between brains and widigital computation.

I mean, sure, but the topology is exactly what makes both work, so we only really care about the topology.

rando_dfad2y ago

and more specifically, between chemical-based information processing systems and Von Neumann architectures for binary information processing.

Agreed, a widespread fallacy of category.

But computers still do some pretty cool things. Powerful tools.

epgui2y ago

I completely disagree, and I think this is an example of human-exceptionalism bias.

lawrenceyan2y ago

We have skip connections and recurrent neural networks at home.

jakobson142y ago

If I had a nickel for every time some neurologist tried to compare brains to neural networks. It's a surefire way to tell someone is either desperate for grant money or has been smoking crack. (previously: comparing brains and "electronic computers")

Their entire article hinges on the complaint "brain seems shallow and neural networks are deep, ergo neural networks are doing it wrong."

Neurologists seem to have a really hard time comprehending that researchers working on neural networks aren't as clueless about computers as neurology is about the brain. They also vastly overestimate how much engineers working on neural networks even care about how biological brains work.

Virtually every attempt at making neural networks mimic biological neurons has been a miserable failure. Neural networks, despite their name, don't work anything like biological neurons and their development is guided by a combination of

A) practical experimentation and refinement, and

B) real, actual understanding about how they work.

The concept of resnets didn't come from biology. It came from observations about the flow of gradients between nodes in the computational graph. The concept of CNNs didn't come from biology, it came from old knowledge of convolutional filters. The current form and function of neural networks is grounded in repeated practical experimentation, not an attempt to mimic the slabs of meat that we place on pedestals. Neural networks are deep because it turns out hierarchical feature detectors work really well, and it doesn't really matter if the brain doesn't do things that way.

And then you have the nitwits searching the brain for transformer networks. Might as well look for mercury delay line memory while you're at it. Quantum entanglement too.

robbrown4512y ago

I can't agree with the dismissiveness of this comment, and frankly I find its tone out of line and not with the spirit of Hacker News.

There are insights that can come from studying the brain, that do indeed apply. Some researchers may not glean anything from such studies, and some may. I have no doubt that as neural networks get more an more powerful, we will continue to find more ways they are similar to the brain, and apply things we've learned about the brain to them.

I certainly prefer to see people making comparisons of neural networks to the brain, that the old "it's just a glorified autocomplete" and the like.

Relax.

ramraj072y ago

No one disagrees we might be able to discern insights if we understand how our brain is wired. The problem is the current state of neuroscience is so flawed in its approach it’s not looking like they’re of any use. They don’t even understand how a 900 neuron worms system works but are more than happy to tap half a billion dollars from unsuspecting politicians saying they’ll map the human connectome. Go read the brain initiative proposal [1] to see how out of touch with reality the scientists in this field are. I agree with OP that sharp criticism of the entire field is fully warranted.

1. https://braininitiative.nih.gov/sites/default/files/document...

andbberger2y ago

what are you talking about is this konrad kording's shitposting alt??? this reeks of naivety

I certainly have many critiques of methods used in neuroscience rn (as a working neuroscientist) but to reduce those to the conclusion that the entire project of neuroscience is hopeless is absurd. We understand certain things quite well actually, and it's not at all obvious what "understanding" at a larger scale would look like. It is very possible that the brain is irreducibly complex, and that the model you would need to construct to describe it would itself be so complex as to be useless in providing insight. Considering that the brain is by far the most complex object in the universe I think we're doing pretty well.

Furthermore, there are quite a lot of disagreements about the utility of connectomics. Outside of the extremists (Sebastian Seung and his ilk) no one thinks that connectomics is going to be the key that brings earth shattering insight. It's just another tool. There is a complete connectome for part of the drosophila brain already (privately funded btw), which is in daily use in many fly labs. It tells you what other neurons are connected to. Incredibly useful. Not earth shattering.

also you might want to measure the neuroscience funding you deem wasteful up against the tens of billions NASA is spending to send humans (and not robots) back to the moon for "the spirit of adventure". cold war's over. robots will do just fine for the moon.

4 more replies

__loam2y ago

No I think these comments are quite necessary. People need to stop making these comparisons because they have absolutely no grounding in how brains actually work. There are bad ideas that should be dismissed.

fastball2y ago

Neural networks are absolutely based on a very simplified model of how brains work. Specific NN architectures are in turn based on specific parts of the brain (e.g. Convolution Neural Networks are based on the visual cortices of cats/frogs).

1 more reply

robbrown4512y ago

You're saying the study has no grounding in how brains work? I'd think a more reasonable conclusion would be that the neuroscientists involved have no grounding in how artificial neural networks work.

It seems the whole point is to bring in additional details of how brains work, that the think may be relevant to artificial NNs.

p1esk2y ago

Artificial neural networks are the closest working model of a brain we have today.

Lots of graph nodes, with weighted connections, performing distributed computation (mainly hierarchical pattern matching), learning from data by gradually updating weights, using selective attention (and/or recurrence, and/or convolutional filters).

Which of the above is not happening in our brains? Which of the above is not biologically inspired?

In fact this description equally applies to both a brain and GPT4.

2 more replies

krainboltgreene2y ago

What does this comment add to the discussion?

robbrown4512y ago

I dunno. My comment complained about the parent comment not adding positively to the discussion. And gave at least a bit of support for that complaint.

Would you have preferred I emulate your style, and complain while providing no support for my complaint?

Ok.

1 more reply

jacobsimon2y ago

This is a really weird take. There is such a long history of shared insights between biology and neural network research, and to say they’re unrelated or can’t take inspiration from one another is bizarre.

> The concept of CNNs didn't come from biology

I just opened a survey paper on CNNs and literally the first sentence of the paper reads:

> “Convolutional Neural Network (CNN) is a well-known deep learning architecture inspired by the natural visual perception mechanism of the living creatures. In 1959, Hubel & Wiesel [1] found that cells in animal visual cortex are responsible for detecting light in receptive fields. Inspired by this discovery…”

Source: https://arxiv.org/pdf/1512.07108.pdf%C3%A3%E2%82%AC%E2%80%9A

jakobson142y ago

That's later backfill, a retroactive change to give a manufactured "biological" origin story. Whether they're real or not, researchers love a good "we took this from nature, isn't nature wonderful!" explanation.

The C in CNN isn't "Convolution" for no reason. It came from work with convolutional filters (yay Sobel kernels!) which at it's height became filter banks and gabor filters and so on before neural networks pretty much killed off handcrafted feature development. Every explanation of how CNNs work still falls back to the original convolutional kernel intuition.

rerdavies2y ago

> The C in CNN isn't "Convolution" for no reason.

The first N in CNN is "Neural" for a reason.

1 more reply

dartos2y ago

You can use that argument for anything you disagree with. Do you have a source or anything?

1 more reply

two_in_one2y ago

While I agree with this emotional post there is one nuance. Neural networks aren't intelligent, brain is. And that's where we want to be. Checking gradients and studying filters can get us only this far. So, using brain as inspiration looks like a good option. There are other, but nobody knows where next breakthrough will be. Like nobody knew five years back that transformers are so powerful. My guess next step to AGI will be a complex modular multi-modal system. With hierarchy, workers and controllers, complex signals.. Sound familiar? Brain is sort of it. This is need for embodied AI, obviously. But, interesting thing, it's needed even for body-less AGI too. I.e. AGI is not a big calculator (!), it's more like real-time system. One reason is that full search is impossible. So, in many cases requests will be like 'give the best answer you can find in 4 seconds'. 'and keep looking'. So far we have only real-time dumb robots and NN big calculators. And brains, of course.

dilawar2y ago

> previously: comparing brains and "electronic computers")

Before that: comparing brain with hydraulic machines. There has been tendency to compare brain with most complex machine known to us at that particular time.

"Descartes was impressed by the hydraulic figures in the royal gardens, and developed a hydraulic theory of the action of the brain. We have since had telephone theories, electrical field theories, and now theories based on computing machines… . We are more likely to find out how the brain works by studying the brain itself, and the phenomenon of behavior, than by indulging in far-fetched physical analogies." -- Karl Lashley 1951

bondarchuk2y ago

Electronic computers, artificial neural networks, hydraulic machines, clockworks etc... are all computationally equivalent to the brain. Anyone making such comparisons is grasping at the fact that the brain can be understood computationally. To complain that there are no pressure-driven pistons, rotating gears or whatever in the brain is missing the point of the analogy, IMHO, which is: all these systems perform computation on top of a physical substrate, and what we actually (should) care about is the computation itself and not the mechanical workings of the substrate.

mjburgess2y ago

I cannot agree enough with Karl here. What is the brain? An organic system with deep roots in the organic body, with deep causal connections with its environment.

There's little sense in ignoring the whole basic mode of operation, physics, chemistry and biology of the brain in order to analogise it to another system without any of those properties.

This, at best, provides a set of inspirations for engineers -- it does nothing for science.

TeMPOraL2y ago

> There's little sense in ignoring the whole basic mode of operation, physics, chemistry and biology of the brain in order to analogise it to another system without any of those properties.

Sure there is. People had a feel for it back in "clockworks" times, nowadays we have a much better grasp because of progress of physics and math, particularly CS - mode of operation is an implementation detail. Whatever the mode, once you understand the behavior enough to model it in computational terms, you can implement it in anything you like - gears and levers, pistons, water flowing between buckets, electrons in silicon, photons going through lenses, photons diffusing through metamaterials, sound waves diffusing through metamaterials - and yes, also via a person locked in a room full of books telling them what to draw in response to a drawing they receive, and also via a billion kids following a game to the letter, via corporate bureaucracy, via board game rules, etc.

Substrate. Does. Not. Matter.

The only thing limiting your choice here is practical one. Humanity is getting a good mileage out of electrons in silicon, so that's the way to go for now. Gears would work too, they're just too annoying to handle at scale.

Of course, today we don't have a full understanding of biological substrate - we can't model it fully in terms of computation, because it's a piece of spontaneously evolved nanotech and we barely begun being able to observe things at those scales. We have a lot of studying in front of us - but this is about learning how the gooey stuff ticks, what does it compute and how. But it's not about some new dimension of computation.

1 more reply

ben_w2y ago

I mildly disagree (although your final conclusion is correct: it indeed does nothing for science).

The deepest fundamental structures in the brain[0] are quantum fields, which are also the deepest fundamental structures in everything else.

There is no known quantum field of "soul" or "intelligence".

The right abstraction is higher, and could still be a whole lot of things; but as maths can be implemented in logic, which can be implemented in electronics or clockwork or hydraulics, it doesn't matter what analogy is used — and my mild disagreement here is that such inspiration has been useful and gotten us this far.

[0] that we know of

1 more reply

spindle2y ago

And also comparing brains to clockwork.

crustacean1112y ago

CNNs actually are biologically inspired. The receptive field in a CNN mimics the way that cortical neurons only respond to stimuli in a restricted region of the visual field. Different cortical neurons have receptive fields that partially overlap to cover the whole visual field [1].

[1] - https://en.wikipedia.org/wiki/Convolutional_neural_network

jakobson142y ago

You're going to have to dig deeper. The concept of a receptive field goes all the way back to convolutional filters.

It's not surprising that we found out later the brain also uses such a fundamental element of signal theory.

SubiculumCode2y ago

Oh good. So you do admit that there are useful parallels between signal processing, statistical processing, and the brain.

vkou2y ago

Sure, and airplanes are inspired by birds. That doesn't mean that detailed studies of the Boeing 747 are going to unlock a lot of hitherto unknown mysteries of heron behaviour.

jacobsimon2y ago

I mean, I know you’re just providing an analogy, but people are still studying the physics of bird flight and we’re nowhere close to building machines yet that can maneuver the way birds can. https://www.quantamagazine.org/geometric-analysis-reveals-ho...

1 more reply

wslh2y ago

Only an observer of the topic but I think it is good to review Koch's book about the real complexity of a single neuron [1].

[1] https://www.amazon.com/Biophysics-Computation-Information-Co...

mrstone2y ago

A neurologist is a medical doctor. Neuroscientists are the PhDs who do the actual research.

blovescoffee2y ago

Dude. What holy and special work do you do? There's nothing dumb or dull in searching for analogous structure between two effective machines, neither of which we understand.

hliyan2y ago

"brain seems shallow and neural networks are deep, ergo neural networks are doing it wrong"

Please don't claim things the author didn't. What I read was "ergo (artificial) neural networks may be missing a trick"

b33j0r2y ago

Agreed, but I do also think that order emerged from chaos. It’s an easy claim when order is defined by itself!

But in reality, we’re equipped exactly to exist, and we still wonder why in a backwards way, even with education (guilty!)

AI is the task of playing God like toddlers at recess, and LLMs the tower of babel. I still wanna play, it’s fun

bjourne2y ago

First, I wonder how you got access to the article? It is behind a paywall and not yet uploaded to the sites I usually find paywalled articles on.

Second, there is no need to compare brains to neural networks because brains are neural networks. Neurons form vertices and axons edges connecting the aforementioned. What you are perhaps thinking of are artificial neural networks - most of which are very dissimilar to brains. But even then you are wrong. Artificial Izhikevich and Hodgkin-Huxley neural networks attempts to closely mimic the behavior of real neurons.

While deep, hierarchical artificial neural networks have been more successful than biologically plausible ones, that may be because the technology isn't ready yet. After all, the perceptron was invented in the 1950's but didn't become prominent until the 2010's (or so). Perhaps we need new memories that better map to (real) neural network topologies, or perhaps 3d chips that can pack transistors in the same way brains pack neurons.

mjan226402y ago

A neuron is analogous to a 3d integrated circuit rather to a transistor. A molecule acts like a transistor https://medium.com/the-physics-arxiv-blog/the-origin-of-life...

Changes in mechanical pressure, electric field, other molecules attachment, photon absorption, can control the conductivity.

Organic semiconductors designed to fit like lego bricks to naturally build the desired structure are IMHO the way to go to produce 3d circuits, rather than layered silicone litography.

ben_w2y ago

> silicone

I've seen this particular mistake a lot recently. New and exciting auto-corrupt from the latest version of iOS?

Given that our brains rewire themselves live, which ANNs can only do by being excessively connected and updating weights to/from zero, silicone (I'm thinking mainly the oil form) may be a better inspiration than lego.

https://en.wikipedia.org/wiki/Silicone

1 more reply

SubiculumCode2y ago

If you read this article, I think most would understand that it is primarily aimed at other neuroscientists, and only using ML structures an an analogy only, and I think a somewhat useful one to boot. The real point of the article was to propose a general hierarchy for how information flows in the brain, to emphasize the importance of subcortical brain even in higher order cognition, and proposes how simultaneous processing of multiple levels of representation can inform action and thought.

As a developmental neuroscientist, I found the article insightful and thought provoking. Further, it is quite consistent with major hypotheses in psychology, how the hippocampus works (a subcortical structure) and combines information into memories: See fuzzy trace theory [1], for example.

Your dismissive tone is unappreciated, ill-informed, and crass.

[1] https://en.wikipedia.org/wiki/Fuzzy-trace_theory

visitor47112y ago

fully agree

radarsat12y ago

> every time some neurologist tried to compare brains to neural networks

Value of this comment aside, it kind of makes me chuckle how casually it (and other comments in this thread) just drops the word "artificial" from neural networks here, specifically when comparing with neurology. The irony is funny. Like, somehow we've forgotten why we call them that in the first place, exactly when talking about the thing that inspired the approach.

NoToP2y ago

I disagree profoundly.

There are things the brain does we have not yet been able to reproduce with a neural network, or to the extent we have seemingly with excessive resources of training and network size. Therefore there is some salient feature of neurology which has been overlooked. I don't think it is necessary to mimic biology down to the exact function of real neurons, but there must in fact be something we are neglecting to mimic.

ben_w2y ago

Possibly, but it may also be that we're training them wrong.

"Book smart, not street smart" (to use a catchphrase) would apply perfectly to GPT models: brain the size of a rodent's, with 50,000 year's experience of reading Reddit, Wikipedia, and StackOverflow, but no "real life" experiences of its own.

andromaton2y ago

Books and articles I was reading in the 80s (eg Minsky and Papert, Byte magazine) were referring to Rosenblatt and retinas.

nathias2y ago

Metaphores and analogies are important tools of thinking, even in science, some bear fruits some lead to errors, but we can't know in advance.

peyton2y ago

I dunno, failure seems okay. Wouldn’t expect a better paradigm to beat SOTA at first. It’s totally plausible that neurons use eg. transposons in a way we don’t yet have the instrument resolution to characterize, which would suggest that you don’t need 1000 layers, but a lookup table or something.

svara2y ago

Doesn't know what a neurologist is, knows they do shit work.

__loam2y ago

As a biomedical engineer who went into software, thank you for this comment lol. So tired of rehashing this.

phlogisticfugu2y ago

deep learning models have already been permitting "shallow signals" for a while. see "skip connections"

https://theaisummer.com/skip-connections/

spacetimeuser52y ago

Who finally cares how exactly an ANN matches a human brain? Is such ANN smarter than ChatGPT?

It is more useful to use AI to develop more ecologically valid measurement methods for biology.

MagicMoonlight2y ago

If it was shallow then it wouldn’t take 25 years for a human brain to fully train. The fact that some parts of it need that much data mean they must be way up the hierarchy.

GranularRecipe2y ago

The reason for deep learning is that shallow networks are very hard (or impossible) to train. In that sense, long time of training is evidence for shallow networks.

IshKebab2y ago

No it's because shallow networks can't express complex functions. If you think about it the shallowest network is pretty much a lookup table. They can theoretically model any function, but the number of parameters needed means in practice they can't. Deep networks can learn much more complex functions for the same number of parameters.

simiones2y ago

> They can theoretically model any function, but the number of parameters needed means in practice they can't.

Even theoretically, no they can't. They can theoretically model any continuos function.

Plus, even for continuous functions, the theorem only proves that, for any function, there exists some NN that approximates it to arbitrary precision. It is not known whether there is some base NN + finite training set that could be used to arrive at that target NN using some algorithm in a finite number of steps.

2 more replies

PartiallyTyped2y ago

I mean… a 3 layer network is a Universal approximator… and you can very much do network distillation… it’s just that getting them wide enough to learn whatever we want them to isn’t computationally efficient. You end up with much larger matmuls which let’s say for simplicity exhibit cubic scaling in the dim. In contrast, you can stack layers and that comes much more computationally friendly because your matmuls are smaller.

Of course you then need to compensate with residuals, initialisation, normalisation, and all that, but it’s a small price to pay for scaling much much better with compute.

1 more reply

GranularRecipe2y ago

What the ratio for the number of parameters required to learn some complex function between a shallow network and a deep network (preferably as a function of the complexity)?

numpad02y ago

Is that to say something along, flapping wings is overcomplicated and stupid?

gwern2y ago

That doesn't follow. Shallow networks can be harder to train than a deep one, which is one of the old arguments for why you should train a deep NN despite its many disadvantages (like latency - often a matter of life and death for biological organisms!). The depth allows easier learning.

This is why today, if you need a low-latency NN, which means a shallow one, often your best bet is to train a deep one first and then distill or prune it down into a shallow one. Because the deep one is so much easier, while training a shallow one from scratch without relying on depth may be an open research question and effectively impossible.

ralfd2y ago

> If it was shallow then it wouldn’t take 25 years for a human brain to fully train.

It doesn't. You can speak perfectly fine with children. And in fact some teenagers think they know everything.

Salgat2y ago

The brain communicates with itself, so deep layers are equivalent to sections of the brain talking to each other. The only relevance white matter depth has is with regard to how it's trained, and since it doesn't use gradient descent, it's irrelevant to neural networks in that regard.

blovescoffee2y ago

Intercommunication does not equal layer depth.

Salgat2y ago

Why not? All a deep neural network is doing is progressive data transformations into something more abstract and meaningful to later layers.

lawlessone2y ago

So does this mean DNN are in some ways deeper than human brains?

bjornsing2y ago

> This shallow architecture exploits the computational capacity of cortical microcircuits and thalamo-cortical loops that are not included in typical hierarchical deep learning and predictive coding networks.

As I understand it the thalamus is basically a giant switchboard though. I see no reason to believe that it never connects the output of one cortical area to the input of another, thus doubling the effective depth of the neural network. (I haven’t read this paper though, as it was behind a paywall.)

Simon_ORourke2y ago

Judging by some of the levels of driving around these parts, the brain may be very shallow indeed.

low_tech_punk2y ago

Replay of Jeff Hawkins group’s A Thousand Brains theory?

SubiculumCode2y ago

"his theory" lol. Jeff Hawkins is a bit player

rando_dfad2y ago

Original to Jeff or not, "A Thousand Brains" does a decent job presenting an interesting and highly plausible model of how the neocortext may function.

Your comment would be very valuable to me if it included pointers to better sources. I have sufficient background to see gaps in Jeff's book, and would be interested in exploring these, perhaps through the references you seem to be aware of.

j / k navigate · click thread line to collapse

178 comments

audunw2y ago

rmorey2y ago

> I seem to remember research stating that an individual neuron has very complex behaviour that requires several ML “neurons” / nodes to simulate.

This is probably what you're remembering: https://www.sciencedirect.com/science/article/pii/S089662732...

titzer2y ago

andbberger2y ago

jacobsimon2y ago

My guess is that the firing rate of biological neurons more or less simplifies to the activation in an artificial neuron. Higher firing rate = higher activation.

magicalhippo2y ago

> Could we construct a neutral net from nodes with more complex behaviour?

Well there's spiking neural networks (SNN)[1], which are modeled more closely to how neurons actually work.

Main obstacle is still, as far as I know, that there's no way to train a SSN as efficiently as a "regular" neural network, which lends itself very nicely to gradient descent and similar[2].

[1]: https://en.wikipedia.org/wiki/Spiking_neural_network

[2]: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9313413/

rsrsrs862y ago

The brain backprops??????

andbberger2y ago

there is no evidence to support this

chriskanan2y ago

lausbub2y ago

sudosysgen2y ago

There's also evidence that the brain does optimize through time and might be implementing, at least in some places, algorithms close to LSTD.

sheeshkebab2y ago

It’s indeed odd that current dnn’s require massive amount of energy to retrain and lack any kind of practical continuous adaptation and learning.

quickthrower22y ago

jakobson142y ago

The brain isn't a faster computer.

quickthrower22y ago

I think we agree? I am talking to the efficiency of the brain. Not processing speed. Efficiency of the brain to do things advantageous to the selfish genes I guess.

The brain is supremely efficient at what the brain has evolved to do. It is almost tautological! Because if it wasn't, it wouldn't have evolved to that.

Silicon comes from an alien land, and is emulating. Even with the best algorithms there has to be a limit on how efficient a computer-based intelligence can be without changing how the chips work.

4 more replies

uoaei2y ago

It's an apples-to-oranges comparison. They're both fruit that grow on trees, but that's where the similarities end.

jakobson142y ago

Yes, it's odd that sled dogs make terrible housepets. /s

Neural networks fundamentally aren't designed to be otherwise. The workflow that has guided their entire development for over a decade is based around expensive training and static inference.

sheeshkebab2y ago

Why then all the talk about AGI when fundamentals don’t even allow for it to emerge.

ben_w2y ago

Because "AGI" is very poorly defined, and ChatGPT is very "general" (compared to everything before it) and matches some (but not all) definitions of "intelligent".

jakobson142y ago

Because drumming up talk about AGI is a really great way to get funding for your startup. The tech industry sustains itself on hype.

1 more reply

anon2912y ago

Because transformers et al have gotten us the closest we've ever been to any system that can even claim to be AGI.

FeepingCreature2y ago

First make it work; then make it efficient.

1 more reply

4death42y ago

edmundsauto2y ago

4death42y ago

Let’s say a human needs an average of 2000 calories a day. A calorie is roughly equivalent to 1 Watt hour, so over 20 years, it takes about 15 MWh to sustain a human.

1 more reply

Affric2y ago

By that token the amount of energy for neural networks will be bound to some extent by the development of the biosphere and the creators of neural networks.

fastball2y ago

nomel2y ago

See “The last question” for some sci-fi solutions to this.

phkahler2y ago

>> It’s indeed odd that current dnn’s require massive amount of energy to retrain and lack any kind of practical continuous adaptation and learning.

golol2y ago

In-comtext learning exists though.

beaugunderson2y ago

https://anonymfile.com/dR8a/s41583-023-00756-z.pdf

hliyan2y ago

"brain seems shallow and neural networks are deep, ergo neural networks are doing it wrong"

Please don't claim things the author didn't. What I read was "ergo (artificial) neural networks may be missing a trick"

hliyan2y ago

Ignore. Reposted this under correct parent comment

rsrsrs862y ago

Beyond the mere topological metaphor of neural networks there is almost nothing in common between brains and widigital computation. This is a widespread fallacy of category.

naasking2y ago

> Beyond the mere topological metaphor of neural networks there is almost nothing in common between brains and widigital computation.

I mean, sure, but the topology is exactly what makes both work, so we only really care about the topology.

rando_dfad2y ago

and more specifically, between chemical-based information processing systems and Von Neumann architectures for binary information processing.

Agreed, a widespread fallacy of category.

But computers still do some pretty cool things. Powerful tools.

epgui2y ago

I completely disagree, and I think this is an example of human-exceptionalism bias.

lawrenceyan2y ago

We have skip connections and recurrent neural networks at home.

jakobson142y ago

Their entire article hinges on the complaint "brain seems shallow and neural networks are deep, ergo neural networks are doing it wrong."

A) practical experimentation and refinement, and

B) real, actual understanding about how they work.

And then you have the nitwits searching the brain for transformer networks. Might as well look for mercury delay line memory while you're at it. Quantum entanglement too.

robbrown4512y ago

I can't agree with the dismissiveness of this comment, and frankly I find its tone out of line and not with the spirit of Hacker News.

I certainly prefer to see people making comparisons of neural networks to the brain, that the old "it's just a glorified autocomplete" and the like.

Relax.

ramraj072y ago

1. https://braininitiative.nih.gov/sites/default/files/document...

andbberger2y ago

what are you talking about is this konrad kording's shitposting alt??? this reeks of naivety

4 more replies

__loam2y ago

fastball2y ago

1 more reply

robbrown4512y ago

It seems the whole point is to bring in additional details of how brains work, that the think may be relevant to artificial NNs.

p1esk2y ago

Artificial neural networks are the closest working model of a brain we have today.

Which of the above is not happening in our brains? Which of the above is not biologically inspired?

In fact this description equally applies to both a brain and GPT4.

2 more replies

krainboltgreene2y ago

What does this comment add to the discussion?

robbrown4512y ago

I dunno. My comment complained about the parent comment not adding positively to the discussion. And gave at least a bit of support for that complaint.

Would you have preferred I emulate your style, and complain while providing no support for my complaint?

Ok.

1 more reply

jacobsimon2y ago

> The concept of CNNs didn't come from biology

I just opened a survey paper on CNNs and literally the first sentence of the paper reads:

Source: https://arxiv.org/pdf/1512.07108.pdf%C3%A3%E2%82%AC%E2%80%9A

jakobson142y ago

rerdavies2y ago

> The C in CNN isn't "Convolution" for no reason.

The first N in CNN is "Neural" for a reason.

1 more reply

dartos2y ago

You can use that argument for anything you disagree with. Do you have a source or anything?

1 more reply

two_in_one2y ago

dilawar2y ago

> previously: comparing brains and "electronic computers")

Before that: comparing brain with hydraulic machines. There has been tendency to compare brain with most complex machine known to us at that particular time.

bondarchuk2y ago

mjburgess2y ago

I cannot agree enough with Karl here. What is the brain? An organic system with deep roots in the organic body, with deep causal connections with its environment.

There's little sense in ignoring the whole basic mode of operation, physics, chemistry and biology of the brain in order to analogise it to another system without any of those properties.

This, at best, provides a set of inspirations for engineers -- it does nothing for science.

TeMPOraL2y ago

> There's little sense in ignoring the whole basic mode of operation, physics, chemistry and biology of the brain in order to analogise it to another system without any of those properties.

Substrate. Does. Not. Matter.

1 more reply

ben_w2y ago

I mildly disagree (although your final conclusion is correct: it indeed does nothing for science).

The deepest fundamental structures in the brain[0] are quantum fields, which are also the deepest fundamental structures in everything else.

There is no known quantum field of "soul" or "intelligence".

[0] that we know of

1 more reply

spindle2y ago

And also comparing brains to clockwork.

crustacean1112y ago

[1] - https://en.wikipedia.org/wiki/Convolutional_neural_network

jakobson142y ago

You're going to have to dig deeper. The concept of a receptive field goes all the way back to convolutional filters.

It's not surprising that we found out later the brain also uses such a fundamental element of signal theory.

SubiculumCode2y ago

Oh good. So you do admit that there are useful parallels between signal processing, statistical processing, and the brain.

vkou2y ago

Sure, and airplanes are inspired by birds. That doesn't mean that detailed studies of the Boeing 747 are going to unlock a lot of hitherto unknown mysteries of heron behaviour.

jacobsimon2y ago

1 more reply

wslh2y ago

Only an observer of the topic but I think it is good to review Koch's book about the real complexity of a single neuron [1].

[1] https://www.amazon.com/Biophysics-Computation-Information-Co...

mrstone2y ago

A neurologist is a medical doctor. Neuroscientists are the PhDs who do the actual research.

blovescoffee2y ago

Dude. What holy and special work do you do? There's nothing dumb or dull in searching for analogous structure between two effective machines, neither of which we understand.

hliyan2y ago

"brain seems shallow and neural networks are deep, ergo neural networks are doing it wrong"

Please don't claim things the author didn't. What I read was "ergo (artificial) neural networks may be missing a trick"

b33j0r2y ago

Agreed, but I do also think that order emerged from chaos. It’s an easy claim when order is defined by itself!

But in reality, we’re equipped exactly to exist, and we still wonder why in a backwards way, even with education (guilty!)

AI is the task of playing God like toddlers at recess, and LLMs the tower of babel. I still wanna play, it’s fun

bjourne2y ago

First, I wonder how you got access to the article? It is behind a paywall and not yet uploaded to the sites I usually find paywalled articles on.

mjan226402y ago

A neuron is analogous to a 3d integrated circuit rather to a transistor. A molecule acts like a transistor https://medium.com/the-physics-arxiv-blog/the-origin-of-life...

Changes in mechanical pressure, electric field, other molecules attachment, photon absorption, can control the conductivity.

Organic semiconductors designed to fit like lego bricks to naturally build the desired structure are IMHO the way to go to produce 3d circuits, rather than layered silicone litography.

ben_w2y ago

> silicone

I've seen this particular mistake a lot recently. New and exciting auto-corrupt from the latest version of iOS?

https://en.wikipedia.org/wiki/Silicone

1 more reply

SubiculumCode2y ago

Your dismissive tone is unappreciated, ill-informed, and crass.

[1] https://en.wikipedia.org/wiki/Fuzzy-trace_theory

visitor47112y ago

fully agree

radarsat12y ago

> every time some neurologist tried to compare brains to neural networks

NoToP2y ago

I disagree profoundly.

ben_w2y ago

Possibly, but it may also be that we're training them wrong.

andromaton2y ago

Books and articles I was reading in the 80s (eg Minsky and Papert, Byte magazine) were referring to Rosenblatt and retinas.

nathias2y ago

Metaphores and analogies are important tools of thinking, even in science, some bear fruits some lead to errors, but we can't know in advance.

peyton2y ago

svara2y ago

Doesn't know what a neurologist is, knows they do shit work.

__loam2y ago

As a biomedical engineer who went into software, thank you for this comment lol. So tired of rehashing this.

phlogisticfugu2y ago

deep learning models have already been permitting "shallow signals" for a while. see "skip connections"

https://theaisummer.com/skip-connections/

spacetimeuser52y ago

Who finally cares how exactly an ANN matches a human brain? Is such ANN smarter than ChatGPT?

It is more useful to use AI to develop more ecologically valid measurement methods for biology.

MagicMoonlight2y ago

If it was shallow then it wouldn’t take 25 years for a human brain to fully train. The fact that some parts of it need that much data mean they must be way up the hierarchy.

GranularRecipe2y ago

The reason for deep learning is that shallow networks are very hard (or impossible) to train. In that sense, long time of training is evidence for shallow networks.

IshKebab2y ago

simiones2y ago

> They can theoretically model any function, but the number of parameters needed means in practice they can't.

Even theoretically, no they can't. They can theoretically model any continuos function.

2 more replies

PartiallyTyped2y ago

Of course you then need to compensate with residuals, initialisation, normalisation, and all that, but it’s a small price to pay for scaling much much better with compute.

1 more reply

GranularRecipe2y ago

What the ratio for the number of parameters required to learn some complex function between a shallow network and a deep network (preferably as a function of the complexity)?

numpad02y ago

Is that to say something along, flapping wings is overcomplicated and stupid?

gwern2y ago

ralfd2y ago

> If it was shallow then it wouldn’t take 25 years for a human brain to fully train.

It doesn't. You can speak perfectly fine with children. And in fact some teenagers think they know everything.

Salgat2y ago

blovescoffee2y ago

Intercommunication does not equal layer depth.

Salgat2y ago

Why not? All a deep neural network is doing is progressive data transformations into something more abstract and meaningful to later layers.

lawlessone2y ago

So does this mean DNN are in some ways deeper than human brains?

bjornsing2y ago

Simon_ORourke2y ago

Judging by some of the levels of driving around these parts, the brain may be very shallow indeed.

low_tech_punk2y ago

Replay of Jeff Hawkins group’s A Thousand Brains theory?

SubiculumCode2y ago

"his theory" lol. Jeff Hawkins is a bit player

rando_dfad2y ago

Original to Jeff or not, "A Thousand Brains" does a decent job presenting an interesting and highly plausible model of how the neocortext may function.

j / k navigate · click thread line to collapse