Is your brain more complicated than your fridge?

6 minute read

Published: May 28, 2022

The human brain is often described as “the most complex object in the universe” - justified by the number of synapses it contains or similar. But a glass of water contains lots of molecules and degrees of freedom as well.¹ While the amount of relevant computation a brain performs during its lifetime may be high,² the amount of information needed to describe it is upper-bounded by how much relevant data

was contained in the fertilized egg it developed from, and
flowed into it after conception.

Even by computing standards of the 1990s, these are modest quantities - what’s more, obtaining them is essentially a solved problem:

How large is a brain’s relevant genome?

According to common wisdom, a fertilized egg - containing a human’s biological algorithm³ - is mostly described by its genome.⁴ With caveats, a reference human’s DNA sequence had been determined by 2003⁵. It is also believed that at most 10-15% of the genome encodes relevant information,⁶ corresponding to about 75-115 megabytes of uncompressed data.⁷ Of course, some of this (QUESTION: How much?) describes other body parts than the brain.⁸

How much data is a brain trained on?

The data a brain receives and develops with comes either from other humans (mostly in the form of language), or from observing and interacting with everyday physics directly. Obtaining it would be as easy as recording actual infants. Estimating the quantities:

150 megabytes worth of text is on the high end of how much language an American child overhears before age 10 - and according to many parents, much of this information doesn’t change its mind. Formal higher education takes even less space: For example, paltry 5.5 megabytes suffice to encode a college degree’s worth of study material.⁹
The information - as opposed to computation - an infant experimenting in its surroundings benefits from can be generated from sufficiently accurate descriptions of the laws of everyday physics, and the objects the infant interacts with. It seems fair to me to assume that, to the accuracy necessary to develop intelligence, these descriptions are largely contained in what adults talked about while it grew up - particularly considering that intelligence doesn’t require eyesight to develop.

In conclusion, adult human intelligence probably fits into less - and plausibly much less - than 300 megabytes of data mostly available to humanity today - if one doesn’t count (pseudo)randomness, redundancy, “character traits” that could be different without changing basic mental functioning, and helpful intermediate computations the brain performed during development.¹⁰ If you have a smart fridge with an LED screen, it probably runs more code.

Neither are we bottlenecked by our fundamental physics knowledge: To get to a point where physics couldn’t predict a brain’s behaviour anymore on an atomic scale, one would need to throw it into a black hole or subject it to similarly extreme conditions.

The bottom line is that describing the human brain’s algorithm, both before and after training, can’t be that complex. The question is: How to understand and reproduce the algorithm, not in a thought experiment but in practice?

Footnotes

This problem is not just hypothetical: As of 2016, surrounding water molecules were a major expense in biochemical simulations according to what I heard. I don’t know the current status (QUESTION). ↩
See Joseph Carlsmith’s report for a thorough attempt at estimation. The number of operations needed may be much higher or much lower than the number of spikes occuring in a natural brain. Furthermore, as they become more complex, artificial and natural computation systems tend to become bottle-necked by communication rather than computation - so the “number of operations required” may turn out to be irrelevant. TODO either elaborate, or remove this? ↩
Strictly speaking, the distinction between “algorithm” and “data” is blurred in an adult brain, as it changes its algorithm based on what it observed during development. ↩
While some more data may be stored in other places, for example the epigenome (QUESTION: more data on that?), it is believed that DNA encodes the bulk. ↩
With more caveats, the hypothetical cost of repeating that feat with modern methods declines exponentially over the years, in analogy to other trends in information processing. ↩
Less that 2 % of human DNA is coding, i.e. describes the amino acid sequences comprising proteins found in humans. According to Graur et al., 2013, another 3-13 % changed slower in evolutionary history than the rest, suggesting that mutations in these places typically had a negative effect on the organism’s functioning and evolutionary fitness. These regions are known or suspected (QUESTION: What is the status on that?) to encode, for example, “control logic” activating or deactivating protein-coding genes. The remainder is given the title “junk DNA”, and assumed to be completely useless. ↩
The DNA in a healthy human brain cell contains one sequence per biological parent, but the difference between these sequences is small - the difference between an individual genome and a reference fits into about 4 MB. ↩
It seems plausible to me that a minimal, abstract algorithm reproducing human intelligence takes order of magnitude less space, (here)[/posts/2022/06/neuro-stochasticity/] is a list of reasons why. ↩
A college student taking 32 courses over the course of their degree, each of which is based on a 500-page book containing 250 words per page, studies 4 000 000 words in total. The original word gap study resulted in 11.5 million words per year overheard by “an average child in a professional family”, which would correspond to 290 million words over 25 years. A raw King James Bible, compressed with lzip, occupies 1.35 bytes/word of disk space. See here for a discussion of raw sensory throughput. ↩
Of course, this depends on the exact definitions and cut-off between “computation” and “information” - for example, human-level intelligence may become much simpler to describe if we allowed evolution as part of the algorithm, rather than requiring the algorithm to store evolution’s outcome at the moment of conception. ↩

Share on

Twitter Facebook LinkedIn

Types of stochasticity and errors brains need to deal with/ways in which proteins in water are suboptimal for computation

6 minute read

Published: June 11, 2022

Together, these may add orders of magnitude to the complexity and resources a brain has to use to accomplish tasks.

mutations in evolutionary history - preferable to have a brain structure surviving some mutations
diffusion times of neurotransmitters
stochasticity of reactions facilitated by proteins
protein folding: to implement the equivalent of a logic gate, one needs to invent some protein that folds in just the right way to facilitate a computation. That doesn’t sound easy at all, may introduce overhead
leakage (of current through axon/neuron walls)
stochasticity of neuronal growth and arborization?
Energy:
- using too much energy is a much bigger concern for animals than for computers, so it is plausible that there are tradeoffs towards using less energy vs. better performance
- relatedly: Mitochondrial volume limits energy inflow into brain; power density of biomatter is much lower than achievable in e.g. microchips
- warm-blooded brain temperature can’t rise more than a few K before damage, limiting sustained energy expenditure. Can’t cool with liquid nitrogen either…
Evolution: There must have been a continuous evolutionary path from a bacterium to the human, or any other, brain. This probably doesn’t mix well with many “brittle” algorithms typical in CS, like cryptography, compression, maybe error correction…
because proteins and neurons are so incredibly slow compared to transistors, algorithms that can’t be parallelized well may not even be worth it
Indeed, there is little evidence for the sort of algorithms computer scientists devise (as in cryptography, compression, …) in biology. The most involved algorithm I heard of is the use of “modified Bloom filters” in flies remembering odors.
Parasites

Duyal Yolcu

Is your brain more complicated than your fridge?

How large is a brain’s relevant genome?

How much data is a brain trained on?

Footnotes

Share on

You May Also Enjoy

Neuroscience tech tree

Neuroscience tech tree

Types of stochasticity and errors brains need to deal with/ways in which proteins in water are suboptimal for computation

Project: Neuroscience under a quantitative progress studies perspective

Reachable states in quantum phase estimation