Skip to content

← Back to the notes index

Knowledge and Entropy — From Living Systems to AI

Working draft — under revision. Numerical claims are illustrative and not yet fully cited.

Executive Summary

Knowledge systems on Earth follow a recurring thermodynamic pattern: compact encodings (DNA sequences, written symbols, algorithms) amplify into vast ordered outcomes while exporting entropy to maintain the second law of thermodynamics. This framing helps explain the accelerating evolution from genetic inheritance to artificial intelligence as a sequence of increasingly energy-intensive amplification cycles, each creating local order at the cost of greater entropy export.


Theoretical Foundation

Core Principle: Knowledge as Active Order

Knowledge systems differ from passive order (crystals, minerals) through their capacity for active proliferation. While crystals form stable structures, knowledge systems use information to actively reshape their environment, creating what we term "knowledge in action."

Universal Amplification Pattern

All knowledge systems follow a similar thermodynamic cycle:

  1. Compact Encoding: Small information input (e.g., genome of order 10⁹ base pairs)
  2. Energy Import (ΔH < 0): Required to drive the process
  3. Amplification Process: Replication (biological), execution (algorithmic), dissemination (cultural)
  4. Local Order Creation (ΔS_local < 0): The system becomes more organised
  5. Entropy Export: Heat / waste expelled to the environment
  6. Planetary Impact: Geological, atmospheric, biological effects
  7. Evolutionary Pressure: Selection for efficiency feeds back to step 1

Governing Equations

Gibbs Free Energy Constraint:

$$\Delta G = \Delta H - T\,\Delta S < 0$$

  • All persistent knowledge systems must export more entropy than they create locally.
  • Energy input ($\Delta H < 0$) compensates for local order creation ($\Delta S < 0$).
  • Temperature $T$ sets the required entropy-export rate.

Shannon Entropy Reduction:

$$H = -\sum_i p_i \log_2 p_i$$

  • Knowledge reduces uncertainty by making outcomes more predictable.
  • Lower $H$ values indicate higher information content and organisation.

Amplification Ratio:

$$A = \frac{\text{Output Scale}}{\text{Input Scale}} \times \frac{t_{\text{reference}}}{t_{\text{epoch}}}$$

  • Normalised across time scales for cross-epoch comparison.
  • Higher $A$ values indicate more efficient knowledge leverage.

Historical Evolution: Six Major Epochs

Epoch 1: Genetic Knowledge Systems (~3.8 billion years ago)

Knowledge Structure: DNA-encoded instincts guiding survival behaviours without learning.

Compact Encoding:

  • Human genome ~3 × 10⁹ base pairs
  • Bacterial genome ~10⁶ base pairs

Amplification Mechanism:

  • Single genome → on the order of 10³⁰ cells globally in biomass
  • Vertical transmission through reproduction
  • Horizontal gene transfer in prokaryotes

Thermodynamic Analysis:

  • Local order: molecules organised into complex cellular structures
  • Energy source: solar flux and chemical gradients
  • Entropy export: metabolic waste, respiratory heat
  • Efficiency: roughly 1% solar conversion in photosynthesis

Planetary Correlation: Great Oxygenation Event (~2.4 billion years ago).

  • Cyanobacterial photosynthesis processes hundreds of gigatons of carbon per year (order of magnitude).
  • Atmospheric shift from methane-rich to ~21% oxygen.
  • Enabled aerobic metabolism with a substantial energy-efficiency gain.

Epoch 2: Social Learning Systems (~500 million years ago)

Knowledge Structure: Neural networks enabling flexible learning through imprinting, conditioning and social interaction.

Compact Encoding:

  • Bird alarm calls — small bit budgets
  • Pheromone trails — small bit budgets
  • Behavioural imprints — short structured templates

Amplification Mechanism:

  • Single cue → coordinated group behaviour (10²–10³ individuals)
  • Horizontal transmission through observation and imitation
  • Cultural inheritance in higher mammals

Thermodynamic Analysis:

  • Local order: synchronised group movements, foraging efficiency
  • Energy source: coordinated hunting, reduced individual expenditure
  • Entropy export: movement heat, territorial conflicts
  • Efficiency: measurable improvement in resource acquisition

Planetary Correlation: Cambrian Explosion (~541 million years ago).

  • Social coordination enabled complex predator-prey dynamics.
  • Diversification into ecological niches.
  • Evolution of communication systems (echolocation, visual displays).

Epoch 3: Human Linguistic Systems (~300,000 years ago)

Knowledge Structure: Spoken language creating distributed, synchronised knowledge across tribal groups.

Compact Encoding:

  • Creation myths — short narratives, large cumulative effect
  • Technical knowledge (toolmaking) — compact procedural sequences
  • Social norms — small rule-sets governing large populations

Amplification Mechanism:

  • Single narrative → tribal coordination (50–150 individuals)
  • Real-time knowledge synchronisation through speech
  • Cultural transmission across generations

Thermodynamic Analysis:

  • Local order: synchronised hunts, resource sharing, conflict resolution
  • Energy source: communal labour specialisation, cooperative childcare
  • Entropy export: controlled fires, tool-manufacturing waste
  • Efficiency: substantial survival advantage through cooperation

Examples:

  • Australian Aboriginal songlines: short verse cycles → navigation across thousands of kilometres
  • Inuit hunting knowledge: seasonal patterns → Arctic survival
  • San people tracking: animal behaviour → desert resource location

Epoch 4: Written Knowledge Systems (~5,500 years ago)

Knowledge Structure: External storage in durable media enabling asynchronous transmission across vast distances and time periods.

Sub-epochs:

  • 4a: Early Writing (3500 BCE) — Cuneiform with on the order of 600 symbols, used in early empire administration
  • 4b: Alphabetic Systems (1050 BCE) — Phoenician alphabet (~22 letters) → broader literacy
  • 4c: Paper and Codex (105–400 CE) — Portable knowledge → wider distribution
  • 4d: Universities (1088 CE) — Institutional knowledge → scholarly networks
  • 4e: Scientific Method (1620 CE) — Empirical methodology → technological breakthroughs

Monetary Systems Integration (~5000 BCE – 600 BCE):

  • Tokens → coins → standardised value encoding
  • Enabled knowledge specialist support through patronage
  • Decoupled intellectual labour from direct production

Epoch 5: Print and Industrial Systems (1440–1950 CE)

Printing Revolution (1440 CE):

  • Single manuscript → many printed copies
  • Standardised information → wider literacy
  • Sharply reduced unit cost of reproduction

Scientific-Industrial Complex (1620–1950 CE):

  • Dalton's atomic theory (1803) → chemical industry
  • Faraday's electromagnetic induction (1831) → electrical power
  • Mendeleev's periodic table (1869) → materials science
  • Einstein's $E = mc^2$ (1905) → nuclear technology

Electronic Communication (1837–1950):

  • Telegraph, telephone, radio, television
  • Single broadcast → very large audiences
  • Near-instantaneous global communication

Epoch 6: Digital and AI Systems (1945–Present)

Computing Era (1945–1990):

  • Mainframes → time-sharing → personal computers
  • Algorithms → automated processing at machine speed

Internet Era (1969–2010):

  • TCP/IP → HTTP/HTML → search engines
  • Protocol standards → global connectivity (billions of users)
  • Wikipedia, video platforms, social media → crowdsourced knowledge

Mobile Computing (2007–Present):

  • Smartphone apps → ubiquitous access (billions of devices)
  • Substantial improvements in computational performance per watt over earlier decades

Cryptocurrency Systems (2009–Present):

  • Bitcoin whitepaper (9 pages) → trustless value transfer
  • Mining network → global consensus at large continuous power draw
  • Energy efficiency relative to information output is very low

AI Era (2017–Present):

  • Training of frontier models is energy-intensive (orders of magnitude beyond a small server cluster)
  • Inference workloads now constitute a significant share of data-centre demand
  • Multi-agent systems → collaborative intelligence

Comparative Analysis Across Epochs

Energy Scale Evolution

Era Energy input (order) Entropy export Amplification factor Time scale
Genetic $\sim10^{17}$ W (solar) Metabolic heat $\sim10^{23}$ (cells/genome) Millions of years
Social per-animal metabolic Movement heat $\sim10^{2}$ (individuals/cue) Thousands of years
Tribal per-tribe metabolic Fire, tools $\sim10^{2}$ (people/narrative) Hundreds of years
Written per-document craft Mining, paper $\sim10^{6}$ (citizens/script) Decades
Industrial per-factory MW–GW CO₂, thermal $\sim10^{6}$ (products/design) Years
Digital global TWh-class IT load Heat, e-waste $\sim10^{9}$ (users/protocol) Months
AI global data-centre load Heat, cooling water $\sim10^{12}$ (interactions/model) Weeks

Numbers are order-of-magnitude indicators only.


Thermodynamic Principles

Dissipative Structures (Prigogine, 1977)

Knowledge systems are dissipative structures — they maintain order through continuous energy flow, like hurricanes or living cells:

  1. Far from equilibrium: require constant energy input
  2. Self-organisation: spontaneous pattern formation
  3. Entropy export: must dissipate waste to environment
  4. Structural stability: persist despite material flux

Landauer's Principle (1961)

Every irreversible computation requires a minimum energy:

$$E_{\min} = k_B T \ln 2 \approx 3 \times 10^{-21}\ \text{J per bit at room temperature.}$$

This sets a fundamental floor on the energy cost of erasing information — even an idealised computer must dissipate at least this much per bit erased. Practical computing is many orders of magnitude above this floor; closing that gap is a long-running research direction.

Schrödinger's Negentropy Concept (1944)

"Life feeds on negative entropy" — organisms import order (food, sunlight) and export disorder (waste, heat). Modern computational systems follow the same principle at digital scales.


Contemporary Challenges

Energy and Verification Constraints on AI

Two pressures dominate the current AI-systems regime:

  • Energy: Training and serving frontier models is increasingly constrained by data-centre power, cooling, and grid availability. The gap between practical energy-per-operation and the Landauer floor remains very large, leaving substantial room for architectural improvement.
  • Verification: As model outputs influence high-stakes decisions, the cost of checking what a system did rises. Verifiable, inspectable reasoning becomes a first-class requirement, not an afterthought.

Possible Architectural Directions

Public research literature explores a range of approaches to these constraints:

  • Near-term: specialised hardware for AI workloads, model compression (pruning, quantisation, distillation), neuromorphic architectures.
  • Medium-term: photonic neural networks, federated learning to distribute energy concentration, partially reversible computation.
  • Long-term: novel substrates (DNA storage, biological computation), bio-hybrid systems.

These directions are not predictions; they are the design space the field is exploring.


Conclusion

The AI era makes knowledge production increasingly energy-bound and verification-bound. The next transition is therefore likely to combine more efficient compute substrates with verifiable reasoning architectures.


References

  • Landauer, R. (1961). "Irreversibility and heat generation in the computing process." IBM Journal of Research and Development, 5(3), 183–191.
  • Prigogine, I. (1977). "Self-Organization in Nonequilibrium Systems." Wiley-Interscience.
  • Schrödinger, E. (1944). "What Is Life? The Physical Aspect of the Living Cell." Cambridge University Press.
  • Luccioni, A. S., et al. (2023). "Power Hungry Processing: Watts Driving the Cost of AI Deployment?" arXiv:2310.10399.
  • Patterson, D., et al. (2021). "Carbon Emissions and Large Neural Network Training." Stanford HAI.
  • Smil, V. (2017). "Energy and Civilization: A History." MIT Press.
  • Strubell, E., et al. (2019). "Energy and policy considerations for deep learning in NLP." ACL Proceedings.

Dr. Danil Gorinevski, cybiont GmbH, 2026


← Back to the notes index