The Physics, Information, and Computation of Perennial Learning: Kolmogorov Complexity, Information Distance, and Port-Hamiltonian Thermodynamics
Abstract
1. Introduction
1.1. Dedication
1.2. The Need for Perennial Learning
1.3. Computational Hardness: Why Perennial Refinement Is Necessary
1.4. Learning Has Its Own Hamiltonian
1.5. Contributions
- Status map
- (i)
- A clean separation between the ideal Kolmogorov-complexity description of model revision and the practical compressor/MDL surrogate used in optimization, together with a locally scoped calibration statement on the finite serialized model family actually visited by the curriculum.
- (ii)
- A PH “perennial inference engine” architecture that interprets reversible transport, dissipative forgetting, safety barriers, and Casimir locks as separate channels rather than as a single monolithic update.
- (iii)
- A standard passivity/dissipation estimate specialized to curriculum-induced feasible-set contraction, whose novelty lies in its interpretation as a rate limiter for safe curriculum scheduling.
- (iv)
- A local Fisher/Laplace/MDL proxy for posterior codelength contraction and a sequential information-budget proposition that clarifies when warm starts can reduce the effective per-stage sample requirement.
- (v)
- A concrete moving-obstacle double-integrator toy case study with NCD-gated curriculum scheduling, actual simulated trajectories, practical entropy/Fisher/overwrite diagnostics, a passivity-violation stress test, and a compact baseline table.
1.6. Roadmap
- Related works
2. Background, Scope, and Standing Assumptions
2.1. Description Complexity, Proxies, and Landauer Cost
2.2. Port-Hamiltonian Systems
2.3. Standing Assumptions
3. Thermodynamic Formulation of Perennial Learning
3.1. Ideal and Practical Lifelong Regularizers
3.2. From Algorithmic Rate Distortion to a Computable Optimal Control Problem (OCP)
3.3. Algorithmic Entropy as a Discovery Diagnostic
3.4. Maxwell’s Demon: The Learning Agent as a Thermodynamic Engine
3.5. Information Distance and the Geometry of Discovery
- (a)
- Task relatedness for curriculum design: measures how much the task structure changes between curriculum steps.
- (b)
- Model-change tracking: measures how much the PH solver parameters have changed; a large NID implies a higher Landauer cost.
- (c)
- Discovery progress: monotonically decreases, signaling convergence toward the ground truth.
- (d)
- Safe feasibility-set change: bounds the safe curriculum step size, where parameterizes the constraint geometry (defined in Example 1) and is the NCD-derived threshold (defined in Section 5.2).
4. The Perennial Inference Engine
4.1. Architecture
- Known: are given; low ; Casimir-protectable.
- Partial: some structure is given, some is learned; moderate .
- Unknown: everything is learned from data; high ; full Landauer cost.
4.2. Running Toy Example: A Planar Double Integrator with a Moving Obstacle
4.3. Safety Hierarchy: Casimir, Barrier, Dissipation
- (1)
- Instantaneous safety: at every t.
- (2)
- Transitional safety: when contracts, the system reaches the new set without leaving the old one during transition.
- (3)
- Informational safety: the agent’s model of the boundary has low enough K to reliably distinguish safe from unsafe.
- Casimir lock: a permanent structural invariant that should never be overwritten during normal operation.
- Barrier shaping: a tunable potential that changes when the geometry of the safe set changes.
- Dissipation shaping: a graded slowdown that governs how aggressively the state may approach the boundary.
- Mechanism 1: Dynamic potential shaping
- Mechanism 2: Anisotropic dissipation shaping
- Mechanism 3: Casimir invariants as hard safety constraints
4.4. Passivity Speed Limit for Constraint Tightening
4.5. Coverage, Identifiability, and Entropy Reduction Rate
5. Identifiability, Curriculum, and Sequential Complexity
5.1. Fisher Information as a Local Codelength Proxy
5.2. Curriculum Scheduling via NCD and NID
- Notations used in this section
- : the effective feasible set at curriculum time , defined in (29). Always written with the time argument; bare refers to the abstract set parameterized by , and (without subscript eff) is not used.
- : the admissible NCD threshold at step , derived from the Landauer budget and defined in (42).
- : practical conditional code length (not the algorithmic conditional ); defined precisely in (40).
- : total information change in bits at stage t (Proposition 5).
- : number of materially changed model coordinates at stage t (Assumption 4). These two quantities are distinct: under the warm-start assumptions.
- Implementation and sensitivity.
- The core problem
- Compression distance as a curriculum metric
- The budget constraint
- Curriculum Lifecycle
- Early phase (Phase 1 in Figure 4). The agent has limited knowledge of obstacle geometry, so it maintains a large, conservative that includes a generous safety margin. The feasibility set is over-sized: the agent trades performance for safety by staying well away from the true boundary. Algorithmically, entropy is high (many microstates consistent with the agent’s current model), and between consecutive observations is small because nothing surprising is happening. Transition trigger: once the Fisher information rises above (Assumption 4), the boundary geometry is sufficiently identified and Phase 2 begins.
- Learning phase (Phase 2). With reliable boundary estimates, the agent begins tightening —increasing barrier steepness (the scale in the log-barrier potential (31)) and raising boundary dissipation—so that contracts toward the true obstacle contour. Each contraction step satisfies (42): the model description shortens as uncertainty collapses, so falls. The Landauer cost per step is low because the agent is mostly deleting uncertainty (erasing microstates it has ruled out) rather than encoding a genuinely new structure. Transition trigger: stops decreasing, signaling that further tightening requires new observations rather than inference.
- Constraint shift (Phase 3). An external event changes the true constraint—in the double-integrator example, the obstacle moves (red arrow in Figure 4). This injects new information: the old model of obstacle geometry is partially wrong, so spikes. The scheduler checks (42): if the spike exceeds , the transition is broken into smaller steps. Even so, the agent must pay the Landauer cost (using the conditional code notation (40)) for each bit of old geometry it overwrites, and the rate of deformation of is bounded by Proposition 3. This is the phase in which the thermodynamic cost is highest, and the safety of curriculum ordering matters most.
- Steady state (Phase 4). The agent has re-adapted to the shifted obstacle and now tracks slow constraint drift with the amortized cost measurements per update (Corollary 1), conditional on the warm-start assumptions of Proposition 5, where is the number of parameters that have changed. The Casimir lock ensures that the invariants established in Phase 2 (momentum-budget conservation, obstacle-clearance structure) are not inadvertently erased during re-adaptation. oscillates gently around the moving boundary rather than making discrete jumps. Return trigger: if spikes again above , the lifecycle re-enters Phase 3.
5.3. Sequential Information Budget
5.4. Where Differential Policy Optimization (DPO) Fits
- Thermodynamic interpretation of the regret bound
- Inference regret: excess cost from suboptimal transport, i.e., from the speed–dissipation tradeoff (Remark 4).
- Safety regret: excess cost from unnecessary conservatism (staying too far from boundaries before learning constraint geometry).
- Stochastic extension
- State-constrained DPO for safety
- Bilevel optimization
6. Numerical Realization
6.1. Discretization, Passivity Margin, and Stiffness Handling
- Conservative step. Integrate using a symplectic method such as Störmer–Verlet or symplectic Euler [55].
- Dissipative/barrier step. Integrate with a discrete-gradient or average-vector-field method [56], so that a discrete passivity inequality is preserved at the substep level.
- Stochastic step. Integrate the Stratonovich noise using a midpoint/Heun-type method compatible with the chain rule [47].
6.2. Algorithmic Template
| Algorithm 1 Perennial PH learning with code-budgeted curriculum updates |
|
6.3. Validation Protocol
- Constrained autonomy
- Sequential data assimilation
6.4. Numerical Case Study: NCD-Gated Moving-Obstacle Curriculum
- Environment encoding and NCD gate
- Diagnostics
- Passivity stress test
- Compact baseline summary

| Method | Violation % | Min. Clearance | MPL | Measurements | Comment |
|---|---|---|---|---|---|
| PH + NCD | 0.00 | 0.043 | 3.771 | 49 | Admitted chain |
| Forced jump | 6.20 | −0.234 | 3.921 | 148 | Direct exceeds with lagged barrier |
| No barrier | 3.40 | −0.088 | 4.102 | 90 | Goal controller without log-barrier |
| Cold start | 0.00 | 0.063 | 4.073 | 1419 | Cubic cold-start identification baseline |
7. Integration and Validation
7.1. The Perennial Inference Engine: Full Assembly
- One inference step
- Observation (port ): new data arrives; the demon observes molecules.
- Transport (DPO update via ): the policy pushes the current posterior toward the updated posterior via the learned transport map. This is reversible (zero Landauer cost).
- Dissipation (Landauer cost via ): outdated beliefs are overwritten; energy per erased bit flows through the dissipation port. This is irreversible.
- Conservation (Casimir C): structural invariants—symmetries, sparsity patterns, discovered physical laws—are preserved automatically (no memory, no measurement, no Landauer cost).
- Safety check (energy certificate): is verified via passivity. If approaching the boundary, increases, slowing the trajectory.
- Headline quantitative result
7.2. Numerical Validation Protocol and Broader Benchmark Agenda
- Validation Class 1: Constrained autonomy and navigation
- Validation Class 2: Sequential inference and data assimilation
- Safety-specific evaluation metrics
- (i)
- Constraint-violation frequency.
- (ii)
- Minimum distance to constraint boundary over trajectory.
- (iii)
- Energy-budget utilization .
- (iv)
- Casimir-invariant drift .
- (v)
- Hausdorff distance between and .
8. Discussion, Limitations, and Conclusions
8.1. Thermodynamic Learning by Lifting, Reduction, and PH Model Enrichment
8.2. Limitations
- Proxy calibration is local and task-family dependent. Exact K is never optimized directly. Every implementable program statement uses or another explicit code proxy, and Assumption 3 is deliberately restricted to the finite serialized model family actually visited by the curriculum.
- The Fisher codelength proxy is local. Proposition 4 is a Laplace/MDL-style curvature diagnostic; it can fail for multimodal, aliased, or strongly misspecified posteriors.
- The comparison is conditional. Corollary 1 is relative to a chosen cold-start baseline and depends on warm start, active-block drift, and informative local curvature. It is not a universal law of perennial learning.
- NCD admissibility is a screen, not a guarantee. Curriculum gating is compressor- and representation-dependent. It becomes a validated metric only after correlation with transfer cost, overwrite cost, or safety violations is measured in the task family.
- Numerical stiffness and solver dependence remain practical constraints. Barriers near the boundary require regularization and careful time stepping, and DPO is one plausible transport solver rather than a theorem of necessity.
8.3. Open Problems
- Adaptive Casimir selection. Can the system learn which invariants to protect? If a constraint has not changed in N curriculum steps, , does this suggest promotion to Casimir status?
- The safety–identifiability frontier. Does the optimal boundary dissipation minimize the total Landauer cost, balancing extra-measurement cost (poor identifiability at high ) against constraint-violation recovery cost (at low )?
- Tighter regret under PH structure. Does the bound tighten when DPO is applied to a system with known PH structure?
- Quantum extension. How can we extend the framework to quantum inference, where K of pure quantum states replaces classical K ([32], §8.7)?
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Appendix A. Reproducibility Details for the NCD-Gated Toy Case Study
- Primary compressor
- Canonical serialization
- Toy simulation hyperparameters
- Curriculum thresholds and outcomes
References
- Li, M.; Vitányi, P.M. A new approach to formal language theory by Kolmogorov complexity. SIAM J. Comput. 1995, 24, 398–410. [Google Scholar] [CrossRef]
- Li, M.; Vitányi, P.M. Reversibility and adiabatic computation: Trading time and space for energy. Proc. R. Soc. Lond. Ser. A Math. Phys. Eng. Sci. 1996, 452, 769–789. [Google Scholar]
- Li, M.; Chen, X.; Li, X.; Ma, B.; Vitányi, P.M.B. The Similarity Metric. IEEE Trans. Inf. Theory 2004, 50, 3250–3264. [Google Scholar] [CrossRef]
- Vereshchagin, N.K.; Vitányi, P.M. Kolmogorov’s structure functions and model selection. IEEE Trans. Inf. Theory 2004, 50, 3265–3290. [Google Scholar] [CrossRef]
- Cilibrasi, R.; Vitányi, P.M.B. Clustering by Compression. IEEE Trans. Inf. Theory 2005, 51, 1523–1545. [Google Scholar] [CrossRef]
- Parisi, G.I.; Kemker, R.; Part, J.L.; Kanan, C.; Wermter, S. Continual Lifelong Learning with Neural Networks: A Review. Neural Netw. 2019, 113, 54–71. [Google Scholar] [CrossRef]
- Thrun, S.; Mitchell, T.M. Lifelong Robot Learning. Robot. Auton. Syst. 1995, 15, 25–46. [Google Scholar] [CrossRef]
- Ellendula, A.S.; Wang, Y.; Nguyen, M.P.; Bajaj, C.L. GRL-SNAM: Geometric Reinforcement Learning with Differential Hamiltonians for Navigation and Mapping in Unknown Environments. In Proceedings of the Fourteenth International Conference on Learning Representations, Rio de Janeiro, Brazil, 23–26 April 2026. [Google Scholar]
- Kirkpatrick, J.; Pascanu, R.; Rabinowitz, N.; Veness, J.; Desjardins, G.; Rusu, A.A.; Milan, K.; Quan, J.; Ramalho, T.; Grabska-Barwinska, A.; et al. Overcoming Catastrophic Forgetting in Neural Networks. Proc. Natl. Acad. Sci. USA 2017, 114, 3521–3526. [Google Scholar] [CrossRef] [PubMed]
- Zenke, F.; Poole, B.; Ganguli, S. Continual Learning Through Synaptic Intelligence. In Proceedings of the 34th International Conference on Machine Learning, Sydney, Australia, 6–11 August 2017; pp. 3987–3995. [Google Scholar]
- Lopez-Paz, D.; Ranzato, M. Gradient Episodic Memory for Continual Learning. In Proceedings of the Advances in Neural Information Processing Systems 30, Long Beach, CA, USA, 4–9 December 2017; pp. 6467–6476. [Google Scholar]
- Schwarz, J.; Luketina, J.; Czarnecki, W.M.; Grabska-Barwinska, A.; Teh, Y.W.; Pascanu, R.; Gretton, A. Progress & Compress: A Scalable Framework for Continual Learning. In Proceedings of the 35th International Conference on Machine Learning, Stockholm, Sweden, 10–15 July 2018; pp. 4528–4537. [Google Scholar]
- Reif, J.H. Complexity of the Mover’s Problem and Generalizations. In Proceedings of the 20th Annual Symposium on Foundations of Computer Science, San Juan, Puerto Rico, 29–31 October 1979; pp. 421–427. [Google Scholar] [CrossRef]
- Canny, J.F. The Complexity of Robot Motion Planning; MIT Press: Cambridge, MA, USA, 1988. [Google Scholar]
- Goldstein, H.; Poole, C.; Safko, J. Classical Mechanics, 3rd ed.; Addison-Wesley: San Francisco, CA, USA, 2001. [Google Scholar]
- Arnold, V.I. Mathematical Methods of Classical Mechanics, 2nd ed.; Springer: New York, NY, USA, 1989. [Google Scholar] [CrossRef]
- Still, S.; Sivak, D.A.; Bell, A.J.; Crooks, G.E. Thermodynamics of Prediction. Phys. Rev. Lett. 2012, 109, 120604. [Google Scholar] [CrossRef]
- Wolpert, D.H. Information Theory—The Bridge Connecting Bounded Rational Game Theory and Statistical Physics. In Complex Engineered Systems; Understanding Complex Systems; Springer: Berlin/Heidelberg, Germany, 2006; pp. 262–290. [Google Scholar] [CrossRef]
- Parrondo, J.M.R.; Horowitz, J.M.; Sagawa, T. Thermodynamics of Information. Nat. Phys. 2015, 11, 131–139. [Google Scholar] [CrossRef]
- van der Schaft, A.; Jeltsema, D. Port-Hamiltonian Systems Theory: An Introductory Overview. Found. Trends Syst. Control 2014, 1, 173–378. [Google Scholar] [CrossRef]
- Todorov, E. Efficient Computation of Optimal Actions. Proc. Natl. Acad. Sci. USA 2009, 106, 11478–11483. [Google Scholar] [CrossRef]
- Kappen, H.J. Path Integrals and Symmetry Breaking for Optimal Control Theory. J. Stat. Mech. Theory Exp. 2005, 2005, P11011. [Google Scholar] [CrossRef]
- Eysenbach, B.; Levine, S. Maximum Entropy RL (Provably) Solves Some Robust RL Problems. In Proceedings of the International Conference on Learning Representations, Virtual, 25 April 2022. [Google Scholar]
- Berkenkamp, F.; Turchetta, M.; Schoellig, A.P.; Krause, A. Safe Model-Based Reinforcement Learning with Stability Guarantees. In Proceedings of the Advances in Neural Information Processing Systems 30, Long Beach, CA, USA, 4–9 December 2017; pp. 908–918. [Google Scholar]
- Fisac, J.F.; Akametalu, A.K.; Zeilinger, M.N.; Kaynama, S.; Gillula, J.; Tomlin, C.J. A General Safety Framework for Learning-Based Control in Uncertain Robotic Systems. IEEE Trans. Autom. Control 2019, 64, 2737–2752. [Google Scholar] [CrossRef]
- Desai, S.A.; Mattheakis, M.; Sondak, D.; Protopapas, P.; Roberts, S.J. Port-Hamiltonian Neural Networks for Learning Explicit Time-Dependent Dynamical Systems. Phys. Rev. E 2021, 104, 034312. [Google Scholar] [CrossRef]
- Sosanya, A.; Greydanus, S. Dissipative Hamiltonian Neural Networks: Learning Dissipative and Conservative Dynamics Separately. arXiv 2022, arXiv:2201.10085. [Google Scholar] [CrossRef]
- Zhong, Y.D.; Dey, B.; Chakraborty, A. Dissipative SymODEN: Encoding Hamiltonian Dynamics with Dissipation and Control into Deep Learning. arXiv 2020, arXiv:2002.08860. [Google Scholar] [CrossRef]
- Bajaj, C. Proving geometric algorithm non-solvability: An application of factoring polynomials. J. Symb. Comput. 1986, 2, 99–102. [Google Scholar] [CrossRef]
- Bajaj, C. Geometric optimization and the polynomial hierarchy. Theor. Comput. Sci. 1987, 54, 87–102. [Google Scholar] [CrossRef]
- Bajaj, C.; Li, M. Geometric optimization and DP-completeness. Discret. Comput. Geom. 1989, 4, 3–13. [Google Scholar] [CrossRef]
- Li, M.; Vitányi, P.M.B. An Introduction to Kolmogorov Complexity and Its Applications, 4th ed.; Springer: Cham, Switzerland, 2019. [Google Scholar] [CrossRef]
- Vitányi, P.M. Conditional Kolmogorov complexity and universal probability. Theor. Comput. Sci. 2013, 505, 93–100. [Google Scholar] [CrossRef]
- Grünwald, P.D. The Minimum Description Length Principle; MIT Press: Cambridge, MA, USA, 2007. [Google Scholar]
- Landauer, R. Irreversibility and Heat Generation in the Computing Process. IBM J. Res. Dev. 1961, 5, 183–191. [Google Scholar] [CrossRef]
- Bennett, C.H. The Thermodynamics of Computation—A Review. Int. J. Theor. Phys. 1982, 21, 905–940. [Google Scholar] [CrossRef]
- Fredkin, E.; Toffoli, T. Conservative Logic. Int. J. Theor. Phys. 1982, 21, 219–253. [Google Scholar] [CrossRef]
- Boyd, A.B.; Mandal, D.; Riechers, P.M.; Crutchfield, J.P. Transient Dissipation and Structural Costs of Physical Information Transduction. Phys. Rev. Lett. 2017, 118, 220602. [Google Scholar] [CrossRef]
- Morrison, P.J. A paradigm for joined Hamiltonian and dissipative systems. Phys. D Nonlinear Phenom. 1986, 18, 410–419. [Google Scholar] [CrossRef]
- Bajaj, C. Computer Algebra Meets Hamiltonian Geometry. Maple Trans. 2026, 6, 24248. [Google Scholar] [CrossRef]
- Li, M.; Vitányi, P.M.B. Inductive reasoning and Kolmogorov complexity. J. Comput. Syst. Sci. 1992, 44, 343–384. [Google Scholar] [CrossRef]
- Nguyen, M.P.; Bajaj, C.L. A Differential and Pointwise Control Approach to Reinforcement Learning. In Proceedings of the Thirty-Ninth Annual Conference on Neural Information Processing Systems, San Diego, CA, USA, 2–7 December 2025. [Google Scholar]
- Boyd, A.B.; Mandal, D.; Crutchfield, J.P. Thermodynamics of Modularity: Structural Costs Beyond the Landauer Bound. Phys. Rev. X 2018, 8, 031036. [Google Scholar] [CrossRef]
- Crutchfield, J.P. The calculi of emergence: Computation, dynamics and induction. Phys. D Nonlinear Phenom. 1994, 75, 11–54. [Google Scholar] [CrossRef]
- Szilard, L. Über die Entropieverminderung in einem thermodynamischen System bei Eingriffen intelligenter Wesen. Z. Phys. 1929, 53, 840–856. [Google Scholar] [CrossRef]
- Boyd, A.B.; Mandal, D.; Crutchfield, J.P. Correlation-Powered Information Engines and the Thermodynamics of Self-Correction. Phys. Rev. E 2017, 95, 012152. [Google Scholar] [CrossRef]
- Lázaro-Camí, J.A.; Ortega, J.P. Stochastic Hamiltonian Dynamical Systems. Rep. Math. Phys. 2008, 61, 65–122. [Google Scholar] [CrossRef]
- Cordoni, F.; Di Persio, L.; Muradore, R. Stochastic port-Hamiltonian systems. J. Nonlinear Sci. 2022, 32, 91. [Google Scholar] [CrossRef]
- Bhardwaj, S.; Bajaj, C. PHAST: Port-Hamiltonian Architecture for Structured Temporal Dynamics Forecasting. arXiv 2026, arXiv:2602.17998. [Google Scholar] [CrossRef]
- Hsu, A.S.; Chater, N.; Vitányi, P. Language Learning From Positive Evidence, Reconsidered: A Simplicity-Based Approach. Top. Cogn. Sci. 2013, 5, 35–55. [Google Scholar] [CrossRef]
- Nguyen, M.P.; Bajaj, C.L. Stochastic Differential Policy Optimization: A Rough Path Approach to Reinforcement Learning. In Proceedings of the Workshop on the Theory of AI for Scientific Computing, Lyon, France, 30 June 2025. [Google Scholar]
- Pontryagin, L.S.; Boltyanskii, V.G.; Gamkrelidze, R.V.; Mishchenko, E.F. The Mathematical Theory of Optimal Processes; Interscience: New York, NY, USA, 1962. [Google Scholar]
- Bajaj, C.; Nguyen, M. Physics-informed neural networks via stochastic hamiltonian dynamics learning. In Proceedings of the Intelligent Systems Conference; Springer: Berlin/Heidelberg, Germany, 2024; pp. 182–197. [Google Scholar]
- Baez, J.C.; Stay, M. Algorithmic Thermodynamics. Math. Struct. Comput. Sci. 2012, 22, 771–787. [Google Scholar] [CrossRef]
- Hairer, E.; Lubich, C.; Wanner, G. Geometric Numerical Integration: Structure-Preserving Algorithms for Ordinary Differential Equations, 2nd ed.; Springer: Berlin/Heidelberg, Germany, 2006. [Google Scholar] [CrossRef]
- Quispel, G.R.W.; McLaren, D.I. A New Class of Energy-Preserving Numerical Integration Methods. J. Phys. A Math. Theor. 2008, 41, 045206. [Google Scholar] [CrossRef]
- McLennan, L.; Wang, Y.; Farell, R.; Nguyen, M.; Bajaj, C. Learning Generalized Hamiltonian Dynamics with Stability from Noisy Trajectory Data. arXiv 2025, arXiv:2509.07280. [Google Scholar] [CrossRef]
- Evensen, G. Data Assimilation: The Ensemble Kalman Filter, 2nd ed.; Springer: Berlin/Heidelberg, Germany, 2009. [Google Scholar] [CrossRef]
- Reich, S.; Cotter, C. Probabilistic Forecasting and Bayesian Data Assimilation; Cambridge University Press: Cambridge, UK, 2015. [Google Scholar] [CrossRef]
- Ellendula, A.S.; Wang, Y.; Bajaj, C.L. Learning Material-Aware Hamiltonian Risk Fields for Safe Navigation. arXiv 2026, arXiv:2605.07038. [Google Scholar] [CrossRef]
- Wang, Y.; Bajaj, C. When Descent Is Too Stable: Event-Triggered Hamiltonian Learning to Optimize. arXiv 2026, arXiv:2605.06868. [Google Scholar] [CrossRef]
- Bédard, C.; Bergeron, D. An Algorithmic Approach to Emergence. Entropy 2022, 24, 985. [Google Scholar] [CrossRef] [PubMed]







| Demon Component | Learning-Agent Analog |
|---|---|
| Demon’s memory | Meta-parameters (policy/design variables) |
| Measuring molecule speed | Observing data |
| Sorting fast/slow | Updating posterior |
| Memory erasure | Overwriting old beliefs () |
| Heat-bath temperature T | Environment stochasticity |
| Szilard cycle | One episode of online learning |
| PH Term | Thermodynamic Role | Kolmogorov Cost |
|---|---|---|
| (symplectic) | Reversible inference; reorganizes beliefs | Zero Landauer cost |
| (dissipation) | Irreversible forgetting; erases outdated beliefs | per erased bit |
| (control port) | Observation intake; new data enters the system | Information gain |
| (noise port) | Environmental stochasticity | Irreducible uncertainty |
| Type | Enforcement | Landauer Cost | Use for |
|---|---|---|---|
| Casimir () | Permanent, exact | Zero | Conservation laws, hard actuator limits |
| Barrier () | Soft, adjustable | per update | Moving obstacles, tightening clearances |
| Dissipation () | Graduated slowdown | Moderate (operating cost) | Safety margins, approach speed limits |
| Component | Static Learner | Perennial PH + DPO |
|---|---|---|
| Proposal Q | Fixed product distribution | Learned transport via DPO |
| Search space | Discrete subset | Casimir-locked phase space |
| Safety | Post-hoc constraint check | Energy certificate + barrier + dissipation |
| Scaling | (global) | (warm-start, conditional) |
| Forgetting cost | Uncontrolled (catastrophic) | Budgeted by |
| Structure | None | Casimir invariants (zero Landauer cost) |
| Curriculum | None | NID/NCD-guided feasibility-set evolution |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Bajaj, C. The Physics, Information, and Computation of Perennial Learning: Kolmogorov Complexity, Information Distance, and Port-Hamiltonian Thermodynamics. Entropy 2026, 28, 551. https://doi.org/10.3390/e28050551
Bajaj C. The Physics, Information, and Computation of Perennial Learning: Kolmogorov Complexity, Information Distance, and Port-Hamiltonian Thermodynamics. Entropy. 2026; 28(5):551. https://doi.org/10.3390/e28050551
Chicago/Turabian StyleBajaj, Chandrajit. 2026. "The Physics, Information, and Computation of Perennial Learning: Kolmogorov Complexity, Information Distance, and Port-Hamiltonian Thermodynamics" Entropy 28, no. 5: 551. https://doi.org/10.3390/e28050551
APA StyleBajaj, C. (2026). The Physics, Information, and Computation of Perennial Learning: Kolmogorov Complexity, Information Distance, and Port-Hamiltonian Thermodynamics. Entropy, 28(5), 551. https://doi.org/10.3390/e28050551

