Symbolic Regression for Air Transport Delay Analysis: A Viable Alternative to Classical Approaches?
Abstract
1. Introduction
2. Materials and Methods
2.1. Delay Data
2.2. Symbolic Regression
3. Analysis of Individual Airports
4. Analysis of Delay Propagation Between Airports
5. Discussion and Conclusions
Funding
Data Availability Statement
Conflicts of Interest
Abbreviations
| EGLL | London Heathrow |
| LFPG | Paris Charles de Gaulle |
| EHAM | Amsterdam Schiphol |
| EDDF | Frankfurt |
| LEMD | Madrid Barajas |
| LEBL | Barcelona El Prat International |
| EDDM | Munich |
| EGKK | London Gatwick |
| LIRF | Rome Fiumicino |
| LFPO | Paris Orly |
| EIDW | Dublin |
| LSZH | Zurich |
| EKCH | Copenhagen |
| LEPA | Palma De Mallorca |
| LPPT | Lisboa |
| ENGM | Oslo Gardermoen |
| EGCC | Manchester |
| EGSS | London Stansted |
| LOWW | Vienna International Airport |
| ESSA | Stockholm Arlanda |
References
- Cook, A.; Tanner, G.; Williams, V.; Meise, G. Dynamic cost indexing–Managing airline delay costs. J. Air Transp. Manag. 2009, 15, 26–35. [Google Scholar] [CrossRef]
- Britto, R.; Dresner, M.; Voltes, A. The impact of flight delays on passenger demand and societal welfare. Transp. Res. Part E Logist. Transp. Rev. 2012, 48, 460–469. [Google Scholar] [CrossRef]
- Wang, Y.; Cao, Y.; Zhu, C.; Wu, F.; Hu, M.; Duong, V.; Watkins, M.; Barzel, B.; Stanley, H.E. Universal patterns in passenger flight departure delays. Sci. Rep. 2020, 10, 6890. [Google Scholar] [CrossRef] [PubMed]
- Mitsokapas, E.; Schäfer, B.; Harris, R.J.; Beck, C. Statistical characterization of airplane delays. Sci. Rep. 2021, 11, 7855. [Google Scholar] [CrossRef]
- Sun, L.L.; Hu, Y.P.; Zhu, C.P. Scaling invariance in domestic passenger flight delays in the United States. Phys. A Stat. Mech. Its Appl. 2023, 611, 128445. [Google Scholar] [CrossRef]
- Wu, Y.; Wu, J.; Zhang, Y.; Cui, Y.; Zhang, Q.; Sun, L. Universality, criticality and complexity of flight delay propagation. Phys. A Stat. Mech. Its Appl. 2025, 683, 131209. [Google Scholar] [CrossRef]
- Olivares, F.; Sun, X.; Wandelt, S.; Zanin, M. Measuring landing independence and interactions using statistical physics. Transp. Res. Part E Logist. Transp. Rev. 2023, 170, 102998. [Google Scholar] [CrossRef]
- Olivares, F.; Zanin, M. Beyond Classical Models: Statistical Physics Tools for the Analysis of Time Series in Modern Air Transport. arXiv 2025, arXiv:2507.20927. [Google Scholar] [CrossRef]
- Tang, Z.; Huang, S.; Han, S. Recent progress about flight delay under complex network. Complexity 2021, 2021, 5513093. [Google Scholar] [CrossRef]
- Cai, Q.; Alam, S.; Duong, V.N. A spatial–temporal network perspective for the propagation dynamics of air traffic delays. Engineering 2021, 7, 452–464. [Google Scholar] [CrossRef]
- Gurtner, G.; Delgado, L.; Valput, D. An agent-based model for air transportation to capture network effects in assessing delay management mechanisms. Transp. Res. Part C Emerg. Technol. 2021, 133, 103358. [Google Scholar] [CrossRef]
- Gil-Rodrigo, S.; Zanin, M. Low Cost Carriers induce specific and identifiable delay propagation patterns: An analysis of the EU and US systems. IEEE Access 2024, 12, 75323–75336. [Google Scholar] [CrossRef]
- Zanin, M. Reconstructing functional networks of air transport delay propagations with minimal information. Phys. A Stat. Mech. Its Appl. 2025, 659, 130318. [Google Scholar] [CrossRef]
- Zanin, M.; Belkoura, S.; Zhu, Y. Network analysis of Chinese air transport delay propagation. Chin. J. Aeronaut. 2017, 30, 491–499. [Google Scholar] [CrossRef]
- Mazzarisi, P.; Zaoli, S.; Lillo, F.; Delgado, L.; Gurtner, G. New centrality and causality metrics assessing air traffic network interactions. J. Air Transp. Manag. 2020, 85, 101801. [Google Scholar] [CrossRef]
- Truong, D. Using causal machine learning for predicting the risk of flight delays in air transportation. J. Air Transp. Manag. 2021, 91, 101993. [Google Scholar] [CrossRef]
- Jia, Z.; Cai, X.; Hu, Y.; Ji, J.; Jiao, Z. Delay propagation network in air transport systems based on refined nonlinear Granger causality. Transp. B Transp. Dyn. 2022, 10, 586–598. [Google Scholar] [CrossRef]
- Pastorino, L.; Zanin, M. Air delay propagation patterns in Europe from 2015 to 2018: An information processing perspective. J. Phys. Complex. 2022, 3, 015001. [Google Scholar] [CrossRef]
- Zhu, Q.; Chen, S.; Guo, T.; Lv, Y.; Du, W. A spatio-temporal approach with self-corrective causal inference for flight delay prediction. IEEE Trans. Intell. Transp. Syst. 2024, 25, 20820–20831. [Google Scholar] [CrossRef]
- Yu, B.; Guo, Z.; Asian, S.; Wang, H.; Chen, G. Flight delay prediction for commercial air transport: A deep learning approach. Transp. Res. Part E Logist. Transp. Rev. 2019, 125, 203–221. [Google Scholar] [CrossRef]
- Gui, G.; Liu, F.; Sun, J.; Yang, J.; Zhou, Z.; Zhao, D. Flight delay prediction based on aviation big data and machine learning. IEEE Trans. Veh. Technol. 2019, 69, 140–150. [Google Scholar] [CrossRef]
- Tan, X.; Liu, Y.; Liu, D.; Zhu, D.; Zeng, W.; Wang, H. An attention-based deep convolution network for mining airport delay propagation causality. Appl. Sci. 2022, 12, 10433. [Google Scholar] [CrossRef]
- Qu, J.; Wu, S.; Zhang, J. Flight delay propagation prediction based on deep learning. Mathematics 2023, 11, 494. [Google Scholar] [CrossRef]
- Zhong, Q.; Yu, Y.; Huang, Y.; Zhang, T. Prediction and optimization of civil aviation flight delays based on machine learning algorithms. Int. J. Comput. Intell. Syst. 2025, 18, 189. [Google Scholar] [CrossRef]
- Langley, P. Data-driven discovery of physical laws. Cogn. Sci. 1981, 5, 31–54. [Google Scholar] [CrossRef]
- Langley, P. Scientific Discovery: Computational Explorations of the Creative Processes; MIT Press: Cambridge, MA, USA, 1987. [Google Scholar]
- Koza, J.R. Genetic programming as a means for programming computers by natural selection. Stat. Comput. 1994, 4, 87–112. [Google Scholar] [CrossRef]
- Schmidt, M.; Lipson, H. Distilling free-form natural laws from experimental data. Science 2009, 324, 81–85. [Google Scholar] [CrossRef]
- La Cava, W.; Burlacu, B.; Virgolin, M.; Kommenda, M.; Orzechowski, P.; de França, F.O.; Jin, Y.; Moore, J.H. Contemporary symbolic regression methods and their relative performance. Adv. Neural Inf. Process. Syst. 2021, 2021, 1. [Google Scholar]
- Makke, N.; Chawla, S. Interpretable scientific discovery with symbolic regression: A review. Artif. Intell. Rev. 2024, 57, 2. [Google Scholar] [CrossRef]
- Kronberger, G.; Burlacu, B.; Kommenda, M.; Winkler, S.M.; Affenzeller, M. Symbolic Regression; Chapman and Hall/CRC: Boca Raton, FL, USA, 2024. [Google Scholar]
- Wu, C.; Zhang, Y. Enhancing the shear-stress-transport turbulence model with symbolic regression: A generalizable and interpretable data-driven approach. Phys. Rev. Fluids 2023, 8, 084604. [Google Scholar] [CrossRef]
- Song, H.; Ma, M.; Yi, C.; Shao, Z.; Bai, R.; Yan, C. An innovative modification to the Menter shear-stress transport turbulence model employing the symbolic regression approach. Phys. Fluids 2024, 36, 065108. [Google Scholar] [CrossRef]
- Ding, D.; Wang, Q.; Chen, Q.; He, L. Symbolic Regression-Based Modeling for Aerodynamic Ground-to-Flight Deviation Laws of Aerospace Vehicles. Aerospace 2025, 12, 455. [Google Scholar] [CrossRef]
- Wu, C.; Zhang, S.; Zhang, Y. Development of a generalizable data-driven turbulence model: Conditioned field inversion and symbolic regression. AIAA J. 2025, 63, 687–706. [Google Scholar] [CrossRef]
- Kefalas, M.; Santiago Rojo, J.d., Jr.; Apostolidis, A.; Van Den Herik, D.; van Stein, B.; Bäck, T. Explainable artificial intelligence for exhaust gas temperature of turbofan engines. J. Aerosp. Inf. Syst. 2022, 19, 447–454. [Google Scholar] [CrossRef]
- Sun, Y.P.; Feng, H.; Zheng, B.; Wen, J.R.; Chao, A.F.; Fei, C.W. Multi-Agent Reinforcement Symbolic Regression for the Fatigue Life Prediction of Aircraft Landing Gear. Aerospace 2025, 12, 718. [Google Scholar] [CrossRef]
- Brandejsky, T. The Train Delay Model Developed by the Genetic Programming Algorithm. J. Adv. Transp. 2022, 2022, 8858756. [Google Scholar] [CrossRef]
- Diebold, F.X. Elements of Forecasting; South-Western College Pub.: Cincinnati, OH, USA, 1998. [Google Scholar]
- Granger, C.W. Investigating causal relations by econometric models and cross-spectral methods. Econom. J. Econom. Soc. 1969, 37, 424–438. [Google Scholar] [CrossRef]
- Granger, C.W. Some recent development in a concept of causality. J. Econom. 1988, 39, 199–211. [Google Scholar] [CrossRef]
- Acharya, K.; Zanin, M. Information decomposition of airport dynamics: A study of Europe and US. Transp. Res. Interdiscip. Perspect. 2025, 31, 101466. [Google Scholar] [CrossRef]
- Olivares, F.; Zanin, M. Quantifying deviations from Gaussianity with application to flight delay distributions. Entropy 2025, 27, 354. [Google Scholar] [CrossRef]
- Du, W.; Zhu, Q.; Chen, S.; Guo, T.; Zhu, Y. Flight Delay Prediction via Learning Long Short-Term Relationship Between Airports. IEEE Intell. Transp. Syst. Mag. 2024, 17, 6–19. [Google Scholar] [CrossRef]
- Wandelt, S.; Chen, X.; Sun, X. Networks for flight delay analysis: A scoping review and research agenda. IEEE Trans. Netw. Sci. Eng. 2025, 12, 1250–1266. [Google Scholar] [CrossRef]
- Noble, W.S. What is a support vector machine? Nat. Biotechnol. 2006, 24, 1565–1567. [Google Scholar] [CrossRef]
- Tian, Y.; Zhou, W.; Viscione, M.; Dong, H.; Kammer, D.S.; Fink, O. Interactive symbolic regression with co-design mechanism through offline reinforcement learning. Nat. Commun. 2025, 16, 3930. [Google Scholar] [CrossRef]
- Guimerà, R.; Sales-Pardo, M. Bayesian symbolic regression: Automated equation discovery from a physicist’s perspective. Philos. Trans. R. Soc. A Math. Phys. Eng. Sci. 2026, 384, 20250089. [Google Scholar] [CrossRef]
- Kamienny, P.A.; Lample, G.; Lamprier, S.; Virgolin, M. Deep generative symbolic regression with Monte-Carlo-tree-search. In Proceedings of the International Conference on Machine Learning, Honolulu, HI, USA, 23–29 July 2023; Curran Associates, Inc.: Red Hook, NY, USA, 2023; pp. 15655–15668. [Google Scholar]
- Cranmer, M. Interpretable machine learning for science with PySR and SymbolicRegression.jl. arXiv 2023, arXiv:2305.01582. [Google Scholar]
- Nocedal, J.; Wright, S.J. Numerical Optimization; Springer: Berlin/Heidelberg, Germany, 2006. [Google Scholar]
- Natekin, A.; Knoll, A. Gradient boosting machines, a tutorial. Front. Neurorobot. 2013, 7, 63623. [Google Scholar] [CrossRef]
- Bentéjac, C.; Csörgő, A.; Martínez-Muñoz, G. A comparative analysis of gradient boosting algorithms. Artif. Intell. Rev. 2021, 54, 1937–1967. [Google Scholar] [CrossRef]
- Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
- Biau, G.; Scornet, E. A random forest guided tour. Test 2016, 25, 197–227. [Google Scholar] [CrossRef]
- Sarker, M.K.; Zhou, L.; Eberhart, A.; Hitzler, P. Neuro-symbolic artificial intelligence: Current trends. AI Commun. 2022, 34, 197–209. [Google Scholar] [CrossRef]
- Bhuyan, B.P.; Ramdane-Cherif, A.; Tomar, R.; Singh, T. Neuro-symbolic artificial intelligence: A survey. Neural Comput. Appl. 2024, 36, 12809–12844. [Google Scholar] [CrossRef]
- Cranmer, M.; Sanchez Gonzalez, A.; Battaglia, P.; Xu, R.; Cranmer, K.; Spergel, D.; Ho, S. Discovering symbolic models from deep learning with inductive biases. Adv. Neural Inf. Process. Syst. 2020, 33, 17429–17442. [Google Scholar]
- Wei, W.; Lu, Q.; Huang, C.; Luo, J. Refining Neural Network with Symbolic Regression. In Proceedings of the Genetic and Evolutionary Computation Conference Companion, Malaga, Spain, 14–18 July 2025; pp. 923–926. [Google Scholar]
- Neill, J.O. An overview of neural network compression. arXiv 2020, arXiv:2006.03669. [Google Scholar] [CrossRef]







| Type | Operators |
|---|---|
| Unary operators | cos, sin, exp, , relu, erf |
| Binary operators | +, −, ×, / |
| Airport | = | MSE | rMSE | Z-Score |
|---|---|---|---|---|
| EGLL | 923,917 | 1,194,536 | −24.91 | |
| LFPG | 366,611 | 374,345 | −11.55 | |
| EHAM | 328,457 | 581,255 | −15.81 | |
| EDDF | 236,559 | 333,209 | −9.62 | |
| LEMD | 425,536 | 444,047 | −2.39 | |
| LEBL | 409,199 | −11.94 | ||
| EDDM | 185,971 | 201,343 | −4.64 | |
| EGKK | 1,114,729 | 1,661,682 | −2.69 | |
| LIRF | 311,055 | 319,162 | −4.53 | |
| LFPO | 245,960 | −13.70 | ||
| EIDW | 314,252 | −16.00 | ||
| LSZH | 203,443 | 222,616 | −14.40 | |
| EKCH | 158,183 | 160,043 | −5.14 | |
| LEPA | 251,191 | 334,020 | −11.72 | |
| LPPT | 344,055 | 434,945 | −13.62 | |
| ENGM | 301,199 | 316,442 | −6.39 | |
| EGCC | 372,992 | 402,244 | −15.70 | |
| EGSS | 290,065 | 342,221 | −18.31 | |
| LOWW | 410,424 | 423,151 | −4.15 | |
| ESSA | 210,976 | 217,236 | −5.60 |
| Source | Destination | Equation | MSE | rMSE | Z-Score |
|---|---|---|---|---|---|
| EDDF | LSZH | 199,602 | 234,105 | −31.09 | |
| EDDF | LOWW | 409,969 | 425,604 | −14.75 | |
| EDDM | LFPG | 375,623 | 382,648 | −10.75 | |
| EDDF | LFPG | 372,717 | 381,726 | −8.02 | |
| LSZH | EGLL | 904,237 | 1,227,787 | −5.91 | |
| LFPO | EGLL | 917,219 | 1,294,352 | −5.63 | |
| LSZH | LEMD | 407,774 | 461,389 | −5.47 | |
| EGLL | LIRF | 300,818 | 310,930 | −4.67 | |
| EDDM | LOWW | 409,724 | 424,019 | ||
| LSZH | LIRF | 302,063 | 353,367 | −3.86 |
| Equation | MSE | rMSE |
|---|---|---|
| 903,928 | 923,596 | |
| 292,575 | 299,627 | |
| 203,428 | 203,428 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Zanin, M. Symbolic Regression for Air Transport Delay Analysis: A Viable Alternative to Classical Approaches? Aerospace 2026, 13, 535. https://doi.org/10.3390/aerospace13060535
Zanin M. Symbolic Regression for Air Transport Delay Analysis: A Viable Alternative to Classical Approaches? Aerospace. 2026; 13(6):535. https://doi.org/10.3390/aerospace13060535
Chicago/Turabian StyleZanin, Massimiliano. 2026. "Symbolic Regression for Air Transport Delay Analysis: A Viable Alternative to Classical Approaches?" Aerospace 13, no. 6: 535. https://doi.org/10.3390/aerospace13060535
APA StyleZanin, M. (2026). Symbolic Regression for Air Transport Delay Analysis: A Viable Alternative to Classical Approaches? Aerospace, 13(6), 535. https://doi.org/10.3390/aerospace13060535
