Air Traffic Demand Forecasting for Origin–Destination Airport Pairs Using Artificial Intelligence †
Abstract
1. Introduction
2. Materials and Methods
2.1. Data Sources and Preprocessing
2.2. General Modeling and Evaluation Strategy
2.2.1. Phase 1: Pre-COVID Benchmarking
2.2.2. Phase 2: Post-COVID Resilience Test
2.2.3. Evaluation Metrics
2.3. Model Prototypes
2.3.1. Initial Developments: SARIMA and Custom Neural Network
- SARIMA (Local Univariate): The first approach was to treat each OD pair as an independent time series using the Seasonal AutoRegressive Integrated Moving Average (SARIMA) model [8]. SARIMA is a well-established statistical benchmark that explicitly models trend, seasonality, and autoregressive components. While a strong baseline, this local approach has limitations: it is computationally expensive to fit thousands of individual models, it cannot learn patterns from other similar routes and the basic implementation is purely univariate and therefore unable to incorporate exogenous factors, though variations of the method enable it.
- Custom Neural Network (Global Multivariate): The second prototype represented a shift to a global, multivariate Deep Learning model. LSTMs [9] are specifically designed to capture long-term temporal dependencies. As a global model, trained on all time series simultaneously, it can learn general patterns of seasonality and demand drivers from all routes. The architecture was designed to integrate a set of different types of features, including temporal-dependent (e.g., lagged passengers, GDP, population) and static features (e.g., route distance). Categorical features like origin and destination airport were fed through embedding layers, allowing the model to learn nuanced relationships between locations. Despite its theoretical power, this approach introduced significant practical complexity in feature engineering, architecture design, and hyperparameter tuning. The tuning process was deemed too time-intensive for the benchmarking phase, leading to a pivot towards a more streamlined framework.
2.3.2. Standardized Benchmarking with Darts: Naive Models, Exponential Smoothing, Prophet, LightGBM
- Naive Model Baselines (Local, Univariate): We first established baselines using NaiveMean (projects the mean of the past data—in this case, of the last full year available), NaiveDrift (extrapolates the linear trend), and NaiveSeasonal (repeats the values from the last seasonal period, considering 12-month seasonality).
- Exponential Smoothing (Local, Univariate): The Holt–Winters variant was implemented, which applies exponentially decreasing weights to past observations and can model level, trend and seasonality [20].
- Prophet (Local, Univariate and Multivariate): This is a decompositional model [21], which models a time series as a sum of trend, seasonality, and holiday components. It was tested as a univariate model and with GDP and population as exogenous covariates.
- LightGBM (Global, Multivariate): We implemented a global model of the gradient boosting framework. A single model was trained on all time series, using past lags of passenger data, future covariates (GDP, population), and static covariates, both categorical and numerical (e.g., origin/destination country, route distance), as features.
3. Results
3.1. Pre-COVID Benchmarking
3.2. Covariate Data Analysis
3.3. Post-COVID Resilience
3.4. Final Modeling and Forecast Analysis
- Strong Trend: On routes with consistent growth (Figure 1a), the model captures the momentum and projects it forward. It is visible that the trend flattens, preventing unrealistic exponential growth.
- Strong Seasonality: The model effectively identifies and extrapolates regular seasonal patterns, a core strength of the Holt–Winters method (Figure 1b).
- Post-COVID Recovery: The model successfully fits the sharp recovery trend observed from 2021, interpreting this recent steep growth as the new prevailing trend and blending it with historical seasonality (Figure 1a,b).
4. Discussion
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- OECD/ITF. Airport Demand Forecasting for Long-Term Planning; ITF Round Tables, No. 159; OECD Publishing: Paris, France, 2016. [Google Scholar]
- Javanmard, M.E.; Tang, Y.; Martínez-Hernández, J.A. Forecasting air transportation demand and its impacts on energy consumption and emission. Appl. Energy 2024, 364, 123031. [Google Scholar] [CrossRef]
- Banerjee, N.; Morton, A.; Akartunalı, K. Passenger demand forecasting in scheduled transportation. Eur. J. Oper. Res. 2020, 286, 797–810. [Google Scholar] [CrossRef]
- Kincaid, I.; Tretheway, M.; Gros, S.; Lewis, D. Addressing Uncertainty About Future Airport Activity Levels in Airport Decision Making; ACRP Report 76; Transportation Research Board: Washington, DC, USA, 2012. [Google Scholar]
- Grosche, T.; Rothlauf, F.; Heinzl, A. Gravity models for airline passenger volume estimation. J. Air Transp. Manag. 2007, 13, 175–183. [Google Scholar] [CrossRef]
- Terekhov, I.; Gollnick, V. A concept of forecasting origin-destination air passenger demand between global city pairs using future socio-economic scenarios. In Proceedings of the 53rd AIAA Aerospace Sciences Meeting, Kissimmee, FL, USA, 5–9 January 2015. AIAA 2015-1640. [Google Scholar]
- Brown, R.G. Exponential Smoothing for Predicting Demand and Short-Range Forecasting; Research Paper No. 15; Office of Naval Research, U.S. Navy: Washington, DC, USA, 1959. [Google Scholar]
- Box, G.E.P.; Jenkins, G.M. Time Series Analysis: Forecasting and Control; Holden-Day: San Francisco, CA, USA, 1970. [Google Scholar]
- Hochreiter, S.; Schmidhuber, J. Long short-term memory. Neural Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef] [PubMed]
- Friedman, J.H. Greedy Function Approximation: A Gradient Boosting Machine. Ann. Stat. 2001, 29, 1189–1232. [Google Scholar] [CrossRef]
- Chen, T.; Guestrin, C. XGBoost: A Scalable Tree Boosting System. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 785–794. [Google Scholar]
- Ke, G.; Meng, Q.; Finley, T.; Wang, T.; Chen, W.; Ma, W.; Liu, T.Y. LightGBM: A highly efficient gradient boosting decision tree. In Proceedings of the 31st Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, CA, USA, 4–9 December 2017. [Google Scholar]
- Prokhorenkova, L.; Gusev, G.; Vorobev, A.; Dorogush, A.V.; Gulin, A. CatBoost: Unbiased boosting with categorical features. In Proceedings of the 32nd Conference on Neural Information Processing Systems (NeurIPS 2018), Montréal, QC, Canada, 3–8 December 2018. [Google Scholar]
- Nochai, R.; Titisa, P.; Kerdprasop, N.; Kerdprasop, K. Deep Learning vs. Gradient Boosting: Optimizing Transport Energy Forecasts in Thailand Through LSTM and XGBoost. Energies 2024, 17, 1685. [Google Scholar]
- Sabre Inc. Available online: https://www.sabreairlinesolutions.com (accessed on 1 October 2025).
- OAG Aviation Worldwide Limited. Available online: https://www.oag.com/ (accessed on 1 October 2025).
- Curto, J.; Hernández, G.; Alonso-García, M.; Serrano-Ortega, A.; Toledo-Garrote, A.; Chamoso, P. Flow disaggregation: Underdetermined non-negative linear systems. In International Symposium on Ambient Intelligence; Springer Nature: Cham, Switzerland, 2023; pp. 22–32. [Google Scholar]
- S&P Global Market Intelligence. Available online: https://www.spglobal.com/marketintelligence/en/ (accessed on 1 October 2025).
- Herzen, J.; Lässig, F.; Piazzetta, S.G.; Neuer, T.; Tafti, L.; Raille, G.; Van Pottelbergh, T.; Pasieka, M.; Skrodzki, A.; Huguenin, N.; et al. Darts: User-Friendly Modern Machine Learning for Time Series. J. Mach. Learn. Res. 2022, 23, 1–6. [Google Scholar]
- Winters, P.R. Forecasting Sales by Exponentially Weighted Moving Averages. Manag. Sci. 1960, 6, 324–342. [Google Scholar] [CrossRef]
- Taylor, S.J.; Letham, B. Forecasting at scale. Am. Stat. 2017, 72, 37–45. [Google Scholar] [CrossRef]

| Model | Type 1 | Type 2 | WMAPE (2019) |
|---|---|---|---|
| NaiveSeasonal (K = 12) | Local | Univariate | 12.40% |
| Prophet | Local | Univariate | 12.49% |
| Prophet | Local | Multivariate | 12.84% |
| LightGBM | Global | Multivariate | 11.86% |
| Exponential Smoothing | Local | Univariate | 11.41% |
| Model | WMAPE (2019) | WMAPE (2023) | (pp) |
|---|---|---|---|
| NaiveSeasonal () | 12.40% | 41.55% | +29.15 |
| Exponential Smoothing | 11.41% | 18.23% | +6.82 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Ortega, A.S.; Martín, A.R.; Argerich Martín, C. Air Traffic Demand Forecasting for Origin–Destination Airport Pairs Using Artificial Intelligence. Eng. Proc. 2026, 133, 25. https://doi.org/10.3390/engproc2026133025
Ortega AS, Martín AR, Argerich Martín C. Air Traffic Demand Forecasting for Origin–Destination Airport Pairs Using Artificial Intelligence. Engineering Proceedings. 2026; 133(1):25. https://doi.org/10.3390/engproc2026133025
Chicago/Turabian StyleOrtega, Alicia Serrano, Albert Ruiz Martín, and Clara Argerich Martín. 2026. "Air Traffic Demand Forecasting for Origin–Destination Airport Pairs Using Artificial Intelligence" Engineering Proceedings 133, no. 1: 25. https://doi.org/10.3390/engproc2026133025
APA StyleOrtega, A. S., Martín, A. R., & Argerich Martín, C. (2026). Air Traffic Demand Forecasting for Origin–Destination Airport Pairs Using Artificial Intelligence. Engineering Proceedings, 133(1), 25. https://doi.org/10.3390/engproc2026133025

