A Data-Driven Framework for Flight Delay Propagation Forecasting During Extreme Weather
Abstract
1. Introduction
- (1)
- Extreme Delay as Reference Exemplars: Unlike conventional approaches that discard extreme delays as outliers, we repurpose them as reference exemplars through similarity retrieval. This design alleviates the data sparsity in extreme weather scenarios and allows the model to learn from rare but high-impact disruptions rather than discarding them.
- (2)
- Learnable Delay Propagation Mechanism: We explicitly model the “preceding delay × exponential time decay” as a learnable feature and feed it directly into the model. Enabling end-to-end optimization of delay propagation effects.
- (3)
- Integrated Prediction Framework: Building upon the Temporal Fusion Transformer, we develop an integrated architecture that comprises a historical retrieval module for rare-delay cases, a multi-source channel-attention embedding fusion module (MS-CA-EFM) to integrate different types of feature embeddings, and a flight chain delay-propagation module that embeds flight chain dependencies.
- (4)
- Empirical Validation Across Diverse Conditions: Our experiments on U.S. flight data demonstrate consistent performance improvements across airports with varying traffic densities, meteorological conditions, and operational complexities, with MAE (Mean Absolute Error) of 3.23 min, RMSE (Root Mean Square Error) of 6.25 min, and R2 of 0.92—improving by 8.8%, 26.0%, and 5.75% compared to the best benchmark.
2. Literature Review
- (1)
- Imputation-based methods, such as GRU-D [26], use missingness masks and time-gap encoding, but often introduce distributional distortions;
- (2)
- (1)
- Deep models often assume dense, regularly sampled inputs, which are incompatible with sparse and incomplete flight logs;
- (2)
- Graph-based approaches tend to aggregate delays at the airport level, overlooking micro-level flight chain interactions;
- (3)
- Extreme delays, while rare, are frequently discarded during training due to class imbalance or noise suppression.
3. Data and Methodology
- Temporal flight chain dependencies: capturing how upstream delays propagate across flight legs;
- External disruptions: incorporating meteorological and airport operational data to model real-time conditions;
- Sparse and irregular observations: enabling robust prediction even under limited or irregular flight records.
3.1. Data
3.1.1. Analysis of Flight Departure Delay Time Characteristics
3.1.2. Extreme Delay Definition and Identification
3.1.3. Data Processing Workflow
- (1)
- Preserving extreme delay events;
- (2)
- Capturing airport-level operational states to reflect congestion and capacity changes;
- (3)
- Aligning multi-source features, including meteorological and flight chain information, for consistent model input.
- Step 1: Anomaly Detection
- Step 2: Data Cleaning
- Step 3: Airport Status Feature Extraction
- Step 4: Multi-source Data Integration
3.1.4. Flight Chain Data Architecture
- (1)
- Chain Construction Protocol
- (2)
- Delay Propagation Encoding
3.2. Prediction Model
3.2.1. Feature Encoding
- (1)
- Dynamic Feature Embedding
- (2)
- Static Feature Embedding
3.2.2. Embedded Historical Retrieval
- Step 1: Creation of the Historical Database
- Step 2: Similarity-Based Retrieval
- Step 3: Historical Fusion
3.2.3. Multi-Source Channel-Attention Embedding Fusion Module
- (1)
- Multi-Source Feature Concatenation
- (2)
- Channel-Attention Fusion
- (3)
- Residual Enhancement and Prediction
3.2.4. Flight Chain Delay Propagation Module
3.2.5. Computational Complexity Analysis
- (1)
- Embedded Historical Retrieval Module
- (2)
- Multi-Source Channel-Attention Embedding Fusion Module
- (3)
- Flight Chain Delay Propagation Module
4. Results
4.1. Input Features
4.2. Experimental Environment and Settings
4.2.1. Computational Environment
4.2.2. Hyperparameter Configuration and Experimental Design
- (1)
- Benchmark Comparison: Performance was contrasted with classical time-series forecasting models to demonstrate the superiority of the proposed method.
- (2)
- Ablation Study: Component-level ablations were performed to assess the indispensability of each architectural module.
- (3)
- Cross-Airport Evaluation: Predictive accuracy was analyzed across different airports to verify robustness under diverse operational environments.
4.2.3. Evaluation Metrics
4.3. Experimental Validation
- (1)
- Benchmark comparisons against classical time-series models to verify performance advantages;
- (2)
- Ablation studies evaluating the essential contribution of each component;
- (3)
- Cross-airport analyses assessing robustness under diverse operational conditions.
4.3.1. Baseline-Model Comparison
4.3.2. Ablation Study
4.3.3. Comparative Performance Analysis Across Airports
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Abbreviations
| DMAN | Departure Management |
| A-CDM | Airport Collaborative Decision Making |
| TFT | Temporal Fusion Transformer |
| TFT-DCP | Temporal Fusion Transformer with Dynamic Chain Propagation |
| TCN | Temporal Convolutional Network |
| GRN | Gated Residual Network |
| MS-CA-EFM | Multi-Source Channel-Attention Embedding Fusion Module |
| MAE | Mean Absolute Error |
| RMSE | Root Mean Square Error |
| R2 | Coefficient of Determination |
| HA | Historical Average |
| LSTM | Long Short-Term Memory |
| FAA | Federal Aviation Administration |
| BTS | Bureau of Transportation Statistics |
| NOAA | National Oceanic and Atmospheric Administration |
| LCD | Local Climatological Data |
| ELU | Exponential Linear Unit |
| LayerNorm | Layer Normalization |
References
- Du, W.B.; Zhang, M.Y.; Zhang, Y.; Cao, X.B.; Zhang, J. Delay causality network in air transport systems. Transp. Res. Part E Logist. Transp. Rev. 2018, 118, 466–476. [Google Scholar] [CrossRef]
- Xue, D.; Wu, L.; Xu, T.; Wu, C.L.; Wang, Z.; He, Z. Space weather effects on transportation systems: A review of current understanding and future outlook. Space Weather 2024, 22, e2024SW004055. [Google Scholar] [CrossRef]
- Qu, J.; Wu, S.; Zhang, J. Flight delay propagation prediction based on deep learning. Mathematics 2023, 11, 494. [Google Scholar] [CrossRef]
- Zheng, Y.; Miao, J.; Le, N.; Jiang, Y.; Li, Y. Intelligent airport collaborative decision making (A-CDM) system. In Proceedings of the 2019 IEEE 1st International Conference on Civil Aviation Safety and Information Technology (ICCASIT), Kunming, China, 17–19 October 2019; pp. 616–620. [Google Scholar] [CrossRef]
- Xu, N.; Kosma, C.; Vazirgiannis, M. TimeGNN: Temporal dynamic graph learning for time series forecasting. In Proceedings of the International Conference on Complex Networks and Their Applications, Cham, Switzerland, 28–30 November 2023; Springer Nature: Cham, Switzerland, 2023; pp. 87–99. [Google Scholar] [CrossRef]
- Sun, X.; Wandelt, S.; Zanin, M. Worldwide air transportation networks: A matter of scale and fractality? Transp. A Transp. Sci. 2017, 13, 607–630. [Google Scholar] [CrossRef]
- Henriques, R.; Feiteira, I. Predictive modelling: Flight delays and associated factors, Hartsfield–Jackson Atlanta International Airport. Procedia Comput. Sci. 2018, 138, 638–645. [Google Scholar] [CrossRef]
- Wesonga, R.; Nabugoomu, F.; Jehopio, P. Parameterized framework for the analysis of probabilities of aircraft delay at an airport. J. Air Transp. Manag. 2012, 23, 1–4. [Google Scholar] [CrossRef]
- Kim, Y.J.; Choi, S.; Briceno, S.; Mavris, D. A deep learning approach to flight delay prediction. In Proceedings of the 2016 IEEE/AIAA 35th Digital Avionics Systems Conference (DASC), Sacramento, CA, USA, 25–29 September 2016; pp. 1–6. [Google Scholar] [CrossRef]
- Gui, G.; Liu, F.; Sun, J.; Yang, J.; Zhou, Z.; Zhao, D. Flight delay prediction based on aviation big data and machine learning. IEEE Trans. Veh. Technol. 2019, 69, 140–150. [Google Scholar] [CrossRef]
- Yu, B.; Guo, Z.; Asian, S.; Wang, H.; Chen, G. Flight delay prediction for commercial air transport: A deep learning approach. Transp. Res. Part E Logist. Transp. Rev. 2019, 125, 203–221. [Google Scholar] [CrossRef]
- Qu, J.; Zhao, T.; Ye, M.; Li, J.; Liu, C. Flight delay prediction using deep convolutional neural network based on fusion of meteorological data. Neural Process. Lett. 2020, 52, 1461–1484. [Google Scholar] [CrossRef]
- Li, Q.; Guan, X.; Liu, J. A CNN-LSTM framework for flight delay prediction. Expert Syst. Appl. 2023, 227, 120287. [Google Scholar] [CrossRef]
- Shi, T.; Lai, J.; Gu, R.; Wei, Z. An improved artificial neural network model for flights delay prediction. Int. J. Pattern Recognit. Artif. Intell. 2021, 35, 2159027. [Google Scholar] [CrossRef]
- Lian, G.; Zhang, Y.; Desai, J.; Xing, Z.; Luo, X. Predicting taxi-out time at congested airports with optimization--based support vector regression methods. Math. Probl. Eng. 2018, 2018, 7509508. [Google Scholar] [CrossRef]
- Guo, Z.; Yu, B.; Hao, M.; Wang, W.; Jiang, Y.; Zong, F. A novel hybrid method for flight departure delay prediction using random forest regression and maximal information coefficient. Aerosp. Sci. Technol. 2021, 116, 106822. [Google Scholar] [CrossRef]
- Mamdouh, M.; Ezzat, M.; Hefny, H. Improving flight delays prediction by developing attention-based bidirectional LSTM network. Expert Syst. Appl. 2024, 238, 121747. [Google Scholar] [CrossRef]
- Khan, W.A.; Chung, S.H.; Eltoukhy, A.E.E.; Khurshid, F. A novel parallel series data-driven model for IATA-coded flight delays prediction and features analysis. J. Air Transp. Manag. 2024, 114, 102488. [Google Scholar] [CrossRef]
- Kim, S.; Park, E. Prediction of flight departure delays caused by weather conditions adopting data-driven approaches. J. Big Data 2024, 11, 11. [Google Scholar] [CrossRef]
- Zeng, Y.; Hu, M.; Chen, H.; Yuan, L.; Alam, S.; Xue, D. Improved air traffic flow prediction in terminal areas using a multimodal spatial–temporal network for weather-aware (MST-WA) model. Adv. Eng. Inform. 2024, 62, 102935. [Google Scholar] [CrossRef]
- Zhang, Y.; Xu, S.; Zhang, L.; Jiang, W.; Alam, S.; Xue, D. Short-term multi-step-ahead sector-based traffic flow prediction based on the attention-enhanced graph convolutional LSTM network (AGCLSTM). Transp. Res. Part C Emerg. Technol. 2024, 160, 104461. [Google Scholar] [CrossRef]
- Cai, K.; Li, Y.; Fang, Y.P.; Zhu, Y. A deep learning approach for flight delay prediction through time-evolving graphs. IEEE Trans. Intell. Transp. Syst. 2021, 23, 11397–11407. [Google Scholar] [CrossRef]
- Rebollo, J.J.; Balakrishnan, H. Characterization and prediction of air traffic delays. Transp. Res. Part C Emerg. Technol. 2014, 44, 231–241. [Google Scholar] [CrossRef]
- Li, Q.; Jing, R. Flight delay prediction from spatial and temporal perspective. Expert Syst. Appl. 2022, 205, 117662. [Google Scholar] [CrossRef]
- Evler, J.; Lindner, M.; Fricke, H.; Schultz, M. Integration of turnaround and aircraft recovery to mitigate delay propagation in airline networks. Comput. Oper. Res. 2022, 138, 105602. [Google Scholar] [CrossRef]
- Che, Z.; Purushotham, S.; Cho, K.; Sontag, D.; Liu, Y. Recurrent neural networks for multivariate time series with missing values. Sci. Rep. 2018, 8, 6085. [Google Scholar] [CrossRef] [PubMed]
- Shukla, S.N.; Marlin, B.M. Multi-time attention networks for irregularly sampled time series. arXiv 2021, arXiv:2101.10318. [Google Scholar] [CrossRef]
- Hajisafi, A.; Siampou, M.D.; Azarijoo, B.; Shahabi, C. WaveGNN: Modeling irregular multivariate time series for accurate predictions. arXiv 2024, arXiv:2412.10621. [Google Scholar] [CrossRef]
- Liu, C.; Chen, Y.R.; Wang, H.; Zhang, Y.; Dai, X.; Luo, Q.; Chen, L. Airport flight ground service time prediction with missing data using graph convolutional neural network imputation and bidirectional sliding mechanism. Appl. Soft Comput. 2023, 133, 109941. [Google Scholar] [CrossRef]
- Wang, D.; Sherry, L.; Xu, N.; Larson, M. Statistical comparison of passenger trip delay and flight delay metrics. Transp. Res. Rec. 2008, 2052, 72–78. [Google Scholar] [CrossRef]
- Bombelli, A.; Sallan, J.M. Analysis of the effect of extreme weather on the US domestic air network: A delay and cancellation propagation network approach. J. Transp. Geogr. 2023, 107, 103541. [Google Scholar] [CrossRef]
- January 13–16, 2024 North American Winter Storm. Available online: https://en.wikipedia.org/wiki/January_13%E2%80%9316,_2024_North_American_winter_storm (accessed on 29 June 2025).
- Destructive Wind and Giant Hail Impact the South Plains. Available online: https://www.weather.gov/lub/events-2024-20240528-storms (accessed on 29 June 2025).
- Thursday Webinar Briefing. Available online: https://www.weather.gov/media/tbw/dssbriefs/ThursdayWebinar.pdf (accessed on 29 June 2025).
- Lim, B.; Arık, S.Ö.; Loeff, N.; Pfister, T. Temporal fusion transformers for interpretable multi-horizon time series forecasting. Int. J. Forecast. 2021, 37, 1748–1764. [Google Scholar] [CrossRef]
- Bai, S.; Kolter, J.Z.; Koltun, V. An empirical evaluation of generic convolutional and recurrent networks for sequence modeling. arXiv 2018, arXiv:1803.01271. [Google Scholar] [CrossRef]
- Clevert, D.A.; Unterthiner, T.; Hochreiter, S. Fast and accurate deep network learning by exponential linear units (ELUs). arXiv 2015, arXiv:1511.07289. [Google Scholar]
- Ba, J.L.; Kiros, J.R.; Hinton, G.E. Layer normalization. arXiv 2016, arXiv:1607.06450. [Google Scholar] [CrossRef]
- Hu, J.; Shen, L.; Sun, G. Squeeze-and-excitation networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA, 18–23 June 2018; pp. 7132–7141. [Google Scholar] [CrossRef]
- Gopalakrishnan, K.; Balakrishnan, H. Control and optimization of air traffic networks. Annu. Rev. Control Robot. Auton. Syst. 2021, 4, 397–424. [Google Scholar] [CrossRef]
- Wang, Y.; Li, M.Z.; Gopalakrishnan, K.; Liu, T. Timescales of delay propagation in airport networks. Transp. Res. Part E Logist. Transp. Rev. 2022, 161, 102687. [Google Scholar] [CrossRef]
- Hochreiter, S.; Schmidhuber, J. Long short-term memory. Neural Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef] [PubMed]
- Zhou, H.; Zhang, S.; Peng, J.; Zhang, S.; Li, J.; Xiong, H.; Zhang, W. Informer: Beyond efficient transformer for long sequence time-series forecasting. In Proceedings of the AAAI Conference on Artificial Intelligence, Virtual, 2–9 February 2021; Volume 35, pp. 11106–11115. [Google Scholar] [CrossRef]







| Feature Source | Static Features | Dynamic Features |
|---|---|---|
| Flight schedule data | Departure airport, scheduled departure time, etc. | Actual taxi-out time, actual departure time, preceding flight delay time, etc. |
| Airport operational status data | Airport classification level, etc. | Airport congestion level, airport density, cumulative departure delay in the past hour, etc. |
| Airport meteorological data | None | Cloud conditions, visibility, precipitation, wind direction, etc. |
| Model | HA | LSTM | Informer | TCN | TFT | TFT-DCP |
|---|---|---|---|---|---|---|
| MAE | 5.87 | 4.81 | 3.99 | 3.78 | 3.54 | 3.23 |
| RMSE | 12.59 | 10.23 | 8.86 | 8.82 | 8.45 | 6.25 |
| R2 | 0.68 | 0.74 | 0.84 | 0.85 | 0.87 | 0.92 |
| Scenario | Period | Model | MAE | RMSE | R2 |
|---|---|---|---|---|---|
| Winter Storm | Normal | TFT-DCP | 3.12 | 6.08 | 0.94 |
| Informer | 3.56 | 7.54 | 0.86 | ||
| TFT | 3.35 | 8.12 | 0.88 | ||
| Peak | TFT-DCP | 3.36 | 6.45 | 0.89 | |
| Informer | 4.31 | 9.81 | 0.83 | ||
| TFT | 3.77 | 9.13 | 0.85 | ||
| Convective Weather | Normal | TFT-DCP | 2.94 | 6.13 | 0.94 |
| Informer | 3.52 | 8.66 | 0.85 | ||
| TFT | 3.42 | 8.21 | 0.87 | ||
| Peak | TFT-DCP | 3.44 | 6.52 | 0.91 | |
| Informer | 4.17 | 9.25 | 0.79 | ||
| TFT | 3.81 | 8.97 | 0.84 | ||
| High-Pressure Event | Normal | TFT-DCP | 3.08 | 6.06 | 0.93 |
| Informer | 3.57 | 8.57 | 0.87 | ||
| TFT | 3.24 | 7.95 | 0.89 | ||
| Peak | TFT-DCP | 3.46 | 6.25 | 0.88 | |
| Informer | 4.64 | 9.34 | 0.81 | ||
| TFT | 3.67 | 8.3 | 0.86 |
| Experiment Number | Dynamic Embedding Module | Embedded Historical Data Retrieval Module | MS-CA-EF Module | Flight-Chain Module | MAE | RMSE | R2 |
|---|---|---|---|---|---|---|---|
| 1 | √ | × | × | × | 3.86 | 7.62 | 0.79 |
| 2 | √ | √ | × | × | 3.51 | 6.49 | 0.85 |
| 3 | √ | √ | √ | × | 3.36 | 6.47 | 0.87 |
| 4 | √ | √ | √ | √ | 3.23 | 6.25 | 0.92 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Guo, J.; Li, J.; Yuan, J.; Yang, Y.; Ren, Z. A Data-Driven Framework for Flight Delay Propagation Forecasting During Extreme Weather. Mathematics 2025, 13, 3551. https://doi.org/10.3390/math13213551
Guo J, Li J, Yuan J, Yang Y, Ren Z. A Data-Driven Framework for Flight Delay Propagation Forecasting During Extreme Weather. Mathematics. 2025; 13(21):3551. https://doi.org/10.3390/math13213551
Chicago/Turabian StyleGuo, Jiuxia, Jingyuan Li, Jiang Yuan, Yungui Yang, and Zihao Ren. 2025. "A Data-Driven Framework for Flight Delay Propagation Forecasting During Extreme Weather" Mathematics 13, no. 21: 3551. https://doi.org/10.3390/math13213551
APA StyleGuo, J., Li, J., Yuan, J., Yang, Y., & Ren, Z. (2025). A Data-Driven Framework for Flight Delay Propagation Forecasting During Extreme Weather. Mathematics, 13(21), 3551. https://doi.org/10.3390/math13213551
