Traffic Flow Prediction in Complex Transportation Networks via a Spatiotemporal Causal–Trend Network
Abstract
1. Introduction
- (1)
- Static-Dynamic Spatial Decoupling: Current graph-based methods typically rely on predefined static graphs (e.g., road connectivity) or learn dynamic graphs separately, failing to jointly model both global topological stability and localized traffic-induced spatial dynamics.
- (2)
- Temporal Causality Neglect: Most attention-based temporal modules treat all historical time steps equally, ignoring the fundamental causal constraints where future traffic states cannot influence past observations.
- (3)
- Information Degradation: The sequential message-passing paradigm in spatiotemporal models causes progressive information loss during feature propagation, particularly detrimental for long-range forecasting.
- We propose an adaptive graph learning layer that requires no prior knowledge to jointly model the spatial dependencies of the traffic network. This is achieved by constructing both a static and a dynamic graph: the static graph captures the stable, global spatial topology, while the dynamic graph focuses on capturing localized spatial dynamics that vary with time and traffic conditions.
- We design a Gated Temporal Attention Module (GTAM) that integrates a novel causal-trend attention mechanism. This module not only effectively captures long-range temporal dependencies but also, through its specialized attention mechanism, enables the model to precisely extract causal relationships and local trend information from the time series data.
- We conduct extensive experiments on multiple real-world traffic datasets. The results validate the effectiveness of our proposed method, demonstrating that the SCTN model achieves superior and more robust prediction performance compared to current state-of-the-art baseline models.
2. Materials and Methods
2.1. Problem Formulation
2.2. The SCTN Framework
2.2.1. Static Graph Learning
2.2.2. Dynamic Graph Learning
2.2.3. Adaptive Graph Convolution Module
2.2.4. Gated Temporal Attention Module
2.2.5. Spatiotemporal Embedding
2.2.6. Loss Function
3. Results
3.1. Datasets
3.2. Experimental Settings
3.3. Baseline Models
3.4. Analysis of Experimental Results
3.5. Ablation Study
3.6. Visualization Analysis
3.7. Cost Experiment
4. Discussion
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Lu, J.; Li, B.; Li, H.; Al-Barakani, A. Expansion of city scale, traffic modes, traffic congestion, and air pollution. Cities 2021, 108, 102974. [Google Scholar] [CrossRef]
- Pojani, D.; Stead, D. Sustainable urban transport in the developing world: Beyond megacities. Sustainability 2015, 7, 7784–7805. [Google Scholar] [CrossRef]
- Zhu, L.; Yu, F.R.; Wang, Y.; Ning, B.; Tang, T. Big data analytics in intelligent transportation systems: A survey. IEEE Trans. Intell. Transp. Syst. 2018, 20, 383–398. [Google Scholar] [CrossRef]
- Yang, S. On feature selection for traffic congestion prediction. Transp. Res. Part C Emerg. Technol. 2013, 26, 160–169. [Google Scholar] [CrossRef]
- Musa, A.A.; Malami, S.I.; Alanazi, F.; Ounaies, W.; Alshammari, M.; Haruna, S.I. Sustainable traffic management for smart cities using internet-of-things-oriented intelligent transportation systems (ITS): Challenges and recommendations. Sustainability 2023, 15, 9859. [Google Scholar] [CrossRef]
- Liu, R.; Shin, S.Y. A Review of Traffic Flow Prediction Methods in Intelligent Transportation System Construction. Appl. Sci. 2025, 15, 3866. [Google Scholar] [CrossRef]
- Liu, H.; Dong, Z.; Jiang, R.; Deng, J.; Deng, J.; Chen, Q.; Song, X. Spatio-temporal adaptive embedding makes vanilla transformer sota for traffic forecasting. In Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, Birmingham, UK, 21–25 October 2023; pp. 4125–4129. [Google Scholar]
- Bruna, J.; Zaremba, W.; Szlam, A.; LeCun, Y. Spectral networks and locally connected networks on graphs. arXiv 2013, arXiv:1312.6203. [Google Scholar] [CrossRef]
- Liu, B.; Tang, X.; Cheng, J.; Shi, P. Traffic flow combination forecasting method based on improved LSTM and ARIMA. Int. J. Embed. Syst. 2020, 12, 22–30. [Google Scholar] [CrossRef]
- Ting, C.-C.; Wu, K.-T.; Lin, H.-T.C.; Lin, S. MixModel: A Hybrid TimesNet–Informer Architecture with 11-Dimensional Time Features for Enhanced Traffic Flow Forecasting. Mathematics 2025, 13, 3191. [Google Scholar] [CrossRef]
- Long, W.; Xiao, Z.; Wang, D.; Jiang, H.; Chen, J.; Li, Y.; Alazab, M. Unified spatial-temporal neighbor attention network for dynamic traffic prediction. IEEE Trans. Veh. Technol. 2022, 72, 1515–1529. [Google Scholar] [CrossRef]
- Chen, J.; Zhang, S.; Xu, W. Scalable Prediction of Heterogeneous Traffic Flow with Enhanced Non-Periodic Feature Modeling. Expert Syst. Appl. 2025, 294, 128847. [Google Scholar] [CrossRef]
- Lana, I.; Del Ser, J.; Velez, M.; Vlahogianni, E.I. Road traffic forecasting: Recent advances and new challenges. IEEE Intell. Transp. Syst. Mag. 2018, 10, 93–109. [Google Scholar] [CrossRef]
- Qi, P.; Pan, C.; Xu, X.; Wang, J.; Liang, J.; Zhou, W. A review of dynamic traffic flow prediction methods for global energy-efficient route planning. Sensors 2025, 25, 5560. [Google Scholar] [CrossRef]
- Toan, T.D.; Truong, V.H. Support vector machine for short-term traffic flow prediction and improvement of its model training using nearest neighbor approach. Transp. Res. Rec. 2021, 2675, 362–373. [Google Scholar] [CrossRef]
- Luo, X.; Li, D.; Zhang, S. Traffic flow prediction during the holidays based on DFT and SVR. J. Sens. 2019, 2019, 6461450. [Google Scholar] [CrossRef]
- Emami, A.; Sarvi, M.; Asadi Bagloee, S. Using Kalman filter algorithm for short-term traffic flow prediction in a connected vehicle environment. J. Mod. Transp. 2019, 27, 222–232. [Google Scholar] [CrossRef]
- Luo, X.; Li, D.; Yang, Y.; Zhang, S. Spatiotemporal traffic flow prediction with KNN and LSTM. J. Adv. Transp. 2019, 2019, 4145353. [Google Scholar] [CrossRef]
- Wang, Q.; Chen, J.; Song, Y.; Li, X.; Xu, W. Fusing visual quantified features for heterogeneous traffic flow prediction. Promet-Traffic Transp. 2024, 36, 1068–1077. [Google Scholar] [CrossRef]
- Lu, S.; Zhang, Q.; Chen, G.; Seng, D. A combined method for short-term traffic flow prediction based on recurrent neural network. Alex. Eng. J. 2021, 60, 87–94. [Google Scholar] [CrossRef]
- Sun, S.; Zhang, C.; Yu, G. A Bayesian network approach to traffic flow forecasting. IEEE Trans. Intell. Transp. Syst. 2006, 7, 124–132. [Google Scholar] [CrossRef]
- Shu, W.; Cai, K.; Xiong, N.N. A short-term traffic flow prediction model based on an improved gate recurrent unit neural network. IEEE Trans. Intell. Transp. Syst. 2021, 23, 16654–16665. [Google Scholar] [CrossRef]
- Zhang, W.; Yu, Y.; Qi, Y.; Shu, F.; Wang, Y. Short-term traffic flow prediction based on spatio-temporal analysis and CNN deep learning. Transp. A Transp. Sci. 2019, 15, 1688–1711. [Google Scholar] [CrossRef]
- Wang, T.; Chen, J.; Lü, J.; Liu, K.; Zhu, A.; Snoussi, H.; Zhang, B. Synchronous spatiotemporal graph transformer: A new framework for traffic data prediction. IEEE Trans. Neural Netw. Learn. Syst. 2022, 34, 10589–10599. [Google Scholar] [CrossRef]
- Yingran, Z.; Chao, L.; Rui, S. Enhancing Traffic Flow Forecasting with Delay Propagation: Adaptive Graph Convolution Networks for Spatio-Temporal Data. IEEE Trans. Intell. Transp. Syst. 2025, 26, 650–660. [Google Scholar] [CrossRef]
- Jiang, J.; Han, C.; Zhao, W.X.; Wang, J. Pdformer: Propagation delay-aware dynamic long-range transformer for traffic flow prediction. In Proceedings of the AAAI Conference on Artificial Intelligence, Washington, DC, USA, 7–14 February 2023; Volume 37, pp. 4365–4373. [Google Scholar] [CrossRef]
- Li, Y.; Yu, R.; Shahabi, C.; Liu, Y. Diffusion convolutional recurrent neural network: Data-driven traffic forecasting. arXiv 2017. [Google Scholar] [CrossRef]
- Fu, R.; Zhang, Z.; Li, L. Using LSTM and GRU neural network methods for traffic flow prediction. In Proceedings of the 2016 31st Youth Academic Annual Conference of Chinese Association of Automation (YAC); IEEE: Piscataway, NJ, USA, 2016; pp. 324–328. [Google Scholar] [CrossRef]
- Yu, B.; Yin, H.; Zhu, Z. Spatio–temporal graph convolutional networks: A deep learning framework for traffic forecasting. In Proceedings of the 27th International Joint Conference on Artificial Intelligence, Stockholm, Sweden, 13–19 July 2018; pp. 3634–3640. [Google Scholar] [CrossRef]
- Guo, S.; Lin, Y.; Feng, N.; Song, C.; Wan, H. Attention based spatial-temporal graph convolutional networks for traffic flow forecasting. In Proceedings of the AAAI Conference on Artificial Intelligence, Honolulu, HI, USA, 27 January–1 February 2019; Volume 33, pp. 922–929. [Google Scholar] [CrossRef]
- Song, C.; Lin, Y.; Guo, S.; Wan, H. Spatial-temporal synchronous graph convolutional networks: A new framework for spatial-temporal network data forecasting. In Proceedings of the AAAI Conference on Artificial Intelligence, New York, NY, USA, 7–12 February 2020; Volume 34, pp. 914–921. [Google Scholar] [CrossRef]
- Shin, Y.; Yoonjin, Y. PGCN: Progressive graph convolutional networks for spatial–temporal traffic forecasting. IEEE Trans. Intell. Transp. Syst. 2024, 25, 7633–7644. [Google Scholar] [CrossRef]
- Bai, L.; Yao, L.; Li, C.; Wang, X.; Wang, C. Adaptive graph convolutional recurrent network for traffic forecasting. Adv. Neural Inf. Process. Syst. 2020, 33, 17804–17815. [Google Scholar] [CrossRef]
- Li, M.; Zhu, Z. Spatial-temporal fusion graph neural networks for traffic flow forecasting. In Proceedings of the AAAI Conference on Artificial Intelligence, Virtually, 2–9 February 2021; Volume 35, pp. 4189–4196. [Google Scholar] [CrossRef]








| Notations | Description | Traffic Characteristic |
|---|---|---|
| Directed graph representation | : Set of N traffic sensors/detectors; : Spatial correlation matrix between road segments | |
| Number of nodes | Total count of deployed traffic sensors in the network | |
| Static adaptive adjacency matrix | Learned static spatial dependencies between sensors based on global traffic patterns | |
| Dynamic adjacency matrix | Time-varying correlations between sensors at each step | |
| , | (Normalized) adjacency matrix | Final graph topology for information propagation |
| Node feature matrix | Traffic measurements at all sensors at time | |
| Feature vector of node / | Multi-dimensional traffic state at specific sensor location | |
| Node features at layer , Initial transformed features | Encoded traffic representations after -hop spatial aggregation, MLP-projected traffic features before graph convolution | |
| Attention parameter matrix | Weights for adaptive layer-wise importance scoring | |
| Number of diffusion steps | Maximum hop distance for spatial information propagation | |
| Temporal embedding, Spatial embedding | Learnable time-of-day/week patterns for periodic traffic trends, Learnable sensor-specific geographic/road context | |
| , | Ground truth and predicted values | Traffic flow measurements (vehicles per time interval) |
| Dataset | PeMS04 | PeMS08 |
|---|---|---|
| Nodes | 307 | 170 |
| Edges | 340 | 295 |
| Samples | 16,992 | 17,856 |
| Traffic Pattern | Medium-scale urban area | Suburban with fluctuations |
| Missing Rate | 3.18% | 0.69% |
| Aggregation Interval | 5 min | 5 min |
| Data Types | Flow, Speed, Occupancy | Flow, Speed, Occupancy |
| Location | Bay Area, San Francisco, CA, USA | San Bernardino County, CA, USA |
| Methods | PeMS04 | PeMS08 | ||||
|---|---|---|---|---|---|---|
| RMSE | MAE | MAPE (%) | RMSE | MAE | MAPE (%) | |
| SVR | 44.56 | 28.70 | 19.20 | 36.16 | 23.25 | 14.64 |
| FC-LSTM | 41.59 | 27.14 | 18.20 | 34.06 | 22.20 | 14.20 |
| DCRNN | 38.12 | 24.70 | 17.12 | 27.83 | 17.86 | 11.45 |
| STGCN | 35.55 | 22.70 | 14.59 | 26.71 | 18.02 | 11.40 |
| ASTGCN | 35.22 | 22.93 | 16.56 | 28.16 | 18.61 | 13.08 |
| STSGCN | 33.65 | 21.19 | 13.90 | 26.80 | 17.13 | 10.96 |
| PGCN | 32.02 | 20.00 | 13.96 | 25.19 | 15.26 | 10.02 |
| PDFormer | 29.98 | 18.35 | 12.26 | 24.18 | 14.98 | 9.89 |
| AGCRN | 32.30 | 19.83 | 12.97 | 25.22 | 15.95 | 10.09 |
| DPSTGC | 30.99 | 19.07 | 12.52 | 24.81 | 15.16 | 9.76 |
| STFGNN | 31.88 | 19.64 | 12.69 | 26.22 | 16.64 | 10.60 |
| SCTN | 30.75 | 18.87 | 12.23 | 24.32 | 15.36 | 9.79 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Feng, X.; Sheng, L.; Zhu, L.; Feng, Y.; Wei, C.; Xiao, X.; Wang, H. Traffic Flow Prediction in Complex Transportation Networks via a Spatiotemporal Causal–Trend Network. Mathematics 2026, 14, 443. https://doi.org/10.3390/math14030443
Feng X, Sheng L, Zhu L, Feng Y, Wei C, Xiao X, Wang H. Traffic Flow Prediction in Complex Transportation Networks via a Spatiotemporal Causal–Trend Network. Mathematics. 2026; 14(3):443. https://doi.org/10.3390/math14030443
Chicago/Turabian StyleFeng, Xingyu, Lina Sheng, Linglong Zhu, Yishan Feng, Chen Wei, Xudong Xiao, and Haochen Wang. 2026. "Traffic Flow Prediction in Complex Transportation Networks via a Spatiotemporal Causal–Trend Network" Mathematics 14, no. 3: 443. https://doi.org/10.3390/math14030443
APA StyleFeng, X., Sheng, L., Zhu, L., Feng, Y., Wei, C., Xiao, X., & Wang, H. (2026). Traffic Flow Prediction in Complex Transportation Networks via a Spatiotemporal Causal–Trend Network. Mathematics, 14(3), 443. https://doi.org/10.3390/math14030443

