The Nonlinear Causal Effect Estimation of the Built Environment on Urban Rail Transit Station Flow Under Emergency
Abstract
1. Introduction
2. Literature Review
2.1. The Influencing Mechanism of URT Station-Level Ridership
2.2. URT Passenger Flow Under Incident and Emergency Conditions
2.3. Research Gap
3. Methodology
3.1. Research Framework
- First, URT network structure indicators (e.g., a station’s connectivity, centrality, transfer degree, or availability of alternative lines);
- Second, station-level built environment features around the station (e.g., land-use mix, job accessibility, presence of nearby transit options);
- Third, external factors (e.g., characteristics of the disruption incident, service suspension duration, weather, or policy interventions).
3.2. Causal Discovery and Structure Learning
3.2.1. Integration with Domain Knowledge
3.2.2. Domain-Constrained NOTEARS Optimization
3.3. Double Machine Learning for Causal Effect Estimation
3.3.1. Causal Effect Estimation
- Firstly, learn the influencing functions and using a machine learning algorithm, such as the gradient boosting decision tree [37].
- Secondly, residualize and to remove variation due to .
- Thirdly, regress the residualized outcome on residualized treatment to estimate the causal effect. We also utilize cross-fitting to avoid overfitting in the estimation.
3.3.2. Average Treatment Effect and Conditional Average Treatment Effect
4. Study Area and Dataset
4.1. Study Area
4.2. Dataset
5. Results
5.1. Causal Effect Analysis
5.2. Heterogeneous Causal Effect Analysis
5.3. Policy Discussion
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Appendix A. Explanation of the NOTEARS Algorithm
Algorithm A1. Pseudocode of the NOTEARS algorithm. Non-combinatorial optimization via trace exponential and augmented Lagrangian for structure learning |
Input: - : Data matrix, where each row is an observation, and each column is a variable (). - : Regularization coefficient (controls sparsity in the learned structure). - : Initial guess for the dual variables. - : Progress rate (), controls the update rate of the optimization. - : Tolerance for convergence, stopping criterion for the optimization. - : Threshold parameter used for edge selection after optimization. Output: - : Estimated weighted adjacency matrix for the DAG. Stage 1: Initialize parameters Initialize as random matrix (size ). Set as the initial guess for the dual variables. Define function h(W) for acyclicity constraint using matrix exponential: where ⊙ is the Hadamard product, exp(W) is the matrix exponential, and tr is the trace. Define the loss function (L1 regularization). Stage 2: Continuous optimization loop For each iteration t = 0, 1, 2, ... a. Solve the primal problem: b. Update dual variables: c. Check convergence condition: If , break the loop and set . Stage 3: Post-processing and thresholding Apply threshold ω to the learned weights matrix : = ⊙ 1(|M| > ω), where 1 is the indicator function. Return the estimated DAG structure with edges defined by . |
Appendix B. Prior Constraint Matrix (PCM)
Potential Treatment Variable | Potential Outcome Variable |
---|---|
Land use entropy | Bus connectivity |
Job accessibility | Bus connectivity |
Job–housing balance level | Bus connectivity |
University dummy | Bus connectivity |
CBD dummy | Bus connectivity |
Transportation hub dummy | Bus connectivity |
Non-incident URT ridership | Bus connectivity |
Degree (network connectivity) | Bus connectivity |
Land use entropy | Non-incident URT ridership |
Job accessibility | Non-incident URT ridership |
Job–housing balance level | Non-incident URT ridership |
University dummy | Non-incident URT ridership |
CBD dummy | Non-incident URT ridership |
Transportation hub dummy | Non-incident URT ridership |
Degree (network connectivity) | Non-incident URT ridership |
Betweenness (network centrality) | Non-incident URT ridership |
Weather dummy | Non-incident URT ridership |
Power incident dummy | Disruption duration |
Intrusion incident dummy | Disruption duration |
Line incident dummy | Disruption duration |
Signal incident dummy | Disruption duration |
Train incident dummy | Disruption duration |
Door incident dummy | Disruption duration |
Weekday dummy | Disruption duration |
Peak-hour dummy | Disruption duration |
Accident station dummy | Disruption duration |
Weather dummy | Disruption duration |
CBD dummy | Disruption duration |
University dummy | Disruption duration |
Transportation hub dummy | Disruption duration |
Degree (network connectivity) | Incident URT ridership decline |
Betweenness (network centrality) | Incident URT ridership decline |
Bus connectivity | Incident URT ridership decline |
Non-incident URT ridership | Incident URT ridership decline |
Disruption duration | Incident URT ridership decline |
Appendix C. Kurtosis and Skewness of the Individual Treatment Effects
Source | Target | Kurtosis | Skewness |
---|---|---|---|
Power incident dummy | Disruption duration | −0.653 | 1.160 |
Intrusion incident dummy | Disruption duration | 0.721 | 1.557 |
Line incident dummy | Disruption duration | −1.907 | 0.306 |
Signal incident dummy | Disruption duration | −0.985 | −0.879 |
Train incident dummy | Disruption duration | 3.382 | 2.181 |
Door incident dummy | Disruption duration | −1.447 | −0.528 |
Weekday dummy | Disruption duration | −0.373 | −0.025 |
Peak-hour dummy | Disruption duration | −0.098 | 0.116 |
Accident station dummy | Disruption duration | 0.607 | −0.116 |
Weather dummy | Disruption duration | −0.099 | 0.119 |
Weather dummy | Non-incident URT ridership | −0.003 | −0.359 |
Degree | Incident URT ridership decline | 0.747 | −0.520 |
Betweenness | Incident URT ridership decline | 1.375 | 0.783 |
Bus connectivity | Incident URT ridership decline | 1.135 | 0.466 |
Non-incident URT ridership | Incident URT ridership decline | −0.301 | 0.385 |
Disruption duration | Incident URT ridership decline | 0.457 | −1.132 |
Land use entropy | Bus connectivity | 0.016 | −0.115 |
Job accessibility | Bus connectivity | 0.067 | −0.157 |
Job–housing balance level | Bus connectivity | 0.244 | 0.726 |
University dummy | Bus connectivity | 1.632 | −0.753 |
CBD dummy | Bus connectivity | 1.275 | 0.916 |
Transportation hub dummy | Bus connectivity | 0.168 | 0.194 |
Land use entropy | Non-incident URT ridership | 1.041 | 1.029 |
Job accessibility | Non-incident URT ridership | −0.056 | 0.038 |
Job–housing balance level | Non-incident URT ridership | 0.188 | 0.744 |
University dummy | Non-incident URT ridership | 0.116 | 0.005 |
CBD dummy | Non-incident URT ridership | 0.535 | 0.115 |
Transportation hub dummy | Non-incident URT ridership | 2.774 | 0.934 |
References
- Wang, W.; Wang, Y.; Wang, G.; Li, M.; Jia, L. Identification of the critical accident causative factors in the urban rail transit system by complex network theory. Phys. A Stat. Mech. Its Appl. 2023, 610, 128404. [Google Scholar] [CrossRef]
- Zhang, X.; Deng, Y.; Li, Q.; Skitmore, M.; Zhou, Z. An incident database for improving metro safety: The case of shanghai. Saf. Sci. 2016, 84, 88–96. [Google Scholar] [CrossRef]
- He, Y.; Zhao, Y.; Tsui, K.-L. Geographically modeling and understanding factors influencing transit ridership: An empirical study of Shenzhen metro. Appl. Sci. 2019, 9, 4217. [Google Scholar] [CrossRef]
- Ge, S.; Shan, M.; Zhai, Z. Emergency Preparedness in China’s Urban Rail Transit System: A Systematic Review. Sustainability 2025, 17, 524. [Google Scholar] [CrossRef]
- Yang, C.; Yu, C.; Dong, W.; Yuan, Q. Substitutes or complements? Examining effects of urban rail transit on bus ridership using longitudinal city-level data. Transp. Res. Part A Policy Pract. 2023, 174, 103728. [Google Scholar] [CrossRef]
- Zhang, Y.; Bian, Y.; Zhao, X.; Zhao, X. Integrating Safety and Efficiency: Design and Evaluation of Dynamic Emergency Evacuation Sign System in Urban Rail Transit. Sustainability 2024, 16, 10921. [Google Scholar] [CrossRef]
- Wang, W.; Ji, Y.; Zhao, Z.; Yin, H. Simulation Optimization of Station-Level Control of Large-Scale Passenger Flow Based on Queueing Network and Surrogate Model. Sustainability 2024, 16, 7502. [Google Scholar] [CrossRef]
- Zhang, Y.; Chengcheng, Y.; Chao, Y.; Wentao, D.; Li, W. Spend more but get less: Unravelling the causes of bus ridership decline using a data-driven machine learning approach. Transp. B Transp. Dyn. 2025, 13, 2438246. [Google Scholar] [CrossRef]
- Kuby, M.; Barranda, A.; Upchurch, C. Factors influencing light-rail station boardings in the United States. Transp. Res. Part A Policy Pract. 2004, 38, 223–247. [Google Scholar] [CrossRef]
- Liu, S.; Yao, E.; Li, B. Exploring urban rail transit station-level ridership growth with network expansion. Transp. Res. Part D Transp. Environ. 2019, 73, 391–402. [Google Scholar] [CrossRef]
- Yu Li, X.; Krishna Sinniah, G.; Li, R. Identify impacting factor for urban rail ridership from built environment spatial heterogeneity. Case Stud. Transp. Policy 2022, 10, 1159–1171. [Google Scholar] [CrossRef]
- Mohamad Zulkifli, S.N.A.; Kadar Hamsa, A.A.; Noor, N.M.; Ibrahim, M. Evaluation of land use density, diversity and ridership of Rail Based Public Transportation System. Transp. Res. Procedia 2017, 25, 5266–5281. [Google Scholar] [CrossRef]
- He, Y.; Zhao, Y.; Tsui, K.L. An Analysis of Factors Influencing Metro Station Ridership: Insights from Taipei Metro. In Proceedings of the 2018 21st International Conference on Intelligent Transportation Systems (ITSC), Maui, HI, USA, 4–7 November 2018; pp. 1598–1603. [Google Scholar]
- Wang, J.; Wan, F.; Dong, C.; Yin, C.; Chen, X. Spatiotemporal effects of built environment factors on varying rail transit station ridership patterns. J. Transp. Geogr. 2023, 109, 103597. [Google Scholar] [CrossRef]
- Yu, C.; Dong, W.; Liu, Y.; Yang, C.; Yuan, Q. Rethinking bus ridership dynamics: Examining nonlinear effects of determinants on bus ridership changes using city-level panel data from 2010 to 2019. Transp. Policy 2024, 151, 85–100. [Google Scholar] [CrossRef]
- Choi, J.; Lee, Y.J.; Kim, T.; Sohn, K. An analysis of Metro ridership at the station-to-station level in Seoul. Transportation 2012, 39, 705–722. [Google Scholar] [CrossRef]
- Gutiérrez, J.; Cardozo, O.D.; García-Palomares, J.C. Transit ridership forecasting at station level: An approach based on distance-decay weighted regression. J. Transp. Geogr. 2011, 19, 1081–1092. [Google Scholar] [CrossRef]
- Wang, Z.; Song, J.; Zhang, Y.; Li, S.; Jia, J.; Song, C. Spatial Heterogeneity Analysis for Influencing Factors of Outbound Ridership of Subway Stations Considering the Optimal Scale Range of “7D” Built Environments. Sustainability 2022, 14, 16314. [Google Scholar] [CrossRef]
- Cardozo, O.D.; García-Palomares, J.C.; Gutiérrez, J. Application of geographically weighted regression to the direct forecasting of transit ridership at station-level. Appl. Geogr. 2012, 34, 548–558. [Google Scholar] [CrossRef]
- Qian, Q.; Liu, Y.; He, M.; He, M.; Qian, H.; Shi, Z. Understanding the Spatial Heterogeneity Impact of Determinants on Ridership of Urban Rail Transit Across Different Passenger Groups. J. Adv. Transp. 2024, 2024, 9933244. [Google Scholar] [CrossRef]
- Yang, L.; Yu, B.; Liang, Y.; Lu, Y.; Li, W. Time-varying and non-linear associations between metro ridership and the built environment. Tunn. Undergr. Space Technol. 2023, 132, 104931. [Google Scholar] [CrossRef]
- An, D.; Tong, X.; Liu, K.; Chan, E.H. Understanding the impact of built environment on metro ridership using open source in Shanghai. Cities 2019, 93, 177–187. [Google Scholar] [CrossRef]
- Gan, Z.; Yang, M.; Feng, T.; Timmermans, H.J. Examining the relationship between built environment and metro ridership at station-to-station level. Transp. Res. Part D Transp. Environ. 2020, 82, 102332. [Google Scholar] [CrossRef]
- Li, S.; Lyu, D.; Liu, X.; Tan, Z.; Gao, F.; Huang, G.; Wu, Z. The varying patterns of rail transit ridership and their relationships with fine-scale built environment factors: Big data analytics from Guangzhou. Cities 2020, 99, 102580. [Google Scholar] [CrossRef]
- Liu, L.; Chen, J.; Wu, H.; Zhen, J.; Li, G.; Lin, L. Physical-virtual collaboration modeling for intra-and inter-station metro ridership prediction. IEEE Trans. Intell. Transp. Syst. 2020, 23, 3377–3391. [Google Scholar] [CrossRef]
- Sun, D.J.; Guan, S. Measuring vulnerability of urban metro network from line operation perspective. Transp. Res. Part A Policy Pract. 2016, 94, 348–359. [Google Scholar] [CrossRef]
- Silva, R.; Kang, S.M.; Airoldi, E.M. Predicting traffic volumes and estimating the effects of shocks in massive transportation systems. Proc. Natl. Acad. Sci. USA 2015, 112, 5643–5648. [Google Scholar] [CrossRef] [PubMed]
- Yap, M.; Cats, O. Predicting disruptions and their passenger delay impacts for public transport stops. Transportation 2021, 48, 1703–1731. [Google Scholar] [CrossRef]
- Cui, H.; Si, B.; Wang, J.; Zhao, B.; Pan, W. Short-term origin–destination flow prediction for urban rail network: A deep learning method based on multi-source big data. Complex Intell. Syst. 2024, 10, 4675–4696. [Google Scholar] [CrossRef]
- Xu, X.; Zhang, K.; Mi, Z.; Wang, X. Short-term passenger flow prediction during station closures in subway systems. Expert Syst. Appl. 2024, 236, 121362. [Google Scholar] [CrossRef]
- Ni, M.; He, Q.; Gao, J. Forecasting the subway passenger flow under event occurrences with social media. IEEE Trans. Intell. Transp. Syst. 2016, 18, 1623–1632. [Google Scholar] [CrossRef]
- Zhang, D.-m.; Du, F.; Huang, H.; Zhang, F.; Ayyub, B.M.; Beer, M. Resiliency assessment of urban rail transit networks: Shanghai metro as an example. Saf. Sci. 2018, 106, 230–243. [Google Scholar] [CrossRef]
- Zhang, N.; Graham, D.J.; Bansal, P.; Hörcher, D. Detecting metro service disruptions via large-scale vehicle location data. Transp. Res. Part C Emerg. Technol. 2022, 144, 103880. [Google Scholar] [CrossRef]
- Zhang, N.; Graham, D.J.; Hörcher, D.; Bansal, P. A causal inference approach to measure the vulnerability of urban metro systems. Transportation 2021, 48, 3269–3300. [Google Scholar] [CrossRef]
- Su, G.; Si, B.; Zhi, K.; Zhao, B.; Zheng, X. Simulation-based method for the calculation of passenger flow distribution in an urban rail transit network under interruption. Urban Rail Transit 2023, 9, 110–126. [Google Scholar] [CrossRef]
- Zeng, J.; Zhang, G.; Rong, C.; Ding, J.; Yuan, J.; Li, Y. Causal learning empowered OD prediction for urban planning. In Proceedings of the 31st ACM International Conference on Information & Knowledge Management, Atlanta, GA, USA, 17–21 October 2022; pp. 2455–2464. [Google Scholar]
- Zheng, X.; Aragam, B.; Ravikumar, P.K.; Xing, E.P. Dags with no tears: Continuous optimization for structure learning. Adv. Neural Inf. Process. Syst. 2018, 31, 9492–9503. [Google Scholar] [CrossRef]
Variable | Type | Unit | Mean | Std | Min | Max |
---|---|---|---|---|---|---|
URT network structure factors | ||||||
Degree | Discrete | - | 2.855 | 1.352 | 1.000 | 8.000 |
Betweenness | Continuous | - | 0.047 | 0.051 | 0.000 | 0.256 |
Station-level built environment factors | ||||||
Land use entropy | Continuous | - | 0.895 | 0.114 | 0.348 | 1.172 |
Job accessibility | Continuous | hundred jobs | 3.364 | 2.851 | 0.037 | 12.399 |
Job–housing balance level | Continuous | - | 1.209 | 0.980 | 0.210 | 7.424 |
University dummy | Discrete | - | 0.224 | 0.417 | 0.000 | 1.000 |
CBD dummy | Discrete | - | 0.304 | 0.460 | 0.000 | 1.000 |
Transportation hub dummy | Discrete | - | 0.151 | 0.358 | 0.000 | 1.000 |
Bus connectivity | Discrete | stations | 6.769 | 7.889 | 0.000 | 56.000 |
Incident andexternal factors | ||||||
Disruption duration | Continuous | minutes | 27.398 | 14.035 | 5.500 | 67.200 |
Power incident dummy | Discrete | - | 0.100 | 0.300 | 0.000 | 1.000 |
Intrusion incident dummy | Discrete | - | 0.142 | 0.349 | 0.000 | 1.000 |
Line incident dummy | Discrete | - | 0.174 | 0.380 | 0.000 | 1.000 |
Signal incident dummy | Discrete | - | 0.224 | 0.417 | 0.000 | 1.000 |
Train incident dummy | Discrete | - | 0.249 | 0.433 | 0.000 | 1.000 |
Door incident dummy | Discrete | - | 0.111 | 0.314 | 0.000 | 1.000 |
Weekday dummy | Discrete | - | 0.184 | 0.388 | 0.000 | 1.000 |
Peak-hour dummy | Discrete | - | 0.394 | 0.489 | 0.000 | 1.000 |
Accident station dummy | Discrete | - | 0.273 | 0.446 | 0.000 | 1.000 |
Weather dummy | Discrete | - | 0.405 | 0.491 | 0.000 | 1.000 |
URT ridership | ||||||
Non-incident URT ridership | Continuous | hundred people/hour | 25.507 | 32.976 | 0.770 | 245.570 |
Incident URT ridership decline | Continuous | hundred people/hour | 10.737 | 18.594 | 0.058 | 216.155 |
Treatment Variable | Outcome Variable | Average Treatment Effect |
---|---|---|
Land use entropy | Bus connectivity | −1.128 |
Job accessibility | Bus connectivity | 0.813 |
Job–housing balance level | Bus connectivity | 1.357 |
University dummy | Bus connectivity | 4.788 |
CBD dummy | Bus connectivity | 1.250 |
Transportation hub dummy | Bus connectivity | 4.655 |
Land use entropy | Non-incident URT ridership | 16.558 |
Job accessibility | Non-incident URT ridership | 3.788 |
Job–housing balance level | Non-incident URT ridership | 6.477 |
University dummy | Non-incident URT ridership | 2.441 |
CBD dummy | Non-incident URT ridership | 8.955 |
Transportation hub dummy | Non-incident URT ridership | 19.556 |
Weather dummy | Non-incident URT ridership | −3.280 |
Power incident dummy | Disruption duration | 40.552 |
Intrusion incident dummy | Disruption duration | 25.334 |
Line incident dummy | Disruption duration | 17.974 |
Signal incident dummy | Disruption duration | 2.038 |
Train incident dummy | Disruption duration | −2.509 |
Door incident dummy | Disruption duration | −24.286 |
Weekday dummy | Disruption duration | 11.37 |
Peak-hour dummy | Disruption duration | 11.028 |
Accident station dummy | Disruption duration | 20.018 |
Weather dummy | Disruption duration | −12.448 |
Degree | Incident URT ridership decline | −0.135 |
Betweenness | Incident URT ridership decline | 77.991 |
Bus connectivity | Incident URT ridership decline | −0.304 |
Non-incident URT ridership | Incident URT ridership decline | 0.467 |
Disruption duration | Incident URT ridership decline | 0.761 |
Built Environment Conditions | CATE of the Low Group | CATE of the Medium Group | CATE of the High Group |
---|---|---|---|
Land use entropy | 0.407 | 0.454 | 0.519 |
Job accessibility | 0.441 | 0.463 | 0.488 |
Job–housing balance level | 0.423 | 0.471 | 0.495 |
Built Environment Conditions | Treatment Group (Dummy = 1) | Control Group (Dummy = 0) |
---|---|---|
University dummy | 0.477 | 0.462 |
CBD dummy | 0.485 | 0.446 |
Transportation hub dummy | 0.527 | 0.391 |
Built Environment Conditions | CATE of the Low Group | CATE of the Medium Group | CATE of the High Group |
---|---|---|---|
Land use entropy | −0.267 | −0.339 | −0.310 |
Job accessibility | −0.299 | −0.305 | −0.312 |
Job–housing balance level | −0.287 | −0.301 | −0.323 |
Built Environment Conditions | Treatment Group (Dummy = 1) | Control Group (Dummy = 0) |
---|---|---|
University dummy | −0.341 | −0.264 |
CBD dummy | −0.315 | −0.282 |
Transportation hub dummy | −0.339 | −0.272 |
Incident and External Factors Conditions | Treatment Group (Dummy = 1) | Control Group (Dummy = 0) |
---|---|---|
Power incident dummy | 1.117 | 0.536 |
Intrusion incident dummy | 1.029 | 0.524 |
Line incident dummy | 0.997 | 0.505 |
Signal incident dummy | 0.919 | 0.633 |
Train incident dummy | 0.728 | 0.774 |
Door incident dummy | 0.680 | 0.793 |
Weekday dummy | 0.825 | 0.732 |
Peak-hour dummy | 0.821 | 0.745 |
Accident station dummy | 0.986 | 0.671 |
Weather dummy | 0.702 | 0.793 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Fan, Q.; Yu, C.; Zuo, J. The Nonlinear Causal Effect Estimation of the Built Environment on Urban Rail Transit Station Flow Under Emergency. Sustainability 2025, 17, 5829. https://doi.org/10.3390/su17135829
Fan Q, Yu C, Zuo J. The Nonlinear Causal Effect Estimation of the Built Environment on Urban Rail Transit Station Flow Under Emergency. Sustainability. 2025; 17(13):5829. https://doi.org/10.3390/su17135829
Chicago/Turabian StyleFan, Qianqi, Chengcheng Yu, and Jianyong Zuo. 2025. "The Nonlinear Causal Effect Estimation of the Built Environment on Urban Rail Transit Station Flow Under Emergency" Sustainability 17, no. 13: 5829. https://doi.org/10.3390/su17135829
APA StyleFan, Q., Yu, C., & Zuo, J. (2025). The Nonlinear Causal Effect Estimation of the Built Environment on Urban Rail Transit Station Flow Under Emergency. Sustainability, 17(13), 5829. https://doi.org/10.3390/su17135829