Estimating Urban Travel Intensity from Ambient Seismic Signals via a Hybrid CatBoost–LSTM Framework
Abstract
1. Introduction
2. Materials and Methods
3. Results
4. Discussion
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Deng, Z.; Ciais, P.; Tzompa-Sosa, Z.A.; Saunois, M.; Qiu, C.J.; Tan, C.; Sun, T.C.; Ke, P.Y.; Cui, Y.N.; Tanaka, K.; et al. Comparing national greenhouse gas budgets reported in UNFCCC inventories against atmospheric inversions. Earth Syst. Sci. Data 2022, 14, 1639–1675. [Google Scholar] [CrossRef]
- Kongboon, R.; Gheewala, S.H.; Sampattagul, S. Greenhouse gas emissions inventory data acquisition and analytics for low carbon cities. J. Clean. Prod. 2022, 343, 130711. [Google Scholar] [CrossRef]
- Wang, J.; Azam, W. Natural resource scarcity, fossil fuel energy consumption, and total greenhouse gas emissions in top emitting countries. Geosci. Front. 2024, 15, 101757. [Google Scholar] [CrossRef]
- Yangtianzheng, Z.; Ying, G. Spatial patterns and trends of inter-city population mobility in China—Based on Baidu migration big data. Cities 2024, 151, 105124. [Google Scholar] [CrossRef]
- Alshahrani, R.; Babour, A. An Infodemiology and Infoveillance Study on COVID-19: Analysis of Twitter and Google Trends. Sustainability 2021, 13, 8528. [Google Scholar] [CrossRef]
- Chen, Q.-F.; Li, L.; Li, G.; Chen, L.; Peng, W.-T.; Tang, Y.; Chen, Y.; Wang, F.-Y. Seismic features of vibration induced by train. Acta Seismol. Sin. 2004, 17, 715–724. [Google Scholar] [CrossRef]
- Riahi, N.; Gerstoft, P. The seismic traffic footprint: Tracking trains, aircraft, and cars seismically. Geophys. Res. Lett. 2015, 42, 2674–2681. [Google Scholar] [CrossRef]
- Scafetta, N.; Mazzarella, A. Cultural noise and the night-day asymmetry of the seismic activity recorded at the Bunker-East (BKE) Vesuvian Station. J. Volcanol. Geotherm. Res. 2018, 349, 117–127. [Google Scholar] [CrossRef]
- Diaz, J.; Ruiz, M.; Sanchez-Pastor, P.S.; Romero, P. Urban seismology: On the origin of earth vibrations within a city. Sci. Rep. 2017, 7, 15296. [Google Scholar] [CrossRef]
- Chen, Y.K.; Savvaidis, A.; Fomel, S.; Saad, O.M.; Chen, Y.F. RFloc3D: A machine-learning method for 3-D microseismic source location using P- and S-wave arrivals. IEEE Trans. Geosci. Remote Sens. 2023, 61, 5901310. [Google Scholar] [CrossRef]
- Elsayed, H.S.; Saad, O.M.; Soliman, M.S.; Chen, Y.K.; Youness, H.A. EQConvMixer: A deep-learning approach for earthquake location from single-station waveforms. IEEE Geosci. Remote Sens. Lett. 2023, 20, 7504905. [Google Scholar] [CrossRef]
- Min, R.; Chen, Y.F.; Wang, H.; Chen, Y.K. DAS vehicle signal extraction using machine learning in urban traffic monitoring. IEEE Trans. Geosci. Remote Sens. 2024, 62, 5908510. [Google Scholar] [CrossRef]
- Mousavi, S.M.; Ellsworth, W.L.; Zhu, W.; Chuang, L.Y.; Beroza, G.C. Earthquake transformer-an attentive deep-learning model for simultaneous earthquake detection and phase picking. Nat. Commun. 2020, 11, 3952. [Google Scholar] [CrossRef]
- Zhu, W.; Mousavi, S.M.; Beroza, G.C. Seismic Signal Denoising and Decomposition Using Deep Neural Networks. IEEE Trans. Geosci. Remote Sens. 2019, 57, 9476–9488. [Google Scholar]
- Yang, L.; Liu, X.; Zhu, W.; Zhao, L.; Beroza, G.C. Toward improved urban earthquake monitoring through deep-learning-based noise suppression. Sci. Adv. 2022, 8, eabl3564. [Google Scholar] [CrossRef]
- Diaz, J.; DeFelipe, I.; Ruiz, M.; Andres, J.; Ayarza, P.; Carbonell, R. Identification of natural and anthropogenic signals in controlled source seismic experiments. Sci. Rep. 2022, 12, 3171. [Google Scholar] [CrossRef] [PubMed]
- Diaz, J.; Ventosa, S.; Schimmel, M.; Ruiz, M.; Macau, A.; Gabas, A.; Marti, D.; Akin, O.; Verges, J. Mapping the basement of the Cerdanya Basin (eastern Pyrenees) using seismic ambient noise. Solid Earth 2023, 14, 499–514. [Google Scholar] [CrossRef]
- Diaz, J.; Ruiz, M.; Jara, J.-A. Seismic monitoring of urban activity in Barcelona during the COVID-19 lockdown. Solid Earth 2021, 12, 725–739. [Google Scholar] [CrossRef]
- Zhao, Y.; Li, Y.E.; Nilot, E.; Fang, G. Urban Running Activity Detected Using a Seismic Sensor during COVID-19 Pandemic. Seismol. Res. Lett. 2022, 93, 181–192. [Google Scholar] [CrossRef]
- Dias, F.L.; Assumpção, M.; Peixoto, P.S.; Bianchi, M.B.; Collaço, B.; Calhau, J. Using seismic noise levels to monitor social isolation: An example from Rio de Janeiro, Brazil. Geophys. Res. Lett. 2020, 47, e2020GL088748. [Google Scholar] [CrossRef]
- Grecu, B.; Borleanu, F.; Tiganescu, A.; Poiata, N.; Dinescu, R.; Tataru, D. The effect of 2020 COVID-19 lockdown measures on seismic noise recorded in Romania. Solid Earth 2021, 12, 2351–2368. [Google Scholar] [CrossRef]
- Hayashida, T.; Yoshimi, M.; Suzuki, H.; Mori, S.; Kagawa, T.; Ichii, K.; Yamada, M. Tracking the effect of human activity on MeSO-net noise using seismic data traffic: Did seismic noise in Tokyo truly decrease during the COVID-19 state of emergency? Seismol. Res. Lett. 2023, 94, 2750–2764. [Google Scholar] [CrossRef]
- Li, Y.E.; Nilot, E.A.; Zhao, Y.; Fang, G. Quantifying Urban Activities Using Nodal Seismometers in a Heterogeneous Urban Space. Sensors 2023, 23, 1322. [Google Scholar] [CrossRef]
- Lecocq, T.; Hicks, S.P.; Van Noten, K.; van Wijk, K.; Koelemeijer, P.; De Plaen, R.S.M.; Massin, F.; Hillers, G.; Anthony, R.E.; Apoloner, M.T.; et al. Global quieting of high-frequency seismic noise due to COVID-19 pandemic lockdown measures. Science 2020, 369, 1338–1343. [Google Scholar] [CrossRef]
- Nimiya, H.; Ikeda, T.; Tsuji, T. Temporal changes in anthropogenic seismic noise levels associated with economic and leisure activities during the COVID-19 pandemic. Sci. Rep. 2021, 11, 20439. [Google Scholar] [CrossRef]
- Rahman, S.I.B.A.; Lythgoe, K.; Muktadir, M.G.; Akhter, S.H.; Hubbard, J. Characterization and spatiotemporal variations of ambient seismic noise in eastern Bangladesh. Front. Earth Sci. 2024, 12, 1334248. [Google Scholar] [CrossRef]
- Wilson, D. Broadband seismic background noise at temporary seismic stations observed on a regional scale in the southwestern United States. Bull. Seismol. Soc. Am. 2002, 92, 3335–3342. [Google Scholar] [CrossRef]
- Kotov, A.N.; Agibalov, A.O.; Sentsov, A.A. Low-Frequency Noise Pollution in the Northeastern Part of Mosrentgen (Moscow). Izv. Atmos. Ocean. Phys. 2023, 59, 959–970. [Google Scholar] [CrossRef]
- Saadia, B.; Fotopoulos, G. Characterizing ambient seismic noise in an urban park environment. Sensors 2023, 23, 2446. [Google Scholar] [CrossRef] [PubMed]
- Smith, K.; Tape, C. Seismic Noise in Central Alaska and Influences From Rivers, Wind, and Sedimentary Basins. J. Geophys. Res. Solid Earth 2019, 124, 11678–11704. [Google Scholar] [CrossRef]
- McNamara, D.E. Ambient Noise Levels in the Continental United States. Bull. Seismol. Soc. Am. 2004, 94, 1517–1527. [Google Scholar] [CrossRef]
- Shi, S.; Pain, K.; Chen, X. Looking into mobility in the COVID-19 ‘eye of the storm’: Simulating virus spread and urban resilience in the Wuhan city-region travel flow network. Cities 2022, 126, 103675. [Google Scholar] [CrossRef] [PubMed]
- Sohrabi, C.; Alsafi, Z.; O’Neill, N.; Khan, M.; Kerwan, A.; Al-Jabir, A.; Iosifidis, C.; Agha, R. World Health Organization declares global emergency: A review of the 2019 novel coronavirus (COVID-19). Int. J. Surg. 2020, 76, 71–76. [Google Scholar] [CrossRef]
- Gibney, E. Coronavirus lockdowns have changed the way Earth moves. Nature 2020, 580, 176–177. [Google Scholar] [CrossRef]
- Guo, K.; Li, J.H.; Shi, L. Intelligent urban travel evaluation model driven by seismic data. High Technol. Lett. 2025, 35, 1300–1310. [Google Scholar]
- Chen, T.; Guestrin, C. XGBoost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 785–794. [Google Scholar]
- Ke, G.L.; Meng, Q.; Finley, T.; Wang, T.F.; Chen, W.; Ma, W.D.; Ye, Q.W.; Liu, T.Y. LightGBM: A Highly Efficient Gradient Boosting Decision Tree. In Proceedings of the Advances in Neural Information Processing Systems (NeurIPS), Long Beach, CA, USA, 4–9 December 2017; pp. 3140–3148. [Google Scholar]
- Prokhorenkova, L.; Gusev, G.; Vorobev, A.; Dorogush, A.V.; Gulin, A. CatBoost: Unbiased boosting with categorical features. In Proceedings of the 32nd Conference on Neural Information Processing Systems (NeurIPS), Montreal, QC, Canada, 2–8 December 2018. [Google Scholar]






| Use in This Study | Calendar Span | Cities | Stage |
|---|---|---|---|
| Internal forward validation | Training: 1 January 2020–11 February 2020 (42 days) Validation: 12 February 2020–25 February 2020 (14 days) | 10 Hubei cities | Fold 1 |
| Internal forward validation | Training: 1 January 2020–25 February 2020 (56 days) Validation: 26 February 2020–10 March 2020 (14 days) | 10 Hubei cities | Fold 2 |
| Internal forward validation | Training: 1 January 2020–10 March 2020 (70 days) Validation: 11 March 2020–24 March 2020 (14 days) | 10 Hubei cities | Fold 3 |
| Internal forward validation | Training: 1 January 2020–24 March 2020 (84 days) Validation: 25 March 2020–7 April 2020 (14 days) | 10 Hubei cities | Fold 4 |
| Full-period refit after fold selection | 1 January 2020–30 April 2020 | 10 Hubei cities | Final refit |
| Completely unseen-city validation | Same calendar interval after preprocessing and sequence warm-up | 84 non-Hubei cities | External validation |
| Model | Internal RMSE | Internal R2 | Internal Pearson r | Internal MAE | External RMSE | External R2 | External Pearson r | External MAE |
|---|---|---|---|---|---|---|---|---|
| CatBoost-static | 0.982 ± 0.226 | −2.569 ± 3.879 | 0.618 ± 0.123 | 0.708 ± 0.134 | 0.835 ± 0.216 | −0.238 ± 1.281 | 0.572 ± 0.349 | 0.659 ± 0.206 |
| LSTM-only | 0.575 ± 0.143 | 0.371 ± 0.312 | 0.560 ± 0.402 | 0.426 ± 0.096 | 0.804 ± 0.222 | −0.025 ± 0.706 | 0.701 ± 0.256 | 0.658 ± 0.215 |
| FusionA | 0.871 ± 0.171 | −1.443 ± 2.651 | 0.676 ± 0.144 | 0.638 ± 0.082 | 0.865 ± 0.234 | −0.283 ± 1.307 | 0.563 ± 0.359 | 0.682 ± 0.215 |
| FusionB (final UTScan) | 0.537 ± 0.214 | 0.533 ± 0.126 | 0.768 ± 0.076 | 0.403 ± 0.131 | 0.789 ± 0.229 | −0.071 ± 1.128 | 0.605 ± 0.370 | 0.630 ± 0.210 |
| Role in Manuscript | Selected Configuration | Definition | Model |
|---|---|---|---|
| Static comparator | Tree d8; lr 0.05; L2 = 5; 800 iter | Same-day baseline-subtracted PSD bands → actual_t | CatBoost-static |
| Sequence comparator | LSTM h16; 1 layer; batch64; lr 0.001 | Seven-day PSD feature sequence → actual_t | LSTM-only |
| Diagnostic hybrid | Tree d8; lr 0.05; L2 = 3; LSTM h16; 1 layer; batch64; lr 0.001 | Seven-day [baseline, historical actual] sequence → residual_t | FusionA |
| Final model | Tree d8; lr 0.05; L2 = 3; LSTM h16; 1 layer; batch32; lr 0.001 | Seven-day baseline-prediction sequence → actual_t | FusionB (UTScan) |
| Analysis | Settings/Data | Key Quantitative Outcome | Interpretation |
|---|---|---|---|
| Validation-window sensitivity | 7-, 14-, and 21-day forward-validation windows within the 10 Hubei development cities. | 7 d: RMSE 0.339, Pearson 0.737; 14 d: RMSE 0.537, Pearson 0.768; 21 d: RMSE 0.583, Pearson 0.755. | The main conclusions do not depend on a single validation-window choice. |
| Baseline-normalization sensitivity | Final FusionB external evaluation on the 84-city external set; file-based station-specific low-activity baseline versus a percentile-based (P5) alternative. | File baseline: RMSE 0.789, MAE 0.630, Pearson 0.605, bias −0.147; P5 baseline: RMSE 1.122, MAE 0.908, Pearson 0.675, bias −0.639. | Baseline choice affects calibration and absolute error more strongly than rank-order association; the file-based baseline was therefore retained for the main analysis. |
| Transient-outlier robustness | Median aggregation with/without outlier filtering; mean aggregation with outlier filtering on the 84-city external set. | median + filter: RMSE 0.789, Pearson 0.605; median − filter: RMSE 0.798, Pearson 0.608; mean + filter: RMSE 0.748, Pearson 0.625. | Alternative light-touch preprocessing choices changed the external summary metrics only modestly, indicating that isolated spikes were not the dominant driver of the daily predictors. |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Guo, K.; Hou, J. Estimating Urban Travel Intensity from Ambient Seismic Signals via a Hybrid CatBoost–LSTM Framework. Appl. Sci. 2026, 16, 3407. https://doi.org/10.3390/app16073407
Guo K, Hou J. Estimating Urban Travel Intensity from Ambient Seismic Signals via a Hybrid CatBoost–LSTM Framework. Applied Sciences. 2026; 16(7):3407. https://doi.org/10.3390/app16073407
Chicago/Turabian StyleGuo, Kai, and Jianmin Hou. 2026. "Estimating Urban Travel Intensity from Ambient Seismic Signals via a Hybrid CatBoost–LSTM Framework" Applied Sciences 16, no. 7: 3407. https://doi.org/10.3390/app16073407
APA StyleGuo, K., & Hou, J. (2026). Estimating Urban Travel Intensity from Ambient Seismic Signals via a Hybrid CatBoost–LSTM Framework. Applied Sciences, 16(7), 3407. https://doi.org/10.3390/app16073407

