SA-PhyGRU: A Self-Attention-Enhanced Physics-Informed GRU for Structural Seismic Response Prediction with Small Datasets
Abstract
1. Introduction
2. The Foundation of the SA-PhyGRU Model
2.1. Gated Recurrent Unit (GRU)
2.2. Self-Attention Mechanism
2.3. Loss Function
2.4. SA-PhyGRU Network
3. The Numerical Case
3.1. The Dataset Generation
3.2. Numerical Validation
4. The Experimental Case
4.1. The Dataset
4.2. Experimental Validation
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Chen, X.; Wang, H.; Gao, H.; Liu, Y.; Pan, Z.; Song, Q.; Qin, H.; Jiang, Y. Prior-Knowledge-Guided Missing Data Imputation for Bridge Cracks: A Temperature-Driven SP-VMD-CNN-GRU Framework. Buildings 2026, 16, 669. [Google Scholar] [CrossRef]
- Takagi, J.; Wada, A. Higher performance seismic structures for advanced cities and societies. Engineering 2019, 5, 184–189. [Google Scholar] [CrossRef]
- Kazama, M.; Noda, T. Damage statistics (Summary of the 2011 off the Pacific Coast of Tohoku Earthquake damage). Soils Found. 2012, 52, 780–792. [Google Scholar] [CrossRef]
- Huang, X.; Wang, R.; Zhang, X.; Huang, G.; Teng, D.; Zhang, X. Damage Inspection and Seismic Assessment of Lingzhao Xuan in the Palace Museum: A Case Study. Buildings 2024, 14, 3311. [Google Scholar] [CrossRef]
- Kang, X.; Chen, H.; Zhao, G.; Lin, X.; Zheng, L.; Chen, Y.; Liu, Q.; Zhao, Z.; Chen, X.; Wang, F. Nearby Real-Time Earthquake Simulation on an Urban Scale Based on Structural Monitoring. Buildings 2024, 14, 3574. [Google Scholar] [CrossRef]
- Gao, Y.; Xiao, Z.; Gong, Z.; Huang, S.; Zhu, H. Spatiotemporal Deformation Prediction Model for Retaining Structures Integrating ConvGRU and Cross-Attention Mechanism. Buildings 2025, 15, 2537. [Google Scholar] [CrossRef]
- Liu, Y.; Zhang, B.; Wang, T.; Su, T.; Chen, H. Dynamic analysis of multilayer-reinforced concrete frame structures based on NewMark-β method. Rev. Adv. Mater. Sci. 2021, 60, 567–577. [Google Scholar] [CrossRef]
- Geller, R.; Jackson, D.; Kagan, Y.; Mulargia, F. Earthquakes cannot be predicted. Science 1997, 275, 1616–1617. [Google Scholar] [CrossRef]
- Zhang, Y.; Wu, G. Seismic vulnerability analysis of rc bridges based on kriging model. J. Earthq. Eng. 2019, 23, 242–260. [Google Scholar] [CrossRef]
- Zheng, Z.; Tian, Y.; Yang, Z.; Lu, X. Hybrid framework for simulating building collapse and ruin scenarios using finite element method and physics engine. Appl. Sci. 2020, 10, 4408. [Google Scholar] [CrossRef]
- Cha, Y.; Choi, W.; Suh, G. Autonomous Structural Visual Inspection Using Region-Based Deep Learning for Detecting Multiple Damage Types. Comput.-Aided Civ. Infrastruct. Eng. 2018, 33, 731–747. [Google Scholar] [CrossRef]
- Xu, Y.; Lu, X.; Cetiner, B.; Taciroglu, E. Real-time regional seismic damage assessment framework based on long short-term memory neural network. Comput.-Aided Civ. Infrastruct. Eng. 2020, 36, 504–521. [Google Scholar] [CrossRef]
- Li, Y.; Bao, T.; Gao, Z.; Shu, X.; Zhang, K.; Xie, L. A new dam structural response estimation paradigm powered by deep learning and transfer learning techniques. Struct. Health Monit. 2021, 21, 770–787. [Google Scholar] [CrossRef]
- Xu, Y.; Quan, Q.; Zhang, Z. Research on Long-Term Structural Response Time-Series Prediction Method Based on the Informer-SEnet Model. Buildings 2026, 16, 189. [Google Scholar] [CrossRef]
- Liu, D.; Yang, J.; Li, J.; Shen, J.; Zhang, Y.; Chen, L.; Zhou, L. LGSTA-GNN: A Local-Global Spatiotemporal Attention Graph Neural Network for Bridge Structural Damage Detection. Buildings 2026, 16, 348. [Google Scholar] [CrossRef]
- Li, H.; Wang, T.; Gang, W. Dynamic response prediction of vehicle-bridge interaction system using feedforward neural network and deep long short-term memory network. Structures 2021, 34, 2415–2431. [Google Scholar] [CrossRef]
- Bengio, Y.; Simard, P.; Frasconi, P. Learning long-term dependencies with gradient descent is difficult. IEEE Trans. Neural Netw. 1994, 5, 157–166. [Google Scholar] [CrossRef] [PubMed]
- Hochreiter, S.; Schmidhuber, J. Long short-term memory. Neural Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef]
- Li, J.; Chen, W.; Fan, G. Structural health monitoring data anomaly detection by transformer enhanced densely connected neural networks. Smart Struct. Syst. 2022, 30, 613. [Google Scholar] [CrossRef]
- Zhang, R.; Zhao, C.; Su, C.; Jing, Z.; Oral, B.; Hao, S. Deep long short-term memory networks for nonlinear structural seismic response prediction. Comput. Struct. 2019, 220, 55–68. [Google Scholar] [CrossRef]
- Huang, P.; Chen, Z. Deep learning for nonlinear seismic responses prediction of subway station. Eng. Struct. 2021, 244, 112735. [Google Scholar] [CrossRef]
- Wu, W.; Ge, S.; Yuan, Y.; Ding, W.; Anastaspoulos, I. Seismic response of subway station in soft soil: Shaking tabletesting versus numerical analysis. Tunn. Undergr. Space Technol. 2020, 100, 103389. [Google Scholar] [CrossRef]
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, Ł.; Polosukhin, I. Attention Is All You Need. In Proceedings of the 31st Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, CA, USA, 4–9 December 2017; Advances in Neural Information Processing Systems. pp. 5998–6008. Available online: https://arxiv.org/abs/1706.03762 (accessed on 9 April 2026).
- Li, T.; Pan, Y.; Tong, K.; Ventura, C.; de Silva, C. A multi-scale attention neural network for sensor location selection and nonlinear structural seismic response prediction. Comput. Struct. 2021, 248, 106507. [Google Scholar] [CrossRef]
- Liao, Y.; Lin, R.; Zhang, R.; Wu, G. Attention-based LSTM (AttLSTM) neural network for Seismic Response Modeling of Bridges. Comput. Struct. 2023, 275, 106915. [Google Scholar] [CrossRef]
- Meng, S.; Zhou, Y.; Gao, Z. Refined self-attention mechanism based real-time structural response prediction method under seismic action. Eng. Appl. Artif. Intell. 2024, 129, 107380. [Google Scholar] [CrossRef]
- Hu, S.; Guo, T.; Shahria, A.; Koetaka, Y.; Ghafoori, E.; Karavasilis, T.L. Machine learning in earthquake engineering: A review on recent progress and future trends in seismic performance evaluation and design. Eng. Struct. 2025, 340, 120721. [Google Scholar] [CrossRef]
- Karniadakis, G.E.; Kevrekidis, I.G.; Lu, L.; Perdikaris, P.; Wang, S.; Yang, L. Physics-informed machine learning. Nat. Rev. Phys. 2021, 3, 422−440. [Google Scholar] [CrossRef]
- Xu, Y.; Sara, K.; Jessica, B.; Paolo, G. Physics-informed machine learning for reliability and systems safety applications: State of the art and challenges. Reliab. Eng. Syst. Saf. 2021, 230, 108900. [Google Scholar] [CrossRef]
- Raissi, M.; Perdikaris, P.; Karniadakis, G. Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. J. Comput. Phys. 2021, 378, 686–707. [Google Scholar] [CrossRef]
- Raissi, M.; Yazdani, A.; Karniadakis, G. Hidden fluid mechanics: Learning velocity and pressure fields from flow visualizations. Science 2020, 367, 1026–1030. [Google Scholar] [CrossRef]
- Li, J.; Chen, Y. A physics-constrained deep residual network for solving the sine-Gordon equation. Commun. Theor. Phys. 2021, 73, 015001. [Google Scholar] [CrossRef]
- Soheil, S.; Martin, T.; Shamim, N.; Majid, J. Dyn Net: Physics-based neural architecture design for nonlinear structural response modeling and prediction. Eng. Struct. 2021, 229, 111582. [Google Scholar] [CrossRef]
- Zhang, R.; Liu, Y.; Sun, H. Physics-informed multi-LSTM networks for metamodeling of nonlinear structures. Comput. Methods Appl. Mech. Eng. 2020, 369, 113226. [Google Scholar] [CrossRef]
- Zhang, R.; Liu, Y.; Sun, H. Physics-guided convolutional neural network (PhyCNN) for data-driven seismic response modeling. Eng. Struct. 2021, 215, 110704. [Google Scholar] [CrossRef]
- Liu, F.; Li, J.; Wang, L. PI-LSTM: Physics-informed long short-term memory network for structural response modeling. Eng. Struct. 2023, 292, 116500. [Google Scholar] [CrossRef]
- Liao, Y.; Tang, H.; Li, R.; Ran, L.; Xie, L. Seismic response prediction and parameters estimation of the frame structure equipped with the base isolation-fluid inerter system (FS-BIFI) based on the PhyLSTM model. Eng. Struct. 2024, 309, 118077. [Google Scholar] [CrossRef]
- Hu, Y.; Tsang, H.H.; Lam, N.; Lumantarna, E. Physics-informed neural networks for enhancing structural seismic response prediction with pseudo-labelling. Arch. Civ. Mech. Eng. 2024, 24, 7. [Google Scholar] [CrossRef]
- Wang, X.W.; Zheng, W.; Wei, Z.H.; Li, S.Q. Explainable AI-Driven Optimal Feature Selection for the Identification of Structural Damage. Struct. Control Health Monit. 2025, 23, 7253150. [Google Scholar] [CrossRef]
- Wang, X.W.; Wang, Z.H.; Yang, S.X.; Wei, S.Q.; Wang, T.L. An incremental broad ensemble learning framework for the identification of structural damage in conditions of data scarcity and non-stationarity. Eng. Struct. 2026, 353, 122261. [Google Scholar] [CrossRef]
- Wang, X.W.; Zhao, Y.H.; Wei, Z.H.; Hu, N. An ultrafast and robust structural damage identification framework enabled by an optimized extreme learning machine. Mech. Syst. Signal Process. 2026, 216, 111509. [Google Scholar] [CrossRef]
- Kyunghyun, C.; Bartvan, M.; Caglar, G.; Dzmitry, B.; Fethi, B.; Holger, S.; Yoshua, B. Learning phrase representations using RNN encoder-decoder for statistical machine translation. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP); Association for Computational Linguistics: Doha, Qatar; pp. 1724–1734. [CrossRef]














| SA-PhyGRU | |
|---|---|
| Physical loss | |
| Data loss | |
| Total loss |
| Datasets | Total Duration | Sample Duration | Sample Number | Proportion |
|---|---|---|---|---|
| Training | 50 s | 0.05 s | 1000 | 10% |
| Testing | 50 s | 0.05 s | 1000 | 90% |
| Response | Metric | GRU | PhyGRU | SA-PhyGRU |
|---|---|---|---|---|
| MAE | 0.0227 | 0.0140 | 0.0125 | |
| Displacement | RMSE | 0.0459 | 0.0287 | 0.0255 |
| 0.8891 | 0.9523 | 0.9724 | ||
| MAE | 0.0896 | 0.0696 | 0.0630 | |
| Velocity | RMSE | 0.1823 | 0.1497 | 0.1375 |
| 0.8979 | 0.9525 | 0.9737 | ||
| MAE | 0.6211 | 0.4225 | 0.3676 | |
| Acceleration | RMSE | 1.2575 | 0.8511 | 0.7421 |
| 0.8528 | 0.9436 | 0.9652 |
| Response | Metric | 30% Dataset | 60% Dataset |
|---|---|---|---|
| MAE | 0.0205 | 0.0142 | |
| Displacement | RMSE | 0.0423 | 0.0286 |
| 0.9134 | 0.9425 | ||
| MAE | 0.0803 | 0.0709 | |
| Velocity | RMSE | 0.1700 | 0.1502 |
| 0.9174 | 0.9478 | ||
| MAE | 0.5371 | 0.4239 | |
| Acceleration | RMSE | 1.0762 | 0.8564 |
| 0.8816 | 0.9361 |
| Response | Metric | SA-PhyGRU | PhyLSTM | LSTM |
|---|---|---|---|---|
| MAE | 0.0125 | 0.0141 | 0.0246 | |
| Displacement | RMSE | 0.0255 | 0.0293 | 0.0483 |
| 0.9724 | 0.9512 | 0.8713 | ||
| MAE | 0.0630 | 0.0688 | 0.0991 | |
| Velocity | RMSE | 0.1375 | 0.1483 | 0.1992 |
| 0.9737 | 0.9575 | 0.8867 | ||
| MAE | 0.3676 | 0.3988 | 0.6134 | |
| Acceleration | RMSE | 0.7421 | 0.8773 | 1.2120 |
| 0.9652 | 0.9521 | 0.8529 |
| Third Floor | Metric | SA-PhyGRU | PI-LSTM | PhyGRU | GRU |
|---|---|---|---|---|---|
| MAE | 0.0007 | 0.001 | 0.0009 | 0.0010 | |
| Displacement | RMSE | 0.0014 | 0.001 | 0.0019 | 0.0021 |
| 0.9734 | 0.965 | 0.9589 | 0.9521 | ||
| MAE | 0.0887 | 0.096 | 0.0994 | 0.1112 | |
| Acceleration | RMSE | 0.1756 | 0.210 | 0.2143 | 0.2357 |
| 0.9551 | 0.937 | 0.9315 | 0.9280 | ||
| Rooftop | |||||
| MAE | 0.0008 | 0.001 | 0.0010 | 0.0012 | |
| Displacement | RMSE | 0.0016 | 0.002 | 0.0020 | 0.0023 |
| 0.9806 | 0.930 | 0.9482 | 0.9213 | ||
| MAE | 0.1935 | 0.145 | 0.2633 | 0.3478 | |
| Acceleration | RMSE | 0.3956 | 0.280 | 0.5026 | 0.6783 |
| 0.9536 | 0.977 | 0.9228 | 0.8796 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Gan, C.-W.; Li, B.; Wang, Y.-Y.; Yang, D. SA-PhyGRU: A Self-Attention-Enhanced Physics-Informed GRU for Structural Seismic Response Prediction with Small Datasets. Buildings 2026, 16, 1738. https://doi.org/10.3390/buildings16091738
Gan C-W, Li B, Wang Y-Y, Yang D. SA-PhyGRU: A Self-Attention-Enhanced Physics-Informed GRU for Structural Seismic Response Prediction with Small Datasets. Buildings. 2026; 16(9):1738. https://doi.org/10.3390/buildings16091738
Chicago/Turabian StyleGan, Cheng-Wu, Bo Li, Yao-Yue Wang, and Dong Yang. 2026. "SA-PhyGRU: A Self-Attention-Enhanced Physics-Informed GRU for Structural Seismic Response Prediction with Small Datasets" Buildings 16, no. 9: 1738. https://doi.org/10.3390/buildings16091738
APA StyleGan, C.-W., Li, B., Wang, Y.-Y., & Yang, D. (2026). SA-PhyGRU: A Self-Attention-Enhanced Physics-Informed GRU for Structural Seismic Response Prediction with Small Datasets. Buildings, 16(9), 1738. https://doi.org/10.3390/buildings16091738
