Research on Arch Dam Deformation Safety Early Warning Method Based on Effect Separation of Regional Environmental Variables and Knowledge-Driven Approach
Abstract
1. Introduction
2. Deformation Partitioning of Arch Dams Based on Optimized ISODATA Clustering
2.1. Selection of Initial Clustering Centers
- (1)
- Randomly select a sample from the sample set as the first initial cluster center;
- (2)
- For each remaining sample , first calculate the distance between it and the existing cluster centers, record the shortest distance, and represent it as ;
- (3)
- For each sample , its probability of being selected as the next cluster center is . Based on this probability, the cluster center for this round is selected. The larger the sample. , the higher its probability of being selected;
- (4)
- Return to step (2) until the selection of initial clustering centers is made.
2.2. High-Dimensional Feature Extraction of Arch Dam Deformation Monitoring Curves Based on the Kernel Method
2.3. Sensitivity Analysis of the Initial Cluster Number K in the K–L–ISODATA
3. PPCA-Based Expression and Isolation of Environmental Impacts
3.1. The Relationship Between Environmental Variables and Dam Effect Quantities
3.2. PPCA-Based Expression and Separation of the Effects of Environmental Variables in Arch Dams
- (1)
- Sort all feature values in descending order and plot them on the x-axis and y-axis with numbers. The number of the feature value corresponding to the location of the mutation point in the graph is the value of z. This method is mainly based on the understanding that the effects of noise and environmental variables are fundamentally different, and sudden changes occur in the eigenvalue spectrum;
- (2)
- Calculate the cumulative contribution of each PPC by arranging all eigenvalues in descending order.
4. Knowledge-Driven Knowledge-Based Monitoring of Deformation Properties of Arch Dams
4.1. Using KDE to Determine the Form of Distribution and Control Limits of the SPE
4.2. Anomaly Diagnosis Using Contribution Maps
4.3. Knowledge-Driven Arch Dam Deformation Safety Warning and Outlier Causation Diagnosis
- (1)
- The control chart is in mode (a), indicating normal monitoring data.
- (2)
- The control chart is model (b), and the monitoring data are normal.
- (3)
- The control chart is for mode (c), the contribution chart is for mode I, and the monitoring data is anomalous for a short period, possibly due to extreme environmental quantities.
- (4)
- The control chart is for mode (c), and the contribution chart is for mode II, which may be the result of a malfunction of an individual instrument that returns to normal.
- (5)
- The control chart is for mode (d), and the contribution chart is for mode I. The monitoring data is abnormal and shows a trend of growth. If there are no extreme environmental variables, it may be due to significant temporal changes in the dam structure, which should be noted.
- (6)
- The control chart is for mode (d), and the contribution chart for mode II has shown a trend of growth, and monitoring data anomalies may be caused by individual instrument failures.
5. A Method for Deformation Safety Warning and Anomaly Diagnosis of Arch Dams Based on the Separation of Environmental Variable Effects and Knowledge-Driven Approach
- (1)
- Data acquisition: Extract all the monitoring data of the deformation monitoring program from the arch dam safety monitoring database.
- (2)
- Coarse error removal and data normalization: Remove erroneous extreme value points, count the dates of all monitoring data for all measurement points, organize the monitoring data for each measurement point on this basis, and replace them with null values if they are missing.
- (3)
- Spatial clustering: Optimized ISODATA clustering was used to classify measurement points into multiple categories, which in turn classified the dam into multiple regions.
- (4)
- Partition PPCA: The maximum expectation algorithm was used to reconstruct the missing sequences, and the PPCA was used to calculate the PPCs of multiple measurement points in each partition.
- (5)
- Separation of influence of environmental quantity and noise: According to the inflection point or cumulative contribution rate of eigenvalue sequence, z PPCs that can best describe the influence of environmental variables on arch dam displacement are obtained. The residual component is noise impact or abnormal monitoring.
- (6)
- Offline monitoring: Use reference data to calculate the norm, and use the KDE estimated probability density function to calculate diagnostic criteria for monitoring quantity.
- (7)
- Online monitoring: Standardize new monitoring data using the mean and variance of reference data, calculate the corresponding norm of the data, and compare it with diagnostic criteria.
- (8)
- Outlier location: If the norm of the measured data exceeds the control limit, a contribution graph is used to locate the outlier and analyze the cause of the anomaly.
6. Engineering Examples
6.1. Project Overview
6.2. Spatial Clustering of Deformation Monitoring Points of a Concrete Arch Dam
6.3. Partitioned Principal Component Analysis
6.4. Offline Analysis
6.4.1. Partitioned Offline Analysis
- (1)
- Affected by the lack of insulation board on the downstream surface, the seasonal variation amplitude of the five measuring points in partition 1 increased significantly from July 2013 to December 2014. Among them, PL2-1, PL3-1, PL4-1, and PL5-1 are also accompanied by an obvious growth trend, while PL6-1 is stable, and only the phenomenon of seasonal displacement increases. After the insulation board was repaired in 2015, the seasonal displacement of each measuring point decreased significantly.
- (2)
- Due to the lack of monitoring data for about 4 months at the PL3-1 measuring point during the water level rise in 2012, when using the traditional PCA method to extract the main city, only the measured values at the corresponding time of the other four measuring points can be deleted, resulting in a large number of missing monitoring information. The PPCA method can extract the PPCs of multiple measurement points with partial missing data, and well retain the monitoring information during water level rise.
- (3)
- The SPE distribution of reference data extracted by the PPCA method is more concentrated, and its control limits UCL1 and UCL2 are slightly smaller than the results of the PCA method.
- (1)
- From the control chart, it can be seen that the dam crest displacement was gradually affected by the failure of the insulation board and increased significantly in November 2013, February to April 2014, and June to September 2014, respectively. It can be seen that the abnormal displacement of the arch dam affected by temperature can be divided into three stages in the 18 months of aging of the insulation board, and the abnormal displacement in the three stages is getting larger and larger.
- (2)
- From July 2013 to November 2013, the insulation board just failed and the temperature gradually decreased. At this time, the dam crest still did not show obvious abnormalities. The control diagram of the dam crest area belongs to class (c) in Figure 4, and the system does not need to alarm, but the SPE value of the PPCA method is closer to UCL1 and more sensitive. The period from November 2013 to March 2014 belongs to category (d) in Figure 4, that is, the system should alarm in November 2013, and the SPE value of the PPCA method is higher and also more sensitive. From April 2014 to December 2014, the repair of the insulation board was gradually completed, but the dam deformation was still affected by the failure of the insulation board in the early stages. At this time, the SPE value of the PPCA method was slightly less than that of the PCA method, which reflected the role of insulation board repair construction.
- (3)
- From the contribution diagram, the two methods are quite different. Since the PL6-1 measuring point is not within the influence range of the loss of insulation board, both methods show that the SPE contribution of the PL6-1 measuring point is the most obvious, indicating that there is a large gap between the deformation law of this measuring point and other measuring points, which is consistent with the above analysis.
6.4.2. Offline Analysis for All Monitoring Points
6.5. Online Monitoring
6.5.1. Online Monitoring of Partitions
- (1)
- Both PPC1 and PC1 can completely encompass the maximum and minimum values of each measurement point over the years, and the growth trend and the amplitude of cyclic displacement are greater.
- (2)
- PPCA can fully utilize the available information to extract the effects of environmental variables. Since PCA needs to extract PCs based on the data of the public date of each measurement point, only 418 sets of data are available. In contrast, PPCA can estimate the PPCs of the missing segments based on the probability distribution of the original data, so that 540 sets of partially missing monitoring data can be used.
- (3)
- Because the PPCA utilizes a portion of the monitoring data from the 2011 water level rise period, the overall trend shows a more pronounced increase in the PPC1 than in the PC1.
6.5.2. Online Monitoring for All Measuring Points
6.6. Discussion
7. Conclusions
- (1)
- By optimizing the selection of initial cluster centers and extracting high-dimensional features of arch dam deformation monitoring curves based on the kernel method, the traditional ISODATA is optimized, which effectively solves the problems of uncertain cluster number and difficult-to-capture high-dimensional features of data. The deformation monitoring points of the arch dam are divided into multiple categories, and then the dam body is divided into multiple regions, which provides the basis for the subsequent zoning monitoring.
- (2)
- PPCA can deal with the missing problem in the monitoring data of multiple monitoring points, calculate the missing data through the correlation relationship, and extract the PPCs representing the overall performance of the dam, which effectively retains the monitoring information and can better separate the impact of environmental variables and noise interference.
- (3)
- The displacement law of all measuring points is more complex, which is difficult to characterize by fewer PCs or PPCs. The SPE control limit of all measuring points is significantly higher than that of the partition. Moreover, there are great differences in the expression of the impact of arch dam environmental variables in each region of the arch dam. Including all measuring points in the analysis will seriously affect the separation results of the impact of environmental variables, and it is easy to misjudge the safety state of the arch dam.
- (4)
- By analyzing the different mode combinations of the control chart and contribution chart of the SPE norm, we can analyze the anomalies of offline data and online data. The control chart can quickly analyze the overall safety status of each area of the dam, and the contribution chart can more accurately analyze the abnormal conditions of different measuring points in the area.
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Zhao, E.; Wu, C. Centroid Deformation-Based Nonlinear Safety Monitoring Model for Arch Dam Performance Evaluation. Eng. Struct. 2021, 243, 112652. [Google Scholar] [CrossRef]
- Yang, G. Deformation Similarity Characteristics-Considered Hybrid Panel Model for Multi-Point Deformation Monitoring of Super-High Arch Dams in Operating Conditions. Measurement 2022, 192, 110908. [Google Scholar] [CrossRef]
- Ma, C.; Chen, L.; Yang, K.; Yang, J.; Tu, Y.; Cheng, L. Intelligent Calibration Method for Microscopic Parameters of Soil—rock Mixtures Based on Measured Landslide Accumulation Morphology. Comput. Methods Appl. Mech. Eng. 2024, 422, 116835. [Google Scholar] [CrossRef]
- Ghannadi, P.; Kourehli, S.S.; Nguyen, A.; Oterkus, E. Letter to the Editor: A brief insight into the NDT in the UK. e-J. Nondestruct. Test. 2024, 29. [Google Scholar] [CrossRef] [PubMed]
- Masi, M.; Ferdos, F.; Losito, G.; Solari, L. Monitoring of internal erosion processes by time-lapse electrical resistivity tomography. J. Hydrol. 2020, 590, 125340. [Google Scholar] [CrossRef]
- Dai, Q.; Lin, F.; Wang, X.; Feng, D.; Bayless, R.C. Detection of concrete dam leakage using an integrated geophysical technique based on flow-field fitting method. J. Appl. Geophys. 2017, 140, 168–176. [Google Scholar] [CrossRef]
- Bolzon, G.; Frigerio, A.; Hajjar, M.; Nogara, C.; Zappa, E. Structural health assessment of existing dams based on non-destructive testing, physics-based models and machine learning tools. NDT E Int. 2025, 150, 103271. [Google Scholar] [CrossRef]
- Bigman, D.P.; Day, D.J. Ground penetrating radar inspection of a large concrete spillway: A case study using SFCW GPR at a hydroelectric dam. Case Stud. Constr. Mater. 2022, 16, e00975. [Google Scholar] [CrossRef]
- Innocenti, A.; Pazzi, V.; Napoli, M.; Ciampalini, R. Electrical resistivity tomography: A reliable tool to monitor the efficiency of different irrigation systems in horticulture fields. J. Appl. Geophys. 2024, 230, 105527. [Google Scholar] [CrossRef]
- Zhao, S.; Kang, F.; Li, J.; Ma, C. Structural Health Monitoring and Inspection of Dams Based on UAV Photogrammetry with Image 3D Reconstruction. Autom. Constr. 2021, 130, 103832. [Google Scholar] [CrossRef]
- Li, M.; Li, M.; Ren, Q.; Li, H.; Song, L. DRLSTM: A Dual-Stage Deep Learning Approach Driven by Raw Monitoring Data for Dam Displacement Prediction. Adv. Eng. Inform. 2022, 51, 101510. [Google Scholar] [CrossRef]
- Xiao, S.; Cheng, L.; Ma, C.; Yang, J.; Xu, X.; Chen, J. An Adaptive Identification Method for Outliers in Dam Deformation Monitoring Data Based on Bayesian Model Selection and Least Trimmed Squares Estimation. J. Civil. Struct. Health Monit. 2024, 14, 763–779. [Google Scholar] [CrossRef]
- Wang, S.; Gu, C.; Liu, Y.; Gu, H.; Xu, B.; Wu, B. Displacement Observation Data-Based Structural Health Monitoring of Concrete Dams: A State-of-Art Review. Structures 2024, 68, 107072. [Google Scholar] [CrossRef]
- Gu, C.; Fu, X.; Shao, C.; Shi, Z.; Su, H. Application of spatiotemporal hybrid model of deformation in safety monitoring of high arch dams: A case study. Int. J. Environ. Res. Public Health 2020, 17, 319. [Google Scholar] [CrossRef]
- Ren, Q.; Li, M.; Kong, T.; Ma, J. Multi-Sensor Real-Time Monitoring of Dam Behavior Using Self-Adaptive Online Sequential Learning. Autom. Constr. 2022, 140, 104365. [Google Scholar] [CrossRef]
- Shao, C.F.; Gu, C.S.; Yang, M.; Xu, Y.X.; Su, H.Z. A novel model of dam displacement based on panel data. Struct. Control Health Monit. 2018, 25, e2037. [Google Scholar] [CrossRef]
- Wang, S.; Xu, C.; Liu, Y.; Gu, H.; Xu, B.; Hu, K. Spatial Association-Considered Real-Time Risk Rate Assessment of High Arch Dams Using Observed Displacement and Combination Prediction Model. Structures 2023, 53, 1108–1121. [Google Scholar] [CrossRef]
- Chen, B.; Hu, T.Y.; Huang, Z.S.; Fang, C.H. A spatio-temporal clustering and diagnosis method for concrete arch dams using deformation monitoring data. Struct. Health Monit. 2019, 18, 1355–1371. [Google Scholar] [CrossRef]
- Cao, W.; Wen, Z.; Su, H. Spatiotemporal Clustering Analysis and Zonal Prediction Model for Deformation Behavior of Super-High Arch Dams. Expert Syst. Appl. 2023, 216, 119439. [Google Scholar] [CrossRef]
- Cheng, M.Y.; Cao, M.T.; Huang, I.F. Hybrid artificial intelligence-based inference models for accurately predicting dam body displacements: A case study of the Fei Tsui dam. Struct. Health Monit. 2021, 21, 1738–1756. [Google Scholar] [CrossRef]
- Xu, X.; Yang, J.; Ma, C.; Qu, X.; Chen, J.; Cheng, L. Segmented modeling method of dam displacement based on BEAST time series decomposition. Measurement 2022, 202, 111811. [Google Scholar] [CrossRef]
- Ren, Q.; Li, H.; Li, M.; Zhang, J.; Kong, T. Towards Online Monitoring of Concrete Dam Displacement Subject to Time-Varying Environments: An Improved Sequential Learning Approach. Adv. Eng. Inform. 2023, 55, 101881. [Google Scholar] [CrossRef]
- Li, B.; Ning, J.; Yang, S.; Zhang, L. Prediction Model for High Arch Dam Stress during the Operation Period Using LightGBM with MSSA and SHAP. Adv. Eng. Softw. 2024, 192, 103635. [Google Scholar] [CrossRef]
- Zhang, K.; Gu, C.; Zhu, Y.; Li, Y.; Shu, X. A Mathematical-Mechanical Hybrid Driven Approach for Determining the Deformation Monitoring Indexes of Concrete Dam. Eng. Struct. 2023, 277, 115353. [Google Scholar] [CrossRef]
- Rong, Z.; Pang, R.; Xu, B.; Zhou, Y. Dam Safety Monitoring Data Anomaly Recognition Using Multiple-Point Model with Local Outlier Factor. Autom. Constr. 2024, 159, 105290. [Google Scholar] [CrossRef]
- Cheng, L.; Zheng, D. Two Online Dam Safety Monitoring Models Based on the Process of Extracting Environmental Effect. Adv. Eng. Softw. 2013, 57, 48–56. [Google Scholar] [CrossRef]
- Yang, G.; Zhao, A.; Sun, J.; Niu, J.; Zhang, J.; Wang, L. Progressive Failure Process-Considered Deformation Safety Diagnosis Method for in-Service High Arch Dam. Eng. Fail. Anal. 2024, 163, 108570. [Google Scholar] [CrossRef]
- Zhan, M.; Chen, B.; Wu, Z. Deformation Warning Index for Reinforced Concrete Dam Based on Structural Health Monitoring Data and Numerical Simulation. Water Sci. Eng. 2023, 16, 408–418. [Google Scholar] [CrossRef]
- Chen, W.; Wang, X.; Tong, D.; Cai, Z.; Zhu, Y.; Liu, C. Dynamic Early-Warning Model of Dam Deformation Based on Deep Learning and Fusion of Spatiotemporal Features. Knowl.-Based Syst. 2021, 233, 107537. [Google Scholar] [CrossRef]
- Pan, W. Characterization Model Research on Deformation of Arch Dam Based on Correlation Analysis Using Monitoring Data. Mathematics 2024, 12, 3110. [Google Scholar] [CrossRef]
- Chen, R.; Wu, Z. Construction and selection of deformation monitoring model for high arch dam using separate modeling technique and composite decision criterion. Struct. Health Monit. 2024, 23, 2509–2530. [Google Scholar] [CrossRef]
- Wang, H.; Yi, Z.; Xu, Y.; Cai, Q.; Li, Z.; Wang, H.; Bai, X. Data-driven distributionally robust optimization approach for the coordinated dispatching of the power system considering the correlation of wind power. Electr. Power Syst. Res. 2024, 230, 110224. [Google Scholar] [CrossRef]
- Ma, Y.; Huang, Y.; Yuan, Y. The total factor characteristics evaluation of photovoltaic power by coarse-fine-grained method. Sustain. Energy Grids Netw. 2024, 38, 101371. [Google Scholar] [CrossRef]
- Zhu, M.; Chen, B.; Gu, C.; Wu, Y.; Chen, W. Optimized Multi-Output LSSVR Displacement Monitoring Model for Super High Arch Dams Based on Dimensionality Reduction of Measured Dam Temperature Field. Eng. Struct. 2022, 268, 114686. [Google Scholar] [CrossRef]
- Yu, H.; Wu, Z.; Bao, T.; Zhang, L. Multivariate Analysis in Dam Monitoring Data with PCA. Sci. China Technol. Sci. 2010, 53, 1088–1097. [Google Scholar] [CrossRef]
- Zhu, D. Application of SVM model based on probabilistic statistical analysis in dam deformation prediction. Urban Geotech. Investig. Surv. 2015, 125–128. [Google Scholar]
- Loh, C.H.; Weng, J.H.; Chen, C.H.; Chang, Y.W. Feature extraction within the Fei-Tsui arch dam under environmental variations. In Proceedings of the IUTAM Symposium on Nonlinear Stochastic Dynamics and Control, Hangzhou, China, 10–14 May 2010; Springer Press: Dordrecht, The Netherlands, 2011; pp. 45–54. [Google Scholar]
- Matrtin, E.B.; Morris, A.J. Non-parametric confidence bounds for process performance monitoring charts. J. Stat. Process Control Charts 1996, 6, 349–358. [Google Scholar] [CrossRef]























| Number of PCs | Partition 1 | Partition 2 | Partition 3 | All Measurement Points | ||||
|---|---|---|---|---|---|---|---|---|
| PPC | PC | PPC | PC | PPC | PC | PPC | PC | |
| 1 | 0.9645 | 0.9508 | 0.9955 | 0.9872 | 0.9861 | 0.9839 | 0.9463 | 0.9276 |
| 2 | 0.9869 | 0.9706 | 0.9983 | 0.9942 | 0.9954 | 0.9956 | 0.9897 | 0.9898 |
| UCL | PCA | PPCA |
|---|---|---|
| ) | 0.4517 | 0.4311 |
| ) | 0.6255 | 0.5822 |
| PCA | 4PPCA | |
|---|---|---|
| UCL1 | 0.3826 | 0.4282 |
| UCL2 | 0.4536 | 0.4553 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Wang, J.; Tong, F.; Gao, Z.; Cheng, L.; Zhao, S. Research on Arch Dam Deformation Safety Early Warning Method Based on Effect Separation of Regional Environmental Variables and Knowledge-Driven Approach. Water 2025, 17, 3217. https://doi.org/10.3390/w17223217
Wang J, Tong F, Gao Z, Cheng L, Zhao S. Research on Arch Dam Deformation Safety Early Warning Method Based on Effect Separation of Regional Environmental Variables and Knowledge-Driven Approach. Water. 2025; 17(22):3217. https://doi.org/10.3390/w17223217
Chicago/Turabian StyleWang, Jianxue, Fei Tong, Zhiwei Gao, Lin Cheng, and Shuaiyin Zhao. 2025. "Research on Arch Dam Deformation Safety Early Warning Method Based on Effect Separation of Regional Environmental Variables and Knowledge-Driven Approach" Water 17, no. 22: 3217. https://doi.org/10.3390/w17223217
APA StyleWang, J., Tong, F., Gao, Z., Cheng, L., & Zhao, S. (2025). Research on Arch Dam Deformation Safety Early Warning Method Based on Effect Separation of Regional Environmental Variables and Knowledge-Driven Approach. Water, 17(22), 3217. https://doi.org/10.3390/w17223217

