Author Contributions
Conceptualization, J.Z., N.J. and C.H.; methodology, J.Z., N.J. and C.H.; software, J.Z., N.J. and C.H.; validation, J.Z., C.H. and M.X.; investigation, M.X.; resources, J.Z., C.H. and M.X.; data curation, J.Z.; writing—original draft preparation, J.Z.; writing—review and editing, J.Z., N.J., C.H. and M.X.; visualization, J.Z. and M.X.; supervision, N.J. and C.H.; project administration, N.J.; funding acquisition, N.J. All authors have read and agreed to the published version of the manuscript.
Figure 1.
Feature Dataset Construction Pipeline.
Figure 1.
Feature Dataset Construction Pipeline.
Figure 2.
Typical anomalies in landslide monitoring data: (a) Jump anomaly caused by nearby disturbance, manifested as an abrupt deviation of crack-width observations from the local variation trend. (b) Continuous missing data caused by equipment malfunction; the shaded region indicates the period of observation interruption.
Figure 2.
Typical anomalies in landslide monitoring data: (a) Jump anomaly caused by nearby disturbance, manifested as an abrupt deviation of crack-width observations from the local variation trend. (b) Continuous missing data caused by equipment malfunction; the shaded region indicates the period of observation interruption.
Figure 3.
Performance comparison of outlier detection algorithms on representative landslide monitoring time series. Precision, recall, and F1-score were calculated against manually checked anomaly labels. The Savitzky–Golay residual method achieved the highest F1-score (0.81), indicating the best balance between missed detections and false positives for transient jump anomalies.
Figure 3.
Performance comparison of outlier detection algorithms on representative landslide monitoring time series. Precision, recall, and F1-score were calculated against manually checked anomaly labels. The Savitzky–Golay residual method achieved the highest F1-score (0.81), indicating the best balance between missed detections and false positives for transient jump anomalies.
Figure 4.
Comparison of anomaly detection results obtained by the four representative methods: Cross markers denote the anomalies identified by each method. Among the four methods, the Savitzky–Golay Filter Residual Method achieved the best overall performance, identifying 24 anomaly points with an F1-score of 0.81. The Moving Average method detected 37 anomalies but produced 16 false positives, yielding an F1-score of 0.62. The Savitzky–Golay method showed relatively few false detections but missed more true anomalies, with an F1-score of 0.60. Isolation Forest exhibited both frequent missed detections and false detections, resulting in the lowest performance among the four methods (F1 = 0.11).
Figure 4.
Comparison of anomaly detection results obtained by the four representative methods: Cross markers denote the anomalies identified by each method. Among the four methods, the Savitzky–Golay Filter Residual Method achieved the best overall performance, identifying 24 anomaly points with an F1-score of 0.81. The Moving Average method detected 37 anomalies but produced 16 false positives, yielding an F1-score of 0.62. The Savitzky–Golay method showed relatively few false detections but missed more true anomalies, with an F1-score of 0.60. Isolation Forest exhibited both frequent missed detections and false detections, resulting in the lowest performance among the four methods (F1 = 0.11).
Figure 5.
Comparison of denoising results obtained by eight filtering methods on the same displacement time series: All methods reduce high-frequency fluctuations and local abnormal spikes to different extents, but they differ markedly in trend preservation, local fluctuation retention, and smoothing strength. Median + Savitzky–Golay, Savitzky–Golay, Median-Butterworth, and EMD maintain high consistency with the main trend of the original sequence while preserving local variations. Among them, EMD shows the highest agreement with the original curve, indicating a better balance between noise suppression and detail preservation. By contrast, Moving Average, Kalman, and Wavelet produce stronger smoothing effects, whereas Trend Fitting mainly captures the long-term evolution trend but responds less effectively to short-term fluctuations and local anomalies.
Figure 5.
Comparison of denoising results obtained by eight filtering methods on the same displacement time series: All methods reduce high-frequency fluctuations and local abnormal spikes to different extents, but they differ markedly in trend preservation, local fluctuation retention, and smoothing strength. Median + Savitzky–Golay, Savitzky–Golay, Median-Butterworth, and EMD maintain high consistency with the main trend of the original sequence while preserving local variations. Among them, EMD shows the highest agreement with the original curve, indicating a better balance between noise suppression and detail preservation. By contrast, Moving Average, Kalman, and Wavelet produce stronger smoothing effects, whereas Trend Fitting mainly captures the long-term evolution trend but responds less effectively to short-term fluctuations and local anomalies.
Figure 6.
Performance Comparison of Different Noise Filtering Methods.
Figure 6.
Performance Comparison of Different Noise Filtering Methods.
Figure 7.
Feature contribution in displacement prediction. The bars summarize normalized F-scores and average correlations between candidate physics-informed features and the GNSS displacement target; higher values indicate stronger explanatory contribution to the displacement-prediction task.
Figure 7.
Feature contribution in displacement prediction. The bars summarize normalized F-scores and average correlations between candidate physics-informed features and the GNSS displacement target; higher values indicate stronger explanatory contribution to the displacement-prediction task.
Figure 8.
Computational flowchart of the dynamic stability index evolution model.
Figure 8.
Computational flowchart of the dynamic stability index evolution model.
Figure 9.
Computational Flowchart of the Time-Dependent Parameter Creep Constitutive Model.
Figure 9.
Computational Flowchart of the Time-Dependent Parameter Creep Constitutive Model.
Figure 10.
Neural Network Architecture Diagram with Loss Function.
Figure 10.
Neural Network Architecture Diagram with Loss Function.
Figure 11.
Architecture of the Phys-LSTM model, including masked input encoding, physics-informed feature extraction, LSTM encoder–decoder forecasting, physics-guided attention, and physics-consistency loss.
Figure 11.
Architecture of the Phys-LSTM model, including masked input encoding, physics-informed feature extraction, LSTM encoder–decoder forecasting, physics-guided attention, and physics-consistency loss.
Figure 12.
Three-Stage Model Training and Prediction Strategy.
Figure 12.
Three-Stage Model Training and Prediction Strategy.
Figure 13.
Evolution of the adaptive physics-constraint weight across the three-stage online learning process.
Figure 13.
Evolution of the adaptive physics-constraint weight across the three-stage online learning process.
Figure 14.
Online Learning Workflow of Phys-LSTM.
Figure 14.
Online Learning Workflow of Phys-LSTM.
Figure 15.
Comparative 3D Visualization of Multi-Metric Performance for Different Models.
Figure 15.
Comparative 3D Visualization of Multi-Metric Performance for Different Models.
Figure 16.
Huangcaoping deformation mass landslide. (a) Landslide zoning and monitoring point arrangement, with a lower-right DEM inset showing the regional position of the HCP landslide and Chengdu in Sichuan Province; (b) A–B geological profile.
Figure 16.
Huangcaoping deformation mass landslide. (a) Landslide zoning and monitoring point arrangement, with a lower-right DEM inset showing the regional position of the HCP landslide and Chengdu in Sichuan Province; (b) A–B geological profile.
Figure 17.
Comparison Chart of HP02 Landslide Displacement Prediction. (a) The July 2017 deformation event, (b) The September 2017 Deformation Event, (c) The July 2018 deformation event, (d) The deformation event in August 2019, and (e) The September 2020 Deformation Event.
Figure 17.
Comparison Chart of HP02 Landslide Displacement Prediction. (a) The July 2017 deformation event, (b) The September 2017 Deformation Event, (c) The July 2018 deformation event, (d) The deformation event in August 2019, and (e) The September 2020 Deformation Event.
Table 1.
Multi-source monitoring data details.
Table 1.
Multi-source monitoring data details.
| Data Category | Monitoring Equipment | Monitoring Parameter (Unit) | Physical Significance | Sampling Frequency | Data Volume (Records) |
|---|
| Surface Displacement | GNSS | 3D Displacement (mm) | Direct representation of macroscopic deformation of the landslide body | once per day to once per hour | 1,899,424 |
| Deep Deformation | Micro-pile inclinometer/inclinometer | Inclination Angle (°), Deep Displacement (mm) | Indication of slip zone location and deep shear deformation | once per day to once per hour | 639,727 |
| Crack Dynamics | Crack Gauge | Crack Width (mm) | Quantitative indicator of surface tensile/shear deformation | once per day to once per second | 15,808,074 |
| Rainfall | Rain Gauge | Cumulative Rainfall (mm), Intensity (mm/h) | Primary triggering factor for landslides | once per day to once per hour | 8,801,127 |
| Groundwater Level | Water Level Gauge | Water Level Elevation (m), Pressure (kPa) | Key parameter for seepage-stress field coupling | once per hour | 4831 |
Table 2.
Common Core Characteristics.
Table 2.
Common Core Characteristics.
| Feature Category | Data Source Equipment | Physical Significance | Relevance to Landslide Evolution |
|---|
| Core Displacement Features | GNSS Displacement Monitor | Three-dimensional surface displacement of landslide (N, E, U) | Directly characterizes the surface deformation state of the landslide body, serving as the core target and key input for model prediction. |
| Surface Deformation Features | Crack Gauge | Crack opening/closure width, dislocation amount | Indicates the expansion of surface tension/shear zones, serving as a sensitive indicator of impending landslide. |
| Triggering Factor Features | Rain Gauge | Real-time rainfall, cumulative rainfall | Main external factors triggering/accelerating landslide deformation, affecting pore water pressure and soil strength of landslide mass. |
| Deep Displacement Features | Micro-pile inclinometer | Inclination angle or displacement at different depths in the borehole | Reflects the location of the slip zone and deep deformation mechanism, providing direct evidence of slip surface formation. |
| Environmental Auxiliary Features | (Optional) Soil Moisture Content/Pore Water Pressure Sensor | Soil volumetric water content, matric suction, or pore water pressure | Reflects hydrological response state, closely related to rainfall infiltration and slip zone softening processes. |
Table 3.
Physics-Informed Feature System.
Table 3.
Physics-Informed Feature System.
| Feature Category | Feature Name | Calculation Method/Formula | Physical Meaning | Data Source |
|---|
| Temporal Decomposition Features | Displacement Trend Strength | Trend component T(t) from STL decomposition | Long-term creep trend of the landslide, reflecting macroscopic deformation driven by gravity | GNSS |
| | Displacement Periodic Oscillation | Periodic component S(t) from STL decomposition | Response to periodic external loads (e.g., seasonal rainfall) | GNSS, Rain gauge, Water level gauge |
| Differential Dynamics Features | Displacement Rate | (First-order differentiation) | Instantaneous velocity of landslide movement, identifying the accelerated creep stage | GNSS |
| | Creep Acceleration | (Second-order differentiation) | Key indicator for accelerated creep, used for early warning and critical state judgment | GNSS |
| Cross-Coupling Features | Rainfall-Displacement Response Ratio | (∆P: Rainfall increment) | Quantifies the triggering efficiency of rainfall infiltration on displacement rate | Crack gauge, GNSS |
| | Crack-Displacement Synergy Coefficient | | Spatial correlation between surface rupture and deep-seated movement | Rain gauge, GNSS |
Table 4.
Overview of Experimental Models.
Table 4.
Overview of Experimental Models.
| Model Category | Model Name | Core Characteristics |
|---|
| Classical Statistical Model | SARIMA | Excels in capturing linear trends and seasonal patterns |
| Traditional Machine Learning | XGBoost | Gradient boosting decision-tree regression using flattened 30-step historical windows, squared-error objective, early stopping, and L2 regularization. |
| Deep Learning Sequence Model | LSTM | Standard Long Short-Term Memory network without physical constraints |
| GRU | A lightweight variant of LSTM with higher computational efficiency |
| Transformer | Modern sequence model based on the self-attention mechanism |
| Proposed Model in This Study | Phys-LSTM | Hybrid LSTM model enhanced by physical equations |
Table 5.
Key hyperparameter settings of the compared models. Common parameters define the unified input-output protocol. For XGBoost, reg_lambda denotes the L2 regularization coefficient. For Phys-LSTM, temporal_decay_rate denotes the temporal attention decay coefficient, target_weight_ratio denotes the feature-attention emphasis assigned to GNSS target features, and L2 denotes kernel regularization.
Table 5.
Key hyperparameter settings of the compared models. Common parameters define the unified input-output protocol. For XGBoost, reg_lambda denotes the L2 regularization coefficient. For Phys-LSTM, temporal_decay_rate denotes the temporal attention decay coefficient, target_weight_ratio denotes the feature-attention emphasis assigned to GNSS target features, and L2 denotes kernel regularization.
| Model | Key Hyperparameters | Value |
|---|
| Common | Historical input window length/Forecasting horizon | 30/5 |
| Common | Training/validation split | 80%/20% |
| SARIMA | Order selection | auto_arima |
| XGBoost | n_estimators, max_depth, learning_rate, objective, reg_lambda | 800, 6, 0.03, squared error, 1.0 |
| LSTM | Hidden units, dropout, learning rate, training control | 128-64, 0.2, adaptive LR, early stopping |
| GRU | Hidden units, dropout, learning rate, training control | 128-64, 0.2, adaptive LR, early stopping |
| Transformer | Heads, key dimension, dropout | 4, 32, 0.2 |
| Phys-LSTM | Encoder hidden units, decoder hidden units, dropout, L2, optimizer, learning rate, batch size, epochs, temporal_decay_rate, target_weight_ratio | 128, 64, 0.3, 0.001, Adam, 5 × 10−4 with warm-up/cosine decay, 512, 100–300 with early stopping, 0.95, 4.0 |
Table 6.
Geological and Monitoring Background.
Table 6.
Geological and Monitoring Background.
| Landslide | Geological and Geomorphological Setting | Deformation/Structural Type | Monitoring Background |
|---|
| Shibanwo | Upper-steep and lower-gentle slope; Quaternary residual/colluvial deposits over mudstone and sandstone | Cover-material landslide developed above weak bedrock | 3 GNSS sensors, 2 acceleration sensors, 2 tiltmeters and 1 rain gauge |
| Pengjiawan | Broken-line slope; loose cover deposits over weathered phyllite | Shallow accumulation-layer landslide sliding along a weak cover-bedrock contact | 4 GNSS sensors, 1 deep-displacement sensor, 1 rain gauge and 1 soil-moisture sensor |
| Caiyuanzi | Steep bedding slope; shale, sandstone, and Quaternary colluvial gravelly soil | Bedding-controlled landslide with a clear slip surface along weak structural interfaces | 3 GNSS sensors and 1 rain gauge |
| Guoluojiao | Gently descending slope; Quaternary loose deposits over bedrock | Cover-material landslide sliding along the cover-bedrock interface | 1 crack gauge, 6 GNSS sensors, 1 acceleration sensor, 1 tiltmeter and 2 rain gauges |
| Houshan | Relatively straight but locally undulating slope; gravelly soil cover over phyllite bedrock | Large accumulation-layer landslide deforming along the cover-bedrock interface | 4 crack gauges, 2 GNSS sensors, 2 acceleration sensors, 2 tiltmeters, 1 rain gauge |
| Dashuzi | Middle-mountain slope; Holocene gravelly soil over phyllite; stepped topography | Soil landslide with upper-steep and lower-gentle morphology, influenced by rainfall and human activities | 1 crack gauge, 5 GNSS sensors and 1 rain gauge |
Table 7.
Standard Deviation of RMSE for Various Models Across Different Landslide Masses.
Table 7.
Standard Deviation of RMSE for Various Models Across Different Landslide Masses.
| Model | Shibanwo Landslide | Pengjiawan Landslide | Caiyuanzi Landslide | Guoluojiao Landslide | Houshan Landslide | Dashuzi Landslide | Standard Deviation |
|---|
| SARIMA | 2.16 | 0.48 | 1.37 | 1.02 | 1.38 | 6.63 | 2.25 |
| XGBoost | 4.16 | 3.37 | 4.23 | 4.31 | 4.41 | 16.12 | 4.92 |
| LSTM | 0.76 | 29.02 | 0.72 | 0.90 | 1.23 | 12.27 | 11.49 |
| GRU | 0.78 | 32.23 | 0.57 | 0.64 | 1.22 | 13.53 | 12.84 |
| Transformer | 1.57 | 77.79 | 1.18 | 3.30 | 1.47 | 34.32 | 31.18 |
| Phys-LSTM | 2.13 | 0.49 | 1.36 | 1.01 | 1.38 | 4.29 | 1.34 |