Next Article in Journal
Revealing the Impact of Urban Morphology Evolution on the Urban Heat Island Effect in the Main Urban Area of Guangzhou: Insights from Local Climate Zones
Previous Article in Journal
Multi-Task Mixture-of-Experts Model for Underwater Target Localization and Recognition
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Missing Data Imputation Method for Waste Dump Landslide Deformation Monitoring Based on a Seq2Seq LSTM–Posterior Correction Model

1
State Key Laboratory of Deep Earth Exploration and Imaging, College of Construction Engineering, Jilin University, Changchun 130026, China
2
School of Earth Sciences and Engineering, Hohai University, Nanjing 210098, China
*
Author to whom correspondence should be addressed.
Remote Sens. 2025, 17(17), 2962; https://doi.org/10.3390/rs17172962
Submission received: 19 July 2025 / Revised: 21 August 2025 / Accepted: 23 August 2025 / Published: 26 August 2025
(This article belongs to the Topic Remote Sensing and Geological Disasters)

Abstract

Surface deformation monitoring is essential for controlling instability processes such as urban infrastructure deformation, mining-induced subsidence, and landslide deformation. However, missing data often disrupt the continuity of the various deformation time series and compromise the reliability of monitoring results. This issue is particularly critical in long-term landslide studies, where conventional missing data imputation methods often neglect the nonlinear characteristics of slope deformation and fail to account for external influences under complex environmental conditions. To address these limitations, this study proposes a deep learning-based imputation method that integrates multi-source monitoring data. A Seq2Seq LSTM (sequence-to-sequence long short-term memory) model is constructed to reconstruct missing deformation values, and a posterior correction module is integrated to optimize the preliminary outputs, thereby enhancing imputation accuracy. The proposed approach is validated using a case study of the southern dump slope landslide at the Hesigewula South Open-Pit Coal Mine in Inner Mongolia, China. Experimental results on the test set demonstrate that the Seq2Seq LSTM–Posterior Correction model significantly outperforms traditional methods such as linear regression and baseline LSTM models. This method offers an effective solution to data gaps in landslide deformation monitoring, demonstrating strong potential for accurate nonlinear imputation in complex environments and providing a practical approach for long-term InSAR-based landslide studies in areas affected by missing SAR data.

1. Introduction

Natural resource exploration and exploitation form a critical foundation for human development and industrial progress [1]. In recent years, rising energy demand has intensified mineral resource extraction, with open-pit mining adopted as a primary method for coal production worldwide due to its high production efficiency, low operating costs, and efficient resource recovery [2]. This trend has resulted in a significant increase in both the number and distribution of open-pit mine slopes. On 22 February 2023, a disastrous landslide at the Xinjing Open-Pit Coal Mine in Inner Mongolia, China, resulted in 53 fatalities and significant infrastructure damage [3]. Such events highlight the critical importance of effective slope deformation monitoring to support early warning and risk mitigation in open-pit mining areas. Landslide deformation monitoring provides a critical foundation for forecasting, early warning, and disaster risk reduction [4]. Remote sensing technologies such as InSAR (Interferometric Synthetic Aperture Radar), GNSS (Global Navigation Satellite System), and UAV (Unmanned Aerial Vehicle) photogrammetry have been extensively applied and are now regarded as indispensable tools for landslide surveillance and geological hazard assessment [5,6,7,8,9,10]. These techniques enable the effective detection of surface displacement and facilitate the analysis of spatiotemporal evolution patterns, susceptibility zones, and rainfall-induced triggering mechanisms [11,12,13,14,15]. However, InSAR-based landslide monitoring is subject to substantial limitations. Influenced by both external and internal factors—such as phase decorrelation, atmospheric disturbances, and sensor malfunctions—InSAR time-series data often suffer from frequent data gaps. These missing values significantly compromise the continuity and interpretability of long-term landslide deformation monitoring, thereby constraining the advancement of landslide-related research [16,17,18].
Currently, mainstream approaches to imputing missing data can be broadly classified into three categories: (1) linear interpolation and conventional statistical time-series models such as ARIMA and spline interpolation; (2) machine learning algorithms including KNN (k-nearest neighbors) and random forests; and (3) deep learning architectures such as LSTM (long short-term memory) and transformer networks [19,20,21]. Although linear interpolation and conventional statistical time-series models offer computational efficiency and ease of implementation, they are generally limited in capturing dynamic patterns over extended gap periods. As such, they are more appropriate for handling short-term, high-frequency data gaps [22]. Machine learning models, characterized by structural transparency, fast training speed, and strong interpretability, are well-suited for imputing missing data over short- to medium-term time scales. However, their performance in more complex tasks is often limited by their insufficient capacity for modeling temporal dynamics [23]. Deep learning models demonstrate strong capabilities in capturing nonlinear patterns in time-series data. Architectures such as LSTM, CNN (Convolutional Neural Network), and RNN (Recurrent Neural Network) have been widely applied to tasks including time-series forecasting and missing data imputation [24,25,26] and have exhibited robust performance in various domains such as traffic flow prediction [27], power load estimation [28], and rainfall–runoff modeling [29]. Although shallow deep learning architectures are effective in capturing nonlinear trends, their structural simplicity often limits their effectiveness in modeling complex systems such as landslide deformation. Models with more complex architectures, such as Seq2Seq LSTM (sequence-to-sequence long short-term memory) [30], STGCN (Spatio-Temporal Graph Convolutional Network) [31], and SAITS (Self-Attention-based Imputation for Time Series) [32], have demonstrated superior performance in both missing data imputation and time-series forecasting. These models not only better capture nonlinear variation patterns but also significantly outperform their shallow counterparts when applied to tasks involving complex systems such as landslide deformation [33]. Li, M. et al. proposed a landslide multivariate displacement prediction method based on Seq2Seq LSTM, which exhibited enhanced capability in capturing local peaks and produced prediction curves that better matched the actual deformation patterns [34].
In landslide deformation data imputation, long-term dependencies—such as the influence of historical deformation on missing segments—are critically important. However, due to gradient-related issues, encoders often struggle to capture such dependencies, leading to poor retention of early sequence information and excessive sensitivity to more recent inputs [35]. In the absence of ground-truth constraints, error accumulation can lead to severe distribution shift, resulting in a dramatic decline in imputation accuracy [36,37,38]. Integrating deep learning with posterior correction offers an effective solution to the distribution shift caused by error accumulation. This multi-method fusion strategy combines the nonlinear modeling capacity of complex deep neural networks with the error suppression and trend alignment advantages of posterior correction [39], thereby enhancing the effectiveness of missing data imputation. Cascade-based approaches that model and correct the prediction errors of LSTM outputs have been successfully employed in time-series forecasting, resulting in significant improvements in prediction accuracy [40]. The integrated model conducts imputation of missing values in time-series monitoring data through a three-stage framework comprising data preprocessing, sequence learning–based reconstruction, and posterior correction. It exhibits strong capability in trend representation learning and delivers high imputation accuracy. In this framework, the Seq2Seq LSTM module captures nonlinear deformation trends and generates preliminary imputation results. The posterior correction module adjusts for accumulated errors through residual modeling, improving the accuracy of long-term landslide deformation reconstruction.
In this case, studying the southern dump slope landslide of the Hesigewula South Open-Pit Coal Mine in Inner Mongolia, we address an extensive data gap between December 2021 and March 2023. A Seq2Seq LSTM–Posterior Correction model is then employed for missing data imputation by fusing Sentinel-1B imagery from the European Space Agency with GNSS observations supplied by the mine operator. Compared with conventional imputation methods, our approach demonstrates superior capability in capturing complex nonlinear relationships between external factors and landslide kinematics. In comparative experiments involving multiple methods, the proposed approach achieved lower MSE (mean squared error) and MAE (Mean Absolute Error), along with higher PCC (Pearson Correlation Coefficients) on the test set, highlighting its superior performance in missing data imputation. The consistency between field-based engineering geological investigations and the imputed results further supports the reliability of the proposed method. This study offers a feasible solution for enabling long-term landslide deformation monitoring in areas affected by SAR data gaps.

2. Study Area

The Hesigewula South Open-Pit Coal Mine, as shown in Figure 1, is located in Xilingol League, Inner Mongolia Autonomous Region, along the southern ridge of the Greater Khingan Mountains, where marshland geomorphology is widely developed. The regional stratigraphy is primarily composed of the Upper Jurassic Manketouebo, Manitu, and Baiyingaolao Formations, the Lower Cretaceous Damoguaihe Formation, and Quaternary deposits. The regional tectonic structure is characterized by an asymmetrical, broad syncline, with a gently dipping western limb and a steeply dipping eastern limb. The region exhibits a typical continental climate, with annual temperature variations ranging from −42.1 °C in winter to 35.3 °C in summer. Most precipitation falls between June and August, accounting for approximately 70% of the yearly total.
The southern dump slope landslide at the Hesigewula South Open-Pit Coal Mine has a height of 69 m and a slope angle of 5.93°. The landslide extends from an altitude of approximately 915 m at the toe to 980 m at the crown. The landslide mass is primarily composed of waste soil dumped above the bedrock. The material is loosely structured with low cohesion and contains mudstone that undergoes a rapid loss of strength upon water infiltration, resulting in overall low slope stability. Historically, multiple minor deformation events have occurred at the southern dump slope landslide during the thawing period from April to May and in the months following heavy rainfall, particularly from August to October.

3. Data and Methods

In this study, we propose a Seq2Seq LSTM–Posterior Correction model for imputing missing data in long-term SAR deformation time series. By integrating Sentinel-1 SAR imagery with GNSS displacement measurements, the model leverages multi-source monitoring data to achieve high-accuracy reconstruction of deformation signals. The overall modeling workflow is shown in Figure 2.

3.1. Data

The dataset used in this study includes 79 descending C-band SAR scenes acquired by the European Space Agency’s Sentinel-1B satellite from 15 April 2018 to 7 December 2021. The Copernicus DEM with a spatial resolution of 8 m was employed for image co-registration and topographic phase removal. In addition, three-dimensional displacement data from 21 March to 17 May 2023 were obtained from nine GNSS stations deployed across the southern dump slope landslide area (Figure 3a), and the vertical displacement component was used in this study for analysis.
The SAR dataset for this area ends on 7 December 2021, due to the decommissioning of the Sentinel-1B satellite, resulting in a series of extended data gaps from December 2021 to March 2023. This interval severely compromises the continuity and reliability of a long-term deformation analysis for the landslide.

3.2. InSAR Processing

The SBAS-InSAR technique was applied to multi-temporal SAR imagery under small baseline constraints. Redundant differential interferograms were generated and analyzed using time series methods to effectively separate atmospheric delays, residual topography, and noise. This process enabled millimeter-level retrieval of surface deformation [41].

3.3. Data Preprocessing

The core processing steps include geocoding, image co-registration with a master image, differential interferometry, adaptive filtering, minimum-cost flow phase unwrapping, GACOS atmospheric correction, and phase-to-deformation conversion, followed by final geocoding [42]. This technique is particularly well-suited for time-series deformation monitoring. The use of redundant interferograms helps suppress errors and increases temporal sampling, resulting in high-density surface deformation measurements.
To ensure that the model effectively captures temporal patterns and to reduce interference from long-term trends, the original InSAR deformation time series was subjected to a series of preprocessing steps prior to imputation.

3.3.1. Uniformization of the Time Series Step Length

We performed temporal alignment and Gaussian interpolation on the original InSAR deformation series to unify the time-step intervals and improve the sequence continuity. This step was necessary to accommodate the input requirements of the Seq2Seq–Posterior Correction model, which relies on evenly sampled time series for effective learning.
A unified time index set, T = {t1, t2, …, tn}, was constructed to resample the original time series of each monitoring point to a consistent daily temporal resolution. For any time point tk with missing observations, the corresponding value was estimated using a temporal linear regression approach as follows:
x ^ t k = x t i + x t j x t i t j t i ( t k t i ) , t i < t k < t j
where x t i and x t j denote the nearest available observations before and after the missing time point, respectively. To avoid residual gaps at the tail of the time series following the regression-based interpolation, a forward-filling strategy was additionally implemented.

3.3.2. Trend Decomposition and Normalization

After interpolation, we conducted trend decomposition and normalization to further preprocess the InSAR time series data. To isolate the underlying trend, we applied linear regression to the training sequences and decomposed the original series Xt into a long-term trend component Tt and a short-term residual component Rt, as shown below:
X t   =   T t   +   R t
where Tt is estimated through linear fitting, and Rt is further normalized as follows:
R ˜ t = R t μ σ
The normalized residual series was segmented into paired input–output samples (X, Y) using a sliding window strategy. Upon the completion of model training, we restored the predicted deformation values to their original physical scale through inverse normalization and trend reconstruction, yielding the final reconstructed deformation time series.

3.4. Imputation Model for Landslide Deformation Monitoring Data

Given the frequent occurrence of missing data within the InSAR deformation time series in long-term landslide deformation monitoring, we propose a hybrid missing data imputation model that integrates a sequence-to-sequence (Seq2Seq) long short-term memory (LSTM) network with a posterior correction module based on trend inversion and residual adjustment. The proposed model performs missing data imputation for time-series monitoring data through a three-stage framework consisting of trend decomposition and normalization, sequence-based imputation, and posterior trend correction. It leverages multi-source monitoring data to reconstruct complete historical deformation records and demonstrates strong capabilities in trend learning and imputation accuracy.

3.4.1. Seq2Seq LSTM Module

To enable multi-step missing data imputation while preserving temporal continuity in the deformation time series, we adopt a Seq2Seq architecture based on long short-term memory networks [43].
The overall model consists of three main components. First, the encoder processes a normalized input sequence of dimension D through a stack of L LSTM layers, extracting temporal features and encoding them into the final hidden and cell states (hT, cT). Second, the decoder initializes its internal state using these encoded vectors and iteratively generates the target sequence in an autoregressive manner. Finally, a linear output layer maps the decoder outputs at each time step back to the original feature space. To enhance training stability and accelerate convergence, we employ teacher forcing [44] with a scheduled sampling mechanism. This approach enables a smooth transition from fully supervised learning to autoregressive inference during training. The teacher forcing ratio r decayed exponentially over training epochs, following the recursive formulation below:
r t + 1 = max ( r t γ , r min ) , γ = 0.95
This mechanism initially relies on InSAR deformation results to stabilize training and gradually transitions toward autonomous sequence generation—facilitating effective imputation without external guidance.
The model incorporates a multi-objective loss function during training, aiming to maintain continuity between imputed and historical segments by balancing missing data imputation accuracy and boundary smoothness. The total loss is defined as follows:
L total = L main + λ L cont
Specifically, L main represents the MSE between the imputed and ground truth sequences, while L cont introduces a penalty for discontinuity between the final time step of the input sequence and the initial point of the imputed segment. The balancing coefficient λ is empirically set to balance the two terms. This loss design effectively alleviates boundary artifacts and enhances consistency in both trend and amplitude between the imputed and original sequences. The AdamW optimizer is employed for model training, with early stopping (patience = 5) applied to avoid overfitting. The dataset was split chronologically into training, validation, and test sets in an 8:1:1 ratio.

3.4.2. Posterior Correction Module

Although the Seq2Seq LSTM model can effectively capture temporal trends in InSAR deformation time series, it inevitably suffers from error accumulation during extended imputations, often resulting in trend drift and deviations from the InSAR deformation time series. To improve the trend fidelity of the imputed segments, we introduce GNSS displacement records as external references and develop a posterior correction strategy based on deformation rate differences. Specifically, linear regression is applied over a time window preceding the imputation start point to estimate the slope of the actual deformation trend from GNSS data, which is then used to guide the correction of the predicted trajectory.
Let y ^ t denote the imputed InSAR time series and gt the GNSS displacement series. To quantify trend deviation, we perform linear regression on the GNSS data over a Tm-day window preceding the imputation start to estimate the reference slope βGNSS. Similarly, the slope of the imputed segment, βimp, is obtained by fitting the tail of the predicted sequence. The trend offset is then defined as the difference between these two slopes:
Δ β = β GNSS β imp
A linear correction term is then applied to each time step t at the tail of the imputed segment to realign its trend with the reference slope:
y ˜ t = y ^ t + Δ β t t s t e t s η
The variables tₛ and tₑ represent the start and end of the imputed interval, respectively, while η  (0, 1) is a smoothing coefficient that governs the strength of correction. The adjustment is applied in a linearly increasing manner, allowing the imputed sequence to gradually align with the GNSS-derived trend. This progressive correction effectively mitigates cumulative prediction errors and suppresses trend drift over time.

3.5. Evaluation Metrics for Model Performance

To evaluate the model’s missing data imputation performance, we applied three commonly used error metrics on the test set: MSE, MAE, and PCC. MSE and MAE quantify the deviation between the imputed deformation and the original InSAR deformation time series, which are defined as follows:
MSE = 1 n i = 1 n y ^ i y i 2
MAE = 1 n i = 1 n y ^ i y i
The PCC is used to assess the linear relationship between the imputed values and the original InSAR deformation time series, with a particular focus on capturing overall trends. The PCC is defined as follows:
PCC = i = 1 n ( y ^ i y ^ ¯ ) ( y i y ¯ ) i = 1 n ( y ^ i y ^ ¯ ) 2 i = 1 n ( y i y ¯ ) 2
The terms y ^ i and y ¯ represent the mean values of the imputed results and the original InSAR deformation time series, respectively. All metrics are computed independently at each monitoring point, and the results are organized into structured tables to facilitate further comparative analysis and error visualization.

4. Results

4.1. SBAS-InSAR Deformation Results

Following InSAR processing, the results were decomposed, and the vertical displacement component was used for subsequent analysis. Consequently, we obtained the average deformation rate and cumulative displacement of the landslide in the vertical direction from 15 April 2018 to 7 December 2021, which were subsequently used for time-series deformation analysis.
Figure 3a illustrates the spatial distribution of average deformation rates for the southern dump slope landslide, with vertical surface displacements ranging from −47 mm/year to 28 mm/year. The head of the landslide exhibits relatively high subsidence rates, while most of the central area shows a slow downward movement. Notably, uplift is observed at the boundary between the head and central area, as well as at the toe. The surface fractures and displaced drainage infrastructure identified at the landslide head during field surveys (Figure 3b,c) correspond well with the high deformation rates indicated by the InSAR analysis, thereby corroborating the reliability of the InSAR results.
To capture both regional and site-specific deformation dynamics, we utilized six cumulative InSAR deformation maps acquired between 2018 and 2021 (Figure 4), alongside three representative monitoring points—P078, P117, and P477—located at the landslide’s toe, central body, and head scarp, respectively (Figure 5). These locations, each coinciding with a GNSS station (NP03, NC32, and NC30), were examined across the full observation span from 15 April 2018 to 17 May 2023 to characterize long-term spatiotemporal deformation patterns. The results indicate that the Hesigewula southern dump slope landslide has undergone sustained deformation over the observation period, with both displacement magnitude and spatial extent progressively increasing year by year. The displacement magnitude in the subsidence direction is greater at the main scarp of the landslide, while most areas of the central body exhibit smaller deformations. A slight uplift is observed in some localized zones within the main body and at the toe.
Representative monitoring points P078 and P117 exhibited a relatively steady subsidence trend from April to October 2018, with a low and nearly linear deformation rate, indicative of slow, uniform movement. Between October 2018 and November 2019, displacement fluctuated but remained limited in magnitude, suggesting a state of quasi-stability. In November 2019, deformation accelerated noticeably, followed by a gradual reduction in displacement rate through April 2021, transitioning into a slower but persistent subsidence phase. On 8 May 2023, during the GNSS monitoring period, P117 experienced a sharp displacement surge, indicating the occurrence of a landslide event accompanied by significant structural failure at the main scarp. Meanwhile, P078 showed no significant displacement during the same period. The representative monitoring point P477 exhibited relatively minor deformation during the InSAR monitoring period, following a slow and near-linear subsidence trend. On 15 May 2023, during the GNSS monitoring phase, a moderate uplift was detected.
Figure 5 shows that the deformation at the three representative points is temporally correlated with external factors. During the 2018–2021 InSAR monitoring period, deformation acceleration typically occurs 3–10 days after the annual maximum 12-day average rainfall, indicating a strong coupling between landslide activity and peak precipitation events. Similarly, around April of each year, when surface temperatures rise and remain above 0 °C—marking the thawing period—the deformation rates at the monitoring points also increase steadily, often producing secondary displacement peaks in certain years.

4.2. Missing Data Imputation in Landslide Deformation Monitoring

Following the preprocessing of the original InSAR deformation time series as outlined in the methodology, the Seq2Seq LSTM–Posterior Correction model was applied to reconstruct the missing deformation intervals.
Specifically, the model is configured to take a 90-day historical deformation sequence as input and predict a 30-day missing segment. During training, input–output pairs are generated using a sliding window strategy, ensuring complete coverage of the available time series for learning. Table 1 summarizes the detailed parameter settings of the model. The training was conducted using the AdamW optimizer, and an L2 regularization term was incorporated to mitigate overfitting. The loss function consists of two components: a primary loss, defined as the mean squared error between the imputed and InSAR deformation time series, and a continuity constraint term that penalizes the deviation between the first point of the filled segment and the last point of the observed sequence. This design effectively addresses the boundary shift issue introduced by the sliding window approach.
After training, the model performed missing data imputation on the missing segments using a sliding window approach. Each output was restored to the original scale through trend reconstruction and inverse normalization. Following the initial missing data imputation, the results were further refined through a posterior correction procedure, which leveraged short-term deformation trends extracted from GNSS monitoring data. By enforcing trend consistency and smoothing the transition across the gap period, this step ensured that the reconstructed time series maintained temporal continuity and remained aligned with the physical deformation patterns observed in the GNSS data.
To enable a more comprehensive and visual assessment of the reconstructed long-term deformation, nine cumulative deformation surface maps were produced at 200-day intervals, covering the period from 1 November 2018 to 20 March 2023 (Figure 6). Meanwhile, the time series of representative monitoring points were completed by filling in the missing segments, thus providing a continuous deformation record for further analysis (Figure 7). Details of the missing data imputation (quarterly sampling for representative monitoring points) are provided in Appendix A (Table A1).
As shown in Figure 6, the reconstructed deformation field captures the full progression of the landslide process. During this period, the southern dump slope landslide experienced a cumulative subsidence of −234 mm and an uplift of up to 85.1 mm. Deformation initially emerged at the main scarp, gradually propagated through the main body, and eventually reached the toe. The most pronounced subsidence occurred near the boundary of the main scarp, while a small uplift zone was detected within the adjacent part of the main body. The majority of the toe experienced only slight deformation, while more noticeable uplift occurred along the toe boundary and at the junction between the toe and the main body. Considering the timing and intensity of deformation across different landslide zones, along with the observed surface morphology, the landslide can be classified as a slowly evolving translational landslide [45]. Initial movement occurred at the main scarp, exerting a downslope driving force on the main body and toe, which subsequently underwent progressive deformation. The uplift deformation observed at the junctions between different zones of the landslide may be attributed to compressive forces acting along the internal boundaries [46,47].
During the gap period in July and August 2022, the representative points entered an accelerated deformation phase several days after the heavy rainfall events. In April 2023, during the thaw period, deformation at the monitoring points accelerated rapidly, followed shortly by slope failure. Comparison between the imputed results and original InSAR monitoring indicates that the deformation behavior during the gap period closely mirrors the patterns recorded in the InSAR monitoring timeframe.
Representative monitoring points at different locations across the landslide body exhibited consistent deformation responses to external factors, highlighting the system’s sensitivity to such influences. Considering the deformation and failure mechanisms of landslides under intense rainfall and freeze–thaw conditions [48,49], we infer that (1) heavy rainfall increases pore water pressure through infiltration, which undermines the overall mechanical strength of the slope material and subsequently accelerates slope movement [50], and (2) freeze–thaw processes promote the development of tensile cracks within the slope, particularly in areas with poor surface drainage or heterogeneous distributions of ice-rich materials, further compromising slope stability [51].
Although the internal structure and material composition of the landslide fundamentally determine its deformation mechanisms, external factors often play a dominant role during critical stages, significantly influencing its development trajectory and deformation trends [52]. Coupling the imputed long-term deformation time series with rainfall and temperature data not only reveals the driving effects of external factors on landslide evolution but also confirms the effectiveness of the proposed missing data imputation approach in reconstructing missing deformation data.

5. Discussion

5.1. Model Evaluation for Missing Data Imputation

To evaluate the performance of different imputation methods for missing deformation data in the study area, we conducted a comparative analysis at three representative points—P078 (toe), P117 (main scarp), and P477 (main body) of the landslide. The methods assessed include linear regression, baseline LSTM, and the proposed Seq2Seq LSTM–Posterior Correction model. These three approaches, respectively, represent a conventional empirical method, a basic deep learning model, and the improved method proposed in this study—offering representative and meaningful comparisons.
Figure 8a–c clearly show that the linear regression method relies solely on extrapolating historical trends, making it inadequate for capturing the dynamic characteristics of deformation. It performs reasonably well only during periods of strong continuity in the displacement signal. The baseline LSTM model generally performs well in the early stages of the test period. However, as the time steps progress and deformation signals become increasingly variable, the single-layer architecture struggles to capture complex temporal patterns. In some cases, its performance in the later stages even falls behind that of the linear regression approach [53].
The Seq2Seq LSTM–Posterior Correction model consistently demonstrates superior performance across all three monitoring points, particularly during the later stages of the test period. It effectively captures the underlying deformation trends and exhibits significantly lower error accumulation over time compared to the other two methods.
To further quantify the performance of the three methods, we calculated the MSE, MAE, and PCC at each of the three representative monitoring points (Figure 9a–c). The results indicate that the Seq2Seq LSTM–Posterior Correction model exhibits a more concentrated error distribution with fewer outliers, demonstrating superior stability and generalization capability. In contrast, the linear regression and baseline LSTM models yield larger errors and more outliers at certain monitoring points. Although they show relatively good fits at specific locations, their overall accuracy and temporal consistency across the test period are inferior to those of the Seq2Seq LSTM–Posterior Correction model.
In summary, the Seq2Seq LSTM–Posterior Correction model demonstrates clear advantages in addressing missing data in landslide deformation monitoring. It effectively captures both local variability and overall trend patterns, making it well-suited for missing data imputation in nonlinear and complex time-series deformation scenarios.

5.2. Field Validation of Reconstruction Results

To further verify the accuracy of the imputed deformation data, we carried out a detailed field survey and UAV photogrammetry at the southern dump landslide of the Hesigewula South Open-Pit Coal Mine in May 2023. Based on integrated field and UAV data interpretation, the landslide was subdivided into five zones: Backscarp Source Zone, Compression Failure Zone, Transverse Sliding Zone, Central Slide Zone, and Frontal Rupture Zone.
In the Backscarp Source Zone (I), intense deformation has produced two stepped scarps at the backscarp, with lengths ranging from approximately 330 to 640 m and step heights between 0.3 and 15 m. A set of extensional cracks was observed up to approximately 70 m from the backscarp. Severe deformation is evident in this zone, where the downward displacement of two scarps imposed tensile stress on the upslope area. As a result, a series of arcuate transverse cracks developed behind the rear scarp and extended further upslope. In the Compressive Deformation Zone (II), pronounced compressional failure is observed. Due to the concave geometry of the sliding surface—gently inclined in the front and steep at the rear—the downslope movement of the Backscarp Source Zone exerts thrusting forces on the central slope, leading to extensive surface cracking and the development of two distinct uplifted zones (① and ②) within the mid-slope (Figure 10. II), with respective uplift heights of approximately 8 m and 10 m. The downslope movement in the Central Slide Zone exerted lateral tension on the adjacent Transverse Sliding Zone (III), inducing a horizontal sliding process across the entire area. A scarp has formed along the right boundary, accompanied by measurable displacements in both horizontal and vertical directions. In the Central Slide Zone (IV), the slope exhibits an overall translational movement driven by the displacement of the Backscarp Source Zone. Although the degree of deformation is relatively moderate, extensive surface cracking is observed throughout the area. A well-defined offset occurs along the left boundary, accompanied by echelon shear–tensile cracks. In the Frontal Rupture Zone (V), the ground surface is extensively fractured, indicating weak material properties. Uplift is evident at the interface with the Central Slide Zone due to compressional forces. Large-scale cracks are widely distributed, reflecting severe deformation and structural damage across the area.
The deformation features interpreted from field investigations—including the two-stepped scarps and localized uplift at zone boundaries—together with the spatiotemporal evolution they imply, closely correspond to the deformation trends reconstructed by the Seq2Seq LSTM–Posterior Correction model. Specifically, the landslide evolution interpreted from the reconstructed deformation time series shows strong spatial and kinematic agreement with the field-based interpretations in terms of both deformation intensity and spatial extent. The observed consistency underscores the accuracy of the Seq2Seq LSTM–Posterior Correction model in imputing missing deformation data across both time and space. The complete time series enables reliable identification of critical deformation areas and provides a faithful representation of the landslide developmental trajectory.

5.3. Limitations and Future Work

While the proposed Seq2Seq LSTM–Posterior Correction model shows strong performance in imputing missing data for dump-slope landslides in open-pit mining areas, its applicability to other geomorphic settings and deformation regimes remains to be systematically assessed. Different types of landslides exhibit significant variations in triggering mechanisms, failure processes, and internal structures [54,55], which may necessitate structural adaptations of the model or the use of more diverse training samples to effectively capture their respective deformation characteristics.
The model relies on long-term continuous monitoring data for training. Although trend decomposition and teacher forcing strategies have been incorporated to enhance temporal stability, their imputing performance may be compromised in cases of severe data loss or limited early-stage observations, thereby constraining their applicability under such conditions.
Comprehensive parameter uncertainty and sensitivity analyses for the posterior correction were not conducted due to data limitations and will be addressed in future work. The current model is entirely data-driven and does not incorporate the physical mechanisms underlying landslide initiation and evolution. Future work could integrate external influencing factors—such as rainfall, freeze–thaw cycles, and geotechnical structure—into the model as auxiliary variables via attention mechanisms [56], thereby enhancing its interpretability.

6. Conclusions

In this study, we integrated SBAS-InSAR and GNSS monitoring techniques to impute the data gaps (8 December 2021 to 20 March 2023) of the south dump landslide at the Hesigewula South Open-Pit Coal Mine, using the Seq2Seq LSTM–Posterior Correction model to enable long-term time-series analysis.
(1)
The Seq2Seq LSTM–Posterior Correction model first employs a residual-coupled Seq2Seq LSTM to extract nonlinear deformation features and impute the missing segments under continuity constraints. Subsequently, a posterior correction module based on deformation rate discrepancies was applied to align the imputed trend with GNSS-derived rates. This adjustment effectively reduced cumulative prediction errors over time and mitigated trend drift in the imputed sequences;
(2)
The SBAS–InSAR results and the imputed deformation data during the gap period exhibited strong consistency in their response to both heavy rainfall and freeze–thaw processes. The reconstructed long-term deformation series indicated a strong correlation between slope evolution and external climatic factors, particularly precipitation and freeze–thaw cycles. We infer that intense rainfall gradually increases pore water pressure through infiltration, thereby weakening the mechanical integrity of the dump slope landslide. Meanwhile, freeze–thaw processes induce the development of tensile cracks within the slope body, further accelerating deformation;
(3)
Compared with traditional imputation approaches, the proposed model achieved lower errors and higher trend consistency, further validated by field surveys and UAV-based optical modeling in May 2023, which confirmed high consistency between observed and imputed deformation;
(4)
To enhance the interpretability and applicability of the model, future research could incorporate explainable attention mechanisms to improve the model’s responsiveness to external factors and its ability to capture the stage-specific deformation characteristics of landslides, while parameter uncertainty and sensitivity analyses for the Posterior Correction module will also be explored when more data are available.

Author Contributions

Conceptualization, T.J. and C.C.; methodology, T.J. and X.A.; software, M.L.; validation, J.B., K.Z. and C.W.; formal analysis, K.Z., Y.J. and T.J.; investigation, C.C. and M.L.; resources, K.Z.; data curation, Y.J., J.B. and M.L.; writing—original draft preparation, T.J.; writing—review and editing, C.C., K.Z. and T.J.; visualization, M.L. and C.W.; supervision, C.C.; project administration, C.C.; funding acquisition, C.C. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Natural Science Foundation of Jilin Province, China (grant number: 20220101158JC); the Jilin University Outstanding Youth Training Program; and the Jilin University Innovation & Entrepreneurship Practice Capacity Development Program (2025).

Data Availability Statement

The original contributions presented in the study are included in the article; further inquiries can be directed to the corresponding author.

Acknowledgments

We thank the editors and the reviewers for their outstanding comments and suggestions, which greatly helped us improve the technical quality of the manuscript.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

Table A1. Cumulative displacements of representative monitoring points after missing data imputation (quarterly sampling).
Table A1. Cumulative displacements of representative monitoring points after missing data imputation (quarterly sampling).
DateP078 (mm)P117 (mm)P477 (mm)
30 April 2018−9.08625−8.8155−4.357
31 July 2018−19.15342−20.557−14.8835
31 October 2018−16.90858−35.10675−24.71433
31 January 20197.65175−25.08475−29.94925
30 April 20190.933−30.70267−42.58533
31 July 2019−0.568−37.97467−31.51033
31 October 2019−5.8745−38.1775−40.5775
31 January 2020−40.60367−57.877−39.899
30 April 2020−81.99587−96.74223−46.54307
31 July 2020−88.94933−110.66167−54.37
31 October 2020−119.02725−125.54475−65.25675
31 January 2021−133.01722−144.48028−61.17356
30 April 2021−155.04006−169.93448−54.76952
31 July 2021−151.6105−162.56767−57.44433
31 October 2021−158.48758−175.65467−64.32953
31 January 2022−167.56662−189.83428−69.2904
30 April 2022−151.99594−183.37887−71.32601
31 July 2022−154.26318−190.10727−79.95189
31 October 2022−142.47695−185.82915−81.39671
31 January 2023−155.81659−194.569−80.71175
30 April 2023−156.42469−241.52894−94.27434
17 May 2023−159.03469−3280.81894−37.02434

References

  1. Wang, Q.; Xing, A.; Xu, X.; Zhou, Y.; Yang, Q.; Song, H.; Peng, J.; Zhai, L.; Bilal, M.; Liu, Y. Formation mechanism and dynamic process of open-pit coal mine landslides: A case study of the Xinjing landslide in Inner Mongolia, China. Landslides 2024, 21, 541–556. [Google Scholar] [CrossRef]
  2. Huertas, J.I.; Huertas, M.E.; Izquierdo, S.; González, E.D. Air quality impact assessment of multiple open pit coal mines in northern Colombia. J. Environ. Manag. 2012, 93, 121–129. [Google Scholar] [CrossRef]
  3. Zhang, N.; Wang, Y.; Zhao, F.; Wang, T.; Zhang, K.; Fan, H.; Zhou, D.; Zhang, L.; Yan, S.; Diao, X. Monitoring and analysis of the collapse at Xinjing Open-Pit Mine, Inner Mongolia, China, using multi-source remote sensing. Remote Sens. 2024, 16, 993. [Google Scholar] [CrossRef]
  4. Zhou, C.; Cao, Y.; Yin, K.; Intrieri, E.; Catani, F.; Wu, L. Characteristic comparison of seepage-driven and buoyancy-driven landslides in Three Gorges Reservoir area, China. Eng. Geol. 2022, 301, 106590. [Google Scholar] [CrossRef]
  5. Jia, W.; Wang, Z. Landslide activity characteristics and stability analysis based on high-resolution remote sensing image: A case study of Dongmiaojia landslide. Remote Sens. Land Resour. 2019, 31, 174–181. [Google Scholar]
  6. Ju, L.-Y.; Xiao, T.; He, J.; Xu, W.-F.; Xiao, S.-H.; Zhang, L.-M. A simulation-enabled slope digital twin for real-time assessment of rain-induced landslides. Eng. Geol. 2025, 353, 108116. [Google Scholar] [CrossRef]
  7. Cao, C.; Zhu, K.; Song, T.; Bai, J.; Zhang, W.; Chen, J.; Song, S. Comparative study on potential landslide identification with ALOS-2 and sentinel-1A data in heavy forest reach, upstream of the Jinsha River. Remote Sens. 2022, 14, 1962. [Google Scholar] [CrossRef]
  8. Zhu, K.; Xu, P.; Cao, C.; Zheng, L.; Liu, Y.; Dong, X. Preliminary identification of geological hazards from Songpinggou to Feihong in Mao County along the Minjiang River using SBAS-InSAR technique integrated multiple spatial analysis methods. Sustainability 2021, 13, 1017. [Google Scholar] [CrossRef]
  9. Cao, C.; Zhu, K.; Xu, P.; Shan, B.; Yang, G.; Song, S. Refined landslide susceptibility analysis based on InSAR technology and UAV multi-source data. J. Clean. Prod. 2022, 368, 133146. [Google Scholar] [CrossRef]
  10. Zhang, W.; Yin, H.; Chen, J.; Lu, C.; Li, T.; Han, B.; Zhao, Z.; Wang, J.; Chen, J. Identification and thermal characteristics of linear discontinuities on a high-steep slope using UAV with thermal infrared imager. Int. J. Rock Mech. Min. Sci. 2025, 186, 106025. [Google Scholar] [CrossRef]
  11. Bozzano, F.; Cipriani, I.; Mazzanti, P.; Prestininzi, A. Displacement patterns of a landslide affected by human activities: Insights from ground-based InSAR monitoring. Nat. Hazards 2011, 59, 1377–1396. [Google Scholar] [CrossRef]
  12. Béjar-Pizarro, M.; Ezquerro, P.; Herrera, G.; Tomás, R.; Guardiola-Albert, C.; Ruiz Hernández, J.M.; Fernández Merodo, J.A.; Marchamalo, M.; Martínez, R. Mapping groundwater level and aquifer storage variations from InSAR measurements in the Madrid aquifer, Central Spain. J. Hydrol. 2017, 547, 678–689. [Google Scholar] [CrossRef]
  13. Xie, W.; Yang, J.; Li, Y.; Lei, J.; Zhong, J.; Li, J. Discriminative Feature Learning Constrained Unsupervised Network for Cloud Detection in Remote Sensing Imagery. Remote Sens. 2020, 12, 456. [Google Scholar] [CrossRef]
  14. Tsironi, V.; Ganas, A.; Karamitros, I.; Efstathiou, E.; Koukouvelas, I.; Sokos, E. Kinematics of active landslides in Achaia (Peloponnese, Greece) through InSAR time series analysis and relation to rainfall patterns. Remote Sens. 2022, 14, 844. [Google Scholar] [CrossRef]
  15. Chen, J.; Zhang, W.; Lu, C.; Zheng, Y.; Liu, H.; Li, W.; Yin, H.; Wang, J.; Wang, Y. Evolution and migration patterns of sediments in an earthquake-affected catchment in Wenchuan, Sichuan Province, China. Catena 2025, 249, 108712. [Google Scholar] [CrossRef]
  16. Li, S.; Xu, W.; Li, Z. Review of the SBAS InSAR Time-series algorithms, applications, and challenges. Geod. Geodyn. 2022, 13, 114–126. [Google Scholar] [CrossRef]
  17. Moretto, S.; Bozzano, F.; Mazzanti, P. The role of satellite InSAR for landslide forecasting: Limitations and openings. Remote Sens. 2021, 13, 3735. [Google Scholar] [CrossRef]
  18. Höser, T. Analysing the Capabilities and Limitations of InSAR Using Sentinel-1 Data for Landslide Detection and Monitoring. Ph.D. Thesis, University of Bonn, Bonn, Germany, 2018. [Google Scholar]
  19. Hippert-Ferrer, A.; Yan, Y.; Bolon, P. EM-EOF: Gap-filling in incomplete SAR displacement time series. IEEE Trans. Geosci. Remote Sens. 2020, 59, 5794–5811. [Google Scholar] [CrossRef]
  20. Appel, M. Efficient data-driven gap filling of satellite image time series using deep neural networks with partial convolutions. Artif. Intell. Earth Syst. 2024, 3, 220055. [Google Scholar] [CrossRef]
  21. Jancevičius, J. Enhancing land use classification with hybrid machine learning and satellite imagery. New Trends Comput. Sci. 2025, 3, 1–17. [Google Scholar] [CrossRef]
  22. Lepot, M.; Aubin, J.-B.; Clemens, F.H. Interpolation in time series: An introductive overview of existing methods, their performance criteria and uncertainty assessment. Water 2017, 9, 796. [Google Scholar] [CrossRef]
  23. Ball, J.E.; Anderson, D.T.; Chan, C.S. Comprehensive survey of deep learning in remote sensing: Theories, tools, and challenges for the community. J. Appl. Remote Sens. 2017, 11, 042609. [Google Scholar] [CrossRef]
  24. Borovykh, A.; Bohte, S.; Oosterlee, C.W. Conditional time series forecasting with convolutional neural networks. arXiv 2017, arXiv:1703.04691. [Google Scholar]
  25. Bai, S.; Kolter, J.Z.; Koltun, V. An empirical evaluation of generic convolutional and recurrent networks for sequence modeling. arXiv 2018, arXiv:1803.01271. [Google Scholar] [CrossRef]
  26. Prater, R.; Hanne, T.; Dornberger, R. Generalized Performance of LSTM in Time-Series Forecasting. Appl. Artif. Intell. 2024, 38, 2377510. [Google Scholar] [CrossRef]
  27. Kim, T.; Sharda, S.; Zhou, X.; Pendyala, R.M. A stepwise interpretable machine learning framework using linear regression (LR) and long short-term memory (LSTM): City-wide demand-side prediction of yellow taxi and for-hire vehicle (FHV) service. Transp. Res. Part C Emerg. Technol. 2020, 120, 102786. [Google Scholar] [CrossRef]
  28. Buratto, W.G.; Muniz, R.N.; Nied, A.; González, G.V. Seq2Seq-LSTM with attention for electricity load forecasting in Brazil. IEEE Access 2024, 12, 30020–30029. [Google Scholar] [CrossRef]
  29. Xiang, Z.; Yan, J.; Demir, I. A rainfall-runoff model with LSTM-based sequence-to-sequence learning. Water Resour. Res. 2020, 56, e2019WR025326. [Google Scholar] [CrossRef]
  30. Masood, Z.; Gantassi, R.; Choi, Y. A multi-step time-series clustering-based Seq2Seq LSTM learning for a single household electricity load forecasting. Energies 2022, 15, 2623. [Google Scholar] [CrossRef]
  31. Li, Z.; Chen, J.; Cao, C.; Zhang, W.; Zhu, K.; Bai, J.; Wu, C. Enhancing long-term prediction of non-homogeneous landslides incorporating spatiotemporal graph convolutional networks and InSAR. Eng. Geol. 2025, 347, 107917. [Google Scholar] [CrossRef]
  32. Du, W. A Deep Learning Model to Impute Missing Data in Time Series. Master’s Thesis, Concordia University, Montreal, QC, Canada, 2021. [Google Scholar]
  33. Stefenon, S.F.; Seman, L.O.; Aquino, L.S.; dos Santos Coelho, L. Wavelet-Seq2Seq-LSTM with attention for time series forecasting of level of dams in hydroelectric power plants. Energy 2023, 274, 127350. [Google Scholar] [CrossRef]
  34. Li, M.; Ren, Q.; Li, M.; Qi, Z.; Tan, D.; Wang, H. Multivariate probabilistic prediction of dam displacement behaviour using extended Seq2Seq learning and adaptive kernel density estimation. Adv. Eng. Inform. 2025, 65, 103343. [Google Scholar] [CrossRef]
  35. Sun, G.; Zhao, Y.; Qi, X. Sequence to sequence architecture based on hybrid LSTM global and local encoders approach for meteorological factors forecasting. Sci. Rep. 2025, 15, 22753. [Google Scholar] [CrossRef] [PubMed]
  36. Jang, E.; Han, D.; Im, J.; Sung, T.; Kim, Y.J. Deep learning-based gap filling for near real-time seamless daily global sea surface salinity using satellite observations. Int. J. Appl. Earth Obs. Geoinf. 2024, 132, 104029. [Google Scholar] [CrossRef]
  37. Casolaro, A.; Capone, V.; Iannuzzo, G.; Camastra, F. Deep learning for time series forecasting: Advances and open problems. Information 2023, 14, 598. [Google Scholar] [CrossRef]
  38. Wen, Q.; Chen, W.; Sun, L.; Zhang, Z.; Wang, L.; Jin, R.; Tan, T. Onenet: Enhancing time series forecasting models under concept drift by online ensembling. Adv. Neural Inf. Process. Syst. 2023, 36, 69949–69980. [Google Scholar]
  39. Cao, H.; Leng, H.; Zhao, J.; Xu, X.; Yang, J.; Li, B.; Zhou, Y.; Huang, L. Exploration of Deep-Learning-Based Error-Correction Methods for Meteorological Remote-Sensing Data: A Case Study of Atmospheric Motion Vectors. Remote Sens. 2024, 16, 3522. [Google Scholar] [CrossRef]
  40. Liu, Y.; Duan, J.; Meng, J. Difference attention based error correction LSTM model for time series prediction. J. Phys. Conf. Ser. 2020, 1550, 032121. [Google Scholar] [CrossRef]
  41. Berardino, P.; Fornaro, G.; Lanari, R.; Sansosti, E. A new algorithm for surface deformation monitoring based on small baseline differential SAR interferograms. IEEE Trans. Geosci. Remote Sens. 2003, 40, 2375–2383. [Google Scholar] [CrossRef]
  42. Shan, B.; Qi, J.; Tian, W.; Zhu, K.; Jin, T.; Yang, Q.; An, X.; Yang, G.; Hu, Q.; Cao, C. Early Identification of Geological Hazards Along the Power Transmission Line in Weinan Based on SBAS-InSAR. Appl. Sci. 2025, 15, 920. [Google Scholar] [CrossRef]
  43. Sutskever, I.; Vinyals, O.; Le, Q.V. Sequence to sequence learning with neural networks. In Proceedings of the Advances in Neural Information Processing Systems, Montreal, QC, Canada, 8–13 December 2014; p. 27. [Google Scholar]
  44. Lamb, A.M.; ALIAS PARTH GOYAL, A.G.; Zhang, Y.; Zhang, S.; Courville, A.C.; Bengio, Y. Professor forcing: A new algorithm for training recurrent networks. In Proceedings of the Advances in Neural Information Processing Systems, Barcelona, Spain, 5–10 December 2016; p. 29. [Google Scholar]
  45. Petley, D.N.; Bulmer, M.H.; Murphy, W. Patterns of movement in rotational and translational landslides. Geology 2002, 30, 719–722. [Google Scholar] [CrossRef]
  46. Liu, Y.; Wang, C.; Gao, G.; Wang, P.; Hou, Z.; Jiao, Q. Analysis of the instability conditions and failure mode of a special type of translational landslide using long-term monitoring data: A case study of the Wobaoshi landslide (in Bazhong, China). Nat. Hazards Earth Syst. Sci. 2020, 20, 1305–1319. [Google Scholar] [CrossRef]
  47. Xin, P.; Kang, X.; Wu, W.; Scaringi, G.; Wang, X.; Wu, Q. Centrifuge modelling of a roto-translational landslide in stiff clay formation. Eng. Geol. 2025, 349, 107964. [Google Scholar] [CrossRef]
  48. Yang, J.; Wu, G.; Jiao, J.; Dyck, M.; He, H. Freeze-thaw induced landslides on grasslands in cold regions. Catena 2022, 219, 106650. [Google Scholar] [CrossRef]
  49. Wang, G.; Sassa, K. Pore-pressure generation and movement of rainfall-induced landslides: Effects of grain size and fine-particle content. Eng. Geol. 2003, 69, 109–125. [Google Scholar] [CrossRef]
  50. Pelascini, L.; Steer, P.; Mouyen, M.; Longuevergne, L. Finite-hillslope analysis of landslides triggered by excess pore water pressure: The roles of atmospheric pressure and rainfall infiltration during typhoons. Nat. Hazards Earth Syst. Sci. 2022, 22, 3125–3141. [Google Scholar] [CrossRef]
  51. Li, T.; Zhang, L.; Gong, W.; Tang, H.; Jiang, R. Initiation mechanism of landslides in cold regions: Role of freeze-thaw cycles. Int. J. Rock Mech. Min. Sci. 2024, 183, 105906. [Google Scholar] [CrossRef]
  52. Chen, T.; Shu, J.; Han, L.; Tovele, G.S.; Li, B. Landslide mechanism and stability of an open-pit slope: The Manglai open-pit coal mine. Front. Earth Sci. 2023, 10, 1038499. [Google Scholar] [CrossRef]
  53. Hill, P.; Biggs, J.; Ponce-López, V.; Bull, D. Time-series prediction approaches to forecasting deformation in Sentinel-1 INSAR data. J. Geophys. Res. Solid Earth 2021, 126, e2020JB020176. [Google Scholar] [CrossRef]
  54. Guerriero, L.; Prinzi, E.P.; Calcaterra, D.; Ciarcia, S.; Di Martire, D.; Guadagno, F.M.; Ruzza, G.; Revellino, P. Kinematics and geologic control of the deep-seated landslide affecting the historic center of Buonalbergo, southern Italy. Geomorphology 2021, 394, 107961. [Google Scholar] [CrossRef]
  55. Crippa, C.; Valbuzzi, E.; Frattini, P.; Crosta, G.B.; Spreafico, M.C.; Agliardi, F. Semi-automated regional classification of the style of activity of slow rock-slope deformations using PS InSAR and SqueeSAR velocity data. Landslides 2021, 18, 2445–2463. [Google Scholar] [CrossRef]
  56. Zhou, C.; Ye, M.; Xia, Z.; Wang, W.; Luo, C.; Muller, J.-P. An interpretable attention-based deep learning method for landslide prediction based on multi-temporal InSAR time series: A case study of Xinpu landslide in the TGRA. Remote Sens. Environ. 2025, 318, 114580. [Google Scholar] [CrossRef]
Figure 1. Location and basic conditions of the study area: (a) location and layout of the Hesigewula South Open-Pit Coal Mine; (b) panoramic view of the southern dump slope landslide; (c) UAV-based optical reconstruction of the southern dump slope landslide; and (d) engineering geological cross-section of the landslide.
Figure 1. Location and basic conditions of the study area: (a) location and layout of the Hesigewula South Open-Pit Coal Mine; (b) panoramic view of the southern dump slope landslide; (c) UAV-based optical reconstruction of the southern dump slope landslide; and (d) engineering geological cross-section of the landslide.
Remotesensing 17 02962 g001
Figure 2. Workflow of the missing data imputation method for InSAR monitoring data.
Figure 2. Workflow of the missing data imputation method for InSAR monitoring data.
Remotesensing 17 02962 g002
Figure 3. (a) SBAS-InSAR average deformation rates of the southern dump slope landslide in the vertical direction, with GNSS monitoring locations. (b,c) Surface damage features, including a displaced drainage ditch and a surface crack.
Figure 3. (a) SBAS-InSAR average deformation rates of the southern dump slope landslide in the vertical direction, with GNSS monitoring locations. (b,c) Surface damage features, including a displaced drainage ditch and a surface crack.
Remotesensing 17 02962 g003
Figure 4. (af). Six SBAS-InSAR cumulative deformation results of the southern dump slope landslide in the vertical direction at Hesigewula from 2018 to 2021.
Figure 4. (af). Six SBAS-InSAR cumulative deformation results of the southern dump slope landslide in the vertical direction at Hesigewula from 2018 to 2021.
Remotesensing 17 02962 g004
Figure 5. Cumulative displacement of three representative monitoring points from 2018 to 2023, with overlaid temperature and precipitation (I: InSAR monitoring period; II: gap period; III: GNSS period). The red dashed boxes highlight slope deformation acceleration associated with the thawing period, while the blue dashed boxes highlight slope deformation acceleration associated with short-term responses to heavy rainfall events.
Figure 5. Cumulative displacement of three representative monitoring points from 2018 to 2023, with overlaid temperature and precipitation (I: InSAR monitoring period; II: gap period; III: GNSS period). The red dashed boxes highlight slope deformation acceleration associated with the thawing period, while the blue dashed boxes highlight slope deformation acceleration associated with short-term responses to heavy rainfall events.
Remotesensing 17 02962 g005
Figure 6. Cumulative deformation results of the landslide over nine periods following missing data imputation.
Figure 6. Cumulative deformation results of the landslide over nine periods following missing data imputation.
Remotesensing 17 02962 g006
Figure 7. Cumulative deformation of representative monitoring points after missing data imputation is overlaid with temperature and precipitation (I: InSAR monitoring period; II: gap period; III: GNSS period).
Figure 7. Cumulative deformation of representative monitoring points after missing data imputation is overlaid with temperature and precipitation (I: InSAR monitoring period; II: gap period; III: GNSS period).
Remotesensing 17 02962 g007
Figure 8. (ac) Imputation results at three representative points during the test period.
Figure 8. (ac) Imputation results at three representative points during the test period.
Remotesensing 17 02962 g008
Figure 9. (ac) Corresponding performance metrics at three representative points.
Figure 9. (ac) Corresponding performance metrics at three representative points.
Remotesensing 17 02962 g009
Figure 10. Landslide zonation map derived from field investigation and UAV-based optical modeling, along with representative failure features observed in Zones I–V.
Figure 10. Landslide zonation map derived from field investigation and UAV-based optical modeling, along with representative failure features observed in Zones I–V.
Remotesensing 17 02962 g010
Table 1. Hyperparameters of the Seq2Seq LSTM model.
Table 1. Hyperparameters of the Seq2Seq LSTM model.
HyperparametersParameter
LSTM hidden size128
Number of LSTM layers2
Batch size32
Loss functionMSE + 0.5 × continuity loss
OptimizerAdamW
Initial learning rate1 × 10−4
Weight decay1 × 10−4
Teacher forcing (init/min/decay)0.8/0.01/0.95
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Jin, T.; Cao, C.; Li, M.; Zhu, K.; Jing, Y.; Wu, C.; An, X.; Bai, J. A Missing Data Imputation Method for Waste Dump Landslide Deformation Monitoring Based on a Seq2Seq LSTM–Posterior Correction Model. Remote Sens. 2025, 17, 2962. https://doi.org/10.3390/rs17172962

AMA Style

Jin T, Cao C, Li M, Zhu K, Jing Y, Wu C, An X, Bai J. A Missing Data Imputation Method for Waste Dump Landslide Deformation Monitoring Based on a Seq2Seq LSTM–Posterior Correction Model. Remote Sensing. 2025; 17(17):2962. https://doi.org/10.3390/rs17172962

Chicago/Turabian Style

Jin, Tie, Chen Cao, Ming Li, Kuanxing Zhu, Yaxuan Jing, Chenyang Wu, Xiguan An, and Ji Bai. 2025. "A Missing Data Imputation Method for Waste Dump Landslide Deformation Monitoring Based on a Seq2Seq LSTM–Posterior Correction Model" Remote Sensing 17, no. 17: 2962. https://doi.org/10.3390/rs17172962

APA Style

Jin, T., Cao, C., Li, M., Zhu, K., Jing, Y., Wu, C., An, X., & Bai, J. (2025). A Missing Data Imputation Method for Waste Dump Landslide Deformation Monitoring Based on a Seq2Seq LSTM–Posterior Correction Model. Remote Sensing, 17(17), 2962. https://doi.org/10.3390/rs17172962

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop