An Automatic Identification Method for Large-Scale Landslide Hazard Potential Integrating InSAR and CRF-Faster RCNN: A Case Study of Ahai Reservoir Area in Jinsha River Basin

Dong, Yujuan; Li, Yongfa; Zuo, Xiaoqing; Liu, Na; Gu, Xiaona; Shi, Haoyi; Jiang, Rukun; Guo, Fangzhen; Gu, Zhengxiong; Chen, Yongzhi

doi:10.3390/rs18020283

Open AccessArticle

An Automatic Identification Method for Large-Scale Landslide Hazard Potential Integrating InSAR and CRF-Faster RCNN: A Case Study of Ahai Reservoir Area in Jinsha River Basin

by

Yujuan Dong

^1,2,3,

Yongfa Li

^1,2,*

,

Xiaoqing Zuo

^1,2,

Na Liu

^1,3,

Xiaona Gu

¹,

Haoyi Shi

¹,

Rukun Jiang

³,

Fangzhen Guo

³,

Zhengxiong Gu

⁴ and

Yongzhi Chen

⁴

¹

Faculty of Land Resources Engineering, Kunming University of Science and Technology, Kunming 650093, China

²

Yunnan Key Laboratory of Intelligent Monitoring and Spatiotemporal Big Data Governance of Natural Resources, Kunming 650051, China

³

Chongqing Institute of Geology and Mineral Surveying and Mapping Co., Ltd., Chongqing 401120, China

⁴

Yunnan Institute of Geology and Mineral Surveying and Mapping Co., Ltd., Kunming 650051, China

^*

Author to whom correspondence should be addressed.

Remote Sens. 2026, 18(2), 283; https://doi.org/10.3390/rs18020283

Submission received: 13 November 2025 / Revised: 26 December 2025 / Accepted: 30 December 2025 / Published: 15 January 2026

(This article belongs to the Special Issue State of the Art of GNSS and SAR/InSAR Techniques for Geomatic Applications)

Download

Browse Figures

Versions Notes

Highlights

What are the main findings?

Adopting a “deformation-driven” identification paradigm, a collaborative approach integrating “InSAR deformation monitoring, intelligent identification of deformation anomaly areas, and multi-source validation” was constructed, which achieved the early detection of landslides.
An improved Faster RCNN model (CRF Faster RCNN) integrating CBAM, ResNet-50, and FPN was proposed, which achieved the automatic identification of landslide anomaly areas from InSAR deformation maps.

What are the implications of the main findings?

Compared with Faster RCNN, this method significantly improves the accuracy of landslide detection and provides important theoretical support for the identification of large-scale landslide hazards.
This provides a scientific basis for the prevention and control of geological disasters in the Ahai Reservoir area of the Jinsha River Basin.

Abstract

Currently, the manual delineation of landslide anomalies from Interferometric Synthetic Aperture Radar(InSAR )deformation data is labor-intensive and time-consuming, creating a major bottleneck for operational large-scale landslide mapping. This study proposes an automated approach for large-scale landslide identification by integrating InSAR technology with an improved Faster Regional Convolutional Neural Network (Faster R-CNN). First, surface deformation over the study area was obtained using the Small Baseline Subset Interferometric Synthetic Aperture Radar (SBAS-InSAR) technique. An enhanced CRF-Faster R-CNN model was then developed by incorporating a Residual Network with 50 layers (ResNet-50)-based backbone, strengthened with a Convolutional Block Attention Module (CBAM), within a Feature Pyramid Network (FPN) framework. This model was applied to deformation velocity maps for the automated detection of landslide-prone areas. Preliminary results were subsequently validated and refined using optical images to produce a final landslide inventory. The proposed method was evaluated in the Ahai Reservoir area of the Jinsha River Basin using 248 ascending and descending Sentinel-1A images acquired between January 2019 and December 2021. Its performance was compared with that of the standard Faster R-CNN model. The results indicate that the CRF-Faster R-CNN model outperforms the conventional approach in terms of landslide anomaly detection, convergence speed, and overall accuracy. A total of 38 potential landslide hazards were identified in the Ahai Reservoir area, with an 84% validation accuracy confirmed through field investigations. This study provides crucial technical support for the rapid identification and operational application of large-scale potential landslide hazards.

Keywords:

SBAS-InSAR; Ahai reservoir area; deformation monitoring; deep learning; automatic landslide identification

1. Introduction

Landslides constitute one of the most prevalent global geological hazards, whose formation and evolution are shaped by multiple factors such as topography, geological structures, geotechnical properties, and hydrology. Owing to their high suddenness, strong destructiveness, and widespread impact, landslides pose severe threats to human life, property safety, and socio-economic development [1,2,3]. Owing to its unique topography and climate, Southwest China is the most severely affected region, experiencing the highest frequency and most substantial losses from landslide disasters [4,5,6].

The Jinsha River Basin serves as a pivotal base for hydropower energy development. Nevertheless, the interplay of its distinctive topography, vertical climate zones, fragile geology, and intensive human engineering activities has resulted in a high frequency of landslides. These events seriously threaten hydropower project safety and regional sustainable development [7,8,9]. The reservoir area of the Ahai Hydropower Station, situated in the middle reaches of the Jinsha River, constitutes a critical node in the West-East Electricity Transmission project. A combination of typical alpine canyon topography, active neotectonic movements, fragmented rock masses, and frequent heavy rainfall creates an environment predisposing the area to pronounced reservoir bank slope failures [10,11]. Consequently, research focused on landslide identification in this area is of paramount importance for ensuring the safe operation of the hydropower station and for protecting the lives and property within the surrounding environment [12].

Surface deformation monitoring is crucial for landslide prevention. While traditional techniques, such as leveling and Global Navigation Satellite System (GNSS) offer high accuracy, their application is often hindered by poor accessibility in high-altitude terrain, long observation cycles, and limited spatial coverage, making them inefficient for monitoring large reservoir areas [13,14]. In recent years, InSARhas emerged as a pivotal tool for large-area deformation monitoring due to its broad coverage, high precision, and non-contact nature [15,16,17,18]. The evolution of time series InSAR techniques, including Persistent Scatterer Interferometric Synthetic Aperture Radar (PS-InSAR) [19,20], SBAS-InSAR [21,22], and Distributed Scatterer InSAR (DS-InSAR) [23,24], has effectively mitigated decorrelation issues, significantly advancing landslide applications. For instance, studies have successfully coupled these methods with machine learning for susceptibility mapping, identified numerous landslides in reservoir areas [25], and the updating of a regional landslide inventory in Tuscany using PS-InSAR, leading to the identification of 672 active landslides [26]. Further innovation is seen in the combination of multi-temporal InSAR with meta-learning to improve slow landslide detection in complex terrains such as Hong Kong [27]. Despite these advancements, large-scale landslide identification still heavily relies on manual interpretation of InSAR deformation data, which is a labor-intensive and time-consuming process. There is a pronounced lack of efficient and accurate methods for the automated identification of landslide anomalies.

In recent years, deep learning has made significant progress in the field of geological hazard identification, providing a new technological approach for large-scale landslide identification. The landslide recognition method based on deep learning utilizes a hierarchical feature learning mechanism to automatically extract key features such as morphology and texture, effectively improving the low efficiency and subjectivity of traditional manual interpretation methods [28], and has achieved a series of important results in the field of landslide recognition [29,30,31,32,33]. In addition, the integration of InSAR technology and Convolutional Neural Network (CNN) algorithm has demonstrated good applicability in multiple fields, including seismic deformation monitoring [34], mining subsidence assessment [35], and ground subsidence detection [36]. Research has shown that CNN has significant advantages in automatic identification and monitoring of geological hazards based on InSAR datasets [37]. However, in alpine valley regions, influenced by complex topography and vegetation coverage, existing methods still exhibit limitations in feature representation and model generalization capabilities. The development of intelligent identification techniques tailored to complex terrain conditions remains an important research direction.

This study proposes an automated method for identifying large-scale potential landslide hazards by integrating InSAR technology with an improved CRF-Faster R-CNN model. Taking the Ahai Reservoir area in the Jinsha River Basin as a case study, we processed 248 ascending and descending Sentinel-1A images acquired between January 2019 and December 2021. The SBAS-InSAR technique was employed to derive surface deformation information. The Faster R-CNN architecture was enhanced through the incorporation of ResNet-50 integrated with CBAM and FPN. The model was trained using monthly deformation velocity maps generated through SBAS-InSAR processing to enable automated detection of deformation anomalies. Potential landslide hazards were systematically identified through the integration of high-resolution optical remote sensing imagery and field validation, with detection accuracy rigorously evaluated. This research provides significant contributions to landslide prevention and mitigation efforts, offering valuable insights for geological disaster risk management in southwestern China.

2. Study Area and Datasets

2.1. Study Area

The Ahai Hydropower Station reservoir area in the Jinsha River Basin is located at the junction of Yulong County and Ninglang County in Lijiang City, Yunnan Province, China (27°21′N–27°46′N, 100°15′E–100°31′E), as shown in Figure 1a. As the fourth cascade development in the middle reaches of the Jinsha River, it connects hydrologically with the upstream Liyuan Station and neighbors the downstream Jin’anqiao Station. This region features a warm-temperate plateau mountain climate with pronounced vertical zonation, where approximately 80% of the annual precipitation occurs from May to October, peaking in July and August. These marked seasonal hydrological fluctuations substantially increase landslide risks. The geological framework of the study area is located within the southwestern periphery of the Yangtze Paraplatform, adjacent to its tectonic junction with the southern prolongation of the Songpan-Garzê Fold System, and lies within the interior of the Sichuan-Yunnan rhombic block. The region exhibits intense Cenozoic tectonic activity, characterized by densely developed folds and faults, with frequent seismic events occurring along the Xianshuihe–Eastern Yunnan seismic belt, as shown in Figure 1b. The reservoir area exhibits a deeply incised V-shaped valley geomorphology, with elevations on both banks exceeding 3000 m, and gullies developing into steep cliff terrain. The bedrock in the reservoir area is dominated by the lower Devonian Alengchuzu Formation of low-grade metamorphic rocks, with a lithological combination of sandstone, siltstone, and slate interbedding. Superficial deposits include gravelly clay, colluvial debris, and cretaceous sands, with weathered basalt blocks prone to softening due to hydro-sensitivity. This distinctive geological setting has fostered the extensive development of collapse deposits, mixed accumulations, and a variety of landslide types across the reservoir region. Coupled with intense and concentrated rainfall events, the reservoir slopes are subjected to substantial compound geohazard risks, including debris flows and slope failures, which pose critical threats to the stability of the riverbanks.

2.2. Datasets

This study employs Sentinel-1A Single Look Complex (SLC) images in Interferometric Wide Swath (IW) mode with VV polarization as the primary SAR data source. The basic parameter information is shown in Table 1. A total of 88 ascending and 160 descending image datasets from January 2019 to December 2021 were processed to derive surface deformation measurements. Topographic correction was performed using the 12.5 m resolution ALOS DEM provided by the Japan Aerospace Exploration Agency (JAXA), while atmospheric correction was implemented with Generic Atmospheric Correction Online Service (GACOS) data obtained from http://www.gacos.net/ (accessed on 9 August 2023). Additionally, optical remote sensing images from Google Earth were utilized to provide auxiliary support for landslide hazard identification. Furthermore, fault data, lithological information, and hydrographic networks were obtained from China’s National Fundamental Geographic Database and the China Geological Survey. Historical landslide disaster point data provided by Yunnan Geological Hazard Prevention authorities served as reliable validation references for automated landslide identification results.

3. Materials and Methods

Figure 2 illustrates the technical flowchart of this study, which consists of three key components: (1) acquisition of surface deformation information using SBAS-InSAR technology, based on 88 ascending and 160 descending Sentinel-1A images acquired between January 2019 and December 2021; (2) an improved Faster R-CNN model incorporating the CBAM attention mechanism, ResNet-50, and FPN was adopted to automatically identify landslide anomalies from deformation velocity maps; (3) accuracy assessment was conducted through model comparison, while optical imagery and field surveys were integrated to confirm the landslide hazard inventory in the study area and validate the accuracy of landslide identification results.

3.1. Technical Principle of SBAS-InSAR

SBAS-InSAR technology, capable of monitoring large-scale, long-time-series surface deformation with millimeter-scale precision, has become an essential tool for observing slow surface displacement and estimating geophysical parameters [11]. In contrast to D-InSAR and PS-InSAR, the SBAS method performs interferometric analysis on multiple SAR images acquired over the same area at different times. Unlike PS-InSAR, which depends on permanent scatterers, SBAS-InSAR utilizes distributed scatterers with stable characteristics, providing superior performance in natural terrain monitoring and enhanced resistance to decorrelation.

First, precise orbit data were applied to geometrically rectify the SAR images, thereby improving their geolocation accuracy. During the interferometric processing stage, the input SAR data undergoes registration of interferometric pairs. Given

N + 1

single-look complex (SLC) SAR images acquired at times

t_{0}, t_{1} \dots t_{N}

, one image is selected as the master and the remaining images were co-registered to it. During interferogram generation, appropriate spatial and temporal baseline thresholds were set, ultimately resulting in

M

interferometric pairs. These interferometric pairs can be expressed as:

\frac{N + 1}{2} < M < N (\frac{N + 1}{2})

(1)

The experiment discarded interferometric pairs with low coherence, resulting in the generation of 216 ascending and 238 descending interferograms. For each interferometric pair, the phase difference was calculated to extract interferometric phase information. Phase unwrapping was performed using the minimum cost flow (MCF) method, and Goldstein filtering was applied to smooth the interferometric phase. To mitigate atmospheric phase delays, GACOS data were used for atmospheric correction, and the topographic phase was removed using a high-accuracy DEM. Through the selection of ground control points (GCPs) for orbit refinement and re-flattening processing, followed by two inversion and geocoding, we ultimately obtained line-of-sight (LOS) deformation rates and associated results.

Building upon the aforementioned processing, using

t_{0}

as the initial time and defining

t_{A}

and

t_{B}

as subsequent time intervals (

t_{B}

>

t_{A}

), the differential interferometric phase

(∆ \emptyset)

at any pixel coordinates

(x, r)

in the

j

-th differential interferogram can be expressed as:

δ ϕ_{j} (x, r) = ϕ_{B} (x, r) - ϕ_{A} (x, r) \approx \frac{4 π}{λ} [d (t_{B}, x, r) - d (t_{A}, x, r)]

(2)

where

λ

represents the radar wavelength, and

d (t_{B}, x, r)

and

d (t_{A}, x, r)

denote the cumulative deformation of LOS at times

t_{B}

and

t_{A}

relative to the reference epoch

t_{0}

.

3.2. Faster RCNN Model

Proposed by Ross Girshick in 2015, Faster R-CNN stands as one of the most representative achievements in the R-CNN series and a classical example among two-stage object detection algorithms [38]. In contrast to single-stage object detection algorithms such as Single Shot MultiBox Detector (SSD) [39], You Only Look Once (YOLO) [40,41], Faster R-CNN exhibits higher detection accuracy and demonstrates superior adaptability to multi-scale objects. The algorithm introduces a Region Proposal Network (RPN) and performs refined classification and regression through Region of Interest Pooling (RoI Pooling), achieving end-to-end object detection with significant advantages in complex scene applications.

The Faster R-CNN network architecture comprises four principal components, the backbone feature extraction network, the Region Proposal Network (RPN), the ROI pooling layer, and the detection head, as illustrated in Figure 3. The processing involves first the input image feeds through a convolutional network to extract high-level feature maps enriched with semantic information. Subsequently, the RPN performs sliding-window detection across these feature maps, generating object proposals along with their corresponding confidence scores through a predefined anchor box mechanism. These variable-sized candidate regions are then normalized into fixed-dimensional feature blocks via ROI pooling. Finally, the corrected features are input into the detection head, which comprises two parallel branches: the classification branch that employs softmax function to calculate the probability of each proposal belonging to the landslide category, and the regression branch that further optimizes the bounding box coordinates to output the precise spatial location of the landslide body.

3.3. CRF-Faster RCNN Model

Aiming at the problems of the traditional Faster RCNN model, such as insufficient feature extraction capability caused by gradient vanishing in deep networks and difficulty in adapting single-layer feature output to multi-scale target detection, this study constructs a novel network model (CRF-Faster RCNN) based on the basic algorithm for the identification of landslide anomaly areas. This model replaces the traditional Visual Geometry Group (VGG) network with ResNet-50 and FPN as the backbone feature extraction network. With the help of the residual connection mechanism, it effectively solves the gradient vanishing problem in deep network training and improves the quality and stability of feature extraction. Meanwhile, it innovatively integrates the CBAM module, utilizing a channel-spatial dual attention mechanism to enhance the model’s adaptability to complex scenarios and improve the detection robustness in vegetation-covered areas. This model can more efficiently capture the semantic information and detailed features of landslide areas, thereby improving the accuracy and reliability of landslide monitoring. Figure 4 shows the schematic diagram of the CRF-Faster RCNN model structure.

ResNet-50 is a deep convolutional neural network architecture based on residual learning mechanisms designed to address the critical challenges of vanishing gradients and performance degradation in deep network training, representing a milestone achievement in computer vision [42]. By introducing residual connection mechanism, this architecture significantly enhances both optimization efficiency and feature representation capability in deep networks. The ResNet-50 architecture comprises five feature extraction stages (Conv1~Conv5), each implemented through stacked bottleneck modules that facilitate multi-level feature representation, as illustrated in Figure 5. Different from the traditional network that directly fits the target mapping function

H (x

), ResNet-50 reconstructs the target function into a residual learning form:

H (x) = F (x) + W_{s} x

(3)

where

x

is the input vector of the module, and

F (x)

is the residual function; when the input and output dimensions are consistent,

W_{s}

is an identity matrix (i.e.,

W_{s} (x) = x

), and the element-wise superposition of the input and residual features is achieved through Shortcut Connection; when the dimensions mismatch, W_s adopts a linear projection matrix composed of

1 \times 1

convolution to adjust the number of input channels and ensure dimensional consistency.

To enhance the multi-scale object detection capability, this study introduces FPN as the multi-scale feature fusion architecture [43]. Its core design includes the Top-Down Pathway, Lateral Connections, and feature fusion modules. Based on the feature maps (C2~C5) extracted from different stages of ResNet-50, FPN performs channel alignment and element-wise addition through bilinear interpolation sampling in the top-down pathway and lateral connections, thereby fusing high-semantic features from deep layers (e.g., C5) with high-resolution features from shallow layers (e.g., C4) across levels. It iteratively generates a multi-scale feature pyramid (P2~P6) with rich semantic and spatial details to support the detection of objects of different sizes. ResNet-50-FPN integrates deep semantic features and multi-scale features, which significantly improves detection and recognition performance. The combined architecture of ResNet50-FPN is specifically shown in Figure 6.

In target detection, traditional network structures are limited by local feature extraction, making it difficult to capture global contextual information and prone to losing key features, which results in insufficient detection accuracy. Therefore, based on the ResNet-50 and FPN architectures, this study introduces the CBAM module, which consists of a Channel Attention Module (CAM) and a Spatial Attention Module (SAM) in series. As shown in Figure 7, the input feature map

P

undergoes global average pooling and max pooling through the CAM module to capture channel dependencies and generate weights for weighting, resulting in an intermediate feature map

P^{'}

; subsequently, P′ is processed by the SAM module to generate spatial weights through spatial pooling and convolution, further calibrate the feature map, and ultimately output the optimized feature map

P^{″}

. This dual attention mechanism can effectively focus on the key regions of the target, enhancing the model’s recognition accuracy in complex scenarios.

3.4. Construction of Time-Series Deformation Dataset

This study constructed a sample dataset based on ascending and descending InSAR deformation measurements from the Ahai Reservoir area. The experimental data comprised SAR imagery acquired between 2019 and 2021, with monthly deformation data systematically incorporated into the dataset. The original deformation images, which comprise a total of 248 scenes (combining ascending and descending), each possess dimensions of 2285 × 3565 pixels. Owing to the considerable spatial extent of the study area, which far exceeds the model input size, a cropping procedure was implemented in order to resize individual samples to 512 × 512 pixels, ensuring complete representation of deformation-intensive regions in the processed images. This preprocessing stage generated 6560 image samples. To maintain the requisite level of detection accuracy, we excluded data with poor quality, retaining 410 high-quality samples as the foundational dataset. Subsequent to this, the dataset was expanded through the implementation of data augmentation techniques, resulting in the augmentation of the dataset to 1230 samples for comprehensive model training and testing.

3.5. Loss Function and Evaluation Metrics

This study employs a joint loss function to optimize the model parameters. The total loss (

L_{t o t a l}

) is composed of the classification loss (

L_{c l s}

) and the bounding box regression loss (

L_{b b o x}

), with the aim of improving the classification accuracy and spatial positioning ability of the landslide target simultaneously. The model performance is evaluated using mAP as the core metric, comprehensively reflecting the balance between precision and recall in landslide detection.

3.5.1. Joint Loss Function Design

The training of the Faster R-CNN model is achieved by optimizing the loss function, with the objective of minimizing the discrepancy between the predictions and the ground truth, thereby enhancing the model’s generalization capability and robustness. The overall loss function in this study consists of two components: the classification loss

L_{c l s}

and the bounding box regression loss function

L_{b b o x}

. These two components coordinate the optimization weights through the balancing coefficient

λ

.

L_{t o t a l} = L_{c l s} + λ L_{b b o x} (λ = 1)

(4)

The classification task employs the cross-entropy loss function, which quantifies the discrepancy between the probability distribution of model predictions and the distribution of ground-truth labels, thereby driving the optimization of classifier parameters. Combined with the Sigmoid function to map the outputs into a probability distribution, its mathematical expression is as follows:

L_{c l s} = - \frac{1}{N} \sum_{i = 1}^{N} \sum_{c = 1}^{C} y_{i, c} \log (p_{i, c})

(5)

where

N

represents the total number of samples (including both positive and negative samples),

C

denotes the total number of categories,

y_{i, c}

indicates the true label of the i-th sample for class

c

, and

p_{i, c}

represents the predicted probability that the i-th sample belongs to class

c

. The cross-entropy loss generates gradient signals proportional to the error through backpropagation, leading to substantial parameter updates when predictions are incorrect. In Faster R-CNN, this function is employed to determine whether candidate regions correspond to landslide targets or the background, enhancing the classifier’s discriminative ability though end-to-end training.

The bounding box regression task employs the Smooth

L 1

Loss function. By adopting a piecewise strategy, it maintains the smoothness of the

L 2

loss function for small errors while inheriting the robustness of the

L 1

loss function for large errors, making it the core loss function for bounding box regression in object detection models. Its mathematical expression is as follows:

L_{s m o o t h - L 1} (t, \hat{t}) = \{\begin{cases} 0.5 {(t - \hat{t})}^{2} i f |t - \hat{t}| < 1 \\ |t - \hat{t}| - 0.5 o t h e r w i s e \end{cases}

(6)

where

t

denotes the true value,

\hat{t}

represents the model’s predicted value, and

|t - \hat{t}|

indicates the absolute value of the prediction error.

3.5.2. Evaluation Metrics

The model performance is evaluated using the mean Average Precision (

m A P

) as the core metric, comprehensively assessing detection accuracy in multi-class scenarios. The

m A P

combines the Precision-Recall (

P - R

) curve metric and computes the average of the Average Precision (

A P

) values across all classes. A higher

m A P

value indicates greater model accuracy. The calculation process is as follows:

A P = \int_{0}^{1} P (R) d r

(7)

m A P = \frac{1}{n} \sum_{i = 1}^{n} {(A P)}_{i}

(8)

In evaluating the performance of the model in automatically identifying landslide anomaly areas, since the model’s prediction results for image data include only two categories: landslide anomaly areas and non-landslide targets; the task is treated as a binary classification problem. Calculations are performed using a binary confusion matrix. Combined with the confusion matrix (Table 2), further analysis of the model’s false positives and false negatives can be conducted.

4. Results and Analysis

4.1. InSAR Deformation Results

This study integrates both ascending and descending orbit datasets to obtain InSAR deformation information, achieving comprehensive monitoring of surface deformation in the reservoir area and significantly enhancing the accuracy and reliability of the monitoring results. Figure 8 displays the spatial distribution of the annual average deformation rate along the radar LOS, where blue indicates surface displacement toward the satellite and red denotes displacement away from the satellite. Constrained by the high mountain-valley terrain and the side-looking imaging geometry of radar, the positive and negative deformation values serve merely as preliminary characterization of surface deformation activity.

As shown in Figure 8a, the deformation results from the ascending data reveal significant spatial heterogeneity in the average annual deformation rate across the study area. The maximum deformation rate reaches −79.2 mm/yr, primarily concentrated near Fengke Town and Donglian Village on the west side of the Jinsha River. The deformation results from the descending, presented in Figure 8b, reveal an asymmetric distribution of extreme deformation values. The maximum deformation rate is −95.8 mm/y, concentrated along the eastern side of the Jinsha River from Baiya to Fengke. The average annual deformation rate within the region primarily ranges from −35 to −30 mm/yr. Deformation activities are spatially clustered in a belt-like pattern along both banks of the Jinsha River, showing strong correlation with regional fault structures and the orientation of the reservoir shoreline.

Prominent deformation zones identified from the ascending data are primarily distributed on the western side of the Jinsha River, encompassing villages such as Labo, Gukongmei, and Baiya. The segment from Meigudi to Shuzhi, in particular, shows potential landslide instability due to the effects of the reservoir bank. In contrast, the descending data reveal more active deformation on the eastern side of the river basin. Regions such as Ligu, Ruziluo, and Kuzhi generally exhibit higher subsidence rates compared to the left bank of the Jinsha River, which may be strongly associated with rock mass creep on steep eastern slopes and well-developed fold-related faults. Notably, regions including Baiya, Xinjian, and Shuzhi show significant deformation signals in both ascending and descending data, suggesting the potential presence of multi-directional superimposed deformation in these areas.

4.2. Automatic Identification Results of Landslide Anomaly Areas Based on the CRF-Faster RCNN

4.2.1. Experimental Setup and Training Visualization

The experiments were implemented in the Python 3.9 programming environment and developed based on the PyTorch 1.10 deep learning framework. The hardware environment for running the experiments was a Linux 18.04 64-bit operating system, an AMD Ryzen 5 3600X CPU @ 3.70 GHz, 128 GB of memory, and an NVIDIA GeForce RTX 3080 Ti GPU. The specific experimental parameters are listed in Table 3.

4.2.2. Identification Results of Abnormal Landslide Areas in Ahai Reservoir Area

This study employs the Faster RCNN and CRF-Faster RCNN models for the automatic identification of landslide anomaly areas in InSAR deformation datasets, with the results shown in Figure 9 and Figure 10. Specifically, Figure 9 presents the identification results from the ascending images, while Figure 10 displays those from the descending images. In the Figure 9a,b and Figure 10a,b represent the identification results of the Faster RCNN and CRF-Faster RCNN models, respectively.

As shown in Figure 9 and Figure 10, the CRF-Faster R-CNN model demonstrates significant improvement in identifying landslide anomaly areas. In the ascending images, Faster RCNN identified 42 landslide anomaly areas, whereas the CRF-Faster RCNN model identified 57. In the descending images, Faster RCNN detected 74 landslide anomaly areas, while the CRF-Faster RCNN model identified 82. These results indicate that the proposed CRF-Faster RCNN model offers superior capability in detailed landslide recognition. However, overlapping detection boxes can be observed in the results, which is primarily due to the IoU threshold being set to 0.7 in this study—meaning any prediction box with a probability exceeding 0.7 is recognized and retained as a landslide anomaly area. Furthermore, the study area is located in a high mountain valley region with substantial vegetation coverage, which may lead to decorrelation issues in the deformation results obtained by the SBAS-InSAR technique. Additionally, vegetation growth may also cause phase changes that might be misinterpreted as surface deformation, leading to false identifications of landslides. Consequently, it remains challenging to completely exclude the influence of decorrelated areas during detection, which may introduce certain errors in the results.

4.3. Model Performance Evaluation

The identification capability of the model is evaluated using the loss function derived from the model testing data. In the experiments, the classification loss function L_cls employs the cross-entropy loss function, and the L1 loss function is adopted for the bounding box regression loss L_bbox. Figure 11 illustrates the variation trends of the loss functions for both the Faster RCNN and the proposed CRF-Faster RCNN models. As observed, the CRF-Faster RCNN model converges faster and achieves a lower loss value. The Faster RCNN model shows local peaks after 2000 iterations, accompanied by significant fluctuations in the loss value, as indicated by the red dashed rectangle in Figure 11a. In contrast, the CRF-Faster RCNN model shows only minor local peaks before 2000 iterations, with slight fluctuations in loss values, indicating that the CRF-Faster RCNN model performs better in object recognition tasks.

Figure 12 illustrates the trend of mAP for the Faster RCNN and CRF-Faster RCNN models. The two models employ different initial learning rates. The CRF-Faster RCNN model, with a lower initial learning rate, enabling it to more thoroughly learn of image features and facilitating a stable rapid convergence towards the optimal solution. When the patience value is set to 10 epochs, the Faster RCNN model begins to converge after 20 epochs, whereas the CRF-Faster RCNN model converges after only 15 epochs. Furthermore, at the same convergence iteration count, the CRF-Faster RCNN model demonstrates higher convergence accuracy. The mAP of the Faster RCNN model eventually stabilized at 0.856, whereas the mAP of the CRF-Faster RCNN model converged to 0.888, approaching 0.9 asymptotically. These results demonstrate that the proposed CRF-Faster RCNN model not only converges faster but also achieves significantly improved identification accuracy, rendering it more suitable for landslide identification tasks.

4.4. Results of Landslide Hazard Identification Using Combined Optical Imaging and Deep Learning

To address the limitations of relying solely on InSAR deformation data for landslide identification, we conducted comprehensive interpretation of automatically detected landslide anomaly zones using optical remote sensing imagery. This analysis incorporated textural features, vegetation coverage patterns, and topographic characteristics to validate potential landslides. Through systematic elimination of other surface deformation anomalies, 38 potential landslides were identified, as shown in Figure 13. Field verification confirmed an overall accuracy of 84%, comprising 15 confirmed historical landslides and 17 showing varying degrees of landslide evidence.

The ascending data detected 24 deformation zones, including 7 prominent deformation areas identified on the western bank of the Jinsha River. The maximum deformation rate observed was −79.2 mm/yr, as shown in Figure 13a. The descending detected 22 deformation zones, with a maximum deformation rate of −53.59 mm/yr, and 8 significant deformation areas located on the eastern side of the river basin, as shown in Figure 13b.

Due to the right-side looking geometry of the radar system perpendicular to its flight direction, the ground projection of the InSAR observation vector for ascending trends from west to east, whereas that for descending trends from east to west. This difference in observation geometry results in substantial variations in the surface coverage monitored by each track. As a consequence, eight landslides (H3, H5, H8, H10, H11, H14, H15 and H16) were detected by both ascending and descending in this study.

A total of nine typical landslides (H1, H8, H10, H11, H14, H16, H22, H30 and H31) were selected from the ascending and descending orbit data for in-depth analysis. Verification was conducted by overlaying and comparing InSAR-derived deformation monitoring points with high-resolution remote sensing images. Among these, the landslides at Baiya (H8), Xinjian (H10), Gukongmei (H14), Shuzhi (H16), and Ladingli (H30) landslides are paleolandslide masses that remain continuously active to date. These landslides primarily develop in surface slopes, residual layers and saturated or strongly weathered rock, classifying them as shallow or medium-thick landslides, and are closely associated with human activities. Figure 14 presents the InSAR deformation rates and corresponding optical images for some of the potential landslide sites. The red solid lines delineate the landslide boundaries, the yellow dashed lines denote landslide subsidence areas, and the arrows represent the sliding directions.

4.5. Analysis of Typical Landslides

To further verify the accuracy of landslide identification, the Ligu (H3) landslide, which exhibits significant deformation, was selected for analysis. The H3 landslide is a giant ancient landslide located on the eastern bank of the Jinsha River Basin, adjacent to the river itself and in close proximity to the Ahai and Liyuan Hydropower Stations. Given its particular geographical setting, a potential reactivation of landslide movement at this location could trigger a cascade of adverse consequences. Specifically, it might result in river blockage, causing extensive damage to the downstream hydropower stations. This, in turn, could lead to subsequent flooding events and the onset of secondary geohazards. Such occurrences would pose a severe threat to the livelihoods and safety of the residents in downstream towns.

Figure 15 presents the descending deformation velocity along the LOS direction for the H3 landslide during the 2019–2021 period, derived using the SBAS-InSAR technique. The results reveal prominent deformation anomalies adjacent to the Jinsha River, corresponding to a newly identified landslide mass whose basal portion connects directly with the river channel. The maximum deformation rate of this newly identified landslide mass reaches −92.8 mm/yr. The landslide exhibits characteristic geomorphology with elevated margins and a depressed central portion, forming a distinct depression approximately 800 m wide, as shown in the white dashed line area in Figure 15c. The landslide boundaries are clearly delineated, within which the deformation rates show significant acceleration.

Figure 16a shows the optical characteristics of the H3 landslide. The landslide area exhibits low vegetation coverage, with predominantly exposed ground surface and locally steep slopes. The red dashed line marks the original boundary of the H3 landslide, measuring approximately 3000 m in length, 2500 m in width, and covering an area of about 7 km². Multiple gullies have developed along the longitudinal deposit zone of the landslide body, while tensile-shear cracks are sporadically distributed across its surface. The yellow dashed line indicates the boundary of the newly identified landslide section, and the white dashed lines mark typical gullies and cracks.

Figure 16b–f present field investigation photographs of the landslide. Figure 16b,c show panoramic views of the newly identified landslide mass taken at different times, clearly illustrating the overall subsidence displacement and areal expansion of the landslide body. Figure 16d–f depict the surrounding rock layers and internal debris condition of the landslide. Evidence of sliding events is preserved on both the ancient and new landslide surfaces, showing extensive landslide traces, including multiple scratch marks and mirror-like features. The lithology of the landslide mass is dominated by argillaceous slate and metamorphic sandstone, with surface cracks approximately 10 cm wide observed along the periphery. These phenomena indicate that the landslide is in a highly unstable state, and under the influence of factors such as river erosion and rainfall, it is highly susceptible to further instability.

5. Discussion

This study proposes a CRF-Faster RCNN model integrating CBAM, ResNet-50, and FPN, which realizes the automatic identification of landslide anomaly areas based on InSAR deformation features. This method significantly improves the recognition accuracy in complex terrains. By combining high-resolution optical image verification and field investigation, it effectively ensures the reliability of identification results while guaranteeing identification efficiency, providing a feasible technical approach for the precise identification of landslide hazards. However, this method still has certain limitations: first, the recognition results are highly dependent on the quality and spatial continuity of InSAR data, limiting its application in areas with severe decoherence, and the model’s ability to distinguish non-landslide deformation signals such as engineering activities remains inadequate; in addition, the current research is mainly based on a single model architecture, and systematic multi-dimensional performance comparison with other mainstream landslide identification models has not yet been carried out.

Based on the current achievements and limitations, future research will further deepen in the following aspects: First, construct a multi-source remote sensing data fusion analysis framework to integrates optical imagery, LiDAR topographic features, and regional geological structure data, improving the model’s feature representation capability and generalization in complex environments. Second, carry out multi-model coupling and comparison research to enhance the method’s versatility and engineering practical value by integrating the advantages of different models. Third, explore weakly supervised and cross-regional transfer learning strategies in small-sample scenarios to reduce the model’s dependence on large labeled samples, enhancing its applicability in data-scarce areas, and providing more accurate and reliable technical support for geological disaster risk prevention and control.

6. Conclusions

This study proposes a large-scale landslide automatic identification method that integrates InSAR and an improved Faster RCNN model. The approach utilizes effective deformation information derived from ascending and descending to construct sample dataset. A modified Faster R-CNN architecture (CRF-Faster R-CNN) was established, featuring a backbone network composed of ResNet-50 and FPN, further enhanced by integrating the CBAM. The model was applied to automatically identify landslides in the Jinsha River Basin’s Ahai Reservoir area within Yunnan’s complex mountainous terrain. To evaluate its performance, we conducted comparative experiments with the baseline Faster R-CNN model. Landslide identification results were validated using optical imagery and field surveys, confirming the model’s accuracy in landslide detection. The main conclusions are as follows:

(1): Time-series InSAR technology was employed to obtain surface deformation monitoring results for the Ahai Reservoir area and its surrounding regions. The monitoring data revealed that the maximum average annual deformation rate reached −79.2 mm/yr in the ascending and −95.8 mm/yr in the descending. Deformation anomalies within the study area are predominantly concentrated in several locations along both banks of the Jinsha River, including Ligu, Baiya, Meigudi, Xinjian, Gukongmei, Shuzhi, Donglian Village, and Ladinli.
(2): The large-scale landslide identification method proposed in this study, based on InSAR and CRF-Faster RCNN, effectively detects landslide anomaly areas, achieves accurate separation between deformed and non-deformed areas, and generates a landslide anomaly detection map. Compared with existing Faster RCNN models, the proposed method demonstrates superior performance in landslide identification capability, convergence speed, and overall accuracy. The overall accuracy (mAP) reached 88.8%.
(3): A total of 38 potential landslide were identified through integrated analysis with optical images, with a field validation accuracy of 84%. Eight landslides were consistently detected by both ascending and descending. Further analysis and interpretation of the typical landslide body H3 revealed the coexistence of old and new landslide bodies in this area, which still exhibits a continuing deformation trend, necessitating enhanced monitoring in the future. The study demonstrates that the proposed method achieves high accuracy in identifying landslide anomaly zones, providing robust technical support for the early identification and monitoring of landslide hazards.

Author Contributions

Conceptualization, Y.D. and Y.L.; methodology, Y.D.; software, Y.D., Y.L. and X.Z.; validation, N.L., X.G. and H.S.; formal analysis, Y.D.; resources, X.Z. and Y.L.; data curation, Y.D. and H.S.; writing—original draft preparation, Y.D.; writing—review and editing Y.L. and X.Z.; visualization, R.J. and F.G.; supervision, Z.G. and Y.C.; funding acquisition, Y.L. and X.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the National Natural Science Foundation of China (grant numbers 42471483, 42161067 and 42361054), the Open Fund Program of Yunnan Key Laboratory of Intelligent Monitoring and Spatiotemporal Big Data Governance of Natural Resources (grant numbers 202449CE340023), the Pilot Cooperation Project between the Ministry of Natural Resources of China and Yunnan Province (grant numbers 2023ZRBSHZ048), the Yunnan Fundamental Research Projects (grant numbers 202501AT070310 and 202401AU070173), the Scientific Research Fund of Yunnan Provincial Department of Education (grant numbers 2024J0067) and the Talent Development Program of Kunming University of Science and Technology (grant numbers KKZ3202421128).

Data Availability Statement

The Sentinel-1 raw data can be downloaded from https://vertex.daac.asf.alaska.edu accessed on 1 August 2023. The ALOS 12.5m DEM data are available at http://vertex.daac.asf.alaska.edu accessed on 1 August 2023. The Precipitation Orbit Data (POD) can be downloaded from https://s1qc.asf.alaska.edu/aux_poeorb/ accessed on 1 August 2023. The GACOS data can be downloaded free of charge from the website: http://www.gacos.net/ accessed on 9 August 2023. The Rainfall data can be obtained from https://www.ncei.noaa.gov/maps-and-geospatial-products accessed on 9 August 2023. Optical imagery was accessed through Google Earth.

Acknowledgments

The authors would like to express their sincere gratitude to the various websites and organizations that provided the data. The authors also extend their heartfelt thanks to the reviewers for their invaluable comments and suggestions.

Conflicts of Interest

Author Yujuan Dong was employed by the company Chongqing Institute of Geology and Mineral Surveying and Mapping Co., Ltd. Author Zhengxiong Gu was employed by the company Yunnan Institute of Geology and Mineral Surveying and Mapping Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

Chen, X.L.; Zhou, Q.; Ran, H.; Dong, R. Earthquake-triggered landslides in southwest China. Nat. Hazards Earth Syst. Sci. 2012, 12, 351–363. [Google Scholar] [CrossRef]
Salmi, E.F.; Nazem, M.; Karakus, M. Numerical analysis of a large landslide induced by coal mining subsidence. Eng. Geol. 2017, 217, 141–152. [Google Scholar] [CrossRef]
Li, W.L.; Xu, Q.; Lu, H.Y.; Dong, H.J.; Zhu, Y.Q. Tracking the Deformation History of Large-Scale Rocky Landslides and Its Enlightenment. Geomat. Inf. Sci. Wuhan Univ. 2019, 44, 1043–1053. [Google Scholar]
Zhang, L.; Dai, K.; Deng, J.; Ge, D.Q.; Liang, R.B.; Li, W.L.; Xu, Q. Identifying potential landslides by stacking-InSAR in southwestern China and its performance comparison with SBAS-InSAR. Remote Sens. 2021, 13, 3662. [Google Scholar] [CrossRef]
Dai, K.; Deng, J.; Xu, Q.; Li, Z.H.; Shi, X.L.; Hancock, C.; Wen, N.L.; Zhang, L.L.; Zhuo, G.C. Interpretation and sensitivity analysis of the InSAR line of sight displacements in landslide measurements. GISci. Remote Sens. 2022, 59, 1226–1242. [Google Scholar] [CrossRef]
Scaringi, G.; Fan, X.; Xu, Q.; Liu, C.; Ouyang, C.J.; Domènech, G.; Yang, F.; Dai, L.X. Some considerations on the use of numerical methods to simulate past landslides and possible new failures: The case of the recent Xinmo landslide (Sichuan, China). Landslides 2018, 15, 1359–1375. [Google Scholar] [CrossRef]
Kang, Y.; Lu, Z.; Zhao, C.; Qu, W. Inferring slip-surface geometry and volume of creeping landslides based on InSAR: A case study in Jinsha River basin. Remote Sens. Environ. 2023, 294, 113620. [Google Scholar] [CrossRef]
Liu, X.; Zhao, C.; Zhang, Q.; Lu, Z.; Li, Z.H.; Yang, C.S.; Zhu, W.; Zeng, J.L.; Chen, L.Q.; Liu, C.J. Integration of Sentinel-1 and ALOS/PALSAR-2 SAR datasets for mapping active landslides along the Jinsha River corridor, China. J. Eng. Geol. 2021, 284, 106033. [Google Scholar] [CrossRef]
Zhang, C.L.; Li, Z.H.; Yu, C.; Song, C.; Xiao, R.Y.; Peng, J.B. Landslide Detection of the Jinsha River Region Using GACOS Assisted InSAR Stacking. Geomat. Inf. Sci. Wuhan Univ. 2021, 46, 1649–1657. [Google Scholar]
Gu, X.; Li, Y.; Zuo, X.; Bu, J.W.; Yang, F.; Yang, X.; Li, Y.N.; Zhang, J.M.; Huang, C.; Shi, C.; et al. Image compression–based DS-InSAR method for landslide identification and monitoring of alpine canyon region: A case study of Ahai Reservoir area in Jinsha River Basin. Landslides 2024, 21, 2501–2517. [Google Scholar] [CrossRef]
Li, Y.; Zuo, X.; Zhu, D.; Wu, W.H.; Yang, X.; Guo, S.P.; Shi, C.; Li, F.; Liu, X.Y. Identification and analysis of landslides in the Ahai reservoir area of the Jinsha River Basin using a combination of DS-InSAR, optical images, and field surveys. Remote Sens. 2022, 14, 6274. [Google Scholar] [CrossRef]
Li, Y.; Fan, X.; Cheng, G. Landslide and rockfall distribution by reservior of stepped hydropower station in the Jinsha River. Geomat. Inf. Sci. Wuhan Univ. 2006, 11, 801–805. [Google Scholar]
Komac, M.; Holley, R.; Mahapatra, P.; Marel, H.; Bavec, M. Coupling of GPS/GNSS and radar interferometric data for a 3D surface displacement monitoring of landslides. Landslides 2015, 12, 241–257. [Google Scholar] [CrossRef]
Wang, S.; Zhang, G.; Chen, Z.; Xu, Z.X.; Liu, Y.T.; Zhao, R.S. Evaluating expressway stability using interferometric synthetic aperture radar and measuring its impact on the occurrence of geohazards: A case study of Shanxi Province, China. GISci. Remote Sens. 2023, 60, 2161200. [Google Scholar] [CrossRef]
Casagli, N.; Catani, F.; Del Ventisette, C.; Luzi, G. Monitoring, prediction, and early warning using ground-based radar interferometry. Landslides 2010, 7, 291–301. [Google Scholar] [CrossRef]
Li, Z.; Song, C.; Yu, C.; Xiao, R.Y.; Chen, L.F.; Luo, H.; Dai, K.R.; Ge, D.Q.; Ding, Y.; Zhang, Y.X.; et al. Application of Satellite Radar Remote Sensing to Landslide Detection and Monitoring: Challenges and Solutions. Geomat. Inf. Sci. Wuhan Univ. 2019, 44, 967–979. [Google Scholar]
Schlögel, R.; Doubre, C.; Malet, J.P.; Masson, F. Landslide deformation monitoring with ALOS/PALSAR imagery: A D-InSAR geomorphological interpretation method. Geomorphology 2015, 231, 314–330. [Google Scholar] [CrossRef]
Singleton, A.; Li, Z.; Hoey, T.; Muller, J.P. Evaluating sub-pixel offset techniques as an alternative to D-InSAR for monitoring episodic landslide movements in vegetated terrain. Remote Sens. Environ. 2014, 147, 133–144. [Google Scholar] [CrossRef]
Ferretti, A.; Prati, C.; Rocca, F. Permanent scatterers in SAR interferometry. IEEE Trans. Geosci. Remote Sens. 2001, 39, 8–20. [Google Scholar] [CrossRef]
Ferretti, A.; Prati, C.; Rocca, F. Nonlinear subsidence rate estimation using permanent scatterers in differential SAR interferometry. IEEE Trans. Geosci. Remote Sens. 2000, 38, 2202–2212. [Google Scholar] [CrossRef]
Berardino, P.; Fornaro, G.; Lanari, R.; Sansosti, E. A new algorithm for surface deformation monitoring based on small baseline differential SAR interferograms. IEEE Trans. Geosci. Remote Sens. 2003, 40, 2375–2383. [Google Scholar] [CrossRef]
Cao, N.; Lee, H.; Jung, H.C. A phase-decomposition-based PSInSAR processing method. IEEE Trans. Geosci. Remote Sens. 2015, 54, 1074–1090. [Google Scholar] [CrossRef]
Jiang, M.; Ding, X.; Hanssen, R.F.; Malhotra, R.; Chang, L. Fast statistically homogeneous pixel selection for covariance matrix estimation for multitemporal InSAR. IEEE Trans. Geosci. Remote Sens. 2014, 53, 1213–1224. [Google Scholar] [CrossRef]
Parizzi, A.; Brcic, R. Adaptive InSAR stack multilooking exploiting amplitude statistics: A comparison between different techniques and practical results. IEEE Geosci. Remote Sens. Lett. 2010, 8, 441–445. [Google Scholar] [CrossRef]
Guo, R.; Li, S.; Chen, Y.; Li, X.X.; Yuan, L.W. Identification and monitoring landslides in Longitudinal Range-Gorge Region with InSAR fusion integrated visibility analysis. Landslides 2021, 18, 551–568. [Google Scholar] [CrossRef]
Rosi, A.; Tofani, V.; Tanteri, L.; Agostini, A.; Catani, F.; Casagli, N. The new landslide inventory of Tuscany (Italy) updated with PS-InSAR: Geomorphological features and landslide distribution. Landslides 2018, 15, 5–19. [Google Scholar] [CrossRef]
Chen, L.; Ma, P.; Yu, C.; Zheng, Y.; Zhu, Q.; Ding, Y.L. Landslide susceptibility assessment in multiple urban slope settings with a landslide inventory augmented by InSAR techniques. Eng. Geol. 2023, 327, 107342. [Google Scholar] [CrossRef]
Ghorbanzadeh, O.; Blaschke, T.; Gholamnia, K.; Meena, S.; Tiede, D.; Aryal, J. Evaluation of different machine learning methods and deep-learning convolutional neural networks for landslide detection. Remote Sens. 2019, 11, 196. [Google Scholar] [CrossRef]
Bhuyan, K.; Meena, S.R.; Nava, L.; Westen, C.V.; Floris, M.; Catani, F. Mapping landslides through a temporal lens: An insight toward multi-temporal landslide mapping using the u-net deep learning model. GISci. Remote Sens. 2023, 60, 2182057. [Google Scholar] [CrossRef]
Cai, J.; Zhang, L.; Dong, J.; Guo, J.C.; Wang, Y.A.; Liao, M.S. Automatic identification of active landslides over wide areas from time-series InSAR measurements using Faster RCNN. Int. J. Appl. Earth Obs. 2023, 124, 103516. [Google Scholar] [CrossRef]
Liu, Y.; Yao, X.; Gu, Z.; Li, R.J.; Zhou, Z.K.; Liu, X.H.; Jiang, S.; Yao, C.; Wei, S.F. Research on automatic recognition of active landslides using InSAR deformation under digital morphology: A case study of the Baihetan reservoir, China. Remote Sens. Environ. 2024, 304, 114029. [Google Scholar] [CrossRef]
Radman, A.; Akhoondzadeh, M.; Hosseiny, B. Integrating InSAR and deep-learning for modeling and predicting subsidence over the adjacent area of Lake Urmia, Iran. GISci. Remote Sens. 2021, 58, 1413–1433. [Google Scholar] [CrossRef]
Zhang, C.; Luo, J.; Li, Z. An Automatic Detection Method of Slow-Moving Landslides Using an Improved Faster R-CNN Model Based on InSAR Deformation Rates. Remote Sens. 2025, 17, 3243. [Google Scholar] [CrossRef]
Brengman, C.M.J.; Barnhart, W.D. Identification of surface deformation in InSAR using machine learning. Geochem. Geophys. Geosyst. 2021, 22, e2020GC009204. [Google Scholar] [CrossRef]
Wu, Z.; Wang, T.; Wang, Y.; Wang, R.; Ge, D.Q. Deep learning for the detection and phase unwrapping of mining-induced deformation in large-scale interferograms. IEEE Geosci. Remote Sens. 2022, 60, 5216318. [Google Scholar] [CrossRef]
Wu, Z.; Ma, P.; Zheng, Y.; Gu, F.; Liu, L.; Lin, H. Automatic detection and classification of land subsidence in deltaic metropolitan areas using distributed scatterer InSAR and Oriented R-CNN. Remote Sens. Environ. 2023, 290, 113545. [Google Scholar] [CrossRef]
Zhu, X.X.; Montazeri, S.; Ali, M.; Hua, Y.S.; Wang, Y.Y.; Mou, L.C. Deep learning meets SAR: Concepts, models, pitfalls, and perspectives. IEEE Geosci. Remote Sens. Mag. 2021, 9, 143–172. [Google Scholar] [CrossRef]
Ren, S.; He, K.; Girshick, R.; Sun, J. Faster R-CNN: Towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell. 2017, 39, 1137–1149. [Google Scholar] [CrossRef]
Li, Y.; Dong, H.; Li, H.; Zhang, X.Y.; Zhang, B.C.; Xiao, Z.F. Multi-block SSD based on small object detection for UAV railway scene surveillance. Chin. J. Aeronaut. 2020, 33, 1747–1755. [Google Scholar] [CrossRef]
Redmon, J.; Divvala, S.; Girshick, R.; Farhadi, A. You only look once: Unified, real-time object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 779–788. [Google Scholar]
Terven, J.; Córdova-Esparza, D.M.; Romero-González, J.A. A comprehensive review of yolo architectures in computer vision: From yolov1 to yolov8 and yolo-nas. Mach. Learn. Knowl. Extr. 2023, 5, 1680–1716. [Google Scholar] [CrossRef]
He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
Lin, T.Y.; Dollár, P.; Girshick, R.; He, K.M.; Hariharan, B.; Belongie, S. Feature pyramid networks for object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 21–26 July 2017; pp. 2117–2125. [Google Scholar]

Figure 1. A geographical overview of the study area: (a) indicates the geographical location of the study area; (b) shows the distribution of geological structures within the study area, with the base map derived from DEM data.

Figure 2. A technical flowchart of this study.

Figure 3. A schematic diagram of the fully connected layer structure.

Figure 4. A schematic diagram of the CRF-Faster RCNN model structure.

Figure 5. Basic module of ResNet-50.

Figure 6. ResNet 50-FPN Network Architecture Diagram.

Figure 7. CBAM network structure diagram.

Figure 8. A surface deformation rate map of the study area: (a) is ascending, (b) is descending.

Figure 9. Identification results of landslide anomaly zone in ascending order: (a) Faster RCNN; (b) CRF-Faster RCNN.

Figure 10. Identification results of landslide anomaly zone in descending order: (a) Faster RCNN; (b) CRF-Faster RCNN.

Figure 11. Loss Function Comparison. (a) Faster RCNN. (b) CRF-Faster RCNN.

Figure 12. mAP comparison chart of the model.

Figure 13. Identification results of potential landslides in ascending and descending order. (a) shows the results for the ascending order. (b) shows the results for the descending order.

Figure 14. Partial Landslide InSAR Deformation (a) and Corresponding Optical Images (b).

Figure 15. A deformation rate map of the H3 landslide obtained using SBAS-InSAR. (a) shows the overall deformation rate map; (b) represents the local map of significant landslide displacement rates; (c) denotes the local map of remote sensing image features.

Figure 16. The optical image characteristics and field photographs of the H3 landslide. (a) shows the overall remote sensing image map; (b,c) represent the overall field survey maps; (d–f) denote the field maps of landslide characteristics.

Table 1. Sentinel-1A image data parameter information.

Parameters	Sentinel-1A
Orbital Direction	Ascending	Descending
Orbit Number	99	135
Band	C	C
Incidence Angle/(°)	42.50	39.88
Number of Images	88	160
Radar Wavelength/cm	5.60	5.60
Resolution/(m × m)	5 × 20	5 × 20
Observation Mode	Interference width	Interference width
Image Time Range	January 2019–December 2021	January 2019–December 2021

Table 2. Confusion matrix.

Category	Landslide (Positive Example P)	Non-Landslide (Negative Example N)
T (True)	TP	TN
F (False)	FP	FN

Table 3. Experimental parameters.

Parameters	Value
Epochs	300
Batch_Size	5
Optimizer	SGD
Optimizer	0.0025
Initial Learning Rate	10
Validation Interval	0.9
Weight_Decay	0.001

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Dong, Y.; Li, Y.; Zuo, X.; Liu, N.; Gu, X.; Shi, H.; Jiang, R.; Guo, F.; Gu, Z.; Chen, Y. An Automatic Identification Method for Large-Scale Landslide Hazard Potential Integrating InSAR and CRF-Faster RCNN: A Case Study of Ahai Reservoir Area in Jinsha River Basin. Remote Sens. 2026, 18, 283. https://doi.org/10.3390/rs18020283

AMA Style

Dong Y, Li Y, Zuo X, Liu N, Gu X, Shi H, Jiang R, Guo F, Gu Z, Chen Y. An Automatic Identification Method for Large-Scale Landslide Hazard Potential Integrating InSAR and CRF-Faster RCNN: A Case Study of Ahai Reservoir Area in Jinsha River Basin. Remote Sensing. 2026; 18(2):283. https://doi.org/10.3390/rs18020283

Chicago/Turabian Style

Dong, Yujuan, Yongfa Li, Xiaoqing Zuo, Na Liu, Xiaona Gu, Haoyi Shi, Rukun Jiang, Fangzhen Guo, Zhengxiong Gu, and Yongzhi Chen. 2026. "An Automatic Identification Method for Large-Scale Landslide Hazard Potential Integrating InSAR and CRF-Faster RCNN: A Case Study of Ahai Reservoir Area in Jinsha River Basin" Remote Sensing 18, no. 2: 283. https://doi.org/10.3390/rs18020283

APA Style

Dong, Y., Li, Y., Zuo, X., Liu, N., Gu, X., Shi, H., Jiang, R., Guo, F., Gu, Z., & Chen, Y. (2026). An Automatic Identification Method for Large-Scale Landslide Hazard Potential Integrating InSAR and CRF-Faster RCNN: A Case Study of Ahai Reservoir Area in Jinsha River Basin. Remote Sensing, 18(2), 283. https://doi.org/10.3390/rs18020283

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

An Automatic Identification Method for Large-Scale Landslide Hazard Potential Integrating InSAR and CRF-Faster RCNN: A Case Study of Ahai Reservoir Area in Jinsha River Basin

Highlights

Abstract

1. Introduction

2. Study Area and Datasets

2.1. Study Area

2.2. Datasets

3. Materials and Methods

3.1. Technical Principle of SBAS-InSAR

3.2. Faster RCNN Model

3.3. CRF-Faster RCNN Model

3.4. Construction of Time-Series Deformation Dataset

3.5. Loss Function and Evaluation Metrics

3.5.1. Joint Loss Function Design

3.5.2. Evaluation Metrics

4. Results and Analysis

4.1. InSAR Deformation Results

4.2. Automatic Identification Results of Landslide Anomaly Areas Based on the CRF-Faster RCNN

4.2.1. Experimental Setup and Training Visualization

4.2.2. Identification Results of Abnormal Landslide Areas in Ahai Reservoir Area

4.3. Model Performance Evaluation

4.4. Results of Landslide Hazard Identification Using Combined Optical Imaging and Deep Learning

4.5. Analysis of Typical Landslides

5. Discussion

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI