Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (1,659)

Search Parameters:
Keywords = deep learning reconstruction

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
23 pages, 386 KiB  
Article
Balancing Tradition, Reform, and Constraints: A Study of Principal Leadership Practices in Chinese Primary Schools
by Chenzhi Li, Edmond Hau-Fai Law, Yunyun Huang and Ke Ding
Educ. Sci. 2025, 15(8), 988; https://doi.org/10.3390/educsci15080988 (registering DOI) - 3 Aug 2025
Abstract
It is well-established that principal leadership significantly influences student learning in developed countries, yet much less is known about how leadership practices manifest in complex systems like China’s, where rapid modernization intersects with deep-rooted educational traditions. In particular, Chinese principals face multiple challenges [...] Read more.
It is well-established that principal leadership significantly influences student learning in developed countries, yet much less is known about how leadership practices manifest in complex systems like China’s, where rapid modernization intersects with deep-rooted educational traditions. In particular, Chinese principals face multiple challenges in balancing the implementation of educational reform policies, high parental expectations, and their own educational ideology, all within limited resources. The current study examines these challenges in Shenzhen, a city which typically manifests them through its rapid development. Specifically, we took a phenomenographic approach and interviewed the principals and staff from five prestigious primary schools to extract the key components behind the diverse school leaders’ styles and practices. Results showed that, the Chinese leadership practice model consists of five key components: mission setting, infrastructure reconstruction, teacher development, learning improvement, and educators’ networking. Although the first four components in this model align with established theories in developed countries, networking was identified as a distinctive and critical element for securing resources and fostering collaboration. These findings may broaden the scope of leadership theories and underscore the need to contextualize leadership practices based on local challenges and dynamics. It also offers practical insights for school leaders on navigating challenges to improve teacher and student outcomes. Full article
(This article belongs to the Special Issue School Leadership and School Improvement)
Show Figures

Figure 1

20 pages, 8858 KiB  
Article
Compressed Sensing Reconstruction with Zero-Shot Self-Supervised Learning for High-Resolution MRI of Human Embryos
by Kazuma Iwazaki, Naoto Fujita, Shigehito Yamada and Yasuhiko Terada
Tomography 2025, 11(8), 88; https://doi.org/10.3390/tomography11080088 (registering DOI) - 2 Aug 2025
Abstract
Objectives: This study investigates whether scan time in the high-resolution magnetic resonance imaging (MRI) of human embryos can be reduced without compromising spatial resolution by applying zero-shot self-supervised learning (ZS-SSL), a deep-learning-based reconstruction method. Methods: Simulations using a numerical phantom were [...] Read more.
Objectives: This study investigates whether scan time in the high-resolution magnetic resonance imaging (MRI) of human embryos can be reduced without compromising spatial resolution by applying zero-shot self-supervised learning (ZS-SSL), a deep-learning-based reconstruction method. Methods: Simulations using a numerical phantom were conducted to evaluate spatial resolution across various acceleration factors (AF = 2, 4, 6, and 8) and signal-to-noise ratio (SNR) levels. Resolution was quantified using a blur-based estimation method based on the Sparrow criterion. ZS-SSL was compared to conventional compressed sensing (CS). Experimental imaging of a human embryo at Carnegie stage 21 was performed at a spatial resolution of (30 μm)3 using both retrospective and prospective undersampling at AF = 4 and 8. Results: ZS-SSL preserved spatial resolution more effectively than CS at low SNRs. At AF = 4, image quality was comparable to that of fully sampled data, while noticeable degradation occurred at AF = 8. Experimental validation confirmed these findings, with clear visualization of anatomical structures—such as the accessory nerve—at AF = 4; there was reduced structural clarity at AF = 8. Conclusions: ZS-SSL enables significant scan time reduction in high-resolution MRI of human embryos while maintaining spatial resolution at AF = 4, assuming an SNR above approximately 15. This trade-off between acceleration and image quality is particularly beneficial in studies with limited imaging time or specimen availability. The method facilitates the efficient acquisition of ultra-high-resolution data and supports future efforts to construct detailed developmental atlases. Full article
Show Figures

Figure 1

19 pages, 2359 KiB  
Article
Research on Concrete Crack Damage Assessment Method Based on Pseudo-Label Semi-Supervised Learning
by Ming Xie, Zhangdong Wang and Li’e Yin
Buildings 2025, 15(15), 2726; https://doi.org/10.3390/buildings15152726 (registering DOI) - 1 Aug 2025
Abstract
To address the inefficiency of traditional concrete crack detection methods and the heavy reliance of supervised learning on extensive labeled data, in this study, an intelligent assessment method of concrete damage based on pseudo-label semi-supervised learning and fractal geometry theory is proposed to [...] Read more.
To address the inefficiency of traditional concrete crack detection methods and the heavy reliance of supervised learning on extensive labeled data, in this study, an intelligent assessment method of concrete damage based on pseudo-label semi-supervised learning and fractal geometry theory is proposed to solve two core tasks: one is binary classification of pixel-level cracks, and the other is multi-category assessment of damage state based on crack morphology. Using three-channel RGB images as input, a dual-path collaborative training framework based on U-Net encoder–decoder architecture is constructed, and a binary segmentation mask of the same size is output to achieve the accurate segmentation of cracks at the pixel level. By constructing a dual-path collaborative training framework and employing a dynamic pseudo-label refinement mechanism, the model achieves an F1-score of 0.883 using only 50% labeled data—a mere 1.3% decrease compared to the fully supervised benchmark DeepCrack (F1 = 0.896)—while reducing manual annotation costs by over 60%. Furthermore, a quantitative correlation model between crack fractal characteristics and structural damage severity is established by combining a U-Net segmentation network with the differential box-counting algorithm. The experimental results demonstrate that under a cyclic loading of 147.6–221.4 kN, the fractal dimension monotonically increases from 1.073 (moderate damage) to 1.189 (failure), with 100% accuracy in damage state identification, closely aligning with the degradation trend of macroscopic mechanical properties. In complex crack scenarios, the model attains a recall rate (Re = 0.882), surpassing U-Net by 13.9%, with significantly enhanced edge reconstruction precision. Compared with the mainstream models, this method effectively alleviates the problem of data annotation dependence through a semi-supervised strategy while maintaining high accuracy. It provides an efficient structural health monitoring solution for engineering practice, which is of great value to promote the application of intelligent detection technology in infrastructure operation and maintenance. Full article
Show Figures

Figure 1

19 pages, 3654 KiB  
Article
Longitudinal Displacement Reconstruction Method of Suspension Bridge End Considering Multi-Type Data Under Deep Learning Framework
by Xiaoting Yang, Chao Wu, Youjia Zhang, Wencai Shao, Linyuan Chang, Kaige Kong and Quan Cheng
Buildings 2025, 15(15), 2706; https://doi.org/10.3390/buildings15152706 (registering DOI) - 31 Jul 2025
Viewed by 113
Abstract
Suspension bridges, as a type of long-span bridge, usually have a larger longitudinal displacement at the end of the beam (LDBD). LDBD can be used to evaluate the safety of bridge components at the end of the beam. However, due to factors such [...] Read more.
Suspension bridges, as a type of long-span bridge, usually have a larger longitudinal displacement at the end of the beam (LDBD). LDBD can be used to evaluate the safety of bridge components at the end of the beam. However, due to factors such as sensor failure and system maintenance, LDBD in the bridge health monitoring system is often missing. Therefore, this study reconstructs the missing part of LDBD based on the long short-term memory network (LSTM) and various data. Specifically, first, the monitoring data that may be related to LDBD in a suspension bridge is analyzed, and the temperature and beam end rotation angle data (RDBD) at representative locations are selected. Then, the temperature data at different places of the bridge are used as the input of the LSTM model to compare and analyze the prediction effect of LDBD. Next, RDBD is used as the input of the LSTM model to observe the prediction effect of LDBD. Finally, temperature and RDBD are used as the input of the LSTM model to observe whether the prediction effect of the LSTM model is improved. The results show that compared with other parts of the bridge, the prediction effect of the temperature inside the box girder in the main span as the model input is better; when RDBD is used as the input of the LSTM model, it is better than the prediction effect of temperature as the model input; temperature and RDBD have higher prediction accuracy when used as the input of the LSTM model together than when used separately as the input of the LSTM model. Full article
(This article belongs to the Section Building Structures)
Show Figures

Figure 1

17 pages, 5062 KiB  
Article
DropDAE: Denosing Autoencoder with Contrastive Learning for Addressing Dropout Events in scRNA-seq Data
by Wanlin Juan, Kwang Woo Ahn, Yi-Guang Chen and Chien-Wei Lin
Bioengineering 2025, 12(8), 829; https://doi.org/10.3390/bioengineering12080829 (registering DOI) - 31 Jul 2025
Viewed by 240
Abstract
Single-cell RNA sequencing (scRNA-seq) has revolutionized molecular biology and genomics by enabling the profiling of individual cell types, providing insights into cellular heterogeneity. Deep learning methods have become popular in single cell analysis for tasks such as dimension reduction, cell clustering, and data [...] Read more.
Single-cell RNA sequencing (scRNA-seq) has revolutionized molecular biology and genomics by enabling the profiling of individual cell types, providing insights into cellular heterogeneity. Deep learning methods have become popular in single cell analysis for tasks such as dimension reduction, cell clustering, and data imputation. In this work, we introduce DropDAE, a denoising autoencoder (DAE) model enhanced with contrastive learning, to specifically address the dropout events in scRNA-seq data, where certain genes show very low or even zero expression levels due to technical limitations. DropDAE uses the architecture of a denoising autoencoder to recover the underlying data patterns while leveraging contrastive learning to enhance group separation. Our extensive evaluations across multiple simulation settings based on synthetic data and a real-world dataset demonstrate that DropDAE not only reconstructs data effectively but also further improves clustering performance, outperforming existing methods in terms of accuracy and robustness. Full article
Show Figures

Figure 1

17 pages, 920 KiB  
Article
Enhancing Early GI Disease Detection with Spectral Visualization and Deep Learning
by Tsung-Jung Tsai, Kun-Hua Lee, Chu-Kuang Chou, Riya Karmakar, Arvind Mukundan, Tsung-Hsien Chen, Devansh Gupta, Gargi Ghosh, Tao-Yuan Liu and Hsiang-Chen Wang
Bioengineering 2025, 12(8), 828; https://doi.org/10.3390/bioengineering12080828 - 30 Jul 2025
Viewed by 272
Abstract
Timely and accurate diagnosis of gastrointestinal diseases (GIDs) remains a critical bottleneck in clinical endoscopy, particularly due to the limited contrast and sensitivity of conventional white light imaging (WLI) in detecting early-stage mucosal abnormalities. To overcome this, this research presents Spectrum Aided Vision [...] Read more.
Timely and accurate diagnosis of gastrointestinal diseases (GIDs) remains a critical bottleneck in clinical endoscopy, particularly due to the limited contrast and sensitivity of conventional white light imaging (WLI) in detecting early-stage mucosal abnormalities. To overcome this, this research presents Spectrum Aided Vision Enhancer (SAVE), an innovative, software-driven framework that transforms standard WLI into high-fidelity hyperspectral imaging (HSI) and simulated narrow-band imaging (NBI) without any hardware modification. SAVE leverages advanced spectral reconstruction techniques, including Macbeth Color Checker-based calibration, principal component analysis (PCA), and multivariate polynomial regression, achieving a root mean square error (RMSE) of 0.056 and structural similarity index (SSIM) exceeding 90%. Trained and validated on the Kvasir v2 dataset (n = 6490) using deep learning models like ResNet-50, ResNet-101, EfficientNet-B2, both EfficientNet-B5 and EfficientNetV2-B0 were used to assess diagnostic performance across six key GI conditions. Results demonstrated that SAVE enhanced imagery and consistently outperformed raw WLI across precision, recall, and F1-score metrics, with EfficientNet-B2 and EfficientNetV2-B0 achieving the highest classification accuracy. Notably, this performance gain was achieved without the need for specialized imaging hardware. These findings highlight SAVE as a transformative solution for augmenting GI diagnostics, with the potential to significantly improve early detection, streamline clinical workflows, and broaden access to advanced imaging especially in resource constrained settings. Full article
Show Figures

Figure 1

18 pages, 4452 KiB  
Article
Upper Limb Joint Angle Estimation Using a Reduced Number of IMU Sensors and Recurrent Neural Networks
by Kevin Niño-Tejada, Laura Saldaña-Aristizábal, Jhonathan L. Rivas-Caicedo and Juan F. Patarroyo-Montenegro
Electronics 2025, 14(15), 3039; https://doi.org/10.3390/electronics14153039 - 30 Jul 2025
Viewed by 229
Abstract
Accurate estimation of upper-limb joint angles is essential in biomechanics, rehabilitation, and wearable robotics. While inertial measurement units (IMUs) offer portability and flexibility, systems requiring multiple inertial sensors can be intrusive and complex to deploy. In contrast, optical motion capture (MoCap) systems provide [...] Read more.
Accurate estimation of upper-limb joint angles is essential in biomechanics, rehabilitation, and wearable robotics. While inertial measurement units (IMUs) offer portability and flexibility, systems requiring multiple inertial sensors can be intrusive and complex to deploy. In contrast, optical motion capture (MoCap) systems provide precise tracking but are constrained to controlled laboratory environments. This study presents a deep learning-based approach for estimating shoulder and elbow joint angles using only three IMU sensors positioned on the chest and both wrists, validated against reference angles obtained from a MoCap system. The input data includes Euler angles, accelerometer, and gyroscope data, synchronized and segmented into sliding windows. Two recurrent neural network architectures, Convolutional Neural Network with Long-short Term Memory (CNN-LSTM) and Bidirectional LSTM (BLSTM), were trained and evaluated using identical conditions. The CNN component enabled the LSTM to extract spatial features that enhance sequential pattern learning, improving angle reconstruction. Both models achieved accurate estimation performance: CNN-LSTM yielded lower Mean Absolute Error (MAE) in smooth trajectories, while BLSTM provided smoother predictions but underestimated some peak movements, especially in the primary axes of rotation. These findings support the development of scalable, deep learning-based wearable systems and contribute to future applications in clinical assessment, sports performance analysis, and human motion research. Full article
(This article belongs to the Special Issue Wearable Sensors for Human Position, Attitude and Motion Tracking)
Show Figures

Figure 1

31 pages, 11269 KiB  
Review
Advancements in Semantic Segmentation of 3D Point Clouds for Scene Understanding Using Deep Learning
by Hafsa Benallal, Nadine Abdallah Saab, Hamid Tairi, Ayman Alfalou and Jamal Riffi
Technologies 2025, 13(8), 322; https://doi.org/10.3390/technologies13080322 - 30 Jul 2025
Viewed by 385
Abstract
Three-dimensional semantic segmentation is a fundamental problem in computer vision with a wide range of applications in autonomous driving, robotics, and urban scene understanding. The task involves assigning semantic labels to each point in a 3D point cloud, a data representation that is [...] Read more.
Three-dimensional semantic segmentation is a fundamental problem in computer vision with a wide range of applications in autonomous driving, robotics, and urban scene understanding. The task involves assigning semantic labels to each point in a 3D point cloud, a data representation that is inherently unstructured, irregular, and spatially sparse. In recent years, deep learning has become the dominant framework for addressing this task, leading to a broad variety of models and techniques designed to tackle the unique challenges posed by 3D data. This survey presents a comprehensive overview of deep learning methods for 3D semantic segmentation. We organize the literature into a taxonomy that distinguishes between supervised and unsupervised approaches. Supervised methods are further classified into point-based, projection-based, voxel-based, and hybrid architectures, while unsupervised methods include self-supervised learning strategies, generative models, and implicit representation techniques. In addition to presenting and categorizing these approaches, we provide a comparative analysis of their performance on widely used benchmark datasets, discuss key challenges such as generalization, model transferability, and computational efficiency, and examine the limitations of current datasets. The survey concludes by identifying potential directions for future research in this rapidly evolving field. Full article
(This article belongs to the Section Information and Communication Technologies)
Show Figures

Figure 1

21 pages, 711 KiB  
Systematic Review
Recent Developments in Image-Based 3D Reconstruction Using Deep Learning: Methodologies and Applications
by Diana-Carmen Rodríguez-Lira, Diana-Margarita Córdova-Esparza, Juan Terven, Julio-Alejandro Romero-González, José Manuel Alvarez-Alvarado, José-Joel González-Barbosa and Alfonso Ramírez-Pedraza
Electronics 2025, 14(15), 3032; https://doi.org/10.3390/electronics14153032 - 30 Jul 2025
Viewed by 320
Abstract
Three-dimensional (3D) reconstruction from images has significantly advanced due to recent developments in deep learning, yet methodological variations and diverse application contexts pose ongoing challenges. This systematic review examines the state-of-the-art deep learning techniques employed for image-based 3D reconstruction from 2019 to 2025. [...] Read more.
Three-dimensional (3D) reconstruction from images has significantly advanced due to recent developments in deep learning, yet methodological variations and diverse application contexts pose ongoing challenges. This systematic review examines the state-of-the-art deep learning techniques employed for image-based 3D reconstruction from 2019 to 2025. Through an extensive analysis of peer-reviewed studies, predominant methodologies, performance metrics, sensor types, and application domains are identified and assessed. Results indicate multi-view stereo and monocular depth estimation as prevailing methods, while hybrid architectures integrating classical and deep learning techniques demonstrate enhanced performance, especially in complex scenarios. Critical challenges remain, particularly in handling occlusions, low-texture areas, and varying lighting conditions, highlighting the importance of developing robust, adaptable models. Principal conclusions highlight the efficacy of integrated quantitative and qualitative evaluations, the advantages of hybrid methods, and the pressing need for computationally efficient and generalizable solutions suitable for real-world applications. Full article
(This article belongs to the Special Issue 3D Computer Vision and 3D Reconstruction)
Show Figures

Figure 1

26 pages, 8762 KiB  
Article
Clustered Rainfall-Induced Landslides in Jiangwan Town, Guangdong, China During April 2024: Characteristics and Controlling Factors
by Ruizeng Wei, Yunfeng Shan, Lei Wang, Dawei Peng, Ge Qu, Jiasong Qin, Guoqing He, Luzhen Fan and Weile Li
Remote Sens. 2025, 17(15), 2635; https://doi.org/10.3390/rs17152635 - 29 Jul 2025
Viewed by 189
Abstract
On 20 April 2024, an extreme rainfall event occurred in Jiangwan Town Shaoguan City, Guangdong Province, China, where a historic 24 h precipitation of 206 mm was recorded. This triggered extensive landslides that destroyed residential buildings, severed roads, and drew significant societal attention. [...] Read more.
On 20 April 2024, an extreme rainfall event occurred in Jiangwan Town Shaoguan City, Guangdong Province, China, where a historic 24 h precipitation of 206 mm was recorded. This triggered extensive landslides that destroyed residential buildings, severed roads, and drew significant societal attention. Rapid acquisition of landslide inventories, distribution patterns, and key controlling factors is critical for post-disaster emergency response and reconstruction. Based on high-resolution Planet satellite imagery, landslide areas in Jiangwan Town were automatically extracted using the Normalized Difference Vegetation Index (NDVI) differential method, and a detailed landslide inventory was compiled. Combined with terrain, rainfall, and geological environmental factors, the spatial distribution and causes of landslides were analyzed. Results indicate that the extreme rainfall induced 1426 landslides with a total area of 4.56 km2, predominantly small-to-medium scale. Landslides exhibited pronounced clustering and linear distribution along river valleys in a NE–SW orientation. Spatial analysis revealed concentrations on slopes between 200–300 m elevation with gradients of 20–30°. Four machine learning models—Logistic Regression, Support Vector Machine (SVM), Random Forest (RF), and Extreme Gradient Boosting (XGBoost)—were employed to assess landslide susceptibility mapping (LSM) accuracy. RF and XGBoost demonstrated superior performance, identifying high-susceptibility zones primarily on valley-side slopes in Jiangwan Town. Shapley Additive Explanations (SHAP) value analysis quantified key drivers, highlighting elevation, rainfall intensity, profile curvature, and topographic wetness index as dominant controlling factors. This study provides an effective methodology and data support for rapid rainfall-induced landslide identification and deep learning-based susceptibility assessment. Full article
(This article belongs to the Special Issue Study on Hydrological Hazards Based on Multi-Source Remote Sensing)
Show Figures

Figure 1

17 pages, 4324 KiB  
Article
Anomaly Detection on Laminated Composite Plate Using Self-Attention Autoencoder and Gaussian Mixture Model
by Olivier Munyaneza and Jung Woo Sohn
Mathematics 2025, 13(15), 2445; https://doi.org/10.3390/math13152445 - 29 Jul 2025
Viewed by 148
Abstract
Composite laminates are widely used in aerospace, automotive, construction, and luxury industries, owing to their superior mechanical properties and design flexibility. However, detecting manufacturing defects and in-service damage remains a vital challenge for structural safety. While traditional unsupervised machine learning methods have been [...] Read more.
Composite laminates are widely used in aerospace, automotive, construction, and luxury industries, owing to their superior mechanical properties and design flexibility. However, detecting manufacturing defects and in-service damage remains a vital challenge for structural safety. While traditional unsupervised machine learning methods have been used in structural health monitoring (SHM), their high false positive rates limit their reliability in real-world applications. This issue is mostly inherited from their limited ability to capture small temporal variations in Lamb wave signals and their dependence on shallow architectures that suffer with complex signal distributions, causing the misclassification of damaged signals as healthy data. To address this, we suggested an unsupervised anomaly detection framework that integrates a self-attention autoencoder with a Gaussian mixture model (SAE-GMM). The model is solely trained on healthy Lamb wave signals, including high-quality synthetic data generated via a generative adversarial network (GAN). Damages are detected through reconstruction errors and probabilistic clustering in the latent space. The self-attention mechanism enhances feature representation by capturing subtle temporal dependencies, while the GMM enables a solid separation among signals. Experimental results demonstrated that the proposed model (SAE-GMM) achieves high detection accuracy, a low false positive rate, and strong generalization under varying noise conditions, outperforming traditional and deep learning baselines. Full article
Show Figures

Figure 1

20 pages, 2776 KiB  
Article
Automatic 3D Reconstruction: Mesh Extraction Based on Gaussian Splatting from Romanesque–Mudéjar Churches
by Nelson Montas-Laracuente, Emilio Delgado Martos, Carlos Pesqueira-Calvo, Giovanni Intra Sidola, Ana Maitín, Alberto Nogales and Álvaro José García-Tejedor
Appl. Sci. 2025, 15(15), 8379; https://doi.org/10.3390/app15158379 - 28 Jul 2025
Viewed by 182
Abstract
This research introduces an automated 3D virtual reconstruction system tailored for architectural heritage (AH) applications, contributing to the ongoing paradigm shift from traditional CAD-based workflows to artificial intelligence-driven methodologies. It reviews recent advancements in machine learning and deep learning—particularly neural radiance fields (NeRFs) [...] Read more.
This research introduces an automated 3D virtual reconstruction system tailored for architectural heritage (AH) applications, contributing to the ongoing paradigm shift from traditional CAD-based workflows to artificial intelligence-driven methodologies. It reviews recent advancements in machine learning and deep learning—particularly neural radiance fields (NeRFs) and its successor, Gaussian splatting (GS)—as state-of-the-art techniques in the domain. The study advocates for replacing point cloud data in heritage building information modeling workflows with image-based inputs, proposing a novel “photo-to-BIM” pipeline. A proof-of-concept system is presented, capable of processing photographs or video footage of ancient ruins—specifically, Romanesque–Mudéjar churches—to automatically generate 3D mesh reconstructions. The system’s performance is assessed using both objective metrics and subjective evaluations of mesh quality. The results confirm the feasibility and promise of image-based reconstruction as a viable alternative to conventional methods. The study successfully developed a system for automated 3D mesh reconstruction of AH from images. It applied GS and Mip-splatting for NeRFs, proving superior in noise reduction for subsequent mesh extraction via surface-aligned Gaussian splatting for efficient 3D mesh reconstruction. This photo-to-mesh pipeline signifies a viable step towards HBIM. Full article
Show Figures

Figure 1

25 pages, 17505 KiB  
Article
A Hybrid Spatio-Temporal Graph Attention (ST D-GAT Framework) for Imputing Missing SBAS-InSAR Deformation Values to Strengthen Landslide Monitoring
by Hilal Ahmad, Yinghua Zhang, Hafeezur Rehman, Mehtab Alam, Zia Ullah, Muhammad Asfandyar Shahid, Majid Khan and Aboubakar Siddique
Remote Sens. 2025, 17(15), 2613; https://doi.org/10.3390/rs17152613 - 28 Jul 2025
Viewed by 294
Abstract
Reservoir-induced landslides threaten infrastructures and downstream communities, making continuous deformation monitoring vital. Time-series InSAR, notably the SBAS algorithm, provides high-precision surface-displacement mapping but suffers from voids due to layover/shadow effects and temporal decorrelation. Existing deep-learning approaches often operate on fixed-size patches or ignore [...] Read more.
Reservoir-induced landslides threaten infrastructures and downstream communities, making continuous deformation monitoring vital. Time-series InSAR, notably the SBAS algorithm, provides high-precision surface-displacement mapping but suffers from voids due to layover/shadow effects and temporal decorrelation. Existing deep-learning approaches often operate on fixed-size patches or ignore irregular spatio-temporal dependencies, limiting their ability to recover missing pixels. With this objective, a hybrid spatio-temporal Graph Attention (ST-GAT) framework was developed and trained on SBAS-InSAR values using 24 influential features. A unified spatio-temporal graph is constructed, where each node represents a pixel at a specific acquisition time. The nodes are connected via inverse distance spatial edges to their K-nearest neighbors, and they have bidirectional temporal edges to themselves in adjacent acquisitions. The two spatial GAT layers capture terrain-driven influences, while the two temporal GAT layers model annual deformation trends. A compact MLP with per-map bias converts the fused node embeddings into normalized LOS estimates. The SBAS-InSAR results reveal LOS deformation, with 48% of missing pixels and 20% located near the Dasu dam. ST D-GAT reconstructed fully continuous spatio-temporal displacement fields, filling voids at critical sites. The model was validated and achieved an overall R2 (0.907), ρ (0.947), per-map R2 ≥ 0.807 with RMSE ≤ 9.99, and a ROC-AUC of 0.91. It also outperformed the six compared baseline models (IDW, KNN, RF, XGBoost, MLP, simple-NN) in both RMSE and R2. By combining observed LOS values with 24 covariates in the proposed model, it delivers physically consistent gap-filling and enables continuous, high-resolution landslide monitoring in radar-challenged mountainous terrain. Full article
Show Figures

Figure 1

17 pages, 6870 KiB  
Article
Edge- and Color–Texture-Aware Bag-of-Local-Features Model for Accurate and Interpretable Skin Lesion Diagnosis
by Dichao Liu and Kenji Suzuki
Diagnostics 2025, 15(15), 1883; https://doi.org/10.3390/diagnostics15151883 - 27 Jul 2025
Viewed by 358
Abstract
Background/Objectives: Deep models have achieved remarkable progress in the diagnosis of skin lesions but face two significant drawbacks. First, they cannot effectively explain the basis of their predictions. Although attention visualization tools like Grad-CAM can create heatmaps using deep features, these features [...] Read more.
Background/Objectives: Deep models have achieved remarkable progress in the diagnosis of skin lesions but face two significant drawbacks. First, they cannot effectively explain the basis of their predictions. Although attention visualization tools like Grad-CAM can create heatmaps using deep features, these features often have large receptive fields, resulting in poor spatial alignment with the input image. Second, the design of most deep models neglects interpretable traditional visual features inspired by clinical experience, such as color–texture and edge features. This study aims to propose a novel approach integrating deep learning with traditional visual features to handle these limitations. Methods: We introduce the edge- and color–texture-aware bag-of-local-features model (ECT-BoFM), which limits the receptive field of deep features to a small size and incorporates edge and color–texture information from traditional features. A non-rigid reconstruction strategy ensures that traditional features enhance rather than constrain the model’s performance. Results: Experiments on the ISIC 2018 and 2019 datasets demonstrated that ECT-BoFM yields precise heatmaps and achieves high diagnostic performance, outperforming state-of-the-art methods. Furthermore, training models using only a small number of the most predictive patches identified by ECT-BoFM achieved diagnostic performance comparable to that obtained using full images, demonstrating its efficiency in exploring key clues. Conclusions: ECT-BoFM successfully combines deep learning and traditional visual features, addressing the interpretability and diagnostic accuracy challenges of existing methods. ECT-BoFM provides an interpretable and accurate framework for skin lesion diagnosis, advancing the integration of AI in dermatological research and clinical applications. Full article
Show Figures

Figure 1

28 pages, 3794 KiB  
Article
A Robust System for Super-Resolution Imaging in Remote Sensing via Attention-Based Residual Learning
by Rogelio Reyes-Reyes, Yeredith G. Mora-Martinez, Beatriz P. Garcia-Salgado, Volodymyr Ponomaryov, Jose A. Almaraz-Damian, Clara Cruz-Ramos and Sergiy Sadovnychiy
Mathematics 2025, 13(15), 2400; https://doi.org/10.3390/math13152400 - 25 Jul 2025
Viewed by 192
Abstract
Deep learning-based super-resolution (SR) frameworks are widely used in remote sensing applications. However, existing SR models still face limitations, particularly in recovering contours, fine features, and textures, as well as in effectively integrating channel information. To address these challenges, this study introduces a [...] Read more.
Deep learning-based super-resolution (SR) frameworks are widely used in remote sensing applications. However, existing SR models still face limitations, particularly in recovering contours, fine features, and textures, as well as in effectively integrating channel information. To address these challenges, this study introduces a novel residual model named OARN (Optimized Attention Residual Network) specifically designed to enhance the visual quality of low-resolution images. The network operates on the Y channel of the YCbCr color space and integrates LKA (Large Kernel Attention) and OCM (Optimized Convolutional Module) blocks. These components can restore large-scale spatial relationships and refine textures and contours, improving feature reconstruction without significantly increasing computational complexity. The performance of OARN was evaluated using satellite images from WorldView-2, GaoFen-2, and Microsoft Virtual Earth. Evaluation was conducted using objective quality metrics, such as Peak Signal-to-Noise Ratio (PSNR), Structural Similarity Index Measure (SSIM), Edge Preservation Index (EPI), and Perceptual Image Patch Similarity (LPIPS), demonstrating superior results compared to state-of-the-art methods in both objective measurements and subjective visual perception. Moreover, OARN achieves this performance while maintaining computational efficiency, offering a balanced trade-off between processing time and reconstruction quality. Full article
Show Figures

Figure 1

Back to TopTop