Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (375)

Search Parameters:
Keywords = unsupervised image segmentation

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
22 pages, 3271 KB  
Article
TextureCLIP: Cross-Dataset Zero-Shot Texture Anomaly Segmentation with Triadic Descriptive Prompting
by Xin Peng Ooi and Seong G. Kong
Electronics 2026, 15(10), 2220; https://doi.org/10.3390/electronics15102220 - 21 May 2026
Abstract
Texture anomaly segmentation aims to localize irregularities on textured surfaces, a task critical for industrial quality control. Supervised methods require extensive labeled data, while unsupervised approaches often struggle to generalize to unseen target domains. Recent zero-shot methods based on vision-language models such as [...] Read more.
Texture anomaly segmentation aims to localize irregularities on textured surfaces, a task critical for industrial quality control. Supervised methods require extensive labeled data, while unsupervised approaches often struggle to generalize to unseen target domains. Recent zero-shot methods based on vision-language models such as Contrastive Language-Image Pretraining (CLIP) enable anomaly detection through text prompts without target-domain training data. However, existing approaches typically rely on generic prompts and show limited sensitivity to fine-grained texture variations. To address these limitations, we propose TextureCLIP, a cross-dataset zero-shot framework with auxiliary training for texture anomaly segmentation. The framework is trained on source texture data from the MVTec AD texture subset using annotated source-domain samples and directly evaluated on six unseen target datasets without access to target-domain training images, annotations, or fine-tuning. The proposed Triadic Descriptive Prompting (TriDP) integrates normal prompts, generic anomaly prompts, and descriptive anomaly prompts to provide complementary semantic cues for improved cross-domain generalization. To enhance spatial sensitivity, Dual Attention Modules (DAMs) are incorporated into the CLIP image encoder to refine local feature representations. In addition, Softmax-Weighted Averaging (SMWA) aggregates multiple anomaly cues by emphasizing the prompt responses with higher similarity scores. Experimental results demonstrate that TextureCLIP achieves strong and consistent performance across diverse texture datasets, attaining 67.06% AP and 65.69% F1-max, with improvements of 5.17 and 2.66 percentage points over the competitive baselines, respectively. Full article
(This article belongs to the Section Artificial Intelligence)
Show Figures

Figure 1

19 pages, 2285 KB  
Article
In Vivo Classification of Patellar Motion Trajectories in Individuals: A 4D-CT-Based Study with Unsupervised Clustering
by Jiaying Wei, Ziyi Jiang, Xinhao Zhang, Weigen Ye, Bowen Guo, Weilin Wu, Jia Li, Mao Yuan, Dehua Wang, Hong Cheng, Wei Huang, Chen Zhao and Ke Li
Diagnostics 2026, 16(10), 1517; https://doi.org/10.3390/diagnostics16101517 - 16 May 2026
Viewed by 176
Abstract
Background: Patellar motion trajectory (PMT) is a key kinematic parameter for evaluating patellofemoral joint (PFJ) stability, but traditional static imaging indices are unable to capture the dynamic six-degrees-of-freedom (6-DOF) characteristics of patellar motion throughout the entire knee flexion–extension cycle. Four-dimensional computed tomography (4D-CT) [...] Read more.
Background: Patellar motion trajectory (PMT) is a key kinematic parameter for evaluating patellofemoral joint (PFJ) stability, but traditional static imaging indices are unable to capture the dynamic six-degrees-of-freedom (6-DOF) characteristics of patellar motion throughout the entire knee flexion–extension cycle. Four-dimensional computed tomography (4D-CT) facilitates in vivo dynamic imaging of the PFJ, while the systematic classification of PMT in asymptomatic populations has remained underexplored. Methods: A retrospective cross-sectional study was performed on 64 asymptomatic and functionally normal knees that underwent 4D-CT dynamic scanning from March 2021 to December 2025. Patellar 6-DOF kinematic data during 0° to 90° of knee flexion–extension were extracted through manifold optimization, automatic segmentation, and spatial registration. Following standardization of the motion cycle, unsupervised K-means clustering was employed to classify PMT phenotypes, with nonparametric tests used to analyze intergroup kinematic differences and evaluate clustering quality. Results: Three distinct PMT types were identified based on clustering validity indices, including a silhouette score of 0.381, a Davies-Bouldin index of 0.916, and a Calinski–Harabasz index of 44.06: Type 1 (7.81%, 35.11 ± 6.56 mm), Type 2 (56.25%, 15.67 ± 6.59 mm), and Type 3 (35.94%, 2.82 ± 2.41 mm). Lateral translation (Tx) served as the dominant determinant for PMT typing (p < 0.001), whereas non-lateral DOF parameters exhibited no consistent intergroup differences. Postural DOFs exhibited coupled fluctuations with Tx but had no independent stratification effect. Traditional static imaging parameters demonstrated no consistent correlation with these dynamic subtypes. Conclusions: Functionally asymptomatic knees exhibited three in vivo patellar 6-DOF motion trajectory phenotypes dominated by lateral translation amplitude. This 4D-CT-based typing framework provides a dynamic kinematic baseline for PFJ stability evaluation and lays a foundation for individualized optimization of ligament reconstruction and pathophysiological research of patellofemoral disorders. Full article
(This article belongs to the Special Issue Applications of Artificial Intelligence in Orthopedics)
Show Figures

Figure 1

18 pages, 8612 KB  
Article
Unsupervised Segmentation of Wear Surface Defects in Hydroturbine Bearing Pads Guided by Local Anomaly Scores
by Xiaolong Yang, Jingxuan Han, Gang Wan, Fengdi Zhu, Chuangji Qin, Ning Xu and Shuo Wang
Lubricants 2026, 14(5), 202; https://doi.org/10.3390/lubricants14050202 - 14 May 2026
Viewed by 114
Abstract
Vision-based defect detection on bearing-pad wear surfaces is essential for quantifying damage geometry and assessing condition in hydroturbine units. Compared with 2D color images, depth images can suppress disturbances caused by complex textures, surface color variations, and specular reflections, thereby providing a more [...] Read more.
Vision-based defect detection on bearing-pad wear surfaces is essential for quantifying damage geometry and assessing condition in hydroturbine units. Compared with 2D color images, depth images can suppress disturbances caused by complex textures, surface color variations, and specular reflections, thereby providing a more reliable basis for precise damage localization. Nevertheless, depth-based damage segmentation under a large field of view remains challenging, mainly due to fine-scale texture noise and weak defect saliency; moreover, robust defect probability estimation is often hindered by limited labeled data. To address these challenges, this paper proposes an unsupervised defect segmentation framework for hydroturbine friction components guided by local anomaly score distributions. First, a salient damage detection module is developed based on topography–texture separation, which mitigates the interference of local micro-texture noise on defect segmentation. Then, a normal reference dataset is constructed using defect-free bearing-pad depth images, and an unsupervised network is employed as the core to generate anomaly score representations of potential damage regions for coarse localization. Finally, the obtained anomaly score distribution is used as adaptive weights to fuse depth-based defect cues with morphological processing, enabling self-adaptive refinement of the damage regions. Experiments on real depth images acquired from hydroturbine bearing pads demonstrate that the proposed method achieves accurate defect extraction and reliable geometric quantification. Quantitative evaluations on the testing set yield a mean surface area error of 9.39% ± 4.25% and a volume error of 4.91% ± 2.85%, with best-case errors dropping as low as 3.67% and 1.03%, respectively. Crucially, these results demonstrate that our framework goes beyond mere visual detection; by operating entirely without pixel-level annotations, it offers a highly practical tool for diagnosing specific lubrication failure modes and driving predictive maintenance in actual hydroturbine engineering. Full article
(This article belongs to the Special Issue Advanced Methods for Wear Monitoring)
Show Figures

Figure 1

22 pages, 4940 KB  
Article
Enhanced Marine Radar Oil Spill Detection via Feature Guidance and BBO-SA Hybrid Optimization
by Baozhu Jia, Zekun Guo, Jin Xu, Xinru Dong, Lilin Chu, Zheng Li and Haixia Wang
Remote Sens. 2026, 18(10), 1551; https://doi.org/10.3390/rs18101551 - 13 May 2026
Viewed by 166
Abstract
X-band marine radar offers unique advantages for monitoring nearshore oil spills. However, oil films and sea clutter exhibit high pixel intensity overlap in radar images. Traditional threshold segmentation and machine learning methods have certain limitations in terms of feature extraction, Region of Interest [...] Read more.
X-band marine radar offers unique advantages for monitoring nearshore oil spills. However, oil films and sea clutter exhibit high pixel intensity overlap in radar images. Traditional threshold segmentation and machine learning methods have certain limitations in terms of feature extraction, Region of Interest (ROI) guidance, threshold optimization adaptability, and unsupervised capabilities. To address these issues, a method of oil film detection for ship radar based on multi-dimensional feature-guided extraction and hybrid optimization search is proposed. By combining Density-Based Spatial Clustering of Applications with Noise (DBSCAN) clustering with multidimensional features, this method automatically extracts ROIs under unlabeled conditions, effectively suppressing sea clutter interference. Subsequently, an improved Beaver Behavior Optimizer (BBO) and simulated annealing (SA) hybrid algorithm (BBO-SA) is introduced within the ROIs, along with a designed adaptive temperature update strategy, to achieve coordinated optimization of global and local searches. The experimental results demonstrate that the method described in this paper performs exceptionally well across all evaluation metrics, confirming its accuracy and robustness in oil film detection. It provides a viable technical approach for emergency monitoring of nearshore oil spills. Full article
Show Figures

Figure 1

28 pages, 6084 KB  
Article
Symmetric Cross-Entropy: A Novel Multi-Level Thresholding Method and Comprehensive Study of Entropy for High-Precision Arctic Ecosystem Segmentation
by Thaweesak Trongtirakul, Sos S. Agaian, Sheli Sinha Chauhuri, Khalifa Djemal and Amir A. Feiz
Information 2026, 17(4), 373; https://doi.org/10.3390/info17040373 - 16 Apr 2026
Viewed by 326
Abstract
Arctic sea ice is a critical indicator of global climate dynamics, directly influencing maritime navigation, polar biodiversity, and offshore engineering safety. The precise mapping of diverse ice types, such as frazil ice, slush, melt ponds, and open water, is essential for environmental monitoring; [...] Read more.
Arctic sea ice is a critical indicator of global climate dynamics, directly influencing maritime navigation, polar biodiversity, and offshore engineering safety. The precise mapping of diverse ice types, such as frazil ice, slush, melt ponds, and open water, is essential for environmental monitoring; however, it remains a formidable challenge in satellite remote sensing. These difficulties arise from low-contrast imagery, overlapping spectral signatures, and the subtle textural nuances characteristic of polar regions. Traditional entropy-based thresholding techniques often falter when segmenting these complex scenes, as they typically rely on Gaussian distribution assumptions that do not align with the stochastic nature of Arctic data. To address these limitations, this paper presents a novel unsupervised segmentation framework based on symmetric cross-entropy (SCE). Unlike standard directional measures, SCE provides a more robust objective function for multi-level thresholding by simultaneously maximizing intra-class cohesion and minimizing inter-class ambiguity. The proposed method uses an optimized search strategy to identify intensity levels that best delineate complex Arctic features. We conducted an extensive entropy-based comparative study that benchmarked SCE against 25 state-of-the-art entropy measures, including Shannon, Kapur, Rényi, Tsallis, and Masi entropies. Our experimental results demonstrate that the SCE method: (i) achieves superior accuracy by consistently outperforming established models in segmentation precision and boundary definition; (ii) provides visual clarity by producing segments with significantly reduced noise, making them ideal for identifying small-scale melt ponds and slush zones; and (iii) demonstrates computational robustness by providing stable threshold values even in datasets with non-Gaussian class distributions and poor illumination. Ultimately, these improvements deliver high-quality ice feature data that enhance risk assessment, operational planning, and predictive modeling. This research marks a major step forward in Arctic sea studies and introduces a valuable new tool for wider image processing and computer vision communities. Full article
(This article belongs to the Section Information Systems)
Show Figures

Figure 1

47 pages, 3286 KB  
Review
LiDAR-Based Road Surface Damage Classification: A Survey
by Trevor Greene, Meisam Shayegh Moradi, Muhammad Umair, Nafiul Nawjis, Naima Kaabouch and Timothy Pasch
Sensors 2026, 26(8), 2338; https://doi.org/10.3390/s26082338 - 10 Apr 2026
Viewed by 553
Abstract
Unlike image-only systems that falter in shadows, glare, and low contrast, LiDAR directly records surface geometry and supports depth-aware quantification. This survey examines LiDAR-based road surface damage classification across the entire pipeline, encompassing acquisition with mobile and terrestrial laser scanning, preprocessing and representation [...] Read more.
Unlike image-only systems that falter in shadows, glare, and low contrast, LiDAR directly records surface geometry and supports depth-aware quantification. This survey examines LiDAR-based road surface damage classification across the entire pipeline, encompassing acquisition with mobile and terrestrial laser scanning, preprocessing and representation choices, supervised, semi-supervised, and unsupervised learning techniques, as well as multisensor fusion at early, mid, and late stages. A consistent thread is measurement, not just detection: we describe how LiDAR damage classification maps to agency practices such as the Distress Identification Manual and the Pavement Condition Index. We summarize datasets and evaluation protocols for detection, segmentation, 3D reconstruction, and ride quality. We outline practical concerns for corridor-scale deployment: calibration and timing, intensity normalization, tiling/streaming, and runtime budgeting. The review concludes with open problems and outlines directions for robust, severity-aware, and scalable field systems. Full article
(This article belongs to the Section Remote Sensors)
Show Figures

Figure 1

24 pages, 4332 KB  
Article
Depth-Aware Adversarial Domain Adaptation for Cross-Domain Remote Sensing Segmentation
by Lulu Niu, Xiaoxuan Liu, Enze Zhu, Yidan Zhang, Hanru Shi, Xiaohe Li, Hong Wang, Jie Jia and Lei Wang
Remote Sens. 2026, 18(7), 1099; https://doi.org/10.3390/rs18071099 - 7 Apr 2026
Viewed by 511
Abstract
As a key task in remote sensing analysis, semantic segmentation of remote sensing images (RSI) underpins many practical applications. Despite its importance, obtaining dense pixel-wise annotations remains labor-intensive and time-consuming. Unsupervised domain adaptation (UDA) offers a promising solution by utilizing knowledge from labeled [...] Read more.
As a key task in remote sensing analysis, semantic segmentation of remote sensing images (RSI) underpins many practical applications. Despite its importance, obtaining dense pixel-wise annotations remains labor-intensive and time-consuming. Unsupervised domain adaptation (UDA) offers a promising solution by utilizing knowledge from labeled source domains for unlabeled target domains, yet its effectiveness is often compromised by significant distribution shifts arising from variations in imaging conditions. To address this challenge, we propose a depth-aware adaptation network (DAAN), a novel two-branch network that explicitly leverages complementary depth information from a digital surface model (DSM) to enhance cross-domain remote sensing segmentation. Unlike conventional UDA methods that primarily focus on semantic features, DAAN incorporates depth data to build a more generalized feature space. This network introduces three key components: an adaptive feature aggregator (AFA) for progressive semantic-depth feature fusion, a gated prediction selection unit (GPSU) that selectively integrates predictions to mitigate the impact of noisy depth measurements, and misalignment-focused residual refinement (MFRR) module that emphasizes poorly aligned target regions during training. Experiments on the ISPRS and GAMUS datasets demonstrate the effectiveness of the proposed method. In particular, DAAN achieves an mIoU of 50.53% and an F1 score of 65.75% for cross-domain segmentation on ISPRS to GAMUS, outperforming models without depth information by 9.17% and 8.99%, respectively. These results demonstrate the advantage of integrating auxiliary geometric information to improve model generalization on unlabeled remote sensing datasets, contributing to higher mapping accuracy, more reliable automated analysis, and enhanced decision-making support. Full article
Show Figures

Figure 1

31 pages, 12141 KB  
Article
A Reliability-Guided Unsupervised Domain Adaptation Framework for Robust Semantic Segmentation Under Adverse Driving Conditions
by Nan Xia and Guoqing Hu
Appl. Sci. 2026, 16(6), 3036; https://doi.org/10.3390/app16063036 - 20 Mar 2026
Viewed by 350
Abstract
Adverse weather and low illumination remain major challenges for autonomous driving perception, where semantic segmentation must stay reliable despite severe appearance degradation. In unsupervised domain adaptation without target annotations, self-training is widely used, but it is often limited by the inconsistent quality of [...] Read more.
Adverse weather and low illumination remain major challenges for autonomous driving perception, where semantic segmentation must stay reliable despite severe appearance degradation. In unsupervised domain adaptation without target annotations, self-training is widely used, but it is often limited by the inconsistent quality of teacher-generated pseudo labels across samples, regions, and training stages. This paper presents RaDA, a reliability-aware self-training framework that regulates pseudo supervision at three levels. First, a progressive exposure strategy determines which target images are admitted for training. Second, spatial reliability weighting suppresses gradients from degraded regions while retaining informative supervision. Third, adaptive teacher update scheduling stabilizes pseudo label generation over time. Experiments on real-world adverse driving benchmarks show that RaDA improves robustness, training stability, and cross-dataset generalization compared with strong baselines. Compared with the previous state-of-the-art method MIC, RaDA achieves mIoU gains of 10.6 percentage points on Foggy Zurich and 8.8 percentage points on the Foggy Driving benchmark. These results indicate that explicit reliability regulation can strengthen self-training domain adaptation for semantic segmentation in autonomous driving under challenging environmental conditions. Full article
Show Figures

Figure 1

25 pages, 29036 KB  
Article
Task-Oriented Unsupervised SAR Image Enhancement with Semantic Preservation for Robust Target Recognition
by Chengyu Wan, Siqian Zhang, Lingjun Zhao, Tao Tang and Gangyao Kuang
Remote Sens. 2026, 18(6), 930; https://doi.org/10.3390/rs18060930 - 19 Mar 2026
Viewed by 353
Abstract
Synthetic aperture radar (SAR) images often suffer from coupled degradations such as speckle noise, background clutter, and system disturbances, which distort target structure and reduce feature discriminability for target recognition. Most existing enhancement methods typically optimize perceptual quality and may produce visually appealing [...] Read more.
Synthetic aperture radar (SAR) images often suffer from coupled degradations such as speckle noise, background clutter, and system disturbances, which distort target structure and reduce feature discriminability for target recognition. Most existing enhancement methods typically optimize perceptual quality and may produce visually appealing yet recognition-inconsistent results, especially when paired supervision is unavailable. To address this, an unsupervised SAR image quality enhancement framework is proposed in this study, formulating the degradation as a domain shift problem between low- and high-quality SAR data. A DualGAN-based architecture is adopted to learn bidirectional mappings with reconstruction regularization, enabling enhancement without paired samples. To explicitly preserve task-relevant features and enforce structural consistency, a segmentation-guided recognition-oriented constraint is introduced to embed task awareness into the enhancement process. Furthermore, to mitigate semantic drift during unpaired translation, a semantic preservation constraint based on contrastive learning is proposed to align the enhanced, original, and smoothed images, which can maintain semantic fidelity and reinforce structural cues. Experimental results demonstrate that the proposed framework effectively bridges the domain gap between low- and high-quality SAR images, producing semantically consistent enhancement and improving robustness in target recognition. Evaluations on the GMVT dataset show that the proposed method achieves an average recognition accuracy improvement of over 10% across six recognition networks and four imaging conditions. Full article
(This article belongs to the Special Issue SAR Images Processing and Analysis (3rd Edition))
Show Figures

Figure 1

20 pages, 4542 KB  
Article
Hierarchy Clustering for Cloud Detection Assisted by Spectral Features of Ground Covers
by Wanxin Song, Shilong Jia, Tianjin Liu and Xiaoyu He
Remote Sens. 2026, 18(5), 698; https://doi.org/10.3390/rs18050698 - 26 Feb 2026
Viewed by 304
Abstract
Cloud detection is an important procedure for the processing of remote sensing images. A cloud detection scheme driven by the spectral and the temporal features is presented in this paper, where an unsupervised hierarchy clustering approach is proposed for large scale image segmentation. [...] Read more.
Cloud detection is an important procedure for the processing of remote sensing images. A cloud detection scheme driven by the spectral and the temporal features is presented in this paper, where an unsupervised hierarchy clustering approach is proposed for large scale image segmentation. The potential cloudy pixels are identified by means of the spectral matching, in which the spectral data of the clustering centers are compared to the patterns in the spectral dataset of ground covers. The matched pixels are regarded as cloudless pixels, whose category can be recognized accordingly. In contrast, the bright temperatures corresponding to the unmatched pixels are used to exclude the interference of the occasional hotspots, enabling the final cloud detection result. Landsat 8 and Sentinel-2 satellite data are used in the validation to demonstrate the precision and stability of the proposed scheme for the data at different spatial resolutions. Full article
Show Figures

Figure 1

26 pages, 29834 KB  
Article
Self-Training Based Image–Text Multimodal Unsupervised Domain Adaptation Segmentation Model for Remote Sensing Images
by Qianqian Liu and Xili Wang
Remote Sens. 2026, 18(4), 651; https://doi.org/10.3390/rs18040651 - 20 Feb 2026
Viewed by 570
Abstract
Deep self-training-based unsupervised domain adaptation (UDA) semantic segmentation methods learn from labeled source domain images and unlabeled target domain images, performing more stably than those based on adversarial training. We propose a self-training-based image–text multimodal unsupervised domain adaptation semantic segmentation model (SIT-UDA) for [...] Read more.
Deep self-training-based unsupervised domain adaptation (UDA) semantic segmentation methods learn from labeled source domain images and unlabeled target domain images, performing more stably than those based on adversarial training. We propose a self-training-based image–text multimodal unsupervised domain adaptation semantic segmentation model (SIT-UDA) for remote sensing images. Unlike UDA methods, which rely solely on images, SIT-UDA enhances generalization performance by integrating category hint information from textual descriptions with image data to segment images. SIT-UDA employs a teacher–student self-training framework consisting of two components: the teacher multimodal segmentation model, which predicts pseudo-labels for target domain images, and the student multimodal segmentation model, which is trained to learn feature representations from both the source and target domains with guidance from the teacher model. To enhance the adaptability of image–text pretrained models in remote sensing domains, SIT-UDA introduces text prompt tuning to optimize the text features in the student model, and two learning strategies are proposed to optimize the model’s training objectives: One is the entropy-guided pixel-level weighting (EGPW) strategy, which adaptively weights the loss obtained by self-training on target domain images, leveraging the pseudo-labels rationally according to the entropy value at the pixel level. The other is the contrastive text constraint (CTC) strategy, which maximizes the similarity of text features for the same category between teacher and student models while minimizing the similarity of text features across different categories, improving text feature discriminability to promote cross-domain image–text alignment. Experiments in various domain adaptation scenarios among three remote sensing datasets (Potsdam, Vaihingen and LoveDA) demonstrate that the SIT-UDA is superior to the comparative domain adaptation semantic segmentation methods in terms of qualitative and quantitative segmentation results. Full article
Show Figures

Figure 1

34 pages, 3680 KB  
Article
A Semi-Supervised Transformer with a Curriculum Training Pipeline for Remote Sensing Image Semantic Segmentation
by Peizhuo Liu, Hongbo Zhu, Xiaofei Mi, Yuke Meng, Huijie Zhao and Xingfa Gu
Remote Sens. 2026, 18(3), 480; https://doi.org/10.3390/rs18030480 - 2 Feb 2026
Viewed by 616
Abstract
Semantic segmentation of remote sensing images is crucial for geospatial applications but is severely hampered by the prohibitive cost of pixel-level annotations. Although semi-supervised learning (SSL) offers a solution by leveraging unlabeled data, its application to Vision Transformers (ViTs) often encounters overfitting and [...] Read more.
Semantic segmentation of remote sensing images is crucial for geospatial applications but is severely hampered by the prohibitive cost of pixel-level annotations. Although semi-supervised learning (SSL) offers a solution by leveraging unlabeled data, its application to Vision Transformers (ViTs) often encounters overfitting and even training instability under extreme label scarcity. To tackle these challenges, we propose a Curriculum-based Self-supervised and Semi-supervised Pipeline (CSSP). The pipeline adopts a staged, easy-to-hard training strategy, commencing with in-domain pretraining for robust feature representation, followed by a carefully designed finetuning stage to prevent overfitting. The pipeline further integrates a novel Difficulty-Adaptive ClassMix (DA-ClassMix) augmentation that dynamically reinforces underperforming categories and a Progressive Intensity Adaptation (PIA) strategy that systematically escalates augmentation strength to maximize model generalization. Extensive evaluations on the Potsdam, Vaihingen, and Inria datasets demonstrate state-of-the-art performance. Notably, with only 1/32 of the labeled data on the Potsdam dataset, the CSSP reaches 82.16% mIoU, nearly matching the fully supervised result (82.24%). Furthermore, we extend the CSSP to a semi-supervised domain adaptation (SSDA) scenario, termed Cross-Domain CSSP (CDCSSP), which outperforms existing SSDA and unsupervised domain adaptation (UDA) methods. This work establishes a stable and highly effective framework for training ViT-based segmentation models with minimal annotation overhead. Full article
Show Figures

Figure 1

17 pages, 10961 KB  
Article
Optimizing Image Segmentation for Microstructure Analysis of High-Strength Steel: Histogram-Based Recognition of Martensite and Bainite
by Filip Hallo, Tomasz Jażdżewski, Piotr Bała, Grzegorz Korpała and Krzysztof Regulski
Materials 2026, 19(2), 429; https://doi.org/10.3390/ma19020429 - 22 Jan 2026
Viewed by 592
Abstract
This study systematically compares three unsupervised segmentation algorithms (Simple Linear Iterative Clustering (SLIC), Felzenszwalb’s graph-based method, and the Watershed algorithm) in combination with two classification approaches: Random Forest using histogram-based features and Convolutional Neural Networks (CNNs). The study employs Bayesian optimization to jointly [...] Read more.
This study systematically compares three unsupervised segmentation algorithms (Simple Linear Iterative Clustering (SLIC), Felzenszwalb’s graph-based method, and the Watershed algorithm) in combination with two classification approaches: Random Forest using histogram-based features and Convolutional Neural Networks (CNNs). The study employs Bayesian optimization to jointly tune segmentation parameters and model hyperparameters, investigating how segmentation quality impacts downstream classification performance. The methodology is validated using light optical microscopy images of a high-strength steel sample, with performance evaluated through stratified cross-validation and independent test sets. The findings demonstrate the critical importance of segmentation algorithm selection and provide insights into the trade-offs between feature-engineered and end-to-end learning approaches for microstructure analysis. Full article
(This article belongs to the Section Metals and Alloys)
Show Figures

Figure 1

26 pages, 7486 KB  
Article
ADAM-Net: Anatomy-Guided Attentive Unsupervised Domain Adaptation for Joint MG Segmentation and MGD Grading
by Junbin Fang, Xuan He, You Jiang and Mini Han Wang
J. Imaging 2026, 12(1), 50; https://doi.org/10.3390/jimaging12010050 - 21 Jan 2026
Viewed by 645
Abstract
Meibomian gland dysfunction (MGD) is a leading cause of dry eye disease, assessable through gland atrophy degree. While deep learning (DL) has advanced meibomian gland (MG) segmentation and MGD classification, existing methods treat these tasks independently and suffer from domain shift across multi-center [...] Read more.
Meibomian gland dysfunction (MGD) is a leading cause of dry eye disease, assessable through gland atrophy degree. While deep learning (DL) has advanced meibomian gland (MG) segmentation and MGD classification, existing methods treat these tasks independently and suffer from domain shift across multi-center imaging devices. We propose ADAM-Net, an attention-guided unsupervised domain adaptation multi-task framework that jointly models MG segmentation and MGD classification. Our model introduces structure-aware multi-task learning and anatomy-guided attention to enhance feature sharing, suppress background noise, and improve glandular region perception. For the cross-domain tasks MGD-1K→{K5M, CR-2, LV II}, this study systematically evaluates the overall performance of ADAM-Net from multiple perspectives. The experimental results show that ADAM-Net achieves classification accuracies of 77.93%, 74.86%, and 81.77% on the target domains, significantly outperforming current mainstream unsupervised domain adaptation (UDA) methods. The F1-score and the Matthews correlation coefficient (MCC-score) indicate that the model maintains robust discriminative capability even under class-imbalanced scenarios. t-SNE visualizations further validate its cross-domain feature alignment capability. These demonstrate that ADAM-Net exhibits strong robustness and interpretability in multi-center scenarios and provide an effective solution for automated MGD assessment. Full article
(This article belongs to the Special Issue Imaging in Healthcare: Progress and Challenges)
Show Figures

Figure 1

31 pages, 15918 KB  
Article
Cross-Domain Landslide Mapping in Remote Sensing Images Based on Unsupervised Domain Adaptation Framework
by Jing Yang, Mingtao Ding, Wubiao Huang, Qiang Xue, Ying Dong, Bo Chen, Lulu Peng, Fuling Zhang and Zhenhong Li
Remote Sens. 2026, 18(2), 286; https://doi.org/10.3390/rs18020286 - 15 Jan 2026
Viewed by 832
Abstract
Rapid and accurate acquisition of landslide inventories is essential for effective disaster relief. Deep learning-based pixel-wise semantic segmentation of remote sensing imagery has greatly advanced in landslide mapping. However, the heavy dependance on extensive annotated labels and sensitivity to domain shifts severely constrain [...] Read more.
Rapid and accurate acquisition of landslide inventories is essential for effective disaster relief. Deep learning-based pixel-wise semantic segmentation of remote sensing imagery has greatly advanced in landslide mapping. However, the heavy dependance on extensive annotated labels and sensitivity to domain shifts severely constrain the model performance in unseen domains, leading to poor generalization. To address these limitations, we propose LandsDANet, an innovative unsupervised domain adaptation framework for cross-domain landslide identification. Firstly, adversarial learning is employed to reduce the data distribution discrepancies between the source and target domains, thereby achieving output space alignment. The improved SegFormer serves as the segmentation network, incorporating hierarchical Transformer blocks and an attention mechanism to enhance feature representation capabilities. Secondly, to alleviate inter-domain radiometric discrepancies and attain image-level alignment, a Wallis filter is utilized to perform image style transformation. Considering the class imbalance present in the landslide dataset, a Rare Class Sampling strategy is introduced to mitigate bias towards common classes and strengthen the learning of the rare landslide class. Finally, a contrastive loss is adopted to further optimize and enhance the model’s ability to delineate fine-grained class boundaries. The proposed model is validated on the Potsdam and Vaihingen benchmark datasets, followed by validation in two landslide scenarios induced by earthquakes and rainfall to evaluate its adaptability across different disaster domains. Compared to the source-only model, LandsDANet achieved improvements in IoU of 27.04% and 35.73% in two cross-domain landslide disaster recognition tasks, respectively. This performance not only showcases its outstanding capabilities but also underscores its robust potential to meet the demands for rapid response. Full article
(This article belongs to the Section AI Remote Sensing)
Show Figures

Graphical abstract

Back to TopTop