MDPI - Publisher of Open Access Journals

26 pages, 11464 KB

Open AccessArticle

Differentiable Superpixel Generation with Complexity-Aware Initialization and Edge Reconstruction for SAR Imagery

by Hang Yu, Jiaye Liang, Gao Han and Lei Wang

Remote Sens. 2026, 18(8), 1213; https://doi.org/10.3390/rs18081213 - 17 Apr 2026

Viewed by 374

Synthetic Aperture Radar (SAR) imagery is inherently degraded by multiplicative speckle noise, rendering traditional superpixel methods—which rely on hard assignment and uniform initialization—suboptimal for boundary preservation. This study proposes a complexity-aware superpixel generation framework featuring differentiable soft-assignment optimization. The approach employs an F-LGRP [...] Read more.

Synthetic Aperture Radar (SAR) imagery is inherently degraded by multiplicative speckle noise, rendering traditional superpixel methods—which rely on hard assignment and uniform initialization—suboptimal for boundary preservation. This study proposes a complexity-aware superpixel generation framework featuring differentiable soft-assignment optimization. The approach employs an F-LGRP (Fusion of Local Gradient Pattern Representation) feature descriptor that fuses regional gradient statistics via Gaussian filtering to suppress speckle, coupled with a complexity-driven recursive quadtree initialization strategy yielding non-uniform seed density. A U-Net architecture predicts soft pixel–superpixel association maps within a 9-neighborhood constraint, supervised by a multi-objective loss integrating edge information reconstruction and boundary feature reconstruction. Comprehensive evaluations on simulated and real SAR images (WHU-OPT-SAR and Munich) demonstrate that the proposed method achieves state-of-the-art performance across Boundary Recall, Undersegmentation Error, Compactness, and Achievable Segmentation Accuracy compared to SLIC, SNIC, Mean-Shift, PILS, and SSN. Validation on downstream segmentation tasks further confirms superior accuracy and computational efficiency, establishing the framework as an effective solution for end-to-end SAR image analysis. Full article

(This article belongs to the Section Remote Sensing Image Processing)

► Show Figures

Figure 1

20 pages, 29969 KB

Open AccessArticle

A Study on Integration of Topographic Clustering and Physical Constraints for Flood Propagation Simulation

by Xu Zhang, Xiaotao Li, Yingwei Sun, Qiaomei Su, Shifan Yuan, Mei Yang, Qianfang Lou and Bingyuan Chen

Remote Sens. 2026, 18(6), 885; https://doi.org/10.3390/rs18060885 - 13 Mar 2026

Viewed by 423

Abstract

Global climate change is increasing extreme rainfall events, and severe floods are becoming more frequent. Flood storage and detention basins (FSDBs) are an important part of the flood control system in China. They play a key role in regional flood emergency response and [...] Read more.

Global climate change is increasing extreme rainfall events, and severe floods are becoming more frequent. Flood storage and detention basins (FSDBs) are an important part of the flood control system in China. They play a key role in regional flood emergency response and regulation. Therefore, accurate simulation of flood evolution after the activation of FSDBs is urgently needed. This study proposes a high-accuracy flood evolution simulation method that combines terrain clustering and physical propagation constraints. We first build a 2 m resolution digital elevation model (DEM) using GF-7 stereo imagery and laser altimetry data. We then introduce an improved superpixel segmentation algorithm (TSLIC). This method reduces the number of computational units while preserving key micro-topographic features. It groups high-resolution grids into terrain units with similar elevation characteristics and continuous spatial structure. Based on these terrain units, we develop a flood evolution model called RS-CFPM. The model combines flow velocity estimated from the Manning equation with flood propagation speed derived from radar remote sensing. It uses a water balance framework and includes a propagation time delay constraint. This design helps overcome the limitation of traditional static inundation methods that ignore flood travel time. We apply the proposed method to simulate the flood inundation process during the “23·7” extreme basin-scale flood event in the Haihe River Basin. Comparison with multi-temporal radar observations shows that the errors of simulated water level and inundation extent in the Dongdian FSDB are both within 10%. The computational efficiency is also improved by more than 60% compared with traditional methods. This study provides a new approach for rapid and accurate simulation of flood inundation processes in FSDBs under emergency conditions. The method can support flood emergency operation and decision-making. Full article

(This article belongs to the Topic Advances in Earth Observation Technologies to Support Water-Related Sustainable Development Goals (SDGs))

► Show Figures

Figure 1

22 pages, 23521 KB

Open AccessArticle

Superpixel-Tokenized and Frequency-Modulated Hybrid CNN–Transformer for Remote Sensing Semantic Segmentation

by Xinlin Xie, Chenhao Chang, Yunyun Yang and Gang Xie

Remote Sens. 2026, 18(5), 754; https://doi.org/10.3390/rs18050754 - 2 Mar 2026

Cited by 1 | Viewed by 639

Abstract

Remote sensing semantic segmentation is fundamental for fine-grained urban scene understanding, which in turn provides pixel-level semantic insights for urban development and environmental surveillance. However, existing hybrid segmentation architectures fail to incorporate intrinsic geometric and physical priors, inevitably leading to structural fragmentation, boundary [...] Read more.

Remote sensing semantic segmentation is fundamental for fine-grained urban scene understanding, which in turn provides pixel-level semantic insights for urban development and environmental surveillance. However, existing hybrid segmentation architectures fail to incorporate intrinsic geometric and physical priors, inevitably leading to structural fragmentation, boundary ambiguity, and spatial misalignment of heterogeneous features. Therefore, we propose a Superpixel-Tokenized and Frequency-Modulated Hybrid CNN–Transformer network (SFCT-Net) for remote sensing semantic segmentation. The proposed network integrates superpixel tokens and high-frequency constraints to preserve structural integrity and boundary precision. First, our Superpixel-Tokenized Linear Position Attention (STLPA) module replaces rigid window tokens with semantic superpixels to ensure object integrity with linear computational complexity. Second, we construct a Frequency-Modulated Deformable Edge Refinement (FMDER) module that leverages high-frequency spectral priors to modulate deformable sampling, achieving robust boundary recovery. Finally, we develop the Spatial–Semantic Feature Coupling (SSFC) module, which employs a dual-branch strategy to correct spatial drift and align deep semantic features with shallow details. Experiments conducted on our self-built Taiyuan Satellite Remote Sensing Dataset (TSRSD) along with the ISPRS Vaihingen and Potsdam benchmark datasets demonstrate that our proposed SFCT-Net delivers state-of-the-art performance and efficiency by fusing superpixel and frequency priors for robust structural and boundary recovery. Full article

► Show Figures

Figure 1

22 pages, 5115 KB

Open AccessArticle

Intelligent Detection Method of Defects in High-Rise Building Facades Using Infrared Thermography

by Daiming Liu, Yongqiang Jin, Yuan Yang, Zhenyang Xiao, Zeming Zhao, Changling Gao and Dingcheng Zhang

Sensors 2026, 26(2), 694; https://doi.org/10.3390/s26020694 - 20 Jan 2026

Cited by 1 | Viewed by 1113

Abstract

High-rise building facades are prone to defects due to prolonged exposure to complex environments. Infrared detection, as a commonly employed method for facade defect inspection, often results in low accuracy owing to abundant interferences and blurred defect boundaries. In this work, an intelligent [...] Read more.

High-rise building facades are prone to defects due to prolonged exposure to complex environments. Infrared detection, as a commonly employed method for facade defect inspection, often results in low accuracy owing to abundant interferences and blurred defect boundaries. In this work, an intelligent defect detection method for high-rise building facades is proposed. In the first stage of the proposed method, a segmentation model based on DeepLabV3+ is proposed to remove interferences in infrared images using masks. The model incorporates a Post-Decoder Dual-Branch Boundary Refinement Module, which is subdivided into a boundary feature optimization branch and a boundary-guided attention branch. Sub-pixel-level contour refinement and boundary-adaptive weighting are hence achieved to mitigate edge blurring induced by thermal diffusion and to enhance the perception of slender cracks and cavity edges. A triple constraint mechanism is also introduced, combining cross-entropy, multi-scale Dice, and boundary-aware losses to address class imbalance and enhance segmentation performance for small targets. Furthermore, superpixel linear iterative clustering (SLIC) is utilized to enforce regional consistency, hence improving the smoothness and robustness of predictions. In the second stage of the proposed method, a defect detection model based on YOLOV11 is proposed to process masked infrared images for detecting hollow, seepage, cracks and detachment. This work validates the proposed method using 180 infrared images collected via unmanned aerial vehicles. The experimental results demonstrate that the proposed method achieves a detection precision of 89.7%, an mAP@0.5 of 87.9%, and a 57.8 mAP@50-95. surpassing other algorithms and confirming its effectiveness and superiority. Full article

(This article belongs to the Section Fault Diagnosis & Sensors)

► Show Figures

Figure 1

26 pages, 1856 KB

Open AccessArticle

Spectral–Spatial Superpixel Bi-Stochastic Graph Learning for Large-Scale and High-Dimensional Hyperspectral Image Clustering

by Cheng Chen, Nian Wang, Shengming Wang, Jiping Cao, Tao Wang, Zhigao Cui and Yanzhao Su

Remote Sens. 2025, 17(23), 3799; https://doi.org/10.3390/rs17233799 - 23 Nov 2025

Cited by 1 | Viewed by 833

Abstract

Despite the substantial body of work that has achieved large-scale data expansion using anchor-based strategies, these methods incur linear complexity relative to the sample size during iterative processes, making them quite time-consuming. Moreover, as feature dimensionality reduction is often overlooked in this procedure, [...] Read more.

Despite the substantial body of work that has achieved large-scale data expansion using anchor-based strategies, these methods incur linear complexity relative to the sample size during iterative processes, making them quite time-consuming. Moreover, as feature dimensionality reduction is often overlooked in this procedure, most of them suffer from the “curse of dimensionality”. To address all these issues simultaneously, we introduce a novel paradigm with a superpixel encoding and data projecting strategy, which learns a small-scale bi-stochastic graph from the data matrix with large-scale pixels and high-dimensional spectral features to achieve effective clustering. Moreover, a symmetric neighbor search strategy is integrated into our framework to ensure the sparsity of graph and further improve the calculation efficiency. For optimization, a simple yet effective strategy is designed, which simultaneously satisfies all bi-stochastic constraints while ensuring convergence to the optimal solution. To validate our model’s effectiveness and scalability, we conduct extensive experiments on various-scale hyperspectral images (HSIs). The results demonstrate that our method achieves the state-of-the-art clustering performance, and can be better extended to large-scale and high-dimensional HSIs. Full article

(This article belongs to the Section Remote Sensing Image Processing)

► Show Figures

Figure 1

23 pages, 8750 KB

Open AccessArticle

Semi-BSU: A Boundary-Aware Semi-Supervised Semantic Segmentation Framework with Superpixel Refinement for Coastal Aquaculture Pond Extraction from Remote Sensing Images

by Yaocan Gan, Bo Cheng, Chunbo Li, Weilong Fu and Xiaoping Zhang

Remote Sens. 2025, 17(22), 3733; https://doi.org/10.3390/rs17223733 - 17 Nov 2025

Viewed by 1206

Abstract

Accurate segmentation of coastal aquaculture ponds from high-resolution remote sensing images is critical for applications such as coastal environmental monitoring, land use mapping, and infrastructure management. Semi-supervised learning (SSL) has emerged as a promising paradigm by leveraging labeled and unlabeled data to reduce [...] Read more.

Accurate segmentation of coastal aquaculture ponds from high-resolution remote sensing images is critical for applications such as coastal environmental monitoring, land use mapping, and infrastructure management. Semi-supervised learning (SSL) has emerged as a promising paradigm by leveraging labeled and unlabeled data to reduce annotation costs. However, existing SSL methods often suffer from pseudo-label quality degradation, manifested as boundary adhesion and intra-class inconsistencies, which significantly affect segmentation accuracy. To address these challenges, we propose Semi-BSU, a boundary-aware semi-supervised semantic segmentation framework based on the mean teacher architecture. Semi-BSU integrates two novel components: (1) a Boundary Consistency Constraint (BCC), which employs an auxiliary boundary classifier to enhance contour accuracy in pseudo labels, and (2) a Superpixel Refinement Module (SRM), which refines pseudo labels at the superpixel level to improve intra-class consistency. Comprehensive experiments conducted on GF6 and ZY1E high-resolution remote sensing imagery, covering diverse coastal environments with complex geomorphological features, demonstrate the effectiveness of our approach. With half of the training set labeled, Semi-BSU achieves an MIOU of 0.8606, F1 score of 0.8896, and Kappa coefficient of 0.8080, outperforming state-of-the-art methods including CPS, GCT, and UniMatch by 0.3–4.9% in MIOU. The method maintains a compact computational footprint with only 1.81 M parameters and 55.71 GFLOPs. Even with only 1/8 labeled data, it yields a 3.57% MIOU improvement over the supervised baseline. The results demonstrate that combining boundary-aware learning with superpixel-based refinement offers an effective and efficient strategy for high-quality pseudo-label generation and accurate mapping of coastal aquaculture ponds in remote sensing imagery. Full article

► Show Figures

Figure 1

19 pages, 1948 KB

Open AccessArticle

Graph-MambaRoadDet: A Symmetry-Aware Dynamic Graph Framework for Road Damage Detection

by Zichun Tian, Xiaokang Shao and Yuqi Bai

Symmetry 2025, 17(10), 1654; https://doi.org/10.3390/sym17101654 - 5 Oct 2025

Cited by 1 | Viewed by 1549

Abstract

Road-surface distress poses a serious threat to traffic safety and imposes a growing burden on urban maintenance budgets. While modern detectors based on convolutional networks and Vision Transformers achieve strong frame-level performance, they often overlook an essential property of road environments—structural symmetry [...] Read more.

Road-surface distress poses a serious threat to traffic safety and imposes a growing burden on urban maintenance budgets. While modern detectors based on convolutional networks and Vision Transformers achieve strong frame-level performance, they often overlook an essential property of road environments—structural symmetry within road networks and damage patterns. We present Graph-MambaRoadDet (GMRD), a symmetry-aware and lightweight framework that integrates dynamic graph reasoning with state–space modeling for accurate, topology-informed, and real-time road damage detection. Specifically, GMRD employs an EfficientViM-T1 backbone and two DefMamba blocks, whose deformable scanning paths capture sub-pixel crack patterns while preserving geometric symmetry. A superpixel-based graph is constructed by projecting image regions onto OpenStreetMap road segments, encoding both spatial structure and symmetric topological layout. We introduce a Graph-Generating State–Space Model (GG-SSM) that synthesizes sparse sample-specific adjacency in

O (M)

time, further refined by a fusion module that combines detector self-attention with prior symmetry constraints. A consistency loss promotes smooth predictions across symmetric or adjacent segments. The full INT8 model contains only 1.8 M parameters and 1.5 GFLOPs, sustaining 45 FPS at 7 W on a Jetson Orin Nano—eight times lighter and 1.7× faster than YOLOv8-s. On RDD2022, TD-RD, and RoadBench-100K, GMRD surpasses strong baselines by up to +6.1 mAP_50:95 and, on the new RoadGraph-RDD benchmark, achieves +5.3 G-mAP and +0.05 consistency gain. Qualitative results demonstrate robustness under shadows, reflections, back-lighting, and occlusion. By explicitly modeling spatial and topological symmetry, GMRD offers a principled solution for city-scale road infrastructure monitoring under real-time and edge-computing constraints. Full article

(This article belongs to the Section Computer)

► Show Figures

Figure 1

23 pages, 3488 KB

Open AccessArticle

Unsupervised Hyperspectral Band Selection Using Spectral–Spatial Iterative Greedy Algorithm

by Xin Yang and Wenhong Wang

Sensors 2025, 25(18), 5638; https://doi.org/10.3390/s25185638 - 10 Sep 2025

Cited by 1 | Viewed by 1379

Abstract

Hyperspectral band selection (BS) is an important technique to reduce data dimensionality for the classification applications of hyperspectral remote sensing images (HSIs). Recently, searching-based BS methods have received increasing attention for their ability to select the best subset of bands while preserving the [...] Read more.

Hyperspectral band selection (BS) is an important technique to reduce data dimensionality for the classification applications of hyperspectral remote sensing images (HSIs). Recently, searching-based BS methods have received increasing attention for their ability to select the best subset of bands while preserving the essential information of the original data. However, existing searching-based BS methods neglect effective exploitation of the spatial and spectral prior information inherent in the data, thus limiting their performance. To address this problem, in this study, a novel unsupervised BS method called Spectral–Spatial Iterative Greedy Algorithm (SSIGA) is proposed. Specifically, to facilitate efficient local search using spectral information, SSIGA conducts clustering on all the bands by employing a K-means clustering method with balanced cluster size constraints and constructs a K-nearest neighbor graph for each cluster. Based on the nearest neighbor graphs, SSIGA can effectively explore the neighborhood solutions in local search. In addition, to efficiently evaluate the discriminability and information redundancy of the solution given by SSIGA using the spatial and spectral information of HSIs, we designed an effective objective function for SSIGA. The value of the objective function is derived by calculating the Fisher score for each band in the solution based on the results of the superpixel segmentation performed on the target HSI, as well as by computing the average information entropy and mutual information of the bands in the solution. Experimental results on three publicly available real HSI datasets demonstrate that the SSIG algorithm achieves superior performance compared to several state-of-the-art methods. Full article

(This article belongs to the Section Sensing and Imaging)

► Show Figures

Figure 1

20 pages, 4058 KB

Open AccessArticle

SCSU–GDO: Superpixel Collaborative Sparse Unmixing with Graph Differential Operator for Hyperspectral Imagery

by Kaijun Yang, Zhixin Zhao, Qishen Yang and Ruyi Feng

Remote Sens. 2025, 17(17), 3088; https://doi.org/10.3390/rs17173088 - 4 Sep 2025

Cited by 1 | Viewed by 1567

Abstract

In recent years, remarkable advancements have been achieved in hyperspectral unmixing (HU). Sparse unmixing, in which models mix pixels as linear combinations of endmembers and their corresponding fractional abundances, has become a dominant paradigm in hyperspectral image analysis. To address the inherent limitations [...] Read more.

In recent years, remarkable advancements have been achieved in hyperspectral unmixing (HU). Sparse unmixing, in which models mix pixels as linear combinations of endmembers and their corresponding fractional abundances, has become a dominant paradigm in hyperspectral image analysis. To address the inherent limitations of spectral-only approaches, spatial contextual information has been integrated into unmixing. In this article, a superpixel collaborative sparse unmixing algorithm with graph differential operator (SCSU–GDO), is proposed, which effectively integrates superpixel-based local collaboration with graph differential spatial regularization. The proposed algorithm contains three key steps. First, superpixel segmentation partitions the hyperspectral image into homogeneous regions, leveraging boundary information to preserve structural coherence. Subsequently, a local collaborative weighted sparse regression model is formulated to jointly enforce data fidelity and sparsity constraints on abundance estimation. Finally, to enhance spatial consistency, the Laplacian matrix derived from graph learning is decomposed into a graph differential operator, adaptively capturing local smoothness and structural discontinuities within the image. Comprehensive experiments on three datasets prove the accuracy, robustness, and practical efficacy of the proposed method. Full article

(This article belongs to the Special Issue Imagery Classification and Feature Extraction Based on Hyperspectral Remote Sensing)

► Show Figures

Figure 1

21 pages, 4595 KB

Open AccessArticle

Weakly Supervised Semantic Segmentation of Remote Sensing Images Using Siamese Affinity Network

by Zheng Chen, Yuheng Lian, Jing Bai, Jingsen Zhang, Zhu Xiao and Biao Hou

Remote Sens. 2025, 17(5), 808; https://doi.org/10.3390/rs17050808 - 25 Feb 2025

Cited by 11 | Viewed by 7050

Abstract

In recent years, weakly supervised semantic segmentation (WSSS) has garnered significant attention in remote sensing image analysis due to its low annotation cost. To address the issues of inaccurate and incomplete seed areas and unreliable pseudo masks in WSSS, we propose a novel [...] Read more.

In recent years, weakly supervised semantic segmentation (WSSS) has garnered significant attention in remote sensing image analysis due to its low annotation cost. To address the issues of inaccurate and incomplete seed areas and unreliable pseudo masks in WSSS, we propose a novel WSSS method for remote sensing images based on the Siamese Affinity Network (SAN) and the Segment Anything Model (SAM). First, we design a seed enhancement module for semantic affinity, which strengthens contextual relevance in the feature map by enforcing a unified constraint principle of cross-pixel similarity, thereby capturing semantically similar regions within the image. Second, leveraging the prior notion of cross-view consistency, we employ a Siamese network to regularize the consistency of CAMs from different affine-transformed images, providing additional supervision for weakly supervised learning. Finally, we utilize the SAM segmentation model to generate semantic superpixels, expanding the original CAM seeds to more completely and accurately extract target edges, thereby improving the quality of segmentation pseudo masks. Experimental results on the large-scale remote sensing datasets DRLSD and ISPRS Vaihingen demonstrate that our method achieves segmentation performance close to that of fully supervised semantic segmentation (FSSS) methods on both datasets. Ablation studies further verify the positive optimization effect of each module on segmentation pseudo labels. Our approach exhibits superior localization accuracy and precise visualization effects across different backbone networks, achieving state-of-the-art localization performance. Full article

► Show Figures

Figure 1

24 pages, 8674 KB

Open AccessArticle

Superpixel Classification with the Aid of Neighborhood for Water Mapping in SAR Imagery

by Tomokazu Miyamoto

Remote Sens. 2024, 16(23), 4576; https://doi.org/10.3390/rs16234576 - 6 Dec 2024

Cited by 2 | Viewed by 1936

Abstract

Water mapping for satellite imagery has been an active research field for many applications, in particular natural disasters such as floods. Synthetic Aperture Radar (SAR) provides high-resolution imagery without constraints on weather conditions. The single-date SAR approach is less accurate than the multi-temporal [...] Read more.

Water mapping for satellite imagery has been an active research field for many applications, in particular natural disasters such as floods. Synthetic Aperture Radar (SAR) provides high-resolution imagery without constraints on weather conditions. The single-date SAR approach is less accurate than the multi-temporal approach but can produce results more promptly. This paper proposes novel segmentation schemes that are designed to process both a target superpixel and its surrounding ones for the input for machine learning. Mixture-based Superpixel-Shallow Deit-Ti/XGBoost (MISP-SDT/XGB) schemes are devised to generate, annotate, and classify superpixels, and perform the land/water segmentation of SAR imagery. These schemes are applied to Sentinel-1 SAR data to examine segmentation performances. Single/mask/neighborhood models and single/neighborhood models are introduced in the MISP-SDT scheme and the MISP-XGB scheme, respectively. The effects of the contextual information about the target and its neighbor superpixels are assessed on its segmentation performances. Regarding polarization, it is shown that the VH mode produces more encouraging results than the VV, which is consistent with previous studies. Also, under our MISP-SDT/XGP schemes, the neighborhood models show better performances than FCNN models. Overall, the neighborhood model gives better performances than the single model. Results from attention maps and feature importance scores show that neighbor regions are looked at or used by the algorithms in the neighborhood models. Our findings suggest that under our schemes, the contextual information has positive effects on land/water segmentation. Full article

► Show Figures

Figure 1

22 pages, 4894 KB

Open AccessArticle

SMALE: Hyperspectral Image Classification via Superpixels and Manifold Learning

by Nannan Liao, Jianglei Gong, Wenxing Li, Cheng Li, Chaoyan Zhang and Baolong Guo

Remote Sens. 2024, 16(18), 3442; https://doi.org/10.3390/rs16183442 - 17 Sep 2024

Cited by 5 | Viewed by 2333

Abstract

As an extremely efficient preprocessing tool, superpixels have become more and more popular in various computer vision tasks. Nevertheless, there are still several drawbacks in the application of hyperspectral image (HSl) processing. Firstly, it is difficult to directly apply superpixels because of the [...] Read more.

As an extremely efficient preprocessing tool, superpixels have become more and more popular in various computer vision tasks. Nevertheless, there are still several drawbacks in the application of hyperspectral image (HSl) processing. Firstly, it is difficult to directly apply superpixels because of the high dimension of HSl information. Secondly, existing superpixel algorithms cannot accurately classify the HSl objects due to multi-scale feature categorization. For the processing of high-dimensional problems, we use the principle of PCA to extract three principal components from numerous bands to form three-channel images. In this paper, a novel superpixel algorithm called Seed Extend by Entropy Density (SEED) is proposed to alleviate the seed point redundancy caused by the diversified content of HSl. It also focuses on breaking the dilemma of manually setting the number of superpixels to overcome the difficulty of classification imprecision caused by multi-scale targets. Next, a space–spectrum constraint model, termed Hyperspectral Image Classification via superpixels and manifold learning (SMALE), is designed, which integrates the proposed SEED to generate a dimensionality reduction framework. By making full use of spatial context information in the process of unsupervised dimension reduction, it could effectively improve the performance of HSl classification. Experimental results show that the proposed SEED could effectively promote the classification accuracy of HSI. Meanwhile, the integrated SMALE model outperforms existing algorithms on public datasets in terms of several quantitative metrics. Full article

(This article belongs to the Special Issue Geospatial Object Detection and Geographic Image Classification Based on Remote Sensing Imagery)

► Show Figures

Figure 1

19 pages, 4407 KB

Open AccessArticle

Superpixels with Content-Awareness via a Two-Stage Generation Framework

by Cheng Li, Nannan Liao, Zhe Huang, He Bian, Zhe Zhang and Long Ren

Symmetry 2024, 16(8), 1011; https://doi.org/10.3390/sym16081011 - 8 Aug 2024

Viewed by 2265

Abstract

The superpixel usually serves as a region-level feature in various image processing tasks, and is known for segmentation accuracy, spatial compactness and running efficiency. However, since these properties are intrinsically incompatible, there is still a compromise within the overall performance of existing superpixel [...] Read more.

The superpixel usually serves as a region-level feature in various image processing tasks, and is known for segmentation accuracy, spatial compactness and running efficiency. However, since these properties are intrinsically incompatible, there is still a compromise within the overall performance of existing superpixel algorithms. In this work, the property constraint in superpixels is relaxed by in-depth understanding of the image content, and a novel two-stage superpixel generation framework is proposed to produce content-aware superpixels. In the global processing stage, a diffusion-based online average clustering framework is introduced to efficiently aggregate image pixels into multiple superpixel candidates according to color and spatial information. During this process, a centroid relocation strategy is established to dynamically guide the region updating. According to the area feature in manifold space, several superpixel centroids are then split or merged to optimize the regional representation of image content. Subsequently, local updating is adopted on pixels in those superpixel regions to further improve the performance. As a result, the dynamic centroid relocating strategy offers online averaging clustering the property of content awareness through coarse-to-fine label updating. Extensive experiments verify that the produced superpixels achieve desirable and comprehensive performance on boundary adherence, visual satisfactory and time consumption. The quantitative results are on par with existing state-of-the-art algorithms in terms with several common property metrics. Full article

(This article belongs to the Special Issue Image Processing and Symmetry: Topics and Applications)

► Show Figures

Figure 1

18 pages, 5080 KB

Open AccessArticle

SPT-UNet: A Superpixel-Level Feature Fusion Network for Water Extraction from SAR Imagery

by Teng Zhao, Xiaoping Du, Chen Xu, Hongdeng Jian, Zhipeng Pei, Junjie Zhu, Zhenzhen Yan and Xiangtao Fan

Remote Sens. 2024, 16(14), 2636; https://doi.org/10.3390/rs16142636 - 18 Jul 2024

Cited by 9 | Viewed by 2658

Abstract

Extracting water bodies from synthetic aperture radar (SAR) images plays a crucial role in the management of water resources, flood monitoring, and other applications. Recently, transformer-based models have been extensively utilized in the remote sensing domain. However, due to regular patch-partition and weak [...] Read more.

Extracting water bodies from synthetic aperture radar (SAR) images plays a crucial role in the management of water resources, flood monitoring, and other applications. Recently, transformer-based models have been extensively utilized in the remote sensing domain. However, due to regular patch-partition and weak inductive bias, transformer-based models face challenges such as edge serration and high data dependency when used for water body extraction from SAR images. To address these challenges, we introduce a new model, the Superpixel-based Transformer (SPT), based on the adaptive characteristic of superpixels and knowledge constraints of the adjacency matrix. (1) To mitigate edge serration, the SPT replaces regular patch partition with superpixel segmentation to fully utilize the internal homogeneity of superpixels. (2) To reduce data dependency, the SPT incorporates a normalized adjacency matrix between superpixels into the Multi-Layer Perceptron (MLP) to impose knowledge constraints. (3) Additionally, to integrate superpixel-level learning from the SPT with pixel-level learning from the CNN, we combine these two deep networks to form SPT-UNet for water body extraction. The results show that our SPT-UNet is competitive compared with other state-of-the-art extraction models, both in terms of quantitative metrics and visual effects. Full article

(This article belongs to the Special Issue Recent Advances in Water Resources and Water Environmental Monitoring with Remote Sensing Techniques)

► Show Figures

Figure 1

14 pages, 5585 KB

Open AccessArticle

Vision-Based On-Road Nighttime Vehicle Detection and Tracking Using Improved HOG Features

by Li Zhang, Weiyue Xu, Cong Shen and Yingping Huang

Sensors 2024, 24(5), 1590; https://doi.org/10.3390/s24051590 - 29 Feb 2024

Cited by 22 | Viewed by 5223

Abstract

The lack of discernible vehicle contour features in low-light conditions poses a formidable challenge for nighttime vehicle detection under hardware cost constraints. Addressing this issue, an enhanced histogram of oriented gradients (HOGs) approach is introduced to extract relevant vehicle features. Initially, vehicle lights [...] Read more.

The lack of discernible vehicle contour features in low-light conditions poses a formidable challenge for nighttime vehicle detection under hardware cost constraints. Addressing this issue, an enhanced histogram of oriented gradients (HOGs) approach is introduced to extract relevant vehicle features. Initially, vehicle lights are extracted using a combination of background illumination removal and a saliency model. Subsequently, these lights are integrated with a template-based approach to delineate regions containing potential vehicles. In the next step, the fusion of superpixel and HOG (S-HOG) features within these regions is performed, and the support vector machine (SVM) is employed for classification. A non-maximum suppression (NMS) method is applied to eliminate overlapping areas, incorporating the fusion of vertical histograms of symmetrical features of oriented gradients (V-HOGs). Finally, the Kalman filter is utilized for tracking candidate vehicles over time. Experimental results demonstrate a significant improvement in the accuracy of vehicle recognition in nighttime scenarios with the proposed method. Full article

(This article belongs to the Special Issue Object Detection and IOU Based on Sensors: Methods and Applications)

► Show Figures

Figure 1

Search Results (43)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (43)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI