Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (5,684)

Search Parameters:
Keywords = local feature enhancement

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
17 pages, 16241 KB  
Article
Morphokinematic Structure of the Planetary Nebula NGC 6563
by Zahra Al, Federico Soto-Badilla, Yüksel Karataş, Gerardo Ramos-Larios and Roberto Vázquez
Galaxies 2026, 14(3), 60; https://doi.org/10.3390/galaxies14030060 (registering DOI) - 15 Jun 2026
Abstract
We present a morphokinematic analysis based on high-resolution long-slit echelle spectroscopy of the [N ii]λ6583 line and narrowband imaging. Position–velocity diagrams reveal asymmetric expansion and localized kinematic features. We derive a systemic velocity of [...] Read more.
We present a morphokinematic analysis based on high-resolution long-slit echelle spectroscopy of the [N ii]λ6583 line and narrowband imaging. Position–velocity diagrams reveal asymmetric expansion and localized kinematic features. We derive a systemic velocity of VsysLSR=25±1 km s−1 (VsysHEL=34±1 km s−1) and a main shell expansion velocity of Vexp=22±1 km s−1. Three-dimensional modeling indicates an ellipsoidal main body surrounded by a thin shell, two ear-like protrusions, and additional small-scale structures. The corresponding kinematic ages are 3600±700 yr for the ellipsoid and ring, and 7500±1000 yr and 8800±1500 yr for the two opposite ear-like protrusions, respectively, indicating that these outer structures predate the main nebular envelope. The kinematic asymmetry and enhanced emission regions suggest evolution within a non-uniform ambient medium. At the same time, the presence of collimated ear-like structures is consistent with shaping influenced by binary interaction, where earlier outflows preceded the ejection of the dense shell. NGC 6563 therefore appears to be a dynamically evolved system shaped by the combined effects of episodic mass ejection and environmental interaction. Full article
(This article belongs to the Special Issue Origins and Models of Planetary Nebulae, 2nd Edition)
Show Figures

Figure 1

36 pages, 32050 KB  
Article
Semantic Segmentation of Pegmatite Dikes in High-Resolution Remote Sensing Imagery Using GAD-UNet++ in the Yilanlike Area, South Tianshan
by Zirui Wu, Chuan Chen, Yuanjun Yu, Yong Tian, Jian Yu and Fang Xia
Remote Sens. 2026, 18(12), 1988; https://doi.org/10.3390/rs18121988 (registering DOI) - 15 Jun 2026
Abstract
Pegmatite dikes are important prospecting indicators for rare-metal deposits, whereas traditional methods for pegmatite dike identification are constrained by the limited capability of human visual interpretation to capture information from remote sensing imagery, resulting in low identification accuracy and efficiency. In recent years, [...] Read more.
Pegmatite dikes are important prospecting indicators for rare-metal deposits, whereas traditional methods for pegmatite dike identification are constrained by the limited capability of human visual interpretation to capture information from remote sensing imagery, resulting in low identification accuracy and efficiency. In recent years, global research on semantic segmentation of different surface features and remote sensing-based mineral exploration using deep learning methods and high-resolution remote sensing imagery has made significant progress; however, studies on surface-exposed geological bodies such as pegmatite dikes remain highly insufficient. To address the key problem of efficiently identifying pegmatite dikes in remote sensing imagery, this study proposes an improved model based on UNet++, termed GAD-UNet++. In the field of remote sensing geology, this study constructed a pegmatite dike semantic segmentation dataset based on high-resolution RGB imagery by using 0.66 m RGB imagery for visual delineation and ZY1F hyperspectral data for spectral constraint and label refinement; on this basis, semantic segmentation of surface pegmatite dikes in the Yilanlike area of the South Tianshan Mountains, Xinjiang, was conducted using RGB remote sensing image patches as model input. Specifically, because pegmatite dikes are small targets characterized by slender structures, indistinct boundaries, and sparse regional distribution, this study introduced a lightweight feature extraction structure (GhostNetV2) and a long-range dependency attention module (DFC) at the encoder stage, and further incorporated the Coordinate Attention module (CA) to enhance spatial localization and boundary representation of the targets. Finally, focal cross-entropy loss and a deep supervision strategy were adopted to improve the accuracy of semantic information extraction for pegmatite dikes, as well as the training stability and segmentation accuracy under class-imbalance conditions. The results show that the proposed model achieved an mIoU of 93.11% and an F1-score of 94.95% on the test set. Compared with existing semantic segmentation models, the proposed model achieved superior performance in both identification accuracy and computational efficiency for pegmatite dikes. In addition, this study delineated 18 potential pegmatite dike enrichment zones in the Yilanlike area, providing technical support for remote sensing-based rare-metal prospecting and geological interpretation in the study area. Full article
(This article belongs to the Section Remote Sensing in Geology, Geomorphology and Hydrology)
Show Figures

Figure 1

26 pages, 6707 KB  
Article
BDRNet: Background-Aware Dynamic-Scale Routing Network for UAV Remote Sensing Object Detection
by Xuelong Zheng, Faming Shao, Qing Liu, Juying Dai, Yiming Yue, Tao Zhang and Caian Chen
Remote Sens. 2026, 18(12), 1987; https://doi.org/10.3390/rs18121987 (registering DOI) - 15 Jun 2026
Abstract
Object detection in UAV remote sensing imagery remains challenging due to severe scale variation, dense object distributions, complex background clutter, and localization ambiguity caused by extremely small objects. To address these issues, this paper proposes BDRNet, a lightweight background-aware dynamic-scale routing network for [...] Read more.
Object detection in UAV remote sensing imagery remains challenging due to severe scale variation, dense object distributions, complex background clutter, and localization ambiguity caused by extremely small objects. To address these issues, this paper proposes BDRNet, a lightweight background-aware dynamic-scale routing network for UAV remote sensing object detection. First, a background-aware feature enhancement (BAFE) module is introduced into the backbone to enhance feature representation through horizontal and vertical contextual modeling, improving target-related responses in complex aerial scenes. Second, a dynamic-scale routing pyramid (DSRP) is designed to retain the high-resolution P2 branch and adaptively integrate multi-scale features through spatially dynamic routing, alleviating the loss of fine-grained information and improving the representation of small and scale-varied objects. Third, a scale- and geometry-aware normalized Wasserstein distance (SGNW) loss is proposed by modeling bounding boxes as two-dimensional Gaussian distributions. By incorporating aspect-ratio-guided geometric weighting and scale-aware dynamic fusion, SGNW improves regression stability for small objects while preserving geometric constraints for medium and large targets. Extensive experiments on the VisDrone2019 and UAVDT datasets demonstrate that BDRNet consistently improves detection accuracy over the YOLOv10s detector while maintaining a comparable model size and computational cost. Compared with several mainstream lightweight detectors, BDRNet achieves a favorable accuracy–efficiency trade-off, demonstrating its effectiveness for UAV remote sensing object detection in complex aerial scenarios. Full article
Show Figures

Figure 1

22 pages, 7177 KB  
Article
Optimization-Oriented Vision-Guided Robotic Grasping for Bolt Handling in Intelligent Manufacturing
by Pengzhan Fu, Zhenlin Zhang, Long Liu, Yingze Xi, Xingwei Zhao and Xuan Wang
Mathematics 2026, 14(12), 2133; https://doi.org/10.3390/math14122133 (registering DOI) - 15 Jun 2026
Abstract
Accurate detection and reliable grasping of small bolts are essential for intelligent manufacturing and automated assembly. However, this remains a challenge due to the small size, slender geometry, and metallic reflective surfaces of bolts. In this paper, we propose a vision-guided robotic bolt [...] Read more.
Accurate detection and reliable grasping of small bolts are essential for intelligent manufacturing and automated assembly. However, this remains a challenge due to the small size, slender geometry, and metallic reflective surfaces of bolts. In this paper, we propose a vision-guided robotic bolt handling framework that integrates lightweight object detection, optimization-oriented grasp execution, and collision-aware trajectory planning. The lightweight YOLOv8n-BoltLite detector, improved with E-C2f, LCA, SA-PAN, and WD-IoU loss, enhances localization accuracy and feature representation for small and slender bolts. A robotic grasping framework is designed to transform detection results into executable robotic actions through 3D pose estimation, mid-shank grasp point generation, and optimization-oriented execution formulation. Additionally, a five-segment trajectory planning strategy ensures safe and efficient robot motion. Experimental results show that YOLOv8n-BoltLite achieves a five-run average mAP of 99.64 ± 0.05% with 198 FPS, and 3.02 M parameters. On an additional challenging external test set involving illumination variation, clutter, partial occlusion, reflection, and clustered bolts, the proposed detector achieves 94.62 ± 0.18%, outperforming recent lightweight detectors under the same training protocol. Robotic experiments involving 1000 controlled grasping trials and 300 multi-target grasping attempts demonstrate a controlled-condition success rate of 97.0% and improved target-selection reliability in multi-bolt scenes. These results suggest that the proposed framework offers a practical and efficient solution for automated bolt handling in intelligent manufacturing environments. Full article
Show Figures

Figure 1

21 pages, 3582 KB  
Article
An Improved YOLOv8n Method for Small Thermal Defect Detection of Photovoltaic Modules in UAV Infrared Inspection
by Tengfei He, Zhongyuan Mao and Yuanchang Zhong
Remote Sens. 2026, 18(12), 1986; https://doi.org/10.3390/rs18121986 (registering DOI) - 15 Jun 2026
Abstract
To address missed detections, false alarms, and deployment limitations in thermal defect detection of photovoltaic modules from unmanned aerial vehicle (UAV) infrared images, this paper proposes an improved detection method based on You Only Look Once version 8 nano (YOLOv8n). The proposed method [...] Read more.
To address missed detections, false alarms, and deployment limitations in thermal defect detection of photovoltaic modules from unmanned aerial vehicle (UAV) infrared images, this paper proposes an improved detection method based on You Only Look Once version 8 nano (YOLOv8n). The proposed method is optimized according to the characteristics of UAV infrared photovoltaic inspection, including small thermal targets, weak and diffuse thermal responses, complex backgrounds, and lightweight deployment requirements. Specifically, a P2 shallow feature layer is introduced to enhance fine-grained feature perception for small thermal defects, while Ghost Convolution (GhostConv) is incorporated into the backbone to reduce model complexity. In addition, C2f-Large Separable Kernel Attention (C2f-LSKA) is embedded in the neck to strengthen contextual and spatial feature modeling under complex infrared backgrounds, and Wise-IoU version 3 (WIoUv3) is adopted to improve bounding box regression and localization stability for boundary-ambiguous thermal anomalies. Experiments are conducted on a self-constructed UAV infrared thermal imaging dataset. From nearly 10,000 inspection images, 3000 representative images are selected and manually annotated, covering typical challenges such as small hot spots, low-contrast defects, complex background interference, and diffuse abnormal temperature-rise regions. Compared with the baseline YOLOv8n, the proposed method improves Precision, Recall, mean average precision at an IoU threshold of 0.5 (mAP@0.5), and mean average precision averaged over IoU thresholds from 0.5 to 0.95 (mAP@0.5:0.95) by 5.1, 11.4, 9.6, and 13.2 percentage points, respectively, while reducing the number of parameters and model size by 65.8% and 61.9%, respectively. These results indicate that the proposed method improves detection accuracy and localization quality under the evaluated UAV infrared inspection setting while maintaining lightweight characteristics. Full article
Show Figures

Figure 1

26 pages, 4334 KB  
Article
RKF-YOLO: A Lightweight Dual-Task Model for Illegal Parking Detection and License Plate Recognition on Edge Devices
by Hao Chen, Yao Li, Yong Jia, Guangle Yao and Ruipeng Zhu
Electronics 2026, 15(12), 2638; https://doi.org/10.3390/electronics15122638 (registering DOI) - 15 Jun 2026
Abstract
To address the joint requirements of illegal parking detection and license plate recognition under complex traffic scenarios and limited edge-device resources, this study proposes RKF-YOLO, a lightweight dual-task model based on improved YOLOv11n that integrates Rep-CSP structural optimization, knowledge-transfer-enhanced training (KTET), and Focal-CIoU [...] Read more.
To address the joint requirements of illegal parking detection and license plate recognition under complex traffic scenarios and limited edge-device resources, this study proposes RKF-YOLO, a lightweight dual-task model based on improved YOLOv11n that integrates Rep-CSP structural optimization, knowledge-transfer-enhanced training (KTET), and Focal-CIoU loss. Compared with YOLOv11n, RKF-YOLO reduces parameters and FLOPs by 38.2% and 38.1%, respectively, while improving mAP@0.5 and mAP@0.5:0.95 by 0.6 and 1.1 percentage points for parking detection; for plate detection, Focal-CIoU improves mAP@0.5:0.95 by 1.3 percentage points and contributes to a recognition accuracy of 95.7%. The unified framework uses a shared backbone and task-oriented detection heads to support vehicle-level illegal parking detection and license-plate-oriented localization. Rep-CSP enhances multi-scale feature representation, asymmetric channel reduction with feature compensation reduces redundant computation, and KTET improves convergence through optimizer and learning-rate migration. Deployment on RK3588 achieves 59.5 FPS for parking detection and 95.1% recognition accuracy, demonstrating real-time performance and practical applicability on resource-constrained edge devices. Full article
Show Figures

Figure 1

17 pages, 27637 KB  
Article
Precise Geometric-Priors-Guided 3D Point Cloud Segmentation Network for Auricle Region: GeoPriors-3DEarSeg
by Li Yuan, Wenhao Zuo, Anting Guo and Wenjiang Huang
Appl. Sci. 2026, 16(12), 6033; https://doi.org/10.3390/app16126033 (registering DOI) - 15 Jun 2026
Abstract
This paper addresses the core challenges in the 3D point cloud segmentation of the human auricle region, including the low distinguishability of weak-boundary features, the lack of anatomical priors in existing methods, the tendency of boundary features to be overwhelmed by background noise, [...] Read more.
This paper addresses the core challenges in the 3D point cloud segmentation of the human auricle region, including the low distinguishability of weak-boundary features, the lack of anatomical priors in existing methods, the tendency of boundary features to be overwhelmed by background noise, and the weakening of supervisory signals. We propose GeoPriors-3DEarSeg, a precise geometric-priors-guided 3D point cloud segmentation network for auricle regions. The network incorporates complementary geometric features from three dimensions, normal vector orientation, local Gaussian curvature, and shape diameter function, to characterize the intrinsic geometric differences at the auricle boundary. A geometric-priors-guided QKV gated attention mechanism is designed to selectively enhance the expression of weak-boundary features. Additionally, we introduce a boundary-aware loss function, NVBLoss, which does not rely on extra annotations to strengthen the supervision of boundary features. The experimental results on a human ear dataset demonstrate that our method achieves a mean intersection over union (mIoU) of 0.9857, outperforms the comparison methods, and precisely segments of weak boundaries in the auricle region. This work provides a technical basis for accurate auricle-region point cloud segmentation and offesr methodological insights for future studies on the weak-boundary segmentation of other medical anatomical structures. Full article
Show Figures

Figure 1

23 pages, 54838 KB  
Article
MMARNet: Two-Stage Remote Sensing Image Registration with Multimodal Attention Mechanism
by Xiangzeng Liu, Guanglu Shi, Zhipeng Huang, Jian Ji and Qiguang Miao
Remote Sens. 2026, 18(12), 1983; https://doi.org/10.3390/rs18121983 (registering DOI) - 15 Jun 2026
Abstract
Multimodal image registration is a fundamental yet challenging task, particularly in remote sensing scenarios involving cross-platform, multi-temporal, and cross-modal data. The primary difficulty arises from the coexistence of large-scale geometric distortions and complex local appearance variations across modalities, which makes it difficult for [...] Read more.
Multimodal image registration is a fundamental yet challenging task, particularly in remote sensing scenarios involving cross-platform, multi-temporal, and cross-modal data. The primary difficulty arises from the coexistence of large-scale geometric distortions and complex local appearance variations across modalities, which makes it difficult for a single-stage model to achieve both global alignment and fine-grained correspondence simultaneously. To address this issue, we propose MMARNet, a task-driven coarse-to-fine registration framework that explicitly decomposes multimodal registration into global geometric alignment and local correspondence refinement. Instead of treating registration as a unified problem, the proposed framework sequentially resolves distinct sources of error, leading to improved robustness and accuracy under challenging conditions. In the first stage, MMARNet learns geometry-aware global alignment by identifying structurally reliable regions across modalities and estimating large-scale transformations, effectively reducing the initial misalignment and normalizing the geometric space. In the second stage, the model focuses on residual local discrepancies by learning context-enhanced feature representations, enabling robust keypoint-level matching even under severe modality differences and nonlinear distortions. The two stages are designed to work in a complementary manner, where global alignment significantly simplifies the subsequent local matching process. Extensive experiments on three challenging multimodal datasets demonstrate that MMARNet achieves superior performance in both accuracy and robustness compared to existing methods. The results validate the effectiveness of the proposed problem decomposition and highlight the advantage of the coarse-to-fine optimization strategy for multimodal remote sensing image registration. Full article
Show Figures

Figure 1

20 pages, 4196 KB  
Article
GHM-DEIM: An Improved DEIM-Based Framework for Subtle and Scale-Variant Thermal Anomaly Detection in Photovoltaic UAV Infrared Imagery
by Jianxiang Li, Lang Yang, Wei Huang, Feng Ren and Jing Hu
Sensors 2026, 26(12), 3796; https://doi.org/10.3390/s26123796 (registering DOI) - 14 Jun 2026
Abstract
With the increasing demand for low-carbon energy, automated defect detection using unmanned aerial vehicle (UAV)-based thermal inspection has become essential for maintaining the reliability of photovoltaic systems. However, existing methods still suffer from low-contrast thermal imagery, large-scale variations of defects, and subtle thermal [...] Read more.
With the increasing demand for low-carbon energy, automated defect detection using unmanned aerial vehicle (UAV)-based thermal inspection has become essential for maintaining the reliability of photovoltaic systems. However, existing methods still suffer from low-contrast thermal imagery, large-scale variations of defects, and subtle thermal anomalies. To address these challenges, this study proposes Grouped-Hypergraph-Modulation DEIM (GHM-DEIM), a robust end-to-end detection framework based on an improved DEIM architecture. Specifically, a grouped multi-scale aggregation attention network is introduced to enhance global thermal perception and recover discriminative features from blurred backgrounds. In addition, an enhanced encoder incorporating a hypergraph-based context encoding mechanism is designed to model high-order non-local relationships and improve feature representation across different defect scales. Furthermore, a modulation fusion module is employed to adaptively refine multi-scale feature responses and suppress environmental noise interference. Extensive experiments conducted on the ThermoSolar-PV and PV-HSD-2025 datasets demonstrate that the proposed method consistently outperforms state-of-the-art detectors, achieving mAP@50 values of 88.6% and 74.2%, respectively, with improvements of 4.7% and 2.9% over the baseline. These results demonstrate the effectiveness and robustness of GHM-DEIM for UAV-based PV thermal defect inspection. Full article
(This article belongs to the Section Sensors and Robotics)
Show Figures

Figure 1

15 pages, 3948 KB  
Article
Machine Learning-Based Analysis of Elastic Springback in Bending of SS, Al, and Cu Sheets with Localized Heating
by Naser A. Alsaleh
J. Manuf. Mater. Process. 2026, 10(6), 207; https://doi.org/10.3390/jmmp10060207 (registering DOI) - 14 Jun 2026
Abstract
Elastic springback is a critical challenge in sheet metal bending that directly affects dimensional accuracy and manufacturing efficiency. This study presents a comparative experimental and machine learning-based analysis of elastic springback behavior in three widely used sheet metals like stainless steel, aluminum, and [...] Read more.
Elastic springback is a critical challenge in sheet metal bending that directly affects dimensional accuracy and manufacturing efficiency. This study presents a comparative experimental and machine learning-based analysis of elastic springback behavior in three widely used sheet metals like stainless steel, aluminum, and copper, which are subjected to folding bending. The influence of key process parameters, namely sheet thickness (0.5 to 1.5 mm) and bending temperature (room temperature to 200 °C), was systematically examined under cold working. A cost-effective localized heating approach using a direct flame was introduced to enhance process control and reduce elastic recovery without the complexity associated with heated dies. Experimental results revealed substantial variability in elastic springback, ranging from 0.15% to 12.41%, emphasizing the fact that they are nonlinear in nature. Statistical evaluation confirmed that sheet thickness is the dominant factor governing elastic springback, while material type and temperature exhibit secondary yet meaningful effects. To improve predictive capability, five regression models (Linear, Polynomial, Support Vector, Random Forest, and Gradient Boosting) were developed and assessed. Among them, Random Forest demonstrated superior performance with the lowest prediction errors and strongest explanatory power, achieving an R2 of approximately 0.85. Cross-validation further validated its robustness and generalization capability. Feature importance and SHapley Additive exPlanations (SHAP) analyses reinforced the primary role of thickness in determining elastic recovery behavior. The findings provide practical insights for selecting materials and process conditions to minimize elastic springback while highlighting the effectiveness of ensemble learning techniques for accurate prediction. This work contributes a consistent framework for enhancing bending precision and supports data-driven decision-making in modern manufacturing environments. Full article
Show Figures

Figure 1

25 pages, 5937 KB  
Article
CGSTA-Net: A Cross-Domain Generative Prior-Assisted Structure–Texture Adaptive Network for Remote Sensing Image Dehazing
by Xiaoyan Li, Yankun Zhao and Na Niu
Symmetry 2026, 18(6), 1027; https://doi.org/10.3390/sym18061027 (registering DOI) - 14 Jun 2026
Abstract
Dehazing of images is important for proper interpretation of optical images in remote sensing. However, current dehazing networks tend to have limited receptive field and texture information loss caused by conventional downsampling and complementary cross-domain information not being utilized in dehazing frameworks. In [...] Read more.
Dehazing of images is important for proper interpretation of optical images in remote sensing. However, current dehazing networks tend to have limited receptive field and texture information loss caused by conventional downsampling and complementary cross-domain information not being utilized in dehazing frameworks. In order to cope with these problems, we propose a Cross-domain Generative Prior-assisted Structure–Texture Adaptive Network for remote sensing image dehazing. It is a dual-stream encoder–decoder framework, which enhances the domain-specific information of RGB and generated prior, and then integrates them adaptively for haze-free reconstruction. In order to minimize information loss in downsampling, wavelet pooling is introduced to consider the frequency-aware structural and textural features. Additionally, a Structure–Texture Calibration Block is designed to simultaneously improve the local frequency textures and construct sparse long-range dependencies of structures, so as to achieve better restoration performance under spatially non-uniform haze. To appropriately fuse the various representations from RGB and generated prior images, a Prior-aware Gated Adaptive Fusion module is developed to balance the domain-specific features dynamically and keep the fine details at multi-level feature fusion. Finally, we utilize pixel-level contrastive learning to guide the latent space away from hazy distributions, thus enhancing the discriminability of the features. Extensive experiments on the three datasets, namely RSID, RICE-I and HRSD, demonstrate that CGSTA-Net can effectively restore images under varying haze conditions and significantly outperforms the latest dehazing methods in terms of visual quality and quantitative performance. Specifically, compared with the most effective competitive method, CGSTA-Net increased the PSNR by 22.9% on RSID, by 13.2% on RICE-I, and by 7.2% on HRSD. Full article
(This article belongs to the Section Computer)
Show Figures

Figure 1

26 pages, 4861 KB  
Article
Class-Aware Semantic Calibration for Cross-Scene Hyperspectral Image Classification
by Boshan Shi, Yanbo Liu, Youqiang Zhang and Guo Cao
Remote Sens. 2026, 18(12), 1976; https://doi.org/10.3390/rs18121976 (registering DOI) - 14 Jun 2026
Abstract
Cross-scene Hyperspectral Image (HSI) classification faces substantial domain shifts caused by sensor heterogeneity, acquisition variation, and scene diversity. While benchmark annotations are assigned to individual center pixels, local patches often contain implicit multi-label semantics due to spectral mixing and spatial overlap. This mismatch [...] Read more.
Cross-scene Hyperspectral Image (HSI) classification faces substantial domain shifts caused by sensor heterogeneity, acquisition variation, and scene diversity. While benchmark annotations are assigned to individual center pixels, local patches often contain implicit multi-label semantics due to spectral mixing and spatial overlap. This mismatch distorts prediction structure, exacerbates generalization errors, and limits the effectiveness of standard domain generalization (DG) techniques focused solely on feature or prediction invariance. We propose Class-Aware Semantic Calibration (CASC), a systematic semantic structure calibration framework that addresses three complementary distortions induced by mismatched patch supervision: (i) Balance corrects class frequency bias via reweighted supervision; (ii) Separability enhances boundary decision stability through margin-based logit calibration; and (iii) Independence reduces domain-specific spurious co-occurrence via prediction covariance decorrelation. To preserve calibrated semantics under pseudo-source shift, we further introduce a complementary DualAlign (DA) module, which jointly aligns feature statistics and prediction distributions, enforcing consistency at both representation and semantic levels. Extensive experiments on three cross-scene benchmarks (Houston, Pavia, and WHU-Hi) demonstrate that CASC-DA consistently improves performance over strong baselines, achieving an average gain of 3.0% in overall accuracy and 4.9% in Kappa coefficient compared with the best-performing baseline on each dataset. These results underscore the importance of semantic structure calibration for domain-generalized HSI classification. Full article
(This article belongs to the Section Remote Sensing Image Processing)
23 pages, 2717 KB  
Article
3DWaFusion: Three-Dimensional Multiscale Wavelet Convolutional Neural Network for Multimodal Medical Image Fusion
by Yu Wang, Rui Zhang, Zhiqiang Zhang, Ningzhong Liu and Xiulai Wang
Sensors 2026, 26(12), 3784; https://doi.org/10.3390/s26123784 (registering DOI) - 14 Jun 2026
Abstract
Multimodal image fusion is a promising technology designed to fuse information from different medical sensors, which offer structured insights for disease diagnosis and treatment. However, existing 2D-centric fusion methods fail to capture 3D spatial continuity, and conventional wavelet-based approaches lack adaptability to diverse [...] Read more.
Multimodal image fusion is a promising technology designed to fuse information from different medical sensors, which offer structured insights for disease diagnosis and treatment. However, existing 2D-centric fusion methods fail to capture 3D spatial continuity, and conventional wavelet-based approaches lack adaptability to diverse lesion regions and suffer from background artifacts. To address this issue, we propose a 3D multiscale wavelet convolutional neural network for multimodal medical image fusion. Specifically, a 3D Discrete Wavelet Transformation (3D DWT) is introduced to decompose input volumes into multi-frequency bands, isolating anatomical structures and lesion details while reducing 3D spatial redundancy. We embed hierarchical multiple frequency band into a Global and Local Feature Calibration (GLFC) module to adaptively enhance single-modal features by fusing global contextual information and local details. Furthermore, a pyramid group-wise multiscale feature interaction is proposed for capturing complementary features across different spatial scales. Finally, a voxel-wise weighted averaging strategy reconstructs the fused image by adaptively assigning contributions to each modality at every spatial position, effectively eliminating artifacts and improving the visual fidelity of the result. Extensive experiments on the BraTS2020 and Hecktor datasets demonstrate that our proposed method outperforms state-of-the-art (SOTA) fusion methods in both subjective visual quality and objective metrics. Moreover, downstream segmentation validation confirms that fused images from our method significantly improve tumor segmentation accuracy. The source code and pre-trained models will be publicly available. Full article
(This article belongs to the Section Biomedical Sensors)
Show Figures

Figure 1

28 pages, 6836 KB  
Article
Flange Trajectory Prediction for LNG Unloading Arms Using KSE-GRU
by Guicai Liu, Wei Wang, Wuwei Feng, Rongsheng Lin, Chuanyu Wu, Shujie Yang, Zhujun Zhang, Jiahang Du and Liangan Zhang
Appl. Sci. 2026, 16(12), 6013; https://doi.org/10.3390/app16126013 (registering DOI) - 13 Jun 2026
Abstract
To autonomously dock LNG unloading arms under adverse sea states, this study formulates a dynamic docking process as a trajectory forecasting task. By integrating visual-perception-based spatial localization with trajectory acquisition and forecasting, a comprehensive operational pipeline is established. To predict the dynamic trajectory [...] Read more.
To autonomously dock LNG unloading arms under adverse sea states, this study formulates a dynamic docking process as a trajectory forecasting task. By integrating visual-perception-based spatial localization with trajectory acquisition and forecasting, a comprehensive operational pipeline is established. To predict the dynamic trajectory of the vessel flange, an improved KSE-GRU model is proposed. By extracting implicit kinematic features, the model effectively enhances trajectory characterization under extreme sea states, thereby significantly improving forecasting accuracy and worst-case error constraints. To ensure the operational feasibility of autonomous docking, a robust control strategy is introduced to complement the trajectory predictions. The experimental results demonstrate that the proposed model outperforms traditional time-series forecasting models across all evaluation metrics. Compared with the baseline neural network models, the Mean-3D error is reduced by 19.14%, and the Max-3D error is capped at 348.77 mm, representing an 8.8% improvement over the baseline. Furthermore, the model demonstrates clear advantages in maintaining trajectory consistency and forecasting reliability. In summary, in this study, a robust trajectory forecasting model is developed for vessel target flanges integrated with a comprehensive control framework, thereby offering a practical approach to autonomous docking under dynamic oceanic conditions. Full article
Show Figures

Figure 1

23 pages, 19029 KB  
Article
CETransUNet: An Intelligent Landslide Identification Method Based on Collaborative Optimization of Global Context and Dual Attention Mechanisms
by Tianli Sun, Chengsheng Yang, Jifeng Wu, Zewei Liu, Ziqian Wang and Xiaoqiang Cheng
Remote Sens. 2026, 18(12), 1974; https://doi.org/10.3390/rs18121974 (registering DOI) - 13 Jun 2026
Abstract
Accurate landslide identification is crucial for enhancing emergency response capabilities during destructive geological hazards. Although deep-learning-based semantic segmentation has demonstrated effectiveness, substantial variations in landslide scales and environmental similarities continue to challenge existing methods. This paper systematically constructs a new co-seismic landslide dataset [...] Read more.
Accurate landslide identification is crucial for enhancing emergency response capabilities during destructive geological hazards. Although deep-learning-based semantic segmentation has demonstrated effectiveness, substantial variations in landslide scales and environmental similarities continue to challenge existing methods. This paper systematically constructs a new co-seismic landslide dataset for the Yarlung Zangbo River basin based on the 2017 Nyingchi earthquake, effectively filling a critical regional data gap. This paper proposes CETransUNet (coordinate attention and edge-guided attention transformer UNet), a novel landslide detection model that integrates ResNet and Transformer architectures. Specifically, a coordinate attention (CA) module is introduced within the skip connections between the encoder and decoder. This module encodes positional information along both horizontal and vertical spatial directions and dynamically re-weights the feature maps, thereby effectively suppressing background noise caused by semantic gaps and enhancing the model’s ability to localize landslide regions. Additionally, an edge-guided attention (EGA) module is incorporated into the decoder. This module extracts explicit edge priors from the input image using a Laplacian operator and imposes geometric constraints on the predictions via a boundary reverse attention mechanism, thereby significantly alleviating boundary ambiguity and morphological distortion of landslides. Evaluations across datasets from the Yarlung Zangbo River, Iburi-Tobu, and Bijie regions demonstrate that CETransUNet significantly outperforms state-of-the-art models—including TransUNet, SegFormer, and SwinUNet—in terms of IoU, MIoU, and F1-score. Overall, through the synergistic optimization of the coordinate attention and edge-guided attention modules, the CETransUNet model achieves synchronous enhancement of boundary integrity and geometric precision in complex scenarios, providing a reliable technical solution for large-scale intelligent landslide identification. Full article
Show Figures

Figure 1

Back to TopTop