Saved Queries

by Enkai Li, Wei Fu, Feng Zhu, Bonan Li, Xiaoping Fan, Tuo Zheng, Peng Zhang, Tiantian Hu, Ziming Zhou, Chongchong Wang and Pengcheng Jiang

Processes 2026, 14(2), 263; https://doi.org/10.3390/pr14020263 - 12 Jan 2026

Abstract

Missing seismic data in reflection seismology, which frequently arises from a variety of operational and natural limitations, immediately impairs the quality of ensuing imaging and calls into question the validity of geological interpretation. Traditional techniques for reconstructing seismic data frequently rely significantly on parameter choices and prior assumptions. Even while these methods work well for partially missing traces, reconstructing whole shot gather is still a difficult task that has not been thoroughly studied. Data-driven approaches that summarize and generalize patterns from massive amounts of data have become more and more common in seismic data reconstruction research in recent years. This work builds on earlier research by proposing an enhanced technique that can recreate whole shot gathers as well as partially missing traces. During model training, we first implement a Moveout-window selective slicing method for reconstructing missing traces. By creating training datasets inside a high signal-to-noise ratio (SNR) window, this method improves the model’s capacity for learning. Additionally, a technique is presented for the receiver domain reconstruction of missing shot data. A dual-domain reconstruction method is used to successfully recover the seismic data in order to handle situations where there is simultaneous missing data in both domains. Full article

(This article belongs to the Special Issue Advances in Evaluation, Development, Simulation and Utilization of Geo-Energy Resources and Underground Space)

►▼ Show Figures

Figure 1

27 pages, 5623 KB

Open AccessArticle

A Multi-Factor Fracturability Evaluation Model for Supercritical CO₂ Fracturing in Tight Reservoirs Considering Dual-Well Configurations

by Yang Li, Guolong Zhang, Quanlin Wu, Quansen Wu and Wanrui Han

Processes 2026, 14(2), 260; https://doi.org/10.3390/pr14020260 - 12 Jan 2026

Abstract

Supercritical CO₂ (SC-CO₂) fracturing has emerged as a promising technology for the effective stimulation of unconventional tight reservoirs due to its low viscosity, high diffusivity, and environmental advantages. However, existing fracturability evaluation models often oversimplify key parameters and lack validation under realistic dual-well conditions. To address these gaps, we developed a multi-factor coupled evaluation model incorporating well spacing, stress anisotropy, and fluid viscosity and proposed a fracturability index (FI) to quantify the potential for complex fracture development. True triaxial SC-CO₂ fracturing experiments using both single- and dual-well setups were conducted, and 3D fracture networks were analyzed via CT imaging and U-Net segmentation. Results show strong agreement between FI and fracture complexity. Optimal fracturing conditions were identified, providing a practical framework for the design and optimization of SC-CO₂ fracturing in tight reservoirs. Full article

(This article belongs to the Section Petroleum and Low-Carbon Energy Process Engineering)

►▼ Show Figures

Figure 1

16 pages, 64661 KB

Open AccessArticle

A Dual-UNet Diffusion Framework for Personalized Panoramic Generation

by Jing Shen, Leigang Huo, Chunlei Huo and Shiming Xiang

J. Imaging 2026, 12(1), 40; https://doi.org/10.3390/jimaging12010040 - 11 Jan 2026

Abstract

While text-to-image and customized generation methods demonstrate strong capabilities in single-image generation, they fall short in supporting immersive applications that require coherent 360° panoramas. Conversely, existing panorama generation models lack customization capabilities. In panoramic scenes, reference objects often appear as minor background elements and may be multiple in number, while reference images across different views exhibit weak correlations. To address these challenges, we propose a diffusion-based framework for customized multi-view image generation. Our approach introduces a decoupled feature injection mechanism within a dual-UNet architecture to handle weakly correlated reference images, effectively integrating spatial information by concurrently feeding both reference images and noise into the denoising branch. A hybrid attention mechanism enables deep fusion of reference features and multi-view representations. Furthermore, a data augmentation strategy facilitates viewpoint-adaptive pose adjustments, and panoramic coordinates are employed to guide multi-view attention. The experimental results demonstrate our model’s effectiveness in generating coherent, high-quality customized multi-view images. Full article

(This article belongs to the Section AI in Imaging)

23 pages, 5900 KB

Open AccessArticle

Hybrid Attention Mechanism Combined with U-Net for Extracting Vascular Branching Points in Intracavitary Images

by Kaiyang Xu, Haibin Wu, Liang Yu and Xin He

Electronics 2026, 15(2), 322; https://doi.org/10.3390/electronics15020322 - 11 Jan 2026

Abstract

To address the application requirements of Visual Simultaneous Localization and Mapping (VSLAM) in intracavitary environments and the scarcity of gold-standard datasets for deep learning methods, this study proposes a hybrid attention mechanism combined with U-Net for vascular branch point extraction in endoluminal images (SuperVessel). The network is initialized via transfer learning with pre-trained SuperRetina model parameters and integrated with a vascular feature detection and matching method based on dual branch fusion and structure enhancement, generating a pseudo-gold-standard vascular branch point dataset. The framework employs a dual-decoder architecture, incorporates a dynamic up-sampling module (CBAM-Dysample) to refine local vessel features through hybrid attention mechanisms, designs a Dice-Det loss function weighted by branching features to prioritize vessel junctions, and introduces a dynamically weighted Triplet-Des loss function optimized for descriptor discrimination. Experiments on the Vivo test set demonstrate that the proposed method achieves an average Area Under Curve (AUC) of 0.760, with mean feature points, accuracy, and repeatability scores of 42,795, 0.5294, and 0.46, respectively. Compared to SuperRetina, the method maintains matching stability while exhibiting superior repeatability, feature point density, and robustness in low-texture/deformation scenarios. Ablation studies confirm the CBAM-Dysample module’s efficacy in enhancing feature expression and convergence speed, offering a robust solution for intracavitary SLAM systems. Full article

(This article belongs to the Section Computer Science & Engineering)

►▼ Show Figures

Figure 1

28 pages, 5526 KB

Open AccessArticle

Symmetry-Aware SwinUNet with Integrated Attention for Transformer-Based Segmentation of Thyroid Ultrasound Images

by Ammar Oad, Imtiaz Hussain Koondhar, Feng Dong, Weibing Liu, Beiji Zou, Weichun Liu, Yun Chen and Yaoqun Wu

Symmetry 2026, 18(1), 141; https://doi.org/10.3390/sym18010141 - 10 Jan 2026

Viewed by 27

Abstract

Accurate segmentation of thyroid nodules in ultrasound images remains challenging due to low contrast, speckle noise, and inter-patient variability that disrupt the inherent spatial symmetry of thyroid anatomy. This study proposes a symmetry-aware SwinUNet framework with integrated spatial attention for thyroid nodule segmentation. The hierarchical window-based Swin Transformer encoder preserves spatial symmetry and scale consistency while capturing both global contextual information and fine-grained local features. Attention modules in the decoder emphasize symmetry consistent anatomical regions and asymmetric nodule boundaries, effectively suppressing irrelevant background responses. The proposed method was evaluated on the publicly available TN3K thyroid ultrasound dataset. Experimental results demonstrate strong performance, achieving a Dice Similarity Coefficient of 85.51%, precision of 87.05%, recall of 89.13%, an IoU of 78.00%, accuracy of 97.02%, and an AUC of 99.02%. Compared with the baseline model, the proposed approach improves the IoU and Dice score by 15.38% and 12.05%, respectively, confirming its ability to capture symmetry-preserving nodule morphology and boundary asymmetry. These findings indicate that the proposed symmetry-aware SwinUNet provides a robust and clinically promising solution for thyroid ultrasound image analysis and computer-aided diagnosis. Full article

(This article belongs to the Special Issue Deep Learning and Deep Learning Synergy of Transformers and Symmetry in Small Object Detection and Tracking)

►▼ Show Figures

Figure 1

20 pages, 10675 KB

Open AccessArticle

FESW-UNet: A Dual-Domain Attention Network for Sorghum Aphid Segmentation

by Caijian Hua and Fangjun Ren

Sensors 2026, 26(2), 458; https://doi.org/10.3390/s26020458 - 9 Jan 2026

Viewed by 127

Abstract

Current management strategies for sorghum aphids heavily rely on indiscriminate chemical application, leading to severe environmental consequences and impacting food safety. While precision spraying offers a viable remediation for pesticide overuse, its effectiveness depends on accurately locating and classifying pests. To address the critical challenge of segmenting small, swarming aphids in complex field environments, we propose FESW-UNet, a dual-domain attention network that integrates Fourier-enhanced attention, spatial attention, and wavelet-based downsampling into a UNet backbone. We introduce an efficient multi-scale attention (EMA) module between the encoder and decoder to enhance global context perception, enabling the model to capture more accurate relationships between global and local features in the field. In the feature extraction stage, we embed a simple attention module (SimAM) to target key infestation regions while suppressing background noise, thereby enhancing pixel-level discrimination. Furthermore, we replace conventional downsampling with Haar wavelet downsampling (HWD) to reduce resolution while preserving structural edge details. Finally, a Fourier-enhanced attention module (FEAM) is added to the skip-connection layers. By using complex-valued weights to regulate frequency-domain features, FEAM effectively fuses global low-frequency structures with local high-frequency details, thereby enhancing feature representation diversity. Experiments on the Aphid Cluster Segmentation dataset demonstrate that FESW-UNet outperforms other models, achieving an mIoU of 68.76%, mPA of 78.19%, and mF1 of 79.01%. The model also demonstrated strong adaptability on the AphidSeg-Sorghum dataset, achieving an mIoU of 81.22%, mPA of 87.97%, and mF1 of 88.60%. The proposed method offers an efficient and feasible technical solution for monitoring and controlling sorghum aphids through image segmentation, demonstrating broad application potential. Full article

(This article belongs to the Special Issue AI, IoT and Smart Sensors for Precision Agriculture: 2nd Edition)

24 pages, 18682 KB

Open AccessArticle

Precise Mapping of Linear Shelterbelt Forests in Agricultural Landscapes: A Deep Learning Benchmarking Study

by Wenjie Zhou, Lizhi Liu, Ruiqi Liu, Fei Chen, Liyu Yang, Linfeng Qin and Ruiheng Lyu

Forests 2026, 17(1), 91; https://doi.org/10.3390/f17010091 - 9 Jan 2026

Viewed by 65

Abstract

Farmland shelterbelts are crucial elements in safeguarding agricultural ecological security and sustainable development, with their precise extraction being vital for regional ecological monitoring and precision agriculture management. However, constrained by their narrow linear distribution, complex farmland backgrounds, and spectral confusion issues, traditional remote sensing methods encounter significant challenges in terms of accuracy and generalization capability. In this study, six representative deep learning semantic segmentation models—U-Net, Attention U-Net (AttU_Net), ResU-Net, U²-Net, SwinUNet, and TransUNet—were systematically evaluated for farmland shelterbelt extraction using high-resolution Gaofen-6 imagery. Model performance was assessed through four-fold cross-validation and independent test set validation. The results indicate that convolutional neural network (CNN)-based models show overall better performance than Transformer-based architectures; on the independent test set, the best-performing CNN model (U-Net) achieved a Dice Similarity Coefficient (DSC) of 91.45%, while the lowest DSC (88.86%) was obtained by the Transformer-based TransUNet model. Among the evaluated models, U-Net demonstrated a favorable balance between accuracy, stability, and computational efficiency. The trained U-Net was applied to large-scale farmland shelterbelt mapping in the study area (Alar City, Xinjiang), achieving a belt-level visual accuracy of 95.58% based on 385 manually interpreted samples. Qualitative demonstrations in Aksu City and Shaya County illustrated model transferability. This study provides empirical guidance for model selection in high-resolution agricultural remote sensing and offers a feasible technical solution for large-scale and precise farmland shelterbelt extraction. Full article

(This article belongs to the Special Issue Forest Disturbance Monitoring by Remote Sensing: Advancements and Applications)

►▼ Show Figures

Figure 1

22 pages, 5754 KB

Open AccessArticle

Low-Cost Deep Learning for Building Detection with Application to Informal Urban Planning

by Lucas González, Jamal Toutouh and Sergio Nesmachnow

ISPRS Int. J. Geo-Inf. 2026, 15(1), 36; https://doi.org/10.3390/ijgi15010036 - 9 Jan 2026

Viewed by 143

Abstract

This article studies the application of deep neural networks for automatic building detection in aerial RGB images. Special focus is put on accuracy robustness in both well-structured and poorly planned urban scenarios, which pose significant challenges due to occlusions, irregular building layouts, and limited contextual cues. The applied methodology considers several CNNs using only RBG images as input, and both validation and transfer capabilities are studied. U-Net-based models achieve the highest single-model accuracy, with an Intersection over Union (

I o U

) of 0.9101. A soft-voting ensemble of the best U-Net models further increases performance, reaching a best ensemble

I o U

of 0.9665, improving over state-of-the-art building detection methods on standard benchmarks. The approach demonstrates strong generalization using only RGB imagery, supporting scalable, low-cost applications in urban planning and geospatial analysis. Full article

(This article belongs to the Special Issue Testing the Quality of GeoAI-Generated Data for VGI Mapping)

►▼ Show Figures

Figure 1

31 pages, 17740 KB

Open AccessArticle

HR-UMamba++: A High-Resolution Multi-Directional Mamba Framework for Coronary Artery Segmentation in X-Ray Coronary Angiography

by Xiuhan Zhang, Peng Lu, Zongsheng Zheng and Wenhui Li

Fractal Fract. 2026, 10(1), 43; https://doi.org/10.3390/fractalfract10010043 - 9 Jan 2026

Viewed by 179

Abstract

Coronary artery disease (CAD) remains a leading cause of mortality worldwide, and accurate coronary artery segmentation in X-ray coronary angiography (XCA) is challenged by low contrast, structural ambiguity, and anisotropic vessel trajectories, which hinder quantitative coronary angiography. We propose HR-UMamba++, a U-Mamba-based framework centered on a rotation-aligned multi-directional state-space scan for modeling long-range vessel continuity across multiple orientations. To preserve thin distal branches, the framework is equipped with (i) a persistent high-resolution bypass that injects undownsampled structural details and (ii) a UNet++-style dense decoder topology for cross-scale topological fusion. On an in-house dataset of 739 XCA images from 374 patients, HR-UMamba++ is evaluated using eight segmentation metrics, fractal-geometry descriptors, and multi-view expert scoring. Compared with U-Net, Attention U-Net, HRNet, U-Mamba, DeepLabv3+, and YOLO11-seg, HR-UMamba++ achieves the best performance (Dice 0.8706, IoU 0.7794, HD95 16.99), yielding a relative Dice improvement of 6.0% over U-Mamba and reducing the deviation in fractal dimension by up to 57% relative to U-Net. Expert evaluation across eight angiographic views yields a mean score of 4.24 ± 0.49/5 with high inter-rater agreement. These results indicate that HR-UMamba++ produces anatomically faithful coronary trees and clinically useful segmentations that can serve as robust structural priors for downstream quantitative coronary analysis. Full article

(This article belongs to the Special Issue Fractional and Fractal Methods in Biomedical Imaging and Time Series Learning)

►▼ Show Figures

Figure 1

22 pages, 3809 KB

Open AccessArticle

Research on Remote Sensing Image Object Segmentation Using a Hybrid Multi-Attention Mechanism

by Lei Chen, Changliang Li, Yixuan Gao, Yujie Chang, Siming Jin, Zhipeng Wang, Xiaoping Ma and Limin Jia

Appl. Sci. 2026, 16(2), 695; https://doi.org/10.3390/app16020695 - 9 Jan 2026

Viewed by 66

Abstract

High-resolution remote sensing images are gradually playing an important role in land cover mapping, urban planning, and environmental monitoring tasks. However, current segmentation approaches frequently encounter challenges such as loss of detail and blurred boundaries when processing high-resolution remote sensing imagery, owing to their complex backgrounds and dense semantic content. In response to the aforementioned limitations, this study introduces HMA-UNet, a novel segmentation network built upon the UNet framework and enhanced through a hybrid attention strategy. The architecture’s innovation centers on a composite attention block, where a lightweight split fusion attention (LSFA) mechanism and a lightweight channel-spatial attention (LCSA) mechanism are synergistically integrated within a residual learning structure to replace the stacked convolutional structure in UNet, which can improve the utilization of important shallow features and eliminate redundant information interference. Comprehensive experiments on the WHDLD dataset and the DeepGlobe road extraction dataset show that our proposed method achieves effective segmentation in remote sensing images by fully utilizing shallow features and eliminating redundant information interference. The quantitative evaluation results demonstrate the performance of the proposed method across two benchmark datasets. On the WHDLD dataset, the model attains a mean accuracy, IoU, precision, and recall of 72.40%, 60.71%, 75.46%, and 72.41%, respectively. Correspondingly, on the DeepGlobe road extraction dataset, it achieves a mean accuracy of 57.87%, an mIoU of 49.82%, a mean precision of 78.18%, and a mean recall of 57.87%. Full article

(This article belongs to the Section Computing and Artificial Intelligence)

►▼ Show Figures

Figure 1

27 pages, 20617 KB

Open AccessArticle

Evaluation of a Computational Simulation Approach Combining GIS, 2D Hydraulic Software, and Deep Learning Technique for River Flood Extent Mapping

by Nikolaos Xafoulis, Evangelia Farsirotou, Spyridon Kotsopoulos and Aris Psilovikos

Hydrology 2026, 13(1), 26; https://doi.org/10.3390/hydrology13010026 - 9 Jan 2026

Viewed by 71

Abstract

Floods are among the most catastrophic natural disasters, causing severe impact on human lives and ecosystems. The proposed methodology integrates Geographic Information Systems, 2D hydraulic modeling, and deep learning techniques to develop a computational simulation approach for flood extent prediction and was implemented in the Enipeas River basin, located within the Thessalia River Basin District, Greece. Hydrological analysis was performed using the HEC-HMS software (version 4.12), while hydraulic simulations were conducted with HEC-RAS 2D. The hydraulic modeling produced synthetic flood scenarios for a 1000-year return period, generating spatially distributed outputs of flood extents. The deep learning algorithm was based on a U-Net (CNN) architecture. The model was trained using multi-channel raster tiles, including open access geospatial data such as Digital Elevation Model, slope, flow direction, stream centerline, land use, and simulated flood extents. Model validation was carried out in two independent domains (TS1 and TS2) located within the same river basin. Model outputs are adequately compared with both 2D hydraulic simulations and official Flood Risk Management Plan maps, and the comparison indicates close spatial and quantitative agreement, with flood extent area differences below 8%. Based on the results, the proposed methodology presents a potential and efficient tool for rapid flood risk mapping. Full article

(This article belongs to the Special Issue Advances in Flood Studies: Enhancing Data Collection, Rating Curves, and Hydrological Analyses)

►▼ Show Figures

Figure 1

22 pages, 13265 KB

Open AccessArticle

BAE-UNet: A Background-Aware and Edge-Enhanced Segmentation Network for Two-Stage Pest Recognition in Complex Field Environments

by Jing Chang, Xuefang Li, Xingye Ze, Xue Ding and He Gong

Agronomy 2026, 16(2), 166; https://doi.org/10.3390/agronomy16020166 - 8 Jan 2026

Viewed by 116

Abstract

To address issues such as significant scale differences, complex pose variations, strong background interference, and similar category characteristics of pests in the images obtained from field traps, this study proposes a pest recognition method based on a two-stage “segmentation–detection” approach to improve the accuracy of field pest situation monitoring. In the first stage, an improved segmentation model, BAE-UNet (Background-Aware and Edge-Enhanced U-Net), is adopted. Based on the classic U-Net framework, a Background-Aware Contextual Module (BACM), a Spatial-Channel Refinement and Attention Module (SCRA), and a Multi-Scale Edge-Aware Spatial Attention Module (MESA) are introduced. These modules respectively optimize multi-scale feature extraction, background suppression, and boundary refinement, effectively removing complex background information and accurately extracting pest body regions. In the second stage, the segmented pest body images are input into the YOLOv8 model to achieve precise pest detection and classification. Experimental results show that BAE-UNet performs excellently in the segmentation task, achieving an mIoU of 0.930, a Dice coefficient of 0.951, and a Boundary F1 of 0.943, significantly outperforming both the baseline U-Net and mainstream models such as DeepLabV3+. After segmentation preprocessing, the detection performance of YOLOv8 is also significantly improved. The precision, recall, mAP50, and mAP50–95 increase from 0.748, 0.796, 0.818, and 0.525 to 0.958, 0.971, 0.977, and 0.882, respectively. The results verify that the proposed two-stage recognition method can effectively suppress background interference, enhance the stability and generalization ability of the model in complex natural scenes, and provide an efficient and feasible technical approach for intelligent pest trap image recognition and pest situation monitoring. Full article

(This article belongs to the Section Pest and Disease Management)

26 pages, 8147 KB

Open AccessArticle

Deep Learning Applied to Spaceborne SAR Interferometry for Detecting Sinkhole-Induced Land Subsidence Along the Dead Sea

by Gali Dekel, Ran Novitsky Nof, Ron Sarafian and Yinon Rudich

Remote Sens. 2026, 18(2), 211; https://doi.org/10.3390/rs18020211 - 8 Jan 2026

Viewed by 112

Abstract

The Dead Sea (DS) region has experienced a sharp increase in sinkhole formation in recent years, posing environmental and infrastructure risks. The Geological Survey of Israel (GSI) employs Interferometric Synthetic Aperture Radar (InSAR) to monitor sinkhole activity and manually map land subsidence along the western shore of the DS. This process is both time-consuming and prone to human error. Automating detection with Deep Learning (DL) offers a transformative opportunity to enhance monitoring precision, scalability, and real-time decision-making. DL segmentation architectures such as UNet, Attention UNet, SAM, TransUNet, and SegFormer have shown effectiveness in learning geospatial deformation patterns in InSAR and related remote sensing data. This study provides a first comprehensive evaluation of a DL segmentation model applied to InSAR data for detecting land subsidence areas that occur as part of the sinkhole-formation process along the western shores of the DS. Unlike image-based tasks, our new model learns interferometric phase patterns that capture subtle ground deformations rather than direct visual features. As the ground truth in the supervised learning process, we use subsidence areas delineated on the phase maps by the GSI team over the years as part of the operational subsidence surveillance and monitoring activities. This unique data poses challenges for annotation, learning, and interpretability, making the dataset both non-trivial and valuable for advancing research in applied remote sensing and its application in the DS. We train the model across three partition schemes, each representing a different type and level of generalization, and introduce object-level metrics to assess its detection ability. Our results show that the model effectively identifies and generalizes subsidence areas in InSAR data across different setups and temporal conditions and shows promising potential for geographical generalization in previously unseen areas. Finally, large-scale subsidence trends are inferred by reconstructing smaller-scale patches and evaluated for different confidence thresholds. Full article

(This article belongs to the Special Issue Remote Sensing Applications Along the Dead Sea Fault: From the Red Sea to Anatolia)

►▼ Show Figures

Figure 1

39 pages, 14025 KB

Open AccessArticle

Degradation-Aware Multi-Stage Fusion for Underwater Image Enhancement

by Lian Xie, Hao Chen and Jin Shu

J. Imaging 2026, 12(1), 37; https://doi.org/10.3390/jimaging12010037 - 8 Jan 2026

Viewed by 123

Abstract

Underwater images frequently suffer from color casts, low illumination, and blur due to wavelength-dependent absorption and scattering. We present a practical two-stage, modular, and degradation-aware framework designed for real-time enhancement, prioritizing deployability on edge devices. Stage I employs a lightweight CNN to classify inputs into three dominant degradation classes (color cast, low light, blur) with 91.85% accuracy on an EUVP subset. Stage II applies three scene-specific lightweight enhancement pipelines and fuses their outputs using two alternative learnable modules: a global Linear Fusion and a LiteUNetFusion (spatially adaptive weighting with optional residual correction). Compared to the three single-scene optimizers (average PSNR = 19.0 dB; mean UCIQE ≈ 0.597; mean UIQM ≈ 2.07), the Linear Fusion improves PSNR by +2.6 dB on average and yields roughly +20.7% in UCIQE and +21.0% in UIQM, while maintaining low latency (~90 ms per 640 × 480 frame on an Intel i5-13400F (Intel Corporation, Santa Clara, CA, USA). The LiteUNetFusion further refines results: it raises PSNR by +1.5 dB over the Linear model (23.1 vs. 21.6 dB), brings modest perceptual gains (UCIQE from 0.72 to 0.74, UIQM 2.5 to 2.8) at a runtime of ≈125 ms per 640 × 480 frame, and better preserves local texture and color consistency in mixed-degradation scenes. We release implementation details for reproducibility and discuss limitations (e.g., occasional blur/noise amplification and domain generalization) together with future directions. Full article

(This article belongs to the Section Image and Video Processing)

►▼ Show Figures

Figure 1

30 pages, 3974 KB

Open AccessArticle

Training-Free Lightweight Transfer Learning for Land Cover Segmentation Using Multispectral Calibration

by Hye-Jung Moon and Nam-Wook Cho

Remote Sens. 2026, 18(2), 205; https://doi.org/10.3390/rs18020205 - 8 Jan 2026

Viewed by 71

Abstract

This study proposes a lightweight framework for transferring pretrained land cover classification architectures without additional training. The system utilizes French IGN imagery and Korean UAV and aerial imagery. It employs FLAIR U-Net models with ResNet34 and MiTB5 backbones, along with the AI-HUB U-Net. The implementation consists of four sequential stages. First, we perform class mapping between heterogeneous schemes and unify coordinate systems. Second, a quadratic polynomial regression equation is constructed. This formula uses multispectral band statistics as hyperparameters and class-wise IoU as the dependent variable. Third, optimal parameters are identified using the stationary point condition of Response Surface Methodology (RSM). Fourth, the final land cover map is generated by fusing class-wise optimal results at the pixel level. Experimental results show that optimization is typically completed within 60 inferences. This procedure achieves IoU improvements of up to 67.86 percentage points compared to the baseline. For automated application, these optimized values from a source domain are successfully transferred to target areas. This includes transfers between high-altitude mountainous and low-lying coastal territories via proportional mapping. This capability demonstrates cross-regional and cross-platform generalization between ResNet34 and MiTB5. Statistical validation confirmed that the performance surface followed a systematic quadratic response. Adjusted R² values ranged from 0.706 to 0.999, with all p-values below 0.001. Consequently, the performance function is universally applicable across diverse geographic zones, spectral distributions, spatial resolutions, sensors, neural networks, and land cover classes. This approach achieves more than a 4000-fold reduction in computational resources compared to full model training, using only 32 to 150 tiles. Furthermore, the proposed technique demonstrates 10–74× superior resource efficiency (resource consumption per unit error reduction) over prior transfer learning schemes. Finally, this study presents a practical solution for inference and performance optimization of land cover semantic segmentation on standard commodity CPUs, while maintaining equivalent or superior IoU. Full article

Show export options Show export options

Select all

Export citation of selected articles as:

Error

Oops... you haven't selected anything for export.

Displaying article 1-50 on page 1 of 69.

Go to page 1 2 3 4 5

Search Results (3,416)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI