Next Article in Journal
An Interval Prediction Method Based on TSKANMixer Architecture for Predicting the State of Health of Lithium-Ion Batteries
Previous Article in Journal
Blockchain-Based Identity Management System Prototype for Enhanced Privacy and Security
Previous Article in Special Issue
Dual-Task Learning for Long-Range Classification in Single-Pixel Imaging Under Atmospheric Turbulence
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

A Remote Sensing Image Object Detection Model Based on Improved YOLOv11

1
Heilongjiang Province Key Laboratory of Laser Spectroscopy Technology and Application, Harbin University of Science and Technology, Harbin 150080, China
2
School of Integrated Circuit, Shenzhen Polytechnic University, Shenzhen 518055, China
*
Author to whom correspondence should be addressed.
Electronics 2025, 14(13), 2607; https://doi.org/10.3390/electronics14132607 (registering DOI)
Submission received: 4 June 2025 / Revised: 25 June 2025 / Accepted: 26 June 2025 / Published: 27 June 2025
(This article belongs to the Special Issue Machine Learning and Computational Intelligence in Remote Sensing)

Abstract

Due to the challenges posed by high resolution, substantial background noise, significant object scale variation, and long-tailed data distribution in remote sensing images, traditional techniques often struggle to maintain both high accuracy and low latency. This paper proposes YOLO11-FSDAT, an advanced object detection framework tailored for remote sensing imagery, which integrates not only modular enhancements but also theoretical and architectural innovations to address these limitations. First, we propose the frequency–spatial feature extraction fusion module (Freq-SpaFEFM), which breaks the conventional paradigm of spatial-domain-dominated feature learning by introducing a multi-branch architecture that fuses frequency- and spatial-domain features in parallel. This design provides a new processing paradigm for multi-scale object detection, particularly enhancing the model’s capability in handling dense and small-object scenarios with complex backgrounds. Second, we introduce the deformable attention-based global–local fusion module (DAGLF), which combines fine-grained local features with global context through deformable attention and residual connections. This enables the model to adaptively capture irregularly oriented objects (e.g., tilted aircraft) and effectively mitigates the issue of information dilution in deep networks. Third, we develop the adaptive threshold focal loss (ATFL), which is the first loss function to systematically address the long-tailed distribution in remote sensing datasets by dynamically adjusting focus based on sample difficulty. Unlike traditional focal loss with fixed hyperparameters, ATFL decouples hard and easy samples and automatically adapts to varying class distributions. Experimental results on the public DOTAv1, SIMD, and DIOR datasets demonstrated that YOLO11-FSDAT achieved 75.22%, 82.79%, and 88.01% mAP, respectively, outperforming baseline YOLOv11n by up to 4.11%. These results confirm the effectiveness, robustness, and broader theoretical value of the proposed framework in addressing key challenges in remote sensing object detection.
Keywords: remote sensing images; multi-scale object detection; frequency–spatial feature fusion; deformable attention mechanism; adaptive threshold focal loss (ATFL) remote sensing images; multi-scale object detection; frequency–spatial feature fusion; deformable attention mechanism; adaptive threshold focal loss (ATFL)

Share and Cite

MDPI and ACS Style

Wang, A.; Fu, Z.; Zhao, Y.; Chen, H. A Remote Sensing Image Object Detection Model Based on Improved YOLOv11. Electronics 2025, 14, 2607. https://doi.org/10.3390/electronics14132607

AMA Style

Wang A, Fu Z, Zhao Y, Chen H. A Remote Sensing Image Object Detection Model Based on Improved YOLOv11. Electronics. 2025; 14(13):2607. https://doi.org/10.3390/electronics14132607

Chicago/Turabian Style

Wang, Aili, Zhijia Fu, Yanran Zhao, and Haisong Chen. 2025. "A Remote Sensing Image Object Detection Model Based on Improved YOLOv11" Electronics 14, no. 13: 2607. https://doi.org/10.3390/electronics14132607

APA Style

Wang, A., Fu, Z., Zhao, Y., & Chen, H. (2025). A Remote Sensing Image Object Detection Model Based on Improved YOLOv11. Electronics, 14(13), 2607. https://doi.org/10.3390/electronics14132607

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop