Next Article in Journal
A UAV Hyperspectral Inversion Framework for Mapping Soil Heavy Metals Based on Spectral Harmonization, Weighted Ensemble Learning, and Environmental Variable Integration
Previous Article in Journal
A Dual-Branch CNN with Depthwise Separable Fusion for Hyperspectral Image Classification
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

FLKFormer: Frequency-Enhanced Large-Kernel Framework for Object Detection in UAV Imagery

1
School of Electronic Information, Xijing University, Xi’an 710123, China
2
School of Computer Science, Northwestern Polytechnical University, Xi’an 710129, China
3
School of Computer Science and Engineering, Xi’an University of Technology, Xi’an 710048, China
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Remote Sens. 2026, 18(11), 1686; https://doi.org/10.3390/rs18111686
Submission received: 27 March 2026 / Revised: 19 May 2026 / Accepted: 20 May 2026 / Published: 22 May 2026

Abstract

UAV object detection remains challenging due to large scale variation, dense small objects, frequent occlusion, and complex background interference. Existing CNN-based detectors are often limited by weak small-object representation, while Transformer-based detectors may not adequately preserve local details in dense aerial scenes. This paper proposes a dual-path detection framework that integrates frequency-domain enhancement with large-kernel convolution and Transformer-based global modeling. An FFT Large-Kernel Convolution (FFLKC) module is introduced to enhance high-frequency details and enlarge the effective receptive field. A Transformer pathway with Full-Process Feature Attention (FPFA) is designed to strengthen long-range dependency modeling and semantic representation. A Frequency-Semantic Memory-guided Adaptive Fusion (FMSAF) module is further employed to integrate local detail features and global contextual information. Experiments on UAVDT and VisDrone demonstrate that the proposed method achieves superior overall detection performance and stronger small-object perception than mainstream detectors. The method reaches 58.7 AP and 51.8 APS on UAVDT, and 39.4 AP and 30.5 APS on VisDrone. Qualitative and quantitative results verify the effectiveness of the proposed design in improving detection quality under complex UAV backgrounds.
Keywords: UAV object detection; small-object detection; frequency-domain enhancement; large-kernel convolution; Transformer; dual-path architecture UAV object detection; small-object detection; frequency-domain enhancement; large-kernel convolution; Transformer; dual-path architecture

Share and Cite

MDPI and ACS Style

Chen, Y.; Huang, W.-Z.; Wang, Z.; Zeng, S.; Yang, C. FLKFormer: Frequency-Enhanced Large-Kernel Framework for Object Detection in UAV Imagery. Remote Sens. 2026, 18, 1686. https://doi.org/10.3390/rs18111686

AMA Style

Chen Y, Huang W-Z, Wang Z, Zeng S, Yang C. FLKFormer: Frequency-Enhanced Large-Kernel Framework for Object Detection in UAV Imagery. Remote Sensing. 2026; 18(11):1686. https://doi.org/10.3390/rs18111686

Chicago/Turabian Style

Chen, Yunhao, Wen-Zhun Huang, Zhen Wang, Sihao Zeng, and Chen Yang. 2026. "FLKFormer: Frequency-Enhanced Large-Kernel Framework for Object Detection in UAV Imagery" Remote Sensing 18, no. 11: 1686. https://doi.org/10.3390/rs18111686

APA Style

Chen, Y., Huang, W.-Z., Wang, Z., Zeng, S., & Yang, C. (2026). FLKFormer: Frequency-Enhanced Large-Kernel Framework for Object Detection in UAV Imagery. Remote Sensing, 18(11), 1686. https://doi.org/10.3390/rs18111686

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop