This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Open AccessArticle
Multi-Feature Fusion Method Based on Adaptive Dilation Convolution for Small-Object Detection
by
Lin Cao
Lin Cao 1,2
,
Jin Wu
Jin Wu 1,2,
Zongmin Zhao
Zongmin Zhao 1,2,*,
Chong Fu
Chong Fu
Prof. Chong Fu is a professor at the Department of Communication and Electronics Engineering, School [...]
Prof. Chong Fu is a professor at the Department of Communication and Electronics Engineering, School of Computer Science and Engineering, Northeastern University. He received an M.S. degree in telecommunication and information systems and a Ph.D. degree in computer software and theory from Northeastern University, Shenyang, China, in 2001 and 2006, respectively. In 2001, He joined Northeastern University. In 2010, he spent three months as a Visiting Researcher at the Department of Electronics and Information Engineering at the Hong Kong Polytechnic University. His research interests include multimedia security and computer vision.
3
and
Dongfeng Wang
Dongfeng Wang 4
1
School of Information and Communication Engineering, Beijing Information Science and Technology University, Beijing 100101, China
2
Key Laboratory of the Ministry of Education for Optoelectronic Measurement Technology and Instrument, Beijing Information Science and Technology University, Beijing 100101, China
3
School of Computer Science and Engineering, Northeastern University, Shenyang 110169, China
4
Beijing TransMicrowave Technology Company, Beijing 100080, China
*
Author to whom correspondence should be addressed.
Sensors 2025, 25(10), 3182; https://doi.org/10.3390/s25103182 (registering DOI)
Submission received: 7 March 2025
/
Revised: 14 May 2025
/
Accepted: 15 May 2025
/
Published: 18 May 2025
Abstract
This paper addresses the challenge of small-object detection in traffic surveillance by proposing a hybrid network architecture that combines attention mechanisms with convolutional layers. The network introduces an innovative attention mechanism into the YOLOv8 backbone, which effectively enhances the detection accuracy and robustness of small objects through fine-grained and coarse-grained attention routing on feature maps. During the feature fusion stage, we employ adaptive dilated convolution, which dynamically adjusts the dilation rate spatially based on frequency components. This adaptive convolution kernel helps preserve the details of small objects while strengthening their feature representation. It also expands the receptive field, which is beneficial for capturing contextual information and the overall features of small objects. Our method demonstrates an improvement in Average Precision (AP) by on the UA-DETRAC-test dataset and on the VisDrone-test dataset when compared to state-of-the-art methods. The experiments indicate that the new architecture achieves significant performance improvements across various evaluation metrics. To fully leverage the potential of our approach, we conducted extended research on radar–camera systems.
Share and Cite
MDPI and ACS Style
Cao, L.; Wu, J.; Zhao, Z.; Fu, C.; Wang, D.
Multi-Feature Fusion Method Based on Adaptive Dilation Convolution for Small-Object Detection. Sensors 2025, 25, 3182.
https://doi.org/10.3390/s25103182
AMA Style
Cao L, Wu J, Zhao Z, Fu C, Wang D.
Multi-Feature Fusion Method Based on Adaptive Dilation Convolution for Small-Object Detection. Sensors. 2025; 25(10):3182.
https://doi.org/10.3390/s25103182
Chicago/Turabian Style
Cao, Lin, Jin Wu, Zongmin Zhao, Chong Fu, and Dongfeng Wang.
2025. "Multi-Feature Fusion Method Based on Adaptive Dilation Convolution for Small-Object Detection" Sensors 25, no. 10: 3182.
https://doi.org/10.3390/s25103182
APA Style
Cao, L., Wu, J., Zhao, Z., Fu, C., & Wang, D.
(2025). Multi-Feature Fusion Method Based on Adaptive Dilation Convolution for Small-Object Detection. Sensors, 25(10), 3182.
https://doi.org/10.3390/s25103182
Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details
here.
Article Metrics
Article Access Statistics
For more information on the journal statistics, click
here.
Multiple requests from the same IP address are counted as one view.