Multi-Module Collaborative Optimization for SAR Image Aircraft Recognition: The SAR-YOLOv8l-ADE Network
Highlights
- We proposed the SAR-ACGAN network architecture to generate high-quality, diversified virtual aircraft target images, effectively expanding the scale of the SAR image dataset, addressing sample scarcity, and significantly improving aircraft target recognition accuracy.
- We constructed the SAR-YOLOv8l-ADE network structure. Through the collaborative optimization of three modules, namely SAR-DFE, SAR-C2f, and 4SDC, it strengthens detailed feature extraction, adapts to multi-scale targets, and enhances the recognition capability of small targets.
- To address the common problem of scarce target samples in the SAR field, the SAR-ACGAN network provides an efficient dataset expansion solution, laying the foundation for performance breakthroughs in similar SAR target recognition tasks.
- The optimization of the SAR-YOLOv8l-ADE network for feature extraction and small-target recognition not only improves the overall performance of SAR aircraft target detection but also provides a methodological reference for other small-target recognition tasks in the SAR field, helping SAR technology be more accurately applied in practical scenarios such as aviation monitoring and target reconnaissance.
Abstract
1. Introduction
- (1)
- A SAR image sample generation network (SAR-Auxiliary Classifier Generative Adversarial Network, SAR-ACGAN) is designed. Based on the ACGAN architecture, the model integrates a self-attention mechanism to enhance adaptability and robustness in noisy environments, enabling the generation of high-quality, diverse virtual aircraft target images. It effectively expands the dataset scale and further improves the recognition accuracy of aircraft targets.
- (2)
- A SAR image detail feature extraction module (SAR-Detail Feature Extraction, SAR-DFE) is designed as a parameter-learnable, adaptive deep learning component. Integrating central difference convolution and adaptive filtering algorithms enhances the extraction of edge and structural features while suppressing speckle noise. The dual residual structure avoids information loss, and the three-branch concatenation expands single-channel images into three-channel images for deeper feature extraction.
- (3)
- A multi-scale target detection module (SAR-Coarse-to-Fine, SAR-C2f) is designed. Based on the original C2f structure of YOLOv8, it integrates a residual structure with multi-receptive-field adaptive fusion (Multi-Scale Adaptive Fusion Resnet, SAR-MAFR), which dynamically adapts receptive fields. This effectively improves the detection performance of multi-scale targets and is particularly conducive to the extraction of small-target features.
- (4)
- A four-branch adaptive fusion detection head (Four-scale Detectors with Coordinate Attention, 4SDC) is designed. It adds a small-target detection branch and dynamically allocates branch weights via an attention mechanism, thereby enhancing the fusion of shallow detail features and deep semantic features, thereby effectively improving the recognition accuracy of small targets.
2. Methods
2.1. Dataset Sample Augmentation Network SAR-ACGAN
2.2. Detailed Feature Extraction Module SAR-DFE
2.3. Multi-Scale Feature Extraction Module SAR-C2f
2.4. Four-Scale Adaptive Fusion Detection Head 4SDC
3. Results
3.1. Dataset Construction
3.1.1. SAR-Aircraft and SAR-Aircraft-EXT Dataset
3.1.2. SAR-Aircraft-Gen Dataset
3.2. Experimental Environment
3.3. Evaluation Metrics
3.3.1. Evaluation Metrics for Generated Image Quality
3.3.2. Evaluation Metrics for Aircraft Target Recognition Capability
3.4. Analysis of Experimental Results
3.4.1. Analysis of Experimental Results for the SAR-ACGAN Network
3.4.2. Analysis of Experimental Results for the SAR-DFE Module
3.4.3. Analysis of Experimental Results for the SAR-C2f and 4SDC Module
3.4.4. Ablation Experiment
3.4.5. Analysis of Experimental Results for Comparisons Among Different Models
3.4.6. Experimental Effect Diagrams
4. Discussion
4.1. Analysis of Cross-Dataset Application
4.2. Future Research Directions
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| SAR | Synthetic Aperture Radar |
| SAR-ACGAN | SAR-Auxiliary Classifier Generative Adversarial Network |
| SAR-DFE | SAR-Detail Feature Extraction |
| SAR-C2f | SAR-Coarse-to-Fine |
| 4SDC | Four-Scale Detectors with Coordinate Attention |
| CDC | Central Difference Convolution |
| A-LEE | Adaptive LEE |
| FID | Fréchet Inception Distance |
| SFE | Structural Feature Extraction |
| SAR-ID | SAR Image Denoising |
References
- Girshick, R. Fast R-CNN. In Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile, 7–13 December 2015; pp. 1440–1448. [Google Scholar] [CrossRef]
- Ren, S.; He, K.; Girshick, R.; Sun, J. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Trans. Pattern Anal. Mach. Intell. 2017, 39, 1137–1149. [Google Scholar] [CrossRef]
- He, K.; Gkioxari, G.; Dollár, P.; Girshick, R. Mask R-CNN. In Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 22–29 October 2017; pp. 2961–2969. [Google Scholar] [CrossRef]
- Liu, W.; Anguelov, D.; Erhan, D.; Szegedy, C.; Reed, S.; Fu, C.-Y.; Berg, A.C. SSD: Single Shot Multibox Detector. In Proceedings of the European Conference on Computer Vision, Amsterdam, The Netherlands, 8–16 October 2016; pp. 21–37. [Google Scholar] [CrossRef]
- Lin, T.Y.; Goyal, P.; Girshick, R.; He, K.; Dollár, P. Focal Loss for Dense Object Detection. IEEE Trans. Pattern Anal. Mach. Intell. 2020, 42, 318–327. [Google Scholar] [CrossRef]
- Redmon, J.; Divvala, S.; Girshick, R.; Farhadi, A. You only look once: Unified, real-time object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 779–788. [Google Scholar] [CrossRef]
- Zhou, L.; Ran, H.; Xiong, R.; Tan, R. NWD-YOLOv5: A YOLOv5 Model for Small Target Detection Based on NWD Loss. In Proceedings of the IEEE International Conference on Robotics, Intelligent Control and Artificial Intelligence, Nanjing, China, 6–8 December 2024. [Google Scholar] [CrossRef]
- Zhang, H.; Xiong, A.; Lai, L.; Chen, C.; Liang, J. AMME-YOLOv7: Improved YOLOv7 Based on Attention Mechanism and Multiscale Expansion for Electric Vehicle Driver and Passenger Helmet Wearing Detection. In Proceedings of the IEEE International Conference on Smart Internet of Things, Xining, China, 25–27 August 2023. [Google Scholar] [CrossRef]
- Jiang, X.N.; Niu, X.Q.; Wu, F.L.; Fu, Y.; Bao, H.; Fan, Y.C.; Zhang, Y.; Pei, J.Y. A Fine-Grained Aircraft Target Recognition Algorithm for Remote Sensing Images Based on YOLOV8. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2025, 18, 4060–4073. [Google Scholar] [CrossRef]
- Khanam, R.; Hussain, M. Yolov11: An overview of the key architectural enhancements. arXiv 2024, arXiv:2410.17725. [Google Scholar] [CrossRef]
- Xiao, X.; Jia, H.; Xiao, P.; Wang, H. Aircraft Detection in SAR Images Based on Peak Feature Fusion and Adaptive Deformable Network. Remote Sens. 2022, 14, 6077. [Google Scholar] [CrossRef]
- Zhao, C.; Zhang, S.; Luo, R.; Feng, S.; Kuang, G. Scattering features spatial-structural association network for aircraft recognition in SAR images. IEEE Geosci. Remote Sens. Lett. 2023, 20, 4006505. [Google Scholar] [CrossRef]
- Zhu, W.; Zhang, L.; Lu, C.; Fan, G.; Song, Y.; Sun, J.; Lv, X. FEMSFNet: Feature Enhancement and Multi-Scales Fusion Network for SAR Aircraft Detection. Remote Sens. 2024, 16, 1589. [Google Scholar] [CrossRef]
- Hu, J.; Shen, L.; Sun, G. Squeeze-and-Excitation Networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA, 18–22 June 2018; pp. 7132–7141. [Google Scholar] [CrossRef]
- Yasir, M.; Liu, S.; Xu, M.; Wan, J.; Nazir, S.; Islam, Q.U.; Dang, K.B. SwinYOLOv7: Robust Ship Detection in Complex Synthetic Aperture Radar Images. Appl. Soft Comput. 2024, 160, 111704. [Google Scholar] [CrossRef]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep Residual Learning for Image Recognition. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar] [CrossRef]
- Guo, X.; Xu, B. SAR-NTV-YOLOv8: A Neural Network Aircraft Detection Method in SAR Images Based on Despeckling Preprocessing. Remote Sens. 2024, 16, 3420. [Google Scholar] [CrossRef]
- Huang, B.; Zhang, T.; Quan, S.; Wang, W.; Guo, W.; Zhang, Z. Scattering Enhancement and Feature Fusion Network for Aircraft Detection in SAR Images. IEEE Trans. Circuits Syst. Video Technol. 2024, 35, 1936–1950. [Google Scholar] [CrossRef]
- Zhao, Y.; Zhao, L.; Liu, Z.; Hu, D.; Kuang, G.; Liu, L. Attentional Feature Refinement and Alignment Network for Aircraft Detection in SAR Imagery. IEEE Trans. Geosci. Remote Sens. 2022, 60, 5220616. [Google Scholar] [CrossRef]
- Chen, L.; Luo, R.; Xing, J.; Li, Z.; Xing, X.; Yuan, Z.; Tan, S.; Cai, X. Geospatial transformer is what you need for aircraft detection in SAR Imagery. IEEE Trans. Geosci. Remote Sens. 2022, 60, 5225715. [Google Scholar] [CrossRef]
- Nie, Y.; Bian, C.; Li, L.; Chen, H.; Chen, S. LFC-SSD: Multiscale aircraft detection based on local feature correlation. IEEE Geosci. Remote Sens. Lett. 2022, 19, 6510505. [Google Scholar] [CrossRef]
- Kang, Y.; Wang, Z.; Fu, J.; Sun, X.; Fu, K. SFR-Net: Scattering Feature Relation Network for Aircraft Detection in Complex SAR Images. IEEE Trans. Geosci. Remote Sens. 2022, 60, 5218317. [Google Scholar] [CrossRef]
- Han, K.; Wang, Y.; Tian, Q.; Guo, J.; Xu, C.; Xu, C. Ghostnet: More features from cheap operations. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA, 13–19 June 2020; pp. 1580–1589. [Google Scholar] [CrossRef]
- Zou, L.; Zhang, H.; Wang, C.; Wu, F.; Gu, F. MW-ACGAN: Generating Multiscale High-Resolution SAR Images for Ship Detection. Sensors 2020, 20, 6673. [Google Scholar] [CrossRef] [PubMed]
- Odena, A.; Olah, C.; Shlens, J. Conditional image synthesis with auxiliary classifier GANs. In Proceedings of the International Conference on Machine Learning, New York, NY, USA, 19–24 June 2016; pp. 2642–2651. [Google Scholar] [CrossRef]
- Sun, X.; Li, W.; Huang, C.; Fu, J.; Feng, J. Multi-step-ahead Prediction of Grenade Trajectory Based on CNN-LSTM Enhanced by Deep Learning and Self-attention Mechanism. Acta Armamentarii 2024, 44, 240659. [Google Scholar]
- Li, Z.; Zhang, S.; Qiao, Y.; Wang, Q.; Jiang, Y.; Zhang, F. Maneuvering trajectory prediction of air combat targets based on self-attention mechanism and CNN-LSTM. J. Ordnance Equip. Eng. 2023, 44, 209–216. [Google Scholar] [CrossRef]
- Geng, J.; Jiang, W.; Deng, X. Multi-scale deep feature learning network with bilateral filtering for SAR image classification. ISPRS J. Photogramm. Remote Sens. 2020, 167, 201–213. [Google Scholar] [CrossRef]
- Xie, S.; Girshick, R.; Dollár, P.; Tu, Z.; He, K. Aggregated residual transformations for deep neural networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 21–26 July 2017; pp. 1492–1500. [Google Scholar] [CrossRef]
- Veit, A.; Wilber, M.J.; Belongie, S. Residual networks behave like ensembles of relatively shallow networks. In Advances in Neural Information Processing Systems; Curran Associates, Inc.: Red Hook, NY, USA, 2016; pp. 550–558. [Google Scholar] [CrossRef]
- Yu, Z.; Zhao, C.; Wang, Z.; Qin, Y.; Su, Z.; Li, X.; Zhou, F.; Zhao, G. Searching central difference convolutional networks for face anti-spoofing. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA, 14–19 June 2020; pp. 5295–5305. [Google Scholar] [CrossRef]
- Lee, S.; Kim, S.-W. Recognition of Targets in SAR Images Based on a WVV Feature Using a Subset of Scattering Centers. Sensors 2022, 22, 8528. [Google Scholar] [CrossRef]
- Zhang, L.; Wu, J.; Fan, Y.; Gao, H.; Shao, Y. An Efficient Building Extraction Method from High Spatial Resolution Remote Sensing Images Based on Improved Mask R-CNN. Sensors 2020, 20, 1465. [Google Scholar] [CrossRef] [PubMed]
- Hoorfar, H.; Puche, A.; Merchenthaler, I. Thermal image edge detection for AI-powered medical research imaging. J. Supercomput. 2025, 81, 629. [Google Scholar] [CrossRef]
- Rumelhart, D.; Hinton, G.; Williams, R. Learning representations by back-propagating errors. Nature 1986, 323, 533–536. [Google Scholar] [CrossRef]
- Szegedy, C.; Liu, W.; Jia, Y.Q.; Sermanet, P.; Reed, S.; Anguelov, D.; Erhan, D.; Vanhoucke, V.; Rabinovich, A.; Liu, W.; et al. Going Deeper with Convolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA, 7–12 June 2015; IEEE: New York, NY, USA, 2015. [Google Scholar] [CrossRef]
- Wang, X.; Hong, W.; Liu, Y.; Hu, D.; Xin, P. SAR Image Aircraft Target Recognition Based on Improved YOLOv5. Appl. Sci. 2023, 13, 6160. [Google Scholar] [CrossRef]
- Wang, X.; Hong, W.; Liu, Y.; Yan, G.; Hu, D.; Jing, Q. Improved YOLOv8 Network of Aircraft Target Recognition Based on Synthetic Aperture Radar Imaging Feature. Sensors 2025, 25, 3231. [Google Scholar] [CrossRef]
- Hou, Q.; Zhou, D.; Feng, J. Coordinate Attention for Efficient Mobile Network Design. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA, 19–25 June 2021. [Google Scholar] [CrossRef]
- IEEE-GRSS. 2021 Gaofen Challenge on Automated High-Resolution Earth Observation Image Interpretation; IEEE-GRSS: Piscataway, NJ, USA, 2021; Available online: https://www.grss-ieee.org/publications/call-for-papers/2021-gaofen-challenge-on-automated-high-resolution-earth-observation-image-interpretatio/ (accessed on 1 October 2021).
- Wang, Z.; Kang, Y.; Zeng, X.; Wang, Y.; Zhang, D.; Sun, X. SAR-AIRcraft-1.0: High-resolution SAR Aircraft Detection and Recognition Dataset. J. Radars 2023, 12, 906–922. [Google Scholar] [CrossRef]
- Heusel, M.; Ramsauer, H.; Unterthiner, T.; Nessler, B.; Hochreiter, S. GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium. In Proceedings of the Advances in Neural Information Processing Systems 30 (NIPS 2017), Long Beach, CA, USA, 4–9 December 2017. [Google Scholar]
- Gayathri, R.; Lincy, R. Transfer learning based handwritten character recognition of tamil script using inception-V3 Model. J. Intell. Fuzzy Syst. 2022, 42, 6102. [Google Scholar] [CrossRef]
- Liu, Z.; Lin, Y.; Cao, Y.; Hu, H.; Wei, Y.; Zhang, Z.; Lin, S.; Guo, B. Swin Transformer: Hierarchical Vision Transformer using Shifted Windows. In Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, QC, Canada, 10–17 October 2021; pp. 9992–10002. [Google Scholar]
- Ye, X.; Du, C. Integrated Multi-Scale Aircraft Detection and Recognition with Scattering Point Intensity Adaptiveness in Complex Background Clutter SAR Images. Remote Sens. 2024, 16, 2471. [Google Scholar] [CrossRef]
- Chen, J.; Shen, Y.; Liang, Y.; Wang, Z.; Zhang, Q. YOLO-SAD: An Efficient SAR Aircraft Detection Network. Appl. Sci. 2024, 14, 3025. [Google Scholar] [CrossRef]




















| Layer | Algorithm | Input Size | Output Size | |
|---|---|---|---|---|
| Generator | Layer1 | ConvT + BN + ReLU | (768, 1, 1) | (384, 4, 4) |
| Layer2 | (384, 4, 4) | (256, 8, 8) | ||
| Layer3 | (256, 8, 8) | (128, 16, 16) | ||
| Layer4 | (128, 16, 16) | (64, 32, 32) | ||
| Layer5 | Self-Attention (ratio = 0.5) | (64, 32, 32) | (32, 32, 32) | |
| Layer6 | ConvT + BN + ReLU | (32, 32, 32) | (16, 64, 64) | |
| Layer7 | ConvT + Tanh | (16, 64, 64) | (1, 128, 128) | |
| Discriminator | Layer1 | Conv + BN + LeakyReLU + Dropout | (1, 128, 128) | (1, 128, 128) |
| Layer2 | (1, 128, 128) | (16, 64, 64) | ||
| Layer3 | (16, 64, 64) | (32, 32, 32) | ||
| Layer4 | Self-Attention (ratio = 2) | (32, 32, 32) | (64, 32, 32) | |
| Layer5 | Conv + BN + LeakyReLU + Dropout | (64, 32, 32) | (128, 16, 16) | |
| Layer6 | (128, 16, 16) | (256, 8, 8) | ||
| Layer7 | (256, 8, 8) | (512, 4, 4) |
| Characteristics | CDC | Sobel |
|---|---|---|
| Parameter | Trainable | Fixed |
| Difference directions | 8 | 4 |
| Nonlinear capability | Yes | No |
| Adaptability | Yes | No |
| A220 | A320/321 | A330 | ARJ21 | B737 | B787 | Other |
|---|---|---|---|---|---|---|
| 307 | 207 | 65 | 201 | 300 | 278 | 242 |
| A220 | A320/321 | A330 | ARJ21 | B737 | B787 | Other | Mean | |
|---|---|---|---|---|---|---|---|---|
| ACGAN | 62.23 | 83.36 | 103.56 | 82.82 | 56.87 | 70.18 | 75.63 | 76.25 |
| SAR-ACGAN | 29.42 | 49.61 | 60.38 | 51.37 | 26.29 | 37.64 | 43.97 | 42.67 |
| SAR-DFE | A220 AP50 | A320/321 AP50 | A330 AP50 | ARJ21 AP50 | B737 AP50 | B787 AP50 | Other AP50 | Mean mAP50 | |
|---|---|---|---|---|---|---|---|---|---|
| YOLOv8s | × | 90.8 | 89.3 | 86.3 | 94.5 | 94.6 | 91.8 | 89.4 | 91.0 |
| YOLOv8s | √ | 94.8 | 90.9 | 88.7 | 96.2 | 95.9 | 94.7 | 91.2 | 93.2 (+2.2) |
| YOLOv8l | × | 91.9 | 90.3 | 87.4 | 95.6 | 95.7 | 92.9 | 90.5 | 92.0 |
| YOLOv8l | √ | 94.1 | 96.2 | 92.4 | 96.0 | 96.8 | 95.3 | 91.4 | 94.6 (+2.6) |
| SAR-DFE | Params | GFLOPs | Mean | |||
|---|---|---|---|---|---|---|
| Conv | SFE | SAR-ID | (M) | (G) | mAP50 (%) | mAP50–95 (%) |
| × | × | × | 43.6 | 165.4 | 92.0 | 70.6 |
| √ | Sobel | × | 43.9 | 165.6 | 92.7 (+0.7) | 72.1 (+1.5) |
| √ | CDC | × | 43.9 | 165.6 | 93.2 (+1.2) | 73.8 (+3.2) |
| √ | × | LEE | 45.6 | 165.8 | 92.9 (+0.9) | 73.1 (+2.5) |
| √ | × | A-LEE | 45.9 | 166.0 | 93.6 (+1.6) | 74.8 (+4.2) |
| √ | Sobel | LEE | 45.9 | 166.0 | 93.3 (+1.3) | 74.2 (+3.6) |
| √ | CDC | A-LEE | 46.2 | 166.2 | 94.6 (+2.6) | 77.2 (+6.6) |
| SAR-DFE | SAR-Aircraft-EXT | SAR-Aircraft-200-LQ | |||
|---|---|---|---|---|---|
| (CDC + A-LEE) | (Sobel + LEE) | mAP50 (%) | mAP50–95 (%) | mAP50 (%) | mAP50–95 (%) |
| × | × | 92.0 | 70.6 | 83.2 | 59.3 |
| × | √ | 93.3 (+1.3) | 74.2 (+3.6) | 86.9 (+3.7) | 64.9 (+5.6) |
| √ | × | 94.6 (+2.6) | 77.2 (+6.6) | 89.8 (+6.6) | 69.8 (+10.5) |
| Model | SAR-DFE | SAR-Aircraft-EXT | SAR-Aircraft-200-NT | SAR-Aircraft-200-ST | |||
|---|---|---|---|---|---|---|---|
| mAP50 (%) | mAP50–95 (%) | mAP50 (%) | mAP50–95 (%) | mAP50 (%) | mAP50–95 (%) | ||
| Faster R-CNN | × | 88.9 | 66.3 | 90.1 (+1.2) | 68.1 (+1.8) | 82.7 (−6.2) | 56.1 (−10.2) |
| SSD | × | 86.5 | 63.6 | 87.5 (+1.0) | 65.2 (+1.6) | 81.4 (−5.1) | 52.6 (−11.0) |
| YOLOv8s | √ | 93.2 | 74.0 | 93.9 (+0.7) | 75.1 (+1.1) | 89.6 (−3.6) | 67.2 (−6.8) |
| YOLOv8l | √ | 94.6 | 77.2 | 95.2 (+0.6) | 78.5 (+1.3) | 91.3 (−3.3) | 69.1 (−8.1) |
| SAR-DFE | SAR-C2f | 4SDC | SAR-Aircraft-EXT | SAR-Aircraft-200-ST | FPS | ||
|---|---|---|---|---|---|---|---|
| mAP50 (%) | mAP50–95 (%) | mAP50 (%) | mAP50–95 (%) | ||||
| √ | × | × | 94.6 | 77.2 | 91.3 | 69.1 | 20.4 |
| √ | √ | × | 95.3 (+0.7) | 78.9 (+1.7) | 92.9 (+1.6) | 73.0 (+3.9) | 21.2 |
| √ | × | √ | 95.8 (+1.2) | 80.0 (+2.8) | 93.9 (+2.6) | 75.6 (+6.5) | 19.8 |
| √ | √ | √ | 96.5 (+1.9) | 81.7 (+4.5) | 95.2 (+3.9) | 79.5 (+10.4) | 20.3 |
| SAR-ACGAN | SAR-DFE | SAR-C2f | 4SDC | Params (M) | GFLOPs (G) | SAR-Aircraft-EXT mAP50 (%) | SAR-Aircraft mAP50 (%) |
|---|---|---|---|---|---|---|---|
| × | × | × | × | 43.6 | 165.4 | - | 90.4 (Baseline) |
| √ | × | × | × | 43.6 (+0.0) | 165.4 (+0.0) | 92.0 (+1.6) | - |
| × | √ | × | × | 46.2 (+2.6) | 166.2 (+0.8) | - | 92.5 (+2.1) |
| × | × | √ | × | 41.5 (−2.1) | 165.2 (−0.2) | - | 91.0 (+0.6) |
| × | × | × | √ | 44.7 (+1.1) | 165.8 (+0.4) | - | 91.4 (+1.0) |
| × | × | √ | √ | 42.6 (−1.0) | 165.6 (+0.2) | - | 92.4 (+2.0) |
| × | √ | √ | √ | 45.2 (+1.6) | 166.4 (+1.0) | - | 94.4 (+4.0) |
| √ | √ | × | × | 46.2 (+2.6) | 166.2 (+0.8) | 94.6 (+4.2) | - |
| √ | × | √ | × | 41.5 (−2.1) | 165.2 (−0.2) | 92.7 (+2.3) | - |
| √ | × | × | √ | 44.7 (+1.1) | 165.8 (+0.4) | 93.2 (+2.8) | - |
| √ | × | √ | √ | 42.6 (−1.0) | 165.6 (+0.2) | 94.2 (+3.8) | - |
| √ | √ | √ | √ | 45.2 (+1.6) | 166.4 (+1.0) | 96.5 (+6.1) | - |
| Model | A220 AP50 (%) | A320/321 AP50 (%) | A330 AP50 (%) | ARJ21 AP50 (%) | B737 AP50 (%) | B787 AP50 (%) | Other AP50 (%) | P (%) | R (%) | Mean mAP50 (%) |
|---|---|---|---|---|---|---|---|---|---|---|
| Faster R-CNN | 85.7 | 86.6 | 80.9 | 88.4 | 86.0 | 91.9 | 87.5 | 84.2 | 89.5 | 86.7 |
| Retina-Net | 81.9 | 80.6 | 76.8 | 85.3 | 82.5 | 86.0 | 82.1 | 83.4 | 77.1 | 82.2 |
| SSD | 83.1 | 76.4 | 91.0 | 73.3 | 85.6 | 88.7 | 81.2 | 84.4 | 79.2 | 82.8 |
| YOLOv5s | 90.9 | 81.8 | 91.8 | 84.4 | 86.1 | 93.3 | 88.3 | 89.7 | 83.4 | 88.1 |
| YOLOv8l | 93.3 | 84.2 | 93.3 | 86.8 | 88.5 | 95.7 | 90.7 | 91.3 | 87.5 | 90.4 |
| Swin Transformer | 80.9 | 100 | 77.4 | 74.6 | 73.8 | 86.1 | 84.8 | - | - | 82.5 |
| SADRN | 95.3 | 96.4 | 96.4 | 94.8 | 95.7 | 95.8 | 94.1 | 85.6 | 93.0 | 95.0 |
| YOLOv11L | 93.2 | 88.5 | 94.0 | 90.8 | 89.0 | 94.5 | 90.2 | 93.0 | 88.3 | 91.5 |
| ResNeXt-101 | 80.9 | 100 | 87.1 | 74.9 | 71.1 | 83.9 | 87.7 | - | - | 83.7 |
| SAR-NTV-YOLOv8 | - | - | - | - | - | - | - | 93.5 | 92.2 | 83.4 |
| YOLO-SAD | 93.5 | 87.9 | 92.7 | 90.1 | 88.2 | 94.3 | 90.9 | 89.3 | 87.9 | 90.8 |
| SA-Net | 80.3 | 94.3 | 88.6 | 78.6 | 59.7 | 70.8 | 71.3 | - | - | 77.7 |
| SFSA | 90.0 | 96.0 | 97.0 | 99.0 | 95.0 | 89.0 | 81.0 | - | - | 92.4 |
| SAR-YOLOv8l-ADE | 96.3 | 96.3 | 97.1 | 95.9 | 96.7 | 98.0 | 95.1 | 93.9 | 91.1 | 96.5 |
| Dataset | Model | Aircraft | Oil Tanks | Bridges | Ships | Mean |
|---|---|---|---|---|---|---|
| AP50 (%) | AP50 (%) | AP50 (%) | AP50 (%) | mAP50 (%) | ||
| MSAR-1.0 | Baseline | 73.9 | 93.9 | 90.9 | 96.1 | 88.7 |
| SAR-YOLOv8l-ADE | 78.4 | 94.8 | 91.6 | 97.6 | 90.6 (+1.9) |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Wang, X.; Hong, W.; Li, Q.; Liu, Y.; Zhang, Q.; Xin, P. Multi-Module Collaborative Optimization for SAR Image Aircraft Recognition: The SAR-YOLOv8l-ADE Network. Remote Sens. 2026, 18, 236. https://doi.org/10.3390/rs18020236
Wang X, Hong W, Li Q, Liu Y, Zhang Q, Xin P. Multi-Module Collaborative Optimization for SAR Image Aircraft Recognition: The SAR-YOLOv8l-ADE Network. Remote Sensing. 2026; 18(2):236. https://doi.org/10.3390/rs18020236
Chicago/Turabian StyleWang, Xing, Wen Hong, Qi Li, Yunqing Liu, Qiong Zhang, and Ping Xin. 2026. "Multi-Module Collaborative Optimization for SAR Image Aircraft Recognition: The SAR-YOLOv8l-ADE Network" Remote Sensing 18, no. 2: 236. https://doi.org/10.3390/rs18020236
APA StyleWang, X., Hong, W., Li, Q., Liu, Y., Zhang, Q., & Xin, P. (2026). Multi-Module Collaborative Optimization for SAR Image Aircraft Recognition: The SAR-YOLOv8l-ADE Network. Remote Sensing, 18(2), 236. https://doi.org/10.3390/rs18020236

