A Review: Remote Sensing Image Object Detection Algorithm Based on Deep Learning
Abstract
:1. Introduction
2. Object Detection Algorithms for Optical Remote Sensing Images
2.1. Remote Sensing Object Detection Algorithm Based on Anchor Frame
2.1.1. YOLO Series Object Detection Algorithms
2.1.2. Remote Sensing Object Detection Algorithm of the YOLO Series
2.1.3. Application of SSD Framework in Remote Sensing Detection
2.2. Remote Sensing Object Detection Algorithm Based on Candidate Box and Regional Convolutional Neural Network
2.3. End-to-End Remote Sensing Object Detection Algorithm Based on Transformer Network
2.4. Remote Sensing Object Detection Method for Specific Scenes
2.4.1. Object Detection in Remote Sensing Images Based on Supervision
2.4.2. Remote Sensing Image Object Detection Method Based on Attention Mechanism
2.4.3. Remote Sensing Image Object Detection Method Based on Multi-Scale Processing
2.4.4. Based on Deep Learning and Traditional Manual Feature Extraction Methods
2.4.5. Fast Image Processing Method Based on VHR
3. Performance Evaluation and Comparison of Optical Remote Sensing Image Object Detection
3.1. Optical Remote Sensing Image Data Sets
3.2. Algorithm Performance Evaluation and Comparison
4. Challenge and Improvement Direction
5. Conclusions and Prospect
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Cheng, G.; Han, J.W. A survey on object detection in optical remote sensing images. ISPRS J. Photogramm. Remote Sens. 2016, 117, 11–28. [Google Scholar] [CrossRef]
- Chen, Q.; Tang, S.; Yang, Q.; Fu, S. Cooper: Cooperative perception for connected autonomous vehicles based on 3d point clouds. In Proceedings of the 2019 IEEE 39th International Conference on Distributed Computing Systems (ICDCS), Dallas, TX, USA, 7–9 July 2019; IEEE: Piscataway Township, NJ, USA, 2019; pp. 514–524. [Google Scholar]
- Najibi, M.; Samangouei, P.; Chellappa, R.; Davis, L.S. Ssh: Single stage headless face detector. In Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 22–29 October 2017; pp. 4875–4884. [Google Scholar]
- Zhang, L.; Lin, L.; Liang, X.; He, K. Is faster RCNN doing well for pedestrian detection? In Proceedings of the Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, 11–14 October 2016; Springer International Publishing: Berlin/Heidelberg, Germany, 2016; pp. 443–457. [Google Scholar]
- Litjens, G.; Kooi, T.; Bejnordi, B.E.; Setio, A.A.A.; Ciompi, F.; Ghafoorian, M.; Van Der Laak, J.A.; Van Ginneken, B.; Sánchez, C.I. A survey on deep learning in medical image analysis. Med. Image Anal. 2017, 42, 60–88. [Google Scholar] [CrossRef]
- Minaee, S.; Boykov, Y.; Porikli, F.; Plaza, A.; Kehtarnavaz, N.; Terzopoulos, D. Image segmentation using deep learning: A survey. IEEE Trans. Pattern Anal. Mach. Intell. 2021, 44, 3523–3542. [Google Scholar] [CrossRef] [PubMed]
- Vedantam, R.; Lawrence Zitnick, C.; Parikh, D. Cider: Consensus-based image description evaluation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, 7–12 June 2015; pp. 4566–4575. [Google Scholar]
- Reddy, K.R.; Priya, K.H.; Neelima, N. Object Detection and Tracking—A Survey. In Proceedings of the 2015 International Conference on Computational Intelligence and Communication Networks (CICN), Jabalpur, India, 12–14 December 2015; IEEE: Piscataway Township, NJ, USA, 2015; pp. 418–421. [Google Scholar]
- Kuehne, H.; Jhuang, H.; Garrote, E.; Poggio, T.; Serre, T. HMDB: A large video database for human motion recognition. In Proceedings of the 2011 International Conference on Computer Vision, Barcelona, Spain, 6–13 November 2011; IEEE: Piscataway Township, NJ, USA, 2011; pp. 2556–2563. [Google Scholar]
- Lowe, G.D. Distinctive Image Features from Scale-Invariant Keypoints. Int. J. Comput. Vis. 2004, 60, 91–110. [Google Scholar] [CrossRef]
- Dalal, N.; Triggs, B. Histograms of oriented gradients for human detection. In Proceedings of the 2005 IEEE Conference on Computer Vision and Pattern Recognition, San Diego, CA, USA, 20–26 June 2005; IEEE Computer Society: Washington, DC, USA, 2005; pp. 886–893. [Google Scholar]
- Felzenszwalb, P.; McAllester, D.; Ramanan, D. A discriminatively trained, multiscale, deformable part model. In Proceedings of the 2008 IEEE Conference on Computer Vision and Pattern Recognition, Anchorage, AK, USA, 23–28 June 2008; IEEE: Piscataway Township, NJ, USA, 2008; pp. 1–8. [Google Scholar]
- Ren, S.; He, K.; Girshick, R.; Sun, J. Faster RCNN: Towards real-time object detection with region proposal networks. Adv. Neural Inf. Process. Syst. 2015, 28. [Google Scholar]
- Redmon, J.; Divvala, S.; Girshick, R.; Farhadi, A. You only look once: Unified, real-time object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 779–788. [Google Scholar]
- Liu, W.; Anguelov, D.; Erhan, D.; Szegedy, C.; Reed, S.; Fu, C.Y.; Berg, A.C. Ssd: Single shot multibox detector. In Proceedings of the Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, 11–14 October 2016; Springer International Publishing: Berlin/Heidelberg, Germany, 2016; pp. 21–37. [Google Scholar]
- Redomn, J.; Farhadi, A. YOLO9000: Better, faster, stron-ger. In Proceedings of the IEEE Conference on Computer Vision & Pattern Recognition, Honolulu, HW, USA, 21–26 July 2017; 1EEE: Piscataway Township, NJ, USA, 2017. [Google Scholar]
- Redmon, J.; Farhadi, A. Yolov3: An incremental improvement. arXiv 2018, arXiv:1804.02767. [Google Scholar]
- Bochkovskiy, A.; Wang, C.Y.; Liao, H.Y.M. Yolov4: Optimal speed and accuracy of object detection. arXiv 2020, arXiv:2004.10934. [Google Scholar]
- Zhu, X.; Lyu, S.; Wang, X.; Zhao, Q. TPH-YOLOv5: Improved YOLOv5 based on transformer prediction head for object detection on drone-captured scenarios. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Montreal, BC, Canada, 11–17 October 2021; pp. 2778–2788. [Google Scholar]
- Ge, Z.; Liu, S.; Wang, F.; Li, Z.; Sun, J. Yolox: Exceeding yolo series in 2021. arXiv 2021, arXiv:2107.08430. [Google Scholar]
- Li, C.; Li, L.; Jiang, H.; Weng, K.; Geng, Y.; Li, L.; Ke, Z.; Li, Q.; Cheng, M.; Nie, W.; et al. YOLOv6: A single-stage object detection framework for industrial applications. arXiv 2022, arXiv:2209.02976. [Google Scholar]
- Wang, C.Y.; Bochkovskiy, A.; Liao, H.Y.M. YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada, 18–22 June 2023; pp. 7464–7475. [Google Scholar]
- Li, Y.; Fan, Q.; Huang, H.; Han, Z.; Gu, Q. A Modified YOLOv8 Detection Network for UAV Aerial Image Recognition. Drones 2023, 7, 304. [Google Scholar] [CrossRef]
- Tang, X.; Zhang, X.; Shi, J.; Wei, S. A moving object detection method based on YOLO for dual-beam SAR. In Proceedings of the 2021 IEEE International Geoscience and Remote Sensing Symposium IGARSS, Brussels, Belgium, 11–16 July 2021; IEEE: Piscataway Township, NJ, USA, 2021; pp. 5315–5318. [Google Scholar]
- Jindal, M.; Raj, N.; Saranya, P.; Sundarabalan, V. Aircraft Detection from Remote Sensing Images using YOLOV5 Architecture. In Proceedings of the 2022 6th International Conference on Devices, Circuits and Systems (ICDCS), Coimbatore, India, 21–22 April 2022; IEEE: Piscataway Township, NJ, USA, 2022; pp. 332–336. [Google Scholar]
- Shi, Y. An Underwater Target Wake Detection in Multi-Source Images Based on Improved YOLOv5. IEEE Access 2023, 11, 31990–31996. [Google Scholar] [CrossRef]
- Ding, W.; Zhang, L. Building detection in remote sensing image based on improved YOLOv5. In Proceedings of the 2021 17th International Conference on Computational Intelligence and Security (CIS), Chengdu, China, 19–22 November 2021; IEEE: Piscataway Township, NJ, USA, 2021; pp. 133–136. [Google Scholar]
- Sun, Y.; Liu, W.; Hou, X.; Bi, F. FRN-YOLO: A Feature Re-fusion Network for Remote Sensing object detection. In Proceedings of the 2021 2nd International Conference on Computer Science and Management Technology (ICCSMT), Shanghai, China, 12–14 November 2021; IEEE: Piscataway Township, NJ, USA, 2021; pp. 372–375. [Google Scholar]
- Wei, J.; Liu, Y.; Li, L.; Xie, W.; Zhao, S.; Zhao, Z. Improved YOLO X with Bilateral Attention for Small Object Detection. In Proceedings of the 2023 International Conference on Applied Intelligence and Sustainable Computing (ICAISC), Zakopane, Poland, 16–17 June 2023; IEEE: Piscataway Township, NJ, USA, 2023; pp. 1–6. [Google Scholar]
- Ma, L.; He, T.; Sun, Y.; Hu, B.B. Lightweight YOLOv4 Algorithm for Remote Sensing Image Detection. In Proceedings of the 2022 14th International Conference on Signal Processing Systems (ICSPS), Jiangsu, China, 18–20 November 2022; IEEE: Piscataway Township, NJ, USA, 2022; pp. 793–797. [Google Scholar]
- Li, X.; Cai, K. Method research on ship detection in remote sensing image based on Yolo algorithm. In Proceedings of the 2020 International Conference on Information Science, Parallel and Distributed Systems (ISPDS), Xi’an, China, 14–16 August 2020; IEEE: Piscataway Township, NJ, USA, 2020; pp. 104–108. [Google Scholar]
- Hong, Z.; Yang, T.; Tong, X.; Zhang, Y.; Jiang, S.; Zhou, R.; Han, Y.; Wang, J.; Yang, S.; Liu, S. Multi-scale ship detection from SAR and optical imagery via a more accurate YOLOv3. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2021, 14, 6083–6101. [Google Scholar] [CrossRef]
- Yang, Y.; Liao, Y.; Cheng, L.; Zhang, K.; Wang, H.; Chen, S. Remote sensing image aircraft object detection based on giou-yolo v3. In Proceedings of the 2021 6th International Conference on Intelligent Computing and Signal Processing (ICSP), Xi’an, China, 9–11 April 2021; IEEE: Piscataway Township, NJ, USA, 2021; pp. 474–478. [Google Scholar]
- Xin, L.; Xie, X.; Lv, J. Research on Remote Sensing Image object detection Algorithm Based on YOLOv5. In Proceedings of the 2022 IEEE 5th Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC), Chongqing, China, 16–18 December 2022; IEEE: Piscataway Township, NJ, USA, 2022; Volume 5, pp. 1497–1501. [Google Scholar]
- Wang, S.; Sun, H.; Zhu, Y.; Li, M.; Xu, Q. SA-YOLO: The Saliency Adjusted Deep Network for Optical Satellite Image Ship Detection. In Proceedings of the IGARSS 2022–2022 IEEE International Geoscience and Remote Sensing Symposium, Kuala Lumpur, Malaysia, 17–22 July 2022; IEEE: Piscataway Township, NJ, USA, 2022; pp. 2131–2134. [Google Scholar]
- Zhang, X.; Zhang, Z. Ship detection based on improved YOLO algorithm. In Proceedings of the 2023 3rd International Conference on Consumer Electronics and Computer Engineering (ICCECE), Guangzhou, China, 5–8 January 2023; IEEE: Piscataway Township, NJ, USA, 2023; pp. 98–101. [Google Scholar]
- Zhang, X.; Yuan, S.; Luan, F.; Lv, J.; Liu, G. Similarity Mask Mixed Attention for YOLOv5 Small Ship Detection of Optical Remote Sensing Images. In Proceedings of the 2022 WRC Symposium on Advanced Robotics and Automation (WRC SARA), Beijing, China, 20 August 2022; IEEE: Piscataway Township, NJ, USA, 2022; pp. 263–268. [Google Scholar]
- Zhu, F.; Wang, Y.; Cui, J.; Liu, G.; Li, H. object detection for remote sensing based on the enhanced YOLOv4 with improved BiFPN. Egypt. J. Remote Sens. Space Sci. 2023, 26, 351–360. [Google Scholar]
- Zhou, L.; Liu, J.; Chen, L. Vehicle detection based on remote sensing image of Yolov3. In Proceedings of the 2020 IEEE 4th Information Technology, Networking, Electronic and Automation Control Conference (ITNEC), Chongqing, China, 12–14 June 2020; IEEE: Piscataway Township, NJ, USA, 2020; Volume 1, pp. 468–472. [Google Scholar]
- Liu, W.; Tian, J.; Tian, T. YOLM: A Remote Sensing Aircraft Detection Model. In Proceedings of the IGARSS 2022–2022 IEEE International Geoscience and Remote Sensing Symposium, Kuala Lumpur, Malaysia, 17–22 July 2022; IEEE: Piscataway Township, NJ, USA, 2022; pp. 1708–1711. [Google Scholar]
- Sharma, M.; Markopoulos, P.P.; Saber, E. YOLOrs-lite: A lightweight cnn for real-time object detection in remote-sensing. In Proceedings of the 2021 IEEE International Geoscience and Remote Sensing Symposium IGARSS, Brussels, Belgium, 11–16 July 2021; IEEE: Piscataway Township, NJ, USA, 2021; pp. 2604–2607. [Google Scholar]
- Wang, Y.K.; Jiang, H.X.; Lin, K.Y. Remote sensing image ship detection based on modified YOLO algorithm. J. Beijing Univ. Aeronaut. Astronaut. 2020, 46, 1184–1191. [Google Scholar] [CrossRef]
- Wang, Z.; Du, L.; Mao, J.; Liu, B.; Yang, D. SAR object detection based on SSD with data augmentation and transfer learning. IEEE Geosci. Remote Sens. Lett. 2018, 16, 150–154. [Google Scholar] [CrossRef]
- Liu, Y.; Yang, J.; Cui, W. Simple, Fast, Accurate Object Detection based on Anchor-Free Method for High Resolution Remote Sensing Images. In Proceedings of the IGARSS 2020–2020 IEEE International Geoscience and Remote Sensing Symposium, Waikoloa, HI, USA, 26 September–2 October 2020; IEEE: Piscataway Township, NJ, USA, 2020; pp. 2443–2446. [Google Scholar]
- Yu, Y.; Wang, J.; Qiang, H.; Jiang, M.; Tang, E.; Yu, C.; Zhang, Y.; Li, J. Sparse anchoring guided high-resolution capsule network for geospatial object detection from remote sensing imagery. Int. J. Appl. Earth Obs. Geoinf. 2021, 104, 102548. [Google Scholar] [CrossRef]
- Wang, Z.H.; Yang, C. Improved SSD model in extraction application of expressway toll station locations from GaoFen 2 remote sensing image. J. Traffic Transp. Eng. 2021, 21, 278–286. [Google Scholar] [CrossRef]
- Yang, Y.; Gu, H.; Han, Y.; Li, H. An end-to-end deep learning change detection framework for remote sensing images. In Proceedings of the IGARSS 2020–2020 IEEE International Geoscience and Remote Sensing Symposium, Waikoloa, HI, USA, 26 September–2 October 2020; IEEE: Piscataway Township, NJ, USA, 2020; pp. 652–655. [Google Scholar]
- Lu, X.; Ji, J.; Xing, Z.; Miao, Q. Attention and feature fusion SSD for remote sensing object detection. IEEE Trans. Instrum. Meas. 2021, 70, 1–9. [Google Scholar] [CrossRef]
- Qu, J.; Su, C.; Zhang, Z.; Razi, A. Dilated convolution and feature fusion SSD network for small object detection in remote sensing images. IEEE Access 2020, 8, 82832–82843. [Google Scholar] [CrossRef]
- Suidong, L.; Lei, Z.H.U.; Wenwu, W. Improving SSD for detecting small target in Remote Sensing Image. In Proceedings of the 2020 Chinese Automation Congress (CAC), Shanghai, China, 6–8 November 2020; IEEE: Piscataway Township, NJ, USA, 2020; pp. 567–571. [Google Scholar]
- Liu, S.; Shi, H.; Guo, Z. Remote sensing image object detection based on improved SSD. In Proceedings of the 2022 3rd International Conference on Computer Vision, Image and Deep Learning & International Conference on Computer Engineering and Applications (CVIDL & ICCEA), Changchun, China, 20–22 May 2022; IEEE: Piscataway Township, NJ, USA, 2022; pp. 421–424. [Google Scholar]
- Han, J.; Zhang, D.; Cheng, G.; Guo, L.; Ren, J. Object Detection in Optical Remote Sensing Images Basedon FFC-SSD Model. Acta Opt. Sin. 2022, 42, 138–148. [Google Scholar]
- Wang, H.T. Research on Target Detection Algorithm of Optical Aircraft Remote Sensing Image Based on Improved SSD. Master’s Thesis, Ningxia University, Yinchuan, China, 2022. [Google Scholar] [CrossRef]
- Shi, W.X.; Tan, D.L.; Bao, S.L. Feature Enhancement SSD Algorithm and Its Application in Remote Sensing Images Target Detection. Acta Photonica Sin. 2020, 49, 154–163. [Google Scholar]
- Yin, F.L.; Wang, T.Y. Target detection of remote sensing image based on attention feature fusion SSD algorithm. Netw. Secur. Data Gov. 2022, 41, 67–73. [Google Scholar] [CrossRef]
- Girshick, R.; Donahue, J.; Darrell, T.; Malik, J. Rich feature hierarchies for accurate object detection and semantic segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA, 23–28 June 2014; pp. 580–587. [Google Scholar]
- Girshick, R. Fast RCNN. In Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile, 7–13 December 2015; pp. 1440–1448. [Google Scholar]
- He, K.; Gkioxari, G.; Dollár, P.; Girshick, R. Mask RCNN. In Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 22–29 October 2017; pp. 2961–2969. [Google Scholar]
- Yu, D.; Ji, S. A new spatial-oriented object detection framework for remote sensing images. IEEE Trans. Geosci. Remote Sens. 2021, 60, 1–16. [Google Scholar] [CrossRef]
- Huiming, Y.; Fuxin, X. A remote sensing image target recognition method based on improved Mask-RCNN model. In Proceedings of the 2021 IEEE 2nd International Conference on Big Data, Artificial Intelligence and Internet of Things Engineering (ICBAIE), Nanchang, China, 26–28 March 2021; IEEE: Piscataway Township, NJ, USA, 2021; pp. 436–439. [Google Scholar]
- Miao, W.X.; Luo, Z. Aircraft detection based on multiple scale faster-RCNN. In Proceedings of the 2018 International Conference on Virtual Reality and Visualization (ICVRV), Qingdao, China, 22–24 October 2018; IEEE: Piscataway Township, NJ, USA, 2018; pp. 90–93. [Google Scholar]
- Sha, M.M.; Li, Y.; Li, A. Multiscale aircraft detection in optical remote sensing imagery based on advanced Faster R-CNN. Natl. Remote Sens. Bull. 2022, 26, 1624–1635. [Google Scholar] [CrossRef]
- Wang, Y.; Bashir, S.M.A.; Khan, M.; Ullah, Q.; Wang, R.; Song, Y.; Guo, Z.; Niu, Y. Remote sensing image super-resolution and object detection: Benchmark and state of the art. Expert Syst. Appl. 2022, 197, 116793. [Google Scholar] [CrossRef]
- Singh, A.K.; Dwivedi, A.K.; Sumanth, M.; Singh, D. An efficient approach for instance segmentation of railway track sleepers in low altitude UAV images using mask RCNN. In Proceedings of the IGARSS 2022–2022 IEEE International Geoscience and Remote Sensing Symposium, Kuala Lumpur, Malaysia, 17–22 July 2022; IEEE: Piscataway Township, NJ, USA, 2022; pp. 4895–4898. [Google Scholar]
- He, Z.; Peng, P.; Wang, L.; Jiang, Y. Enhancing seismic p-wave arrival picking by target-oriented detection of the local windows using faster-rcnn. IEEE Access 2020, 8, 141733–141747. [Google Scholar] [CrossRef]
- Feng, J.; Liang, Y.; Ye, Z.; Wu, X.; Zeng, D.; Zhang, X.; Tang, X. Small object detection in optical remote sensing video with motion guided RCNN. In Proceedings of the IGARSS 2020–2020 IEEE International Geoscience and Remote Sensing Symposium, Waikoloa, HI, USA, 26 September–2 October 2020; IEEE: Piscataway Township, NJ, USA, 2020; pp. 272–275. [Google Scholar]
- Zhang, C.; Liu, T.; Lam, K.M. Angle Tokenization Guided Multi-Scale Vision Transformer for Oriented Object Detection in Remote Sensing Imagery. In Proceedings of the IGARSS 2022–2022 IEEE International Geoscience and Remote Sensing Symposium, Kuala Lumpur, Malaysia, 17–22 July 2022; IEEE: Piscataway Township, NJ, USA, 2022; pp. 3063–3066. [Google Scholar]
- Hu, Z.; Gao, K.; Zhang, X.; Wang, J.; Wang, H.; Yang, Z.; Li, C.; Li, W. EMO2-DETR: Efficient-Matching Oriented Object Detection with Transformers. IEEE Trans. Geosci. Remote Sens. 2023, 61, 5616814. [Google Scholar] [CrossRef]
- Li, N.; Cheng, L.; Ji, C.; Chen, H.; Geng, W.; Yang, W. Airport detection in remote sensing real-open world using deep learning. Eng. Appl. Artif. Intell. 2023, 122, 106083. [Google Scholar] [CrossRef]
- Wu, X.; Yang, L.; Ma, Y.; Wu, C.; Guo, C.; Yan, H.; Qiao, Z.; Yao, S.; Fan, Y. An end-to-end multiple side-outputs fusion deep supervision network based remote sensing image change detection algorithm. Signal Process. 2023, 213, 109203. [Google Scholar] [CrossRef]
- Li, Y.; Li, X.; Zhang, Y.; Peng, D.; Bruzzone, L. Cost-efficient information extraction from massive remote sensing data: When weakly supervised deep learning meets remote sensing big data. Int. J. Appl. Earth Obs. Geoinf. 2023, 120, 103345. [Google Scholar] [CrossRef]
- Wu, Z.Z.; Xu, J.; Wang, Y.; Sun, F.; Tan, M.; Weise, T. Hierarchical fusion and divergent activation based weakly supervised learning for object detection from remote sensing images. Inf. Fusion 2022, 80, 23–43. [Google Scholar] [CrossRef]
- Zhang, Z.; Feng, Z.; Yang, S. Semi-supervised object detection framework with object first mixup for remote sensing images. In Proceedings of the 2021 IEEE International Geoscience and Remote Sensing Symposium IGARSS, Brussels, Belgium, 11–16 July 2021; IEEE: Piscataway Township, NJ, USA, 2021; pp. 2596–2599. [Google Scholar]
- Zha, W.; Hu, L.; Duan, C.; Li, Y. Semi-supervised learning-based satellite remote sensing object detection method for power transmission towers. Energy Rep. 2023, 9, 15–27. [Google Scholar] [CrossRef]
- Li, W.P.; Yang, X.G.; Li, C.X.; Lu, R.; Huang, P. An improved semi-supervised transfer learning method for infrared object detection neural network. Infrared Laser Eng. 2021, 50, 243–250. [Google Scholar]
- Du, L.; Wei, D.; Li, L.; Guo, Y. SAR Target Detection Network via Semi-supervised Learning. J. Electron. Inf. Technol. 2020, 42, 154–163. [Google Scholar]
- Lv, J.D.; Wang, T.; Tang, X.B. Semi-supervised SAR Ship Target Detection with Graph Attention Network. J. Electron. Inf. Technol. 2023, 45, 1541–1549. [Google Scholar]
- Wang, X.; Yan, X.; Tan, K.; Pan, C.; Ding, J.; Liu, Z.; Dong, X. Double U-Net (W-Net): A change detection network with two heads for remote sensing imagery. Int. J. Appl. Earth Obs. Geoinf. 2023, 122, 103456. [Google Scholar] [CrossRef]
- Qingyun, F.; Zhaokui, W. Cross-modality attentive feature fusion for object detection in multispectral remote sensing imagery. Pattern Recognit. 2022, 130, 108786. [Google Scholar] [CrossRef]
- Hua, X.; Wang, X.; Rui, T.; Zhang, H.; Wang, D. A fast self-attention cascaded network for object detection in large scene remote sensing images. Appl. Soft Comput. 2020, 94, 106495. [Google Scholar] [CrossRef]
- Ma, C.; Weng, L.; Xia, M.; Lin, H.; Qian, M.; Zhang, Y. Dual-branch network for change detection of remote sensing image. Eng. Appl. Artif. Intell. 2023, 123, 106324. [Google Scholar] [CrossRef]
- Zhang, R.T.; Jiang, X.J.; An, J.S.; Cui, T.S. Design of global-contextual detection model for optical remote sensing targets. Chin. Opt. 2020, 13, 1302–1313. [Google Scholar]
- Nong, Y.J.; Wang, J.J. Spatial Relation ship Detection Method of Remote Sensing Objects. Acta Opt. Sin. 2021, 41, 212–217. [Google Scholar]
- Yin, H.; Weng, L.; Li, Y.; Xia, M.; Hu, K.; Lin, H.; Qian, M. Attention-guided siamese networks for change detection in high resolution remote sensing images. Int. J. Appl. Earth Obs. Geoinf. 2023, 117, 103206. [Google Scholar] [CrossRef]
- Han, W.; Li, J.; Wang, S.; Wang, Y.; Yan, J.; Fan, R.; Zhang, X.; Wang, L. A context-scale-aware detector and a new benchmark for remote sensing small weak object detection in unmanned aerial vehicle images. Int. J. Appl. Earth Obs. Geoinf. 2022, 112, 102966. [Google Scholar] [CrossRef]
- Dong, Z.; Zhang, M.; Li, L.; Liu, Q.; Wen, Q.; Wang, W.; Luo, W.; Wu, Z.; Tang, T.; Ji, W. A multiscale building detection method based on boundary preservation for remote sensing images: Taking the Yangbi M6. 4 earthquake as an example. Nat. Hazards Res. 2022, 2, 121–131. [Google Scholar] [CrossRef]
- Zhang, Q.; Tang, J.; Zheng, H.; Lin, C. Efficient object detection method based on aerial optical sensors for remote sensing. Displays 2022, 75, 102328. [Google Scholar] [CrossRef]
- Song, Z.; Li, X.; Zhu, R.; Wang, Z.; Yang, Y.; Zhang, X. ERMF: Edge refinement multi-feature for change detection in bitemporal remote sensing images. Signal Process. Image Commun. 2023, 2023, 116964. [Google Scholar] [CrossRef]
- Gao, T.; Niu, Q.; Zhang, J.; Chen, T.; Mei, S.; Jubair, A. Global to Local: A Scale-Aware Network for Remote Sensing Object Detection. IEEE Trans. Geosci. Remote Sens. 2023, 61, 5615614. [Google Scholar] [CrossRef]
- Chen, S.; Zhao, J.; Zhou, Y.; Wang, H.; Yao, R.; Zhang, L.; Xue, Y. Info-FPN: An Informative Feature Pyramid Network for object detection in remote sensing images. Expert Syst. Appl. 2023, 214, 119132. [Google Scholar] [CrossRef]
- Su, H.; You, Y.; Meng, G. Multi-Scale Context-Aware RCNN for Few-Shot Object Detection in Remote Sensing Images. In Proceedings of the IGARSS 2022–2022 IEEE International Geoscience and Remote Sensing Symposium, Kuala Lumpur, Malaysia, 17–22 July 2022; IEEE: Piscataway Township, NJ, USA, 2022; pp. 1908–1911. [Google Scholar]
- Dong, X.; Qin, Y.; Fu, R.; Gao, Y.; Liu, S.; Ye, Y.; Li, B. Multiscale deformable attention and multilevel features aggregation for remote sensing object detection. IEEE Geosci. Remote Sens. Lett. 2022, 19, 1–5. [Google Scholar] [CrossRef]
- Zhang, H.; Li, J.; Song, R.; Li, Y. Multi-Scale Structure-Conditioned Feature Transform Network for Object Detection in Remote Sensing Imagery. In Proceedings of the 2021 IEEE International Geoscience and Remote Sensing Symposium IGARSS, Brussels, Belgium, 11–16 July 2021; IEEE: Piscataway Township, NJ, USA, 2021; pp. 4208–4211. [Google Scholar]
- Dong, Z.; Wang, M.; Wang, Y.; Zhu, Y.; Zhang, Z. Object detection in high resolution remote sensing imagery based on convolutional neural networks with suitable object scale features. IEEE Trans. Geosci. Remote Sens. 2019, 58, 2104–2114. [Google Scholar] [CrossRef]
- Meng, Y.B.; Wang, F.; Liu, G.H. Remote sensing multi-scale object detection based on multivariate feature extraction and characterization optimization. Opt. Precis. Eng. 2023, 31, 2465–2482. [Google Scholar] [CrossRef]
- Yao, Q.L.; Hu, X.; Lei, H. Object Detection in Remote Sensing Images Using Multiscale Convolutional Neural Networks. Acta Opt. Sin. 2019, 48, 1266–1274. [Google Scholar]
- Zhang, Y.Z.; Guo, W.; Li, W.B. Omnidirectional accurate detection algorithm for dense small objects in remote sensing images. J. Jilin Univ. 2023, 1–9. [Google Scholar] [CrossRef]
- Zhang, M.; Zheng, H.; Gong, M.; Wu, Y.; Li, H.; Jiang, X. Self-structured pyramid network with parallel spatial-channel attention for change detection in VHR remote sensed imagery. Pattern Recognit. 2023, 138, 109354. [Google Scholar] [CrossRef]
- Zhou, J.; Zhang, R.; Zhao, W.; Shen, S.; Wang, N. APS-Net: An Adaptive Point Set Network for Optical Remote-Sensing Object Detection. IEEE Geosci. Remote Sens. Lett. 2022, 20, 1–5. [Google Scholar] [CrossRef]
- Wang, C.; Sun, W.; Fan, D.; Liu, X.; Zhang, Z. Adaptive feature weighted fusion nested U-Net with discrete wavelet transform for change detection of high-resolution remote sensing images. Remote Sens. 2021, 13, 4971. [Google Scholar] [CrossRef]
- Xu, R.; Tao, Y.; Lu, Z.; Zhong, Y. Attention-mechanism-containing neural networks for high-resolution remote sensing image classification. Remote Sens. 2018, 10, 1602. [Google Scholar] [CrossRef]
- Liu, M.; Jiao, L.; Liu, X.; Li, L.; Liu, F.; Yang, S. C-CNN: Contourlet convolutional neural networks. IEEE Trans. Neural Netw. Learn. Syst. 2020, 32, 2636–2649. [Google Scholar] [CrossRef]
- Hu, X.D.; Wang, X.Q.; Meng, F.J.; Hua, X.; Yan, Y.J.; Li, Y.Y.; Huang, J.; Jiang, X.L. Gabor-CNN for object detection based on small samples. Def. Technol. 2020, 16, 1116–1129. [Google Scholar] [CrossRef]
- Chen, Y.; Zhu, L.; Ghamisi, P.; Jia, X.; Li, G.; Tang, L. Hyperspectral images classification with Gabor filtering and convolutional neural network. IEEE Geosci. Remote Sens. Lett. 2017, 14, 2355–2359. [Google Scholar] [CrossRef]
- Zheng, S.; Wu, Z.; Xu, Y.; Wei, Z.; Plaza, A. Learning orientation information from frequency-domain for oriented object detection in remote sensing images. IEEE Trans. Geosci. Remote Sens. 2022, 60, 1–12. [Google Scholar] [CrossRef]
- El-Khamy, S.E.; Al-Kabbany, A.; Shimaa, E.L.B. MLRS-CNN-DWTPL: A new enhanced multi-label remote sensing scene classification using deep neural networks with wavelet pooling layers. In Proceedings of the 2021 International Telecommunications Conference (ITC-Egypt), Alexandria, Egypt, 13–15 July 2021; IEEE: Piscataway Township, NJ, USA, 2021; pp. 1–5. [Google Scholar]
- El-Gayar, M.M. Automatic Generation of Image Caption Based on Semantic Relation using Deep Visual Attention Prediction. Int. J. Adv. Comput. Sci. Appl. 2023, 14. [Google Scholar] [CrossRef]
- He, Y.; Zhou, S.; Quan, X. Remote Sensing Image Scene Classification Based on ECA Attention Mechanism Convolutional Neural Network. In Proceedings of the 2022 IEEE 4th International Conference on Civil Aviation Safety and Information Technology (ICCASIT), Dali, China, 12–14 October 2022; IEEE: Piscataway Township, NJ, USA, 2022; pp. 1265–1269. [Google Scholar]
- Tsourounis, D.; Kastaniotis, D.; Theoharatos, C.; Kazantzidis, A.; Economou, G. SIFT-CNN: When Convolutional Neural Networks Meet Dense SIFT Descriptors for Image and Sequence Classification. J. Imaging 2022, 8, 256. [Google Scholar] [CrossRef] [PubMed]
- Li, J.; Wang, T.; Gao, M.; Zhu, A.; Shan, G.; Snoussi, H. Two Stream Neural Networks with Traditional CNN and Gabor CNN for Object Classification. In Proceedings of the 2018 37th Chinese Control Conference (CCC), Wuhan, China, 25–27 July 2018; IEEE: Piscataway Township, NJ, USA, 2018; pp. 9350–9355. [Google Scholar]
- Huo, C.; Zhou, Z.; Lu, H.; Pan, C.; Chen, K. Fast object-level change detection for VHR images. IEEE Geosci. Remote Sens. Lett. 2009, 7, 118–122. [Google Scholar] [CrossRef]
- Mboga, N.; Georganos, S.; Grippa, T.; Lennert, M.; Vanhuysse, S.; Wolff, E. Fully convolutional networks and geographic object-based image analysis for the classification of VHR imagery. Remote Sens. 2019, 11, 597. [Google Scholar] [CrossRef]
- Zhang, Y.; Zhou, P.; Ren, Y.; Zou, Z. GPU-accelerated large-size VHR images registration via coarse-to-fine matching. Comput. Geosci. 2014, 66, 54–65. [Google Scholar] [CrossRef]
- Saha, S.; Bovolo, F.; Bruzzone, L. Unsupervised deep change vector analysis for multiple-change detection in VHR images. IEEE Trans. Geosci. Remote Sens. 2019, 57, 3677–3693. [Google Scholar] [CrossRef]
- Chen, X.; Zhang, Q.; Han, J.; Han, X.; Liu, Y.; Fang, Y. Research progress of deep learning-based object detection of optical remote sensing image. J. Commun. 2022, 43, 190–203. [Google Scholar]
- Heitz, G.; Koller, D. Learning spatial context: Using stuff to find things. In Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 2008; pp. 30–43. [Google Scholar]
- Tanner, F.; Colder, B.; Pullen, C.; Heagy, D.; Eppolito, M.; Carlan, V.; Oertel, C.; Sallee, P. Overhead imagery research data set—An annotated data library & tools to aid in the development of computer vision algorithms. In Proceedings of the 2009 IEEE Applied Imagery Pattern Recognition Workshop (AIPR 2009), Washington, DC, USA, 14–16 October 2009; IEEE Press: Piscataway Township, NJ, USA, 2009; pp. 1–8. [Google Scholar]
- Maggiori, E.; Tarabalka, Y.; Charpiat, G.; Alliez, P. Can semantic labeling methods generalize to any city? the inria aerial image labeling benchmark. In Proceedings of the 2017 IEEE International Geoscience and Remote Sensing Symposium, Worth, TX, USA, 23–28 July 2017; IEEE Press: Piscataway Township, NJ, USA, 2017; pp. 3226–3229. [Google Scholar]
- Zhu, H.; Chen, X.; Dai, W.; Fu, K.; Ye, Q.; Jiao, J. Orientation robust object detection in aerial images using deep convolutional neural network. In Proceedings of the 2015 IEEE International Conference on Image Processing, Quebec City, QC, Canada, 27–30 September 2015; IEEE Press: Piscataway Township, NJ, USA, 2015; pp. 3735–3739. [Google Scholar]
- Cheng, G.; Han, J.; Zhou, P.; Guo, L. Multi-class geospatial object detection and geographic image classification based on collection of part detectors. ISPRS J. Photogramm. Remote Sens. 2014, 98, 119–132. [Google Scholar] [CrossRef]
- Razakarivony, S.; Jurie, F. Vehicle detection in aerial imagery: A small object detection benchmark. J. Vis. Commun. Image Represent. 2016, 34, 187–203. [Google Scholar] [CrossRef]
- Liu, Z.; Yuan, L.; Weng, L.; Yang, Y. A high resolution optical satellite image dataset for ship recognition and some new baselines. In Proceedings of the 6th International Conference on Pattern Recognition Applications and Methods, Porto, Portugal, 24–26 February 2017; SciTePress: Francisco, Italy, 2017; pp. 324–331. [Google Scholar]
- Liu, K.; Mattyus, G. Fast multiclass vehicle detection on aerial images. IEEE Geosci. Remote Sens. Lett. 2015, 12, 1938–1942. [Google Scholar]
- Long, Y.; Gong, Y.; Xiao, Z.; Liu, Q. Accurate object localization in remote sensing images based on convolutional neural networks. IEEE Trans. Geosci. Remote Sens. 2017, 55, 2486–2498. [Google Scholar] [CrossRef]
- Zhang, Y.; Yuan, Y.; Feng, Y.; Lu, X. Hierarchical and robust convolutional neural network for very high-resolution remote sensing object detection. IEEE Trans. Geosci. Remote Sens. 2019, 57, 5535–5548. [Google Scholar] [CrossRef]
- Zou, Z.X.; Shi, Z.W. Random access memories: A new paradigm for object detection in high resolution aerial remote sensing images. IEEE Trans. Image Process. 2018, 27, 1100–1111. [Google Scholar] [CrossRef] [PubMed]
- Yang, Y.M. ITCVD Dataset[EB]. 2018. Available online: https://research.utwente.nl/en/datasets/itcvd-dataset (accessed on 28 November 2023).
- Li, K.; Wan, G.; Cheng, G.; Meng, L.; Han, J. Object detection in optical remote sensing images: A survey and a new benchmark. ISPRS J. Photogramm. Remote Sens. 2020, 159, 296–307. [Google Scholar] [CrossRef]
- Xia, G.S.; Bai, X.; Ding, J.; Zhu, Z.; Belongie, S.; Luo, J.; Datcu, M.; Pelillo, M.; Zhang, L. DOTA: A large-scale dataset for object detection in aerial images. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–22 June 2018; IEEE Press: Piscataway Township, NJ, USA, 2018; pp. 3974–3983. [Google Scholar]
- Sun, X.; Wang, P.; Yan, Z.; Xu, F.; Wang, R.; Diao, W.; Chen, J.; Li, J.; Feng, Y.; Xu, T.; et al. FAIR1M: A benchmark dataset for fine-grained object recognition in high-resolution remote sensing imagery. ISPRS J. Photogramm. Remote Sens. 2022, 184, 116–130. [Google Scholar] [CrossRef]




| Version | Time | Characteristics | Weakness | Advantage | 
|---|---|---|---|---|
| YOLOv1 [14] | 2015 | Fully convolutional networks, high real-time, simple, and effective. | Poor positioning accuracy, and poor detection of small objects. | Fast, easy to implement and deploy. | 
| YOLOv2 [16] | 2016 | Anchor box, multi-scale prediction, and Darknet-19 are used as the basic network with significant accuracy improvement. | Detection of small objects remains difficult. | Fast speed, high overall accuracy, and good robustness. | 
| YOLOv3 [17] | 2018 | Multi-scale prediction, FPN, better objecting | Relatively slow, higher computing resources are needed. | High detection accuracy, the effect of small object detection is improved, and strong robustness. | 
| YOLOv4 [18] | 2020 | Powerful detection performance, faster speeds, higher accuracy, CSPDarknet53 network. | Requires higher computational resources and higher complexity. | Excellent detection accuracy, good detection effect on small objects and occluded objects, and strong robustness. | 
| YOLOv5 [19] | 2020 | Lightweight network, fast speed, high precision, easy to train and deploy. | Relatively new, there may be some instability and room for improvement. | High speed and high accuracy, efficient performance and training deployment efficiency, multi-language support, and smaller model volume. | 
| YOLOX [20] | 2021 | Decoupled head, anchor-free, and SimOTA. | Currently, there are only 640 × 640 pre-trained weights. | Better performance at lower image resolutions. | 
| YOLOv6 [21] | 2022 | Support model training, reasoning, and multi-platform deployment, improvement, and optimization of network structure and training strategy. | High false detection rate, version maintenance update speed is too slow, not suitable for industrial fields. | Full-chain industrial application requirements, improved and optimized network structure, and algorithm level. | 
| YOLOv7 [22] | 2023 | Better precision and rate, with high accuracy and fast detection speed. | Newer knowledge may be difficult for some users to learn. | It has better accuracy and speed while ensuring accuracy and can process video and images in real time. | 
| YOLOv8 [23] | 2023 | Fast and efficient, high accuracy, supports multi-category object detection, suitable for real-time applications. | Poor detection of small objects, A large amount of training data, and training time are required. | Fast speed, high accuracy, fitting real-time scenarios, supporting multi-category object detection. | 
| Literature | Paper Highlights | Applicability | 
|---|---|---|
| Tang et al. [24] | Moving object detection method based on the YOLO, processing of dual-beam SAR images | Applying for moving object detection in dual-beam SAR images. | 
| Jindal et al. [25] | Using the YOLOV5 architecture to realize aircraft detection in remote sensing images. | Applying for aircraft inspection tasks. | 
| Shi et al. [26] | The improved YOLOv5 realizes the wake detection of underwater objects based on multi-source images. | Applying for the field of underwater object wake detection and having certain practical value. | 
| Ding et al. [27] | Remote sensing image building detection with high accuracy and robustness. | Applying for remote sensing image building detection and using in urban planning, resource management, and other fields. | 
| Sun et al. [28] | Feature re-fusion extracts details, reducing false detections. | Fast and accurate detection of the object, providing important data support. | 
| Wei et al. [29] | Utilizing bilateral attention, detailed information about small objects in remotely sensed images is effectively captured. | The network focuses on improving small object detection. | 
| Ma et al. [30] | Optimizing the network structure and parameters and achieving a lightweight. | Providing accurate object detection results. | 
| Li et al. [31] | Data enhancement techniques are employed. | Real-time monitoring of ships, improving collection efficiency and safety. | 
| Hong et al. [32] | Multi-scale detection, improved network structure and loss function, etc. | Maritime traffic monitoring, maritime patrol and island defense, marine resource management, and other areas. | 
| Yang et al. [33] | The GIoU evaluation metric is introduced in the loss function. | Applying for aviation monitoring, military reconnaissance, disaster monitoring, and other fields. | 
| Xin et al. [34] | Optimizing the characteristics of remote sensing images, such as high resolution and complex backgrounds. | Large-scale and complex remote sensing image data, with the ability to detect multiple objects simultaneously. | 
| Wang et al. [35] | A saliency adjustment mechanism to weight the input image for saliency. | For efficient and accurate detection of ship objects in optical satellite images. | 
| Zhang et al. [36] | Optimizing the network architecture, designing the loss functions, adjusting the prior frames, etc. | Processing large-scale image data for real-time ship inspection. | 
| Zhang et al. [37] | Hybrid attention mechanism of similarity mask. | Efficient and accurate detection of small ship objects in optical remote sensing images. | 
| Zhu et al. [38] | Methods such as introducing attention mechanisms, adapting feature fusion strategies, and optimizing feature propagation paths. | Handling the task of object detection in remote sensing images. | 
| Zhou et al. [39] | A multi-scale feature fusion mechanism is introduced. | Handling vehicle detection tasks and combining multi-scale information to improve detection performance. | 
| Liu et al. [40] | Specific training strategies and optimization methods for aircraft. | Handling the tasks of aircraft inspection and optimizing aircraft-specific problems. | 
| Sharma et al. [41] | Designing a lightweight object detection model. | Handling real-time object detection tasks can be deployed for use on devices with restricted resources or limited computing power. | 
| Wang et al. [42] | Adjusting the network structure, mining, and fusion of ship characteristics. | Handling the task of ship detection in remote sensing images and high detection accuracy can be obtained. | 
| Literature | Paper Highlights | Applicability | 
|---|---|---|
| Han et al. [85] | Context scale-aware detector, providing a new benchmark data set. | Remote sensing small weak object detection task in UAV images. | 
| Dong et al. [86] | Accurate maintenance of building boundaries is emphasized, and the accuracy and robustness of detection are improved using a multi-scale strategy. | Building detection tasks in earthquake disasters and other remote sensing images. | 
| Zhang et al. [87] | An efficient object detection method based on multi-scale aerial optical sensor. | Remote sensing image, aerial photography, UAV image analysis, and other fields. | 
| Song et al. [88] | Innovative design with edge refinement and multi-feature fusion. | Two-phase remote sensing image change detection, surface environmental change monitoring, and other tasks. | 
| Gao et al. [89] | Innovative design with scale perception and global-to-local strategy. | Remote sensing object detection and multi-scale object detection tasks. | 
| Chen et al. [90] | An innovative design with an information feature pyramid and feature selection based on information gain. | |
| Su et al. [91] | The small sample object detection of multi-scale context-aware. | Few-shot object detection task in remote sensing images | 
| Dong et al. [92] | Multi-scale deformable attention mechanism and multi-level feature aggregation method are used to improve the accuracy and robustness of object detection. | Remote sensing image object detection tasks. | 
| Zhang et al. [93] | The multi-scale structural condition feature transformation and attention module are introduced. | |
| Dong et al. [94] | The method of applying object scale feature extraction and structural optimization. | High-resolution remote sensing image object detection task. | 
| Meng et al. [95] | The method of multivariate feature extraction and characterization optimization. | Remote sensing multi-scale object detection tasks. | 
| Yao [96] | The method of multi-scale fusion feature and convolutional neural network | Remote sensing imagery aircraft object detection Mission. | 
| Zhang [97] | Using the specific object detection network structure and algorithm optimization technology to achieve accurate detection of dense small objects. | Detection task of dense small objects in remote sensing images. | 
| Zhang [98] | An adaptive point set network and a point set modeling and matching method are introduced. | Optical remote sensing image target detection task. | 
| Zhou [99] | Autonomous structure pyramid network and parallel space-channel attention mechanism. | Change detection task of high-resolution remote sensing images. | 
| Data Set | Publisher and Content Description | Number of Object Categories | Number of Images | 
|---|---|---|---|
| TAS [116] | Vehicle objecting dataset published by Stanford University. | 1 | 30 | 
| OIRDS [117] | Vehicle objecting datasets published by Raytheon Corporation. | 5 | 900 | 
| SZTAKI [118] | Rotating building object dataset published by Mta Sztaki. | 1 | 9 | 
| UCAS-AOD [119] | Vehicle and aircraft object datasets published by CAS, and background negative samples. | 2 | 976 | 
| NWPU VHR-10 [120] | The data set of aircraft, ships, oil tanks, baseball courts, tennis courts, basketball courts, and other objects released by Northwestern Polytechnical University. | 10 | 1510 | 
| VEDAI [121] | Vehicle objecting dataset published by Caen University. | 9 | 1210 | 
| HRSC2016 [122] | Ship objecting dataset released by Northwestern Polytechnical University. | 1 | 1061 | 
| DLR3k [123] | Vehicle object dataset published by German Aerospace Center. | 7 | 20 | 
| RSOD [124] | Aircraft, oil tank, stadium, and overpass object datasets released by Wuhan University. | 4 | 976 | 
| TGRS-HRRSD [125] | Object datasets for ships, bridges, athletic fields, oil tanks, basketball courts, tennis courts, and other object data sets released by the Chinese Academy of Sciences. | 13 | 21,761 | 
| LEVIR [126] | The object data set of aircraft, ships, and oil tanks released by Beijing University of Aeronautics and Astronautics. | 3 | 22,000 | 
| ITCVD [127] | Vehicle objecting dataset published by Twente University. | 1 | 135 | 
| DIOR [128] | Aircraft, airports, basketball courts, bridges, chimneys, dams, and other object data sets published by Northwestern Polytechnical University. | 20 | 23,463 | 
| DOTA [129] | object data sets of ships, swimming pools, track and field fields, ports, helicopters, football fields, and other object data sets released by Wuhan University. | 16 | 2806 | 
| FAIR1M [130] | The data set of 5 large categories and 37 fine-grained categories such as aircraft, ships, vehicles, stadiums, and roads published by the Chinese Academy of Sciences is the world’s largest fine-grained object detection and recognition data set for optical remote sensing images. | 37 | 15,000 | 
| Data Set | Algorithm | Backbone Network | Literature | Release Time | mAP/% | 
|---|---|---|---|---|---|
| RSOD [124] | FPN-YOLO | DarkNet53 | Sun et al. [28] | 2021 | 87.40 | 
| DAM-YOLOX | CSPDarkNet | Wei et al. [29] | 2023 | 93.90 | |
| YOLOv5-DNA | Xin et al. [34] | 2022 | 77.51 | ||
| DF-SSD | ResNet-50 | Qu et al. [45] | 2020 | 51.78 | |
| SSOD-RS | Zhang et al. [73] | 2021 | 90.70 | ||
| RCNN-FCD | Su et al. [91] | 2022 | 96.60 | ||
| MLFAM | Dong et al. [92] | 2022 | 92.50 | ||
| I-SSD | VGG-16 | Liu et al. [47] | 2022 | 80.53 | |
| AFF-SSD | Yin et al. [55] | 2022 | 75.19 | ||
| DOTA [129] | GSC-YOLO | CSPDarkNet | Ma et al. [30] | 2022 | 93.44 | 
| YOLOv4-CD | Zhu et al. [38] | 2023 | 90.88 | ||
| SAHR-CapsNet | Yu et al. [50] | 2021 | 93.04 | ||
| AF-SSD | ResNet-50 | Lu et al. [44] | 2021 | 52.60 | |
| FFC-SSD | Xue et al. [52] | 2022 | 74.90 | ||
| SOSA-FCN | Hua et al. [80] | 2020 | 95.25 | ||
| ATMTransformer | DETR | Zhang et al. [67] | 2022 | 77.30 | |
| EMO2-DETR | Hu et al. [68] | 2023 | 70.91 | ||
| Info-FPN | FPN | Chen et al. [90] | 2023 | 75.84 | |
| FAIR1M [130] | YOLM | CSPDarkNet | Liu et al. [40] | 2022 | 88.70 | 
| NWPU VHR-10 [120] | AF-SSD | ResNet-50 | Lu et al. [44] | 2021 | 69.80 | 
| DF-SSD | Qu et al. [45] | 2020 | 65.35 | ||
| FESSD | Shi et al. [54] | 2020 | 79.36 | ||
| MSCNN | Yao et al. [96] | 2019 | 96.00 | ||
| CenterNet | DLA-34 | Liu et al. [49] | 2020 | 95.70 | |
| GCDN | ResNet-18 | Zhang et al. [82] | 2020 | 97.60 | |
| DIOR [128] | RSADet | DLA-34 | Yu et al. [59] | 2021 | 72.20 | 
| CenterNet | Yu et al. [59] | 2021 | 69.40 | ||
| DIOR [128] | MLFAM | ResNet-50 | Dong et al. [92] | 2022 | 73.90 | 
| MFC | MFE | Meng et al. [95] | 2023 | 70.90 | |
| VEDAI [121] | YOLOFusion | CSPDarkNet | Qingyun [79] | 2022 | 78.60 | 
| TGRS-HRRSD [125] | SOSA-FCN | ResNet-50 | Hua et al. [80] | 2020 | 97.25 | 
| MSFT | SCFT | Zhang et al. [93] | 2021 | 86.33 | |
| MFC | MFE | Meng et al. [95] | 2023 | 90.20 | 
| Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. | 
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Bai, C.; Bai, X.; Wu, K. A Review: Remote Sensing Image Object Detection Algorithm Based on Deep Learning. Electronics 2023, 12, 4902. https://doi.org/10.3390/electronics12244902
Bai C, Bai X, Wu K. A Review: Remote Sensing Image Object Detection Algorithm Based on Deep Learning. Electronics. 2023; 12(24):4902. https://doi.org/10.3390/electronics12244902
Chicago/Turabian StyleBai, Chenshuai, Xiaofeng Bai, and Kaijun Wu. 2023. "A Review: Remote Sensing Image Object Detection Algorithm Based on Deep Learning" Electronics 12, no. 24: 4902. https://doi.org/10.3390/electronics12244902
APA StyleBai, C., Bai, X., & Wu, K. (2023). A Review: Remote Sensing Image Object Detection Algorithm Based on Deep Learning. Electronics, 12(24), 4902. https://doi.org/10.3390/electronics12244902
 
        
 
                                                

 
       