Next Article in Journal
Temporal and Spatial Analysis of Deformation Monitoring of the Ming Great Wall in Shanxi Province through InSAR
Next Article in Special Issue
Self-Learning Robot Autonomous Navigation with Deep Reinforcement Learning Techniques
Previous Article in Journal
A Novel CA-RegNet Model for Macau Wetlands Auto Segmentation Based on GF-2 Remote Sensing Images
Previous Article in Special Issue
Visual Odometry of a Low-Profile Pallet Robot Based on Ortho-Rectified Ground Plane Image from Fisheye Camera
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Advancing Image Object Detection: Enhanced Feature Pyramid Network and Gradient Density Loss for Improved Performance

1
School of Physics and Mechanical and Electrical Engineering, Longyan University, Longyan 364012, China
2
School of Software Engineering, Xi’an Jiaotong University, Xi’an 710049, China
3
Institute of Artificial Intelligence and Robotics, Xi’an Jiaotong University, Xi’an 710049, China
*
Author to whom correspondence should be addressed.
Appl. Sci. 2023, 13(22), 12174; https://doi.org/10.3390/app132212174
Submission received: 24 September 2023 / Revised: 2 November 2023 / Accepted: 3 November 2023 / Published: 9 November 2023
(This article belongs to the Special Issue Autonomous Vehicles and Robotics)

Abstract

In the era of artificial intelligence, the significance of images and videos as intuitive conveyors of information cannot be overstated. Computer vision techniques rooted in deep learning have revolutionized our ability to autonomously and accurately identify objects within visual media, making them a focal point of contemporary research. This study addresses the pivotal role of image object detection, particularly in the contexts of autonomous driving and security surveillance, by presenting an in-depth exploration of this field with a focus on enhancing the feature pyramid network. One of the key challenges in existing object detection methodologies lies in mitigating information loss caused by multi-scale feature fusion. To tackle this issue, we propose the enhanced feature pyramid, which adeptly amalgamates features extracted across different scales. This strategic enhancement effectively curbs information attrition across various layers, thereby strengthening the feature extraction capabilities of the foundational network. Furthermore, we confront the issue of excessive classification loss in image object detection tasks by introducing the gradient density loss function, designed to mitigate classification discrepancies. Empirical results unequivocally demonstrate the efficacy of our approach in enhancing the detection of multi-scale objects within images. When evaluated across benchmark datasets, including MS COCO 2017, MS COCO 2014, Pascal VOC 2007, and Pascal VOC 2012, our method achieves impressive average precision scores of 39.4%, 42.0%, 51.5%, and 49.9%, respectively. This performance clearly outperforms alternative state-of-the-art methods in the field. This research not only contributes to the evolving landscape of computer vision and object detection but also has practical implications for a wide range of applications, aligning with the transformative trends in the automotive industry and security technologies.
Keywords: deep learning; object detection; feature pyramid network; multi-scale features; gradient density loss deep learning; object detection; feature pyramid network; multi-scale features; gradient density loss

Share and Cite

MDPI and ACS Style

Wang, Y.; Wang, Q.; Zou, R.; Wen, F.; Liu, F.; Zhang, Y.; Du, S.; Zeng, W. Advancing Image Object Detection: Enhanced Feature Pyramid Network and Gradient Density Loss for Improved Performance. Appl. Sci. 2023, 13, 12174. https://doi.org/10.3390/app132212174

AMA Style

Wang Y, Wang Q, Zou R, Wen F, Liu F, Zhang Y, Du S, Zeng W. Advancing Image Object Detection: Enhanced Feature Pyramid Network and Gradient Density Loss for Improved Performance. Applied Sciences. 2023; 13(22):12174. https://doi.org/10.3390/app132212174

Chicago/Turabian Style

Wang, Ying, Qinghui Wang, Ruirui Zou, Falin Wen, Fenglin Liu, Yihang Zhang, Shaoyi Du, and Wei Zeng. 2023. "Advancing Image Object Detection: Enhanced Feature Pyramid Network and Gradient Density Loss for Improved Performance" Applied Sciences 13, no. 22: 12174. https://doi.org/10.3390/app132212174

APA Style

Wang, Y., Wang, Q., Zou, R., Wen, F., Liu, F., Zhang, Y., Du, S., & Zeng, W. (2023). Advancing Image Object Detection: Enhanced Feature Pyramid Network and Gradient Density Loss for Improved Performance. Applied Sciences, 13(22), 12174. https://doi.org/10.3390/app132212174

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop