Next Article in Journal
Enhancing Thermal Comfort and Efficiency in Fuel Cell Trucks: A Predictive Control Approach for Cabin Heating
Previous Article in Journal
Comparative Analysis of Efficiency and Harmonic Generation in Multiport Converters: Study of Two Operating Conditions
Previous Article in Special Issue
Research on Multi-Target Detection and Tracking of Intelligent Vehicles in Complex Traffic Environments Based on Deep Learning Theory
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Robust 3D Object Detection in Complex Traffic via Unified Feature Alignment in Bird’s Eye View

1
College of Information Science and Technology, Beijing University of Chemical Technology, Beijing 100029, China
2
Tukrin Technology, Beijing 101300, China
*
Author to whom correspondence should be addressed.
World Electr. Veh. J. 2025, 16(10), 567; https://doi.org/10.3390/wevj16100567
Submission received: 21 August 2025 / Revised: 28 September 2025 / Accepted: 30 September 2025 / Published: 2 October 2025
(This article belongs to the Special Issue Recent Advances in Intelligent Vehicle)

Abstract

Reliable three-dimensional (3D) object detection is critical for intelligent vehicles to ensure safety in complex traffic environments, and recent progress in multi-modal sensor fusion, particularly between LiDAR and camera, has advanced environment perception in urban driving. However, existing approaches remain vulnerable to occlusions and dense traffic, where depth estimation errors, calibration deviations, and cross-modal misalignment are often exacerbated. To overcome these limitations, we propose BEVAlign, a local–global feature alignment framework designed to generate unified BEV representations from heterogeneous sensor modalities. The framework incorporates a Local Alignment (LA) module that enhances camera-to-BEV view transformation through graph-based neighbor modeling and dual-depth encoding, mitigating local misalignment from depth estimation errors. To further address global misalignment in BEV representations, we present the Global Alignment (GA) module comprising a bidirectional deformable cross-attention (BDCA) mechanism and CBR blocks. BDCA employs dual queries from LiDAR and camera to jointly predict spatial sampling offsets and aggregate features, enabling bidirectional alignment within the BEV domain. The stacked CBR blocks then refine and integrate the aligned features into unified BEV representations. Experiment on the nuScenes benchmark highlights the effectiveness of BEVAlign, which achieves 71.7% mAP, outperforming BEVFusion by 1.5%. Notably, it achieves strong performance on small and occluded objects, particularly in dense traffic scenarios. These findings provide a basis for advancing cooperative environment perception in next-generation intelligent vehicle systems.
Keywords: 3D object detection; multi-modal sensor fusion; unified BEV representations; feature alignment; autonomous driving 3D object detection; multi-modal sensor fusion; unified BEV representations; feature alignment; autonomous driving

Share and Cite

MDPI and ACS Style

Liu, A.; Zhang, Y.; Shi, H.; Chen, J. Robust 3D Object Detection in Complex Traffic via Unified Feature Alignment in Bird’s Eye View. World Electr. Veh. J. 2025, 16, 567. https://doi.org/10.3390/wevj16100567

AMA Style

Liu A, Zhang Y, Shi H, Chen J. Robust 3D Object Detection in Complex Traffic via Unified Feature Alignment in Bird’s Eye View. World Electric Vehicle Journal. 2025; 16(10):567. https://doi.org/10.3390/wevj16100567

Chicago/Turabian Style

Liu, Ajian, Yandi Zhang, Huichao Shi, and Juan Chen. 2025. "Robust 3D Object Detection in Complex Traffic via Unified Feature Alignment in Bird’s Eye View" World Electric Vehicle Journal 16, no. 10: 567. https://doi.org/10.3390/wevj16100567

APA Style

Liu, A., Zhang, Y., Shi, H., & Chen, J. (2025). Robust 3D Object Detection in Complex Traffic via Unified Feature Alignment in Bird’s Eye View. World Electric Vehicle Journal, 16(10), 567. https://doi.org/10.3390/wevj16100567

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.
Back to TopTop