MDPI - Publisher of Open Access Journals

20 pages, 2220 KB

Open AccessArticle

R2KAN-U-Net: A Novel Architecture Integrating Kolmogorov–Arnold Networks with Residual U-Net for Robust Traffic Sign Segmentation

by Taha Ben-Abbou, Houda El Omrani, Khalid El Fazazy, Mohamed Adnane Mahraz, Hamid Tairi and Jamal Riffi

Sensors 2026, 26(12), 3797; https://doi.org/10.3390/s26123797 (registering DOI) - 15 Jun 2026

Abstract

Traffic sign segmentation is a fundamental component of intelligent transportation systems and autonomous driving, where reliable pixel-level perception is required under challenging real-world conditions such as illumination variations, occlusion, scale diversity, and complex urban backgrounds. In this work, we propose Residual–Recurrent Kolmogorov–Arnold Network [...] Read more.

Traffic sign segmentation is a fundamental component of intelligent transportation systems and autonomous driving, where reliable pixel-level perception is required under challenging real-world conditions such as illumination variations, occlusion, scale diversity, and complex urban backgrounds. In this work, we propose Residual–Recurrent Kolmogorov–Arnold Network U-Net (R2KAN-U-Net), where “R2” denotes the integration of residual convolutional learning and recurrent KAN-based feature refinement. The proposed architecture combines residual U-Net feature extraction, multi-scale KAN fusion, and recurrent KAN refinement to improve pixel-level traffic sign segmentation under challenging road-scene conditions. The proposed framework integrates three complementary components: (1) residual convolutional blocks for stable feature propagation; (2) a multi-scale KAN fusion bottleneck for capturing contextual information at different receptive fields; and (3) recurrent KAN refinement modules for iterative enhancement of discriminative features. Unlike conventional convolutional architectures, the proposed KAN-based formulation replaces linear transformations with learnable univariate functions, enabling adaptive nonlinear feature modeling. We conduct extensive experiments on a custom dataset containing 9300 annotated urban traffic scene images, as well as on the ADE20K and Cityscapes benchmarks. On the custom dataset, the proposed R2KAN-U-Net achieved a Dice coefficient of 0.92 and an IoU score of 0.89, providing a strong accuracy–efficiency trade-off for traffic-sign foreground segmentation. It achieves competitive segmentation accuracy compared with recent CNN-, transformer-, and state-space-based segmentation models while using fewer parameters and lower computational cost. Additional low-light experiments demonstrate improved segmentation stability, with R2KAN-U-Net achieving the highest low-light Dice score of 0.88 and a competitive low-light IoU of 0.79. Furthermore, the proposed architecture maintains competitive computational efficiency with only 24 M parameters, 44.8 G FLOPs, and near-real-time inference at 13 ms per image. The experimental results demonstrate that integrating KAN-based function-space learning with residual and multi-scale feature refinement provides an effective and computationally efficient solution for robust traffic sign segmentation in complex driving environments. Full article

(This article belongs to the Section Sensors and Robotics)

► Show Figures

Figure 1

21 pages, 1572 KB

Open AccessArticle

Efficient Glare Suppression Network for Nighttime Images with Lightweight Parallel Attention and Ghost Convolution

by Ruoyu Yang, Huaixin Chen, Sijie Luo and Zhixi Wang

Sensors 2026, 26(12), 3773; https://doi.org/10.3390/s26123773 (registering DOI) - 12 Jun 2026

Viewed by 319

Abstract

Aiming at the problems of glare interference, local overexposure and detail loss caused by artificial light sources such as vehicle lamps and street lamps in nighttime road scenes, as well as the challenges of existing glare suppression models with large parameters, high computational [...] Read more.

Aiming at the problems of glare interference, local overexposure and detail loss caused by artificial light sources such as vehicle lamps and street lamps in nighttime road scenes, as well as the challenges of existing glare suppression models with large parameters, high computational complexity and difficulty in deploying on edge devices, this paper proposes a lightweight glare suppression network (LGSNet) based on ghost depthwise separable convolution and Lightweight Parallel Attention. Based on the U-Net architecture, the network introduces ghost depthwise separable convolution blocks (GhostDSC) in the encoder and decoder, which generates ghost features through cheap linear transformations by exploiting feature map redundancy, significantly reducing model parameters and computational costs while maintaining feature representation ability. Meanwhile, a Lightweight Parallel Attention (LPA) module is designed in the decoder stage, which integrates channel attention and pixel attention in parallel, enhancing the network’s attention to glare regions and edge details with extremely low parameter increment to improve detail recovery accuracy. In addition, a joint loss function consisting of background loss, glare loss and reconstruction loss is constructed to collaboratively optimize glare suppression and detail preservation. Experimental results on the public Flare7K++ dataset and the self-built nighttime road glare dataset NRGD show that the proposed method has only 7.45 M parameters, much lower than standard U-Net and Uformer. It achieves competitive results on full-reference metrics such as PSNR, SSIM, LPIPS and no-reference metrics such as NIQE, BRISQUE, PIQE, and can effectively suppress various types of glare interference and restore obscured scene details. It achieves a superior trade-off between model complexity and enhancement performance, significantly reducing the parameter count and computational overhead compared to heavy baselines, thereby offering a highly efficient solution for resource-aware glare suppression tasks. Full article

(This article belongs to the Section Intelligent Sensors)

19 pages, 762 KB

Open AccessArticle

TMRGBT-D2D: A Temporal Misaligned RGB-Thermal Dataset for Drone-to-Drone Target Detection

by Hexiang Hao, Yueping Peng, Zecong Ye, Baixuan Han, Wei Tang, Wenchao Kang, Xuekai Zhang, Qilong Li and Wenchao Liu

Drones 2025, 9(10), 694; https://doi.org/10.3390/drones9100694 - 10 Oct 2025

Cited by 4 | Viewed by 3491

Abstract

In the field of drone-to-drone detection tasks, the issue of fusing temporal information with infrared and visible light data for detection has been rarely studied. This paper presents the first temporal misaligned rgb-thermal dataset for drone-to-drone target detection, named TMRGBT-D2D. The dataset covers [...] Read more.

In the field of drone-to-drone detection tasks, the issue of fusing temporal information with infrared and visible light data for detection has been rarely studied. This paper presents the first temporal misaligned rgb-thermal dataset for drone-to-drone target detection, named TMRGBT-D2D. The dataset covers various lighting conditions (i.e., high-light scenes captured during the day, medium-light and low-light scenes captured at night, with night scenes accounting for 38.8% of all data), different scenes (sky, forests, buildings, construction sites, playgrounds, roads, etc.), different seasons, and different locations, consisting of a total of 42,624 images organized into sequential frames extracted from 19 RGB-T video pairs. Each frame in the dataset has been meticulously annotated, with a total of 94,323 annotations. Except for drones that cannot be identified under extreme conditions, infrared and visible light annotations are one-to-one corresponding. This dataset presents various challenges, including small object detection (the average size of objects in visible light images is approximately 0.02% of the image area), motion blur caused by fast movement, and detection issues arising from imaging differences between different modalities. To our knowledge, this is the first temporal misaligned rgb-thermal dataset for drone-to-drone target detection, providing convenience for research into rgb-thermal image fusion and the development of drone target detection. Full article

(This article belongs to the Special Issue Detection, Identification and Tracking of UAVs and Drones)

► Show Figures

Figure 1

28 pages, 6018 KB

Open AccessArticle

Analysis of Factors Influencing Driving Safety at Typical Curve Sections of Tibet Plateau Mountainous Areas Based on Explainability-Oriented Dynamic Ensemble Learning Strategy

by Xinhang Wu, Fei Chen, Wu Bo, Yicheng Shuai, Xue Zhang, Wa Da, Huijing Liu and Junhao Chen

Sustainability 2025, 17(17), 7820; https://doi.org/10.3390/su17177820 - 30 Aug 2025

Cited by 2 | Viewed by 1742

Abstract

The complex topography of China’s Tibetan Plateau mountainous roads, characterized by diverse curve types and frequent traffic accidents, significantly impacts the safety and sustainability of the transportation system. To enhance driving safety on these mountain roads and promote low-carbon, resilient transportation development, this [...] Read more.

The complex topography of China’s Tibetan Plateau mountainous roads, characterized by diverse curve types and frequent traffic accidents, significantly impacts the safety and sustainability of the transportation system. To enhance driving safety on these mountain roads and promote low-carbon, resilient transportation development, this study investigates the mechanisms through which different curve types affect driving safety and proposes optimization strategies based on interpretable machine learning methods. Focusing on three typical curve types in plateau regions, drone high-altitude photography was employed to capture footage of three specific curves along China’s National Highway G318. Oblique photography was utilized to acquire road environment information, from which 11 data indicators were extracted. Subsequently, 8 indicators, including cornering preference and vehicle type, were designated as explanatory variables, the curve type indicator was set as the dependent variable, and the remaining indicators were established as safety assessment indicators. Linear models (logistic regression, ridge regression) and non-linear models (Random Forest, LightGBM, XGBoost) were used to conduct model comparison and factor analysis. Ultimately, three non-linear models were selected, employing an explainability-oriented dynamic ensemble learning strategy (X-DEL) to evaluate the three curve types. The results indicate that non-linear models outperform linear models in terms of accuracy and scene adaptability. The explainability-oriented dynamic ensemble learning strategy (X-DEL) is beneficial for the construction of driving safety models and factor analysis on Tibetan Plateau mountainous roads. Furthermore, the contribution of indicators to driving safety varies across different curve types. This research not only deepens the scientific understanding of safety issues on plateau mountainous roads but, more importantly, its proposed solutions directly contribute to building safer, more efficient, and environmentally friendly transportation systems, thereby providing crucial impetus for sustainable transportation and high-quality regional development in the Tibetan Plateau. Full article

(This article belongs to the Special Issue Sustainable Transportation: Strategies for Enhancing Safety and Reliability)

► Show Figures

Figure 1

22 pages, 8901 KB

Open AccessArticle

D3Fusion: Decomposition–Disentanglement–Dynamic Compensation Framework for Infrared-Visible Image Fusion in Extreme Low-Light

by Wansi Yang, Yi Liu and Xiaotian Chen

Appl. Sci. 2025, 15(16), 8918; https://doi.org/10.3390/app15168918 - 13 Aug 2025

Cited by 4 | Viewed by 1632

Abstract

Infrared-visible image fusion quality is critical for nighttime perception in autonomous driving and surveillance but suffers severe degradation under extreme low-light conditions, including irreversible texture loss in visible images, thermal boundary diffusion artifacts, and overexposure under dynamic non-uniform illumination. To address these challenges, [...] Read more.

Infrared-visible image fusion quality is critical for nighttime perception in autonomous driving and surveillance but suffers severe degradation under extreme low-light conditions, including irreversible texture loss in visible images, thermal boundary diffusion artifacts, and overexposure under dynamic non-uniform illumination. To address these challenges, a Decomposition–Disentanglement–Dynamic Compensation framework, D3Fusion, is proposed. Firstly, a Retinex-inspired Decomposition Illumination Net (DIN) decomposes inputs into enhanced images and degradative illumination maps for joint low-light recovery. Secondly, an illumination-guided encoder and a multi-scale differential compensation decoder dynamically balance cross-modal features. Finally, a progressive three-stage training paradigm from illumination correction through feature disentanglement to adaptive fusion resolves optimization conflicts. Compared to State-of-the-Art methods, on the LLVIP, TNO, MSRS, and RoadScene datasets, D3Fusion achieves an average improvement of 1.59% in standard deviation (SD), 6.9% in spatial frequency (SF), 2.59% in edge intensity (EI), and 1.99% in visual information fidelity (VIF), demonstrating superior performance in extreme low-light scenarios. The framework effectively suppresses thermal diffusion artifacts while mitigating exposure imbalance, adaptively brightening scenes while preserving texture details in shadowed regions. This significantly improves fusion quality for nighttime images by enhancing salient information, establishing a robust solution for multimodal perception under illumination-critical conditions. Full article

(This article belongs to the Special Issue Big Data and Artificial Intelligence Applications in Infrared Thermography)

► Show Figures

Figure 1

20 pages, 4396 KB

Open AccessArticle

Squeeze-EnGAN: Memory Efficient and Unsupervised Low-Light Image Enhancement for Intelligent Vehicles

by Haegyo In, Juhum Kweon and Changjoo Moon

Sensors 2025, 25(6), 1825; https://doi.org/10.3390/s25061825 - 14 Mar 2025

Cited by 5 | Viewed by 1517

Abstract

Intelligent vehicles, such as autonomous cars, drones, and robots, rely on sensors to gather environmental information and respond accordingly. RGB cameras are commonly used due to their low cost and high resolution but are limited in low-light conditions. While employing LiDAR or specialized [...] Read more.

Intelligent vehicles, such as autonomous cars, drones, and robots, rely on sensors to gather environmental information and respond accordingly. RGB cameras are commonly used due to their low cost and high resolution but are limited in low-light conditions. While employing LiDAR or specialized cameras can address this issue, these solutions often incur high costs. Deep learning-based low-light image enhancement (LLIE) methods offer an alternative, but existing models struggle to adapt to road scenes. Furthermore, most LLIE models rely on supervised training but are heavily constrained by the lack of low-light and normal-light paired datasets. In particular, obtaining paired datasets for driving scenes is extremely challenging. To address these issues, this paper proposes Squeeze-EnGAN, a memory-efficient, GAN-based LLIE method capable of unsupervised learning without paired image datasets. Squeeze-EnGAN incorporates a fire module into a U-net architecture, substantially reducing the number of parameters and Multiply-Accumulate Operations (MACs) compared to its base model, EnlightenGAN. Additionally, Squeeze-EnGAN achieves real-time performance on devices like Jetson Xavier (0.061 s). Significantly, enhanced images improve object detection performance over original images, demonstrating the model’s potential to aid high-level vision tasks in intelligent vehicles. Full article

(This article belongs to the Section Vehicular Sensing)

► Show Figures

Figure 1

25 pages, 6736 KB

Open AccessArticle

LFIR-YOLO: Lightweight Model for Infrared Vehicle and Pedestrian Detection

by Quan Wang, Fengyuan Liu, Yi Cao, Farhan Ullah and Muxiong Zhou

Sensors 2024, 24(20), 6609; https://doi.org/10.3390/s24206609 - 14 Oct 2024

Cited by 26 | Viewed by 6126

Abstract

The complexity of urban road scenes at night and the inadequacy of visible light imaging in such conditions pose significant challenges. To address the issues of insufficient color information, texture detail, and low spatial resolution in infrared imagery, we propose an enhanced infrared [...] Read more.

The complexity of urban road scenes at night and the inadequacy of visible light imaging in such conditions pose significant challenges. To address the issues of insufficient color information, texture detail, and low spatial resolution in infrared imagery, we propose an enhanced infrared detection model called LFIR-YOLO, which is built upon the YOLOv8 architecture. The primary goal is to improve the accuracy of infrared target detection in nighttime traffic scenarios while meeting practical deployment requirements. First, to address challenges such as limited contrast and occlusion noise in infrared images, the C2f module in the high-level backbone network is augmented with a Dilation-wise Residual (DWR) module, incorporating multi-scale infrared contextual information to enhance feature extraction capabilities. Secondly, at the neck of the network, a Content-guided Attention (CGA) mechanism is applied to fuse features and re-modulate both initial and advanced features, catering to the low signal-to-noise ratio and sparse detail features characteristic of infrared images. Third, a shared convolution strategy is employed in the detection head, replacing the decoupled head strategy and utilizing shared Detail Enhancement Convolution (DEConv) and Group Norm (GN) operations to achieve lightweight yet precise improvements. Finally, loss functions, PIoU v2 and Adaptive Threshold Focal Loss (ATFL), are integrated into the model to better decouple infrared targets from the background and to enhance convergence speed. The experimental results on the FLIR and multispectral datasets show that the proposed LFIR-YOLO model achieves an improvement in detection accuracy of 4.3% and 2.6%, respectively, compared to the YOLOv8 model. Furthermore, the model demonstrates a reduction in parameters and computational complexity by 15.5% and 34%, respectively, enhancing its suitability for real-time deployment on resource-constrained edge devices. Full article

(This article belongs to the Section Sensing and Imaging)

► Show Figures

Figure 1

18 pages, 5148 KB

Open AccessArticle

Effects of Titanium Dioxide (TiO₂) on Physico-Chemical Properties of Low-Density Polyethylene

by Peter P. Ndibewu, Tina E. Lefakane and Taki E. Netshiozwi

Polymers 2024, 16(19), 2788; https://doi.org/10.3390/polym16192788 - 1 Oct 2024

Cited by 8 | Viewed by 5471

Abstract

Hazardous chemicals are transported on rail and road networks. In the case of accidental spillage or terror attack, civilian and military first responders must approach the scene equipped with appropriate personal protective equipment. The plausible manufacturing of chemical protective polymer material, from photocatalyst [...] Read more.

Hazardous chemicals are transported on rail and road networks. In the case of accidental spillage or terror attack, civilian and military first responders must approach the scene equipped with appropriate personal protective equipment. The plausible manufacturing of chemical protective polymer material, from photocatalyst anatase titanium dioxide (TiO₂) doped low-density polyethylene (LDPE), for cost-effective durable lightweight protective garments against toxic chemicals such as 2-chloroethyl ethyl sulphide (CEES) was investigated. The photocatalytic effects on the physico-chemical properties, before and after ultraviolet (UV) light exposure were evaluated. TiO₂ (0, 5, 10, 15% wt) doped LDPE films were extruded and characterized by SEM-EDX, TEM, tensile tester, DSC-TGA and permeation studies before and after exposure to UV light, respectively. Results revealed that tensile strength and thermal analysis showed an increasing shift, whilst CEES permeation times responded oppositely with a significant decrease from 127 min to 84 min due to the degradation of the polymer matrix for neat LDPE, before and after UV exposure. The TiO₂-doped films showed an increasing shift in results obtained for physical properties as the doping concentration increased, before and after UV exposure. Relating to chemical properties, the trend was the inverse of the physical properties. The 15% TiO₂-doped film showed improved permeation times only when the photocatalytic TiO₂ was activated. However, 5% TiO₂-doped film exceptionally maintained better permeation times before and after UV exposure demonstrating better resistance against CEES permeation. Full article

(This article belongs to the Topic Advanced Polymeric Composites: Processing, Characterization and Mechanical Behavior)

► Show Figures

Figure 1

19 pages, 17909 KB

Open AccessArticle

Nighttime Pothole Detection: A Benchmark

by Min Ling, Quanjun Shi, Xin Zhao, Wenzheng Chen, Wei Wei, Kai Xiao, Zeyu Yang, Hao Zhang, Shuiwang Li, Chenchen Lu and Yufan Zeng

Electronics 2024, 13(19), 3790; https://doi.org/10.3390/electronics13193790 - 24 Sep 2024

Cited by 9 | Viewed by 5543

Abstract

In the field of computer vision, the detection of road potholes at night represents a critical challenge in enhancing the safety of intelligent transportation systems. Ensuring road safety is of paramount importance, particularly in promptly repairing pothole issues. These abrupt road depressions can [...] Read more.

In the field of computer vision, the detection of road potholes at night represents a critical challenge in enhancing the safety of intelligent transportation systems. Ensuring road safety is of paramount importance, particularly in promptly repairing pothole issues. These abrupt road depressions can easily lead to vehicle skidding, loss of control, and even traffic accidents, especially when water has pooled in or submerged the potholes. Therefore, the detection and recognition of road potholes can significantly reduce vehicle damage and the incidence of safety incidents. However, research on road pothole detection lacks high-quality annotated datasets, particularly under low-light conditions at night. To address this issue, this study introduces a novel Nighttime Pothole Dataset (NPD), independently collected and comprising 3831 images that capture diverse scene variations. The construction of this dataset aims to counteract the insufficiency of existing data resources and strives to provide a richer and more realistic benchmark. Additionally, we develop a baseline detector, termed WT-YOLOv8, for the proposed dataset, based on YOLOv8. We also evaluate the performance of the improved WT-YOLOv8 method and eight state-of-the-art object detection methods on the NPD and the COCO dataset. The experimental results on the NPD demonstrate that WT-YOLOv8 achieves a 2.3% improvement in mean Average Precision (mAP) over YOLOv8. In terms of the key metrics—AP@0.5 and AP@0.75—it shows enhancements of 1.5% and 2.8%, respectively, compared to YOLOv8. The experimental results provide valuable insights into each method’s strengths and weaknesses under low-light conditions. This analysis highlights the importance of a specialized dataset for nighttime pothole detection and shows variations in accuracy and robustness among methods, emphasizing the need for improved nighttime pothole detection techniques. The introduction of the NPD is expected to stimulate further research, encouraging the development of advanced algorithms for nighttime pothole detection, ultimately leading to more flexible and reliable road maintenance and road safety. Full article

(This article belongs to the Special Issue New Trends in AI-Assisted Computer Vision)

► Show Figures

Figure 1

27 pages, 10017 KB

Open AccessArticle

A Self-Adaptive Automatic Incident Detection System for Road Surveillance Based on Deep Learning

by César Bartolomé-Hornillos, Luis M. San-José-Revuelta, Javier M. Aguiar-Pérez, Carlos García-Serrada, Eduardo Vara-Pazos and Pablo Casaseca-de-la-Higuera

Sensors 2024, 24(6), 1822; https://doi.org/10.3390/s24061822 - 12 Mar 2024

Cited by 3 | Viewed by 2894

Abstract

We present an automatic road incident detector characterised by a low computational complexity for easy implementation in affordable devices, automatic adaptability to changes in scenery and road conditions, and automatic detection of the most common incidents (vehicles with abnormal speed, pedestrians or objects [...] Read more.

We present an automatic road incident detector characterised by a low computational complexity for easy implementation in affordable devices, automatic adaptability to changes in scenery and road conditions, and automatic detection of the most common incidents (vehicles with abnormal speed, pedestrians or objects falling on the road, vehicles stopped on the shoulder, and detection of kamikaze vehicles). To achieve these goals, different tasks have been addressed: lane segmentation, identification of traffic directions, and elimination of unnecessary objects in the foreground. The proposed system has been tested on a collection of videos recorded in real scenarios with real traffic, including areas with different lighting. Self-adaptability (plug and play) to different scenarios has been tested using videos with significant scene changes. The achieved system can process a minimum of 80 video frames within the camera’s field of view, covering a distance of 400 m, all within a span of 12 s. This capability ensures that vehicles travelling at speeds of 120 km/h are seamlessly detected with more than enough margin. Additionally, our analysis has revealed a substantial improvement in incident detection with respect to previous approaches. Specifically, an increase in accuracy of 2–5% in automatic mode and 2–7% in semi-automatic mode. The proposed classifier module only needs 2.3 MBytes of GPU to carry out the inference, thus allowing implementation in low-cost devices. Full article

(This article belongs to the Section Vehicular Sensing)

► Show Figures

Graphical abstract

16 pages, 12821 KB

Open AccessCommunication

Fast Automatic Registration of UAV Images via Bidirectional Matching

by Xin Luo, Zuqi Wei, Yuwei Jin, Xiao Wang, Peng Lin, Xufeng Wei and Wenjian Zhou

Sensors 2023, 23(20), 8566; https://doi.org/10.3390/s23208566 - 18 Oct 2023

Cited by 8 | Viewed by 2443

Abstract

Image registration plays a vital role in the mosaic process of multiple UAV (Unmanned Aerial Vehicle) images acquired from different spatial positions of the same scene. Aimed at the problem that many fast registration methods cannot provide both high speed and accuracy simultaneously [...] Read more.

Image registration plays a vital role in the mosaic process of multiple UAV (Unmanned Aerial Vehicle) images acquired from different spatial positions of the same scene. Aimed at the problem that many fast registration methods cannot provide both high speed and accuracy simultaneously for UAV visible light images, this work proposes a novel registration framework based on a popular baseline registration algorithm, ORB—the Oriented FAST (Features from Accelerated Segment Test) and Rotated BRIEF (Binary Robust Independent Elemental Features) algorithm. First, the ORB algorithm is utilized to extract image feature points fast. On this basis, two bidirectional matching strategies are presented to match obtained feature points. Then, the PROSRC (Progressive Sample Consensus) algorithm is applied to remove false matches. Finally, the experiments are carried out on UAV image pairs about different scenes including urban, road, building, farmland, and forest. Compared with the original version and other state-of-the-art registration methods, the bi-matching ORB algorithm exhibits higher accuracy and faster speed without any training or prior knowledge. Meanwhile, its complexity is quite low for on-board realization. Full article

(This article belongs to the Section Sensing and Imaging)

► Show Figures

Figure 1

22 pages, 17114 KB

Open AccessArticle

Radar Timing Range–Doppler Spectral Target Detection Based on Attention ConvLSTM in Traffic Scenes

by Fengde Jia, Jihong Tan, Xiaochen Lu and Junhui Qian

Remote Sens. 2023, 15(17), 4150; https://doi.org/10.3390/rs15174150 - 24 Aug 2023

Cited by 19 | Viewed by 6380

Abstract

With the development of autonomous driving and the emergence of various intelligent traffic scenarios, object detection technology based on deep learning is more and more widely applied to real traffic scenarios. Commonly used detection devices include LiDAR and cameras. Since the implementation of [...] Read more.

With the development of autonomous driving and the emergence of various intelligent traffic scenarios, object detection technology based on deep learning is more and more widely applied to real traffic scenarios. Commonly used detection devices include LiDAR and cameras. Since the implementation of traffic scene target detection technology requires mass production, the advantages of millimeter-wave radar have emerged, such as low cost and no interference from the external environment. The performance of LiDAR and cameras is greatly reduced due to their sensitivity to light, which affects target detection at night and in bad weather. However, millimeter-wave radar can overcome the influence of these harsh environments and has a great auxiliary effect on safe driving on the road. In this work, we propose a deep-learning-based object detection method considering the radar range–Doppler spectrum in traffic scenarios. The algorithm uses YOLOv8 as the basic architecture, makes full use of the time series characteristics of range–Doppler spectrum data in traffic scenarios, introduces the ConvLSTM network, and exerts the ability to process time series data. In order to improve the model’s ability to detect small objects, an efficient and lightweight Efficient Channel Attention (ECA) module is introduced. Through extensive experiments, our model shows better performance on two publicly available radar datasets, CARRADA and RADDet, compared to other state-of-the-art methods. Compared with other mainstream methods that can only achieve 30–60% mAP performance when the IOU is 0.3, our model can achieve 74.51% and 75.62% on the RADDet and CARRADA datasets, respectively, and has better robustness and generalization ability. Full article

(This article belongs to the Topic Information Sensing Technology for Intelligent/Driverless Vehicle, 2nd Edition)

► Show Figures

Graphical abstract

16 pages, 5058 KB

Open AccessArticle

Combining Low-Light Scene Enhancement for Fast and Accurate Lane Detection

by Changshuo Ke, Zhijie Xu, Jianqin Zhang and Dongmei Zhang

Sensors 2023, 23(10), 4917; https://doi.org/10.3390/s23104917 - 19 May 2023

Cited by 8 | Viewed by 3217

Abstract

Lane detection is a crucial task in the field of autonomous driving, as it enables vehicles to safely navigate on the road by interpreting the high-level semantics of traffic signs. Unfortunately, lane detection is a challenging problem due to factors such as low-light [...] Read more.

Lane detection is a crucial task in the field of autonomous driving, as it enables vehicles to safely navigate on the road by interpreting the high-level semantics of traffic signs. Unfortunately, lane detection is a challenging problem due to factors such as low-light conditions, occlusions, and lane line blurring. These factors increase the perplexity and indeterminacy of the lane features, making them hard to distinguish and segment. To tackle these challenges, we propose a method called low-light enhancement fast lane detection (LLFLD) that integrates the automatic low-light scene enhancement network (ALLE) with the lane detection network to improve lane detection performance under low-light conditions. Specifically, we first utilize the ALLE network to enhance the input image’s brightness and contrast while reducing excessive noise and color distortion. Then, we introduce symmetric feature flipping module (SFFM) and channel fusion self-attention mechanism (CFSAT) to the model, which refine the low-level features and utilize more abundant global contextual information, respectively. Moreover, we devise a novel structural loss function that leverages the inherent prior geometric constraints of lanes to optimize the detection results. We evaluate our method on the CULane dataset, a public benchmark for lane detection in various lighting conditions. Our experiments show that our approach surpasses other state of the arts in both daytime and nighttime settings, especially in low-light scenarios. Full article

(This article belongs to the Section Vehicular Sensing)

► Show Figures

Figure 1

21 pages, 13010 KB

Open AccessArticle

Mapping with Monocular Camera Sensor under Adversarial Illumination for Intelligent Vehicles

by Wei Tian, Yongkun Wen and Xinning Chu

Sensors 2023, 23(6), 3296; https://doi.org/10.3390/s23063296 - 21 Mar 2023

Cited by 2 | Viewed by 3177

Abstract

High-precision maps are widely applied in intelligent-driving vehicles for localization and planning tasks. The vision sensor, especially monocular cameras, has become favoured in mapping approaches due to its high flexibility and low cost. However, monocular visual mapping suffers from great performance degradation in [...] Read more.

High-precision maps are widely applied in intelligent-driving vehicles for localization and planning tasks. The vision sensor, especially monocular cameras, has become favoured in mapping approaches due to its high flexibility and low cost. However, monocular visual mapping suffers from great performance degradation in adversarial illumination environments such as on low-light roads or in underground spaces. To address this issue, in this paper, we first introduce an unsupervised learning approach to improve keypoint detection and description on monocular camera images. By emphasizing the consistency between feature points in the learning loss, visual features in dim environment can be better extracted. Second, to suppress the scale drift in monocular visual mapping, a robust loop-closure detection scheme is presented, which integrates both feature-point verification and multi-grained image similarity measurements. With experiments on public benchmarks, our keypoint detection approach is proven robust against varied illumination. With scenario tests including both underground and on-road driving, we demonstrate that our approach is able to reduce the scale drift in reconstructing the scene and achieve a mapping accuracy gain of up to 0.14 m in textureless or low-illumination environments. Full article

(This article belongs to the Section Vehicular Sensing)

► Show Figures

Figure 1

20 pages, 7229 KB

Open AccessArticle

Classification of Multispectral Airborne LiDAR Data Using Geometric and Radiometric Information

by Salem Morsy, Ahmed Shaker and Ahmed El-Rabbany

Geomatics 2022, 2(3), 370-389; https://doi.org/10.3390/geomatics2030021 - 9 Sep 2022

Cited by 8 | Viewed by 3617

Abstract

Classification of airborne light detection and ranging (LiDAR) point cloud is still challenging due to the irregular point cloud distribution, relatively low point density, and the complex urban scenes being observed. The availability of multispectral LiDAR systems allows for acquiring data at different [...] Read more.

Classification of airborne light detection and ranging (LiDAR) point cloud is still challenging due to the irregular point cloud distribution, relatively low point density, and the complex urban scenes being observed. The availability of multispectral LiDAR systems allows for acquiring data at different wavelengths with a variety of spectral information from land objects. In this research, a rule-based point classification method of three levels for multispectral airborne LiDAR data covering urban areas is presented. The first level includes ground filtering, which attempts to distinguish aboveground from ground points. The second level aims to divide the aboveground and ground points into buildings, trees, roads, or grass using three spectral indices, namely normalized difference feature indices (NDFIs). A multivariate Gaussian decomposition is then used to divide the NDFIs’ histograms into the aforementioned four classes. The third level aims to label more classes based on their spectral information such as power lines, types of trees, and swimming pools. Two data subsets were tested, which represent different complexity of urban scenes in Oshawa, Ontario, Canada. It is shown that the proposed method achieved an overall accuracy up to 93%, which is increased to over 98% by considering the spatial coherence of the point cloud. Full article

► Show Figures

Figure 1

Search Results (23)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Saved Queries

Search Filter Reset All

Years

Feature Papers

Subjects

Journals

Article Types

Countries / Regions

Search Results (23)

Further Information

Guidelines

MDPI Initiatives

Follow MDPI