A Reference-Free Lens-Flare-Aware Detector for Autonomous Driving
Abstract
1. Introduction
- We propose a lightweight, reference-free model to estimate the “impact of lens flare” on detected objects, providing an efficient solution suitable for real-world applications;
- To develop the reference-free model, we employ a teacher–student training framework in which a reference-based teacher network guides the student model’s learning process;
- Our proposed method is detector-agnostic and can be seamlessly integrated into existing object detection frameworks to enhance performance in flare-affected scenarios with minimal training overhead;
- Unlike previous work relying on non-learnable pixel-level metrics [4], we introduce an end-to-end approach using a lightweight Convolutional Neural Network (CNN) to quantify the “impact of lens flare” and a three-layer Multi-Layer Perceptron (MLP) to optimize the detection via a Log-Likelihood Ratio (LLR) loss.
2. Background
2.1. Lens Flare and Related Work
2.2. Object Detection in Adverse Conditions
3. Methodology
3.1. Theoretical Foundation
3.2. “Impact of Lens Flare” Estimation
3.3. Loss Function
3.3.1. LLR Loss
3.3.2. Cross-Model Loss
4. Experimental Setup and Implementation
4.1. Datasets
4.1.1. Lens Flare Datasets
4.1.2. Autonomous Driving Datasets
4.2. Implementation Details
4.2.1. Estimation of “Impact of Lens Flare” (Reference-Based)
4.2.2. Estimation of “Impact of Lens Flare” (Reference-Free)
4.2.3. Prediction of LLR
4.3. Training Pipeline
| Algorithm 1: Training Pipeline |
|
5. Result and Discussion
5.1. Average Precision
5.2. Time Efficiency
5.3. Ablation Study
5.4. Generalization Across Diverse Detection Architectures
5.5. Evaluation of Real-World Generalization
5.6. Limitation
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Crayton, T.J.; Meier, B.M. Autonomous vehicles: Developing a public health research agenda to frame the future of transportation policy. J. Transp. Health 2017, 6, 245–252. [Google Scholar] [CrossRef]
- Karangwa, J.; Liu, J.; Zeng, Z. Vehicle Detection for Autonomous Driving: A Review of Algorithms and Datasets. IEEE Trans. Intell. Transp. Syst. 2023, 24, 11568–11594. [Google Scholar] [CrossRef]
- IEEE P2020 Working Group. IEEE P2020 Automotive Imaging White Paper; IEEE: New York, NY, USA, 2018. [Google Scholar]
- Ma, S.; Aelterman, J. Lens Flare-Aware Detector in Autonomous Driving. In Proceedings of the VISIGRAPP (2): VISAPP, 2024; SciTePress: Setúbal, Portugal, 2024; pp. 341–348. [Google Scholar]
- Wu, Y.; He, Q.; Xue, T.; Garg, R.; Chen, J.; Veeraraghavan, A.; Barron, J.T. How to train neural networks for flare removal. In Proceedings of the IEEE/CVF International Conference on Computer Vision; IEEE: New York, NY, USA, 2021; pp. 2239–2247. [Google Scholar]
- Talvala, E.V.; Adams, A.; Horowitz, M.; Levoy, M. Veiling glare in high dynamic range imaging. ACM Trans. Graph. (TOG) 2007, 26, 37-es. [Google Scholar] [CrossRef]
- Dai, Y.; Li, C.; Zhou, S.; Feng, R.; Loy, C.C. Flare7k: A phenomenological nighttime flare removal dataset. Adv. Neural Inf. Process. Syst. 2022, 35, 3926–3937. [Google Scholar]
- Dai, Y.; Li, C.; Zhou, S.; Feng, R.; Luo, Y.; Loy, C.C. Flare7k++: Mixing synthetic and real datasets for nighttime flare removal and beyond. IEEE Trans. Pattern Anal. Mach. Intell. 2024, 46, 7041–7055. [Google Scholar] [CrossRef] [PubMed]
- Deng, H.; Li, L.; Zhang, F.; Li, Z.; Xu, B.; Lu, Q.; Gao, C.; Sang, N. Towards Blind Flare Removal Using Knowledge-driven Flare-level Estimator. IEEE Trans. Image Process. 2024, 33, 6114–6128. [Google Scholar] [CrossRef]
- Kotp, Y.; Torki, M. Flare-Free Vision: Empowering Uformer with Depth Insights. In Proceedings of the ICASSP 2024–2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP); IEEE: New York, NY, USA, 2024; pp. 2565–2569. [Google Scholar]
- Jin, Z.; Chen, S.; Feng, H.; Xu, Z.; Chen, Y. Toward Real Flare Removal: A Comprehensive Pipeline and A New Benchmark. arXiv 2023, arXiv:2306.15884. [Google Scholar] [CrossRef]
- Zhou, Y.; Liang, D.; Chen, S.; Huang, S.J.; Yang, S.; Li, C. Improving lens flare removal with general-purpose pipeline and multiple light sources recovery. In Proceedings of the IEEE/CVF International Conference on Computer Vision; IEEE: New York, NY, USA, 2023; pp. 12969–12979. [Google Scholar]
- Dai, Y.; Luo, Y.; Zhou, S.; Li, C.; Loy, C.C. Nighttime smartphone reflective flare removal using optical center symmetry prior. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition; IEEE: New York, NY, USA, 2023; pp. 20783–20791. [Google Scholar]
- Yi, W.; Wang, L. Robust Object Detection for Autonomous Driving in Adverse Weather Conditions With Multi-Scale Feature Enhancement. IEEE Open. J. Intell. Transp. Syst. 2025, 6, 1459–1475. [Google Scholar] [CrossRef]
- Girshick, R.; Donahue, J.; Darrell, T.; Malik, J. Region-based convolutional networks for accurate object detection and segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 2015, 38, 142–158. [Google Scholar] [CrossRef]
- Girshick, R. Fast r-cnn. In Proceedings of the IEEE International Conference on Computer Vision; IEEE: New York, NY, USA, 2015; pp. 1440–1448. [Google Scholar]
- Ren, S.; He, K.; Girshick, R.; Sun, J. Faster r-cnn: Towards real-time object detection with region proposal networks. Adv. Neural Inf. Process. Syst. 2015, 28, 91–99. [Google Scholar] [CrossRef]
- Redmon, J.; Divvala, S.; Girshick, R.; Farhadi, A. You only look once: Unified, real-time object detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition; IEEE: New York, NY, USA, 2016; pp. 779–788. [Google Scholar]
- Jocher, G. Ultralytics YOLOv5. 2020. Available online: https://github.com/ultralytics/yolov5 (accessed on 20 November 2022). Version 7.0, License: AGPL-3.0. [CrossRef]
- Yaseen, M. What is yolov9: An in-depth exploration of the internal features of the next-generation object detector. arXiv 2024, arXiv:2409.07813. [Google Scholar]
- Ge, Z. Yolox: Exceeding yolo series in 2021. arXiv 2021, arXiv:2107.08430. [Google Scholar] [CrossRef]
- Carion, N.; Massa, F.; Synnaeve, G.; Usunier, N.; Kirillov, A.; Zagoruyko, S. End-to-End Object Detection with Transformers. arXiv 2020, arXiv:2005.12872. [Google Scholar]
- Dosovitskiy, A.; Beyer, L.; Kolesnikov, A.; Weissenborn, D.; Zhai, X.; Unterthiner, T.; Dehghani, M.; Minderer, M.; Heigold, G.; Gelly, S.; et al. An image is worth 16 × 16 words: Transformers for image recognition at scale. arXiv 2020, arXiv:2010.11929. [Google Scholar]
- Zhu, X.; Su, W.; Lu, L.; Li, B.; Wang, X.; Dai, J. Deformable detr: Deformable transformers for end-to-end object detection. arXiv 2020, arXiv:2010.04159. [Google Scholar]
- Roh, B.; Shin, J.; Shin, W.; Kim, S. Sparse detr: Efficient end-to-end object detection with learnable sparsity. arXiv 2021, arXiv:2111.14330. [Google Scholar]
- Wang, Y.; Zhang, X.; Yang, T.; Sun, J. Anchor detr: Query design for transformer-based detector. Proc. Proc. AAAI Conf. Artif. Intell. 2022, 36, 2567–2575. [Google Scholar] [CrossRef]
- Russakovsky, O.; Deng, J.; Su, H.; Krause, J.; Satheesh, S.; Ma, S.; Huang, Z.; Karpathy, A.; Khosla, A.; Bernstein, M.; et al. ImageNet Large Scale Visual Recognition Challenge. Int. J. Comput. Vis. (IJCV) 2015, 115, 211–252. [Google Scholar] [CrossRef]
- Lin, T.Y.; Maire, M.; Belongie, S.; Bourdev, L.; Girshick, R.; Hays, J.; Perona, P.; Ramanan, D.; Zitnick, C.L.; Dollár, P. Microsoft COCO: Common Objects in Context. arXiv 2015, arXiv:1405.0312. [Google Scholar] [CrossRef]
- Sun, P.; Kretzschmar, H.; Dotiwalla, X.; Chouard, A.; Patnaik, V.; Tsui, P.; Guo, J.; Zhou, Y.; Chai, Y.; Caine, B.; et al. Scalability in perception for autonomous driving: Waymo open dataset. In Proceedings of the IEEE/CVF conference on Computer Vision and Pattern Recognition; IEEE: New York, NY, USA, 2020; pp. 2446–2454. [Google Scholar]
- Geiger, A.; Lenz, P.; Stiller, C.; Urtasun, R. Vision meets robotics: The kitti dataset. Int. J. Robot. Res. 2013, 32, 1231–1237. [Google Scholar] [CrossRef]
- Hnewa, M.; Radha, H. Object detection under rainy conditions for autonomous vehicles: A review of state-of-the-art and emerging techniques. IEEE Signal Process. Mag. 2020, 38, 53–67. [Google Scholar] [CrossRef]
- Yu, B.; Chen, Y.; Cao, S.Y.; Shen, H.L.; Li, J. Three-channel infrared imaging for object detection in haze. IEEE Trans. Instrum. Meas. 2022, 71, 5008513. [Google Scholar] [CrossRef]
- Ding, Q.; Li, P.; Yan, X.; Shi, D.; Liang, L.; Wang, W.; Xie, H.; Li, J.; Wei, M. Cf-yolo: Cross fusion yolo for object detection in adverse weather with a high-quality real snow dataset. IEEE Trans. Intell. Transp. Syst. 2023, 24, 10749–10759. [Google Scholar] [CrossRef]
- Onzon, E.; Bömer, M.; Mannan, F.; Heide, F. Neural Exposure Fusion for High-Dynamic Range Object Detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition; IEEE: New York, NY, USA, 2024; pp. 17564–17573. [Google Scholar]
- Rashed, H.; Ramzy, M.; Vaquero, V.; El Sallab, A.; Sistu, G.; Yogamani, S. Fusemodnet: Real-time camera and lidar based moving object detection for robust low-light autonomous driving. In Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops; IEEE: New York, NY, USA, 2019. [Google Scholar]
- Chen, W.T.; Huang, Z.K.; Tsai, C.C.; Yang, H.H.; Ding, J.J.; Kuo, S.Y. Learning multiple adverse weather removal via two-stage knowledge learning and multi-contrastive regularization: Toward a unified model. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition; IEEE: New York, NY, USA, 2022; pp. 17653–17662. [Google Scholar]
- Moustakides, G.V.; Basioti, K. Training neural networks for likelihood/density ratio estimation. arXiv 2019, arXiv:1911.00405. [Google Scholar] [CrossRef]
- Zhang, R.; Isola, P.; Efros, A.A.; Shechtman, E.; Wang, O. The unreasonable effectiveness of deep features as a perceptual metric. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition; IEEE: New York, NY, USA, 2018; pp. 586–595. [Google Scholar]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition; IEEE: New York, NY, USA, 2016; pp. 770–778. [Google Scholar]
- Yu, F.; Chen, H.; Wang, X.; Xian, W.; Chen, Y.; Liu, F.; Madhavan, V.; Darrell, T. Bdd100k: A diverse driving dataset for heterogeneous multitask learning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition; IEEE: New York, NY, USA, 2020; pp. 2636–2645. [Google Scholar]
- Caesar, H.; Bankiti, V.; Lang, A.H.; Vora, S.; Liong, V.E.; Xu, Q.; Krishnan, A.; Pan, Y.; Baldan, G.; Beijbom, O. nuscenes: A multimodal dataset for autonomous driving. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition; IEEE: New York, NY, USA, 2020; pp. 11621–11631. [Google Scholar]
- Wang, A.; Chen, H.; Liu, L.; Chen, K.; Lin, Z.; Han, J.; Ding, G. Yolov10: Real-time end-to-end object detection. Adv. Neural Inf. Process. Syst. 2024, 37, 107984–108011. [Google Scholar]
- Jocher, G.; Qiu, J. Ultralytics YOLO11. 2024. Version 11.0.0, AGPL-3.0 License. Available online: https://github.com/ultralytics/ultralytics (accessed on 15 August 2025).
- Tian, Y.; Ye, Q.; Doermann, D. YOLO12: Attention-Centric Real-Time Object Detectors. arXiv 2025, arXiv:2502.12524. [Google Scholar]
- Lv, W.; Xu, S.; Zhao, Y.; Wang, G.; Wei, J.; Cui, C.; Du, Y.; Dang, Q.; Liu, Y. DETRs Beat YOLOs on Real-time Object Detection. arXiv 2023, arXiv:2304.08069. [Google Scholar]












| Class | Original Object Detector (YOLOv5m) [19] | Reference-Based Model | Reference-Free Model |
|---|---|---|---|
| car | 0.643 ± 0.004 | 0.659 ± 0.005 | 0.653 ± 0.004 |
| traffic light | 0.519 ± 0.006 | 0.543 ± 0.006 | 0.540 ± 0.006 |
| traffic sign | 0.535 ± 0.005 | 0.549 ± 0.003 | 0.547 ± 0.004 |
| bicycle | 0.357 ± 0.017 | 0.390 ± 0.035 | 0.380 ± 0.028 |
| pedestrian | 0.440 ± 0.004 | 0.457 ± 0.003 | 0.452 ± 0.003 |
| truck | 0.418 ± 0.006 | 0.437 ± 0.011 | 0.434 ± 0.009 |
| bus | 0.430 ± 0.014 | 0.450 ± 0.017 | 0.441 ± 0.014 |
| motorcycle | 0.277 ± 0.010 | 0.282 ± 0.008 | 0.281 ± 0.007 |
| rider | 0.302 ± 0.017 | 0.311 ± 0.017 | 0.308 ± 0.015 |
| Lens-Flare-Corrupted Images | De-Flare Images [8] | |
|---|---|---|
| AP | 0.641 | 0.680 |
| Model | Inference Time (ms/Image) 1 | Params (M) 2 | MACs (M) 2 |
|---|---|---|---|
| De-flare model [8] (Transformer-based) | 191.44 | 20.45 | 322,758.31 3 |
| De-flare model [13] (CNN-based) | 201.23 | 3.64 | 4,391,958.01 3 |
| Reference-free model | 19.09 | 11.51 | 595.76 |
| Reference-based model | 18.73 | 2.47 | 393.02 |
| Lens Flare Perception | LLR Prediction | AP on Cars | ||
|---|---|---|---|---|
| MSD | LPIPS | MLP | KDE | |
| ✔ 2 | ✔ | 0.646 ± 0.003 | ||
| ✔ | ✔ | 0.648 ± 0.003 | ||
| ✔ 3 | ✔ | 0.647 ± 0.003 | ||
| ✔ | ✔ | 0.659 ± 0.005 4 | ||
| Model | Baseline AP | +Ours (Ref-Based) | +Ours (Ref-Free) |
|---|---|---|---|
| YOLOv10-m [42] | 0.456 ± 0.002 | 0.470 ± 0.007 | 0.470 ± 0.007 |
| YOLOv11-m [43] | 0.467 ± 0.002 | 0.489 ± 0.007 | 0.488 ± 0.006 |
| YOLOv12-m [44] | 0.460 ± 0.001 | 0.480 ± 0.005 | 0.479 ± 0.004 |
| RT-DETR-l [45] | 0.468 ± 0.004 | 0.504 ± 0.010 | 0.504 ± 0.009 |
| Model | Baseline AP | +Ours (Ref-Free) |
|---|---|---|
| YOLOv10-m [42] | 0.659 | 0.668 |
| YOLOv11-m [43] | 0.657 | 0.658 |
| YOLOv12-m [44] | 0.663 | 0.674 |
| RT-DETR-l [45] | 0.679 | 0.705 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Ma, S.; Willems, T.; Ma, W.; Yusuf, M.; Hamme, D.V.; Aelterman, J.; Philips, W. A Reference-Free Lens-Flare-Aware Detector for Autonomous Driving. Sensors 2026, 26, 2359. https://doi.org/10.3390/s26082359
Ma S, Willems T, Ma W, Yusuf M, Hamme DV, Aelterman J, Philips W. A Reference-Free Lens-Flare-Aware Detector for Autonomous Driving. Sensors. 2026; 26(8):2359. https://doi.org/10.3390/s26082359
Chicago/Turabian StyleMa, Shanxing, Tim Willems, Wenwen Ma, Marwan Yusuf, David Van Hamme, Jan Aelterman, and Wilfried Philips. 2026. "A Reference-Free Lens-Flare-Aware Detector for Autonomous Driving" Sensors 26, no. 8: 2359. https://doi.org/10.3390/s26082359
APA StyleMa, S., Willems, T., Ma, W., Yusuf, M., Hamme, D. V., Aelterman, J., & Philips, W. (2026). A Reference-Free Lens-Flare-Aware Detector for Autonomous Driving. Sensors, 26(8), 2359. https://doi.org/10.3390/s26082359

