Explicit Illumination Modeling for Object Detection in Low-Light Environments
Abstract
1. Introduction
- We propose a novel object detection network, termed IADNet, which is designed to enhance the illumination perception capability of detection models. Extensive experiments conducted on multiple low-light datasets demonstrate that IADNet consistently outperforms existing state-of-the-art methods.
- We design an Illumination Modeling Subnetwork (IMS) to explicitly model illumination information in images through contrastive learning. The illumination-aware features captured by the IMS are then used to dynamically modulate the semantic features extracted by the detection backbone, thereby enhancing the illumination perception capability of the detection network.
- We propose a Global Feature Enhancement Module (GFEM) to strengthen the global context modeling capability of the detection network. By enhancing long-range dependency modeling and global semantic representation, GFEM further improves feature representation quality and detection stability under complex illumination conditions.
2. Related Work
2.1. Object Detection
2.2. Low-Light Image Object Detection
2.3. Low-Light Image Enhancement
3. Method
3.1. The Architecture of IADNet
3.2. Illumination Modeling Subnetwork
3.3. Adaptive Weighted Downsampling Layer
3.4. Global Feature Enhancement Module
3.5. Loss Function
4. Experiment
4.1. Implementation Details
4.2. Evaluation Metrics
- TP (True Positives): Number of correctly detected objects (correct class and IoU above the threshold);
- FP (False Positives): Number of incorrectly detected objects (wrong class or IoU below the threshold);
- FN (False Negatives): Number of ground-truth objects missed by the detector.
4.3. Dataset
4.4. Ablation Study
4.5. Experiment Result
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Ren, S.; He, K.; Girshick, R.; Sun, J. Faster R-CNN: Towards real-time object detection with region proposal networks. IEEE Trans. Pattern Anal. Mach. Intell. 2016, 39, 1137–1149. [Google Scholar]
- Wang, C.Y.; Bochkovskiy, A.; Liao, H.Y.M. YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada, 17–24 June 2023; pp. 7464–7475. [Google Scholar]
- Wang, C.Y.; Yeh, I.H.; Liao, H.Y.M. YOLOv9: Learning what you want to learn using programmable gradient information. In Proceedings of the European Conference on Computer Vision; Springer: Cham, Switzerland, 2024; pp. 1–21. [Google Scholar]
- Wang, A.; Chen, H.; Liu, L.; Chen, K.; Lin, Z.; Han, J.; Ding, G. YOLOv10: Real-time end-to-end object detection. Adv. Neural Inf. Process. Syst. 2024, 37, 107984–108011. [Google Scholar]
- Lin, T.Y.; Goyal, P.; Girshick, R.; He, K.; Dollár, P. Focal loss for dense object detection. In Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 22–29 October 2017; pp. 2980–2988. [Google Scholar]
- Carion, N.; Massa, F.; Synnaeve, G.; Usunier, N.; Kirillov, A.; Zagoruyko, S. End-to-end object detection with transformers. In Proceedings of the European Conference on Computer Vision; Springer: Cham, Switzerland, 2020; pp. 213–229. [Google Scholar]
- Zhang, H.; Li, F.; Liu, S.; Zhang, L.; Su, H.; Zhu, J.; Ni, L.M.; Shum, H.-Y. DINO: DETR with improved denoising anchor boxes for end-to-end object detection. arXiv 2022, arXiv:2203.03605. [Google Scholar]
- Lin, T.Y.; Maire, M.; Belongie, S.; Hays, J.; Perona, P.; Ramanan, D.; Dollár, P.; Zitnick, C.L. Microsoft COCO: Common objects in context. In Proceedings of the Computer Vision-ECCV 2014; Springer: Cham, Switzerland, 2014; pp. 740–755. [Google Scholar]
- Everingham, M.; Van Gool, L.; Williams, C.K.I.; Winn, J.; Zisserman, A. The PASCAL visual object classes (VOC) challenge. Int. J. Comput. Vis. 2010, 88, 303–338. [Google Scholar] [CrossRef]
- Zhang, H.; Wang, Y.; Yang, Y. LL-WSOD: Weakly supervised object detection in low-light. J. Vis. Commun. Image Represent. 2024, 98, 104010. [Google Scholar] [CrossRef]
- Hui, Y.; Wang, J.; Li, B. WSA-YOLO: Weak-supervised and adaptive object detection in the low-light environment for YOLOv7. IEEE Trans. Instrum. Meas. 2024, 73, 1–12. [Google Scholar] [CrossRef]
- Liu, W.; Ren, G.; Yu, R.; Guo, S.; Zhu, J.; Zhang, L. Image-adaptive YOLO for object detection in adverse weather conditions. Proc. AAAI Conf. Artif. Intell. 2022, 36, 1792–1800. [Google Scholar] [CrossRef]
- Kalwar, S.; Patel, D.; Aanegola, A.; Konda, K.R.; Garg, S.; Krishna, K.M. GDIP: Gated differentiable image processing for object detection in adverse conditions. In Proceedings of the 2023 IEEE International Conference on Robotics and Automation (ICRA); IEEE: New York, NY, USA, 2023; pp. 7083–7089. [Google Scholar]
- Zhang, H.; Yu, H.; Tao, Y.; Zhu, W.; Zhang, K. Improvement of ship target detection algorithm for YOLOv7-tiny. IET Image Process. 2024, 18, 1710–1718. [Google Scholar] [CrossRef]
- Dong, P.; Wang, Y.; Yu, Q.; Feng, W.; Zong, G. AMC-YOLO: Improved YOLOv8-based defect detection for cigarette packs. IET Image Process. 2024, 18, 4873–4886. [Google Scholar] [CrossRef]
- Zhang, D.; Fang, W.; Liu, Y.; Lyu, Z.; Xiong, C.; Wang, Z. Two-stage video anomaly detection based on dual-stream networks and multi-instance learning. IET Image Process. 2024, 18, 4843–4851. [Google Scholar] [CrossRef]
- Girshick, R. Fast R-CNN. In Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile, 7–13 December 2015; pp. 1440–1448. [Google Scholar]
- He, K.; Gkioxari, G.; Dollár, P.; Girshick, R. Mask R-CNN. In Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 22–29 October 2017; pp. 2961–2969. [Google Scholar]
- Cai, Y.; Bian, H.; Lin, J.; Wang, H.; Timofte, R.; Zhang, Y. Retinexformer: One-stage Retinex-based transformer for low-light image enhancement. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Paris, France, 1–6 October 2023; pp. 12504–12513. [Google Scholar]
- Li, C.; Guo, C.; Chen, C.L. Learning to enhance low-light image via zero-reference deep curve estimation. IEEE Trans. Pattern Anal. Mach. Intell. 2021, 44, 4225–4238. [Google Scholar] [CrossRef]
- Jia, X.; Zhu, C.; Li, M.; Tang, W.; Zhou, W. LLVIP: A visible-infrared paired dataset for low-light vision. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Montreal, QC, Canada, 11–17 October 2021; pp. 3496–3504. [Google Scholar]
- Li, X.; Li, Z.; Zhou, L.; Huang, Z. FOLD: Low-Level Image Enhancement for Low-Light Object Detection Based on FPGA MPSoC. Electronics 2024, 13, 230. [Google Scholar]
- Li, J.; Wang, X.; Chang, Q.; Wang, Y.; Chen, H. Research on low-light environment object detection algorithm based on YOLO_GD. Electronics 2024, 13, 3527. [Google Scholar] [CrossRef]
- Land, E.H. The Retinex theory of color vision. Sci. Am. 1977, 237, 108–129. [Google Scholar] [CrossRef]
- Jobson, D.J.; Rahman, Z.; Woodell, G.A. Properties and performance of a center/surround Retinex. IEEE Trans. Image Process. 1997, 6, 451–462. [Google Scholar] [CrossRef]
- Jobson, D.J.; Rahman, Z.; Woodell, G.A. A multiscale Retinex for bridging the gap between color images and the human observation of scenes. IEEE Trans. Image Process. 1997, 6, 965–976. [Google Scholar] [CrossRef]
- Kimmel, R.; Elad, M.; Shaked, D.; Keshet, R.; Sobel, I. A variational framework for Retinex. Int. J. Comput. Vis. 2003, 52, 7–23. [Google Scholar] [CrossRef]
- Ren, W.; Liu, S.; Ma, L.; Xu, Q.; Xu, X.; Cao, X.; Du, J.; Yang, M.-H. Low-light image enhancement via a deep hybrid network. IEEE Trans. Image Process. 2019, 28, 4364–4375. [Google Scholar] [CrossRef]
- Guo, C.; Li, C.; Guo, J.; Loy, C.C.; Hou, J.; Kwong, S.; Cong, R. Zero-reference deep curve estimation for low-light image enhancement. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA, 13–19 June 2020; pp. 1780–1789. [Google Scholar]
- Xu, X.; Wang, R.; Lu, J. Low-light image enhancement via structure modeling and guidance. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada, 17–24 June 2023; pp. 9893–9903. [Google Scholar]
- Khanam, R.; Hussain, M. YOLOv11: An overview of the key architectural enhancements. arXiv 2024, arXiv:2410.17725. [Google Scholar] [CrossRef]
- Chen, L.; Fu, Y.; Wei, K.; Zheng, D.; Heide, F. Instance segmentation in the dark. Int. J. Comput. Vis. 2023, 131, 2198–2218. [Google Scholar] [CrossRef]
- Zamir, S.W.; Arora, A.; Khan, S.; Hayat, M.; Khan, F.S.; Yang, M.-H. Restormer: Efficient transformer for high-resolution image restoration. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA, 18–24 June 2022; pp. 5728–5739. [Google Scholar]
- Loh, Y.P.; Chan, C.S. Getting to know low-light images with the exclusively dark dataset. Comput. Vis. Image Underst. 2019, 178, 30–42. [Google Scholar] [CrossRef]
- Yang, W.; Yuan, Y.; Ren, W.; Liu, J.; Scheirer, W.J.; Wang, Z.; Zhang, T.; Zhong, Q.; Xie, D.; Pu, S.; et al. Advancing image understanding in poor visibility environments: A collective benchmark study. IEEE Trans. Image Process. 2020, 29, 5737–5752. [Google Scholar] [CrossRef] [PubMed]
- Qin, Q.; Chang, K.; Huang, M.; Li, G. DENet: Detection-driven enhancement network for object detection under adverse weather conditions. In Proceedings of the Asian Conference on Computer Vision, Macao, China, 4–8 December 2022; pp. 2813–2829. [Google Scholar]
- Cui, Z.; Qi, G.J.; Gu, L.; You, S.; Zhang, Z.; Harada, T. Multitask AET with orthogonal tangent regularity for dark object detection. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Montreal, QC, Canada, 11–17 October 2021; pp. 2553–2562. [Google Scholar]
- Hong, M.; Cheng, S.; Huang, H.; Fan, H.; Liu, S. You only look around: Learning illumination-invariant feature for low-light object detection. Adv. Neural Inf. Process. Syst. 2024, 37, 87136–87158. [Google Scholar]
- Yin, X.; Yu, Z.; Fei, Z.; Lv, W.; Gao, X. PE-YOLO: Pyramid enhancement network for dark object detection. In Proceedings of the International Conference on Artificial Neural Networks; Springer: Cham, Switzerland, 2023; pp. 163–174. [Google Scholar]

















| Experimental Environment and Hyperparameters | Version/Value |
|---|---|
| Operating System | Ubuntu 20.04.6 |
| CPU | 13th Gen Intel® Core™ i9-13900K |
| GPU | NVIDIA GeForce RTX 4090 24 GB |
| Python | 3.9.18 |
| PyTorch | 2.0.1 |
| CUDA | 11.8 |
| Learning Rate Decay Strategy | Cosine Annealing |
| Optimizer | SGD |
| Batch Size | 16 |
| Initial Learning Rate | 0.02 |
| Number of Training Epochs | 100 |
| Baseline | IMS | GFEM | mAP@50/% | mAP@50:95/% | Params/M | GFLOPs | Inference Time (ms) |
|---|---|---|---|---|---|---|---|
| √ | 76.9 | 52.6 | 20.1 | 68.0 | 7.7 | ||
| √ | √ | 79.1 | 54.9 | 20.6 | 72.5 | 8.2 | |
| √ | √ | 78.2 | 53.8 | 21.2 | 68.5 | 8.0 | |
| √ | √ | √ | 80.3 | 56.0 | 21.7 | 73.0 | 8.6 |
| IMS | mAP@50/% | Params/M | GFLOPs | |
|---|---|---|---|---|
| Conv | AWD | |||
| √ | 78.5 | 22.7 | 71.4 | |
| √ | 79.1 | 20.6 | 72.5 | |
| λdet | λctr | mAP@50/% |
|---|---|---|
| 0.2 | 0.1 | 80.3 |
| 0.5 | 0.1 | 79.5 |
| 0.2 | 0.3 | 79.9 |
| 0.1 | 0.1 | 79.4 |
| 0.4 | 0.2 | 80.1 |
| Model | Bicycle | Boat | Bottle | Bus | Car | Cat | Chair | Cup | Dog | Motorbike | People | Table | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| YOLOv9m [3] | 85.6 | 66.8 | 72.0 | 89.7 | 85.8 | 83.0 | 70.1 | 71.5 | 84.7 | 78.6 | 78.3 | 63.7 | 77.5 |
| YOLOv10m [4] | 85.8 | 66.9 | 72.0 | 89.0 | 81.4 | 81.1 | 68.1 | 69.7 | 82.9 | 77.0 | 76.4 | 60.9 | 75.9 |
| YOLOv11m [31] | 80.5 | 68.2 | 72.5 | 88.4 | 82.6 | 80.9 | 70.3 | 75.1 | 79.3 | 77.2 | 80.1 | 67.7 | 76.9 |
| IA-YOLO [12] | 79.1 | 65.8 | 70.4 | 85.6 | 80.2 | 78.3 | 68.5 | 72.6 | 76.9 | 74.8 | 77.5 | 62.9 | 74.3 |
| GDIP [13] | 80.2 | 66.9 | 71.7 | 86.4 | 81.1 | 79.2 | 69.8 | 73.5 | 77.8 | 75.6 | 78.4 | 66.6 | 75.6 |
| DETR [6] | 78.5 | 66.1 | 70.3 | 85.9 | 80.6 | 78.6 | 68.2 | 72.9 | 76.9 | 75.2 | 77.6 | 65.6 | 74.7 |
| DE-YOLO [36] | 80.4 | 79.7 | 77.9 | 91.2 | 82.7 | 72.8 | 69.9 | 80.1 | 77.2 | 76.7 | 82.0 | 57.2 | 77.3 |
| MAET [37] | 83.1 | 78.5 | 75.6 | 92.9 | 83.1 | 73.4 | 71.3 | 79.0 | 79.8 | 77.2 | 81.1 | 57.0 | 77.7 |
| Zero-DCE [29] | 84.1 | 77.6 | 78.3 | 93.1 | 83.7 | 70.3 | 69.8 | 77.6 | 77.4 | 76.3 | 81.0 | 53.6 | 76.9 |
| YOLA [38] | 83.9 | 78.7 | 75.3 | 88.8 | 79.0 | 73.4 | 69.9 | 71.9 | 86.8 | 66.3 | 78.3 | 49.8 | 75.2 |
| PE-YOLO [39] | 84.7 | 79.2 | 79.3 | 92.5 | 83.9 | 71.5 | 71.7 | 79.7 | 79.7 | 77.3 | 81.8 | 55.3 | 78.0 |
| Ours | 87.3 | 73.3 | 73.9 | 91.8 | 83.9 | 84.7 | 73.6 | 76.2 | 87.5 | 79.7 | 82.1 | 71.0 | 80.3 |
| Model | Precision/% | Recall/% | mAP@50/% | mAP@50:95/% |
|---|---|---|---|---|
| YOLOv9m | 88.8 | 75.8 | 85.2 | 53.5 |
| YOLOv10m | 88.1 | 76.0 | 85.1 | 54.2 |
| YOLOv11m | 89.2 | 77.4 | 85.6 | 54.2 |
| IA-YOLO | 87.4 | 75.0 | 84.3 | 52.7 |
| GDIP | 87.8 | 75.5 | 84.8 | 53.1 |
| DETR | 86.9 | 74.6 | 83.9 | 52.4 |
| DE-YOLO | 88.7 | 76.6 | 85.5 | 54.0 |
| MAET | 88.9 | 77.0 | 86.0 | 54.4 |
| Zero-DCE | 87.6 | 76.2 | 84.9 | 52.8 |
| YOLA | 88.4 | 76.4 | 85.0 | 53.2 |
| PE-YOLO | 89.0 | 77.8 | 86.3 | 55.0 |
| Ours | 92.3 | 82.0 | 89.3 | 57.8 |
| Model | Precision/% | Recall/% | mAP@50/% | mAP@50:95/% |
|---|---|---|---|---|
| YOLOv9m | 65.8 | 54.1 | 52.5 | 30.4 |
| YOLOv10m | 66.1 | 54.4 | 52.8 | 30.2 |
| YOLOv11m | 66.7 | 55.0 | 53.2 | 30.8 |
| IA-YOLO | 65.2 | 53.5 | 51.8 | 30.1 |
| GDIP | 65.5 | 53.9 | 52.3 | 31.0 |
| DETR | 65.1 | 53.6 | 51.9 | 29.8 |
| DE-YOLO | 66.0 | 54.2 | 52.6 | 30.2 |
| MAET | 65.9 | 54.6 | 52.6 | 29.5 |
| Zero-DCE | 65.4 | 53.8 | 51.8 | 27.3 |
| YOLA | 66.2 | 54.5 | 52.7 | 30.2 |
| PE-YOLO | 66.4 | 54.8 | 52.9 | 30.5 |
| Ours | 68.3 | 56.5 | 54.6 | 31.2 |
| Model | Person | Car | Bus | Bicycle | Motorbike | Total |
|---|---|---|---|---|---|---|
| YOLOv9m | 89.3 | 91.5 | 88.7 | 90.9 | 90.1 | 90.1 |
| YOLOv10m | 87.3 | 89.5 | 86.8 | 90.2 | 89.2 | 88.6 |
| YOLOv11m | 86.2 | 88.7 | 85.9 | 89.3 | 87.4 | 87.5 |
| IA-YOLO | 84.1 | 86.5 | 83.8 | 87.2 | 84.9 | 85.3 |
| GDIP | 84.8 | 87.1 | 84.6 | 87.9 | 86.1 | 86.1 |
| DE-YOLO | 89.2 | 91.8 | 90.5 | 88.6 | 93.4 | 90.7 |
| MAET | 89.5 | 91.3 | 88.7 | 92.1 | 89.4 | 90.2 |
| Zero-DCE | 88.5 | 90.3 | 87.2 | 92.7 | 89.3 | 89.6 |
| Ours | 90.1 | 93.2 | 89.7 | 92.8 | 92.1 | 91.6 |
| Model | Person | Car | Bus | Bicycle | Motorbike | Total |
|---|---|---|---|---|---|---|
| YOLOv9m | 90.5 | 92.1 | 91.3 | 92.8 | 92.3 | 91.8 |
| YOLOv10m | 90.2 | 92.5 | 89.8 | 93.1 | 91.4 | 91.4 |
| YOLOv11m | 91.3 | 93.7 | 90.8 | 94.2 | 92.5 | 92.5 |
| Ours | 91.9 | 94.8 | 91.9 | 93.9 | 92.6 | 93.0 |
| Model | Params/M | GFLOPs | Inference Time (ms) |
|---|---|---|---|
| YOLOv9m | 20.1 | 76.8 | 9.3 |
| YOLOv10m | 16.5 | 59.1 | 7.8 |
| YOLOv11m | 20.1 | 68.0 | 7.7 |
| Ours | 21.7 | 73.0 | 8.6 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Cao, W.; Yang, P.; Lyu, W. Explicit Illumination Modeling for Object Detection in Low-Light Environments. Electronics 2026, 15, 2057. https://doi.org/10.3390/electronics15102057
Cao W, Yang P, Lyu W. Explicit Illumination Modeling for Object Detection in Low-Light Environments. Electronics. 2026; 15(10):2057. https://doi.org/10.3390/electronics15102057
Chicago/Turabian StyleCao, Wenkang, Peng Yang, and Wensheng Lyu. 2026. "Explicit Illumination Modeling for Object Detection in Low-Light Environments" Electronics 15, no. 10: 2057. https://doi.org/10.3390/electronics15102057
APA StyleCao, W., Yang, P., & Lyu, W. (2026). Explicit Illumination Modeling for Object Detection in Low-Light Environments. Electronics, 15(10), 2057. https://doi.org/10.3390/electronics15102057

