Zero-3DCE: A Low-Light Video Enhancement for More Robust Computer Vision Tasks
Abstract
1. Introduction
- Extend Zero-DCE to multi-frame (video) enhancement via separable 3D convolutions.
- Introduce edge-aware and MS-SSIM loss functions for structure and contrast preservation.
- Demonstrate that zero-reference learning can match supervised models while achieving real-time inference.
- Integrate Zero-3DCE with YOLOv11 to enable robust nighttime object and crime detection.
2. Related Work
2.1. Image Enhancers
2.2. Video Enhancers
3. Materials and Methods
3.1. Low-Light Enhancement
3.1.1. 3DCE Network
3.1.2. Spatial Attention Module
3.1.3. Zero-Reference Loss Functions
Color Constancy Loss
Spatial Consistency Loss
Exposure Control Loss
Illumination Smoothness Loss
Edge Detection Loss
MS-SSIM Loss
3.2. Detection
4. Results
4.1. Training Configuration
4.2. Ablation Study
4.2.1. Edge Detection Loss
4.2.2. MS-SSIM Loss
4.2.3. Spatial Attention
4.2.4. Separable Convolution
4.3. Enhancement Results
| Model | PSNR (↑) | SSIM (↑) | LPIPS (↓) | NIQE (↓) | BRISQUE (↓) |
|---|---|---|---|---|---|
| EnlightenGAN [16] | 17.5557 | 0.66567 | 0.3878 | 4.5814 | 10.3452 |
| RUAS [37] | 9.5754 | 0.3194 | 0.5452 | 6.6551 | 27.4994 |
| Zero-DiDCE [41] | 17.9406 | 0.5841 | 0.3782 | 7.8945 | 28.9683 |
| NightEnhancement [37] | 21.5212 | 0.7629 | 0.3595 | 4.4899 | 27.1202 |
| Zero-DCE [8] | 14.8607 | 0.5624 | 0.3852 | 7.7657 | 27.4029 |
| Zero-3DCE | 18.2723 | 0.6676 | 0.3687 | 3.5057 | 13.6800 |
| Model | Runtimes in Seconds |
|---|---|
| EnlightenGAN | 0.2692 |
| GSAD | 2.6917 |
| RUAS | 0.0838 |
| KinD++ | 3.6136 |
| KANT | 55.7471 |
| Zero-DiDCE | 0.2291 |
| Night Enhancement | 0.4520 |
| Zero-3DCE | 0.0621 |
5. Discussion
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Abbreviations
| CV | Computer Vision |
| GAN | Generative Adversarial Network |
| LLIE | Low-Light Image Enhancement |
| LLVE | Low-Light Video Enhancement |
References
- Cai, Y.; Bian, H.; Lin, J.; Wang, H.; Timofte, R.; Zhang, Y. Retinexformer: One-stage retinex-based transformer for low-light image enhancement. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Paris, France, 1–6 October 2023; pp. 12504–12513. [Google Scholar]
- Liu, Y.; Huang, T.; Dong, W.; Wu, F.; Li, X.; Shi, G. Low-light image enhancement with multi-stage residue quantization and brightness-aware attention. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Paris, France, 1–6 October 2023; pp. 12140–12149. [Google Scholar]
- Xu, X.; Wang, R.; Lu, J. Low-light image enhancement via structure modeling and guidance. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada, 17–24 June 2023; pp. 9893–9903. [Google Scholar]
- Liu, F.; Fan, L. A review of advancements in low-light image enhancement using deep learning. arXiv 2025, arXiv:2505.0575. [Google Scholar] [CrossRef]
- Yu, C.; Han, G.; Pan, M.; Wu, X.; Deng, A. Zero-TCE: Zero Reference Tri-Curve Enhancement for Low-Light Images. Appl. Sci. 2025, 15, 701. [Google Scholar] [CrossRef]
- He, J.; Xue, M.; Ning, A.; Song, C. Zero-reference lighting estimation diffusion model for low-light image enhancement. arXiv 2024, arXiv:2403.02879. [Google Scholar]
- Khanam, R.; Hussain, M. Yolov11: An overview of the key architectural enhancements. arXiv 2024, arXiv:2410.17725. [Google Scholar] [CrossRef]
- Guo, C.; Li, C.; Guo, J.; Loy, C.C.; Hou, J.; Kwong, S.; Cong, R. Zero-reference deep curve estimation for low-light image enhancement. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Washington, DC, USA, 14–19 June 2020; pp. 1780–1789. [Google Scholar]
- Lv, F.; Lu, F.; Wu, J.; Lim, C. MBLLEN: Low-Light Image/Video Enhancement Using CNNs. In Proceedings of the BMVC, Newcastle, UK, 3–6 September 2018; p. 4. [Google Scholar]
- Guo, X.; Hu, Q. Low-light image enhancement via breaking down the darkness. Int. J. Comput. Vis. 2023, 131, 48–66. [Google Scholar] [CrossRef]
- Baidoo, E.; Kwesi Kontoh, A. Implementation of gray level image transformation techniques. Int. J. Mod. Educ. Comput. Sci. 2018, 10, 44. [Google Scholar] [CrossRef][Green Version]
- Jebadass, J.R.; Balasubramaniam, P. Low light enhancement algorithm for color images using intuitionistic fuzzy sets with histogram equalization. Multimed. Tools Appl. 2022, 81, 8093–8106. [Google Scholar] [CrossRef]
- Chen, Y.; Wen, C.; Liu, W.; He, W. A depth iterative illumination estimation network for low-light image enhancement based on retinex theory. Sci. Rep. 2023, 13, 19709. [Google Scholar] [CrossRef]
- Zhang, Y.; Zhang, J.; Guo, X. Kindling the darkness: A practical low-light image enhancer. In Proceedings of the 27th ACM International Conference on Multimedia, Nice, France, 21–25 October 2019; pp. 1632–1640. [Google Scholar]
- Lore, K.G.; Akintayo, A.; Sarkar, S. LLNet: A deep autoencoder approach to natural low-light image enhancement. Pattern Recognit. 2017, 61, 650–662. [Google Scholar] [CrossRef]
- Jiang, Y.; Gong, X.; Liu, D.; Cheng, Y.; Fang, C.; Shen, X.; Yang, J.; Zhou, P.; Wang, Z. Enlightengan: Deep light enhancement without paired supervision. IEEE Trans. Image Process. 2021, 30, 2340–2349. [Google Scholar] [CrossRef]
- Zhang, J.; Li, H.; Huo, Z. Unsupervised Boosted Fusion Network for Single Low-light Image Enhancement; IEEE Access: Piscataway, NJ, USA, 2024. [Google Scholar]
- Ming, F.; Wei, Z.; Zhang, J. Unsupervised low-light image enhancement in the fourier transform domain. Appl. Sci. 2023, 14, 332. [Google Scholar] [CrossRef]
- Chen, J.; Wang, Y.; Han, Y. A semi-supervised network framework for low-light image enhancement. Eng. Appl. Artif. Intell. 2023, 126, 107003. [Google Scholar] [CrossRef]
- Jiang, N.; Cao, Y.; Zhang, X.-Y.; Wang, D.-H.; He, Y.; Wang, C.; Zhu, S. Low-light image enhancement with quality-oriented pseudo labels via semi-supervised contrastive learning. Expert Syst. Appl. 2025, 276, 127106. [Google Scholar] [CrossRef]
- Li, C.; Guo, C.; Loy, C.C. Learning to enhance low-light image via zero-reference deep curve estimation. IEEE Trans. Pattern Anal. Mach. Intell. 2021, 44, 4225–4238. [Google Scholar] [CrossRef]
- Zhang, L.; Zhang, L.; Liu, X.; Shen, Y.; Zhang, S.; Zhao, S. Zero-shot restoration of back-lit images using deep internal learning. In Proceedings of the 27th ACM International Conference on Multimedia, Nice, France, 21–25 October 2019; pp. 1623–1631. [Google Scholar]
- Zhu, L.; Yang, W.; Chen, B.; Zhu, H.; Meng, X.; Wang, S. Temporally consistent enhancement of low-light videos via spatial-temporal compatible learning. Int. J. Comput. Vis. 2024, 132, 4703–4723. [Google Scholar] [CrossRef]
- Liu, S.; Li, X.; Zhou, Z.; Guo, B.; Zhang, M.; Shen, H.; Yu, Z. AdaEnlight: Energy-aware low-light video stream enhancement on mobile devices. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 2023, 6, 1–26. [Google Scholar] [CrossRef]
- Zhang, G.; Zhang, Y.; Yuan, X.; Fu, Y. Binarized low-light raw video enhancement. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Washington, DC, USA, 17–21 June 2024; pp. 25753–25762. [Google Scholar]
- Wang, Z.; Simoncelli, E.P.; Bovik, A.C. Multiscale structural similarity for image quality assessment. In The Thrity-Seventh Asilomar Conference on Signals, Systems & Computers; IEEE: Piscataway, NJ, USA, 2003; pp. 1398–1402. [Google Scholar]
- Gandhi, V.; Gandhi, S. Fine-Tuning Without Forgetting: Adaptation of YOLOv8 Preserves COCO Performance. arXiv 2025, arXiv:2505.01016. [Google Scholar] [CrossRef]
- Rafi, A.N.Y.; Yusuf, M. Improving Vehicle Detection in Challenging Datasets: YOLOv5s and Frozen Layers Analysis. Int. J. Inform. Comput. 2023, 5, 31–45. [Google Scholar]
- Lee, C.; Lee, C.; Kim, C.-S. Contrast enhancement based on layered difference representation of 2D histograms. IEEE Trans. Image Process. 2013, 22, 5372–5384. [Google Scholar] [CrossRef]
- Guo, X.; Li, Y.; Ling, H. LIME: Low-light image enhancement via illumination map estimation. IEEE Trans. Image Process. 2016, 26, 982–993. [Google Scholar] [CrossRef]
- Wei, C.; Wang, W.; Yang, W.; Liu, J. Deep retinex decomposition for low-light enhancement. arXiv 2018, arXiv:1808.04560. [Google Scholar] [CrossRef]
- Zheng, S.; Ma, Y.; Pan, J.; Lu, C.; Gupta, G. Low-light image and video enhancement: A comprehensive survey and beyond. arXiv 2022, arXiv:2212.10772. [Google Scholar]
- Li, C.; Guo, C.; Han, L.; Jiang, J.; Cheng, M.-M.; Gu, J.; Loy, C.C. Low-light image and video enhancement using deep learning: A survey. IEEE Trans. Pattern Anal. Mach. Intell. 2021, 44, 9396–9416. [Google Scholar] [CrossRef] [PubMed]
- Anantrasirichai, N.; Lin, R.; Malyugina, A.; Bull, D. BVI-Lowlight: Fully Registered Benchmark Dataset for Low-Light Video Enhancement. arXiv 2024, arXiv:2402.01970. [Google Scholar]
- Tu, Z.; Liu, Y.; Zhang, Y.; Mu, Q.; Yuan, J. DTCM: Joint optimization of dark enhancement and action recognition in videos. IEEE Trans. Image Process. 2023, 32, 3507–3520. [Google Scholar] [CrossRef] [PubMed]
- Yang, W.; Wang, S.; Fang, Y.; Wang, Y.; Liu, J. From fidelity to perceptual quality: A semi-supervised approach for low-light image enhancement. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Washington, DC, USA, 14–19 June 2020; pp. 3063–3072. [Google Scholar]
- Liu, R.; Ma, L.; Zhang, J.; Fan, X.; Luo, Z. Retinex-inspired unrolling with cooperative prior architecture search for low-light image enhancement. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA, 20–25 June 2021; pp. 10561–10570. [Google Scholar]
- Jin, Y.; Yang, W.; Tan, R.T. Unsupervised night image enhancement: When layer decomposition meets light-effects suppression. In Proceedings of the European Conference on Computer Vision, Tel Aviv, Israel, 23–27 October 2022; pp. 404–421. [Google Scholar]
- Hou, J.; Zhu, Z.; Hou, J.; Liu, H.; Zeng, H.; Yuan, H. Global structure-aware diffusion process for low-light image enhancement. Adv. Neural Inf. Process. Syst. 2024, 36, 79734–79747. [Google Scholar]
- Brateanu, A.; Balmez, R.; Orhei, C.; Ancuti, C.; Ancuti, C. Enhancing Low-Light Images with Kolmogorov–Arnold Networks in Transformer Attention. Sensors 2025, 25, 327. [Google Scholar] [CrossRef] [PubMed]
- Mi, A.; Luo, W.; Qiao, Y.; Huo, Z. Rethinking zero-DCE for low-light image enhancement. Neural Process. Lett. 2024, 56, 93. [Google Scholar] [CrossRef]
- Verner, K. Human Detection Dataset CCTV Footage of Humans. Available online: https://www.kaggle.com/datasets/constantinwerner/human-detection-dataset (accessed on 10 September 2024).
- Sharma, A. Weapon Detection Dataset Weapon Detection Including KNIFE, Gun, Pistol etc. Available online: https://www.kaggle.com/datasets/ankan1998/weapon-detection-dataset (accessed on 10 September 2024).
- School. Person, Weapon Datasets Dataset. Available online: https://universe.roboflow.com/school-fin7c/person-weapon-datasets (accessed on 10 September 2024).
- Yang, W.; Yuan, Y.; Ren, W.; Liu, J.; Scheirer, W.J.; Wang, Z.; Zhang, T.; Zhong, Q.; Xie, D.; Pu, S. Advancing image understanding in poor visibility environments: A collective benchmark study. IEEE Trans. Image Process. 2020, 29, 5737–5752. [Google Scholar] [CrossRef]










| Model | Metrics | ||||
|---|---|---|---|---|---|
| Precision | Recall | F1 Score | mAP@0.5 | mAP@0.5:0.95 | |
| Dark | 0.9509 | 0.6301 | 0.7580 | 0.7990 | 0.6778 |
| Zero-3DCE | 0.9560 | 0.8401 | 0.8943 | 0.9078 | 0.7897 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Tatana, M.M.; Maswanganyi, R.C.; Khumalo, P. Zero-3DCE: A Low-Light Video Enhancement for More Robust Computer Vision Tasks. Algorithms 2025, 18, 775. https://doi.org/10.3390/a18120775
Tatana MM, Maswanganyi RC, Khumalo P. Zero-3DCE: A Low-Light Video Enhancement for More Robust Computer Vision Tasks. Algorithms. 2025; 18(12):775. https://doi.org/10.3390/a18120775
Chicago/Turabian StyleTatana, Mpilo Mbulelo, Rito Clifford Maswanganyi, and Philani Khumalo. 2025. "Zero-3DCE: A Low-Light Video Enhancement for More Robust Computer Vision Tasks" Algorithms 18, no. 12: 775. https://doi.org/10.3390/a18120775
APA StyleTatana, M. M., Maswanganyi, R. C., & Khumalo, P. (2025). Zero-3DCE: A Low-Light Video Enhancement for More Robust Computer Vision Tasks. Algorithms, 18(12), 775. https://doi.org/10.3390/a18120775

