A Two-Level Illumination Correction Network for Digital Meter Reading Recognition in Non-Uniform Low-Light Conditions
Abstract
1. Introduction
- We design an Adaptive Illumination Correction Module (AICM) based on Retinex theory to address the issue of reading blind spots caused by non-uniform illumination. Different from conventional image enhancement methods, this module adaptively decouples the illumination and reflectance components, effectively eliminating visual blind spots without introducing artifacts, thereby achieving pixel-level illumination correction for the input data.
- To address the inevitable amplification of background noise during illumination enhancement, we construct an Illumination-invariant Feature Perception Module (IFPM). Leveraging channel attention and spatial attention as feature filters, IFPM dynamically suppresses non-semantic background noise and guides the network to focus on digital semantic features, thereby achieving the synergy between illumination recovery and illumination-invariant feature extraction.
- Building upon these customized modules, we propose a coarse-to-fine cascaded perception framework (CFCP) for DMRR in non-uniform low-light environments. This framework deeply integrates the coarse localization of YOLOv10 with fine-grained text segmentation and recognition. By employing a multi-stage processing strategy, it achieves efficient noise suppression while simultaneously performing illumination recovery.
2. Related Work
2.1. Digital Meter Reading Recognition Methods
2.2. Low-Light Image Enhancement
3. Methodology
3.1. Overview of the CFCP Framework
3.2. Illumination-Aware Cascaded Detection
3.2.1. Coarse Detection
3.2.2. Adaptive Illumination Correction Module
3.2.3. Illumination-Invariant Feature Extraction and Fusion
3.2.4. Prediction Head and Joint Optimization Objective
3.3. Reading Recognition
4. Experiments
4.1. Dataset
4.1.1. Data Collection
4.1.2. Data Augmentation
4.2. Experimental Settings
4.2.1. Evaluation Metrics
4.2.2. Training Details
4.3. Comparisons with the State-of-the-Arts
4.4. Ablation Experiments
4.4.1. Effectiveness of AICM
4.4.2. Effectiveness of IFPM
4.4.3. Effectiveness of Recognition Network
4.4.4. Effectiveness of Parameter
4.5. Convergence and Overfitting Analysis
4.6. Failure Cases
5. Discussion
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
| DMRR | Digital Meter Reading Recognition |
| CFCP | Coarse-to-Fine Cascaded Perception |
| TLICN | Two-Level Illumination Correction Network |
| AICM | Adaptive Illumination Correction Module |
| IFPM | Illumination-invariant Feature Perception Module |
| SHN | Stacked Hourglass Network |
| LLIE | Low-Light Image Enhancement |
| HE | Histogram Equalization |
| RoI | regions of interest |
| GAP | Global Average Pooling |
| CAM | Channel Attention Module |
| SAM | Spatial Attention Module |
| MLP | Multi-Layer Perceptron |
| DB | Differentiable Binarization |
| OHEM | Online Hard Example Mining |
| BCE | Binary Cross-Entropy |
| CTC | Connectionist Temporal Classification |
| SAR | Show, Attend, and Read |
| MGV | Mean Gray Value |
| BiFPN | Bidirectional Feature Pyramid Network |
References
- Barbosa, C.R.H.; Sousa, M.C.; Almeida, M.F.L.; Calili, R.F. Smart manufacturing and digitalization of metrology: A systematic literature review and a research agenda. Sensors 2022, 22, 6114. [Google Scholar] [CrossRef]
- Imran, M.; Anwar, H.; Tufail, M.; Khan, A.; Khan, M.; Ramli, D.A. Image-based automatic energy meter reading using deep learning. Comput. Mater. Contin. 2023, 74, 203–216. [Google Scholar] [CrossRef]
- Zheng, C.; Wang, S.; Zhang, Y.; Zhang, P.; Zhao, Y. A robust and automatic recognition system of analog instruments in power system by using computer vision. Measurement 2016, 92, 413–420. [Google Scholar] [CrossRef]
- Elrefaei, L.A.; Bajaber, A.; Natheir, S.; AbuSanab, N.; Bazi, M. Automatic electricity meter reading based on image processing. In Proceedings of the 2015 IEEE Jordan Conference on Applied Electrical Engineering and Computing Technologies (AEECT); IEEE: Piscataway, NJ, USA, 2015; pp. 1–5. [Google Scholar]
- Rosyady, P.A.; Wijaya, M.A.P. AI-Powered Monitoring System for Analog kWh Meters Using Digital Image Processing and OCR. In Proceedings of the 2025 1st International Conference on Artificial Intelligence Technology (ICoAIT); IEEE: Piscataway, NJ, USA, 2025; pp. 255–258. [Google Scholar]
- Lin, W.; Zhao, Z.; Tao, J.; Lian, C.; Zhang, C. Research on digital meter reading method of inspection robot based on deep learning. Appl. Sci. 2023, 13, 7146. [Google Scholar] [CrossRef]
- Sun, J.; Huang, Z.; Zhang, Y. A novel automatic reading method of pointer meters based on deep learning. Neural Comput. Appl. 2023, 35, 8357–8370. [Google Scholar] [CrossRef]
- Mai, X.; Li, W.; Huang, Y.; Yang, Y. An automatic meter reading method based on one-dimensional measuring curve mapping. In Proceedings of the 2018 IEEE International Conference of Intelligent Robotic and Control Engineering (IRCE); IEEE: Piscataway, NJ, USA, 2018; pp. 69–73. [Google Scholar]
- Salomon, G.; Laroca, R.; Menotti, D. Image-based automatic dial meter reading in unconstrained scenarios. Measurement 2022, 204, 112025. [Google Scholar] [CrossRef]
- Wang, C.Y.; Bochkovskiy, A.; Liao, H.Y.M. YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Vancouver, BC, Canada, 17–24 June 2023; pp. 7464–7475. [Google Scholar]
- Varghese, R.; Sambath, M. Yolov8: A novel object detection algorithm with enhanced performance and robustness. In Proceedings of the 2024 International Conference on Advances in Data Engineering and Intelligent Computing Systems (ADICS); IEEE: Piscataway, NJ, USA, 2024; pp. 1–6. [Google Scholar]
- Majumder, M.; Wilmot, C. Automated vehicle counting from pre-recorded video using you only look once (YOLO) object detection model. J. Imaging 2023, 9, 131. [Google Scholar] [CrossRef]
- Girshick, R. Fast r-cnn. In Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile, 7–13 December 2015; pp. 1440–1448. [Google Scholar]
- He, K.; Gkioxari, G.; Dollár, P.; Girshick, R. Mask r-cnn. In Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 22–29 October 2017; pp. 2961–2969. [Google Scholar]
- Kim, J.H.; Kwon, S.; Fu, J.; Park, J.H. Hair follicle classification and hair loss severity estimation using mask R-CNN. J. Imaging 2022, 8, 283. [Google Scholar] [CrossRef]
- Shi, B.; Bai, X.; Yao, C. An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition. IEEE Trans. Pattern Anal. Mach. Intell. 2016, 39, 2298–2304. [Google Scholar] [CrossRef]
- Liao, M.; Wan, Z.; Yao, C.; Chen, K.; Bai, X. Real-time scene text detection with differentiable binarization. In Proceedings of the AAAI Conference on Artificial Intelligence, New York, NY, USA, 9–11 February 2020; Volume 34, pp. 11474–11481. [Google Scholar]
- Liao, M.; Zou, Z.; Wan, Z.; Yao, C.; Bai, X. Real-time scene text detection with differentiable binarization and adaptive scale fusion. IEEE Trans. Pattern Anal. Mach. Intell. 2022, 45, 919–931. [Google Scholar] [CrossRef]
- Laroca, R.; Araujo, A.B.; Zanlorensi, L.A.; De Almeida, E.C.; Menotti, D. Towards image-based automatic meter reading in unconstrained scenarios: A robust and efficient approach. IEEE Access 2021, 9, 67569–67584. [Google Scholar] [CrossRef]
- Wang, A.; Chen, H.; Liu, L.; Chen, K.; Lin, Z.; Han, J.; Ding, G. Yolov10: Real-time end-to-end object detection. Adv. Neural Inf. Process. Syst. 2024, 37, 107984–108011. [Google Scholar]
- Rahman, Z.U.; Jobson, D.J.; Woodell, G.A. Retinex processing for automatic image enhancement. J. Electron. Imaging 2004, 13, 100–110. [Google Scholar] [CrossRef]
- Anis, A.; Khaliluzzaman, M.; Yakub, M.; Chakraborty, N.; Deb, K. Digital electric meter reading recognition based on horizontal and vertical binary pattern. In Proceedings of the 2017 3rd International Conference on Electrical Information and Communication Technology (EICT); IEEE: Piscataway, NJ, USA, 2017; pp. 1–6. [Google Scholar]
- Bai, Q.; Zhao, L.; Zhang, Y.; Qi, Z. Research of automatic recognition of digital meter reading based on intelligent image processing. In Proceedings of the 2010 2nd International Conference on Computer Engineering and Technology; IEEE: Piscataway, NJ, USA, 2010; Volume 5, pp. pp. 619–623. [Google Scholar]
- Shuo, H.; Ximing, Y.; Donghang, L.; Shaoli, L.; Yu, P. Digital recognition of electric meter with deep learning. In Proceedings of the 2019 14th IEEE International Conference on Electronic Measurement & Instruments (ICEMI); IEEE: Piscataway, NJ, USA, 2019; pp. 600–607. [Google Scholar]
- Iqbal, A.; Basit, A.; Ali, I.; Babar, J.; Ullah, I. Automated Meter Reading Detection Using Inception with Single Shot Multi-Box Detector. Intell. Autom. Soft Comput. 2021, 27, 299–309. [Google Scholar] [CrossRef]
- Zhang, J.; Liu, W.; Xu, S.; Zhang, X. Key point localization and recurrent neural network based water meter reading recognition. Displays 2022, 74, 102222. [Google Scholar] [CrossRef]
- Shafiee, M.J.; Chywl, B.; Li, F.; Wong, A. Fast YOLO: A fast you only look once system for real-time embedded object detection in video. arXiv 2017, arXiv:1709.05943. [Google Scholar] [CrossRef]
- Gómez, L.; Rusinol, M.; Karatzas, D. Cutting Sayre’s Knot: Reading scene text without segmentation. application to utility meters. In Proceedings of the 2018 13th IAPR International Workshop on Document Analysis Systems (DAS); IEEE: Piscataway, NJ, USA, 2018; pp. 97–102. [Google Scholar]
- Redmon, J.; Farhadi, A. Yolov3: An incremental improvement. arXiv 2018, arXiv:1804.02767. [Google Scholar] [CrossRef]
- Li, W.; Wang, S.; Ullah, I.; Zhang, X.; Duan, J. Multiple attention-based encoder–decoder networks for gas meter character recognition. Sci. Rep. 2022, 12, 10371. [Google Scholar] [CrossRef]
- Fu, X.; Liao, Y.; Zeng, D.; Huang, Y.; Zhang, X.P.; Ding, X. A probabilistic method for image enhancement with simultaneous illumination and reflectance estimation. IEEE Trans. Image Process. 2015, 24, 4965–4977. [Google Scholar] [CrossRef]
- Fu, X.; Zeng, D.; Huang, Y.; Zhang, X.P.; Ding, X. A weighted variational model for simultaneous reflectance and illumination estimation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 2782–2790. [Google Scholar]
- Li, M.; Liu, J.; Yang, W.; Sun, X.; Guo, Z. Structure-revealing low-light image enhancement via robust retinex model. IEEE Trans. Image Process. 2018, 27, 2828–2841. [Google Scholar] [CrossRef] [PubMed]
- Lore, K.G.; Akintayo, A.; Sarkar, S. LLNet: A deep autoencoder approach to natural low-light image enhancement. Pattern Recognit. 2017, 61, 650–662. [Google Scholar] [CrossRef]
- Wei, C.; Wang, W.; Yang, W.; Liu, J. Deep retinex decomposition for low-light enhancement. arXiv 2018, arXiv:1808.04560. [Google Scholar] [CrossRef]
- Guo, C.; Li, C.; Guo, J.; Loy, C.C.; Hou, J.; Kwong, S.; Cong, R. Zero-reference deep curve estimation for low-light image enhancement. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA, 13–19 June 2020; pp. 1780–1789. [Google Scholar]
- Jiang, Y.; Gong, X.; Liu, D.; Cheng, Y.; Fang, C.; Shen, X.; Yang, J.; Zhou, P.; Wang, Z. Enlightengan: Deep light enhancement without paired supervision. IEEE Trans. Image Process. 2021, 30, 2340–2349. [Google Scholar] [CrossRef]
- Cui, Z.; Li, K.; Gu, L.; Su, S.; Gao, P.; Jiang, Z.; Qiao, Y.; Harada, T. You only need 90k parameters to adapt light: A light weight transformer for image enhancement and exposure correction. arXiv 2022, arXiv:2205.14871. [Google Scholar] [CrossRef]
- Wang, C.Y.; Yeh, I.H.; Mark Liao, H.Y. Yolov9: Learning what you want to learn using programmable gradient information. In Proceedings of the European Conference on Computer Vision; Springer: Berlin/Heidelberg, Germany, 2024; pp. 1–21. [Google Scholar]
- Lin, Z.; Wu, Y.; Ma, Y.; Chen, J.; Zhang, R.; Wu, J.; Yin, G.; Lin, L. YOLO-LLTS: Real-Time Low-Light Traffic Sign Detection via Prior-Guided Enhancement and Multi-Branch Feature Interaction. arXiv 2025, arXiv:2503.13883. [Google Scholar] [CrossRef]
- Howard, A.; Sandler, M.; Chu, G.; Chen, L.C.; Chen, B.; Tan, M.; Wang, W.; Zhu, Y.; Pang, R.; Vasudevan, V.; et al. Searching for mobilenetv3. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Seoul, Republic of Korea, 27 October–2 November 2019; pp. 1314–1324. [Google Scholar]
- Chen, Y.; Wang, Z.; Peng, Y.; Zhang, Z.; Yu, G.; Sun, J. Cascaded pyramid network for multi-person pose estimation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018; pp. 7103–7112. [Google Scholar]
- Cui, C.; Sun, T.; Lin, M.; Gao, T.; Zhang, Y.; Liu, J.; Wang, X.; Zhang, Z.; Zhou, C.; Liu, H.; et al. Paddleocr 3.0 technical report. arXiv 2025, arXiv:2507.05595. [Google Scholar] [CrossRef]
- Du, Y.; Chen, Z.; Jia, C.; Yin, X.; Zheng, T.; Li, C.; Du, Y.; Jiang, Y.G. Svtr: Scene text recognition with a single visual model. arXiv 2022, arXiv:2205.00159. [Google Scholar] [CrossRef]
- Graves, A.; Fernández, S.; Gomez, F.; Schmidhuber, J. Connectionist temporal classification: Labelling unsegmented sequence data with recurrent neural networks. In Proceedings of the 23rd International Conference on Machine Learning, Pittsburgh, PA, USA, 25–29 June 2006; pp. 369–376. [Google Scholar]
- Li, H.; Wang, P.; Shen, C.; Zhang, G. Show, attend and read: A simple and strong baseline for irregular text recognition. In Proceedings of the AAAI Conference on Artificial Intelligence, Honolulu, HI, USA, 27 January–1 February 2019; Volume 33, pp. 8610–8617. [Google Scholar]
- Zhou, P.; Xie, X.; Lin, Z.; Yan, S. Towards understanding convergence and generalization of AdamW. IEEE Trans. Pattern Anal. Mach. Intell. 2024, 46, 6486–6493. [Google Scholar] [CrossRef]
- Ren, S.; He, K.; Girshick, R.; Sun, J. Faster r-cnn: Towards real-time object detection with region proposal networks. Adv. Neural Inf. Process. Syst. 2015, 28, 1–9. [Google Scholar] [CrossRef]
- Tan, M.; Pang, R.; Le, Q.V. Efficientdet: Scalable and efficient object detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA, 13–19 June 2020; pp. 10781–10790. [Google Scholar]
- Borisyuk, F.; Gordo, A.; Sivakumar, V. Rosetta: Large scale system for text detection and recognition in images. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, London, UK, 19–23 August 2018; pp. 71–79. [Google Scholar]
- Howard, A.G. Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv 2017, arXiv:1704.04861. [Google Scholar] [CrossRef]














| Methods | P | R | F1 | mAP50 | Acc | Params (M) | Time (ms) |
|---|---|---|---|---|---|---|---|
| CRNN | - | - | - | - | 91.8 ± 0.3 | 8.3 | 20.3 |
| EfficientDet-D2 | 94.1 | 95.5 | 94.8 | 95.3 | 91.5 ± 0.2 | 8.1 | 18.4 |
| Faster R-CNN | 96.5 | 95.8 | 96.1 | 96.4 | 92.3 ± 0.3 | 28.4 | 46.7 |
| YOLO-CPDM+EERRM | 95.2 | 96.8 | 96.0 | 96.8 | 91.6 ± 0.4 | 12.8 | 30.2 |
| CFCP | 96.3 | 98.0 | 97.1 | 98.4 | 93.4 ± 0.1 | 7.9 | 17.1 |
| Methods | P | R | F1 | mAP50 | Acc |
|---|---|---|---|---|---|
| CRNN | - | - | - | - | 78.2 ± 0.5 |
| EfficientDet-D2 | 84.1 | 85.2 | 84.6 | 86.3 | 74.1 ± 0.4 |
| Faster R-CNN | 88.5 | 87.2 | 87.8 | 87.1 | 80.3 ± 0.5 |
| YOLO-CPDM+EERRM | 86.4 | 88.1 | 87.2 | 85.3 | 81.9 ± 0.3 |
| CFCP | 92.4 | 93.1 | 92.7 | 91.7 | 83.2 ± 0.2 |
| Methods | P | R | F1 | mAP50 | Acc |
|---|---|---|---|---|---|
| CRNN | - | - | - | - | |
| EfficientDet-D2 | 95.4 | 96.2 | 95.8 | 96.3 | |
| Faster R-CNN | 97.8 | 96.6 | 97.2 | 97.5 | |
| YOLO-CPDM+EERRM | 96.5 | 97.5 | 97 | 98.6 | |
| CFCP | 97.5 | 98.8 | 98.1 | 99.1 | 96.2 ± 0.1 |
| Model | AICM | IFPM | Precision | Recall | F1-Scores | Acc |
|---|---|---|---|---|---|---|
| Baseline | × | × | 69.3 | 70.2 | 69.7 | 60.3 |
| Baseline+AICM | ✓ | × | 89.5 | 88.7 | 89.1 | 85.0 |
| Baseline +IFPM | × | ✓ | 77.2 | 79.8 | 78.5 | 65.4 |
| CFCP | ✓ | ✓ | 96.3 | 98.0 | 97.1 | 93.2 |
| Recognition Model | Acc | Params | Time |
|---|---|---|---|
| CRNN | 91.2 | 8.3 M | 20.3 ms |
| Rosetta | 90.6 | 44.3 M | 67.5 ms |
| SVTR-Base | 95.1 | 24.6 M | 43.8 ms |
| CFCP | 93.2 | 4.9 M | 11.5 ms |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Fu, H.; Xie, Z.; Jiang, W.; Ma, X.; Yang, D. A Two-Level Illumination Correction Network for Digital Meter Reading Recognition in Non-Uniform Low-Light Conditions. J. Imaging 2026, 12, 146. https://doi.org/10.3390/jimaging12040146
Fu H, Xie Z, Jiang W, Ma X, Yang D. A Two-Level Illumination Correction Network for Digital Meter Reading Recognition in Non-Uniform Low-Light Conditions. Journal of Imaging. 2026; 12(4):146. https://doi.org/10.3390/jimaging12040146
Chicago/Turabian StyleFu, Haoning, Zhiwei Xie, Wenzhu Jiang, Xingjiang Ma, and Dongying Yang. 2026. "A Two-Level Illumination Correction Network for Digital Meter Reading Recognition in Non-Uniform Low-Light Conditions" Journal of Imaging 12, no. 4: 146. https://doi.org/10.3390/jimaging12040146
APA StyleFu, H., Xie, Z., Jiang, W., Ma, X., & Yang, D. (2026). A Two-Level Illumination Correction Network for Digital Meter Reading Recognition in Non-Uniform Low-Light Conditions. Journal of Imaging, 12(4), 146. https://doi.org/10.3390/jimaging12040146

