Towards Reliable Evaluation of Underwater Image Enhancement Using Subjective and Objective Analysis
Abstract
1. Introduction
- We conduct a systematic multi-dataset subjective and objective evaluation of four state-of-the-art deep learning-based UIE models in laboratory conditions according to ITU-R BT.500-15 [12] using images from the UIEBD and EUVP datasets.
- We demonstrate that the perceived performance of UIE models is strongly dataset-dependent, with methods such as UDNet and GuidedHybSensUIR achieving subjective quality comparable to or exceeding that of raw images in specific scenarios across the considered datasets.
- We show that recent general-purpose no-reference IQA measures, such as no-reference top-down image quality assessment (TOPIQ_NR) [13] and the learning-based image quality evaluator (LIQE) [14], achieve higher correlation with subjective MOS values than traditional underwater-specific measures such as the underwater image quality measure (UIQM) [15] and underwater color image quality evaluation (UCIQE) [16].
2. Related Work
2.1. Underwater Image Enhancement Algorithms
2.1.1. Traditional and Prior-Driven Methods
2.1.2. Deep Learning-Based Methods
2.2. Objective Image Quality Assessment Measures
2.2.1. General-Purpose Image Quality Assessment
2.2.2. Underwater Image Quality Assessment
3. Dataset Construction and Subjective Evaluation
- A total of 12 images from the subset of 890 images with quasi-reference—specifically, images “3650”, “3728”, “3925”, “3947”, “9547”, “9554”, “9557”, “12290”, “12299”, “12324”, “12336”, and “15113”;
- A total of 12 images from the subset of 60 images without quasi-reference (UIEBD-Challenging)—namely, images “52”, “102”, “432”, “579”, “605”, “616”, “627”, “770”, “866”, “880”, “2575”, and “2856”.
3.1. Subjective Quality Assessment on the UIEBD Dataset
3.2. Subjective Quality Assessment on the EUVP Dataset
4. Objective Quality Assessment
4.1. No-Reference Image Quality Assessment
- General-purpose NR-IQA methods (22): ARNIQA, BRISQUE, CLIP-IQA+, CLIP-IQA, CNNIQA, DBCNN, HyperIQA, ILNIQE, LIQE, MANIQA, MUSIQ, NIQE, NRQM, PAQ-2-PIQ, PI, PIQE, Q-Align, QualiCLIP+, QualiCLIP, TOPIQ_NR, TReS, and WADIQAM_NR.
- Underwater NR-IQA methods (11): URanker, CCF, CSN_uwiqa, CSN_uid2021, UIF, UIQI, UIQM, UICM, UISM, UIConM, and UCIQE.
4.2. Full-Reference Image Quality Assessment
5. Discussion
5.1. Subjective Evaluation and Statistical Analysis
5.2. Implications for Objective Quality Assessment and Evaluation Protocols
6. Conclusions and Future Research
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Abbreviations
| IQA | Image quality assessment |
| UIE | Underwater image enhancement |
| MOS | Mean opinion score |
| NR-IQA | No-reference image quality assessment |
| UIQA | Underwater image quality assessment |
| DCP | Dark-channel prior |
| UDCP | Underwater dark-channel prior |
| UIFM | Underwater image formation model |
| ICSP | Illumination-channel sparsity prior |
| ACCC | Attenuated color-channel correction |
| MSPE | Multi-interval sub-histogram perspective equalization |
| UVIC | Underwater vignetting image correction |
| CCMF | Color correction with multi-scale fusion |
| UCCNet | Underwater color correction network |
| UCCNet-KT | Underwater color correction network with knowledge transfer |
| CVE-Net | Cross-view enhancement network |
| GuidedHybSensUIR | Prior-guided hybrid-sense underwater image restoration |
| HCLR-Net | Hybrid contrastive learning regularization network |
| CCL-Net | Cascaded contrastive learning network |
| TAFormer | Transmission-aware Swin Transformer |
| HUPE | Heuristic underwater perceptual enhancement |
| UDNet | Uncertainty distribution network |
| FR | Full reference |
| NR | No reference |
| FR-IQA | Full-reference image quality assessment |
| SSIM | Structural similarity index |
| MS-SSIM | Multi-scale structural similarity index |
| IQM2 | Image quality measure 2 |
| PSNR | Peak signal-to-noise ratio |
| RR-IQA | Reduced-reference image quality assessment |
| NSSs | Natural scene statistics |
| BRISQUE | Blind/Referenceless Image Spatial Quality Evaluator |
| NIQE | Natural Image Quality Evaluator |
| ILNIQE | Integrated Local Natural Image Quality Evaluator |
| NRQM | No-Reference Quality Metric |
| PIQE | Perception-based Image Quality Evaluator |
| PI | Perceptual Index |
| ARNIQA | Adaptive representation-based no-reference image quality assessment method |
| CNNIQA | Convolutional neural network image quality assessment |
| DBCNN | Deep bilinear convolutional neural network |
| HyperIQA | Hyper network-based image quality assessment |
| MANIQA | Multi-dimension attention network for image quality assessment |
| MUSIQ | Multi-scale image quality transformer |
| PAQ-2-PIQ | Patch-to-image quality ranking |
| TOPIQ | Top-down image quality assessment |
| TReS | Transformer-based ranking strategy |
| WADIQAM | Weighted average deep image quality assessment model |
| CLIP-IQA | Contrastive language-image pre-training based image quality assessment |
| QualiCLIP | Quality-aware contrastive language–image pre-training |
| Q-Align | Quality alignment |
| LIQE | Learning-based image quality evaluator |
| UIQM | Underwater image quality measure |
| UICM | Underwater image colorfulness measure |
| UISM | Underwater image sharpness measure |
| UIConM | Underwater image contrast measure |
| UCIQE | Underwater color image quality evaluation |
| CCF | Colorfulness index, contrast index and fog density index |
| AMQI | Attention and mamba-driven quality index |
| ATUIQP | Attention and transformer-driven underwater image quality predictor |
| URanker | Underwater ranker |
| CSN | Contrast index, sharpness index, and naturalness index |
| UIF | Underwater image fidelity |
| UIQI | Underwater image quality index |
| ACR | Absolute category rating |
| CI | Confidence interval |
| ICC | Intraclass correlation coefficient |
| SIFT | Scale-invariant feature transform |
| SURF | Speeded-up robust features |
| mAP | Mean Average Precision |
| IoU | Intersection over Union |
References
- Saad Saoud, L.; Elmezain, M.; Sultan, A.; Heshmat, M.; Seneviratne, L.; Hussain, I. Seeing Through the Haze: A Comprehensive Review of Underwater Image Enhancement Techniques. IEEE Access 2024, 12, 145206–145233. [Google Scholar] [CrossRef]
- Wang, M.; Zhang, K.; Wei, H.; Chen, W.; Zhao, T. Underwater image quality optimization: Researches, challenges, and future trends. Image Vis. Comput. 2024, 146, 104995. [Google Scholar] [CrossRef]
- Shuang, X.; Zhang, J.; Tian, Y. Algorithms for improving the quality of underwater optical images: A comprehensive review. Signal Process. 2024, 219, 109408. [Google Scholar] [CrossRef]
- Li, B.; Chen, Z.; Lu, L.; Qi, P.; Zhang, L.; Ma, Q.; Hu, H.; Zhai, J.; Li, X. Cascaded frameworks in underwater optical image restoration. Inf. Fusion 2025, 117, 102809. [Google Scholar] [CrossRef]
- Li, X.; Han, Y.; Wang, H.; Liu, T.; Chen, S.; Hu, H. Polarimetric Imaging Through Scattering Media: A Review. Front. Phys. 2022, 10, 815296. [Google Scholar] [CrossRef]
- Li, C.; Guo, C.; Ren, W.; Cong, R.; Hou, J.; Kwong, S.; Tao, D. An Underwater Image Enhancement Benchmark Dataset and Beyond. IEEE Trans. Image Process. 2019, 29, 4376–4389. [Google Scholar] [CrossRef] [PubMed]
- Islam, M.J.; Xia, Y.; Sattar, J. Fast Underwater Image Enhancement for Improved Visual Perception. IEEE Robot. Autom. Lett. 2020, 5, 3227–3234. [Google Scholar] [CrossRef]
- Liu, Y.; Jiang, Q.; Wang, X.; Luo, T.; Zhou, J. Underwater Image Enhancement with Cascaded Contrastive Learning. IEEE Trans. Multimed. 2025, 27, 1512–1525. [Google Scholar] [CrossRef]
- Guo, X.; Chen, X.; Wang, S.; Pun, C.-M. Underwater Image Restoration Through a Prior Guided Hybrid Sense Approach and Extensive Benchmark Analysis. IEEE Trans. Circuits Syst. Video Technol. 2025, 35, 4784–4800. [Google Scholar] [CrossRef]
- Zhang, Z.; Jiang, Z.; Ma, L.; Liu, J.; Fan, X.; Liu, R. HUPE: Heuristic Underwater Perceptual Enhancement with Semantic Collaborative Learning. Int. J. Comput. Vis. 2025, 133, 3259–3277. [Google Scholar] [CrossRef]
- Saleh, A.; Sheaves, M.; Jerry, D.; Azghadi, M.R. Adaptive deep learning framework for robust unsupervised underwater image enhancement. Expert Syst. Appl. 2025, 268, 126314. [Google Scholar] [CrossRef]
- ITU-R BT.500-15; Methodology for the Subjective Assessment of the Quality of Television Pictures. International Telecommunications Union: Geneva, Switzerland, 2023.
- Chen, C.; Mo, J.; Hou, J.; Wu, H.; Liao, L.; Sun, W.; Yan, Q.; Lin, W. TOPIQ: A Top-Down Approach From Semantics to Distortions for Image Quality Assessment. IEEE Trans. Image Process. 2024, 33, 2404–2418. [Google Scholar] [CrossRef]
- Zhang, W.; Zhai, G.; Wei, Y.; Yang, X.; Ma, K. Blind Image Quality Assessment via Vision-Language Correspondence: A Multitask Learning Perspective. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, BC, Canada, 18–22 June 2023; pp. 14071–14081. [Google Scholar] [CrossRef]
- Panetta, K.; Gao, C.; Agaian, S. Human-Visual-System-Inspired Underwater Image Quality Measures. IEEE J. Ocean. Eng. 2016, 41, 541–551. [Google Scholar] [CrossRef]
- Yang, M.; Sowmya, A. An Underwater Color Image Quality Evaluation Metric. IEEE Trans. Image Process. 2015, 24, 6062–6071. [Google Scholar] [CrossRef]
- He, K.; Sun, J.; Tang, X. Single Image Haze Removal Using Dark Channel Prior. IEEE Trans. Pattern Anal. Mach. Intell. 2011, 33, 2341–2353. [Google Scholar] [CrossRef] [PubMed]
- Drews, P.L.J.; Nascimento, E.R.; Botelho, S.S.C.; Campos, M.F.M. Underwater Depth Estimation and Image Restoration Based on Single Images. IEEE Comput. Graph. Appl. 2016, 36, 24–35. [Google Scholar] [CrossRef]
- Fu, X.; Zhuang, P.; Huang, Y.; Liao, Y.; Zhang, X.-P.; Ding, X. A Retinex-Based Enhancing Approach for Single Underwater Image. In Proceedings of the IEEE International Conference on Image Processing (ICIP), Paris, France, 27–30 October 2014; pp. 4572–4576. [Google Scholar] [CrossRef]
- Peng, Y.-T.; Cao, K.; Cosman, P.C. Generalization of the Dark Channel Prior for Single Image Restoration. IEEE Trans. Image Process. 2018, 27, 2856–2868. [Google Scholar] [CrossRef] [PubMed]
- Galdran, A.; Pardo, D.; Picón, A.; Alvarez-Gila, A. Automatic Red-Channel Underwater Image Restoration. J. Vis. Commun. Image Represent. 2015, 26, 132–145. [Google Scholar] [CrossRef]
- Peng, Y.-T.; Cosman, P.C. Underwater Image Restoration Based on Image Blurriness and Light Absorption. IEEE Trans. Image Process. 2017, 26, 1579–1594. [Google Scholar] [CrossRef]
- Xie, J.; Hou, G.; Wang, G.; Pan, Z. A Variational Framework for Underwater Image Dehazing and Deblurring. IEEE Trans. Circuits Syst. Video Technol. 2022, 32, 3514–3526. [Google Scholar] [CrossRef]
- Hou, G.; Li, N.; Zhuang, P.; Li, K.; Sun, H.; Li, C. Non-Uniform Illumination Underwater Image Restoration via Illumination Channel Sparsity Prior. IEEE Trans. Circuits Syst. Video Technol. 2024, 34, 799–814. [Google Scholar] [CrossRef]
- Zhang, W.; Wang, Y.; Li, C. Underwater Image Enhancement by Attenuated Color Channel Correction and Detail Preserved Contrast Enhancement. IEEE J. Ocean. Eng. 2022, 47, 718–735. [Google Scholar] [CrossRef]
- Zhou, J.; Pang, L.; Zhang, D.; Zhang, W. Underwater Image Enhancement Method via Multi-Interval Subhistogram Perspective Equalization. IEEE J. Ocean. Eng. 2023, 48, 474–488. [Google Scholar] [CrossRef]
- Wang, Y.; Ma, Y.; Li, Y.; Zhang, J.; Mi, Z.; Fu, X. Underwater Vignetting Image Correction Based on Binary Polynomial Regularization and Latent Low-Rank Representation. IEEE Trans. Circuits Syst. Video Technol. 2025, 35, 3410–3425. [Google Scholar] [CrossRef]
- Zhang, D.; He, Z.; Zhang, X.; Wang, Z.; Ge, W.; Shi, T.; Lin, Y. Underwater image enhancement via multi-scale fusion and adaptive color-gamma correction in low-light conditions. Eng. Appl. Artif. Intell. 2023, 126, 106972. [Google Scholar] [CrossRef]
- Schechner, Y.Y.; Karpel, N. Clear underwater vision. In Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), Washington, DC, USA, 27 June–2 July 2004; Volume 1, pp. 536–543. [Google Scholar] [CrossRef]
- Chen, Z.; Wu, J.; Hu, H.; Li, X. Underwater polarimetric descattering via scene adaptation and multi-parameter optimization. Opt. Lasers Eng. 2026, 196, 109410. [Google Scholar] [CrossRef]
- Li, C.; Anwar, S.; Hou, J.; Cong, R.; Guo, C.; Ren, W. Underwater Image Enhancement via Medium Transmission-Guided Multi-Color Space Embedding. IEEE Trans. Image Process. 2021, 30, 4985–5000. [Google Scholar] [CrossRef]
- Lin, P.; Wang, Y.; Li, Y.; Fan, Z.; Fu, X. Underwater Color Correction Network With Knowledge Transfer. IEEE Trans. Multimed. 2024, 26, 8088–8103. [Google Scholar] [CrossRef]
- Zhou, J.; Zhang, D.; Zhang, W. Cross-view enhancement network for underwater images. Eng. Appl. Artif. Intell. 2023, 121, 105952. [Google Scholar] [CrossRef]
- Zhou, J.; Sun, J.; Li, C.; Jiang, Q.; Zhou, M.; Lam, K.-M.; Zhang, W.; Fu, X. HCLR-Net: Hybrid Contrastive Learning Regularization with Locally Randomized Perturbation for Underwater Image Enhancement. Int. J. Comput. Vis. 2024, 132, 4132–4156. [Google Scholar] [CrossRef]
- Li, Y.; Mi, Z.; Wang, Y.; Jiang, S.; Fu, X. TAFormer: A Transmission-Aware Transformer for Underwater Image Enhancement. IEEE Trans. Circuits Syst. Video Technol. 2025, 35, 601–616. [Google Scholar] [CrossRef]
- Khan, M.R.; Negi, A.; Kulkarni, A.; Phutke, S.S.; Vipparthi, S.K.; Murala, S. Phaseformer: Phase-Based Attention Mechanism for Underwater Image Restoration and Beyond. In Proceedings of the 2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Tucson, AZ, USA, 28 February–4 March 2025; pp. 9618–9629. [Google Scholar] [CrossRef]
- Tang, Y.; Kawasaki, H.; Iwaguchi, T. Underwater Image Enhancement by Transformer-based Diffusion Model with Non-uniform Sampling for Skip Strategy. In Proceedings of the 31st ACM International Conference on Multimedia, Ottawa, ON, Canada, 29 October–3 November 2023; pp. 5419–5427. [Google Scholar] [CrossRef]
- Wang, Z.; Bovik, A.C.; Sheikh, H.R.; Simoncelli, E.P. Image quality assessment: From error visibility to structural similarity. IEEE Trans. Image Process. 2004, 13, 600–612. [Google Scholar] [CrossRef]
- Wang, Z.; Simoncelli, E.P.; Bovik, A.C. Multiscale structural similarity for image quality assessment. In Proceedings of the Thirty-Seventh Asilomar Conference on Signals, Systems & Computers, Pacific Grove, CA, USA, 9–12 November 2003; Volume 2, pp. 1398–1402. [Google Scholar] [CrossRef]
- Dumic, E.; Grgic, S.; Grgic, M. IQM2: New image quality measure based on steerable pyramid wavelet transform and structural similarity index. Signal Image Video Process. 2014, 8, 1159–1168. [Google Scholar] [CrossRef]
- Mittal, A.; Moorthy, A.K.; Bovik, A.C. No-Reference Image Quality Assessment in the Spatial Domain. IEEE Trans. Image Process. 2012, 21, 4695–4708. [Google Scholar] [CrossRef] [PubMed]
- Mittal, A.; Soundararajan, R.; Bovik, A.C. Making a “Completely Blind” Image Quality Analyzer. IEEE Signal Process. Lett. 2013, 20, 209–212. [Google Scholar] [CrossRef]
- Zhang, L.; Zhang, L.; Bovik, A.C. A Feature-Enriched Completely Blind Image Quality Evaluator. IEEE Trans. Image Process. 2015, 24, 2579–2591. [Google Scholar] [CrossRef]
- Ma, C.; Yang, C.-Y.; Yang, X.; Yang, M.-H. Learning a no-reference quality metric for single-image super-resolution. Comput. Vis. Image Underst. 2017, 158, 1–16. [Google Scholar] [CrossRef]
- Venkatanath, N.; Praneeth, D.; Maruthi Chandrasekhar, B.; Channappayya, S.S.; Medasani, S.S. Blind image quality evaluation using perception based features. In Proceedings of the 2015 Twenty First National Conference on Communications (NCC), Mumbai, India, 27 February–1 March 2015; pp. 1–6. [Google Scholar] [CrossRef]
- Blau, Y.; Michaeli, T. The Perception-Distortion Tradeoff. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018; pp. 6228–6237. [Google Scholar] [CrossRef]
- Agnolucci, L.; Galteri, L.; Bertini, M.; Del Bimbo, A. ARNIQA: Learning Distortion Manifold for Image Quality Assessment. In Proceedings of the 2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI, USA, 1–6 January 2024; pp. 188–197. [Google Scholar]
- Kang, L.; Ye, P.; Li, Y.; Doermann, D. Convolutional Neural Networks for No-Reference Image Quality Assessment. In Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA, 23–28 June 2014; pp. 1733–1740. [Google Scholar] [CrossRef]
- Zhang, W.; Ma, K.; Yan, J.; Deng, D.; Wang, Z. Blind Image Quality Assessment Using a Deep Bilinear Convolutional Neural Network. EEE Trans. Circuits Syst. Video Technol. 2020, 30, 36–47. [Google Scholar] [CrossRef]
- Su, S.; Yan, Q.; Zhu, Y.; Zhang, C.; Ge, X.; Sun, J.; Zhang, Y. Blindly Assess Image Quality in the Wild Guided by a Self-Adaptive Hyper Network. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 13–19 June 2020; pp. 3664–3673. [Google Scholar] [CrossRef]
- Yang, S.; Wu, T.; Shi, S.; Lao, S.; Gong, Y.; Cao, M.; Wang, J.; Yang, Y. MANIQA: Multi-dimension Attention Network for No-Reference Image Quality Assessment. In Proceedings of the 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), New Orleans, LA, USA, 19–20 June 2022; pp. 1190–1199. [Google Scholar] [CrossRef]
- Ke, J.; Wang, Q.; Wang, Y.; Milanfar, P.; Yang, F. MUSIQ: Multi-scale Image Quality Transformer. In Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, QC, Canada, 11–17 October 2021; pp. 5128–5137. [Google Scholar] [CrossRef]
- Ying, Z.; Niu, H.; Gupta, P.; Mahajan, D.; Ghadiyaram, D.; Bovik, A.C. From Patches to Pictures (PaQ-2-PiQ): Mapping the Perceptual Space of Picture Quality. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 14–19 June 2020; pp. 3572–3582. [Google Scholar] [CrossRef]
- Golestaneh, S.A.; Dadsetan, S.; Kitani, K.M. No-Reference Image Quality Assessment via Transformers, Relative Ranking, and Self-Consistency. In Proceedings of the 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI, USA, 3–8 January 2022; pp. 3989–3999. [Google Scholar] [CrossRef]
- Bosse, S.; Maniry, D.; Müller, K.-R.; Wiegand, T.; Samek, W. Deep Neural Networks for No-Reference and Full-Reference Image Quality Assessment. IEEE Trans. Image Process. 2018, 27, 206–219. [Google Scholar] [CrossRef]
- Wang, J.; Chan, K.C.K.; Loy, C.C. Exploring CLIP for assessing the look and feel of images. In Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence and Thirty-Fifth Conference on Innovative Applications of Artificial Intelligence and Thirteenth Symposium on Educational Advances in Artificial Intelligence (AAAI’23/IAAI’23/EAAI’23), Washington, DC, USA, 7–14 February 2023; pp. 2555–2563. [Google Scholar] [CrossRef]
- Agnolucci, L.; Galteri, L.; Bertini, M. Quality-Aware Image-Text Alignment for Opinion-Unaware Image Quality Assessment. arXiv 2024, arXiv:2403.11176. [Google Scholar] [CrossRef]
- Wu, H.; Zhang, Z.; Zhang, W.; Chen, C.; Liao, L.; Li, C.; Gao, Y.; Wang, A.; Zhang, E.; Sun, W.; et al. Q-ALIGN: Teaching LMMs for visual scoring via discrete text-defined levels. In Proceedings of the 41st International Conference on Machine Learning (ICML’24), Vienna, Austria, 21–27 July 2024. [Google Scholar]
- Wang, Y.; Li, N.; Li, Z.; Gu, Z.; Zheng, H.; Zheng, B.; Sun, M. An imaging-inspired no-reference underwater color image quality assessment metric. Comput. Electr. Eng. 2018, 70, 904–913. [Google Scholar] [CrossRef]
- Hou, G.; Zhang, S.; Lu, T.; Li, Y.; Pan, Z.; Huang, B. Underwater Image Quality Assessment via Color Sensitivity Network. Comput. Electr. Eng. 2024, 118, 109293. [Google Scholar] [CrossRef]
- Zheng, Y.; Chen, W.; Lin, R.; Zhao, T.; Le Callet, P. UIF: An Objective Quality Assessment for Underwater Image Enhancement. IEEE Trans. Image Process. 2022, 31, 5456–5468. [Google Scholar] [CrossRef]
- Liu, Y.; Gu, K.; Cao, J.; Wang, S.; Zhai, G.; Dong, J.; Kwong, S. UIQI: A Comprehensive Quality Evaluation Index for Underwater Images. IEEE Trans. Multimed. 2024, 26, 2560–2573. [Google Scholar] [CrossRef]
- Guo, C.; Wu, R.; Jin, X.; Han, L.; Zhang, W.; Chai, Z.; Li, C. Underwater ranker: Learn which is better and how to be better. In Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence and Thirty-Fifth Conference on Innovative Applications of Artificial Intelligence and Thirteenth Symposium on Educational Advances in Artificial Intelligence (AAAI’23/IAAI’23/EAAI’23), Washington, DC, USA, 7–14 February 2023; pp. 702–709. [Google Scholar] [CrossRef]
- Liu, Y.; Zhang, B.; Hu, R.; Gu, K.; Zhai, G.; Dong, J. Underwater Image Quality Assessment: Benchmark Database and Objective Method. IEEE Trans. Multimed. 2024, 26, 7734–7747. [Google Scholar] [CrossRef]
- Cao, J.; Zhang, B.; Liu, Y.; Hu, R.; Gu, K.; Zhai, G.; Dong, J. Attention and Mamba-Driven Quality Assessment for Underwater Images. IEEE Trans. Multimed. 2025, 27, 9761–9775. [Google Scholar] [CrossRef]
- Chen, C.; Mo, J. IQA-PyTorch: PyTorch Toolbox for Image Quality Assessment. Available online: https://github.com/chaofengc/IQA-PyTorch (accessed on 1 October 2025).
- Chen, X. Python Code for Several Metrics: PSNR, SSIM, UCIQE, and UIQM. Available online: https://github.com/xueleichen/PSNR-SSIM-UCIQE-UIQM-Python (accessed on 1 October 2025).
- CCL-Net: Official implementation of the underwater image enhancement with cascaded contrastive learning. Available online: https://github.com/lewis081/CCL-Net (accessed on 1 October 2025).
- GuidedHybSensUIR: Official implementation of the prior-guided hybrid-sense underwater image restoration. Available online: https://github.com/CXH-Research/GuidedHybSensUIR (accessed on 1 October 2025).
- HUPE: Official implementation of the heuristic underwater perceptual enhancement. Available online: https://github.com/ZengxiZhang/HUPE (accessed on 1 October 2025).
- UDNet: Official implementation of the uncertainty distribution network. Available online: https://github.com/alzayats/UDnet (accessed on 1 October 2025).
- Shrout, P.E.; Fleiss, J.L. Intraclass Correlations: Uses in Assessing Rater Reliability. Psychol. Bull. 1979, 86, 420–428. [Google Scholar] [CrossRef]
- Salarian, A. Intraclass Correlation Coefficient (ICC). Available online: https://www.mathworks.com/matlabcentral/fileexchange/22099-intraclass-correlation-coefficient-icc (accessed on 31 January 2026).
- Lowe, D.G. Distinctive Image Features from Scale-Invariant Keypoints. Int. J. Comput. Vis. 2004, 60, 91–110. [Google Scholar] [CrossRef]
- Bay, H.; Ess, A.; Tuytelaars, T.; Van Gool, L. Speeded-Up Robust Features (SURF). Comput. Vis. Image Underst. 2008, 110, 346–359. [Google Scholar] [CrossRef]





| Domain | Type | IQA Measures |
|---|---|---|
| General-purpose | FR | PSNR, SSIM [38], MS-SSIM [39], IQM2 [40], TOPIQ_FR [13], etc. |
| NR, handcrafted | BRISQUE [41], NIQE [42], ILNIQE [43], NRQM [44], PIQE [45], PI [46], etc. | |
| NR, learning-based | ARNIQA [47], CNNIQA [48], DBCNN [49], HyperIQA [50], MANIQA [51], MUSIQ [52], PAQ-2-PIQ [53], TOPIQ_NR [13], TReS [54], WADIQAM [55], CLIP-IQA/CLIP-IQA+ [56], QualiCLIP/QualiCLIP+ [57], Q-Align [58], LIQE [14], etc. | |
| Underwater-specific | NR, handcrafted | UIQM [15] (UICM, UISM, UIConM), UCIQE [16], CCF [59], CSN [60], UIF [61], UIQI [62], etc. |
| NR, learning-based | URanker [63], ATUIQP [64], AMQI [65], etc. |
| UIEBD | EUVP | |
|---|---|---|
| Monitor | Dell 2407WFP-HC | Dell 2407WFP-HC |
| Screen diagonal | 24″ | 24″ |
| Resolution | 1920 × 1200 pixels | 1920 × 1200 pixels |
| Viewing distance | 0.4 m | 0.4 m |
| Male observers | 6 | 16 |
| Female observers | 10 | 1 |
| Overall | 16 | 17 |
| Age range (years) | 22–40 | 20–41 |
| Average age (years) | 25 | 22 |
| Number of outliers | 0 | 0 |
| Objective Measure | PLCC_C1 | PLCC_C2 | PLCC_C3 | SRCC | KRCC | |
|---|---|---|---|---|---|---|
| General NR-IQA methods | ARNIQA | 0.646 | 0.635 | 0.656 | 0.574 | 0.414 |
| BRISQUE | 0.377 | 0.389 | 0.369 | −0.378 | −0.261 | |
| CLIP-IQA+ | 0.686 | 0.683 | 0.685 | 0.676 | 0.488 | |
| CLIP-IQA | 0.465 | 0.462 | 0.380 | 0.436 | 0.316 | |
| CNNIQA | 0.379 | 0.374 | 0.378 | 0.306 | 0.235 | |
| DBCNN | 0.453 | 0.451 | 0.468 | 0.426 | 0.304 | |
| HyperIQA | 0.414 | 0.401 | 0.429 | 0.392 | 0.278 | |
| ILNIQE | 0.384 | 0.379 | 0.372 | −0.345 | −0.244 | |
| LIQE | 0.465 | 0.453 | 0.504 | 0.460 | 0.321 | |
| MANIQA | 0.405 | 0.407 | 0.409 | 0.373 | 0.271 | |
| MUSIQ | 0.720 | 0.715 | 0.714 | 0.682 | 0.506 | |
| NIQE | 0.379 | 0.382 | 0.423 | −0.357 | −0.255 | |
| NRQM | 0.574 | 0.574 | 0.587 | 0.574 | 0.405 | |
| General NR-IQA methods | PAQ-2-PIQ | 0.511 | 0.506 | 0.499 | 0.506 | 0.357 |
| PI | 0.517 | 0.511 | 0.529 | −0.502 | −0.358 | |
| PIQE | 0.411 | 0.408 | 0.407 | −0.382 | −0.265 | |
| Q-Align | 0.740 | 0.731 | 0.747 | 0.666 | 0.502 | |
| QualiCLIP+ | 0.529 | 0.564 | 0.575 | 0.544 | 0.377 | |
| QualiCLIP | 0.550 | 0.550 | 0.552 | 0.543 | 0.381 | |
| TOPIQ_NR | 0.786 | 0.788 | 0.790 | 0.796 | 0.614 | |
| TReS | 0.435 | 0.432 | 0.416 | 0.436 | 0.313 | |
| WADIQAM_NR | 0.106 | 0.222 | 0.239 | 0.093 | 0.062 | |
| Underwater NR-IQA methods | Uranker | 0.356 | 0.305 | 0.370 | 0.038 | 0.028 |
| CCF | 0.227 | 0.321 | 0.333 | −0.025 | −0.021 | |
| CSN_uwiqa | 0.261 | 0.240 | 0.216 | 0.177 | 0.124 | |
| CSN_uid2021 | 0.256 | 0.359 | 0.255 | 0.117 | 0.082 | |
| UIF | 0.564 | 0.564 | 0.577 | 0.567 | 0.400 | |
| UIQI | 0.246 | 0.246 | 0.246 | 0.261 | 0.181 | |
| UIQM | 0.263 | 0.140 | 0.262 | −0.052 | −0.044 | |
| UICM | 0.163 | 0.210 | 0.237 | −0.131 | −0.087 | |
| UISM | 0.240 | 0.264 | 0.265 | 0.115 | 0.076 | |
| UIConM | 0.300 | 0.359 | 0.359 | −0.115 | −0.065 | |
| UCIQE | 0.250 | 0.319 | 0.349 | −0.034 | −0.027 |
| Objective Measure | PLCC_C1 | PLCC_C2 | PLCC_C3 | SRCC | KRCC | |
|---|---|---|---|---|---|---|
| General NR-IQA methods | ARNIQA | 0.768 | 0.771 | 0.771 | 0.751 | 0.540 |
| BRISQUE | 0.599 | 0.608 | 0.632 | −0.529 | −0.382 | |
| CLIP-IQA+ | 0.856 | 0.856 | 0.857 | 0.844 | 0.663 | |
| CLIP-IQA | 0.806 | 0.806 | 0.806 | 0.787 | 0.603 | |
| CNNIQA | 0.775 | 0.774 | 0.775 | 0.758 | 0.553 | |
| DBCNN | 0.800 | 0.797 | 0.805 | 0.804 | 0.613 | |
| HyperIQA | 0.803 | 0.802 | 0.802 | 0.793 | 0.589 | |
| ILNIQE | 0.427 | 0.470 | 0.419 | −0.375 | −0.261 | |
| LIQE | 0.842 | 0.842 | 0.861 | 0.871 | 0.689 | |
| MANIQA | 0.691 | 0.678 | 0.706 | 0.623 | 0.454 | |
| MUSIQ | 0.848 | 0.852 | 0.802 | 0.832 | 0.650 | |
| NIQE | 0.743 | 0.744 | 0.750 | −0.710 | −0.531 | |
| NRQM | 0.607 | 0.683 | 0.685 | 0.597 | 0.418 | |
| PAQ-2-PIQ | 0.593 | 0.593 | 0.564 | 0.590 | 0.413 | |
| PI | 0.696 | 0.694 | 0.718 | −0.685 | −0.502 | |
| PIQE | 0.538 | 0.487 | 0.573 | −0.348 | −0.248 | |
| Q-Align | 0.694 | 0.692 | 0.702 | 0.656 | 0.484 | |
| QualiCLIP+ | 0.777 | 0.779 | 0.792 | 0.744 | 0.557 | |
| QualiCLIP | 0.861 | 0.859 | 0.860 | 0.844 | 0.649 | |
| TOPIQ_NR | 0.819 | 0.819 | 0.819 | 0.804 | 0.608 | |
| TReS | 0.759 | 0.757 | 0.751 | 0.743 | 0.554 | |
| WADIQAM_NR | 0.599 | 0.598 | 0.598 | 0.579 | 0.412 | |
| Underwater NR-IQA methods | Uranker | 0.224 | 0.247 | 0.275 | 0.190 | 0.128 |
| CCF | 0.186 | 0.183 | 0.187 | 0.166 | 0.113 | |
| CSN_uwiqa | 0.467 | 0.466 | 0.489 | 0.461 | 0.316 | |
| CSN_uid2021 | 0.437 | 0.437 | 0.443 | 0.364 | 0.254 | |
| UIF | 0.518 | 0.518 | 0.540 | 0.472 | 0.336 | |
| UIQI | 0.387 | 0.390 | 0.392 | 0.364 | 0.243 | |
| UIQM | 0.355 | 0.354 | 0.387 | 0.329 | 0.227 | |
| UICM | 0.140 | 0.218 | 0.243 | 0.043 | 0.025 | |
| UISM | 0.463 | 0.462 | 0.479 | 0.418 | 0.301 | |
| UIConM | 0.155 | 0.140 | 0.140 | −0.107 | −0.076 | |
| UCIQE | 0.156 | 0.221 | 0.165 | 0.137 | 0.096 |
| PSNR | SSIM | MS-SSIM | IQM2 | |
|---|---|---|---|---|
| CCL-Net | 23.269 | 0.922 | 0.948 | 0.423 |
| GuidedHybSensUIR | 22.329 | 0.936 | 0.959 | 0.497 |
| HUPE | 20.610 | 0.858 | 0.931 | 0.392 |
| UDNet | 18.816 | 0.866 | 0.873 | 0.196 |
| Raw | 15.758 | 0.724 | 0.815 | 0.134 |
| Group | Control Group | UIEBD, p-Value | EUVP, p-Value |
|---|---|---|---|
| CCL-Net | GuidedHybSensUIR | 0.347 | 0.728 |
| CCL-Net | HUPE | 0.983 | 0.001 |
| CCL-Net | UDNet | 0.089 | 0.998 |
| CCL-Net | Raw | 0.006 | 0.993 |
| GuidedHybSensUIR | HUPE | 0.125 | 0.000 |
| GuidedHybSensUIR | UDNet | 0.959 | 0.887 |
| GuidedHybSensUIR | Raw | 0.473 | 0.928 |
| HUPE | UDNet | 0.022 | 0.000 |
| HUPE | Raw | 0.001 | 0.000 |
| UDNet | Raw | 0.873 | 1.000 |
| Metric Category | Strengths | Limitations | Recommended Usage |
|---|---|---|---|
| Underwater-specific measures | Interpretable quality indicators; designed for underwater image characteristics | Weaker correlation with subjective MOS; limited perceptual consistency | Complementary evaluation of underwater-specific degradations |
| Learning-based NR-IQA measures | Stronger agreement with human perceptual judgments; robust perceptual assessment | Possible domain-shift effects; reduced interpretability | Perceptual evaluation; benchmarking and optimization of UIE algorithms |
| FR-IQA measures | Direct comparison against reference or quasi-reference images | Dependence on reference quality; quasi-reference bias | Controlled evaluation scenarios with available reference information |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Palazari, S.; Dumic, E. Towards Reliable Evaluation of Underwater Image Enhancement Using Subjective and Objective Analysis. Electronics 2026, 15, 2412. https://doi.org/10.3390/electronics15112412
Palazari S, Dumic E. Towards Reliable Evaluation of Underwater Image Enhancement Using Subjective and Objective Analysis. Electronics. 2026; 15(11):2412. https://doi.org/10.3390/electronics15112412
Chicago/Turabian StylePalazari, Stella, and Emil Dumic. 2026. "Towards Reliable Evaluation of Underwater Image Enhancement Using Subjective and Objective Analysis" Electronics 15, no. 11: 2412. https://doi.org/10.3390/electronics15112412
APA StylePalazari, S., & Dumic, E. (2026). Towards Reliable Evaluation of Underwater Image Enhancement Using Subjective and Objective Analysis. Electronics, 15(11), 2412. https://doi.org/10.3390/electronics15112412

