Underwater Image Enhancement Using Hyper-Laplacian Reflectance Priors and CNN-Based Classification †
Abstract
:1. Introduction
- Light reflects back from the captured image, but it cannot be scattered.
- Light reflects back from the image and will be scattered at a small angle.
2. Literature Review
- Hyper-Laplacian priors: Many of the researchers stated that due to the water’s ability to absorb and scatter light, underwater images often provide poor visibility. Because of this reason, the final outcome of the image can be foggy, fuzzy, and lacking in contrast. With the help of the image enhancement method, for example, the hyper-Laplacian filter is one way to increase the quality of the images [17].
- The researcher [18] proposed a method with the combination of GHP (Gradient his- togram preservation) and hyper-Laplacian, which is used to increase the performance of denoising and produce the fine output of the image texture. Research on the images that capture real-world scenes used the hyper-Laplacian as the best model for the gradient images. Also, it has been frequently used in a number of image restoration works [19,20].
3. Methodology
- Data Collection and Preparation:- -
- We collected the dataset of real images and obtained the enhanced images from them via HLRP. We ensured that the images are labeled with the appropriate class (real or enhanced). The set of real and enhanced images with label 1 is shown in Figure 1b.
- -
- The photos were preprocessed by shrinking them to a uniform size and normalizing the pixel values.
- -
- We split the dataset into training, validation, and test sets. The training set will be used to train the CNN, the validation set for hyperparameter tuning, and the test set for final evaluation.
- -
- Then, we predicted the model on new predictable images.
 
- Model Architecture:- -
- We designed a CNN architecture suitable for image classification. This typically involves stacking convolutional layers, followed by pooling layers to capture spatial features and fully connected layers for classification.
- -
- We created a custom architecture based on the complexity of our dataset and available resources.
 
- Model Training:- -
- We added a 2D convolutional layer with 32 filters, a filter size of 3 × 3, ReLU activation function, and ‘same’ padding.
- -
- The image was size 300 by 300.
- -
- We added a max pooling layer with a pool size of 2 × 2, which reduces the spatial dimensions of the feature maps.
- -
- We added a dropout layer that randomly drops 70% of the neurons during training, helping to prevent overfitting.
- -
- These steps are repeated twice with increasing filter sizes (64 and 128) to extract more complex features.
- -
- Then, we added a fully connected layer with 256 neurons, ReLU activation function, and L2 regularization with a regularization coefficient of 0.001. Regularization helps prevent overfitting.
- -
- Then, we added the final fully connected layer with a single neuron and sigmoid activation, which is suitable for binary classification tasks.
- -
- The Adam optimizer is used to optimize the model’s weights during training.
- -
- The model’s performance will be evaluated using accuracy as the metric.
 
- Model Evaluation:- -
- We evaluated the trained model on the test set to assess its performance, passed the test images through the trained network, and compared the predicted labels with the ground truth labels.
- -
- We calculated evaluation metrics such as accuracy, precision, recall, and F1-score to quantify the model’s performance.
- -
- We visualized the results using confusion matrices.
 
4. Discussion of Experimental Results
- Network Training: To train our proposed network, we utilized a dataset consisting of 440 raw underwater images and their corresponding enhanced images. All images in the dataset had a resolution of 300 × 300 and were sourced from the dataset provided by [21]. For the deep-learning framework, we employed the Keras framework running on top of TensorFlow. The training process was conducted over 20 epochs for each dataset. It is worth mentioning that our model was trained on the free version of Google Colab, which operated on a Linux system with an x86_64 processor and 12.68 GB of RAM.
- Qualitative Comparison: The qualitative comparison between the real and enhanced images can be seen from the naked eye. In some photographs, the results reveal considerable colour and contrast improvements. Figure 2a depicts the real images to be predicted, whereas Figure 2b depicts the lighted images to be predicted. The proposed approach generated outcomes that were sharper and clearer. Overall, it is obvious from the visual assessment that the proposed approach can give superior outcomes.
- Quantitative Comparison: Figure 3a shows the RGB histogram of real images, which is evenly spread, whereas, Figure 3b shows the RGB histogram of illuminated images, which has more luminance in a white scale. Figure 4a shows the training accuracy of the system, where it achieved its accuracy within a few epochs despite trying to avoid overfitting. Figure 4b shows the training loss that also reaches minimal after a few epochs. Figure 4c shows the confusion matrix. It is conclusive that the proposed method was quite efficient in enhancing the underwater images. The CNN trained on labeled images real and enhanced was able to accurately differentiate between the two different image qualities.
5. Conclusions
- HLRP enhances the outer edges and characteristics of underwater images and restores the real color of nature.
- HLRP is helpful for acquiring smoothness in both spatial and linear illumination environments.
- We used CNN for the classification of underwater images.
- We classified the images as real and enhanced and tested another set of images to their corresponding labels.
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Schettini, R.; Corchs, S. Underwater Image Processing: State of the Art of Restoration and Image Enhancement Methods. Eurasip J. Adv. Signal Process. 2010, 2010, 746052. [Google Scholar] [CrossRef]
- Zhuang, P.; Wu, J.; Porikli, F.; Li, C. Underwater Image Enhancement With Hyper-Laplacian Reflectance Priors. IEEE Trans. Image Process. 2022, 31, 5442–5455. [Google Scholar] [CrossRef] [PubMed]
- Ancuti, C.; Ancuti, C.O.; Haber, T.; Bekaert, P. Enhancing underwater images and videos by fusion. In Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, Providence, RI, USA, 16–21 June 2012. [Google Scholar] [CrossRef]
- Mohan, S.; Simon, P. Underwater Image Enhancement based on Histogram Manipulation and Multiscale Fusion. Procedia Comput. Sci. 2020, 171, 941–950. [Google Scholar] [CrossRef]
- Anwar, S.; Li, C. Diving deeper into underwater image enhancement: A survey. Signal Process. Image Commun. 2020, 89, 115978. [Google Scholar] [CrossRef]
- Li, C.; Anwar, S.; Porikli, F. Underwater scene prior inspired deep underwater image and video enhancement. Pattern Recognit. 2020, 98, 107038. [Google Scholar] [CrossRef]
- LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef] [PubMed]
- Zhang, K.; Zuo, W.; Chen, Y.; Meng, D.; Zhang, L. Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising. IEEE Trans. Image Process. 2017, 26, 3142–3155. [Google Scholar] [CrossRef] [PubMed]
- Dong, C.; Loy, C.C.; He, K.; Tang, X. Image Super-Resolution Using Deep Convolutional Networks. IEEE Trans. Pattern Anal. Mach. Intell. 2016, 38, 295–307. [Google Scholar] [CrossRef]
- Cai, B.; Xu, X.; Jia, K.; Qing, C.; Tao, D. DehazeNet: An End-to-End System for Single Image Haze Removal. IEEE Trans. Image Process. 2016, 25, 5187–5198. [Google Scholar] [CrossRef]
- Lee, H.S.; Moon, S.W.; Eom, I.K. Underwater Image Enhancement Using Successive Color Correction and Superpixel Dark Channel Prior. Symmetry 2020, 12, 1220. [Google Scholar] [CrossRef]
- Li, J.; Skinner, K.A.; Eustice, R.M.; Johnson-Roberson, M. WaterGAN: Unsupervised Generative Network to Enable Real-time Color Correction of Monocular Underwater Images. IEEE Robot. Autom. Lett. 2017, 3, 387–394. [Google Scholar] [CrossRef]
- Goodfellow, I.; Pouget-Abadie, J.; Mirza, M.; Xu, B.; Warde-Farley, D.; Ozair, S.; Courville, A.; Bengio, Y. Generative adversarial networks. Commun. ACM 2020, 63, 139–144. [Google Scholar] [CrossRef]
- Krizhevsky, A.; Sutskever, I.; Hinton, G.E. ImageNet classification with deep convolutional neural networks. Commun. ACM 2017, 60, 84–90. [Google Scholar] [CrossRef]
- Girshick, R. Fast R-CNN. arXiv 2015, arXiv:1504.08083. [Google Scholar] [CrossRef]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition. arXiv 2014, arXiv:1406.4729. [Google Scholar] [CrossRef]
- Zhang, W.; Jin, S.; Zhuang, P.; Liang, Z.; Li, C. Underwater Image Enhancement via Piecewise Color Correction and Dual Prior Optimized Contrast Enhancement. IEEE Signal Process. Lett. 2023, 30, 229–233. [Google Scholar] [CrossRef]
- Jia, F.; Zhang, H.; Deng, H.; Li, W.; Zuo, W. Image denoising using hyper-Laplacian priors and gradient histogram preservation model. In Proceedings of the 2014 12th International Conference on Signal Processing (ICSP), Hangzhou, China, 19–23 October 2014. [Google Scholar] [CrossRef]
- Fergus, R.; Singh, B.; Hertzmann, A.; Roweis, S.T.; Freeman, W.T. Removing camera shake from a single photograph. In Proceedings of the ACM SIGGRAPH 2006, Boston, MA, USA, 30 June–3 August 2006. [Google Scholar] [CrossRef]
- Cho, T.S.; Zitnick, C.L.; Joshi, N.; Kang, S.B.; Szeliski, R.; Freeman, W.T. Image Restoration by Matching Gradient Distributions. IEEE Trans. Pattern Anal. Mach. Intell. 2012, 34, 683–694. [Google Scholar] [CrossRef]
- Anwar, S.; Li, C.; Porikli, F. Deep Underwater Image Enhancement. arXiv 2018, arXiv:1807.03528. [Google Scholar] [CrossRef]




| Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. | 
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Minhas, T.; Hassan, F.; Irshad, R. Underwater Image Enhancement Using Hyper-Laplacian Reflectance Priors and CNN-Based Classification. Eng. Proc. 2023, 46, 14. https://doi.org/10.3390/engproc2023046014
Minhas T, Hassan F, Irshad R. Underwater Image Enhancement Using Hyper-Laplacian Reflectance Priors and CNN-Based Classification. Engineering Proceedings. 2023; 46(1):14. https://doi.org/10.3390/engproc2023046014
Chicago/Turabian StyleMinhas, Talha, Faizan Hassan, and Rubia Irshad. 2023. "Underwater Image Enhancement Using Hyper-Laplacian Reflectance Priors and CNN-Based Classification" Engineering Proceedings 46, no. 1: 14. https://doi.org/10.3390/engproc2023046014
APA StyleMinhas, T., Hassan, F., & Irshad, R. (2023). Underwater Image Enhancement Using Hyper-Laplacian Reflectance Priors and CNN-Based Classification. Engineering Proceedings, 46(1), 14. https://doi.org/10.3390/engproc2023046014
 
         
                                                

