DualOadamNet: Dual-Branch Lightweight Network for Underwater Image Processing with Optical-Aware Detail Augmentation
Abstract
1. Introduction
- 1.
- A dual-branch optical sensing network architecture is proposed, with the core being the serial design of the OADAM module, which realizes precise staged repair of multiple types of degradation and ensures independent optimization of color and details through dual-branch decoupling.
- 2.
- A series of specialized function modules are designed, including a Multi-scale Denoising Module and Softplus Adaptive Color Enhancement Module, to solve core defects of existing methods in denoising and color correction.
- 3.
- The image fusion and feature transfer mechanisms are optimized, and the optimized contrast constraint adaptive fusion strategy is used to improve the integrity and naturalness of image enhancement.
- 4.
- The real-time architecture of the model is optimized while maintaining good image optimization quality and quantitative performance, ensuring lightweight real-time processing on 720p and 1080p images.
2. Related Work
2.1. Traditional Non-Deep Learning Methods
2.2. Deep-Learning Methods
- Physical-model-guided enhancement: integrates underwater light attenuation and scattering models to improve repair accuracy, but complex physical modeling increases computational overhead, limiting real-time deployment.
- Data-driven multi-dimensional collaborative optimization: e.g., ASANnet uses fusion channel attention and adaptive normalization modules to optimize color distribution; and DICAM enhances detail extraction through deep Inception structures [15,16]. However, these methods often rely on the parallel processing of multiple degradation dimensions (noise, color, and detail), which can lead to an imbalance in the enhancement effect. Existing detail augmentation modules also rely on complex attention mechanisms or dense connections, improving performance but significantly increasing parameter count and inference delay, which conflicts with real-time deployment requirements.
2.3. Lightweight and Real-Time Optimization
2.4. Feature Fusion and Normalization
2.5. Limitations and Proposed Solution
- 1.
- Lack of multi-branch function or unclear task division between branches, making it difficult to decouple color correction and detail repair, resulting in task conflict.
- 2.
- Optical distortion and detail augmentation are mostly processed in parallel, lacking a natural progressive repair mechanism.
- 3.
- Difficulty in balancing lightweight design and performance: complex architectures improve performance but sacrifice real-time capability, while overly simplified models cannot handle complex degradations.
3. Methods
3.1. Problem Definition
3.2. Overall Model Architecture
- 1.
- Feature Preprocessing: the input image is split by PixelUnshuffle to reduce the amount of calculation while preserving the local feature structure, which lays the foundation for decoupling the “global color” and “local detail” features of the two branches;
- 2.
- Double-Branch Feature Extraction: the global color branch extracts the optical features (brightness/color) of the channel dimension through convolution, and the local branch extracts the detailed features (edge/texture) of the spatial dimension through convolution, so as to realize the decoupling of multi-branch tasks;
- 3.
- Optical-Aware Detail Augmentation Module, OADAM: perform “optical restoration → multi-scale denoising → image restoration → color enhancement” series enhancement on the global color branch features, solve the single degradation problem in stages, and avoid the conflict of image optimization tasks;
- 4.
- Two-Branch Element-Wise Feature Fusion: the enhanced global color channel features and local detail features are fused element by element, and the fused features are restored to the original image size by PixelShuffle;
- 5.
- Global–Local Contrast Constraining Module: the global contrast weight is generated through the global contrast learning module GLC to adaptively enhance the fused features; then, the feature adaptive normalization module FAN performs the final local normalization, constrains the range of pixel values, and outputs the final enhanced image.
3.3. Core Module Design
3.3.1. Image Preprocessing
3.3.2. Channel-Branch Feature Extraction Model
3.3.3. Optical-Aware Detail Augmentation Module (OADAM)
- 1.
- Optical Recovery LayerThe core objective of this layer is to solve the optical distortion caused by the scattering and absorption of underwater light. The module structure is shown in Figure 3.
- 1.
- Multi-Scale Denoising LayerThe core goal of this layer is to remove the underwater noise pollution and solve the contradiction with the blurred edge details. The module structure is shown in Figure 4.
- 1.
- Image Detail Restoration LayerThe core objective of this layer is to solve the blurred details left after noise removal. The module structure is shown in Figure 5.
- 1.
- Color Enhancement LayerThe core goal of this layer is to correct the color offset and solve the unnatural problem of color transition. The module structure is shown in Figure 6.
3.3.4. Two-Branch Element-Wise Feature Fusion Model
3.3.5. Global–Local Contrast Constraining Module
3.4. Loss Function Design
4. Experiment
4.1. Experimental Setup and Dataset
- 1.
- UIEB dataset [19]: UIEB contains 950 real underwater images, 890 of which provide clear visual reference images manually screened, and the remaining 60 are used as challenging test samples. These images cover diverse underwater environments such as oceans, lakes and artificial pools, and show rich target categories such as coral reefs, fish, underwater buildings and aquatic plants.
- 2.
- EUVP dataset [13]: A multi-functional underwater image dataset, which contains about 11,000 composite subset images, converts clear underwater images to underwater degraded versions through physical models, and provides accurate pixel-level-matching GT, with about 1100 real-world collected images. The dataset design is based on underwater optical physical models, and can simulate degradation under different “water types” (clear water, turbid water, and extremely turbid water), shooting distance and lighting conditions.
- 3.
- LSUI dataset [20]: Low-light underwater dataset. LSUI contains 5004 pairs of images (original low-light image + clear reference image), and is the largest of the three datasets. All images are collected under real low-ight conditions, with special attention to weak-light environments such as deep-sea, night and muddy waters.
- GC [9]: A mathematical technique for adjusting image brightness and contrast through nonlinear pixel value transformation. The basic module commonly used in the current underwater image enhancement model can solve the problems of dark detail loss and partial contrast imbalance in underwater images.
- LU2net [21]: A new U-shaped network specially designed for real-time underwater image enhancement. The proposed model combines the axial depth convolution and channel attention module, which can significantly reduce the computational requirements and model parameters, so as to improve the processing speed.
- LiteEnhanceNet [17]: A model for single underwater image enhancement. The network uses deep separable convolution as the main building block to reduce the computational complexity. Single aggregation joining is used to effectively extract the features of the lower and middle layers. In addition, the appropriate activation function and extrusion excitation module are integrated in the appropriate position in the network to reduce the computational complexity.
- FunIE-GAN [13]: An underwater image enhancement technology based on visual perception fusion. The method is divided into three stages: color correction, contrast enhancement and multi-task fusion. In color correction, the relationship between statistical properties and the analyzed color channels is combined to construct an adaptive compensation method to achieve color correction. The advanced multi-scale decomposition method is used to enhance the gray information of the l-channel.
- Shallow-Uwnet [14]: A hybrid architecture model using a full convolution network + dense links + residual mechanism can optimize the efficiency of feature utilization while avoiding redundant computing. In addition, the lightweight design of the mechanism of the model greatly reduces the computational parameters.
- DNnet [7]: A lightweight neural network for real-time enhancement of high-resolution underwater images, which is based on pixel rearrangement, FAN normalization and CDR dynamic coordination to improve the efficiency of image optimization, suppress excessive enhancement and balance the brightness distribution of images, so as to achieve the balance between high-resolution underwater images and real-time enhancement.
4.2. Experimental Evaluation
4.2.1. Comparison of Evaluation Indicators
- 1.
- MSE: The classic full reference index to measure the pixel-level difference of an image is used to quantify the global distortion by calculating the mean square difference of the gray value of the corresponding pixels between the original image and the enhanced image. This index is efficient and differentiable, and is one of the basic optimization objectives of the deep learning model.
- 2.
- PSNR: Based on the full reference quality index derived from MSE, the image fidelity is evaluated by quantifying the ratio of signal peak power to noise power. This index is regarded as the core objective evaluation standard in any resolution image compression and enhancement task, especially suitable for measuring the global brightness and color distortion.
- 3.
- SSIM: Based on the full reference index of the characteristics of the human visual system (HVS), the image similarity is quantified from the three dimensions of brightness, contrast and structure to fit human subjective perception. In recent years, it has been widely used in the field of underwater image enhancement, which is very suitable for evaluating the structure detail retention ability of the enhanced image. Its basic form is the most commonly used standard evaluation index in the paper.
- 4.
- UIQM: Aiming at the no-reference quality index of an underwater image, without the original clear image as a reference, the typical problems of low contrast, blue-green color deviation and blurred details of an underwater image are fully adapted by fusing the three sub-indexes of color richness, sharpness and contrast. Its weight coefficient has been verified by a large number of underwater datasets, which can effectively distinguish the color balance and detail recognizability of enhanced images, and is the core evaluation index in the field of underwater image enhancement in recent years.
4.2.2. Visual Quality Comparison
4.3. Ablation Experiment
4.3.1. Channel-Branch Feature Extraction Model Ablation Experiment
4.3.2. OADAM Model Ablation Experiment
4.3.3. Baseline
4.4. Analysis of Experimental Results
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Xia, H.; Bao, B.; Liao, F.; Chen, J.; Wang, B.; Li, Z. A patch-based method for underwater image enhancement with denoising diffusion models. IEEE Trans. Cybern. 2024, 55, 269–281. [Google Scholar] [CrossRef] [PubMed]
- Shi, X.; Wang, Y.G. CPDM: Content-preserving diffusion model for underwater image enhancement. Sci. Rep. 2024, 14, 31309. [Google Scholar] [CrossRef] [PubMed]
- Khan, R.; Kulkarni, A.; Phutke, S.S.; Murala, S. Underwater image enhancement with phase transfer and attention. In Proceedings of the 2023 International Joint Conference on Neural Networks (IJCNN); IEEE: Piscataway, NJ, USA, 2023; pp. 1–8. [Google Scholar]
- Zhou, J.; Gai, Q.; Zhang, D.; Lam, K.-M.; Zhang, W.; Fu, X. IACC: Cross-illumination awareness and color correction for underwater images under mixed natural and artificial lighting. IEEE Trans. Geosci. Remote Sens. 2024, 62, 1–15. [Google Scholar] [CrossRef]
- Zhang, D.; Zhou, J.; Guo, C.; Zhang, W.; Li, C. Synergistic multiscale detail refinement via intrinsic supervision for underwater image enhancement. Proc. AAAI Conf. Artif. Intell. 2024, 38, 7033–7041. [Google Scholar] [CrossRef]
- Zhou, J.; Liu, Q.; Jiang, Q.; Ren, W.; Lam, K.-M.; Zhang, W. Underwater camera: Improving visual perception via adaptive dark pixel prior and color correction. Int. J. Comput. Vis. 2023, 133, 8215–8233. [Google Scholar] [CrossRef]
- Cao, T.; Yu, Z.; Zheng, B. DNnet: A lightweight network for real-time 4K underwater image enhancement using dynamic range and average normalization. Expert Syst. Appl. 2025, 270, 126561. [Google Scholar] [CrossRef]
- Cong, X.; Zhao, Y.; Gui, J.; Hou, J.; Tao, D. A comprehensive survey on underwater image enhancement based on deep learning. arXiv 2024, arXiv:2405.19684. [Google Scholar] [CrossRef]
- Lai, Y.L.; Ang, T.F.; Bhatti, U.A.; Ku, C.S.; Han, Q.; Por, L.Y. Color correction methods for underwater image enhancement: A systematic literature review. PLoS ONE 2025, 20, e0317306. [Google Scholar] [CrossRef] [PubMed]
- Dhanya, P.R.; Anilkumar, S.; Balakrishnan, A.A.; Supriya, M.H. L-CLAHE intensification filter (L-CIF) algorithm for underwater image enhancement and colour restoration. In Proceedings of the 2019 International Symposium on Ocean Technology (SYMPOL); IEEE: Piscataway, NJ, USA, 2019; pp. 117–128. [Google Scholar]
- Kang, Y.; Jiang, Q.; Li, C.; Ren, W.; Liu, H.; Wang, P. A perception-aware decomposition and fusion framework for underwater image enhancement. IEEE Trans. Circuits Syst. Video Technol. 2022, 33, 988–1002. [Google Scholar] [CrossRef]
- Hou, G.; Zhao, X.; Pan, Z.; Yang, H.; Tan, L.; Li, J. Benchmarking underwater image enhancement and restoration, and beyond. IEEE Access 2020, 8, 122078–122091. [Google Scholar] [CrossRef]
- Islam, M.J.; Xia, Y.; Sattar, J. Fast underwater image enhancement for improved visual perception. IEEE Robot. Autom. Lett. 2020, 5, 3227–3234. [Google Scholar] [CrossRef]
- Naik, A.; Swarnakar, A.; Mittal, K. Shallow-UWnet: Compressed model for underwater image enhancement. Proc. AAAI Conf. Artif. Intell. 2021, 35, 15853–15854. [Google Scholar] [CrossRef]
- Park, C.W.; Eom, I.K.; Kim, J.H.; Lee, S.Y.; Park, J.W.; Choi, Y.S.; Kim, H.J.; Lee, J.H.; Oh, S.H.; Kim, Y.J.; et al. Underwater image enhancement using adaptive standardization and normalization networks. In Engineering Applications of Artificial Intelligence; IEEE: Piscataway, NJ, USA, 2024; pp. 1–6. [Google Scholar]
- Tolie, H.F.; Ren, J.; Elyan, E. DICAM: Deep inception and channel-wise attention modules for underwater image enhancement. Neurocomputing 2024, 584, 127585. [Google Scholar] [CrossRef]
- Zhang, S.; Zhao, S.; An, D.; Li, D.; Zhao, R. LiteEnhanceNet: A lightweight network for real-time single underwater image enhancement. Expert Syst. Appl. 2024, 240, 122546. [Google Scholar] [CrossRef]
- Jiang, J.; Ye, T.; Bai, J.; Chen, S.; Chai, W.; Jun, S.; Liu, Y.; Chen, E. FA+Net: You only need 9K parameters for underwater image enhancement. arXiv 2023, arXiv:2305.08824. [Google Scholar]
- Li, C.; Guo, C.; Ren, W.; Cong, R.; Hou, J.; Kwong, S.; Tao, D. An underwater image enhancement benchmark dataset and beyond. IEEE Trans. Image Process. 2019, 29, 4376–4389. [Google Scholar] [CrossRef] [PubMed]
- Peng, L.; Zhu, C.; Bian, L. U-shape Transformer for underwater image enhancement. IEEE Trans. Image Process. 2023, 32, 3066–3079. [Google Scholar] [CrossRef] [PubMed]
- Yang, H.; Xu, J.; Lin, Z.; He, J. LU2Net: A lightweight network for real-time underwater image enhancement. arXiv 2024, arXiv:2406.14973. [Google Scholar]












| Methods | MSE (×10−3) ↓ | PSNR ↑ | SSIM ↑ | UIQM ↑ |
|---|---|---|---|---|
| GC | 2.3990 | 15.99 | 0.776 | 2.68 |
| LU2net | 0.4410 | 22.99 | 0.889 | 3.11 |
| LiteEnhanceNet | 0.5523 | 22.32 | 0.904 | 3.18 |
| FunIE-GAN | 0.8109 | 19.84 | 0.820 | 3.34 |
| Shallow-UWnet | 1.2561 | 18.31 | 0.778 | 2.91 |
| DNnet | 0.5460 | 23.01 | 0.910 | 2.97 |
| DualOadamNet (m = 1) | 0.4586 | 23.16 | 0.905 | 3.16 |
| DualOadamNet (m = 2) | 0.4418 | 22.26 | 0.910 | 3.26 |
| DualOadamNet (m = 3) | 0.4071 | 22.58 | 0.913 | 3.31 |
| Methods | MSE (×10−3) ↓ | PSNR ↑ | SSIM ↑ | UIQM ↑ |
|---|---|---|---|---|
| GC | 3.309 | 14.51 | 0.584 | 2.619 |
| LU2net | 0.1720 | 26.60 | 0.904 | 2.920 |
| LiteEnhanceNet | 0.2091 | 25.56 | 0.849 | 2.88 |
| FunIE-GAN | 0.2915 | 24.41 | 0.801 | 2.94 |
| Shallow-UWnet | 0.2607 | 24.54 | 0.829 | 2.84 |
| DNnet | 0.5910 | 21.37 | 0.831 | 2.81 |
| DualOadamNet (m = 1) | 0.2831 | 24.58 | 0.847 | 3.06 |
| DualOadamNet (m = 2) | 0.2166 | 25.35 | 0.907 | 3.23 |
| DualOadamNet (m = 3) | 0.1915 | 25.89 | 0.917 | 3.23 |
| Methods | MSE (×10−3) ↓ | PSNR ↑ | SSIM ↑ | UIQM ↑ |
|---|---|---|---|---|
| GC | 1.691 | 16.55 | 0.794 | 2.954 |
| LU2net | 0.2340 | 25.66 | 0.868 | 3.014 |
| LiteEnhanceNet | 0.6821 | 20.46 | 0.811 | 2.88 |
| FunIE-GAN | 0.3910 | 23.57 | 0.823 | 3.00 |
| Shallow-UWnet | 0.9653 | 19.36 | 0.750 | 2.84 |
| DNnet | 0.5280 | 22.25 | 0.867 | 2.89 |
| DualOadamNet (m = 1) | 0.3273 | 24.02 | 0.887 | 3.09 |
| DualOadamNet (m = 2) | 0.2651 | 24.58 | 0.924 | 3.26 |
| DualOadamNet (m = 3) | 0.2505 | 24.84 | 0.927 | 3.26 |
| Methods (Dataset) | MSE (×103) ↓ | PSNR ↑ | SSIM ↑ | UIQM ↑ |
|---|---|---|---|---|
| DualOadamNet (UIEB) | 0.4586 | 23.16 | 0.905 | 3.16 |
| GlobalOADAMNet (UIEB) | 0.4903 | 23.08 | 0.897 | 3.109 |
| DualOadamNet (EUVP) | 0.2831 | 24.58 | 0.847 | 3.06 |
| GlobalOADAMNet (EUVP) | 0.3334 | 23.68 | 0.842 | 2.97 |
| DualOadamNet (LSUI) | 0.3273 | 24.02 | 0.887 | 3.09 |
| GlobalOADAMNet (LSUI) | 0.3698 | 23.61 | 0.876 | 2.99 |
| Methods (Dataset) | MSE (×103) ↓ | PSNR ↑ | SSIM ↑ | UIQM ↑ |
|---|---|---|---|---|
| DualOadamNet (UIEB) | 0.4586 | 23.16 | 0.905 | 3.16 |
| DualNet (UIEB) | 0.4928 | 22.85 | 0.901 | 3.11 |
| DualOadamNet (EUVP) | 0.2831 | 24.58 | 0.847 | 3.06 |
| DualNet (EUVP) | 0.3086 | 24.19 | 0.841 | 3.01 |
| DualOadamNet (LSUI) | 0.3273 | 24.02 | 0.887 | 3.09 |
| DualNet (LSUI) | 0.3547 | 23.54 | 0.885 | 3.07 |
| Methods (Dataset) | MSE (×103) ↓ | PSNR ↑ | SSIM ↑ | UIQM ↑ |
|---|---|---|---|---|
| DualOadamNet (UIEB) | 0.4586 | 23.16 | 0.905 | 3.16 |
| Baseline (UIEB) | 1101.6 | 18.77 | 0.834 | 3.08 |
| DualOadamNet (EUVP) | 0.2831 | 24.58 | 0.847 | 3.06 |
| Baseline (EUVP) | 0.4768 | 23.59 | 0.862 | 3.02 |
| DualOadamNet (LSUI) | 0.3273 | 24.02 | 0.887 | 3.09 |
| Baseline (LSUI) | 0.4662 | 22.82 | 0.878 | 3.01 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Zhan, S.; Yu, J.; Li, D. DualOadamNet: Dual-Branch Lightweight Network for Underwater Image Processing with Optical-Aware Detail Augmentation. AI 2026, 7, 88. https://doi.org/10.3390/ai7030088
Zhan S, Yu J, Li D. DualOadamNet: Dual-Branch Lightweight Network for Underwater Image Processing with Optical-Aware Detail Augmentation. AI. 2026; 7(3):88. https://doi.org/10.3390/ai7030088
Chicago/Turabian StyleZhan, Siyang, Jianyong Yu, and Dong Li. 2026. "DualOadamNet: Dual-Branch Lightweight Network for Underwater Image Processing with Optical-Aware Detail Augmentation" AI 7, no. 3: 88. https://doi.org/10.3390/ai7030088
APA StyleZhan, S., Yu, J., & Li, D. (2026). DualOadamNet: Dual-Branch Lightweight Network for Underwater Image Processing with Optical-Aware Detail Augmentation. AI, 7(3), 88. https://doi.org/10.3390/ai7030088

