Figure 1.
Demonstration of the learning-based image processing operation. The convolutional kernel traverses the entire spatial extent of the feature map, enforcing a rigid, locality-inductive inductive bias, wherein each pixel is constrained to interact exclusively with its nearest neighbors. In self-attention frameworks, pixels are uniformly partitioned into isotropic attention grids, forcing every query to aggregate information from a rigidly predefined neighborhood of constant cardinality, thereby endowing the model with an immutable, size-invariant receptive field. As for our graph-aware model, every pixel is instantiated as a node whose receptive field is dynamically calibrated, and severely degraded regions are automatically allocated a higher node budget, whereas mildly corrupted areas receive a sparser allocation, thereby enabling parsimonious yet effective underwater image enhancement.
Figure 1.
Demonstration of the learning-based image processing operation. The convolutional kernel traverses the entire spatial extent of the feature map, enforcing a rigid, locality-inductive inductive bias, wherein each pixel is constrained to interact exclusively with its nearest neighbors. In self-attention frameworks, pixels are uniformly partitioned into isotropic attention grids, forcing every query to aggregate information from a rigidly predefined neighborhood of constant cardinality, thereby endowing the model with an immutable, size-invariant receptive field. As for our graph-aware model, every pixel is instantiated as a node whose receptive field is dynamically calibrated, and severely degraded regions are automatically allocated a higher node budget, whereas mildly corrupted areas receive a sparser allocation, thereby enabling parsimonious yet effective underwater image enhancement.
Figure 2.
Overview of the proposed GA-UIE, which consists of three stages: graph feature generation stage, graph-aware enhancement stage, graph-aware fusion stage.
Figure 2.
Overview of the proposed GA-UIE, which consists of three stages: graph feature generation stage, graph-aware enhancement stage, graph-aware fusion stage.
Figure 3.
Demonstration of graph-aware color correction module and graph-aware texture enhancement module. The Conv2D_3*3 and Conv2D_1*1 represent the and convolutions, respectively.
Figure 3.
Demonstration of graph-aware color correction module and graph-aware texture enhancement module. The Conv2D_3*3 and Conv2D_1*1 represent the and convolutions, respectively.
Figure 4.
Indication of the proposed graph-aware fusion module.
Figure 4.
Indication of the proposed graph-aware fusion module.
Figure 5.
Visual comparison of traditional methods on UIEB-70 test set. An enlarged view of the key region (yellow box) is provided; the same in the following figures.
Figure 5.
Visual comparison of traditional methods on UIEB-70 test set. An enlarged view of the key region (yellow box) is provided; the same in the following figures.
Figure 6.
Visual comparison of learning-based approaches on UIEB-70 test set.
Figure 6.
Visual comparison of learning-based approaches on UIEB-70 test set.
Figure 7.
Visual comparison of traditional techniques on U45 test set.
Figure 7.
Visual comparison of traditional techniques on U45 test set.
Figure 8.
Visual comparison of learning-based approaches on U45 test set.
Figure 8.
Visual comparison of learning-based approaches on U45 test set.
Figure 9.
Visual comparison of the component ablation study on UIEB-80 test set. w/o denotes without.
Figure 9.
Visual comparison of the component ablation study on UIEB-80 test set. w/o denotes without.
Figure 10.
Visual comparison of the graph-prior ablation study on UIEB-80 test set.
Figure 10.
Visual comparison of the graph-prior ablation study on UIEB-80 test set.
Figure 11.
Visual comparison of the loss function ablation study on UIEB-80 test set. w/o denotes without.
Figure 11.
Visual comparison of the loss function ablation study on UIEB-80 test set. w/o denotes without.
Figure 12.
Failure cases and mitigation on UIEB-80 test set.
Figure 12.
Failure cases and mitigation on UIEB-80 test set.
Table 1.
Various approaches are evaluated at UIEB-80 test set using performance metrics including PSNR (±std), SSIM, UCIQE, UIQM, and LPIPS (↑: higher is better; ↓: lower is better. This convention applies to the following tables).
Table 1.
Various approaches are evaluated at UIEB-80 test set using performance metrics including PSNR (±std), SSIM, UCIQE, UIQM, and LPIPS (↑: higher is better; ↓: lower is better. This convention applies to the following tables).
Method | Metrics |
---|
PSNR ↑ | SSIM ↑ | UCIQE ↑ | UIQM ↑ | LPIPS ↓ |
CBF | 23.42 (±4.69) | 0.9192 | 0.5891 | 1.202 | 0.1532 |
ERH | 22.53 (±4.13) | 0.9321 | 0.6023 | 1.189 | 0.1825 |
IBLA | 21.05 (±6.06) | 0.7832 | 0.5618 | 1.125 | 0.2994 |
MMLE | 20.15 (±4.08) | 0.8176 | 0.5782 | 1.353 | 0.2706 |
Water-Net | 24.79 (±5.03) | 0.9275 | 0.5679 | 1.136 | 0.1435 |
Ucolor | 24.23 (±4.94) | 0.9254 | 0.5593 | 1.142 | 0.1263 |
SGUIE | 24.94 (±4.27) | 0.9348 | 0.5352 | 1.148 | 0.1137 |
PUGAN | 24.62 (±5.10) | 0.9274 | 0.5710 | 1.158 | 0.1249 |
GA-UIE | 25.12(±4.36) | 0.9391 | 0.5732 | 1.171 | 0.1099 |
Table 2.
Evaluations of various techniques at U45 test set are performed using the UCIQE, UIQM, and parameter metrics.
Table 2.
Evaluations of various techniques at U45 test set are performed using the UCIQE, UIQM, and parameter metrics.
Method | Metrics |
---|
UCIQE ↑ | UIQM ↑ | Parameters ↓ |
---|
CBF | 0.5841 | 1.192 | - |
ERH | 0.5813 | 1.180 | - |
IBLA | 0.5630 | 1.125 | - |
MMLE | 0.5797 | 1.198 | - |
Water-Net | 0.5679 | 1.162 | 1.10 M |
Ucolor | 0.5714 | 1.155 | 145.65 M |
SGUIE | 0.5733 | 1.179 | 45.69 M |
PUGAN | 0.5702 | 1.171 | 160.36 M |
GA-UIE | 0.5762 | 1.184 | 10.33 M |
Table 3.
Evaluations of component ablation study on UIEB-80 test set indicated by the PSNR, SSIM, UCIQE, UIQM, and LPIPS metrics. w/o denotes without.
Table 3.
Evaluations of component ablation study on UIEB-80 test set indicated by the PSNR, SSIM, UCIQE, UIQM, and LPIPS metrics. w/o denotes without.
Method | Metrics |
---|
PSNR ↑ | SSIM ↑ | UCIQE ↑ | UIQM ↑ | LPIPS ↓ |
---|
w/o Color | 23.87 | 0.9251 | 0.5521 | 1.097 | 0.1315 |
w/o Texture | 24.63 | 0.9177 | 0.5663 | 1.119 | 0.1207 |
w/o Fusion | 24.99 | 0.9306 | 0.5699 | 1.135 | 0.1134 |
GA-UIE | 25.12 | 0.9391 | 0.5732 | 1.171 | 0.1099 |
Table 4.
Evaluations of component ablation study on UIEB-80 test set indicated by the parameters, FLOPs, running time, and memory. w/o denotes without.
Table 4.
Evaluations of component ablation study on UIEB-80 test set indicated by the parameters, FLOPs, running time, and memory. w/o denotes without.
Method | Metrics |
---|
Parameters ↓ | FLOPs ↓ | Running Time ↓ | Memory ↓ |
---|
w/o Color | 9.57 M | 30.64 G | 8.2 ms | 3211 M |
w/o Texture | 9.83 M | 32.87 G | 8.9 ms | 3798 M |
w/o Fusion | 10.12 M | 33.15 G | 9.7 ms | 4233 M |
GA-UIE | 10.33 M | 33.80 G | 10.2 ms | 4786 M |
Table 5.
Evaluations of graph-prior ablation study on the UIEB-80 test set indicated by the PSNR, SSIM, UCIQE, UIQM, and LPIPS metrics.
Table 5.
Evaluations of graph-prior ablation study on the UIEB-80 test set indicated by the PSNR, SSIM, UCIQE, UIQM, and LPIPS metrics.
Method | Metrics |
---|
PSNR ↑ | SSIM ↑ | UCIQE ↑ | UIQM ↑ | LPIPS ↓ |
---|
KNN-Based Graph Prior | 24.88 | 0.9311 | 0.5609 | 1.120 | 0.1171 |
GA-UIE | 25.12 | 0.9391 | 0.5732 | 1.171 | 0.1099 |
Table 6.
Evaluations of loss function ablation study on UIEB-80 test set indicated by the PSNR, SSIM, UCIQE, UIQM, and LPIPS metrics. w/o denotes without.
Table 6.
Evaluations of loss function ablation study on UIEB-80 test set indicated by the PSNR, SSIM, UCIQE, UIQM, and LPIPS metrics. w/o denotes without.
Method | Metrics |
---|
PSNR ↑ | SSIM ↑ | UCIQE ↑ | UIQM ↑ | LPIPS ↓ |
---|
w/o VGG Loss | 23.99 | 0.9177 | 0.5540 | 1.110 | 0.1198 |
GA-UIE | 25.12 | 0.9391 | 0.5732 | 1.171 | 0.1099 |
Table 7.
Evaluations of complexity ablation study on UIEB-80 test set indicated by the running time, memory, PSNR, SSIM, and UCIQE metrics.
Table 7.
Evaluations of complexity ablation study on UIEB-80 test set indicated by the running time, memory, PSNR, SSIM, and UCIQE metrics.
Method | Metrics |
---|
Running Time (ms) ↓ | Memory (MB) ↓ | PSNR ↑ | SSIM ↑ | UCIQE ↑ |
---|
Patchsize () | 6.8 ms | 1280 M | 24.35 | 0.9133 | 0.5512 |
Patchsize () | 9.1 ms | 2680 M | 24.78 | 0.9219 | 0.5575 |
| 8.5 ms | 3850 M | 24.60 | 0.9182 | 0.5557 |
| 11.4 ms | 4320 M | 24.95 | 0.9255 | 0.5604 |
Degree budget (1/16 pixels) | 9.8 ms | 2100 M | 24.50 | 0.9150 | 0.5532 |
Degree budget (1/4 pixels) | 10.6 ms | 3550 M | 24.85 | 0.9231 | 0.5580 |
GA-UIE | 10.2 ms | 4786 M | 25.12 | 0.9391 | 0.5732 |
Table 8.
Comparative test of underwater grasping success rates: Task 1 is conducted in a tank, whereas Task 2 is performed in a real-world underwater environment.
Table 8.
Comparative test of underwater grasping success rates: Task 1 is conducted in a tank, whereas Task 2 is performed in a real-world underwater environment.
Method | Metrics |
---|
Task 1 ↑ | Task 2 ↑ |
---|
Degraded image | 0.52 | 0.34 |
Image enhanced by GA-UIE | 0.71 | 0.63 |