RetinalCoNet: Underwater Fish Segmentation Network Based on Bionic Retina Dual-Channel and Multi-Module Cooperation
Abstract
1. Introduction
- (1)
- The underwater lighting conditions are complex and changeable, and different depths and shooting times lead to great differences in light intensity and spectral distribution. To address critical challenges in underwater imagery—including unstable contrast, reduced definition, blurred boundaries, and noise-obscured semi-transparent tissues—this paper proposes a novel bionic retina dual-channel computational module. By simulating biological vision to separate light and dark signals, it can adapt to the changes in underwater lighting and enhance the perception of subtle features of fish.
- (2)
- To mitigate edge blurring induced by light attenuation and particle scattering while resolving occlusion-induced contour fragmentation, we implement a prompt learning module. This approach amplifies feature responses at indistinct boundaries and occluded regions while adaptively guiding model attention toward discriminative features, thereby enhancing target saliency perception and improving fish segmentation integrity in complex underwater environments.
- (3)
- The underwater optical characteristics lead to the attenuation and scattering of light, which weakens the gradient information of the fish boundary, and the low contrast between target and background further aggravates the difficulty of boundary discrimination. In order to meet this challenge, this paper integrates the concept of edge prior guidance and designs an edge enhancement module to solve the problems of the blurred boundary and insufficient fusion of multi-scale features in underwater fish segmentation, and realize accurate pixel-level segmentation.
2. Related Works
2.1. Underwater Image Segmentation
2.2. Bionic Neural Network
2.3. Prior Information
3. Research Method
3.1. Methods for Incorporating Prior Information
- (1)
- Prior information adding method based on transfer learning and pre-training: refers to using the general knowledge contained in the model pre-trained on large-scale general data as prior information, and transferring this knowledge to the target task through transfer learning technology (such as feature extraction or fine-tuning). This approach circumvents full model training from scratch, substantially enhancing efficacy in data-scarce scenarios while accelerating convergence and strengthening generalization capabilities. Fundamentally, it leverages transferable feature representations acquired during pre-training as robust initialization points, enabling rapid adaptation to novel tasks.
- (2)
- The method of adding prior information based on the data level: this refers to adding prior information such as domain knowledge or expert experience to the model by transforming the training data itself, such as using knowledge to guide data enhancement, screening high-quality samples, embedding structured knowledge representation, or generating synthetic data in line with the prior, so the data more directly and explicitly reflect the known rules or relationships, thus guiding the model to learn this information more efficiently and accurately.
- (3)
- Structural integration of prior information: This approach embeds domain knowledge and inductive biases by designing inherent architectural characteristics within the network. Through such structural constraints, models are guided to autonomously learn feature representations aligned with prior assumptions. This kind of prior does not depend on the data preprocessing or training strategy, but gives the model “built-in sensitivity” to specific types of data patterns through the connection mode, parameter-sharing mechanism, or constraints of the network layers.
- (4)
- The method of adding prior information based on the loss function: This refers to embedding prior information such as domain knowledge, problem characteristics, or expected behavior into the model optimization process by designing the form, structure, or parameters of loss functions, and guiding the model to learn features or prediction results that conform to the prior law.
3.2. RetinalCoNet Model Structure
3.2.1. Interactive Encoder Block
3.2.2. Dynamic Prompt Block
3.2.3. Decoder Block
4. Collection and Construction of Dataset
5. Results and Discussion
5.1. Experimental Environment and Evaluation Metrics
5.2. Comparative Experiments with Different Models
5.3. Comparative Experiments in Different Scenes
5.4. Ablation Experiments
6. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Chuang, M.C.; Hwang, J.N.; Williams, K.; Towler, R. Automatic fish segmentation via double local thresholding for trawl-based underwater camera systems. In Proceedings of the 18th IEEE International Conference on Image Processing, Brussels, Belgium, 11–14 September 2011; pp. 3145–3148. [Google Scholar]
- Baloch, A.; Ali, M.; Gul, F.; Basir, S.; Afzal, I. Fish Image Segmentation Algorithm (FISA) for improving the performance of image retrieval system. Int. J. Adv. Comput. Sci. Appl. 2017, 8, 396–403. [Google Scholar] [CrossRef]
- Spampinato, C.; Giordano, D.; Di Salvo, R.; Chen-Burger, Y.H.J.; Fisher, R.B.; Nadarajan, G. Automatic fish classification for underwater species behavior understanding. In Proceedings of the First ACM International Workshop on Analysis and Retrieval of Tracked Events and Motion in Imagery Streams, New York, NY, USA, 29 October 2010; pp. 45–50. [Google Scholar]
- Liu, R.; Jiang, Z.; Yang, S.; Fan, X. Twin adversarial contrastive learning for underwater image enhancement and beyond. IEEE Trans. Image Process. 2022, 31, 4922–4936. [Google Scholar] [CrossRef]
- Kim, Y.H.; Park, K.R. PSS-net: Parallel semantic segmentation network for detecting marine animals in underwater scene. Front. Mar. Sci. 2022, 9, 1003568. [Google Scholar] [CrossRef]
- Liu, L.; Yu, W. Underwater image saliency detection via attention-based mechanism. J. Phys. Conf. Ser. 2022, 2189, 012012. [Google Scholar] [CrossRef]
- Liu, F.; Fang, M. Semantic segmentation of underwater images based on improved Deeplab. J. Mar. Sci. Eng. 2020, 8, 188. [Google Scholar] [CrossRef]
- Hambarde, P.; Murala, S.; Dhall, A. UW-GAN: Single-image depth estimation and image enhancement for underwater images. IEEE Trans. Instrum. Meas. 2021, 70, 5018412. [Google Scholar] [CrossRef]
- Dudhane, A.; Hambarde, P.; Patil, P.; Murala, S. Deep underwater image restoration and beyond. IEEE Signal Process. Lett. 2020, 27, 675–679. [Google Scholar] [CrossRef]
- Fu, Z.; Chen, R.; Huang, Y.; Cheng, E.; Ding, X.; Ma, K.-K. MASNet: A robust deep marine animal segmentation network. IEEE J. Ocean. Eng. 2023, 49, 1104–1115. [Google Scholar] [CrossRef]
- Chen, I.-H.; Belbachir, N. Using Mask R-CNN for underwater fish instance segmentation as novel objects: A proof of concept. In Proceedings of the Northern Lights Deep Learning Workshop, Tromsø, Norway, 10–12 January 2023; Volume 4. [Google Scholar]
- Yang, Y.; Li, D.; Zhao, S. A novel approach for underwater fish segmentation in complex scenes based on multi-levels triangular atrous convolution. Aquac. Int. 2024, 32, 5215–5240. [Google Scholar] [CrossRef]
- Chicchon, M.; Bedon, H.; Del-Blanco, C.R.; Sipiran, I. Semantic segmentation of fish and underwater environments using deep convolutional neural networks and learned active contours. IEEE Access 2023, 11, 33652–33665. [Google Scholar] [CrossRef]
- Shen, C.; Wu, Y.; Qian, G.; Wu, X.; Cao, H.; Wang, C.; Tang, J.; Liu, J. Intelligent bionic polarization orientation method using biological neuron model for harsh conditions. IEEE Trans. Pattern Anal. Mach. Intell. 2025, 47, 789–806. [Google Scholar] [CrossRef]
- Pu, Y.; Hang, Z.; Wang, G.; Hu, H. Bionic artificial lateral line underwater localization based on the neural network method. Appl. Sci. 2022, 12, 7241. [Google Scholar] [CrossRef]
- Li, H.; Tian, F.; Deng, S.; Wu, Z.; Zhao, L. Mammalian olfaction-inspired spike-coded bionic neural network. IEEE Trans. Instrum. Meas. 2024, 73, 6505411. [Google Scholar] [CrossRef]
- Hu, Y.; Li, Z.; Lu, Z.; Jia, X.; Wang, P.; Liu, X. Identification method of crop aphids based on bionic attention. Agronomy 2024, 14, 1093. [Google Scholar] [CrossRef]
- Gu, T.; Liu, S.; Pu, Q.; Wang, J.; Wang, B.; Hu, X.; Sun, P.; Li, Q.; Zhu, L.; Lu, G. A visual-olfactory bionic sensing system bioinspired from zebrafish for confusable liquid localization and recognition. Sens. Actuators B Chem. 2025, 441, 138053. [Google Scholar] [CrossRef]
- Liang, Z.; Lin, Z.; Li, X.; Zou, X. A bionic vision method for extracting motion information of small-target in cotton field backgrounds. In Proceedings of the International Conference on Optical and Photonic Engineering, Bellingham, WA, USA, 14–16 April 2025; Volume 13509. [Google Scholar]
- Xia, B.; Zhan, B.; Shen, M.; Yang, H. Explicit-implicit prior knowledge-based diffusion model for generative medical image segmentation. Knowl. Based Syst. 2024, 303, 112426. [Google Scholar] [CrossRef]
- Jiang, W.; Zhang, D.; Hui, G. A dual-branch fracture attribute fusion network based on prior knowledge. Eng. Appl. Artif. Intell. 2023, 127, 107383. [Google Scholar] [CrossRef]
- Zhang, Y.; Meng, N.; Zhao, M.; Zhang, T. RASpine: Regional attention lateral spinal segmentation based on anatomical prior knowledge. In Proceedings of the 46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Orlando, FL, USA, 15–19 July 2024; pp. 1–4. [Google Scholar]
- Ding, D.; Li, J.; Wang, H.; Wang, K.; Feng, J.; Xiao, M. ApplianceFilter: Targeted electrical appliance disaggregation with prior knowledge fusion. Appl. Energy 2024, 365, 123157. [Google Scholar] [CrossRef]
- Yan, J.; Zhang, Y.; Hu, J.; Cui, H.; Chi, J.; Yang, G.; Chen, C.; Yu, T. Prior-based bi-encoder transformer for underwater image enhancement. Multimed. Syst. 2025, 31, 3. [Google Scholar] [CrossRef]
- Choi, M.; Han, K.; Wang, X.; Zhang, Y.; Liu, Z. A dual-stream neural network explains the functional segregation of dorsal and ventral visual pathways in human brains. In Proceedings of the 37th International Conference on Neural Information Processing Systems (NIPS ‘23), New York, NY, USA, 10–16 December 2023; pp. 50408–50428. [Google Scholar]
- Potlapalli, V.; Zamir, S.W.; Khan, S.; Khan, F.S. PromptIR: Prompting for all-in-one blind image restoration. In Proceedings of the 37th International Conference on Neural Information Processing Systems, New York, NY, USA, 10 December 2023; pp. 71275–71293. [Google Scholar]
- Ronneberger, O.; Fischer, P.; Brox, T. U-Net: Convolutional networks for biomedical image segmentation. In Medical Image Computing and Computer-Assisted Intervention; Springer International Publishing: Berlin/Heidelberg, Germany, 2015. [Google Scholar]
- Zheng, D.; Zheng, X.; Yang, L.; Gao, C.; Zhu, Y.; Ruan, M. MFFN: Multi-view feature fusion network for camouflaged object detection. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Hawaii, HI, USA, 2–7 January 2023; pp. 6221–6231. [Google Scholar]
- Hu, X.; Wang, S.; Qin, X.; Dai, H.; Ren, W.; Luo, D.; Tai, Y.; Shao, L. High-resolution iterative feedback network for camouflaged object detection. In Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, Washington, DC, USA, 7–14 February 2023; pp. 881–889. [Google Scholar]
- Ge, Y.; Ren, J.; Zhang, C.; He, M.; Bi, H.; Zhang, Q. Feature-aware and iterative refinement network for camouflaged object detection. Vis. Comput. 2024, 41, 4741–4758. [Google Scholar] [CrossRef]
- Liu, C.; Yao, H.; Qiu, W.; Cui, H.; Fang, Y.; Xu, A. Multi-scale feature map fusion encoding for underwater object segmentation. Appl. Intell. 2025, 55, 163. [Google Scholar] [CrossRef]
Positive Category | Negative Category | |
---|---|---|
Positive category | (True Positive) TP | (False Negative) FN |
Negative category | (False Positive) FP | (True Negative) TN |
Model | mDice | mIou | MAE | |||
---|---|---|---|---|---|---|
Unet | 0.768 | 0.856 | 0.828 | 0.784 | 0.810 | 0.029 |
MFFN | 0.566 | 0.465 | 0.709 | 0.589 | 0.757 | 0.072 |
HitNet | 0.510 | 0.420 | 0.677 | 0.548 | 0.752 | 0.070 |
FIRNet | 0.532 | 0.425 | 0.673 | 0.524 | 0.781 | 0.090 |
MASNet | 0.630 | 0.544 | 0.752 | 0.833 | 0.772 | 0.047 |
Deeplab-FusionNet | 0.475 | 0.812 | 0.725 | 0.752 | 0.643 | 0.069 |
Ours | 0.823 | 0.892 | 0.856 | 0.834 | 0.924 | 0.023 |
Experiment Number | Dynamic Prompt Block | Boundary Enhance Block | Bionic Retinal Dual-Channel Block | mDice | mIou | MAE | |||
---|---|---|---|---|---|---|---|---|---|
A | 0.768 | 0.856 | 0.828 | 0.784 | 0.810 | 0.029 | |||
B | √ | 0.817 | 0.876 | 0.854 | 0.826 | 0.915 | 0.023 | ||
C | √ | √ | 0.818 | 0.882 | 0.857 | 0.828 | 0.918 | 0.023 | |
D | √ | √ | √ | 0.823 | 0.892 | 0.856 | 0.834 | 0.924 | 0.023 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Zheng, J.; Fu, Y.; Lu, J.; Liu, J.; Luo, Z.; Zhang, S. RetinalCoNet: Underwater Fish Segmentation Network Based on Bionic Retina Dual-Channel and Multi-Module Cooperation. Fishes 2025, 10, 424. https://doi.org/10.3390/fishes10090424
Zheng J, Fu Y, Lu J, Liu J, Luo Z, Zhang S. RetinalCoNet: Underwater Fish Segmentation Network Based on Bionic Retina Dual-Channel and Multi-Module Cooperation. Fishes. 2025; 10(9):424. https://doi.org/10.3390/fishes10090424
Chicago/Turabian StyleZheng, Jianhua, Yusha Fu, Junde Lu, Jinfang Liu, Zhaoxi Luo, and Shiyu Zhang. 2025. "RetinalCoNet: Underwater Fish Segmentation Network Based on Bionic Retina Dual-Channel and Multi-Module Cooperation" Fishes 10, no. 9: 424. https://doi.org/10.3390/fishes10090424
APA StyleZheng, J., Fu, Y., Lu, J., Liu, J., Luo, Z., & Zhang, S. (2025). RetinalCoNet: Underwater Fish Segmentation Network Based on Bionic Retina Dual-Channel and Multi-Module Cooperation. Fishes, 10(9), 424. https://doi.org/10.3390/fishes10090424