Weakly Supervised Deep Learning for Ocular Image Segmentation: A Systematic Review of Fundus and OCT Methods
Abstract
1. Introduction
2. Materials and Methods
2.1. Related Work
2.2. Research Questions
- RQ1: Which weakly supervised deep learning approaches have been proposed for ocular image segmentation in fundus and OCT images?
- RQ2: For optic disc and cup segmentation in fundus images, how do weakly supervised deep learning methods compare to fully supervised baselines in terms of segmentation performance and annotation effort?
- RQ3: For retinal layer and vessel segmentation in OCT, how do 2D, 3D, and hybrid 2D + 3D deep learning designs trained under weakly supervised regimes perform compared with fully supervised baselines?
2.3. Search Strategy
2.4. Data Extraction
2.5. Methodological Quality Assessment of Included Reviews
2.6. Synthesis and Quality Appraisal
3. Results
4. Discussion
5. Conclusions
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
| CL | Contrastive Learning |
| CNN | Convolutional Neural Network |
| DL | Deep Learning |
| FA | Fluorescein Angiography |
| FAF | Fundus Autofluorescence |
| FS | Fully Supervised |
| MIL | Multiple-Instance Learning |
| OCT | Optical Coherence Tomography |
| OCTA | Optical Coherence Tomography Angiography |
| OD/OC | Optic Disc/Optic Cup |
| PoP | Publish or Perish |
| PRISMA | Preferred Reporting Items for Systematic Reviews and Meta-Analyses |
| RoB | Risk of Bias |
| SSL | Self-Supervised Learning |
| ViT | Vision Transformer |
| WSL | Weakly Supervised Learning |
| WSSS | Weakly Supervised Semantic Segmentation |
Appendix A
Full Search Strategies and Search Yields
| Source | Interface | Years | Language | Fields Searched | Key Settings |
|---|---|---|---|---|---|
| Google Scholar | Publish or Perish (PoP) v8.19 | 2020–2026 | English | Title + keywords | Max results per query: 200; include citations: No; include patents: No; only review articles: Yes |
| PubMed | PubMed (web) | 2020–2026 | English | Title/ Abstract (tiab) | Filters: year range; English |
| Scopus | Scopus (Advanced Search) | 2020–2026 | English | TITLE-ABS-KEY | Filters: year range; English |
| Google Scholar (Publish or Perish v8.19) Settings: Years 2020–2026; Language English; Fields searched Title + Keywords; Max results 200; exclude patents; exclude citations. Query 1 (weak-supervision surveys): Title: (“review” OR “survey”) AND (“weakly supervised” OR “weak supervision”) Hits: 14 Query 2 (ocular modalities + weak supervision): Title: (“review” OR “survey”) AND “deep learning” AND (“optical coherence tomography” OR “OCT” OR “fundus”); Keywords: (“weak supervision” OR “weakly supervised”) Hits: 6 Query 3 (segmentation + ocular + weak supervision): Title: (“review” OR “survey”) AND (“deep learning segmentation”); Keywords: (“optical coherence tomography” OR “OCT” OR “fundus images”) AND (“weak supervision” OR “weakly supervised”) Hits: 8 PubMed (executed: 2020–2026) Query: ((“weak supervision”[tiab] OR “weakly supervised”[tiab] OR “weakly-supervised”[tiab] OR “weak label*”[tiab] OR “inexact label*”[tiab] OR “sparse annotation*”[tiab] OR scribble*[tiab] OR “point annotation*”[tiab] OR “bounding box*”[tiab] OR “box annotation*”[tiab] OR “pseudo-label*”[tiab] OR “pseudo label*”[tiab] OR “multiple instance”[tiab] OR MIL[tiab]) AND (fundus[tiab] OR OCT[tiab] OR “optical coherence tomography”[tiab] OR retina*[tiab] OR ocular[tiab] OR ophthalm*[tiab]) AND (review[tiab] OR survey[tiab] OR “systematic review”[tiab] OR “scoping review”[tiab] OR overview[tiab] OR tutorial[tiab])) Hits: 4 Scopus (executed: 2020–2026) Query: TITLE-ABS-KEY((“weakly supervised” OR “weak supervision” OR “weakly-supervised” OR “weak label*” OR “inexact label*” OR “sparse annotation*” OR scribble* OR “point annotation*” OR “bounding box*” OR “box annotation*” OR “pseudo-label*” OR “pseudo label*” OR “multiple instance” OR MIL) AND (segment* OR segmentation OR “image segmentation” OR “semantic segmentation”) AND (fundus OR OCT OR “optical coherence tomography” OR retina* OR retinal OR ocular OR ophthalm* OR “optic disc” OR “optic nerve head”) AND (review OR survey OR “systematic review” OR “scoping review” OR overview OR tutorial)) Hits: 6 |
References
- Alawad, M.; Aljouie, A.; Alamri, S.; Alghamdi, M.; Alabdulkader, B.; Alkanhal, N.; Almazroa, A. Machine Learning and Deep Learning Techniques for Optic Disc and Cup Segmentation—A Review. Clin. Ophthalmol. 2022, 2022, 747–764. [Google Scholar] [CrossRef]
- Goutam, B.; Hashmi, M.F.; Geem, Z.W.; Bokde, N.D. A Comprehensive Review of Deep Learning Strategies in Retinal Disease Diagnosis Using Fundus Images. IEEE Access 2022, 10, 57796–57823. [Google Scholar] [CrossRef]
- Anusuya, S.; Masoodhu Banu, N.M. A Comprehensive Review of Glaucoma Detection from Fundus Images Using Deep Learning. In Proceedings of the 2023 Second International Conference on Augmented Intelligence and Sustainable Systems (ICAISS), Trichy, India, 23–25 August 2023; pp. 1–6. [Google Scholar]
- Zedan, M.J.M.; Zulkifley, M.A.; Ibrahim, A.A.; Moubark, A.M.; Kamari, N.A.M.; Abdani, S.R. Automated Glaucoma Screening and Diagnosis Based on Retinal Fundus Images Using Deep Learning Approaches: A Comprehensive Review. Diagnostics 2023, 13, 2180. [Google Scholar] [CrossRef]
- Rizvana, M.; Narayanan, S. Deep Learning of Fundus Images and Optical Coherence Tomography Images for Ocular Disease Detection—A Review. Multimed. Tools Appl. 2024, 83, 88745–88789. [Google Scholar]
- Chen, K.; Gao, G.; Yang, X.; Wang, W.; Na, J. Denoising, Segmentation and Volumetric Rendering of Optical Coherence Tomography Angiography (OCTA) Images Using Deep Learning Techniques: A Comprehensive Review. arXiv 2025, arXiv:2502.14935. [Google Scholar]
- Xue, H.; Hamed, H.N.B.A.; Isyaku, B.; Qichen, S.; Xin, D. Analysis of Atherosclerotic Plaques Using OCT Images Based on Deep Learning: A Comprehensive Review. KSII Trans. Internet Inf. Syst. TIIS 2024, 18, 3256–3277. [Google Scholar]
- Zhang, M.; Zhou, Y.; Zhao, J.; Man, Y.; Liu, B.; Yao, R. A Survey of Semi- and Weakly Supervised Semantic Segmentation of Images. Artif. Intell. Rev. 2020, 53, 4259–4288. [Google Scholar] [CrossRef]
- Bohlender, S.; Oksuz, I.; Mukhopadhyay, A. A Survey on Shape-Constraint Deep Learning for Medical Image Segmentation. IEEE Rev. Biomed. Eng. 2021, 16, 225–240. [Google Scholar] [CrossRef]
- Gao, Y.; Jiang, Y.; Peng, Y.; Yuan, F.; Zhang, X.; Wang, J. Medical Image Segmentation: A Comprehensive Review of Deep Learning-Based Methods. Tomography 2025, 11, 52. [Google Scholar] [CrossRef] [PubMed]
- Ilesanmi, A.E.; Ilesanmi, T.; Gbotoso, G.A. A systematic review of retinal fundus image segmentation and classification methods using convolutional neural networks. Healthc. Anal. 2023, 4, 100261. [Google Scholar] [CrossRef]
- Qin, Q.; Chen, Y. A review of retinal vessel segmentation for fundus image analysis. Eng. Appl. Artif. Intell. 2024, 128, 107454. [Google Scholar] [CrossRef]
- Abdulsahib, A.A.; Mahmoud, M.A.; Mohammed, M.A.; Rasheed, H.H.; Mostafa, S.A.; Maashi, M.S. Comprehensive review of retinal blood vessel segmentation and classification techniques: Intelligent solutions for green computing in medical images. Netw. Model. Anal. Health Inform. Bioinform. 2021, 10, 20. [Google Scholar]
- Abdi, S.; Abdulazeez, A.M. A comprehensive review of deep learning in OCT image segmentation and classification. Med. Novel Technol. Devices 2025, 28, 100396. [Google Scholar]
- Rajarajeshwari, G.; Selvi, G.C. Application of artificial intelligence for classification, segmentation, early detection, early diagnosis, and grading of diabetic retinopathy from fundus retinal images: A comprehensive review. IEEE Access 2024, 12, 172499–172536. [Google Scholar] [CrossRef]
- Zazueta, L.J.G.; Covarrubias, B.L.L.; Cota, C.X.N.; Briseño, M.V.; Hipólito, J.I.N.; Rodríguez, G.J.A. Segmentation Algorithms in Fundus Images: A Review of Digital Image Analysis Techniques. Appl. Sci. 2025, 15, 11324. [Google Scholar] [CrossRef]
- Ma, X.; Cao, G.; Chen, Y. A review of optic disc and optic cup segmentation based on fundus images. IET Image Process. 2024, 18, 2521–2539. [Google Scholar] [CrossRef]
- Veena, H.N.; Muruganandham, A.; Kumaran, T.S. A review on the optic disc and optic cup segmentation and classification approaches over retinal fundus images for detection of glaucoma. SN Appl. Sci. 2020, 2, 1476. [Google Scholar] [CrossRef]
- Gautam, A.; Shanker, R. Diabetic retinopathy detection from fundus images: A wide survey from grading to segmentation of lesions. Comput. Biol. Med. 2025, 196, 110715. [Google Scholar] [CrossRef]
- Ilesanmi, A.; Ilesanmi, T.; Idowu, O.P.; Torigian, D.A.; Udupa, J.K. Segmentation and Classification of Retinal Fundus Images using Convolutional Neural Networks: A systematic review of methods and latest trends. Res. Sq. 2022. [Google Scholar] [CrossRef]
- Zhang, H.; Yang, B.; Li, S.; Zhang, X.; Li, X.; Liu, T.; Higashita, R.; Liu, J. Retinal OCT image segmentation with deep learning: A review of advances, datasets, and evaluation metrics. Comput. Med. Imaging Graph. 2025, 123, 102539. [Google Scholar] [CrossRef] [PubMed]
- Zhou, Z.-H. A Brief Introduction to Weakly Supervised Learning. Natl. Sci. Rev. 2018, 5, 44–53. [Google Scholar]
- Cheplygina, V.; de Bruijne, M.; Pluim, J.P.W. Not-so-Supervised: A Survey of Semi-Supervised, Multiple-Instance, and Transfer Learning in Medical Image Analysis. Med. Image Anal. 2019, 54, 280–296. [Google Scholar]
- Shen, W.; Peng, Z.; Wang, X.; Wang, H.; Cen, J.; Jiang, D.; Xie, L.; Yang, X.; Tian, Q. A Survey on Label-Efficient Deep Image Segmentation: Bridging the Gap Between Weak Supervision and Dense Prediction. IEEE Trans. Pattern Anal. Mach. Intell. 2023, 45, 9284–9305. [Google Scholar] [CrossRef] [PubMed]
- Qu, L.; Liu, S.; Liu, X.; Wang, M.; Song, Z. Towards Label-Efficient Automatic Diagnosis and Analysis: A Comprehensive Survey of Advanced Deep Learning-Based Weakly Supervised, Semi-Supervised and Self-Supervised Techniques. Med. Image Anal. 2022, 79, 102475. [Google Scholar]
- Green, R. Learning with Imperfect Labels and Incomplete Views: A Review of Representation Methods for Weakly Supervised Perception. TechRxiv 2025. [Google Scholar] [CrossRef]
- Zhang, X.; Wang, J.; Wei, J.; Yuan, X.; Wu, M. A Review of Non-Fully Supervised Deep Learning for Medical Image Segmentation. Information 2025, 16, 433. [Google Scholar] [CrossRef]
- Fasana, C.; Pasini, S.; Milani, F.; Fraternali, P. Weakly Supervised Object Detection for Remote Sensing Images: A Survey. Remote. Sens. 2022, 14, 5362. [Google Scholar] [CrossRef]
- Martínez-Heredia, A.M.; Ventura, S. Weak Supervision: A Survey on Predictive Maintenance. WIREs Data Min. Knowl. Discov. 2025, 15, e70022. [Google Scholar] [CrossRef]
- Zhao, B.; Cen, J.; Si, W.; Chen, H. A Review of Bearing Fault Diagnosis Based on Weakly Supervised Learning. Machines 2023, 11, 112. [Google Scholar]
- Hassan, H.; Ren, Z.; Zhou, C.; Khan, M.A.; Pan, Y.; Zhao, J.; Huang, B. Supervised and Weakly Supervised Deep Learning Models for COVID-19 CT Diagnosis: A Systematic Review. Comput. Methods Programs Biomed. 2022, 218, 106731. [Google Scholar] [CrossRef]
- Wan, X.C.; Chen, L. Is Denoising Necessary for Ultrasound Image Segmentation Deep Learning: Review and Benchmark. Learning 2023, 25, 45. [Google Scholar]
- Mehrnia, S.S.; Safahi, Z.; Mousavi, A.; Panahandeh, F.; Farmani, A.; Yuan, R.; Rahmim, A.; Salmanpour, M.R. Landscape of 2D Deep Learning Segmentation Networks Applied to CT Scan from Lung Cancer Patients: A Systematic Review. J. Imaging Inform. Med. 2025, 38, 3711–3740. [Google Scholar] [CrossRef]
- Ouassit, Y.; Ghoumid, K.; Damoiseaux, J.-L.; Benmoussa, N. A Brief Survey on Weakly Supervised Semantic Segmentation. IEEE Access 2022, 10, 81268–81289. [Google Scholar] [CrossRef]
- Page, M.J.; McKenzie, J.E.; Bossuyt, P.M.; Boutron, I.; Hoffmann, T.C.; Mulrow, C.D.; Shamseer, L.; Tetzlaff, J.M.; Akl, E.A.; Brennan, S.E.; et al. The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. BMJ 2021, 372, n71. [Google Scholar] [CrossRef]
- PROSPERO. International Prospective Register of Systematic Reviews. Centre for Reviews and Dissemination, University of York. Available online: https://www.crd.york.ac.uk/prospero/ (accessed on 11 January 2026).
- Staal, J.; Abramoff, M.D.; Niemeijer, M.; Viergever, M.A.; van Ginneken, B. Ridge-Based Vessel Segmentation in Color Images of the Retina. IEEE Trans. Med. Imaging 2004, 23, 501–509. [Google Scholar] [CrossRef] [PubMed]
- Hoover, A.; Kouznetsova, V.; Goldbaum, M. Locating Blood Vessels in Retinal Images by Piecewise Threshold Probing of a Matched Filter Response. IEEE Trans. Med. Imaging 2000, 19, 203–210. [Google Scholar] [CrossRef]
- Fraz, M.M.; Remagnino, P.; Hoppe, A.; Uyyanonvara, B.; Rudnicka, A.R.; Owen, C.G.; Barman, S.A. An Ensemble Classification-Based Approach Applied to Retinal Blood Vessel Segmentation. IEEE Trans. Biomed. Eng. 2012, 59, 2538–2548. [Google Scholar] [CrossRef]
- Odstrcilik, J.; Kolar, R.; Budai, A.; Hornegger, J.; Jan, J.; Gazarek, J.; Kubena, T.; Cernosek, P.; Svoboda, O.; Angelopoulou, E. Retinal Vessel Segmentation by Improved Matched Filtering: Evaluation on a New High-Resolution Fundus Image Database. IET Image Process. 2013, 7, 373–383. [Google Scholar] [CrossRef]
- Holm, S.; Russell, G.; Nourrit, V.; McLoughlin, N. DR HAGIS—A Fundus Image Database for the Automatic Extraction of Retinal Surface Vessels from Diabetic Patients. J. Med. Imaging 2017, 4, 014503. [Google Scholar] [CrossRef] [PubMed]
- Hu, Q.; Abràmoff, M.D.; Garvin, M.K. Retinal Images vessel Tree Extraction (RITE) Dataset. Med. Image Comput. Comput. Assist. Interv. 2013, 16, 436–443. [Google Scholar] [PubMed]
- Zhang, J.; Dashtbozorg, B.; Bekkers, E.; Pluim, J.P.W.; Duits, R.; ter Haar Romeny, B.M. Robust Retinal Vessel Segmentation via Locally Adaptive Derivative Frames in Orientation Scores. IEEE Trans. Med. Imaging 2016, 35, 2631–2644. [Google Scholar] [CrossRef]
- Zhang, Z.; Yin, F.S.; Liu, J.; Wong, W.K.; Tan, N.M.; Lee, B.H.; Cheng, J.; Wong, T.Y. ORIGA(-light): An Online Retinal Fundus Image Database for Glaucoma Analysis and Research. In Proceedings of the 2010 Annual International Conference of the IEEE Engineering in Medicine and Biology, Buenos Aires, Argentina, 31 August–4 September 2010; pp. 3065–3068. [Google Scholar]
- Sivaswamy, J.; Krishnadas, S.R.; Joshi, G.D.; Jain, M.; Tabish, A.U. Drishti-GS: Retinal Image Dataset for Optic Nerve Head (ONH) Segmentation. In Proceedings of the 2014 IEEE 11th International Symposium on Biomedical Imaging (ISBI), Beijing, China, 29 April–2 May 2014; pp. 53–56. [Google Scholar]
- Carmona, E.J.; Rincón, M.; García-Feijoó, J.; Martínez-de-la-Casa, J.M. Identification of the Optic Nerve Head with Genetic Algorithms. Artif. Intell. Med. 2008, 43, 243–259. [Google Scholar] [CrossRef]
- Batista, A.F.; Costa, A.E.; Bernardino, A. RIM-ONE DL: A Unified Retinal Image Database for Assessing Glaucoma Using Deep Learning. Image Anal. Stereol. 2020, 39, 161–167. [Google Scholar]
- Almazroa, A.A.; Alodhayb, S.; Osman, E.; Ramadan, E.; Hummadi, M.; Dlaim, M.; Alkatee, M.; Raahemifar, K.; Lakshminarayanan, V. Retinal Fundus Images for Glaucoma Analysis: The RIGA Dataset. In Proceedings of the SPIE MEDICAL IMAGING, Houston, TX, USA, 10–15 February 2018; Volume 10579, pp. 55–62. [Google Scholar]
- Orlando, J.I.; Fu, H.; Breda, J.B.; van Keer, K.; Bathula, D.R.; Diaz-Pinto, A.; Fang, R.; Heng, P.-A.; Kim, J.; Lee, J.; et al. REFUGE Challenge: A Unified Framework for Evaluating Automated Methods for Glaucoma Assessment from Fundus Photographs. Med. Image Anal. 2020, 59, 101570. [Google Scholar] [CrossRef]
- Fang, H.; Li, F.; Wu, J.; Fu, H.; Sun, X.; Son, J.; Yu, S.; Zhang, M.; Yuan, C.; Bian, C.; et al. REFUGE2 Challenge: A Treasure Trove for Multi-Dimension Analysis and Evaluation in Glaucoma Screening. arXiv 2022, arXiv:2202.08994. [Google Scholar] [CrossRef]
- Bajwa, M.N.; Singh, G.A.P.; Neumeier, W.; Malik, M.I.; Dengel, A.; Ahmed, S. G1020: A Benchmark Retinal Fundus Image Dataset for Computer-Aided Glaucoma Detection. In Proceedings of the 2020 International Joint Conference on Neural Networks (IJCNN), Glasgow, UK, 19–24 July 2020. [Google Scholar]
- Porwal, P.; Pachade, S.; Kamble, R.; Kokare, M.; Deshmukh, G.; Sahasrabuddhe, V.; Meriaudeau, F. Indian Diabetic Retinopathy Image Dataset (IDRiD): A Database for Diabetic Retinopathy Screening Research. Data 2018, 3, 25. [Google Scholar] [CrossRef]
- Decencière, E.; Zhang, X.; Cazuguel, G.; Lay, B.; Cochener, B.; Trone, C.; Gain, P.; Ordonez, R.; Massin, P.; Erginay, A.; et al. Feedback on a Publicly Distributed Image Database: The Messidor Database. Image Anal. Ster. 2014, 33, 231. [Google Scholar] [CrossRef]
- Lin, L.; Li, M.; Huang, Y.; Cheng, P.; Xia, H.; Wang, K.; Tang, X. The SUSTech-SYSU Dataset for Automated Exudate Detection and Diabetic Retinopathy Grading. Sci. Data 2020, 7, 409. [Google Scholar] [CrossRef]
- Chiu, S.J.; Li, X.T.; Nicholas, P.; Toth, C.A.; Izatt, J.A.; Farsiu, S. Kernel Regression Based Segmentation of Optical Coherence Tomography Images with Diabetic Macular Edema. Biomed. Opt. Express 2015, 6, 1172–1194. [Google Scholar] [CrossRef] [PubMed]
- Bogunovic, H.; Venhuizen, F.; Klimscha, S.; Apostolopoulos, S.; Bab-Hadiashar, A.; Bagci, U.; Beg, M.F.; Bekalo, L.; Chen, Q.; Ciller, C.; et al. RETOUCH: The Retinal OCT Fluid Detection and Segmentation Benchmark and Challenge. IEEE Trans. Med. Imaging 2019, 38, 1858–1874. [Google Scholar]
- Li, M.; Huang, K.; Xu, Q.; Yang, J.; Zhang, Y.; Ji, Z.; Xie, K.; Yuan, S.; Liu, Q.; Chen, Q. OCTA-500: A Retinal OCTA Dataset for Multi-Scale Vascular Analysis. Med. Image Anal. 2024, 93, 103092. [Google Scholar] [CrossRef] [PubMed]
- Ma, Y.; Hao, H.; Xie, J.; Fu, H.; Zhang, J.; Yang, J.; Liu, J.; Zheng, Y.; Wang, Y. ROSE: A Retinal OCT-Angiography Vessel Segmentation Dataset and Algorithmic Benchmark. Med. Image Anal. 2021, 73, 102162. [Google Scholar]
- Fang, H.; Li, F.; Fu, H.; Wu, J.; Zhang, X.; Xu, Y. Dataset and Evaluation Algorithm Design for GOALS Challenge. arXiv 2022, arXiv:2207.14447. [Google Scholar] [CrossRef]
- MICCAI Registered Challenges. Glaucoma OCT Analysis and Layer Segmentation (GOALS); Zenodo: Geneva, Switzerland, 2022. [Google Scholar]




| Review Articles | Description | Medical Imaging | Weak Labels | Learning Strategies | Backbones | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| OCT | Fundus | CAM | MIL | Sparse | Auto– Labels | Anatomy Rules | U-Net Family | DeepLab Family | Transformer/ Hybrid | ||
| Rizvana & Narayanan (2024) [5] | Deep learning of fundus images and optical coherence tomography images for ocular disease detection—A review | ✓ | ✓ | ✓ | |||||||
| Xue et al. (2024) [7] | Analysis of Atherosclerotic Plaques Using OCT Images Based on Deep Learning: A Comprehensive Review | ✓ | ✓ | ✓ | |||||||
| Chen et al. (2025) [6] | Denoising, segmentation and volumetric rendering of optical coherence tomography angiography (OCTA) images using deep learning techniques: A comprehensive review | ✓ | ✓ | ||||||||
| Alawad et al. (2022) [1] | Machine Learning and Deep Learning Techniques for Optic Disc and Cup Segmentation—A Review | ✓ | |||||||||
| Goutam et al. (2022) [2] | A Comprehensive Review of Deep Learning Strategies in Retinal Disease Diagnosis Using Fundus Images | ✓ | ✓ | ✓ | ✓ | ||||||
| Anusuya & Masoodhu Banu (2023) [3] | A Comprehensive Review of Glaucoma Detection from Fundus Images using Deep Learning | ✓ | |||||||||
| Zedan et al. (2023) [4] | Automated Glaucoma Screening and Diagnosis Based on Retinal Fundus Images Using Deep Learning Approaches | ✓ | ✓ | ✓ | ✓ | ||||||
| Zhang et al. (2020) [8] | A survey of semi- and weakly supervised semantic segmentation of images | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ||
| Ouassit et al. (2022) [34] | A Brief Survey on Weakly Supervised Semantic Segmentation | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ||||
| Hassan et al. (2022) [31] | Supervised and weakly supervised deep learning models for COVID-19 CT diagnosis: A systematic review | ✓ | ✓ | ✓ | ✓ | ✓ | |||||
| Qu et al. (2022) [25] | Towards label-efficient automatic diagnosis and analysis: a comprehensive survey of advanced deep learning-based weakly supervised, semi-supervised and self … | ✓ | ✓ | ✓ | ✓ | ||||||
| Shen et al. (2023) [24] | A Survey on Label-Efficient Deep Image Segmentation: Bridging the Gap Between Weak Supervision and Dense Prediction | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ||
| Bohlender et al. (2023) [9] | A Survey on Shape-Constraint Deep Learning for Medical Image Segmentation | ✓ | ✓ | ||||||||
| Zhang et al. (2025) [27] | A Review of Non-Fully Supervised Deep Learning for Medical Image Segmentation | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ||
| Gao et al. (2025) [10] | Medical Image Segmentation: A Comprehensive Review of Deep Learning-Based Methods | ✓ | ✓ | ✓ | ✓ | ||||||
| Green (2025) [26] | Learning with Imperfect Labels and Incomplete Views: A Review of Representation Methods for Weakly Supervised Perception | ✓ | ✓ | ||||||||
| This review (PRISMA) | Weakly Supervised Deep Learning for Ocular Image Segmentation: A Systematic Umbrella Review of Surveys and Reviews on Fundus and OCT Methods | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
| Identifier | Inclusion Criteria |
|---|---|
| IC1 | Review-type articles published in journals or conferences; preprints were also eligible. |
| IC2 | Uses deep learning and addresses weak supervision. |
| IC3 | Discusses methods or applications relevant to weak supervision and/or segmentation or related perception tasks (e.g., segmentation, detection/localization, saliency, temporal localization), including medical and ocular imaging. |
| IC4 | The full text is available in English. |
| IC5 | Published between 2020 and 2026. |
| Identifier | Exclusion Criteria |
|---|---|
| EC1 | Not a review-type article. |
| EC2 | Not involving deep learning methods. |
| EC3 | Not about weak supervision and not relevant to segmentation/related perception tasks. |
| EC4 | Full text not obtainable (or not available in English). |
| EC5 | Duplicate record of another included review. |
| ID | Included Review | Review Type | Protocol Registered? | Adequate Search? | Selection/Extraction in Duplicate? | RoB Assessed? | Overall Confidence |
|---|---|---|---|---|---|---|---|
| R1 | Zhang (2020) [8] | Narrative survey | N | N | Y/N | N | Critically low |
| R2 | Ouassit (2022) [34] | Narrative survey | N | N | Y/N | N | Critically low |
| R3 | Shen (2023) [24] | Narrative survey | N | PY | Y/N | N | Critically low |
| R4 | Qu (2022) [25] | Narrative survey | N | PY | Y/N | N | Critically low |
| R5 | Green (2025) [26] | Narrative survey | N | N | N/N | N | Critically low |
| R6 | Zhang (2025) [27] | Narrative survey | N | PY | Y/N | N | Critically low |
| R7 | Bohlender (2023) [9] | Narrative survey | N | PY | Y/N | N | Critically low |
| R8 | Gao (2025) [10] | Narrative survey | N | PY | Y/Y | N | Critically low |
| R9 | Fasana (2022) [28] | Narrative survey | N | PY | N/N | N | Critically low |
| R10 | Martínez-Heredia (2025) [29] | Narrative survey | N | Y | Y/N | Y | Critically low |
| R11 | Zhao (2023) [30] | Narrative survey | N | PY | N/N | N | Critically low |
| R12 | Hassan (2022) [31] | Systematic review | N | Y | Y/N | N | Critically low |
| R13 | Liu (2023) [32] | Narrative review | N | PY | Y/N | N | Critically low |
| R14 | Mehrnia (2025) [33] | Narrative review | N | Y | Y/Y | N | Critically low |
| R15 | Alawad (2022) [1] | Narrative review | N | PY | Y/N | N | Critically low |
| R16 | Goutam (2022) [2] | Narrative review | N | PY | Y/N | N | Critically low |
| R17 | Anusuya (2023) [3] | Narrative review | N | PY | Y/N | N | Critically low |
| R18 | Zedan (2023) [4] | Narrative review | N | Y | Y/N | N | Critically low |
| R19 | Rizvana & Narayanan (2024) [5] | Narrative review | N | N | Y/N | N | Critically low |
| R20 | Xue (2024) [7] | Narrative review | N | PY | Y/N | N | Critically low |
| R21 | Chen (2025) [6] | Narrative review | N | PY | Y/N | N | Critically low |
| ID | First Author (Year) | Domain Focus | Segmentation as a Main Topic? | Ocular Imaging Covered? |
|---|---|---|---|---|
| R1 | Zhang (2020) [8] | Generic WSL/segmentation | Yes | No |
| R2 | Ouassit (2022) [34] | Generic WSL/segmentation | Yes | No |
| R3 | Shen (2023) [24] | Generic WSL/segmentation | Yes | No |
| R4 | Qu (2022) [25] | Generic WSL/segmentation | Partly | No |
| R5 | Green (2025) [26] | Generic WSL | No | No |
| R6 | Zhang (2025) [27] | Medical segmentation | Yes | No |
| R7 | Bohlender (2023) [9] | Medical segmentation | Yes | Occasional |
| R8 | Gao (2025) [10] | Medical segmentation | Yes | Occasional |
| R9 | Fasana (2022) [28] | Other domain | No | No |
| R10 | Martínez-Heredia (2025) [29] | Other domain | No | No |
| R11 | Zhao (2023) [30] | Other domain | No | No |
| R12 | Hassan (2022) [31] | Medical imaging | Partly | No |
| R13 | Liu (2023) [32] | Medical segmentation | Yes | No |
| R14 | Mehrnia (2025) [33] | Medical segmentation | Yes | No |
| R15 | Alawad (2022) [1] | Ocular | Yes | Fundus |
| R16 | Goutam (2022) [2] | Ocular | Partly | Fundus |
| R17 | Anusuya (2023) [3] | Ocular | Partly | Fundus |
| R18 | Zedan (2023) [4] | Ocular | Partly | Fundus |
| R19 | Rizvana & Narayanan (2024) [5] | Ocular | Partly | Fundus + OCT |
| R20 | Xue (2024) [7] | OCT | Yes | OCT |
| R21 | Chen (2025) [6] | Ocular | Yes | OCTA |
| ID | First Author (Year) | Weakly Supervised Learning Coverage | Weakly Supervised Label Types | Label-Efficient Strategies |
|---|---|---|---|---|
| R1 | Zhang (2020) [8] | Detailed | CAM, MIL, sparse (points/scribbles/boxes) | Pseudo-/self-training, consistency, semi-supervision |
| R2 | Ouassit (2022) [34] | Detailed | CAM, seeds, sparse, some MIL | Seed expansion, losses, post-processing, some pseudo-labelling |
| R3 | Shen (2023) [24] | Detailed | Weak labels broadly | Pseudo-label curricula, consistency, SSL/contrastive, semi-/self-supervision |
| R4 | Qu (2022) [25] | Detailed | Weak, semi-, self-supervision | Pseudo-labels, self-training, SSL, multi-task/curriculum |
| R5 | Green (2025) [26] | Conceptual | Inexact/inaccurate labels, partial views | Robust objectives, sample re-weighting, multi-view consistency |
| R6 | Zhang (2025) [27] | Detailed | Weak, semi-, self-supervision; sparse labels | Pseudo-/auto-labelling, SSL, teacher–student, hybrid supervision |
| R7 | Bohlender (2023) [9] | Conceptual | Not main focus; mostly assumes full labels | Shape/topology priors, anatomy-aware losses, CRF |
| R8 | Gao (2025) [10] | Conceptual | Mostly fully supervised; brief WSL/SSL mentions | Transfer learning, occasional semi-/self-supervision |
| R9 | Fasana (2022) [28] | Detailed | Image-level labels, MIL, proposal boxes | MIL training, pseudo-labelling, OICR-style refinement |
| R10 | Martínez-Heredia (2025) [29] | Detailed | Weak labels in time-series (incomplete/inexact) | Self-training, EM-style refinement, heuristics |
| R11 | Zhao (2023) [30] | Detailed | Incomplete/inaccurate labels, weak supervision | Domain adaptation, pseudo-labels, MIL, SSL |
| R12 | Hassan (2022) [31] | Conceptual | Weak labels mainly for diagnosis (COVID-19 CT) | Pseudo-labelling, semi-supervision (classification-heavy) |
| R13 | Liu (2023) [32] | Conceptual | Weak labels mentioned only tangentially | Focus on denoising + supervised segmentation; some transfer learning |
| R14 | Mehrnia (2025) [33] | Conceptual | Non-fully supervised approaches briefly noted | Mainly full supervision; occasional SSL/semi-supervision |
| R15 | Alawad (2022) [1] | Minimal | WSL rarely/only briefly mentioned | Transfer learning, some CAM-based visualization |
| R16 | Goutam (2022) [2] | Conceptual | WSL/label efficiency mentioned at high level | Transfer learning, data augmentation, some CAM |
| R17 | Anusuya (2023) [3] | None | – | Focus on supervised glaucoma classification and features |
| R18 | Zedan (2023) [4] | Minimal | CAM/heatmaps occasionally for explainability | Supervised pipelines; transfer learning |
| R19 | Rizvana & Narayanan (2024) [5] | Conceptual | CAM/attention for interpretability | Transfer learning, some attention mechanisms |
| R20 | Xue (2024) [7] | None | – | Supervised training, transfer learning |
| R21 | Chen (2025) [6] | None | – | Supervised training; some multi-task/denoising |
| Dataset | Modality | Dim. | Primary Segmentation Tasks | Size (Approx.) | Label Type | Weak-Label Availability (Native vs. Constructed) | Public |
|---|---|---|---|---|---|---|---|
| DRIVE [37] | Fundus | 2D | Retinal vessels | 40 images (20 train, 20 test) | Pixel-wise vessel masks (2 annotators for test) | Dense only | Yes |
| STARE [38] | Fundus | 2D | Retinal vessels | 20 images | Pixel-wise vessel masks (2 annotations) | Dense only | Yes |
| CHASE_DB1 [39] | Fundus | 2D | Retinal vessels | 28 images | Pixel-wise vessel masks | Dense only | Yes |
| HRF [40] | Fundus | 2D | Retinal vessels | 45 images (15 + 15 + 15) | Pixel-wise vessel masks | Dense only | Yes |
| DR HAGIS [41] | Fundus | 2D | Retinal vessels | 39 images | Pixel-wise vessel masks | Dense only | Yes |
| RITE [42] | Fundus | 2D | Vessels + artery/vein (A/V) labels | 40 images | Vessel masks + A/V labels | Dense only | Yes |
| IOSTAR [43] | Fundus | 2D | Retinal vessels, OD, A/V ratio | 30 images (1024 × 1024) | Pixel-wise vessels, OD/A/V labels | Dense only | Yes |
| ORIGA(-light) [44] | Fundus | 2D | OD/OC | 650 images (482 healthy, 168 glaucoma) | OD & OC boundaries | Dense only | Partly |
| DRISHTI-GS1 [45] | Fundus | 2D | OD/OC | 101 images (train/test split) | OD & OC masks (multi-expert) | Dense only | Yes |
| DRIONS-DB [46] | Fundus | 2D | OD | 110 images | OD contours (2 annotations) | Dense only | Yes |
| RIM-ONE DL [47] | Fundus | 2D | OD/OC | 485 images (313 normal, 172 glaucoma) | OD/OC masks | Dense only | Yes |
| RIGA [48] | Fundus | 2D | OD/OC | 750 images from 3 sources | OD/OC boundaries (multi-expert) | Dense only | Yes |
| REFUGE/REFUGE2 [49,50] | Fundus | 2D | OD/OC (+glaucoma labels) | 1200 images | OD/OC masks + diagnosis labels | Native image-level + dense | Yes |
| G1020 [51] | Fundus | 2D | OD/OC | 1020 images (724 healthy, 296 glaucoma) | OD/OC masks + labels | Native image-level + dense | Yes |
| IDRiD [52] | Fundus | 2D | Lesion segmentation (MA, EX, HE, soft EX) + OD/FAZ in some splits | 81 images with pixel-wise lesion labels (+extra images for grading) | Pixel-wise lesion masks (+ DR grades) | Native image-level + dense | Yes |
| e-ophtha (MA/EX) [53] | Fundus | 2D | Lesion segmentation (MAs, exudates) | MA: 148 lesion + 233 normal; EX: 47 lesion + 35 normal | Pixel-wise lesion masks | Dense only | Yes |
| SUSTech-SYSU EX [54] | Fundus | 2D | Exudate segmentation | 1400+ images (various subsets) | Pixel-wise exudate masks | Dense only | Yes |
| Duke DME/Duke SD-OCT [55] | OCT | 2D | Fluid regions ± layers | ~110 B-scans from 10 eyes (canonical dataset) | Pixel-wise fluid/layer labels | Dense only | Yes |
| RETOUCH [56] | OCT | 3D | Fluid segmentation (IRF, SRF, PED) | >70 training volumes + test from 3 vendors | Voxel-wise fluid labels | Dense only | Yes |
| OCTA-500 [57] | OCT + OCTA | 3D + 2D | Vessels (large/capillary), arteries/veins, FAZ (2D/3D), retinal layers | 500 subjects, multi-FOV | Multi-label (vessels, FAZ, layers) | Dense only | Yes |
| ROSE [58] | OCTA | 2D | Vessel segmentation, FAZ | 229 images in 3 subsets | Centerline + pixel-wise vessel/FAZ labels | Native sparse + dense | Yes |
| OCT layer challenge datasets [59,60] | OCT | 2D + 3D | Retinal layers | Dozens–hundreds of volumes depending on dataset | Layer boundary/surface annotations | Mixed/unclear | Mixed |
| ID | First Author (Year) | OD/OC Segmentation Role | WSL Methods for OD/OC? | Annotation-Effort Metrics Reported? | WSL vs. FS OD/OC Comparison? | OD/OC Metrics Reported |
|---|---|---|---|---|---|---|
| R15 | Alawad (2022) [1] | Main focus | No systematic WSL; occasional mention of CAM/feature extraction only | No | No | Yes |
| R16 | Goutam (2022) [2] | Component | Only high-level/brief references to weak or label-efficient ideas | No | No | Yes |
| R17 | Anusuya (2023) [3] | Minimal/indirect | None for segmentation; focus on supervised classification and hand-crafted or learned features | No | No | Yes |
| R18 | Zedan (2023) [4] | Component | Minimal; weak supervision not analyzed | No | No | Yes |
| R19 | Rizvana & Narayanan (2024) [5] | Component | Conceptual mentions of CAM/attention as interpretability; no explicit WSL OD/OC segmentation analysis | No | No | Yes |
| ID | First Author (Year) | Modality Focus (Volumetric) | Main Segmentation Tasks | Designs Discussed (2D/3D/Hybrid) | Supervision Regime for These Designs | Quantitative Comparison Between Designs? | Example Datasets Mentioned |
|---|---|---|---|---|---|---|---|
| R6 | Zhang (2025) [27] | Multi-modal medical imaging | Organ/tumour and structure segmentation across CT, MRI, etc. | 2D, 3D, hybrid | Mix of full, weak, semi- and self-supervision; WSL examples mostly non-ocular | Qualitative only | Various CT/MRI datasets |
| R8 | Gao (2025) [10] | Multi-modal medical imaging | General organ and lesion segmentation (CT, MRI, US, OCT as one of many) | 2D, 3D, hybrid | FS training | Qualitative comparison of context vs. computation | Common multi-organ CT/MRI benchmarks; OCT only as minor example |
| R13 | Liu (2023) [32] | Ultrasound | Organ/lesion segmentation with denoising emphasis | 2D, 3D, hybrid | FS training | Qualitative remarks on 2D vs. 3D performance | Several US benchmarks (non-ocular) |
| R14 | Mehrnia (2025) [33] | Lung CT | Lung tumour/lesion segmentation | 2D only | FS training | No | Lung CT datasets |
| R20 | Xue (2024) [7] | Intravascular OCT | Plaque component segmentation | 2D, 3D; hybrid | FS training | Qualitative comparison of 2D vs. 3D | Intravascular OCT datasets from specific centres |
| R21 | Chen (2025) [6] | OCTA | Vessels, plexus segmentation | 2D, 3D, hybrid | FS training | Qualitative discussion of 2D vs. 3D vs. hybrid designs | OCTA-500, ROSE and other OCTA datasets |
| Category | Item to Report | Minimum Details |
|---|---|---|
| Task & data | Modality/task/dimension | Fundus OD/OC (2D); OCT fluid (3D); OCTA vessels + FAZ (2D/3D). |
| Dataset | Dataset name + split | Dataset version; train/val/test split; cross-validation if used. |
| Label source | Label type | Image-level, points, scribbles, boxes, partial masks, temporal/clinical labels. |
| Weak-label availability | Native vs. constructed | Native weak labels or constructed from dense masks. |
| Annotation effort | Cost metric | Images labelled; clicks/scribbles; minutes per image; annotator expertise; tools used. |
| Baselines | Matched fully supervised baseline | Same backbone, same split, same preprocessing; training budget matched. |
| Method | Weak supervision mechanism | CAM/MIL, region growing/propagation, graph constraints, shape priors, pseudo-labelling, SSL/CL pretraining. |
| Training details | Core settings | Input resolution, augmentations, loss terms, optimizer/LR, epochs, early stopping. |
| Evaluation | Metrics + protocol | Dice/IoU, boundary metrics, AUPR (vessels), FAZ error; volume-wise vs. slice-wise rules. |
| Statistics | Uncertainty/testing | Confidence intervals; repeated runs; Wilcoxon/Friedman if applicable. |
| Reproducibility | Code + model availability | Code link, pretrained weights, seed control, hardware. |
| Bias & limitations | Failure modes | Domain shift, class imbalance, label noise, leakage risks. |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Penedo, P.; Machado, J.; Anjos, R.; Marta, A.; Silva, A.C.; Cunha, A. Weakly Supervised Deep Learning for Ocular Image Segmentation: A Systematic Review of Fundus and OCT Methods. Appl. Sci. 2026, 16, 2241. https://doi.org/10.3390/app16052241
Penedo P, Machado J, Anjos R, Marta A, Silva AC, Cunha A. Weakly Supervised Deep Learning for Ocular Image Segmentation: A Systematic Review of Fundus and OCT Methods. Applied Sciences. 2026; 16(5):2241. https://doi.org/10.3390/app16052241
Chicago/Turabian StylePenedo, Pedro, Jorge Machado, Rita Anjos, Ana Marta, Aristófanes Corrêa Silva, and António Cunha. 2026. "Weakly Supervised Deep Learning for Ocular Image Segmentation: A Systematic Review of Fundus and OCT Methods" Applied Sciences 16, no. 5: 2241. https://doi.org/10.3390/app16052241
APA StylePenedo, P., Machado, J., Anjos, R., Marta, A., Silva, A. C., & Cunha, A. (2026). Weakly Supervised Deep Learning for Ocular Image Segmentation: A Systematic Review of Fundus and OCT Methods. Applied Sciences, 16(5), 2241. https://doi.org/10.3390/app16052241

