Multi-View Machine Learning with an Optic Disc Localization for Glaucoma Diagnosis
Abstract
1. Introduction
2. Methods
2.1. Data Description
2.2. Data Preprocessing
2.3. Data Partitioning

2.4. Multi-View Network
2.4.1. Original Fundus Image View
2.4.2. Cropped Optic Disc View
2.4.3. Computing Environment
3. Results
| Views | Loss | Accuracy | AUC | Precision | Recall | F1-Score |
|---|---|---|---|---|---|---|
| Original | 0.2633 ± 0.0084 | 0.8874 ± 0.0029 | 0.9391 ± 0.0027 | 0.7881 ± 0.0234 | 0.7081 ± 0.0312 | 0.7450 ± 0.0097 |
| Cropped | 0.2547 ± 0.0259 | 0.9011 ± 0.0071 | 0.9452 ± 0.0065 | 0.8064 ± 0.0231 | 0.7590 ± 0.0547 | 0.7802 ± 0.023 |
| Multiview | 0.2314 ± 0.0136 | 0.9048 ± 0.0053 | 0.9514 ± 0.0045 | 0.8195 ± 0.0156 | 0.7590 ± 0.0401 | 0.7872 ± 0.0178 |


4. Discussion
4.1. Analysis of PCA
4.2. Data Imbalanced

4.3. Data Augmentation
4.4. Multiview Weight
4.5. Fine-Tuning and Backbone
4.6. Model Training Setup
4.7. Grad-CAM Analysis
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- World Health Organization. World Report on Vision; World Health Organization: Geneva, Switzerland, 2019; Available online: https://www.who.int/publications/i/item/world-report-on-vision (accessed on 16 April 2025).
- Lee, D.A.; Higginbotham, E.J. Glaucoma and its treatment: A review. Am. J. Health-Syst. Pharm. 2005, 62, 691–699. [Google Scholar] [CrossRef] [PubMed]
- Gopi, V.P.; Anjali, M.S.; Niwas, S.I. PCA-based localization approach for segmentation of optic disc. Int. J. Comput. Assist. Radiol. Surg. 2017, 12, 2195–2204. [Google Scholar] [CrossRef] [PubMed]
- Chiang, C.C.; Lin, P.Y.; Lin, Y.C.; Chen, H.Y. Deep learning-based glaucoma detection in highly myopic populations using fundus images. Biomedicines 2024, 12, 1394. [Google Scholar] [CrossRef] [PubMed]
- Saha, S.; Islam, M.S.; Hossain, M.A.; Rahman, M.M. A fully automated deep learning-based system for glaucoma detection using fundus images. Sci. Rep. 2023, 13, 18607. [Google Scholar] [CrossRef] [PubMed]
- Islam, M.M.; Rahman, M.A.; Saha, S.; Aktar, S. Multimodal deep learning framework for glaucoma detection using fundus and OCT images. Sci. Rep. 2025, 15, 12034. [Google Scholar]
- Al-Mahrooqi, A.; Medvedev, D.; Muhtaseb, R.; Yaqub, M. Gardnet: Robust Multi-View Network for Glaucoma Classification in Color Fundus Images. In Proceedings of the International Workshop on Ophthalmic Medical Image Analysis; Springer: Cham, Switzerland, 2022; pp. 152–161. [Google Scholar]
- Issac, A.; Sarathi, M.P.; Dutta, M.K. An Adaptive Threshold Based Image Processing Technique for Improved Glaucoma Detection and Classification. Comput. Methods Programs Biomed. 2015, 122, 229–244. [Google Scholar] [CrossRef] [PubMed]
- Soorya, M.; Issac, A.; Dutta, M.K. An Automated and Robust Image Processing Algorithm for Glaucoma Diagnosis from Fundus Images Using Novel Blood Vessel Tracking and Bend Point Detection. Int. J. Med. Inform. 2018, 110, 52–70. [Google Scholar]
- Shanmugam, P.; Raja, J.; Pitchai, R. An Automatic Recognition of Glaucoma in Fundus Images Using Deep Learning and Random Forest Classifier. Appl. Soft Comput. 2021, 109, 107512. [Google Scholar] [CrossRef]
- Yin, P.; Xu, Y.; Zhu, J.; Liu, J.; Yi, C.A.; Huang, H.; Wu, Q. Deep Level Set Learning for Optic Disc and Cup Segmentation. Neurocomputing 2021, 464, 330–341. [Google Scholar] [CrossRef]
- Hervella, Á.S.; Rouco, J.; Novo, J.; Ortega, M. End-to-end multi-task learning for simultaneous optic disc and cup segmentation and glaucoma classification in eye fundus images. Appl. Soft Comput. 2022, 116, 108347. [Google Scholar] [CrossRef]
- Ronneberger, O.; Fischer, P.; Brox, T. U-Net: Convolutional Networks for Biomedical Image Segmentation. In Proceedings of the 18th International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI 2015); Springer: Cham, Switzerland, 2015; Part III; pp. 234–241. [Google Scholar]
- Mora, A.R. Deep Learning-Based Prediction of Multiple Ocular Diseases Using CLAHE Enhancement. Bacherlor’s Thesis, Tilburg University, Tilburg, The Netherlands, 2024. [Google Scholar]
- Wanling, W.; Mohamed Shah, N. Fundus Image Enhancement Using CLAHE. New Explor. Electr. Eng. 2025, 1, 68–79. [Google Scholar] [CrossRef]
- Mittapalli, P.S.; Kande, G.B. Segmentation of Optic Disk and Optic Cup from Digital Fundus Images for the Assessment of Glaucoma. Biomed. Signal Process. Control 2016, 24, 34–46. [Google Scholar] [CrossRef]
- Berndt-Schreiber, M. Morphological Operations in Fundus Image Analysis. J. Med. Inform. Technol. 2007, 11, 79–86. [Google Scholar]
- Tan, M.; Le, Q. EfficientNetV2: Smaller Models and Faster Training. In Proceedings of the International Conference on Machine Learning, PMLR, Virtual, 18–24 July 2021; pp. 10096–10106. [Google Scholar]
- Tan, M.; Le, Q. EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks. In Proceedings of the International Conference on Machine Learning, Long Beach, CA, USA, 9–15 June 2019; pp. 6105–6114. [Google Scholar]
- Tan, M.; Le, Q.V. MixConv: Mixed Depthwise Convolutional Kernels. arXiv 2019, arXiv:1907.09595. [Google Scholar] [CrossRef]
- Selvaraju, R.R.; Cogswell, M.; Das, A.; Vedantam, R.; Parikh, D.; Batra, D. Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization. In Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 22–29 October 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 618–626. [Google Scholar]
- Chakravarty, A.; Sivswamy, J. Joint optic disc and cup segmentation and glaucoma classification using multi-task convolutional neural networks. IEEE Trans. Med. Imaging 2018, 37, 2496–2507. [Google Scholar]
- Hemelings, R.; Elen, B.; Barbosa-Breda, J.; Stalmans, I.; Van Keer, K. Deep learning for glaucoma detection and segmentation beyond the optic disc region. IEEE J. Biomed. Health Inform. 2021, 25, 1419–1427. [Google Scholar]
- Pacal, I.; Celik, O.; Bayram, B.; Cunha, A. Enhancing EfficientNetv2 with global and efficient channel attention mechanisms for accurate MRI-Based brain tumor classification. Clust. Comput. 2024, 27, 11187–11212. [Google Scholar] [CrossRef]
- Gang, S.; Fabrice, N.; Chung, D.; Lee, J. Character recognition of components mounted on printed circuit board using deep learning. Sensors 2021, 21, 2921. [Google Scholar] [CrossRef] [PubMed]










| Views | Fold 1 | Fold 2 | Fold 3 | Fold 4 | Fold 5 | Mean ± SD |
|---|---|---|---|---|---|---|
| Original | 0.9356 | 0.9409 | 0.9377 | 0.9378 | 0.9435 | 0.9391 ± 0.0028 |
| Cropped | 0.9516 | 0.9426 | 0.9341 | 0.9468 | 0.9510 | 0.9452 ± 0.0065 |
| Multiview | 0.9556 | 0.9489 | 0.9447 | 0.9508 | 0.9568 | 0.9514 ± 0.0045 |
| Comparison | Mean AUC Difference | p-Value | Significance |
|---|---|---|---|
| Multiview vs. Original | 0.0123 | 0.0061 | Significant () |
| Multiview vs. Cropped | 0.0061 | 0.0707 | Not significant () |
| Study | Method/Model | Dataset | AUC |
|---|---|---|---|
| Mahrooqi et al. (2022) [7] | Multi-view (GARDNet) | EyePACS/RIM-ONE DL | 0.92–0.93 |
| Chakravarty & Sivswamy (2018) [22] | Joint OD/OC segmentation + CNN | REFUGE | 0.95 |
| Hemelings et al. (2021) [23] | Cropping-based CNN | UZL/REFUGE | 0.94 |
| Chiang et al. (2024) [4] | Deep learning model | 3088 clinical images | 0.894 |
| Proposed (This Work) | Multi-view network | 14,255 clinical dataset | 0.951 |
| Setting | VF | HF | R | B | C | S | H | D | Original AUC | Cropped AUC |
|---|---|---|---|---|---|---|---|---|---|---|
| No Aug | ||||||||||
| 1 | 0.5 | 0.5 | ||||||||
| 2 | ||||||||||
| 3 | 0.5 | 0.5 | ||||||||
| 4 | ||||||||||
| 5 | 0.3 | |||||||||
| 6 | 0.5 | 0.5 | ||||||||
| 7 | 0.5 | 0.5 | 0.3 | 0.9391 ± 0.0027 | 0.9452 ± 0.0065 |
| Multiview Weights | Experimental Evaluation | |||||
|---|---|---|---|---|---|---|
| Original | Cropped | Accuracy | AUC | Precision | Recall | F1-Score |
| 0.9 | 1.1 | 0.9022 ± 0.0046 | 0.9518 ± 0.0042 | 0.8176 ± 0.0188 | 0.7477 ± 0.0402 | 0.7800 ± 0.0164 |
| 0.8 | 1.2 | 0.9032 ± 0.0051 | 0.9519 ± 0.0042 | 0.8186 ± 0.0143 | 0.7512 ± 0.0411 | 0.7825 ± 0.0180 |
| 0.7 | 1.3 | 0.9039 ± 0.0049 | 0.9519 ± 0.0043 | 0.8197 ± 0.0148 | 0.7534 ± 0.0409 | 0.7842 ± 0.0176 |
| 0.6 | 1.4 | 0.9047 ± 0.0043 | 0.9517 ± 0.0044 | 0.8214 ± 0.0147 | 0.7555 ± 0.0396 | 0.7861 ± 0.0163 |
| 0.5 | 1.5 | 0.9048 ± 0.0053 | 0.9514 ± 0.0045 | 0.8195 ± 0.0156 | 0.7590 ± 0.0401 | 0.7872 ± 0.0178 |
| 0.4 | 1.6 | 0.9034 ± 0.0044 | 0.9509 ± 0.0045 | 0.8150 ± 0.0153 | 0.7576 ± 0.0423 | 0.7842 ± 0.0174 |
| 0.3 | 1.7 | 0.9025 ± 0.0048 | 0.9502 ± 0.0046 | 0.8114 ± 0.0148 | 0.7583 ± 0.0434 | 0.7829 ± 0.0182 |
| 0.2 | 1.8 | 0.9021 ± 0.0063 | 0.9493 ± 0.0047 | 0.8099 ± 0.0177 | 0.7583 ± 0.0499 | 0.7818 ± 0.0222 |
| 0.1 | 1.9 | 0.9009 ± 0.0067 | 0.9480 ± 0.0050 | 0.8068 ± 0.0198 | 0.7509 ± 0.0537 | 0.7794 ± 0.0237 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Siying, P.; Muangphara, T.; Photun, A.; Suppalap, S.; Klinsuwan, T.; Phruancharoen, C.; Treeyawedkul, S.; Chira-adisai, T.; Supattanawong, Y.; Wangkeeree, R. Multi-View Machine Learning with an Optic Disc Localization for Glaucoma Diagnosis. Appl. Sci. 2026, 16, 3158. https://doi.org/10.3390/app16073158
Siying P, Muangphara T, Photun A, Suppalap S, Klinsuwan T, Phruancharoen C, Treeyawedkul S, Chira-adisai T, Supattanawong Y, Wangkeeree R. Multi-View Machine Learning with an Optic Disc Localization for Glaucoma Diagnosis. Applied Sciences. 2026; 16(7):3158. https://doi.org/10.3390/app16073158
Chicago/Turabian StyleSiying, Parichat, Thitima Muangphara, Aphinan Photun, Siwakon Suppalap, Thitiphat Klinsuwan, Chatmongkol Phruancharoen, Sirinan Treeyawedkul, Tanate Chira-adisai, Ying Supattanawong, and Rabian Wangkeeree. 2026. "Multi-View Machine Learning with an Optic Disc Localization for Glaucoma Diagnosis" Applied Sciences 16, no. 7: 3158. https://doi.org/10.3390/app16073158
APA StyleSiying, P., Muangphara, T., Photun, A., Suppalap, S., Klinsuwan, T., Phruancharoen, C., Treeyawedkul, S., Chira-adisai, T., Supattanawong, Y., & Wangkeeree, R. (2026). Multi-View Machine Learning with an Optic Disc Localization for Glaucoma Diagnosis. Applied Sciences, 16(7), 3158. https://doi.org/10.3390/app16073158

