Implementation of the VGG19 Model with Transfer Learning for Retinal Disease Diagnosis: A Study on Normal Eyes, Diabetic Retinopathy, Cataract, and Glaucoma Datasets

Kharisma, Ivana Lucia; Susanti,; Rustiani,; Pratama, Riski Abdilah; Kamdan,

doi:10.3390/engproc2025107111

Open AccessProceeding Paper

Implementation of the VGG19 Model with Transfer Learning for Retinal Disease Diagnosis: A Study on Normal Eyes, Diabetic Retinopathy, Cataract, and Glaucoma Datasets^†

by

Ivana Lucia Kharisma

^*

,

Susanti

,

Rustiani

,

Riski Abdilah Pratama

and

Kamdan

Faculty of Engineering, Computer and Design, Nusa Putra University, Sukabumi 43152, Indonesia

^*

Author to whom correspondence should be addressed.

^†

Presented at the 7th International Global Conference Series on ICT Integration in Technical Education & Smart Society, Aizuwakamatsu City, Japan, 20–26 January 2025.

Eng. Proc. 2025, 107(1), 111; https://doi.org/10.3390/engproc2025107111

Published: 25 September 2025

(This article belongs to the Proceedings of The 7th International Global Conference Series on ICT Integration in Technical Education & Smart Society)

Download

Browse Figures

Versions Notes

Abstract

Retinal disorders, such as diabetic retinopathy, cataract, and glaucoma, are among the leading causes of vision loss and blindness worldwide. The use of normal data in diagnostic studies provides a basis for distinguishing between pathological and healthy conditions. Complete and accurate diagnosis of these conditions is essential for effective treatment and prevention of recurrence. This study focuses on the VGG19 model and transfer learning to classify retinal conditions such as normal, diabetic, cataract, and glaucoma. A publicly available dataset from Kaggle consisting of labeled retinal images is used for training and evaluation. The data used in this study consists of 400 retinal images, each consisting of 100 images per class, where there are four classes consisting of normal eyes, cataract, diabetic retinopathy and glaucoma. In 50 epochs of training, Adam optimization and softmax function activation, the modeling performance measured using the confusion matrix, including the accuracy, precision, recall and F1 score, achieves accuracy results of 0.91 for 320 training data and 0.88 for 80 validation data. The loss value is 0.18 for the training data and 0.31 for the validation data. Using the test data, the values of the cataract class are 0.94 for precision, 0.8 for recall, and 0.86 for the F1 score. The values are 0.91 for precision, 1.00 for recall and 0.95 for the F1 score in the diabetic retinopathy class. For glaucoma, the scores are 0.74 for precision, 0.85 for recall, and 0.79 for the F1 score. The normal class has scores of 1.00 for precision, 0.9 for recall and 0.95 for the F1 score. Given the performance test results shown above, VGG19 modeling for diagnosing retinal disease provides quite good results. Future research can expand this research by combining additional datasets and exploring other neural network architectures to improve the diagnostic performance.

Keywords:

eye diseases; diabetic retinopathy; cataracts; glaucoma; VGG19; transfer learning

1. Introduction

Retinal disorders, such as diabetic retinopathy, cataract, and glaucoma, are among the leading causes of vision loss and blindness worldwide [1,2]. These conditions have a significant impact on quality of life and place a burden on the healthcare system [3]. In addition to disease, the use of normal data in diagnostic studies provides a basis for distinguishing between pathological and healthy conditions [4]. Complete and accurate diagnosis of these conditions is essential for effective treatment and prevention of recurrence [5]. However, physical retinal examination requires a lot of time, effort, and sensitivity to individual differences, necessitating the use of automated diagnostic tools [6].

Deep learning, particularly convolutional neural networks (CNNs), has helped the field of medical image analysis by providing powerful solutions for image classification and feature extraction [7]. Among the available CNN architectures, the VGG19 model stands out due to its robustness and effectiveness in capturing complex features in data [8]. The use of transfer learning, which uses a previously trained model to adapt to new tasks, is increasing model efficiency, particularly when dealing with large amounts of data [9]. The goal of this study is to improve the accuracy and efficiency of diagnosing retinal diseases by implementing transfer learning with the VGG19 model [10,11]

This study focuses on the identification of retinal diseases in four categories: normal, diabetic, cataract, and glaucoma [12]. Including normal data allows for clearer distinctions between healthy and unhealthy conditions, which improves the diagnostic accuracy [13]. The collection of retinal data available to the public is used to evaluate the performance of the VGG19 architecture in classifying this condition [14]. This study also aims to investigate the model’s ability to generalize across multiple disease categories and its potential application in clinical practice [15].

With this goal in mind, our research aims to contribute to the rapidly growing field of healthcare solutions provided by AI [16]. The goal of this project is to foster the development of efficient, accurate and reliable diagnostic tools that can reduce the amount of global waste generated and improve the quality of environmental protection [17].

1.1. Previous Related Research

Previous research [18] focused on one type of diabetic retinopathy (DR), which is caused by diabetes and leads to vision loss, using transfer learning and a previously trained VGGNet network to analyze retinal fundus image features, as shown in Figure 1 below.

Likewise, the present research will focus on three types of retinal diseases using the VGG19 model and transfer learning from a dataset containing normal eyes, diabetic retinopathy, cataracts, and glaucoma. The purpose of this study is to improve the classification accuracy by comparing the models for three types of eye conditions.

1.2. VGG19 Architecture

In simple terms, the VGG19 model works with a three-part architecture: backbone, pooling, and head, which is used to classify retinal diseases such as normal, diabetic retinopathy, cataract, and glaucoma using ImageNet transfer learning [20]. The backbone uses a 3 × 3 matrix with filters that gradually increase from 64 to 512, as shown in the Figure 2 below.

Pooling reduces the spatial dimensions and uses maximum pooling to ensure computer efficiency while retaining important information. Finally, the head converts the features into one dimension and optimizes them using a spreadsheet, followed by a softmax spreadsheet to determine the probability of reaching the target class level.

This architecture allows for efficient feature extraction and accurate classification for multi-layer systems.

2. Materials and Methods

This research was carried out in several stages, as shown in Figure 3 below.

In Figure 3, there are learning tasks that are completed sequentially from beginning to end, interconnected, and structured to ensure that all the learning is logical.

2.1. Dataset Retrieval

The data used in this study was obtained from Kaggle, a website that provides high-quality data and information for machine learning. The dataset consists of retinal images divided into four categories: 1038 cataract, 1098 diabetic retinopathy, 1007 glaucoma, and 1074 normal vision images, totaling 4217 images [19]. But to maximize the training time and due to hardware limitations, not all the data is included in the VGG19 model training process. The data used consists of 400 images for 4 classes, with each class consisting of 100 images. This labeled table is essential for model training and evaluation to ensure accurate retinal disease diagnosis, as shown in Figure 4 below.

Figure 4 shows that the dataset consists of four types of retinal diseases, with a total of 400 images used to develop the retinal disease diagnosis model.

2.2. Training Dataset

In this study, the dataset was divided into two parts: 80% for training and 20% for validation, which will be used also as testing images. The total number of images was 400 images, which were classified into four categories: normal, cataract, diabetic retinopathy, and glaucoma. Each class level has approximately 80 images for training and 20 images for validation, as illustrated in Figure 5.

To evaluate the model performance, key metrics such as the accuracy (A), recall (R), F1 score (F1), and precision (P), which are based on standard formulas, are used.

3. Results

The model was previously trained in the Jupyter Notebook version 7.3.2 environment via the Google Colab training process. The number of images was 400, divided into four categories: normal, diabetic retinopathy, cataract, and glaucoma, with 320 images used for training and 80 images for validation, which were also used as test images. The modeling was carried out with the VGG19 transfer learning model, with parameters used to optimize the model performance. The evaluation was carried out using the accuracy, loss, precision, recall and F1 score metrics to assess the effectiveness of the model in diagnosing retinal diseases based on medical images. The model performance against the test data is shown in Table 1 below.

The results of training the VGG19 model with transfer learning show that the model has the best performance in detecting the normal class, with precision of 1.00 and an F1 score of 0.95, while the worst performance is in the glaucoma class, with precision of 0.74 and an F1 score of 0.79. The highest F1 score values are found in the diabetic retinopathy (0.95) and normal classes, which shows that the model is more effective in detecting diabetic retinopathy than other diseases. Therefore, improving the model architecture or using data augmentation techniques may be a solution to improve the accuracy of retinal disease diagnosis.

We can visualize the model performance for the training and validation data using a curve diagram, and for the test data using an ROC curve, as shown in Figure 6 below.

From Figure 6 above, looking at the ROC curve results, VGG19 modeling is able to predict the classes according to the actual class values quite well, as seen from the high true positive rate in each class. In the cataract class, it is 1.00; in diabetic retinopathy, it is 0.99; in glaucoma, it is 0.94; and in normal, it is 0.98.

Next, we will visualize the performance model using the test data using the confusion matrix, as shown in Figure 7 below.

The confusion matrix of the VGG19 model with transfer learning is displayed in the figure above, showing how the model classifies four types of eye disease: normal, diabetic retinopathy, cataract, and glaucoma. Based on the confusion matrix, the model has the best performance in detecting diabetic retinopathy, with a total of 20 predictions, in addition to normal 18 predictions, glaucoma 17 predictions and cataract 16 predictions.

Despite the fact that this model has limitations in differentiating various types of eye diseases, especially cataract, it excels in classifying diabetic retinopathy and normal eyes, as characterized by the high prediction rate. This shows that the model is quite effective in determining the typical patterns of these two diseases. The classification that occurs can be performed based on similar visual features between categories or by synchronizing the amount of data in each class. Therefore, improving the quality of datasets and using data augmentation techniques can help improve the model accuracy, especially in terms of the class categorization.

At the end of testing, the VGG19 model with transfer learning was tested to classify retinal diseases. In the image below, the model successfully identifies images with some predictions based on their category, as stated in Figure 8 below.

The results show that the model can classify all 10 images precisely, indicating its suitability with all the true labels of the data. More extensive evaluation using larger datasets and model refinement techniques could improve the accuracy in terms of detecting different types of retinal disease.

4. Conclusions

The use of the VGG19 model and the transfer learning method for retinal disease classification has been completed. The study concluded that the VGG19 model had the best performance in detecting normal eyes, with a precision of 1.00 and a recall of 0.90, while the worst performance was found in detecting glaucoma, with a precision of 0.74 and a recall of 0.79. The highest F1 score was found in the diabetic retinopathy and normal classes (0.95), which shows that the model is more effective in classifying these types of disease compared to other types of disease.

Evaluation using the confusion matrix showed that the model achieved accuracy results of 0.91 for 320 training data and 0.88 for 80 validation data. The loss value was 0.18 for the training data and 0.31 for the validation data. Although the results are impressive, this model still has challenges in categorizing several types of retinal diseases, including glaucoma. The high probability of classification occurring is caused by visual similarities between categories or inconsistent data distribution in a dataset. Therefore, if this method is to be used consistently in medical practice, it is necessary to consider various other factors, such as clinical need, data availability, and priority of use based on the available resources.

Overall, this research found that the VGG19 model with transfer learning has the potential to be used as a diagnostic tool for retinal diseases based on medical images. However, to improve the model performance, further research can be conducted by expanding the dataset, using data augmentation techniques, and exploring more complex network architectures. With further development, this model will be able to make a significant contribution to the field of ophthalmology and enable more accurate and efficient detection of retinal diseases.

Author Contributions

Conceptualization, S., R., R.A.P. and I.L.K.; Methodology, I.L.K., S. and K.; Software, S., R.A.P., R. and I.L.K.; Validation, I.L.K. and K.; Formal Analysis, S.; Resources, S., R. and R.A.P.; Writing—Original Draft Preparation, S., R., I.L.K. and R.A.P.; Writing—Review and Editing, S., I.L.K. and K. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

Kuper, H.; Foster, A. Impact of VISION 2020 on Global Blindness. Can. J. Ophthalmol. 2006, 41, 669–671. [Google Scholar] [CrossRef] [PubMed]
Vardhan, K.B.; Nidhish, M.; Kiran, C.S.; Shameem, D.N.; Charan, V.S.; Bhavadharini, R.M. Eye Disease Detection Using Deep Learning Models with Transfer Learning Techniques. EAI Endorsed Trans. Scalable Inf. Syst. 2024, 11. [Google Scholar] [CrossRef]
Assi, L.; Chamseddine, F.; Ibrahim, P.; Sabbagh, H.; Rosman, L.; Congdon, N.; Evans, J.; Ramke, J.; Kuper, H.; Burton, M.J.; et al. A Global Assessment of Eye Health and Quality of Life: A Systematic Review of Systematic Reviews. JAMA Ophthalmol. 2021. [Google Scholar] [CrossRef]
Aberegg, S.K.; Johnson, S.A. When Alternative Diagnoses Are More Likely Than Pulmonary Embolism: A Paradox. Ann. Am. Thorac. Soc. 2020, 17, 670–672. [Google Scholar] [CrossRef]
de Carlo, T.E.; Romano, A.; Waheed, N.K.; Duker, J.S. A Review of Optical Coherence Tomography Angiography (OCTA). Eye Vis. 2015, 2, 5. [Google Scholar] [CrossRef] [PubMed]
Marrugo, A.G.; Millán, M.S.; Cristóbal, G.; Gabarda, S.; Šorel, M.; Šroubek, F. Toward Computer-Assisted Diagnosis and Telemedicine in Ophthalmology. SPIE Newsroom 2012. [Google Scholar] [CrossRef]
Greenspan, H.; Van Ginneken, B.; Summers, R.M. Deep Learning in Medical Imaging: Overview and Future Promise of an Exciting New Technique. IEEE Trans. Med. Imaging 2016, 35, 1153–1159. [Google Scholar] [CrossRef]
Sharma, A.; Mittal, S. Deep Learning-Based Ocular Disease Screening: Comparative Performance of CNN and VGG19. In Proceedings of the 2024 International Conference on Artificial Intelligence and Emerging Technology (Global AI Summit), Greater Noida, India, 5–7 September 2024; pp. 436–440. [Google Scholar]
Rodman, E.; Williams, N.W. Transfer Learning: A New Paradigm for Big Data Research. Available online: https://ssrn.com/abstract=4555572 (accessed on 20 June 2025).
Proceedings of the 4th International Conference on Communication and Electronics Systems (ICCES 2019), Coimbatore, India, 17–19 July 2019; IEEE: Piscataway, NJ, USA, 2019.
Choudhary, A.; Ahlawat, S.; Urooj, S.; Pathak, N.; Lay-Ekuakille, A.; Sharma, N. A Deep Learning-Based Framework for Retinal Disease Classification. Healthcare 2023, 11, 212. [Google Scholar] [CrossRef] [PubMed]
Kaushik, P.; Sharma, P. Efficient Eye Disease Detection: A Deep Learning Approach with InceptionResNetV2 for Retinal Image Classification. In Proceedings of the 2024 International Conference on Advances in Computing, Communication and Materials (ICACCM), Dehradun, India, 13–14 September 2024; pp. 1–5. [Google Scholar] [CrossRef]
Pepe, M.S.; Thompson, M.L. Combining Diagnostic Test Results to Increase Accuracy. Biostatistics 2000, 1, 123–140. [Google Scholar] [CrossRef] [PubMed]
Singla, S.; Gupta, R. Diabetic Retinopathy Diagnosis Using VGG-19 Deep Learning Model. In Proceedings of the 2024 3rd International Conference for Advancement in Technology (ICONAT), Goa, India, 19–20 January 2024; pp. 1–4. [Google Scholar] [CrossRef]
Kalivaraprasad, B.; Prasad, M.V.D.; Bharathi, H. Enhancing Disease Diagnosis through Transfer Learning: Improving Accuracy and Generalization. J. Electr. Syst. 2024, 20, 1424–1433. [Google Scholar] [CrossRef]
Rastogi, M. The Growth and Potential of AI Applications in Medicine and Healthcare. Indian J. Appl. Res. 2023, 13, 17–18. [Google Scholar] [CrossRef]
Street, A.; Vernooij, E.; Rogers, M.H. Diagnostic Waste: Whose Responsibility? Glob. Health 2022, 18, 23. [Google Scholar] [CrossRef] [PubMed]
Sudha, V.; Ganeshbabu, T.R. A Convolutional Neural Network Classifier VGG-19 Architecture for Lesion Detection and Grading in Diabetic Retinopathy Based on Deep Learning. Comput. Mater. Continua 2021, 66, 827–842. [Google Scholar] [CrossRef]
Doddi, G.V. Eye Diseases Classification (Eye Disease Retinal Images). Available online: https://www.kaggle.com/datasets/gunavenkatdoddi/eye-diseases-classification (accessed on 20 June 2025).
Ogden, J.; Lee, D.M.; Moradi, H. Transfer Learning for Classification of Retinal Disease Using Fundus Imaging. In Proceedings of the 2023 IEEE 11th International Conference on Healthcare Informatics (ICHI), Houston, TX, USA, 26–29 June 2023; pp. 510–512. [Google Scholar] [CrossRef]
Computer Vision 101. Available online: https://media.geeksforgeeks.org/wp-content/uploads/20240712183746/VGG--19-Architecture-.webp (accessed on 20 June 2025).

Figure 1. (a) Normal, (b) diabetic retinopathy, (c) cataract, and (d) glaucoma [19].

Figure 2. VGG19 architecture. Reprinted with permission Ref. [21]. 2025, GeeksforGeeks.

Figure 3. Research flowchart.

Figure 4. A number of images in the retina dataset obtained from Kaggle [19].

Figure 5. Percentage division of the dataset.

Figure 6. Evaluation and performance curve of the VGG19 model with transfer learning.

Figure 7. Confusion matrix detection in the VGG19 model with transfer learning.

Figure 8. Prediction results.

Table 1. Performance of test data.

Class	Precision	Recall	F1 Score
Cataract	0.94	0.80	0.86
Diabetic Retinopathy	0.91	1.00	0.95
Glaucoma	0.74	0.85	0.79
Normal	1.00	0.90	0.95

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Kharisma, I.L.; Susanti; Rustiani; Pratama, R.A.; Kamdan. Implementation of the VGG19 Model with Transfer Learning for Retinal Disease Diagnosis: A Study on Normal Eyes, Diabetic Retinopathy, Cataract, and Glaucoma Datasets. Eng. Proc. 2025, 107, 111. https://doi.org/10.3390/engproc2025107111

AMA Style

Kharisma IL, Susanti, Rustiani, Pratama RA, Kamdan. Implementation of the VGG19 Model with Transfer Learning for Retinal Disease Diagnosis: A Study on Normal Eyes, Diabetic Retinopathy, Cataract, and Glaucoma Datasets. Engineering Proceedings. 2025; 107(1):111. https://doi.org/10.3390/engproc2025107111

Chicago/Turabian Style

Kharisma, Ivana Lucia, Susanti, Rustiani, Riski Abdilah Pratama, and Kamdan. 2025. "Implementation of the VGG19 Model with Transfer Learning for Retinal Disease Diagnosis: A Study on Normal Eyes, Diabetic Retinopathy, Cataract, and Glaucoma Datasets" Engineering Proceedings 107, no. 1: 111. https://doi.org/10.3390/engproc2025107111

APA Style

Kharisma, I. L., Susanti, Rustiani, Pratama, R. A., & Kamdan. (2025). Implementation of the VGG19 Model with Transfer Learning for Retinal Disease Diagnosis: A Study on Normal Eyes, Diabetic Retinopathy, Cataract, and Glaucoma Datasets. Engineering Proceedings, 107(1), 111. https://doi.org/10.3390/engproc2025107111

Article Menu

Implementation of the VGG19 Model with Transfer Learning for Retinal Disease Diagnosis: A Study on Normal Eyes, Diabetic Retinopathy, Cataract, and Glaucoma Datasets^†

Abstract