Refined IDRiD: An Enhanced Dataset for Diabetic Retinopathy Segmentation with Expert-Validated Annotations and Comprehensive Anatomical Context
Abstract
1. Introduction
2. Data Description
2.1. Dataset Overview
2.2. Image Preprocessing
2.3. Annotation Architecture
2.4. Dataset Statistics
2.5. Lesion Prevalence
3. Method
3.1. Original Dataset
3.2. Expert Team and Annotation Workflow
3.3. Proliferative Lesion Annotation
3.4. Anatomical Context Integration
3.5. Data Preprocessing and Format Standardization
4. Technical Validation
4.1. Inter-Rater Reliability
4.2. Reliability Comparison with Original Annotations
4.3. Correlation Analysis: Original vs. Refined Annotations
4.4. Clinical Validation
5. Usage Notes
5.1. Data Loading
5.2. Class Imbalance Handling
5.3. Multi-Task Learning Architecture and Future Directions
5.4. Clinical Application Considerations
5.5. Limitations
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Leley, S.P.; Ciulla, T.A.; Bhatwadekar, A.D. Diabetic Retinopathy in the Aging Population: A Perspective of Pathogenesis and Treatment. Clin. Interv. Aging 2021, 16, 1367–1378. [Google Scholar] [CrossRef] [PubMed]
- Porwal, P.; Pachade, S.; Kamble, R.; Kokare, M.; Deshmukh, G.; Sahasrabuddhe, V.; Meriaudeau, F. Indian Diabetic Retinopathy Image Dataset (IDRiD): A Database for Diabetic Retinopathy Screening Research. Data 2018, 3, 25. [Google Scholar] [CrossRef]
- Li, T.; Gao, Y.; Wang, K.; Guo, S.; Liu, H.; Kang, H. Diagnostic Assessment of Deep Learning Algorithms for Diabetic Retinopathy Screening. Inf. Sci. 2019, 501, 511–522. [Google Scholar] [CrossRef]
- Decencière, E.; Zhang, X.; Cazuguel, G.; Lay, B.; Cochener, B.; Trone, C.; Gain, P.; Ordóñez, R.; Massin, P.; Erginay, A.; et al. TeleOphta: Machine Learning and Image Processing Methods for Teleophthalmology. IRBM 2013, 34, 196–203. [Google Scholar] [CrossRef]
- Decencière, E.; Cazuguel, G.; Zhang, X.; Thibault, G.; Klein, J.C.; Meyer, F.; Marcotegui, B.; Quellec, G.; Lamard, M.; Danno, R.; et al. Feedback on a Publicly Distributed Database: The MESSIDOR Database. Image Anal. Stereol. 2014, 33, 231–234. [Google Scholar] [CrossRef]
- Kauppi, T.; Kalesnykiene, V.; Kamarainen, J.K.; Lensu, L.; Sorri, I.; Raninen, A.; Voutilainen, R.; Uusitalo, H.; Kalviainen, H.; Pietilä, J. DIARETDB1 Diabetic Retinopathy Database and Evaluation Protocol; Technical Report; Department of Ophthalmology, Faculty of Medicine, University of Kuopio: Kuopio, Finland, 2007. [Google Scholar]
- Zhou, Y.; Wang, B.; Huang, L.; Cui, S.; Shao, L. A Benchmark for Studying Diabetic Retinopathy: Segmentation, Grading, and Transferability. IEEE Trans. Med. Imaging 2021, 40, 818–828. [Google Scholar] [CrossRef] [PubMed]
- Chankhachon, S.; Kansomkeat, S.; Bhurayanontachai, P.; Intajag, S. Deep Learning Network with Illuminant Augmentation for Diabetic Retinopathy Segmentation Using Comprehensive Anatomical Context Integration. Diagnostics 2025, 15, 2762. [Google Scholar] [CrossRef] [PubMed]
- Ronneberger, O.; Fischer, P.; Brox, T. U-Net: Convolutional Networks for Biomedical Image Segmentation. In Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015; Springer: Cham, Switzerland, 2015; pp. 234–241. [Google Scholar] [CrossRef]
- Early Treatment Diabetic Retinopathy Study Research Group. Grading Diabetic Retinopathy from Stereoscopic Color Fundus Photographs—An Extension of the Modified Airlie House Classification. ETDRS Report Number 10. Ophthalmology 1991, 98, 786–806. [Google Scholar] [CrossRef]
- Müller, D.; Soto-Rey, I.; Kramer, F. Towards a Guideline for Evaluation Metrics in Medical Image Segmentation. BMC Res. Notes 2022, 15, 210. [Google Scholar] [CrossRef] [PubMed]
- Landis, J.R.; Koch, G.G. The Measurement of Observer Agreement for Categorical Data. Biometrics 1977, 33, 159–174. [Google Scholar] [CrossRef] [PubMed]
- Salehi, S.S.M.; Erdogmus, D.; Gholipour, A. Tversky Loss Function for Image Segmentation Using 3D Fully Convolutional Deep Networks. In Machine Learning in Medical Imaging, MLMI 2017; Wang, Q., Shi, Y., Suk, H.I., Suzuki, K., Eds.; Lecture Notes in Computer Science; Springer: Cham, Switzerland, 2017; Volume 10541, pp. 379–387. [Google Scholar] [CrossRef]
- Lin, T.-Y.; Goyal, P.; Girshick, R.; He, K.; Dollár, P. Focal Loss for Dense Object Detection. In Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 22–29 October 2017; pp. 2999–3007. [Google Scholar] [CrossRef]
- Fu, Y.; Liu, M.; Zhang, G.; Peng, J. Lightweight Frequency Recalibration Network for Diabetic Retinopathy Multi-Lesion Segmentation. Appl. Sci. 2024, 14, 6941. [Google Scholar] [CrossRef]
- Fu, Y.; Zhang, G.; Lu, X.; Wu, H.; Zhang, D. RMCA U-net: Hard Exudates Segmentation for Retinal Fundus Images. Expert Syst. Appl. 2023, 234, 120987. [Google Scholar] [CrossRef]
- Fu, Y.; Chen, J.; Li, J.; Pan, D.; Yue, X.; Zhu, Y. Optic Disc Segmentation by U-net and Probability Bubble in Abnormal Fundus Images. Pattern Recognit. 2021, 117, 107971. [Google Scholar] [CrossRef]
- Fu, Y.; Zhang, G.; Li, J.; Pan, D.; Wang, Y.; Zhang, D. Fovea Localization by Blood Vessel Vector in Abnormal Fundus Images. Pattern Recognit. 2022, 129, 108711. [Google Scholar] [CrossRef]


| Dataset | Images | Lesion Types | Annotation Type | PDR Lesions | Anatomical | Public |
|---|---|---|---|---|---|---|
| IDRiD [2] | 81 | MA, HE, EX, CWS | Pixel-level | No | OD only | Yes |
| DDR [3] | 757 | MA, HE, EX, CWS | Pixel-level | No | No | Yes |
| E-ophtha [4] | 463 | MA, EX | Pixel-level | No | No | Yes |
| FGADR [7] | 1842 | MA, HE, EX, CWS, NV, IRMA | Pixel-level + Grade | Yes | No | Yes |
| Refined IDRiD (Ours) | 81 | MA, HE, EX, CWS, NV, VH, IRMA | Pixel-level (unified) | Yes | Yes (5 types) | Yes |
| Class Name | Label ID |
|---|---|
| Background | 0 |
| Retina | 8 |
| Fovea (FV) | 16 |
| Vessel | 24 |
| Optic Disc (OD) | 32 |
| Vitreous Hemorrhage (VH) | 4 |
| Hard Exudate (EX) | 63 |
| Intraretinal Microvascular Abnormality (IRMA) | 96 |
| Cotton-Wool Spot (CWS) | 191 |
| Neovascularization (NV) | 166 |
| Hemorrhage (HE) | 127 |
| Microaneurysm (MA) | 255 |
| Class | Original IDRiD | Enhanced IDRiD | ||
|---|---|---|---|---|
| No. Pixels | Class % | No. Pixels | Class % | |
| Background | 82,151,578 | 98.10 | 23,634,564 | 27.83 |
| Retina | 0 | 0.00 | 51,279,750 | 60.38 |
| Vessel | 0 | 0.00 | 6,503,691 | 7.66 |
| Optic Disc | 0 | 0.00 | 1,096,874 | 1.29 |
| Fovea | 0 | 0.00 | 435,364 | 0.51 |
| Hard Exudates | 529,486 | 0.63 | 800,220 | 0.94 |
| Hemorrhages | 829,435 | 0.99 | 770,067 | 0.91 |
| Cotton-Wool Spots | 164,869 | 0.20 | 153,786 | 0.18 |
| Microaneurysms | 70,210 | 0.08 | 84,895 | 0.10 |
| Neovascularization | 0 | 0.00 | 11,441 | 0.01 |
| IRMA | 0 | 0.00 | 18,592 | 0.02 |
| Vitreous Hemorrhage | 0 | 0.00 | 145,412 | 0.17 |
| Lesion Type | Precision | Recall | F1-Score |
|---|---|---|---|
| Microaneurysm (MA) | 0.8621 | 0.8849 | 0.8734 |
| Hemorrhage (HE) | 0.9078 | 0.9236 | 0.9230 |
| Hard Exudate (EX) | 0.9198 | 0.9378 | 0.9287 |
| Cotton-Wool Spot (CWS) | 0.8867 | 0.9025 | 0.8945 |
| Neovascularization (NV) | 0.8756 | 0.8891 | 0.8823 |
| Vitreous Hemorrhage (VH) | 0.9045 | 0.9205 | 0.9420 |
| IRMA | 0.8534 | 0.8752 | 0.8642 |
| Mean (Overall) | 0.8871 | 0.9048 | 0.9012 |
| Lesion Type | Original IoU | Refined IoU | Improvement | p-Value |
|---|---|---|---|---|
| Microaneurysm (MA) | 0.5734 | 0.6389 | +11.4% | <0.001 |
| Hemorrhage (HE) | 0.6156 | 0.6803 | +10.5% | <0.001 |
| Hard Exudate (EX) | 0.6923 | 0.7359 | +6.3% | 0.002 |
| Cotton-Wool Spot (CWS) | 0.6445 | 0.7025 | +9.0% | <0.001 |
| Mean (Overall) | 0.6315 | 0.6894 | +9.2% | <0.001 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Chankhachon, S.; Kansomkeat, S.; Bhurayanontachai, P.; Intajag, S. Refined IDRiD: An Enhanced Dataset for Diabetic Retinopathy Segmentation with Expert-Validated Annotations and Comprehensive Anatomical Context. Data 2026, 11, 30. https://doi.org/10.3390/data11020030
Chankhachon S, Kansomkeat S, Bhurayanontachai P, Intajag S. Refined IDRiD: An Enhanced Dataset for Diabetic Retinopathy Segmentation with Expert-Validated Annotations and Comprehensive Anatomical Context. Data. 2026; 11(2):30. https://doi.org/10.3390/data11020030
Chicago/Turabian StyleChankhachon, Sakon, Supaporn Kansomkeat, Patama Bhurayanontachai, and Sathit Intajag. 2026. "Refined IDRiD: An Enhanced Dataset for Diabetic Retinopathy Segmentation with Expert-Validated Annotations and Comprehensive Anatomical Context" Data 11, no. 2: 30. https://doi.org/10.3390/data11020030
APA StyleChankhachon, S., Kansomkeat, S., Bhurayanontachai, P., & Intajag, S. (2026). Refined IDRiD: An Enhanced Dataset for Diabetic Retinopathy Segmentation with Expert-Validated Annotations and Comprehensive Anatomical Context. Data, 11(2), 30. https://doi.org/10.3390/data11020030

