No-Reference Quality Assessment of Dermoscopic Images Using Minimal Expert Supervision
Abstract
1. Introduction
1.1. Related Works on No-Reference IQA Metrics
1.2. Proposed Approach and Main Contributions
- Optimize image acquisition protocols
- Provide immediate feedback to clinicians
- Filter suitable images for AI model training or inference
- Ensure consistency across different imaging devices and settings
- We propose a novel data preparation methodology combining controlled synthetic degradation with objective quality scoring to create a comprehensive assessment system that aligns with clinical perception while capturing critical diagnostic patterns.
- We introduce a new training framework using a continuous ranking system that significantly reduces the need for expert annotations.
- Our metric achieves 92% accuracy in distinguishing between high- and low-quality dermoscopic images as classified by expert dermatologists, showing strong alignment with clinical quality perception.
- We provide a transferable framework that can be extended to other medical imaging modalities, offering a blueprint for developing domain-specific quality metrics with limited annotation resources.
2. Materials and Methods
2.1. Dataset
- Complete visualization of the lesion with appropriate margins (minimum 10% border area around the lesion)
- Absence of significant artifacts (less than 5% of the lesion area affected by hair, air bubbles, ruler markings, ink)
- Proper focus throughout the region of interest
- Adequate lighting with balanced exposure
- Substantial artifacts obscuring diagnostically relevant features (>10% of lesion area affected);
- Poor focus rendering critical patterns indiscernible;
- Improper illumination (>15% pixels under- or overexposed);
- Incomplete lesion visualization (lesion extends beyond image boundaries).
- The PH2 dataset [30]: 200 dermoscopic images acquired with professional equipment
- The PAD-UFES-20 dataset [31]: 2044 smartphone-based dermoscopic images (from an original set of 2298)
- A private dataset from Novara Hospital (Novara, Italy): 149 images collected using the Nurugo smartphone dermoscope [13]
2.2. Custom Degradation Pipeline
2.2.1. Motion Blur Degradation
2.2.2. Defocus Blur Degradation
2.2.3. JPEG Compression Artifacts
- Color Space Transformation: The image, originally in RGB format, is first transformed into the YCbCr color space. This separates luminance (Y) from chrominance (Cb and Cr), allowing for more efficient compression since the human visual system is less sensitive to changes in chrominance compared to luminance.
- Block Splitting and Discrete Cosine Transform (DCT): The image is divided into non-overlapping blocks, and each block undergoes the Discrete Cosine Transform (DCT) [35] to convert spatial pixel information into frequency components:
- Quantization: The DCT coefficients are quantized by dividing each coefficient by a corresponding value from a quantization matrix , followed by rounding:Lower quality values of lead to more aggressive quantization, which results in significant information loss and visible compression artifacts.
- Entropy Coding: The quantized coefficients are then compressed through entropy coding techniques. In JPEG terminology, the first coefficient of each block (representing the average value) is called the DC coefficient, while the remaining 63 coefficients (representing increasingly fine details) are called AC coefficients. These coefficients are encoded differently: DC coefficients using Differential Pulse Code Modulation (DPCM) [36], and the AC coefficient using run-length encoding and Huffman coding [37] based on their patterns of zeros.
- Reconstruction: During decompression, this process is reversed. The encoded data is decoded and then dequantized by multiplying with the same quantization matrix:
2.3. Patch Extraction and Characterization
Elo Rating System for Quality Ranking
- Initialization: All image patches were assigned an initial Elo rating of 3000.
- Pairing: Image patches were paired using an adaptive strategy throughout the training process. In early iterations, patches were matched across broader rating differences to establish initial hierarchies. The pairing range then progressively narrowed as training advanced, starting from ±100 points and gradually decreasing to ±50 points in later iterations. This approach helped establish a stable global ranking while preventing local ranking cycles that could emerge from fixed-range comparisons.
- Outcome determination: For each pair, we first compared their degradation category (D0, D1, D2, D3), with the established quality hierarchy D0 > D1 > D2 > D3. When comparing patches within the same degradation level, we employed PSNR as the final quality indicator. If all metrics showed differences below 0.1%, the match was declared a draw.
- Rating update: After each comparison, we updated the Elo ratings using Equation (10).
- Convergence: We repeated steps 2–4 until the maximum absolute change in Elo ratings across all images fell below 10−3, indicating stable relative quality rankings. We repeated steps 2–4 until meeting two criteria: maximum absolute change in Elo ratings across all images below 10−3 and no rating inversions in the last 500 comparisons. This ensured convergence to a stable and reliable ranking system.
2.4. No-Reference Metric Network Training
2.5. Inference Strategy
| Algorithm 1. DermaIQA Inference Strategy. |
| Input: Dermoscopic image I of arbitrary size Output: Quality score Q and optional patch-level analysis
|
2.6. Evaluation Criteria
3. Results
3.1. Benchmark on the ISIC Test Set
3.2. Comparison with IQA Metrics
3.3. Validation on the External Test Sets
4. Discussion
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| AI | Artificial Intelligence |
| AUC | Area Under the ROC Curve |
| CNR | Contrast-to-Noise Ratio |
| DCT | Discrete Cosine Transform |
| HQ | High-Quality |
| IQA | Image Quality Assessment |
| LQ | Low-Quality |
| MSE | Mean Squared Error |
| PLCC | Pearson Linear Correlation Coefficient |
| PSNR | Peak Signal-to-Noise Ratio |
| ROI | Region Of Interest |
| SRCC | Spearman Rank Correlation Coefficient |
| SSIM | Structural Similarity Index Metric |
| ViT | Vision Transformer |
References
- Celebi, M.E.; Codella, N.; Halpern, A. Dermoscopy Image Analysis: Overview and Future Directions. IEEE J. Biomed. Health Inform. 2019, 23, 474–478. [Google Scholar] [CrossRef]
- Esteva, A.; Kuprel, B.; Novoa, R.A.; Ko, J.; Swetter, S.M.; Blau, H.M.; Thrun, S. Dermatologist-Level Classification of Skin Cancer with Deep Neural Networks. Nature 2017, 542, 115–118. [Google Scholar] [CrossRef] [PubMed]
- Gessert, N.; Sentker, T.; Madesta, F.; Schmitz, R.; Kniep, H.; Baltruschat, I.; Werner, R.; Schlaefer, A. Skin Lesion Classification Using CNNs with Patch-Based Attention and Diagnosis-Guided Loss Weighting. IEEE Trans. Biomed. Eng. 2020, 67, 495–503. [Google Scholar] [CrossRef] [PubMed]
- Tofighi, N.J.; Elfkir, M.H.; Imamoglu, N.; Ozcinar, C.; Erdem, A.; Erdem, E. Omnidirectional Image Quality Assessment with Local–Global Vision Transformers. Image Vis. Comput. 2024, 148, 105151. [Google Scholar] [CrossRef]
- Seoni, S.; Shahini, A.; Meiburger, K.M.; Marzola, F.; Rotunno, G.; Acharya, U.R.; Molinari, F.; Salvi, M. All You Need Is Data Preparation: A Systematic Review of Image Harmonization Techniques in Multi-Center/Device Studies for Medical Support Systems. Comput. Methods Programs Biomed. 2024, 250, 108200. [Google Scholar] [CrossRef] [PubMed]
- Breger, A.; Karner, C.; Selby, I.; Gröhl, J.; Dittmer, S.; Lilley, E.; Babar, J.; Beckford, J.; Else, T.R.; Sadler, T.J.; et al. A Study on the Adequacy of Common IQA Measures for Medical Images. In Proceedings of 2024 International Conference on Medical Imaging and Computer-Aided Diagnosis (MICAD 2024), Manchester, UK, 19–21 November 2024; Su, R., Frangi, A.F., Zhang, Y., Eds.; Lecture Notes in Electrical Engineering; Springer Nature: Singapore, 2025; Volume 1372, pp. 451–462. [Google Scholar]
- Breger, A.; Biguri, A.; Landman, M.S.; Selby, I.; Amberg, N.; Brunner, E.; Gröhl, J.; Hatamikia, S.; Karner, C.; Ning, L.; et al. A Study of Why We Need to Reassess Full Reference Image Quality Assessment with Medical Images. J. Imaging Inform. Med. 2025, 38, 3444–3469. [Google Scholar] [CrossRef]
- Yu, S.; Dai, G.; Wang, Z.; Li, L.; Wei, X.; Xie, Y. A Consistency Evaluation of Signal-to-Noise Ratio in the Quality Assessment of Human Brain Magnetic Resonance Images. BMC Med. Imaging 2018, 18, 17. [Google Scholar] [CrossRef]
- Fei, X.; Xiao, L.; Sun, Y.; Wei, Z. Perceptual Image Quality Assessment Based on Structural Similarity and Visual Masking. Signal Process. Image Commun. 2012, 27, 772–783. [Google Scholar] [CrossRef]
- Adhikari, A.; Lee, S.-W. AM-BQA: Enhancing Blind Image Quality Assessment Using Attention Retractable Features and Multi-Dimensional Learning. Image Vis. Comput. 2024, 147, 105076. [Google Scholar] [CrossRef]
- Chow, L.S.; Paramesran, R. Review of Medical Image Quality Assessment. Biomed. Signal Process. Control 2016, 27, 145–154. [Google Scholar] [CrossRef]
- Hosny, K.M.; Kassem, M.A.; Foaud, M.M. Skin Cancer Classification Using Deep Learning and Transfer Learning. In Proceedings of the 2018 9th Cairo International Biomedical Engineering Conference (CIBEC), Cairo, Egypt, 20–22 December 2018; IEEE: New York, NY, USA, 2018; pp. 90–93. [Google Scholar]
- Veronese, F.; Tarantino, V.; Zavattaro, E.; Biacchi, F.; Airoldi, C.; Salvi, M.; Seoni, S.; Branciforti, F.; Meiburger, K.M.; Savoia, P. Teledermoscopy in the Diagnosis of Melanocytic and Non-Melanocytic Skin Lesions: NurugoTM Derma Smartphone Microscope as a Possible New Tool in Daily Clinical Practice. Diagnostics 2022, 12, 1371. [Google Scholar] [CrossRef] [PubMed]
- Yang, S.; Wu, T.; Shi, S.; Lao, S.; Gong, Y.; Cao, M.; Wang, J.; Yang, Y. MANIQA: Multi-dimension Attention Network for No-Reference Image Quality Assessment, Github Repository. Available online: https://github.com/IIGROUP/MANIQA (accessed on 3 February 2026).
- Ke, J.; Wang, Q.; Wang, Y.; Milanfar, P.; Yang, F. Musiq: Multi-Scale Image Quality Transformer. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Montreal, QC, Canada, 10–17 October 2021; IEEE: New York, NY, USA, 2021; pp. 5148–5157. [Google Scholar]
- Su, S.; Yan, Q.; Zhu, Y.; Zhang, C.; Ge, X.; Sun, J.; Zhang, Y. Blindly Assess Image Quality in the Wild Guided by a Self-Adaptive Hyper Network. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Waikoloa, HI, USA, 3–8 January 2022; IEEE: New York, NY, USA, 2020; pp. 3667–3676. [Google Scholar]
- Golestaneh, S.A.; Dadsetan, S.; Kitani, K.M. No-Reference Image Quality Assessment via Transformers, Relative Ranking, and Self-Consistency. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, Waikoloa, HI, USA, 3–8 January 2022; IEEE: New York, NY, USA, 2022; pp. 1220–1230. [Google Scholar]
- Mittal, A.; Soundararajan, R.; Bovik, A.C. Making a “Completely Blind” Image Quality Analyzer. IEEE Signal Process. Lett. 2013, 20, 209–212. [Google Scholar] [CrossRef]
- Zhang, L.; Zhang, L.; Bovik, A.C. A Feature-Enriched Completely Blind Image Quality Evaluator. IEEE Trans. Image Process. 2015, 24, 2579–2591. [Google Scholar] [CrossRef] [PubMed]
- Rodrigues, R.; Lévêque, L.; Gutiérrez, J.; Jebbari, H.; Outtas, M.; Zhang, L.; Chetouani, A.; Al-Juboori, S.; Martini, M.G.; Pinheiro, A.M.G. Objective Quality Assessment of Medical Images and Videos: Review and Challenges. Multimed. Tools Appl. 2024, 84, 29915–29948. [Google Scholar] [CrossRef]
- Chen, B.; Solebo, A.L.; Bao, W.; Taylor, P. Medical Image Quality Assessment Based on Probability of Necessity and Sufficiency. In Proceedings of the 2025 IEEE 22nd International Symposium on Biomedical Imaging (ISBI), Houston, TX, USA, 14–17 April 2025; IEEE: New York, NY, USA, 2025; pp. 1–4. [Google Scholar]
- Chen, J.; Lin, L.; Cheng, P.; Huang, Y.; Tang, X. Uno-Qa: An Unsupervised Anomaly-Aware Framework with Test-Time Clustering for Octa Image Quality Assessment. In Proceedings of the 2023 IEEE 20th International Symposium on Biomedical Imaging (ISBI), Cartagena, Colombia, 18–21 April 2023; IEEE: New York, NY, USA, 2023; pp. 1–5. [Google Scholar]
- Petashvili, D.; Wang, H.; Deslandes, A.; Avery, J.; Condous, G.; Carneiro, G.; Hull, L.; Chen, H.-T. Learning Subjective Image Quality Assessment for Transvaginal Ultrasound Scans from Multi-Annotator Labels. In Proceedings of the 2024 IEEE International Symposium on Biomedical Imaging (ISBI), Athens, Greece, 27–30 May 2024; IEEE: New York, NY, USA, 2024; pp. 1–5. [Google Scholar]
- Maruyama, S.; Watanabe, H.; Shimosegawa, M. An Image Quality Assessment Index Based on Image Features and Keypoints for X-Ray CT Images. PLoS ONE 2024, 19, e0304860. [Google Scholar] [CrossRef]
- Ozer, C.; Guler, A.; Cansever, A.T.; Oksuz, I. Explainable Image Quality Assessment for Medical Imaging. arXiv 2023, arXiv:2303.14479. [Google Scholar] [CrossRef]
- Nikiforaki, K.; Karatzanis, I.; Dovrou, A.; Bobowicz, M.; Gwozdziewicz, K.; Díaz, O.; Tsiknakis, M.; Fotiadis, D.I.; Lekadir, K.; Marias, K. Image Quality Assessment Tool for Conventional and Dynamic Magnetic Resonance Imaging Acquisitions. J. Imaging 2024, 10, 115. [Google Scholar] [CrossRef]
- Elo, A.E. The Rating of Chessplayers, Past and Present, 2nd ed.; Arco Publishing Company, Inc.: New York, NY, USA, 1986. [Google Scholar]
- Tschandl, P.; Rosendahl, C.; Kittler, H. The HAM10000 Dataset, a Large Collection of Multi-Source Dermatoscopic Images of Common Pigmented Skin Lesions. Sci. Data 2018, 5, 180161. [Google Scholar] [CrossRef]
- Codella, N.C.F.; Gutman, D.; Celebi, M.E.; Helba, B.; Marchetti, M.A.; Dusza, S.W.; Kalloo, A.; Liopyris, K.; Mishra, N.; Kittler, H.; et al. Skin Lesion Analysis toward Melanoma Detection: A Challenge at the 2017 International Symposium on Biomedical Imaging (ISBI), Hosted by the International Skin Imaging Collaboration (ISIC). In Proceedings of the 2018 IEEE 15th International Symposium on Biomedical Imaging (ISBI 2018), Washington, DC, USA, 4–7 April 2018; IEEE: New York, NY, USA, 2018; pp. 168–172. [Google Scholar]
- Mendonca, T.; Ferreira, P.M.; Marques, J.S.; Marcal, A.R.S.; Rozeira, J. PH2—A Dermoscopic Image Database for Research and Benchmarking. In Proceedings of the 2013 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), Osaka, Japan, 3–7 July 2013; IEEE: New York, NY, USA, 2013; pp. 5437–5440. [Google Scholar]
- Oztel, I.; Yolcu Oztel, G.; Sahin, V.H. Deep Learning-Based Skin Diseases Classification Using Smartphones. Adv. Intell. Syst. 2023, 5, 2300211. [Google Scholar] [CrossRef]
- Buslaev, A.; Iglovikov, V.I.; Khvedchenya, E.; Parinov, A.; Druzhinin, M.; Kalinin, A.A. Albumentations: Fast and Flexible Image Augmentations. Information 2020, 11, 125. [Google Scholar] [CrossRef]
- Levin, A.; Weiss, Y.; Durand, F.; Freeman, W.T. Understanding and Evaluating Blind Deconvolution Algorithms. In Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA, 20–25 June 2009; IEEE: New York, NY, USA, 2009; pp. 1964–1971. [Google Scholar]
- Al-Ani, M.S.; Awad, F.H. The JPEG Image Compression Algorithm. Int J Adv Eng Technol 2013, 6, 1055–1062. [Google Scholar]
- Ahmed, N.; Natarajan, T.; Rao, K.R. Discrete Cosine Transform. IEEE Trans. Comput. 1974, C-23, 90–93. [Google Scholar] [CrossRef]
- Tomar, R.R.S.; Jain, K. Lossless Image Compression Using Differential Pulse Code Modulation and Its Application. In Proceedings of the 2015 International Conference on Computational Intelligence and Communication Networks (CICN), Jabalpur, India, 12–14 December 2015; IEEE: New York, NY, USA, 2015; pp. 397–400. [Google Scholar]
- Moffat, A. Huffman Coding. ACM Comput. Surv. 2020, 52, 1–35. [Google Scholar] [CrossRef]
- Weiyuan, W.; Verma, D.; Yang, W. Patchify Github Repository. GitHub Repository. Available online: https://github.com/dovahcrow/patchify.py (accessed on 3 February 2026).
- Huynh-Thu, Q.; Ghanbari, M. Scope of Validity of PSNR in Image/Video Quality Assessment. Electron. Lett. 2008, 44, 800–801. [Google Scholar] [CrossRef]
- Liu, Z.; Lin, Y.; Cao, Y.; Hu, H.; Wei, Y.; Zhang, Z.; Lin, S.; Guo, B. Swin Transformer: Hierarchical Vision Transformer Using Shifted Windows. In Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, QC, Canada, 10–17 October 2021; IEEE: New York, NY, USA, 2021; pp. 10012–10022. [Google Scholar]
- Zhang, W.; Ma, K.; Zhai, G.; Yang, X. Uncertainty-Aware Blind Image Quality Assessment in the Laboratory and Wild. IEEE Trans. Image Process. 2021, 30, 3474–3486. [Google Scholar] [CrossRef]
- Chen, C.; Jiadi, M. IQA-PyTorch: PyTorch Toolbox for Image Quality Assessment, GitHub Repository. Available online: https://github.com/chaofengc/IQA-PyTorch (accessed on 3 February 2026).
- Krotkov, E. Focusing. Int. J. Comput. Vis. 1988, 1, 223–237. [Google Scholar] [CrossRef]
- Pech-Pacheco, J.L.; Cristobal, G.; Chamorro-Martinez, J.; Fernandez-Valdivia, J. Diatom Autofocusing in Brightfield Microscopy: A Comparative Study. In Proceedings of the 15th International Conference on Pattern Recognition. ICPR-2000, Barcelona, Spain, 3–7 September 2000; IEEE: New York, NY, USA, 2000; Volume 3, pp. 314–317. [Google Scholar]
- Liu, L.; Liu, B.; Huang, H.; Bovik, A.C. No-Reference Image Quality Assessment Based on Spatial and Spectral Entropies. Signal Process. Image Commun. 2014, 29, 856–863. [Google Scholar] [CrossRef]
- Al-refai, G.; Qadan, S.; Al-refai, M.; Elmoaqet, H. Introducing Edges Density and Texture Contrast Scores as Quality Metrics for Enhanced Low Light Images. In Proceedings of the 2024 22nd International Conference on Research and Education in Mechatronics (REM), Amman, Jordan, 24–26 September 2024; IEEE: New York, NY, USA, 2024; pp. 1–5. [Google Scholar]
- Salvi, M.; Branciforti, F.; Molinari, F.; Meiburger, K.M. Generative Models for Color Normalization in Digital Pathology and Dermatology: Advancing the Learning Paradigm. Expert Syst. Appl. 2024, 245, 123105. [Google Scholar] [CrossRef]
- Salvi, M.; Branciforti, F.; Veronese, F.; Zavattaro, E.; Tarantino, V.; Savoia, P.; Meiburger, K.M. DermoCC-GAN: A New Approach for Standardizing Dermatological Images Using Generative Adversarial Networks. Comput. Methods Programs Biomed. 2022, 225, 107040. [Google Scholar] [CrossRef]
- Yuan, Y.; Chao, M.; Lo, Y.-C. Automatic Skin Lesion Segmentation Using Deep Fully Convolutional Networks With Jaccard Distance. IEEE Trans. Med. Imaging 2017, 36, 1876–1886. [Google Scholar] [CrossRef]









| Subset | Quality Level | Number of Images |
|---|---|---|
| Training images | (original HQ) | 539 |
| (degraded HQ) | 539 | |
| (original LQ) | 273 | |
| (degraded LQ) | 273 | |
| Testing—ISIC Archive | 150 | |
| 150 | ||
| External validation | PH2 (dermoscope) | 200 |
| PAD-UFES-20 (smartphone) | 2044 | |
| Novara (smartphone) | 149 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Ferraris, A.; Branciforti, F.; Meiburger, K.M.; Veronese, F.; Zavattaro, E.; Savoia, P.; Salvi, M. No-Reference Quality Assessment of Dermoscopic Images Using Minimal Expert Supervision. Appl. Sci. 2026, 16, 1682. https://doi.org/10.3390/app16041682
Ferraris A, Branciforti F, Meiburger KM, Veronese F, Zavattaro E, Savoia P, Salvi M. No-Reference Quality Assessment of Dermoscopic Images Using Minimal Expert Supervision. Applied Sciences. 2026; 16(4):1682. https://doi.org/10.3390/app16041682
Chicago/Turabian StyleFerraris, Andrea, Francesco Branciforti, Kristen M. Meiburger, Federica Veronese, Elisa Zavattaro, Paola Savoia, and Massimo Salvi. 2026. "No-Reference Quality Assessment of Dermoscopic Images Using Minimal Expert Supervision" Applied Sciences 16, no. 4: 1682. https://doi.org/10.3390/app16041682
APA StyleFerraris, A., Branciforti, F., Meiburger, K. M., Veronese, F., Zavattaro, E., Savoia, P., & Salvi, M. (2026). No-Reference Quality Assessment of Dermoscopic Images Using Minimal Expert Supervision. Applied Sciences, 16(4), 1682. https://doi.org/10.3390/app16041682

