XAI-MedNet: A Next-Generation Explainable AI Framework for Contrast-Enhanced Skin Lesion Classification via Entropy-Controlled Optimization
Abstract
1. Introduction
- Visual inspection: Healthcare professionals assess suspicious moles or lesions using the naked eye or a dermatoscope, a handheld device that magnifies the skin.
- Biopsy: This procedure involves extracting a small portion of skin tissue for microscopic analysis, typically performed when an abnormal lesion is detected during a visual examination.
- Dermoscopy: A non-invasive technique using a dermatoscope to reveal internal skin structures through magnification and polarized light.
- Reflectance confocal microscopy (RCM): A non-invasive method that captures high-resolution images of the dermis’ internal architecture using a laser.
- Computer-aided diagnosis (CAD): This approach utilizes machine learning algorithms to analyze images and label them as benign or malignant based on their features.
- Beyond improving predictive performance, modern CAD systems must be interpretable and transparent to achieve clinical acceptance. This study addresses these requirements by situating the proposed framework within Explainable AI, using interpretable preprocessing and entropy-driven feature selection.
2. Literature Review
3. Problem Statement and Contributions
- Proposal of a novel contrast stretching framework that combines an extended Bat algorithm with an Artificial bee colony (EBA-ABC) algorithm to improve the lesion visibility and boundary clarity.
- Proposal of a bio-inspired feature selection framework, entropy-controlled whale optimization algorithm, to address challenges related to the “curse of dimensionality” and over-fitting.
- Development of a transparent diagnostic workflow incorporating XAI methodologies to provide clinicians with traceable visual evidence, ensuring that model predictions for lesion malignancy are grounded in interpretable feature analysis.
4. Proposed Framework
4.1. Extended BA-ABC Algorithm
4.1.1. Bat Algorithm
4.1.2. Artificial Bee Colony Algorithm
4.2. Feature Fusion
4.3. Feature Selection
- Encircling prey: Whales search at random, depending on their current position. In order to increase the algorithm’s capacity for exploration, this humpback whale trait is applied in this case. The mathematical formulation of this behavior is as follows:where represents the distance between the current and randomly selected individual of the population, represents the position vector of the randomly generated population, denotes a randomly chosen individual from the population, and operator performs element-wise multiplication. The two functional parameters and are calculated using the following relations:where is the randomly generated number between 0 and 1, and exhibits a linearly decreasing value from 2 to 0 over iterations.
- Exploitation: Bubble-net attacking strategy: The bubble-net attack that humpback whales follow involves them moving in a helix-shaped pattern. The whole strategy is as follows:where represents the current best solution, l is a random number , and b denotes the geometry of the logarithmic spiral.In addition to swimming in a spiral-shaped pattern around the prey, humpback whales also swim in a circle that is gradually getting smaller. The likelihood of selecting the shrinking encircling mechanism or the spiral model that updates the whale positions during optimization is fixed at 50%.Based on the likelihood of choosing either a spiral or a shrinking circle model, is chosen to be 0.5.
| Algorithm 1 Entropy-Controlled Whale Optimization for Feature Selection |
|
- Exploration: Search for prey: A similar methodology, leveraging the fluctuations of the vector, can be employed to locate prey. Accordingly, by assigning random values to that exceed 1 or fall below −1, we induce a deliberate deviation in the trajectory of the search agent, effectively distancing it from the reference whales. In the exploration phase, unlike the exploitation phase, the agent’s search position is adjusted based on a randomly chosen search agent rather than the best one identified so far.This strategy accentuates the importance of exploration, thereby enabling the WOA algorithm to undertake a more exhaustive and comprehensive search throughout the solution space. The following is the mathematical relationship:
5. Results and Discussion
5.1. Segmentation Framework
5.1.1. Parameter Setting and Performance Measure
5.1.2. BAT
5.2. CA-Net
5.3. Classification Results
5.3.1. Parameter Setting and Performance Measure
5.3.2. Statistical Significance
5.3.3. Assessment of Variability and External Validation
5.3.4. Explainable AI (XAI): Evaluation of the Proposed Framework
- Visual and Feature-Level Interpretability: The effectiveness of the EBA-ABC contrast enhancement module is validated visually through network attention mapping. Figure 11 illustrates the qualitative visual comparison of the decision-making processes of the various models in this study through Gradient-weighted Class Activation Mapping (Grad-CAM). Different backbones have different preferences for how they extract features from the input data: for instance, DenseNet-201 (Column 3) localizes internal dense regions of textural variation, while Inception-ResNet v2 (Column 4) captures broader morphological boundaries and uses multi-scale asymmetry as part of its feature extraction strategy. It is also noted that individual maps of the same architecture may exhibit both fragmented and diffuse activations in the lesion’s background skin. Alternatively, the map produced by the proposed concatenated model (Column 5) offers a more spatially coherent representation of activation patterns. For EBA-ABC visually enhanced images, the proposed concatenation model produced a more accurate representation of the core pathological morphology of each lesion, with minimal background noise interference. The activation heatmaps reveal that, while the model’s focus on original images is often dispersed across background noise, the enhanced images force the network’s attention to be highly localized on the actual lesion area. This provides visually interpretable outputs that align directly with dermatological assessment criteria.The proposed framework addresses the lack of interpretability of traditional deep feature fusion methods via its use of an Entropy-controlled whale optimization algorithm (WOA), which allows for quantifying the information contributions made by each individual feature set based on Shannon’s entropy (i.e., a mathematical measure of uncertainty). For each feature in the final predictive output, the framework affords the opportunity to have an objective basis for determining which features to include or exclude, ultimately achieving significant improvements by reducing redundancy and mitigating overfitting by isolating only the most discriminative features from the inputs provided to the predictive model. While Grad-CAM visualizations provide intuitive and clinically relevant insights into model attention, it is important to acknowledge that such techniques are inherently qualitative and post-hoc. They do not necessarily capture causal relationships between input features and model predictions and may exhibit sensitivity to architectural variations and input perturbations. To address this limitation, the proposed framework complements visual explanations with entropy-controlled feature selection, which provides a quantitative measure of feature importance based on information theory. This combination enables a more structured and reliable interpretability mechanism. Nevertheless, the exploration of more causally grounded and robust explainability techniques remains an important direction for future work.
- Clinical Relevance: To bridge the gap between theoretical performance and practical application, the proposed XAI-MedNet framework is designed to function seamlessly as a “human-in-the-loop” Clinical Decision Support System (CDSS). In a real-world dermatological workflow, the system operates as a secondary evaluator rather than an autonomous diagnostician. Upon clinical image acquisition, the framework first presents the dermatologist with the contrast-enhanced image, visually isolating the lesion boundaries. Subsequently, it outputs a malignancy probability alongside feature-level interpretation maps. This dual-level transparency allows the clinician to instantly cross-reference the algorithm’s internal focus with established dermoscopic criteria, empowering them to confidently validate the prediction or dismiss irrelevant artifacts prior to making a final biopsy decision.
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Lomas, A.; Leonardi-Bee, J.; Bath-Hextall, F. A systematic review of worldwide incidence of nonmelanoma skin cancer. Br. J. Dermatol. 2012, 166, 1069–1080. [Google Scholar] [CrossRef] [PubMed]
- Ajmal, M.; Khan, M.A.; Akram, T.; Alqahtani, A.; Alhaisoni, M.; Armghan, A.; Althubiti, S.A.; Alenezi, F. BF2SkNet: Best deep learning features fusion-assisted framework for multiclass skin lesion classification. Neural Comput. Appl. 2023, 35, 22115–22131. [Google Scholar] [CrossRef]
- Sanyal, S.; Andrew, R.; Graham-Durand, D.; Kotbagi, S.; Austrie, B. Keratinizing Squamous Cell Carcinoma Masquerading as Basal Cell Carcinoma Constituting a Diagnostic Pitfall: A Case Report with Etiopathogenetic Discourse and Mohs Micrographic Surgical Management. Cureus 2026, 18, e104371. [Google Scholar] [CrossRef] [PubMed]
- Akram, T.; Junejo, R.; Alsuhaibani, A.; Rafiullah, M.; Akram, A.; Almujally, N.A. Precision in Dermatology: Developing an Optimal Feature Selection Framework for Skin Lesion Classification. Diagnostics 2023, 13, 2848. [Google Scholar] [CrossRef]
- Alruwaili, M.; Mohamed, M. An integrated deep learning model with EfficientNet and ResNet for accurate multi-class skin disease classification. Diagnostics 2025, 15, 551. [Google Scholar] [CrossRef] [PubMed]
- Rashad, N.M.; Abdelnapi, N.M.; Seddik, A.F.; Sayedelahl, M. Automating skin cancer screening: A deep learning. J. Eng. Appl. Sci. 2025, 72, 6. [Google Scholar] [CrossRef]
- Gabani, V.; Navamani, T.; Shyamala, K.; Rajpal, V.K.V. Multimodal skin lesion classification for early cancer diagnosis using deep learning. Front. Physiol. 2026, 17, 1717517. [Google Scholar] [CrossRef]
- Rajpara, S.; Botello, A.; Townend, J.; Ormerod, A. Systematic review of dermoscopy and digital dermoscopy/artificial intelligence for the diagnosis of melanoma. Br. J. Dermatol. 2009, 161, 591–604. [Google Scholar] [CrossRef]
- Akram, T.; Alsuhaibani, A.; Khan, M.A.; Khan, S.U.; Naqvi, S.R.; Bilal, M. Dermo-Optimizer: Skin Lesion Classification Using Information-Theoretic Deep Feature Fusion and Entropy-Controlled Binary Bat Optimization. Int. J. Imaging Syst. Technol. 2024, 34, e23172. [Google Scholar] [CrossRef]
- Pixels to Classes: Intelligent Learning Framework for Multiclass Skin Lesion Localization and Classification. Comput. Electr. Eng. 2021, 90, 106956. [CrossRef]
- Khan, M.A.; Sharif, M.; Akram, T.; Bukhari, S.A.C.; Nayak, R.S. Developed Newton-Raphson based deep features selection framework for skin lesion recognition. Pattern Recognit. Lett. 2020, 129, 293–303. [Google Scholar] [CrossRef]
- Khan, M.A.; Muhammad, K.; Sharif, M.; Akram, T.; Albuquerque, V.H.C.d. Multi-Class Skin Lesion Detection and Classification via Teledermatology. IEEE J. Biomed. Health Inform. 2021, 25, 4267–4275. [Google Scholar]
- Malik, S.; Akram, T.; Ashraf, I.; Rafiullah, M.; Ullah, M.; Tanveer, J. A Hybrid Preprocessor DE-ABC for Efficient Skin-Lesion Segmentation with Improved Contrast. Diagnostics 2022, 12, 2625. [Google Scholar]
- Roman, A.; Rahman, M.M.; Haider, S.A.; Akram, T.; Naqvi, S.R. Integrating Feature Selection and Deep Learning: A Hybrid Approach for Smart Agriculture Applications. Algorithms 2025, 18, 222. [Google Scholar] [CrossRef]
- Khan, M.A.; Akram, T.; Sharif, M.; Saba, T.; Javed, K.; Lali, I.U.; Tanik, U.J.; Rehman, A. Construction of saliency map and hybrid set of features for efficient segmentation and classification of skin lesion. Microsc. Res. Tech. 2019, 82, 741–763. [Google Scholar] [CrossRef] [PubMed]
- Khan, M.A.; Sharif, M.; Akram, T.; Damaševičius, R.; Maskeliūnas, R. Skin Lesion Segmentation and Multiclass Classification Using Deep Learning Features and Improved Moth Flame Optimization. Diagnostics 2021, 11, 811. [Google Scholar] [CrossRef] [PubMed]
- Khan, M.A.; Akram, T.; Zhang, Y.D.; Sharif, M. Attributes based skin lesion detection and recognition: A mask RCNN and transfer learning-based deep learning framework. Pattern Recognit. Lett. 2021, 143, 58–66. [Google Scholar] [CrossRef]
- Xie, Y.; Zhang, J.; Xia, Y.; Shen, C. A Mutual Bootstrapping Model for Automated Skin Lesion Segmentation and Classification. IEEE Trans. Med. Imaging 2020, 39, 2482–2493. [Google Scholar] [CrossRef]
- Mehta, D.; Primiero, C.; Betz-Stablein, B.; Nguyen, T.D.; Gal, Y.; Bowling, A.; Haskett, M.; Sashindranath, M.; Bonnington, P.; Mar, V.; et al. Multi-task AI models in dermatology: Overcoming critical clinical translation challenges for enhanced skin lesion diagnosis. J. Eur. Acad. Dermatol. Venereol. 2025, 39, 2121–2133. [Google Scholar] [CrossRef]
- Miglani, V.; Bhatia, M. Skin lesion classification: A transfer learning approach using efficientnets. In Advanced Machine Learning Technologies and Applications: Proceedings of AMLTA 2020; Springer: Singapore, 2020; pp. 315–324. [Google Scholar]
- Cano, E.; Mendoza-Avilés, J.; Areiza, M.; Guerra, N.; Mendoza-Valdés, J.L.; Rovetto, C.A. Multi skin lesions classification using fine-tuning and data-augmentation applying NASNet. PeerJ Comput. Sci. 2021, 7, e371. [Google Scholar] [PubMed]
- Aziz, S.; Bilal, M.; Khan, M.U.; Amjad, F. Deep learning-based automatic morphological classification of leukocytes using blood smears. In Proceedings of the 2020 International Conference on Electrical, Communication, and Computer Engineering (ICECCE), Istanbul, Turkey, 12–13 June 2020; IEEE: New York, NY, USA, 2020; pp. 1–5. [Google Scholar]
- Afza, F.; Sharif, M.; Khan, M.A.; Tariq, U.; Yong, H.S.; Cha, J. Multiclass skin lesion classification using hybrid deep features selection and extreme learning machine. Sensors 2022, 22, 799. [Google Scholar] [CrossRef]
- Aruk, I.; Pacal, I.; Toprak, A.N. A novel hybrid ConvNeXt-based approach for enhanced skin lesion classification. Expert Syst. Appl. 2025, 283, 127721. [Google Scholar] [CrossRef]
- Noman, A.; Beiji, Z.; Zhu, C.; Al-Habib, M.; Al-asri, A. GAMFuse: Graph-based Adaptive Multiscale Feature Fusion for few-shot skin lesion classification. Biomed. Signal Process. Control 2026, 119, 109990. [Google Scholar] [CrossRef]
- Malik, S.; Akram, T.; Awais, M.; Khan, M.A.; Hadjouni, M.; Elmannai, H.; Alasiry, A.; Marzougui, M.; Tariq, U. An Improved Skin Lesion Boundary Estimation for Enhanced-Intensity Images Using Hybrid Metaheuristics. Diagnostics 2023, 13, 1285. [Google Scholar] [CrossRef] [PubMed]
- Malik, S.; Islam, S.R.; Akram, T.; Naqvi, S.R.; Alghamdi, N.S.; Baryannis, G. A novel hybrid meta-heuristic contrast stretching technique for improved skin lesion segmentation. Comput. Biol. Med. 2022, 151, 106222. [Google Scholar] [CrossRef] [PubMed]
- Optimized Binary Bat algorithm for classification of white blood cells. Measurement 2019, 143, 180–190. [CrossRef]
- Ahmed, B.; Akram, T.; Naqvi, S.R.; Alsuhaibani, A.; Altherwy, Y.N.; Masud, U. A Novel Deep Learning Framework with Meta-Heuristic Feature Selection for Enhanced Remote Sensing Image Classification. IEEE Access 2024, 12, 91974–91998. [Google Scholar] [CrossRef]
- Yang, X.S. A New Metaheuristic Bat-Inspired Algorithm. In Nature Inspired Cooperative Strategies for Optimization (NICSO 2010); Springer: Berlin/Heidelberg, Germany, 2010; pp. 65–74. [Google Scholar]
- Szegedy, C.; Ioffe, S.; Vanhoucke, V.; Alemi, A. Inception-v4, inception-resnet and the impact of residual connections on learning. In Proceedings of the AAAI Conference on Artificial Intelligence, San Francisco, CA, USA, 4–9 February 2017; Volume 31. [Google Scholar]
- Iandola, F.; Moskewicz, M.; Karayev, S.; Girshick, R.; Darrell, T.; Keutzer, K. Densenet: Implementing efficient convnet descriptor pyramids. arXiv 2014, arXiv:1404.1869. [Google Scholar]
- Xie, S.; Girshick, R.; Dollár, P.; Tu, Z.; He, K. Aggregated residual transformations for deep neural networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 21–26 July 2017; IEEE: New York, NY, USA, 2017; pp. 1492–1500. [Google Scholar]
- Li, Y.; Li, T.; Liu, H. Recent advances in feature selection and its applications. Knowl. Inf. Syst. 2017, 53, 551–577. [Google Scholar] [CrossRef]
- Mirjalili, S.; Lewis, A. The Whale Optimization Algorithm. Adv. Eng. Softw. 2016, 95, 51–67. [Google Scholar] [CrossRef]
- Hasan, M.K.; Elahi, M.T.E.; Alam, M.A.; Jawad, M.T.; Martí, R. DermoExpert: Skin lesion classification using a hybrid convolutional neural network through segmentation, transfer learning, and augmentation. Inform. Med. Unlocked 2022, 28, 100819. [Google Scholar] [CrossRef]
- Alenezi, F.; Armghan, A.; Polat, K. Wavelet transform based deep residual neural network and ReLU based Extreme Learning Machine for skin lesion classification. Expert Syst. Appl. 2023, 213, 119064. [Google Scholar] [CrossRef]
- Akram, T.; Lodhi, H.M.J.; Naqvi, S.R.; Naeem, S.; Alhaisoni, M.; Ali, M.; Haider, S.A.; Qadri, N.N. A multilevel features selection framework for skin lesion classification. Hum.-Centric Comput. Inf. Sci. 2020, 10, 12. [Google Scholar] [CrossRef]
- Haque, M.M.; Akter, R.; Akib, A.; Hasib, A. A Deep Learning Approach for Automated Skin Lesion Diagnosis with Explainable AI. arXiv 2026, arXiv:2601.00964. [Google Scholar] [CrossRef]
- Padhy, S.; Dash, S.; Kumar, N.; Singh, S.P.; Kumar, G.; Moral, P. Temporal integration of ResNet features with LSTM for enhanced skin lesion classification. Results Eng. 2025, 25, 104201. [Google Scholar] [CrossRef]
- Song, L.; Wang, H.; Wang, Z.J. Decoupling multi-task causality for improved skin lesion segmentation and classification. Pattern Recognit. 2024, 133, 108995. [Google Scholar] [CrossRef]
- Alhudhaif, A.; Almaslukh, B.; Aseeri, A.O.; Guler, O.; Polat, K. A novel nonlinear automated multi-class skin lesion detection system using soft-attention based convolutional neural networks. Chaos Solitons Fractals 2023, 170, 113409. [Google Scholar] [CrossRef]
- Benyahia, S.; Meftah, B.; Lézoray, O. Multi-features extraction based on deep learning for skin lesion classification. Tissue Cell 2022, 74, 101701. [Google Scholar] [CrossRef]
- Nakai, K.; Chen, Y.W.; Han, X.H. Enhanced deep bottleneck transformer model for skin lesion classification. Biomed. Signal Process. Control 2022, 78, 103997. [Google Scholar] [CrossRef]
- Ding, S.; Wu, Z.; Zheng, Y.; Liu, Z.; Yang, X.; Yang, X.; Yuan, G.; Xie, J. Deep attention branch networks for skin lesion classification. Comput. Methods Programs Biomed. 2021, 212, 106447. [Google Scholar] [CrossRef] [PubMed]
- Calderón, C.; Sanchez, K.; Castillo, S.; Arguello, H. BILSK: A bilinear convolutional neural network approach for skin lesion classification. Comput. Methods Programs Biomed. Update 2021, 1, 100036. [Google Scholar] [CrossRef]
- Hameed, N.; Shabut, A.M.; Ghosh, M.K.; Hossain, M.A. Multi-class multi-level classification algorithm for skin lesions classification using machine learning techniques. Expert Syst. Appl. 2020, 141, 112961. [Google Scholar] [CrossRef]











| Dataset | Class | Samples | Total | Training | Testing |
|---|---|---|---|---|---|
| PH2 | Benign | 160 | 200 | 160 | 40 |
| Malignant | 40 | ||||
| ISBI-2016 | Benign | 1006 | 1279 | 1023 | 256 |
| Malignant (Melanoma) | 273 | ||||
| ISIC-2017 | Nevus | 1372 | 2000 | 1600 | 400 |
| Melanoma | 374 | ||||
| Seborrheic Keratosis | 254 | ||||
| HAM10000 | Melanocytic Nevi (nv) | 6705 | 10,015 | 8012 | 2003 |
| Melanoma (mel) | 1113 | ||||
| Benign Keratosis-like Lesions (bkl) | 1099 | ||||
| Basal Cell Carcinoma (bcc) | 514 | ||||
| Actinic Keratoses (akiec) | 327 | ||||
| Vascular Lesions (vasc) | 142 | ||||
| Dermatofibroma (df) | 115 |
| Algorithm | Status | Dataset | IoU (%) | F1-Score (%) |
|---|---|---|---|---|
| Bat Framework | Without Preprocessor | PH2 | 85.4 | 93.1 |
| ISBI-2016 | 85.2 | 92.0 | ||
| ISIC-2017 | 86.2 | 92.6 | ||
| With Preprocessor | PH2 | 86.2 | 93.3 | |
| ISBI-2016 | 87.3 | 93.9 | ||
| ISIC-2017 | 87.9 | 93.7 |
| Algorithm | Status | Dataset | IoU (%) | F1-Score (%) |
|---|---|---|---|---|
| Bat Framework | Without Preprocessor | PH2 | 86.2 | 93.8 |
| ISBI-2016 | 85.8 | 92.4 | ||
| ISIC-2017 | 89.2 | 94.3 | ||
| With Preprocessor | PH2 | 87.5 | 94.2 | |
| ISBI-2016 | 87.7 | 93.6 | ||
| ISIC-2017 | 88.9 | 94.1 |
| Parameter | Value | Parameter | Value |
|---|---|---|---|
| Training function | SGD | Stride | 1 |
| Activation Function | ReLu | Execution Environment | Auto |
| Mini Batch Size | 32 | Max. Epochs | 150 |
| Validation Frequency | 15 | Learning parameter | 1 × 10−4 |
| Loss function | Cross Entropy | DropOut Rate | 0.1 |
| Vector Fusion | Input Dimension | Output Dimension | Red. Percentage (%) |
|---|---|---|---|
| PH2 | |||
| 140 × 3456 | 140 × 592 | 83 | |
| 140 × 2592 | 140 × 612 | 77 | |
| 140 × 2976 | 140 × 571 | 81 | |
| 140 × 4512 | 140 × 970 | 79 | |
| ISBI-2016 | |||
| 900 × 3456 | 900 × 795 | 77 | |
| 900 × 2592 | 900 × 804 | 69 | |
| 900 × 2976 | 900 × 803 | 73 | |
| 900 × 4512 | 900 × 948 | 79 | |
| ISIC-2017 | |||
| 1400 × 3456 | 1400 × 1106 | 68 | |
| 1400 × 2592 | 1400 × 1011 | 61 | |
| 1400 × 2976 | 1400 × 1071 | 64 | |
| 1400 × 4512 | 1354 × 925 | 70 | |
| HAM10000 | |||
| 7000 × 3456 | 7000 × 1210 | 65 | |
| 7000 × 2592 | 7000 × 1063 | 59 | |
| 7000 × 2976 | 7000 × 1161 | 61 | |
| 7000 × 4512 | 1400 × 1489 | 67 | |
| Vector Fusion | OA (%) | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Simple Feature Fusion | Proposed Framework | |||||||||
| TL-NN | Q-SVM | M-NN | ESD | W-KNN | TL-NN | Q-SVM | M-NN | ESD | W-KNN | |
| PH2 | ||||||||||
| 95.16 | 96.37 | 91.41 | 92.16 | 93.64 | 96.00 | 97.00 | 90.10 | 91.15 | 96.20 | |
| 89.16 | 90.26 | 87.18 | 94.12 | 93.17 | 95.16 | 93.25 | 91.26 | 91.20 | 93.66 | |
| 81.20 | 88.61 | 82.00 | 87.21 | 83.00 | 91.62 | 89.40 | 87.60 | 93.76 | 92.11 | |
| 93.16 | 92.00 | 94.62 | 97.75 | 93.17 | 98.60 * | 95.20 | 98.05 | 96.60 | 91.38 | |
| ISBI-2016 | ||||||||||
| 87.10 | 81.64 | 90.40 | 93.45 | 86.15 | 90.18 | 88.70 | 90.64 | 88.42 | 92.18 | |
| 90.65 | 87.28 | 90.45 | 86.58 | 89.45 | 94.56 | 92.95 | 88.67 | 91.48 | 82.64 | |
| 83.16 | 78.16 | 81.40 | 83.10 | 84.25 | 84.55 | 89.70 | 88.70 | 84.62 | 84.55 | |
| 91.30 | 94.55 | 90.52 | 92.18 | 88.64 | 92.88 | 96.25 * | 93.78 | 94.17 | 94.85 | |
| ISIC-2017 | ||||||||||
| 87.34 | 81.50 | 88.64 | 84.62 | 83.62 | 87.20 | 93.86 | 87.52 | 88.50 | 88.34 | |
| 88.50 | 83.50 | 83.78 | 87.52 | 88.64 | 95.55 | 91.64 | 90.42 | 89.56 | 92.24 | |
| 71.80 | 67.25 | 77.90 | 79.65 | 83.40 | 83.17 | 79.30 | 80.52 | 78.64 | 83.89 | |
| 88.42 | 85.49 | 93.68 | 90.47 | 91.63 | 93.67 | 91.42 | 95.85 * | 91.88 | 94.50 | |
| HAM10000 | ||||||||||
| 82.10 | 84.34 | 80.62 | 87.91 | 86.42 | 88.76 | 88.59 | 90.13 | 83.64 | 82.33 | |
| 84.88 | 86.48 | 87.98 | 83.14 | 84.48 | 83.64 | 85.56 | 81.46 | 88.59 | 83.64 | |
| 81.33 | 78.64 | 80.64 | 65.47 | 68.55 | 77.48 | 71.66 | 81.69 | 78.41 | 65.23 | |
| 88.56 | 81.42 | 94.16 | 90.16 | 79.47 | 94.01 | 96.03 * | 93.62 | 93.91 | 94.17 | |
| Classifier | Dataset | Performance Measure | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| I | II | III | IV | Accuracy (%) | Sen | Spe | FNR | FPR | F1 | |
| Q-SVM | ✓ | 97.01 | 0.970 | 0.973 | 0.030 | 0.029 | 0.971 | |||
| ✓ | 96.25 | 0.944 | 0.982 | 0.055 | 0.017 | 0.963 | ||||
| ✓ | 93.86 | 0.935 | 0.941 | 0.064 | 0.058 | 0.938 | ||||
| ✓ | 96.03 | 0.960 | 0.993 | - | - | 0.960 | ||||
| TL-NN | ✓ | 98.60 | 1.000 | 0.971 | 0.000 | 0.028 | 0.985 | |||
| ✓ | 94.56 | 0.939 | 0.959 | 0.060 | 0.048 | 0.945 | ||||
| ✓ | 95.55 | 0.957 | 0.953 | 0.042 | 0.046 | 0.955 | ||||
| ✓ | 94.01 | 0.940 | 0.990 | - | - | 0.940 | ||||
| M-NN | ✓ | 98.05 | 0.976 | 0.984 | 0.023 | 0.015 | 0.980 | |||
| ✓ | 93.77 | 0.934 | 0.940 | 0.065 | 0.059 | 0.938 | ||||
| ✓ | 95.85 | 0.956 | 0.960 | 0.043 | 0.039 | 0.958 | ||||
| ✓ | 94.16 | 94.17 | 0.990 | - | - | 0.941 | ||||
| ESD | ✓ | 97.75 | 0.975 | 0.979 | 0.024 | 0.020 | 0.977 | |||
| ✓ | 94.18 | 0.935 | 0.947 | 0.064 | 0.052 | 0.942 | ||||
| ✓ | 93.68 | 0.933 | 0.939 | 0.066 | 0.060 | 0.937 | ||||
| ✓ | 93.91 | 0.927 | 0.989 | - | - | 0.939 | ||||
| W-KNN | ✓ | 96.20 | 0.964 | 0.959 | 0.035 | 0.040 | 0.961 | |||
| ✓ | 94.85 | 0.938 | 0.959 | 0.061 | 0.040 | 0.949 | ||||
| ✓ | 94.50 | 0.949 | 0.940 | 0.050 | 0.059 | 0.944 | ||||
| ✓ | 94.17 | 0.940 | 0.990 | - | - | 0.940 | ||||
| Dataset | Backbone/Configuration | Performance Metrics | |||
|---|---|---|---|---|---|
| Accuracy (%) | Sensitivity | Specificity | F1-Score | ||
| ISBI-2016 | DenseNet-201 | ||||
| Baseline Model | 89.50 | 0.878 | 0.902 | 0.886 | |
| Baseline + Enhancement | 92.40 | 0.910 | 0.931 | 0.918 | |
| Inception-ResNet v2 | |||||
| Baseline Model | 88.80 | 0.869 | 0.895 | 0.879 | |
| Baseline + Enhancement | 91.60 | 0.902 | 0.923 | 0.908 | |
| NASNet-Mobile | |||||
| Baseline Model | 86.90 | 0.845 | 0.878 | 0.855 | |
| Baseline + Enhancement | 89.80 | 0.881 | 0.904 | 0.889 | |
| Proposed Framework | 96.25 | 0.944 | 0.982 | 0.963 | |
| ISIC-2017 | DenseNet-201 | ||||
| Baseline Model | 88.40 | 0.865 | 0.893 | 0.874 | |
| Baseline + Enhancement | 91.20 | 0.898 | 0.920 | 0.906 | |
| Inception-ResNet v2 | |||||
| Baseline Model | 87.70 | 0.858 | 0.886 | 0.868 | |
| Baseline + Enhancement | 90.60 | 0.890 | 0.914 | 0.898 | |
| NASNet-Mobile | |||||
| Baseline Model | 85.50 | 0.834 | 0.866 | 0.845 | |
| Baseline + Enhancement | 88.50 | 0.869 | 0.895 | 0.877 | |
| Proposed Framework | 95.85 | 0.956 | 0.960 | 0.956 | |
| Source | SS | df | MS | F-Statistic | Prob > F |
|---|---|---|---|---|---|
| Between Classifiers | 9.6274 | 2 | 4.8137 | 23.1046 | 0.0001 |
| Within Classifiers (Error) | 2.5001 | 12 | 0.2083 | – | – |
| Total | 12.1275 | 14 | – | – | – |
| Source | SS | df | MS | F-Statistic | Prob > F |
|---|---|---|---|---|---|
| Between Classifiers | 15.6484 | 2 | 7.8242 | 55.8885 | <0.0001 |
| Within Classifiers (Error) | 1.6800 | 12 | 0.1400 | – | – |
| Total | 17.3284 | 14 | – | – | – |
| Source | SS | df | MS | F-Statistic | Prob > F |
|---|---|---|---|---|---|
| Between Classifiers | 39.5290 | 2 | 19.7645 | 42.2048 | <0.0001 |
| Within Classifiers (Error) | 5.6196 | 12 | 0.4683 | – | – |
| Total | 45.1486 | 14 | – | – | – |
| Source | SS | df | MS | F-Statistic | Prob > F |
|---|---|---|---|---|---|
| Between Classifiers | 14.3080 | 2 | 7.1540 | 17.6766 | 0.0003 |
| Within Classifiers (Error) | 4.8566 | 12 | 0.4047 | – | – |
| Total | 19.1646 | 14 | – | – | – |
| Author (Year) | Problem Type | Dataset | Accuracy |
|---|---|---|---|
| Haque et al. [39] (2026) | Classification | HAM10000 | 91.15% |
| Padhy et al. [40] (2025) | - | HAM10000 | 94.23% |
| Aruk et al. [24] (2025) | - | HAM10000 | 94.30% |
| Hu et al. [41] (2024) | - | ISIC-2017 | 94.0% |
| Song et al. [41] (2023) | - | ISIC-2017 | 95.6% |
| Alenezi et al. [37] (2023) | - | HAM10000 | 95.73% |
| Alhudhaif et al. [42] (2023) | - | HAM10000 | 95.94% |
| Benyahia et al. [43] (2022) | - | PH2 | 98.70% |
| Nakai et al. [44] (2022) | - | ISIC-2017, HAM10000 | 92.1%, 95.84% |
| M.K.Hasan et al. [36] (2022) | - | ISBI-2016 & ISIC-2017 | 92% & 93.1% |
| Ding et al. [45] (2021) | - | ISIC-2017 | 92.2 (AUC)% |
| Calderón et al. [46] (2021) | - | HAM10000 | 93.21% |
| Khan et al. [11] (2020) | - | ISBI-2016 & ISIC-2017 | 94.5% & 93.4% |
| Hameed et al. [47] (2020) | - | ISBI-2016 | 96.15% |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Alabduljabbar, A.; Akram, T.; Altherwy, Y.N.; Akram, M.A.; Ashraf, I. XAI-MedNet: A Next-Generation Explainable AI Framework for Contrast-Enhanced Skin Lesion Classification via Entropy-Controlled Optimization. Bioengineering 2026, 13, 506. https://doi.org/10.3390/bioengineering13050506
Alabduljabbar A, Akram T, Altherwy YN, Akram MA, Ashraf I. XAI-MedNet: A Next-Generation Explainable AI Framework for Contrast-Enhanced Skin Lesion Classification via Entropy-Controlled Optimization. Bioengineering. 2026; 13(5):506. https://doi.org/10.3390/bioengineering13050506
Chicago/Turabian StyleAlabduljabbar, Abdulrahman, Tallha Akram, Youssef N. Altherwy, Muhammad Adeel Akram, and Imran Ashraf. 2026. "XAI-MedNet: A Next-Generation Explainable AI Framework for Contrast-Enhanced Skin Lesion Classification via Entropy-Controlled Optimization" Bioengineering 13, no. 5: 506. https://doi.org/10.3390/bioengineering13050506
APA StyleAlabduljabbar, A., Akram, T., Altherwy, Y. N., Akram, M. A., & Ashraf, I. (2026). XAI-MedNet: A Next-Generation Explainable AI Framework for Contrast-Enhanced Skin Lesion Classification via Entropy-Controlled Optimization. Bioengineering, 13(5), 506. https://doi.org/10.3390/bioengineering13050506

