This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Open AccessArticle
An Explainable Artificial Intelligence Framework for the Classification of Pumpkin Seed Varieties (Cucurbita pepo L.) Using Morphological Features
1
Department of Biosystems Engineering, Gorgan University of Agricultural Sciences and Natural Resources, Gorgan 49189-16573, Iran
2
Independent Researcher, Tehran 31976-46813, Iran
3
Department of Biosystems Engineering, University of Mohaghegh Ardabili, Ardabil 56199-11367, Iran
4
Department of Computer Science and Systems, Universidad de Murcia, 30100 Murcia, Spain
5
Agricultural and Marine Engineering Research Group, Polytechnic University of Cartagena, 30203 Cartagena, Spain
*
Author to whom correspondence should be addressed.
Appl. Sci. 2026, 16(12), 5958; https://doi.org/10.3390/app16125958 (registering DOI)
Submission received: 10 May 2026
/
Revised: 8 June 2026
/
Accepted: 10 June 2026
/
Published: 12 June 2026
Abstract
Accurate automatic classification of seed varieties is important for seed sorting, quality assurance, and plant breeding, yet reliable discrimination remains difficult when cultivars exhibit highly similar visual characteristics. This study presents a reproducible and interpretable framework for the binary classification of two Turkish pumpkin seed varieties using tabular morphological descriptors extracted from segmented seed images. Unlike many previous machine learning studies in this domain, which offer limited interpretability and leave model decisions largely as a black box, the proposed approach places Explainable Artificial Intelligence (XAI) at the center of the analysis. The framework combines biologically meaningful feature engineering, Optuna-based hyperparameter optimization, repeated stratified cross-validation, and a comparative evaluation of XGBoost, LightGBM, and CatBoost. Model explainability was investigated using SHapley Additive exPlanations (SHAP) to identify the morphological traits driving both global and instance-level predictions, while corrected repeated k-fold t-tests were used to assess the statistical significance of performance differences, which confirmed comparable accuracy among the three boosting models and a significant advantage over the baseline classifiers. All three boosting ensembles consistently outperformed the baseline classifiers (SVM, Logistic Regression, and Random Forest) on the hold-out test set. CatBoost achieved the best overall results, with an accuracy of 0.888, an F1-score of 0.879, and an MCC of 0.777. SHAP analysis consistently highlighted compactness, roundness, eccentricity, and engineered interaction descriptors as the most influential predictors. Overall, the proposed XAI-driven framework provides an accurate and transparent solution for pumpkin seed classification.
Share and Cite
MDPI and ACS Style
Sabzi, S.; Daliran, O.; Pourdarbani, R.; García-Mateos, G.; Molina-Martínez, J.M.
An Explainable Artificial Intelligence Framework for the Classification of Pumpkin Seed Varieties (Cucurbita pepo L.) Using Morphological Features. Appl. Sci. 2026, 16, 5958.
https://doi.org/10.3390/app16125958
AMA Style
Sabzi S, Daliran O, Pourdarbani R, García-Mateos G, Molina-Martínez JM.
An Explainable Artificial Intelligence Framework for the Classification of Pumpkin Seed Varieties (Cucurbita pepo L.) Using Morphological Features. Applied Sciences. 2026; 16(12):5958.
https://doi.org/10.3390/app16125958
Chicago/Turabian Style
Sabzi, Sajad, Omid Daliran, Raziyeh Pourdarbani, Ginés García-Mateos, and José Miguel Molina-Martínez.
2026. "An Explainable Artificial Intelligence Framework for the Classification of Pumpkin Seed Varieties (Cucurbita pepo L.) Using Morphological Features" Applied Sciences 16, no. 12: 5958.
https://doi.org/10.3390/app16125958
APA Style
Sabzi, S., Daliran, O., Pourdarbani, R., García-Mateos, G., & Molina-Martínez, J. M.
(2026). An Explainable Artificial Intelligence Framework for the Classification of Pumpkin Seed Varieties (Cucurbita pepo L.) Using Morphological Features. Applied Sciences, 16(12), 5958.
https://doi.org/10.3390/app16125958
Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details
here.
Article Metrics
Article Access Statistics
For more information on the journal statistics, click
here.
Multiple requests from the same IP address are counted as one view.