Precise Discrimination Between Rape Honey and Acacia Honey Based on Sugar and Amino Acid Profiles Combined with Machine Learning
Abstract
1. Introduction
2. Materials and Methods
2.1. Chemicals and Reagents
2.2. Sample Preparation and Derivatization
2.3. UPLC-MS/MS Analysis Conditions
2.4. Determination of Sugar Content
2.5. Machine Learning Modeling and Analysis
2.5.1. Data Preprocessing
2.5.2. Unsupervised Dimensionality Reduction and Visualization
2.5.3. Classification Models
2.5.4. Evaluation Criteria
2.5.5. Interpretable Machine Learning
2.6. Statistical Analysis
3. Results and Discussion
3.1. Sample Information and Source Description
3.2. Amino Acid and Sugar Composition Analysis
3.3. Machine Learning Analysis
3.4. Interpretability Analysis
3.5. Web App Development
4. Conclusions
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Zhang, Y.; Zeng, X.J.; Ma, T.C.; Zhang, D.D.; Wu, T.; Zhao, H.A.; Cheng, N.; Cao, W. Identification of unique peptide markers for rape (Brassica napus L.) honey with untargeted and targeted proteomics approaches and its application in honey adulteration analysis. Food Chem. 2025, 483, 144256. [Google Scholar] [CrossRef]
- Mosić, M.D.; Trifković, J.Đ.; Ristivojević, P.M.; Milojković-Opsenica, D.M. Quality Assessment of Bee Pollen-Honey Mixtures Using Thin-Layer Chromatography in Combination with Chemometrics. Chem. Biodivers. 2023, 20, e202201141. [Google Scholar] [CrossRef]
- Ali, H.; Rafique, K.; Ullah, R.; Saleem, M.; Ahmad, I. Classification of Sidr Honey and Detection of Sugar Adulteration Using Right Angle Fluorescence Spectroscopy and Chemometrics. Eur. Food Res. Technol. 2022, 248, 1823–1829. [Google Scholar] [CrossRef]
- Yan, S.; Yuan, Y.; Pan, F.; Mu, G.; Xu, H.; Xue, X. Distinguishing the botanical origins of rare honey through untargeted metabolomics and machine learning interpreting flavonoid profiles. Food Chem. 2025, 470, 142752. [Google Scholar] [CrossRef] [PubMed]
- Hao, Y.C.; Zhang, Z.H.; Luo, E.X.; Yang, J.; Wang, S.C. Plant metabolomics: Applications and challenges in the era of multi-omics big data. Abiotech 2025, 6, 116–132. [Google Scholar] [CrossRef]
- da Silva, P.M.; Gauche, C.; Gonzaga, L.V.; Costa, A.C.O.; Fett, R. Honey: Chemical composition, stability and authenticity. Food Chem. 2016, 196, 309–323. [Google Scholar] [CrossRef] [PubMed]
- Ding, J.; Li, B.; Xu, C.; Qiao, Y.; Zhang, L. Diagnosing Crop Diseases Based on Domain-Adaptive Pre-Training BERT of Electronic Medical Records. Appl. Intell. 2023, 53, 15979–15992. [Google Scholar] [CrossRef]
- Liu, Z.; Xu, T.; Zhou, J.; Chen, L. Application of Stable Isotopic and Elemental Composition Combined with Random Forest Algorithm for the Botanical Classification of Chinese Honey. J. Food Compos. Anal. 2022, 110, 104565. [Google Scholar] [CrossRef]
- Mateo, F.; Tarazona, A.; Mateo, E.M. Comparative Study of Several Machine Learning Algorithms for Classification of Unifloral Honeys. Foods 2021, 10, 1543. [Google Scholar] [CrossRef]
- Geană, E.I.; Isopescu, R.; Ciucure, C.T.; Gîjiu, C.L.; Joșceanu, A.M. Honey Adulteration Detection via Ultraviolet–Visible Spectral Investigation Coupled with Chemometric Analysis. Foods 2024, 13, 3630. [Google Scholar] [CrossRef] [PubMed]
- Mara, A.; Migliorini, M.; Ciulu, M.; Chignola, R.; Egido, C.; Núñez, O.; Sentellas, S.; Saurina, J.; Caredda, M.; Deroma, M.A.; et al. Elemental Fingerprinting Combined with Machine Learning Techniques as a Powerful Tool for Geographical Discrimination of Honeys from Nearby Regions. Foods 2024, 13, 243. [Google Scholar] [CrossRef]
- Yong, C.-H.; Muhammad, S.A.; Abd Aziz, F.; Nasir, F.I.; Mustafa, M.Z.; Ibrahim, B.; Kelly, S.D.; Cannavan, A.; Seow, E.-K. Detecting Adulteration of Stingless Bee Honey Using Untargeted 1H NMR Metabolomics with Chemometrics. Food Chem. 2022, 368, 130808. [Google Scholar] [CrossRef]
- Li, Y.; Jin, Y.; Yang, S.; Zhang, W.; Zhang, J.; Zhao, W.; Chen, L.; Wen, Y.; Zhang, Y.; Lu, K.; et al. Strategy for Comparative Untargeted Metabolomics Reveals Honey Markers of Different Floral and Geographic Origins Using Ultrahigh-Performance Liquid Chromatography-Hybrid Quadrupole-Orbitrap Mass Spectrometry. J. Chromatogr. A 2017, 1499, 78–89. [Google Scholar] [CrossRef]
- Zhou, M.; Feng, H.; Liu, J.; Pi, J.; Wang, H.; Zhou, T.; Peng, Q.; Zhang, L. Identification of the Botanical Source of Honey Based on Optimized SVM Model with Censored Data of ICP-MS. J. Instrum. Anal. 2021, 40, 7. [Google Scholar]
- Yang, J.; Liu, Y.; Cui, Z.; Wang, T.; Liu, T.; Liu, G. Analysis of Free Amino Acid Composition and Honey Plant Species in Seven Honey Species in China. Foods 2024, 13, 1065. [Google Scholar] [CrossRef] [PubMed]
- Cieslak, M.C.; Castelfranco, A.M.; Roncalli, V.; Lenz, P.H.; Hartline, D.K. t-Distributed Stochastic Neighbor Embedding (t-SNE): A Tool for Eco-Physiological Transcriptomic Analysis. Mol. Genom. 2020, 51, 100723. [Google Scholar] [CrossRef] [PubMed]
- Pan, F.; Liu, D.; Tuersuntoheti, T.; Xing, H.; Zhu, Z.; Fang, Y.; Zhao, L.; Zhao, L.; Li, X.; Le, Y.; et al. Mining Anti-Hypertensive Peptides in Animal Food Through Deep Learning: A Case Study of Gastrointestinal Digestive Products of Royal Jelly. Food Sci. Anim. Prod. 2024, 2, 9240053. [Google Scholar] [CrossRef]
- Xu, C.; Zhang, L. Cucumber Diseases Diagnosis Based on Multi-Class SVM and Electronic Medical Record. Neural Comput. Appl. 2023, 36, 4959–4978. [Google Scholar] [CrossRef]
- Ahmadlou, M.; Adeli, H. Enhanced Probabilistic Neural Network with Local Decision Circles: A Robust Classifier. Integr. Comput. Aided Eng. 2010, 17, 197–210. [Google Scholar] [CrossRef]
- Saha, D.; Manickavasagam, A. Machine Learning Techniques for Analysis of Hyperspectral Images to Determine Quality of Food Products: A Review. Curr. Res. Food Sci. 2021, 4, 28–44. [Google Scholar] [CrossRef]
- Dong, L.; Liu, P.; Qi, Z.; Lin, J.; Duan, M. Development and Validation of a Machine-Learning Model for Predicting the Risk of Death in Sepsis Patients with Acute Kidney Injury. Heliyon 2024, 10, e29985. [Google Scholar] [CrossRef] [PubMed]
- Wang, T.; Xu, Y.T.; Liu, X.H. Multi-Task Twin Spheres Support Vector Machine with Maximum Margin for Imbalanced Data Classification. Appl. Intell. 2023, 53, 3318–3335. [Google Scholar] [CrossRef]
- Yang, K.; Zhao, L. A New Optimizing Parameter Approach of LSSVM Multiclass Classification Model. Neural Comput. Appl. 2012, 21, 945–955. [Google Scholar] [CrossRef]
- Tankaria, H.; Sugimoto, S.; Yamashita, N. A Regularized Limited Memory BFGS Method for Large-Scale Unconstrained Optimization and Its Efficient Implementations. Comput. Optim. Appl. 2022, 82, 61–88. [Google Scholar] [CrossRef]
- Esposito, C.; Iachetti, E.; Fabbri, A.; Raggi, L. Data Fusion of FT-NIR Spectroscopy and Vis/NIR Hyperspectral Imaging to Predict Quality Parameters of Yellow Flesh “Jin tao” Kiwifruit. Biosyst. Eng. 2024, 237, 157–169. [Google Scholar]
- Lundberg, S.M.; Lee, S.-I. A Unified Approach to Interpreting Model Predictions. In Proceedings of the 31st International Conference on Neural Information Processing Systems (NIPS’17), Long Beach, CA, USA, 4–9 December 2017; pp. 4768–4778. [Google Scholar]
- Wang, H.R.; Xu, Y.T.; Zhou, Z.J. Ramp Loss KNN-Weighted Multi-Class Twin Support Vector Machine. Soft Comput. 2022, 26, 6591–6618. [Google Scholar] [CrossRef]
- Guo, M.; Wang, K.; Lin, H.; Wang, L.; Cao, L.; Sui, J. Spectral Data Fusion in Nondestructive Detection of Food Products: Strategies, Recent Applications, and Future Perspectives. Compr. Rev. Food Sci. Food Saf. 2024, 23, 13301. [Google Scholar] [CrossRef]
- Guo, T.; Pan, F.; Cui, Z.; Yang, Z.; Chen, Q.; Zhao, L.; Song, H. FAPD: An Astringency Threshold and Astringency Type Prediction Database for Flavonoid Compounds Based on Machine Learning. J. Agric. Food Chem. 2023, 71, 4172–4183. [Google Scholar] [CrossRef]
- Pedregosa, J.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-Learn: Machine Learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
- Xu, C.; Ding, J.; Qiao, Y.; Zhang, L. Tomato Disease and Pest Diagnosis Method Based on the Stacking of Prescription Data. Comput. Electron. Agric. 2022, 197, 106997. [Google Scholar] [CrossRef]
- Cui, Y.-N.; Yan, S.-J.; Zhang, Y.-N.; Wang, R.; Song, L.-L.; Ma, Y.; Guo, H.; Yang, P.-Z. Physiological, Metabolome and Gene Expression Analyses Reveal the Accumulation and Biosynthesis Pathways of Soluble Sugars and Amino Acids in Sweet Sorghum under Osmotic Stresses. Int. J. Mol. Sci. 2024, 25, 8942. [Google Scholar] [CrossRef]
- Lin, I.W.; Sosso, D.; Chen, L.-Q.; Gase, K.; Kim, S.-G.; Kessler, D.; Klinkenberg, P.M.; Gorder, M.K.; Hou, B.-H.; Qu, X.-Q.; et al. Nectar secretion requires sucrose phosphate synthases and the sugar transporter SWEET9. Nature 2014, 508, 546–549. [Google Scholar] [CrossRef] [PubMed]
- Kalaycıoğlu, Z.; Kaygusuz, H.; Diker, S.; Koçylı, S.; Erim, F.B. Characterization of Turkish Honeybee Pollens by Principal Component Analysis Based on Their Individual Organic Acids, Sugars, Minerals, and Antioxidant Activities. LWT 2017, 84, 402–408. [Google Scholar] [CrossRef]
- da Costa, I.F.; Uria-Toro, M.J. Evaluation of the Antioxidant Capacity of Bioactive Compounds and Determination of Proline in Honeys from Pará. J. Food Sci. Technol. 2021, 58, 1900–1908. [Google Scholar]
- Bertazzini, M.; Forlani, G. Intraspecific Variability of Floral Nectar Volume and Composition in Rape (Brassica napus L. var. oleifera). Front. Plant Sci. 2016, 7, 288. [Google Scholar] [CrossRef]






| Sugar Component | Linear Regression Equation | Correlation Coefficient (R2) |
|---|---|---|
| Fructose | y = 1.29x + 2.40 | 0.9875 |
| Glucose | y = 1.33x + 3.18 | 0.9839 |
| Sucrose | y = 1.53x + 7.59 | 0.9906 |
| Turanose | y = 1.53x + 7.61 | 0.9904 |
| Maltulose | y = 1.69x + 7.69 | 0.9882 |
| Maltose | y = 1.73x + 7.72 | 0.9896 |
| Kojibiose | y = 1.57x + 7.57 | 0.9877 |
| Isomaltose | y = 2.00x + 7.83 | 0.9922 |
| Erlose | y = 1.72x + 7.51 | 0.9890 |
| Melezitose | y = 1.84x + 7.83 | 0.9927 |
| Raffinose | y = 1.63x + 7.48 | 0.9827 |
| Maltotriose | y = 1.69x + 7.51 | 0.9891 |
| Algorithm | Optimal Parameters |
|---|---|
| MLP | activation = sigmoid, Hidden_layer_sizes = (8, 1), solver = lbfgs |
| GaussianNB | var smoothing = 0 |
| KNN | leaf size = 48, metric = manhattan, n neighbors = 6, p = 1, weights = distance |
| Decision Tree | ccp alpha = 0.09, class weight = None, criterion = entropy, max depth = 14, max features = log2, min samples leaf = 1, min samples split = 17, splitter = best |
| LDA | shrinking = True solver = eigen, shrinkage = 0.74, priors = None, alpha = 0.00 |
| 80% Train Set 10cv | MLP | Linear Discriminant | KNeighbors | Decision Tree | GaussianNB |
|---|---|---|---|---|---|
| ACC | 0.998 ± 0.005 | 0.985 ± 0.016 | 0.993 ± 0.008 | 0.982 ± 0.014 | 0.975 ± 0.021 |
| Sn | 0.996 ± 0.010 | 0.976 ± 0.031 | 0.987 ± 0.016 | 0.981 ± 0.022 | 0.987 ± 0.017 |
| Sp | 1 ± 0 | 0.997 ± 0.01 | 1 ± 0 | 0.984 ± 0.017 | 0.966 ± 0.036 |
| MCC | 0.997 ± 0.010 | 0.971 ± 0.031 | 0.987 ± 0.017 | 0.964 ± 0.028 | 0.952 ± 0.04 |
| AUC | 1 ± 0 | 0.997 ± 0.01 | 1 ± 0 | 0.982 ± 0.014 | 0.996 ± 0.01 |
| Precision | 1 ± 0 | 0.997 ± 0.011 | 1 ± 0 | 0.984 ± 0.017 | 0.964 ± 0.039 |
| f1 | 0.999 ± 0.003 | 0.991 ± 0.01 | 0.997 ± 0.004 | 0.983 ± 0.014 | 0.97 ± 0.029 |
| 20% Test Set | MLP | Linear Discriminant | KNeighbors | Decision Tree | GaussianNB |
|---|---|---|---|---|---|
| ACC | 0.991 | 0.991 | 0.982 | 0.945 | 0.991 |
| Sn | 0.976 | 0.976 | 0.953 | 0.907 | 0.976 |
| Sp | 1 | 1 | 1 | 0.97 | 1 |
| MCC | 0.981 | 0.981 | 0.962 | 0.884 | 0.981 |
| AUC | 0.997 | 0.990 | 0.993 | 0.946 | 0.991 |
| Precision | 1 | 1 | 1 | 0.951 | 1 |
| f1 | 0.996 | 0.995 | 0.991 | 0.948 | 0.995 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Sun, C.; Pan, F.; Tian, W.; Cui, Z.; Xue, X.; Xu, Y. Precise Discrimination Between Rape Honey and Acacia Honey Based on Sugar and Amino Acid Profiles Combined with Machine Learning. Foods 2026, 15, 70. https://doi.org/10.3390/foods15010070
Sun C, Pan F, Tian W, Cui Z, Xue X, Xu Y. Precise Discrimination Between Rape Honey and Acacia Honey Based on Sugar and Amino Acid Profiles Combined with Machine Learning. Foods. 2026; 15(1):70. https://doi.org/10.3390/foods15010070
Chicago/Turabian StyleSun, Chenyu, Fei Pan, Wenli Tian, Zongyan Cui, Xiaofeng Xue, and Yitian Xu. 2026. "Precise Discrimination Between Rape Honey and Acacia Honey Based on Sugar and Amino Acid Profiles Combined with Machine Learning" Foods 15, no. 1: 70. https://doi.org/10.3390/foods15010070
APA StyleSun, C., Pan, F., Tian, W., Cui, Z., Xue, X., & Xu, Y. (2026). Precise Discrimination Between Rape Honey and Acacia Honey Based on Sugar and Amino Acid Profiles Combined with Machine Learning. Foods, 15(1), 70. https://doi.org/10.3390/foods15010070

