Prediction of Total Anthocyanin Content in Single-Kernel Maize Using Spectral and Color Space Data Coupled with AutoML
Abstract
1. Introduction
2. Materials and Methods
2.1. Plant Material
2.2. Image Data Extraction and Acquisition of NIR Data
2.3. Total Anthocyanin Content (TAC) Determination
2.4. Datasets Preparation and Data Preprocessing
2.5. Development and Evaluation of Prediction Models
3. Results and Discussion
3.1. Changes in Color Spaces and Spectral Measurements According to Embryo-Up and Embryo-Down Sides
3.2. Evaluation of Prediction Models for Image and Spectral Datasets According to Embryo and Endosperm Side
4. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
| RGB | Red–Green–Blue |
| HSV | Hue–Saturation–Value |
| NIR | Near-Infrared |
| PLSR | Partial Least Square Regression |
| RMSE | Root Mean Squared Error |
| MAE | Mean Absolute Error |
| MAPE | Mean Absolute Percentage Error |
Appendix A
| No | Kernel | Kernel | Kernel | Kernel | |||
|---|---|---|---|---|---|---|---|
| 1 | ![]() | 11 | ![]() | 21 | ![]() | 31 | ![]() |
| 2 | ![]() | 12 | ![]() | 22 | ![]() | 32 | ![]() |
| 3 | ![]() | 13 | ![]() | 23 | ![]() | 33 | ![]() |
| 4 | ![]() | 14 | ![]() | 24 | ![]() | 34 | ![]() |
| 5 | ![]() | 15 | ![]() | 25 | ![]() | 35 | ![]() |
| 6 | ![]() | 16 | ![]() | 26 | ![]() | 36 | ![]() |
| 7 | ![]() | 17 | ![]() | 27 | ![]() | 37 | ![]() |
| 8 | ![]() | 18 | ![]() | 28 | ![]() | 38 | ![]() |
| 9 | ![]() | 19 | ![]() | 29 | ![]() | 39 | ![]() |
| 10 | ![]() | 20 | ![]() | 30 | ![]() | 40 | ![]() |
References
- Serna-Saldivar, S.O. Corn: Chemistry and Technology; Elsevier: Amsterdam, The Netherlands, 2018; ISBN 0128118865. [Google Scholar]
- Şerment, M.; Kahrıman, F. Mısırda Toplam Fenolik ve Antosiyanin İçeriğinin Belirlenmesi Için Geliştirilmiş NIRS (Yakın Kızıl Ötesi Spektroskopisi) Kalibrasyon Modelleri Üzerine Kemometrik Yöntemlerin Etkisi. J. Adv. Res. Nat. Appl. Sci. 2021, 7, 437–449. [Google Scholar] [CrossRef]
- Suriano, S.; Balconi, C.; Valoti, P.; Redaelli, R. Comparison of Total Polyphenols, Profile Anthocyanins, Color Analysis, Carotenoids and Tocols in Pigmented Maize. LWT 2021, 144, 111257. [Google Scholar] [CrossRef]
- Colombo, R.; Ferron, L.; Papetti, A. Colored Corn: An up-Date on Metabolites Extraction, Health Implication, and Potential Use. Molecules 2021, 26, 199. [Google Scholar] [CrossRef]
- Satué-Gracia, M.T.; Heinonen, M.; Frankel, E.N. Anthocyanins as Antioxidants on Human Low-Density Lipoprotein and Lecithin−Liposome Systems. J. Agric. Food Chem. 1997, 45, 3362–3367. [Google Scholar] [CrossRef]
- Guo, H.; Wu, H.; Sajid, A.; Li, Z. Whole Grain Cereals: The Potential Roles of Functional Components in Human Health. Crit. Rev. Food Sci. Nutr. 2022, 62, 8388–8402. [Google Scholar] [CrossRef] [PubMed]
- Del Pozo-Insfran, D.; Serna Saldivar, S.O.; Brenes, C.H.; Talcott, S.T. Polyphenolics and Antioxidant Capacity of White and Blue Corns Processed into Tortillas and Chips. Cereal Chem. 2007, 84, 162–168. [Google Scholar] [CrossRef]
- Tsuda, T.; Horio, F.; Uchida, K.; Aoki, H.; Osawa, T. Dietary Cyanidin 3-O-Beta-D-Glucoside-Rich Purple Corn Color Prevents Obesity and Ameliorates Hyperglycemia in Mice. J. Nutr. 2003, 133, 2125–2130. [Google Scholar] [CrossRef]
- Li, D.; Zhang, Y.; Liu, Y.; Sun, R.; Xia, M. Purified Anthocyanin Supplementation Reduces Dyslipidemia, Enhances Antioxidant Capacity, and Prevents Insulin Resistance in Diabetic Patients. J. Nutr. 2015, 145, 742–748. [Google Scholar] [CrossRef] [PubMed]
- Mazewski, C.; Liang, K.; Gonzalez de Mejia, E. Inhibitory Potential of Anthocyanin-Rich Purple and Red Corn Extracts on Human Colorectal Cancer Cell Proliferation In Vitro. J. Funct. Foods 2017, 34, 254–265. [Google Scholar] [CrossRef]
- Amanah, H.Z.; Joshi, R.; Masithoh, R.E.; Choung, M.-G.; Kim, K.-H.; Kim, G.; Cho, B.-K. Nondestructive Measurement of Anthocyanin in Intact Soybean Seed Using Fourier Transform Near-Infrared (FT-NIR) and Fourier Transform Infrared (FT-IR) Spectroscopy. Infrared. Phys. Technol. 2020, 111, 103477. [Google Scholar] [CrossRef]
- Mangalvedhe, A.A.; Danao, M.C.; Paulsmeyer, M.; Rausch, K.D.; Singh, V.; Juvik, J.A. Anthocyanin Determination in Different Corn Hybrids Using near Infrared Spectroscopy. In Proceedings of the ASABE Annual International Meeting, New Orleans, LA, USA, 26–29 July 2015; Volume 152181716. [Google Scholar]
- Orman, B.A.; Schumann, R.A. Nondestructive Single-Kernel Oil Determination of Maize by near-Infrared Transmission Spectroscopy. J. Am. Oil. Chem. Soc. 1992, 69, 1036–1038. [Google Scholar] [CrossRef]
- Weinstock, B.A.; Janni, J.; Hagen, L.; Wright, S. Prediction of Oil and Oleic Acid Concentrations in Individual Corn (Zea mays L.) Kernels Using near-Infrared Reflectance Hyperspectral Imaging and Multivariate Analysis. Appl. Spectrosc. 2006, 60, 9–16. [Google Scholar] [CrossRef]
- Jiang, H.Y.; Zhu, Y.J.; Wei, L.M.; Dai, J.R.; Song, T.M.; Yan, Y.L.; Chen, S.J. Analysis of Protein, Starch and Oil Content of Single Intact Kernels by near Infrared Reflectance Spectroscopy (NIRS) in Maize (Zea mays L.). Plant Breed. 2007, 126, 492–497. [Google Scholar] [CrossRef]
- Anirban, A.; Hong, H.T.; O’Hare, T.J. Profiling and Quantification of Anthocyanins in Novel Purple-Pericarp Sweetcorn and Purple-Pericarp Maize. bioRxiv 2022, 2022–2027. [Google Scholar] [CrossRef]
- Baye, T.M.; Pearson, T.C.; Settles, A.M. Development of a Calibration to Predict Maize Seed Composition Using Single Kernel near Infrared Spectroscopy. J. Cereal Sci. 2006, 43, 236–243. [Google Scholar] [CrossRef]
- Kahrıman, F.; Onaç, İ.; Öner, F.; Mert, F.; Egesel, C.Ö. Analysis of Secondary Biochemical Components in Maize Flour Samples by NIR (near Infrared Reflectance) Spectroscopy. J. Food Meas. Charact. 2020, 14, 2320–2332. [Google Scholar] [CrossRef]
- Ajaz, R.H.; Hussain, L. Seed Classification Using Machine Learning Techniques. Seed 2015, 2, 1098–1102. [Google Scholar]
- Kahrıman, F.; Güz, A.M.; Pehlivan, İ. Use of Machine Learning Models-Based Image Analysis for Classification of Haploid and Diploid Maize. Crop Breed. Appl. Biotechnol. 2023, 23, e45322349. [Google Scholar] [CrossRef]
- Duc, N.T.; Ramlal, A.; Rajendran, A.; Raju, D.; Lal, S.K.; Kumar, S.; Sahoo, R.N.; Chinnusamy, V. Image-Based Phenotyping of Seed Architectural Traits and Prediction of Seed Weight Using Machine Learning Models in Soybean. Front. Plant. Sci. 2023, 14, 1206357. [Google Scholar] [CrossRef]
- Barbedo, J.G.A. Plant Disease Identification from Individual Lesions and Spots Using Deep Learning. Biosyst. Eng. 2019, 180, 96–107. [Google Scholar] [CrossRef]
- Piccolo, E.L.; Matteoli, S.; Landi, M.; Guidi, L.; Massai, R.; Remorini, D. Measurements of Anthocyanin Content of Prunus Leaves Using Proximal Sensing Spectroscopy and Statistical Machine Learning. IEEE Trans. Instrum. Meas. 2022, 71, 1–10. [Google Scholar] [CrossRef]
- Miao, H.; Chen, X.; Guo, Y.; Wang, Q.; Zhang, R.; Chang, Q. Estimation of Anthocyanins in Winter Wheat Based on Band Screening Method and Genetic Algorithm Optimization Models. Remote Sens. 2024, 16, 2324. [Google Scholar] [CrossRef]
- Jiang, S.; Chang, Q.; Wang, X.; Zheng, Z.; Zhang, Y.; Wang, Q. Estimation of Anthocyanins in Whole-Fertility Maize Leaves Based on Ground-Based Hyperspectral Measurements. Remote Sens. 2023, 15, 2571. [Google Scholar] [CrossRef]
- Sala, R.; Zambetti, M.G.; Pirola, F.; Pinto, R. How to Select a Suitable Machine Learning Algorithm: A Feature-Based, Scope-Oriented Selection Framework. Summer Sch. Fr. Turco. Proc. 2018, 2018, 87–93. [Google Scholar]
- Ferreira, L.; Pilastri, A.; Martins, C.M.; Pires, P.M.; Cortez, P. A Comparison of AutoML Tools for Machine Learning, Deep Learning and XGBoost. In Proceedings of the 2021 International Joint Conference on Neural Networks (IJCNN), Shenzhen, China, 18–22 July 2021; IEEE: New York, NY, USA, 2021; pp. 1–8. [Google Scholar]
- Feurer, M.; Klein, A.; Eggensperger, K.; Springenberg, J.; Blum, M.; Hutter, F. Efficient and Robust Automated Machine Learning. In Proceedings of the 29th International Conference on Neural Information Processing Systems, Montreal, QC, Canada, 7–12 December 2015. [Google Scholar]
- R Core Team. R: A Language and Environment for Statistical Computing [Software], Version 4.3.1; R foundation for Statistical Computing: Vienna, Austria, 2019.
- Pau, G.; Fuchs, F.; Sklyar, O.; Boutros, M.; Huber, W. EBImage—An R Package for Image Processing with Applications to Cellular Phenotypes. Bioinformatics 2010, 26, 979–981. [Google Scholar] [CrossRef]
- Zeileis, A.; Fisher, J.C.; Hornik, K.; Ihaka, R.; McWhite, C.D.; Murrell, P.; Stauffer, R.; Wilke, C.O. Colorspace: A Toolbox for Manipulating and Assessing Colors and Palettes. J. Stat. Softw. 2020, 96, 1–49. [Google Scholar] [CrossRef]
- Abdel-Aal, E.S.M.; Hucl, P. A Rapid Method for Quantifying Total Anthocyanins in Blue Aleurone and Purple Pericarp Wheats. Cereal Chem. 1999, 76, 350–354. [Google Scholar] [CrossRef]
- Reynkens, T. Rospca: Robust Sparse PCA Using the ROSPCA Algorithm, Version 1.1.1; ROSPCA: Birmingham, UK, 2018; Volume 1.
- Płońska, A.; Płoński, P. Mljar: State-of-the-Art Automated Machine Learning Framework for Tabular Data, Version 0.10; MLJAR, Inc.: Łapy, Poland, 2021; Volume 3.
- Chicco, D.; Warrens, M.J.; Jurman, G. The Coefficient of Determination R-Squared Is More Informative than SMAPE, MAE, MAPE, MSE and RMSE in Regression Analysis Evaluation. PeerJ Comput. Sci. 2021, 7, e623. [Google Scholar] [CrossRef] [PubMed]
- Li, T.; Zhang, W.; Yang, H.; Dong, Q.; Ren, J.; Fan, H.; Zhang, X.; Zhou, Y. Comparative Transcriptome Analysis Reveals Differentially Expressed Genes Related to the Tissue-Specific Accumulation of Anthocyanins in Pericarp and Aleurone Layer for Maize. Sci. Rep. 2019, 9, 2485. [Google Scholar] [CrossRef]
- Van Deynze, A.E.; Pauls, K.P. Seed Colour Assessment in Brassica Napus Using a Near Infrared Reflectance Spectrometer Adapted for Visible Light Measurements. Euphytica 1994, 76, 45–51. [Google Scholar] [CrossRef]
- Abdel-Aal, E.-S.M.; Young, J.C.; Rabalski, I. Anthocyanin Composition in Black, Blue, Pink, Purple, and Red Cereal Grains. J. Agric. Food Chem. 2006, 54, 4696–4704. [Google Scholar] [CrossRef]
- Lopez-Martinez, L.X.; Oliart-Ros, R.M.; Valerio-Alfaro, G.; Lee, C.H.; Parkin, K.L.; Garcia, H.S. Antioxidant Activity, Phenolic Compounds and Anthocyanins Content of Eighteen Strains of Mexican Maize. LWT Food Sci. Technol. 2009, 42, 1187–1192. [Google Scholar] [CrossRef]
- Abdel-Aal, E.M.; Choo, T.; Dhillon, S.; Rabalski, I. Free and Bound Phenolic Acids and Total Phenolics in Black, Blue, and Yellow Barley and Their Contribution to Free Radical Scavenging Capacity. Cereal Chem. 2012, 89, 198–204. [Google Scholar] [CrossRef]
- Žilić, S.; Serpen, A.; Akillioǧlu, G.; Gökmen, V.; Vančetović, J. Phenolic Compounds, Carotenoids, Anthocyanins, and Antioxidant Capacity of Colored Maize (Zea mays L.) Kernels. J. Agric. Food Chem. 2012, 60, 1224–1231. [Google Scholar] [CrossRef]
- Salinas Moreno, Y.; Sánchez, G.S.; Hernández, D.R.; Lobato, N.R. Characterization of Anthocyanin Extracts from Maize Kernels. J. Chromatogr. Sci. 2005, 43, 483–487. [Google Scholar] [CrossRef] [PubMed]
- Breiman, L. Random Forests. Mach. Learn 2001, 45, 5–32. [Google Scholar] [CrossRef]
- Ke, G.; Meng, Q.; Finley, T.; Wang, T.; Chen, W.; Ma, W.; Ye, Q.; Liu, T.-Y. Lightgbm: A Highly Efficient Gradient Boosting Decision Tree. In Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, CA, USA, 4–9 December 2017. [Google Scholar]
- Hjerpe, A. Computing Random Forests Variable Importance Measures (VIM) on Mixed Continuous and Categorical Data. Master’s Thesis, KTH Royal Institute of Technology School of Computer Science and Communication, Stockholm, Sweden, 2016. [Google Scholar]
- Manzoor, M.F.; Hussain, A.; Naumovski, N.; Ranjha, M.M.A.N.; Ahmad, N.; Karrar, E.; Xu, B.; Ibrahim, S.A. A Narrative Review of Recent Advances in Rapid Assessment of Anthocyanins in Agricultural and Food Products. Front. Nutr. 2022, 9, 901342. [Google Scholar] [CrossRef]
- Nankar, A.N.; Scott, M.P.; Pratt, R.C. Compositional Analyses Reveal Relationships among Components of Blue Maize Grains. Plants 2020, 9, 1775. [Google Scholar] [CrossRef] [PubMed]
- Manley, M. Near-Infrared Spectroscopy and Hyperspectral Imaging: Non-Destructive Analysis of Biological Materials. Chem. Soc. Rev. 2014, 43, 8200–8214. [Google Scholar] [CrossRef]
- Morales-Reyes, J.-L.; Aquino-Bolaños, E.-N.; Acosta-Mesa, H.-G.; Márquez-Grajales, A. Estimation of Anthocyanins in Homogeneous Bean Landraces Using Neuroevolution. In Proceedings of the Mexican International Conference on Artificial Intelligence, Merida, Mexico, 13–18 November 2023; Springer: Berlin/Heidelberg, Germany, 2023; pp. 373–384. [Google Scholar]






| The Number of Models | Training Data Size (Row × Column) | ||||
|---|---|---|---|---|---|
| Model | Dataset | Embryo-Up | Embryo-Down | n × p (Embryo-Up) | n × p (Embryo-Down) |
| Classic (PLSR) | HSV | 1 | 1 | 181 × 9 | 185 × 9 |
| LAB | 1 | 1 | 181 × 9 | 185 × 9 | |
| RGB | 1 | 1 | 181 × 9 | 185 × 9 | |
| NIR | 1 | 1 | 181 × 1200 | 185 × 1200 | |
| RGB+HSV+LAB+NIR | 1 | 1 | 181 × 1227 | 185 × 1227 | |
| AutoML | HSV | 193 | 192 | 181 × 9 | 185 × 9 |
| LAB | 196 | 191 | 181 × 9 | 185 × 9 | |
| RGB | 186 | 191 | 181 × 9 | 185 × 9 | |
| NIR | 194 | 203 | 181 × 1200 | 185 × 1200 | |
| RGB+HSV+LAB+NIR | 180 | 182 | 181 × 1227 | 185 × 1227 | |
| Total | 954 | 964 | |||
| Model Dataset | n | Mean | STD 1 | Min | Max |
|---|---|---|---|---|---|
| Embryo-Up | 181 | 29.97 | 34.86 | 5.53 | 184.67 |
| Embryo-Down | 185 | 28.82 | 34.19 | 5.53 | 184.63 |
| Side | Model | Dataset | MAE | RMSE | MSE | R2 | MAPE |
|---|---|---|---|---|---|---|---|
| Embryo-Up | Classic (PLSR) | HSV | 19.00 | 27.15 | 737.24 | 0.39 | 1.15 |
| LAB | 20.09 | 28.79 | 828.79 | 0.31 | 1.20 | ||
| RGB | 20.43 | 28.82 | 830.40 | 0.31 | 1.47 | ||
| NIR | 23.81 | 32.95 | 1086.00 | 0.10 | 1.65 | ||
| RGB+HSV+LAB+NIR | 19.76 | 28.58 | 816.72 | 0.32 | 1.17 | ||
| Mean | 20.62 | 29.26 | 859.83 | 0.29 | 1.33 | ||
| The Best (Ensemble) | HSV | 11.38 | 18.60 | 346.13 | 0.71 | 0.46 | |
| LAB | 11.52 | 20.25 | 410.24 | 0.66 | 0.50 | ||
| RGB | 10.34 | 16.37 | 268.11 | 0.78 | 0.52 | ||
| NIR | 18.35 | 25.37 | 643.80 | 0.47 | 1.26 | ||
| RGB+HSV+LAB+NIR | 10.79 | 18.23 | 332.46 | 0.72 | 0.51 | ||
| Mean | 12.48 | 19.77 | 400.15 | 0.67 | 0.65 | ||
| Embryo-Down | Classic (PLSR) | HSV | 16.01 | 26.24 | 688.76 | 0.41 | 0.77 |
| LAB | 16.45 | 25.84 | 667.61 | 0.43 | 0.85 | ||
| RGB | 17.77 | 27.27 | 743.50 | 0.36 | 1.01 | ||
| NIR | 21.29 | 29.16 | 850.16 | 0.27 | 1.57 | ||
| RGB+HSV+LAB+NIR | 17.03 | 24.27 | 589.00 | 0.49 | 1.08 | ||
| Mean | 17.71 | 26.56 | 707.81 | 0.39 | 1.05 | ||
| The Best (Ensemble) | HSV | 9.24 | 16.86 | 284.18 | 0.76 | 0.35 | |
| LAB | 9.67 | 17.92 | 321.13 | 0.72 | 0.39 | ||
| RGB | 11.45 | 18.75 | 351.74 | 0.70 | 0.54 | ||
| NIR | 15.23 | 21.88 | 478.57 | 0.59 | 0.98 | ||
| RGB+HSV+LAB+NIR | 10.25 | 16.60 | 275.51 | 0.76 | 0.50 | ||
| Mean | 11.17 | 18.40 | 342.22 | 0.71 | 0.55 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Songur, U.; Fidan, S.; Alaca Yıldırım, E.; Kahrıman, F.; Tiryaki, A.M. Prediction of Total Anthocyanin Content in Single-Kernel Maize Using Spectral and Color Space Data Coupled with AutoML. Sensors 2026, 26, 805. https://doi.org/10.3390/s26030805
Songur U, Fidan S, Alaca Yıldırım E, Kahrıman F, Tiryaki AM. Prediction of Total Anthocyanin Content in Single-Kernel Maize Using Spectral and Color Space Data Coupled with AutoML. Sensors. 2026; 26(3):805. https://doi.org/10.3390/s26030805
Chicago/Turabian StyleSongur, Umut, Sertuğ Fidan, Ezgi Alaca Yıldırım, Fatih Kahrıman, and Ali Murat Tiryaki. 2026. "Prediction of Total Anthocyanin Content in Single-Kernel Maize Using Spectral and Color Space Data Coupled with AutoML" Sensors 26, no. 3: 805. https://doi.org/10.3390/s26030805
APA StyleSongur, U., Fidan, S., Alaca Yıldırım, E., Kahrıman, F., & Tiryaki, A. M. (2026). Prediction of Total Anthocyanin Content in Single-Kernel Maize Using Spectral and Color Space Data Coupled with AutoML. Sensors, 26(3), 805. https://doi.org/10.3390/s26030805









































