Next Article in Journal
Impact of Food Waste on Society, Specifically at Retail and Foodservice Levels in Developed and Developing Countries
Next Article in Special Issue
Assessment of ‘Golden Delicious’ Apples Using an Electronic Nose and Machine Learning to Determine Ripening Stages
Previous Article in Journal
Evaluation of Simultaneous Growth of Escherichia coli O157:H7, Salmonella spp., and Listeria monocytogenes in Ground Beef Samples in Different Growth Media
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Detection of Anthocyanins in Potatoes Using Micro-Hyperspectral Images Based on Convolutional Neural Networks

1
College of Mechanical and Electrical Engineering, Inner Mongolia Agricultural University, No.306 Zhaowuda Road, Hohhot 010010, China
2
Inner Mongolia Engineering Research Center of Intelligent Equipment for the Entire Process of Forage and Feed Production, Hohhot 010018, China
*
Author to whom correspondence should be addressed.
Foods 2024, 13(13), 2096; https://doi.org/10.3390/foods13132096
Submission received: 29 May 2024 / Revised: 21 June 2024 / Accepted: 25 June 2024 / Published: 1 July 2024
(This article belongs to the Special Issue Sensors for Food Safety and Quality Assessment (2nd Edition))

Abstract

:
The color potato has the function of both a food and vegetable. The color potato not only contains various amino acids and trace elements needed by the human body but also contains anthocyanins. Anthocyanins have many functions, such as antioxidation, inflammation inhibition, vision improvement, and cancer prevention, so colored potatoes are deeply loved by consumers and have good market prospects. However, at present, the detection of anthocyanin content in color potatoes mainly depends on chemical methods, which are time-consuming and laborious, so it is necessary to study a fast and accurate detection method. In this study, microscopic hyperspectral equipment was used to collect the spectral information of the outer skin and inner skin of potatoes. The original spectrum, pretreatment spectrum, and characteristic spectrum variables of the outer skin and inner skin were predicted by the convolution neural network (CNN) algorithm and partial least squares regression (PLS) algorithm, respectively, and the performance of the model was evaluated by the prediction set correlation coefficient (Rp), prediction set root mean square error (RMSEP), correction set correlation coefficient (Rc), correction set root mean square error (RMSEC), and residual prediction deviation (RPD). The results revealed that the inner skin Raw + CNN model constructed under raw spectral data is optimal with Rc = 0.9508, RMSEC = 0.0374%, Rp = 0.9461, RMSEP = 0.2361% and RPD = 4.4933. The inner skin Savitzky-Golay (SG) + Detrend (DET) + CNN model constructed from pre-processed spectral data is optimal with Rc = 0.9499, RMSEC = 0.0359%, Rp = 0.9439, RMSEP = 0.2384%, RPD = 4.6516. The inner skin DET + competitive adaptive reweighted sampling (CARS) +CNN model constructed from the feature-based spectral data was optimal with Rc = 0.9527, RMSEC = 0.0708%, Rp = 0.9457, RMSEP = 0.2711%, and RPD = 4.1623. It can be seen that the Rp, RMSEP, Rc, RMSEC, and RPD values for modeling the spectral information of the inner skin were higher than those of the outer skin under the three different spectral data. The prediction accuracy of the model built by the CNN algorithm was better than the conventional algorithm PLS, the application of the CNN algorithm in inner skin can achieve accurate prediction of anthocyanin content in potato.

1. Introduction

Colorful potatoes are rich in flavonoids with antioxidant properties such as anthocyanins, which have bright colors and are responsible for the color of fruits, vegetables, flowers, and plants [1]. Studies have shown that anthocyanins not only impart bright colors to plants but also resist ultraviolet rays, diseases, insect pests, and low temperatures [2,3,4]. In terms of nutrition and health care, anthocyanins are natural and powerful free radical scavengers [5] that have various physiological functions such as reducing the growth rate of cancer cells, regulating blood sugar, and enhancing vision as well as anti-aging and anti-tumor properties. Therefore, eating colored potatoes is beneficial to human health [6,7,8]. However, there are few studies on the detection of anthocyanins in potatoes at present. Currently, research on potato anthocyanins has mainly focused on their function, accumulation mode, characteristics, and content using stoichiometric methods. For the detection of anthocyanins, the pH-differential method and different high-performance liquid chromatography (HPLC) methods in combination with a photodiode array detector or mass spectrometry (MS) are most frequently used in the food industry as well as in research [9]. However, the detection process is complicated and redundant, which significantly affects the accurate detection and quality classification of potato anthocyanins.
Some scholars have also conducted some research on the rapid detection of anthocyanins and have made some achievements. Huck C W et al. [9] used near-infrared spectroscopy to quickly detect the total anthocyanin content in Sambucus fruits, used a partial least squares regression (PLSR) model to predict, and found that using near-infrared spectroscopy is a reliable detection method that can realize the prediction of anthocyanin content. U. S. Dinish [10] used a new custom portable handheld Vis-NIR spectrometer that collected the reflectance spectra of red and green lettuce leaves, wirelessly transmitted the spectral data via Bluetooth, and provided the raw spectral data and processed information, after which he predicted the anthocyanin content of red and green lettuce and found a correlation coefficient of 0.84. It can be seen that the handheld spectrometer can accurately detect the content of anthocyanins in plants. However, these studies are all point measurement methods, and the detection area is small.
Only a small area near the probe can be detected, and the overall information of the tested sample cannot be obtained.
Microscopic hyperspectral imaging technology is a combination of atlas technology, integrating microscopic imaging and spectral information, which can not only obtain the microscopic cell morphology information of substances but also the corresponding spectral information. Currently, it is mainly used in the medical field and rarely in agriculture. It can enlarge information that is invisible to the naked eye to the required multiple under a microscope. By directly projecting the beam to the target area and narrowing the measurement range, spectral and spatial information of tissue slices can be obtained [11,12,13], thereby fundamentally improving the detection accuracy. Minghua [14] used microscopic hyperspectral imaging technology to detect POD, CAT, and SOD activity indices of tomato leaves under salt stress and integrated them with spectral image information to build a quantitative model of antioxidant enzymes in tomato leaves under salt stress, realizing the spectral stripping and visual distribution of single antioxidant enzymes in cells. Xuan et al. [15] studied the detection of chlamydia spores in contaminated soil, obtained the microscopic hyperspectral image of perispora chlamydia in contaminated soil, and established the detection model. Jianhong [16] studied the changes of antioxidant active molecules and malondialdehyde (MDA) in mutton at different storage periods at 4 °C and combined with 400–1000 nm micro-hyperspectral technology, conducted rapid detection and analysis of total superoxide dismutase (T-SOD) activity, catalase (CAT) activity and malondialdehyde content. The feasibility of simultaneous determination of T-SOD, CAT, and MDA in mutton by micro-hyperspectral imaging was verified by Jinping [17]. Relying on the early detection method involving the use of micro-high spectral imaging technology to detect mushroom wolf molds, a combination of the BS-NET-FC band selection algorithm and MTCEM to high-spectrum image detection thickness of the thick spore micro-spectrum image was proposed. Great spores reduce redundant band images while effectively detecting the spore target. Lu [18] used the spectral information of microscopic hyperspectral imaging technology to conduct rapid non-destructive testing research on soluble protein content, glutathione (GSH) content, peroxidase (POD) activity and catalase (CAT) activity of mutton and established models, and these models generated good results. Huan [19] used visible micro-hyperspectral technology to detect the activity of superoxide dismutase (SOD) in mutton muscle cells. The above-mentioned studies confirm that micro-hyperspectroscopy is feasible for internal quality detection.
The traditional regression detection algorithm used currently in the prediction process of potato internal components is not suitable for the detection of large sample sizes, whereas the deep learning algorithm can achieve linear and nonlinear calculations through convolution and activation functions to extract the linear and nonlinear features of samples. In addition, the deep learning model is highly robust and can withstand the continuous detection of a large sample size. Therefore, deep learning has gradually become widely used in the regression analysis of spectral data [20,21,22]. Convolutional neural networks (CNN), as classic deep learning neural networks, have also been successfully applied to one-, two-, and three-dimensional data analysis [23,24,25,26,27].
Therefore, a convolutional neural network model based on deep learning of an artificial intelligence algorithm was used in this study to predict anthocyanin content. The specific research objectives were as follows:
(1)
To obtain spectral information on the microstructures of different parts of potatoes.
(2)
To build a model based on the CNN, conforming to the characteristics of the spectral data.
(3)
To construct a convolutional neural network and partial least regression prediction model of potato anthocyanins based on the original spectrum, pre-process the spectrum and characteristic spectrum variables and obtain the optimal prediction model through comparative analysis.
At present, no scholars have used micro-hyperspectral equipment and deep learning algorithms to detect anthocyanin content in potatoes. This study provides a reference for future research in related industries.

2. Materials and Methods

2.1. Preparation of Experimental Samples

Potatoes were purchased from a farmers’ market in Hohhot, Inner Mongolia Autonomous Region. Fresh round or oval potatoes, which are intact and with no decay or mechanical damage, little difference in shape, and belonging to red and black diamond varieties were selected. The potatoes were cleaned using water before the experiment and placed at the laboratory temperature for 8 h. In this study, freehand sections were selected, and rectangular blocks of 8 mm × 6 mm × 15 mm of uniform size were cut with a single knife blade from the outer skin and inner skin of potatoes, respectively. They were cut into uniform slices with a knife blade and then rinsed briefly under distilled water using tweezers. After rinsing, the surface moisture was absorbed with absorbent paper and placed on the slide, and the cover slide was used for simple sealing. The outer skin and inner skin of the potatoes constituted 576 samples, respectively.

2.2. Hyperspectral Image Acquisition

2.2.1. Microscopic Hyperspectral Imaging System

In this study, a microscopic hyperspectral system was established by Wuling optics (Taiwan Province, China), as shown in Figure 1, which was composed of an optical microsystem, a spectral scanning imaging instrument, and a data acquisition system. An optical microscopy system primarily comprises a microscope and an electric carrier table. Multiples of 5, 10, 20, 50, and 100 times were obtainable for the objective mirrors, and a multiple of 10 was available for the eyepiece.
The interior of the spectral scanning imager is mainly composed of acousto-optic tunable filters (AOTF), their drivers, and CCD imaging equipment. Among them, the spectral scanner has a spectral range of 365–1025 nm, a total of 616 spectra, and a spectral resolution of 2.8 nm. The camera resolution of the imaging device is 1.4 million pixels, and the slit size is 30 µm.

2.2.2. Micro-Hyperspectral Data Acquisition

Sample collection process:
(1)
Before collecting the micro-high-resolution spectral images, the instrument and equipment were maintained open for 30 min to ensure that the light source irradiation intensity was stable. This study used the transmitted light source.
(2)
The potato slices were fixed on a microscope carrier table.
(3)
The image collection software Hyperspec was opened; the carrier table was initialized, and preliminary focusing was attained; the eye mirror adjustment focal length was observed to make the image clear; the image of the image was avoided; and the strength of the light source was adjusted until the sample reached a reasonable light source exposure value.
(4)
The collection parameters were set in Hyperspec software. The specific parameters are listed in Table 1.
(5)
The software was then executed, and the instrument was scanned until the image collection was completed. This process was repeated to collect all samples.

2.2.3. Micro-Hyperspectral Image Correction

Owing to the inherent instability of the light source of the hyperspectral imaging system and the influence of external noise and the transmission process operation, the acquired hyperspectral images contain noise in certain bands. Therefore, before acquiring the micro-hyperspectral images, a lens cover and standard whiteboard were used to obtain all white and black images, respectively. Subsequently, in the acquisition software of the imaging system, the original image is corrected to black and white according to Equation (1) [19]:
C = R B W B × 100 %
where C is the corrected image, R the original image, B the black reference image obtained by completely covering the camera lens with a lens cap (approximately 0% reflectivity), and W the white-calibrated image (approximately 100% reflectivity).

2.3. Experimental Methods

2.3.1. Determination of Anthocyanin Content

The measurement of anthocyanins in potatoes was based on the Chinese agricultural industry standard “NY/T 2640-2014 Plant-Dedicated Flower Circin Primrose Method” [28]. The main principle was to extract anthocyanins from potatoes using ethanol and water and hydrolyze anthocyanins into anthocyanins by boiling water bath. The anthocyanins were determined using high-performance liquid chromatography, the retention time was qualitative, and the external standard method was quantitative.

2.3.2. Extraction of Microscopic Hyperspectral Data

The corrected potato micro-hyperspectral images were imported into ENVI 5.3 software (ITT Visual Information Solutions, Boulder, CO, USA), and a Region of interest (ROI) area of 100 × 100 was selected on the complete cell structure with color using the square tool. The average spectral transmittance of the ROI was extracted from the spectral data of each sample. Using this method, the spectral information of the samples was successively extracted according to the sequence of labels, and the spectral data matrix of the outer and inner skin was established as 576 × 616 pixels.

2.3.3. Micro-Hyperspectral Data Pre-Processing

Owing to the existence of mechanical noise and baseline drift in the original spectrum, it was necessary to conduct pre-processing to eliminate unnecessary information. In this study, the established data matrix was pre-processed, and the spectral pre-processing algorithms adopted were the standard normal variable transformation (SNV), Detrending method (DET), convolution smoothing (Savitzk-Golay), and their combinations, SG-SNV, SG-DET, and SNV-DET. SNV is the most widely used method in spectral data pre-processing, which is conducive to correcting changes in path length and spectral intensity and eliminating the interference of the nonlinear light scattering effect [29]. The SG can improve the smoothness of the spectrum and reduce noise interference. DET is a polynomial baseline correction method, which is used to eliminate baseline offset and curvature in spectral signals. It highlights the absorption peak of the spectral curve by subtracting an optimal trend line fitted by a polynomial from the original spectral curve [30]. DET has great advantages in eliminating the interference of spectral data by multicollinearity, baseline drift, and curvature. In this study, spectral data were pre-processed using the Unscrambler X 10.1 (CAMO AS, Oslo, Norway) software.

2.3.4. Feature Wavelength Selection

High-spectrum data contain hundreds of thousands of continuous wavelengths with redundant and multi-common linearity. In this study, improved competitive adaptive weighted (CARS) and continuous projection algorithms (SPA) were used to eliminate redundant wavelengths and select the variables for the best performance.
The competitive adaptive reweighting algorithm (CARS) is an algorithm that Li et al. simplified and improved on the original CARS algorithm [31]. This method is based on Darwin’s evolutionary theory. In sample selection, adaptive reweighting sampling technology is used to select wavelength points with large absolute values of regression coefficients in the PLS model. Wavelength points with small weights are eliminated, and the subset with the smallest root-mean-square error RMSECV is selected by means of cross-validation, which is the optimal variable subset. In this study, 50 Monte Carlo samples and 10 cross-validations were used.
The SPA method is a forward variable-selection method that uses a simple projection operation to obtain a collinear minimum subset of variables. Therefore, the characteristic wavelength is extracted from the entire band, most of the redundant information in the original spectral matrix is eliminated, and the modeling conditions are improved. The basic principle of SPA is to project a set of wavelength subsets into a vector space and select the wavelength subset with the least redundancy [32]. The number of characteristic wavelengths was set in advance, and the parameters for the minimum and maximum numbers of variables selected in the SPA program were 1 and 30, respectively.

2.3.5. Establishment of Regression Prediction Model

To compare and analyze the differences between the traditional regression algorithm and the popular deep learning algorithm in spectral information processing, the traditional linear partial least squares regression algorithm and convolutional neural network algorithm were selected in this study.

Partial Least Squares Regression Algorithm

Partial least squares regression (PLSR) [33,34] is the most widely used linear regression algorithm at present. Owing to its modeling principles and structure, this model is often the first choice for building predictive models. It has the advantage of considering both matrices X (spectral data) and Y (starch content). This can solve the problem of having a large number of variables or collinear variables in the original data. Partial least squares regression analysis was used to convert the original data into a limited number of independent latent variables (LVs). The optimal number of potential variables was determined by minimizing the sum of the squares of error (RMSE) to prevent model over-fitting or under-fitting. This is typically performed through cross-validation. In this study, the number of LVs was optimized using a 10-fold cross-validation method, the maximum number of LVs was set to 15, and the LVs value with a smaller error was selected according to the modeling effect. The ratio of the PLS model training set and test set was 2:1.

Convolutional Neural Network Regression Algorithm

As the convolutional neural network model can use a hierarchical structure, it has the characteristics of translation and other changes [35,36]. Thus, even if the appearance of the data changes, the original information of the data will not be lost, and effective information can still be identified and extracted. The convolutional neural network also has the feature of parameter sharing. The convolutional kernel moves in the same layer by means of translation and shares a set of convolutional kernel parameters when feature extraction is performed at each position. Therefore, the convolutional neural network significantly reduces the number of parameters, accelerates the learning rate, and prevents overfitting.
Combined with the characteristics of spectral data, this study proposes to further optimize the design of the basic convolutional neural network.
  • Combined with the spectral data and several experiments, three convolution layers were built, and the number of convolution nuclei in each layer was 16 and the sample ratio of the model training set and the test set is 4:1.
  • This study proposes introducing batch normalization (BN) after the convolutional layer, standardizing the data after convolution, and then inputting it into the next layer [37], which can significantly simplify the data after the convolutional layer and improve the speed after extracting useful information.
  • In this study, the rectified linear unit (ReLU) activation function was introduced after normalization.

Data Processing Software

The pre-processing method in this study used the Unscrambler software, and the PLS algorithm was implemented in MATLAB (R2019). The CNN program in this study is executed on the computer with the following specifications: Windows 10 system, Intel(R) Core (TM) i7-9700 CPU, 3.00 GHz, and memory 32 GB (Intel, Santa Clara, CA, USA). The CNN algorithm program is based on the Python language, using the PyTorch (Professional 2021) development environment. The learning rate was set to 0.001, and the number of training rounds was 100. The training hardware was an NVIDIA 2070 GPU for parallel computing.

3. Results and Discussion

3.1. Characteristic Analysis of Spectral Information

In this study, hyperspectral image information under the microscope of the outer and inner skin of two kinds of potatoes, Red Mei and Black Jingang, was collected, as shown in Figure 2.
As shown in Figure 3, the spectral information of the outer and inner skin shows that the trend of the raw spectral information of the two varieties is consistent, indicating that the spectral information under the microscope is not significantly affected by the variety. Further, by comparing the outer skin and the inner skin part, it was also found that the spectral trend of the inner and outer skin was consistent, which also showed that the spectral difference between the inner skin part and the outer skin was not large; however, whether there was a difference after the final application of algorithm modeling needs to be further studied.
Visible/near-infrared spectroscopy is based on the vibration of molecular bonds such as C-H, O-H, and N-H. Anthocyanins are flavonoid compounds with a C-skeleton of C6-C3-C6 as the basic structure, containing C-H, O-H, C-O, and other chemical bonds [38]. Therefore, visible/near-infrared spectroscopy can be used to predict anthocyanin content [39]. It can also be observed from the original spectrum in Figure 3 that there are evident trappings in the spectrum at approximately 530, 700, and 850 nm. The main pigment information was observed at approximately 530 and 700 nm, and anthocyanins were soluble pigments, which is consistent with previous studies [40,41]. The absorption peak at approximately 850 nm was due to the third overtone of C-H, which represents the absorption band of glucose and is related to the hydrocarbon group of C-H [42,43].
It can be observed from the raw spectral data in Figure 3 that there is a certain raw edge, that is, noise, in the spectral information. Hence, it is necessary to pre-process the original data. SG smoothing, SNV, DET, SG + SNV, SG + DET, and SNV + DET were used to pre-process the original spectra of the inner skin part and the outer skin part, respectively, and the results are shown in Figure 4 and Figure 5. It can also be observed from the image that SG smoothing cannot generate evident changes from the original spectrum that are distinguishable with the naked eye. However, the spectral data value has changed, the other pre-processing methods generate clear changes, and the spectrum becomes more clustered and has fewer burrs.

3.2. Characteristic Wavelenght Extraction Analysis

In this study, feature wavelength extraction was performed on pre-processed data using CARS and SPA. The partial least squares regression and convolutional neural network algorithms were used to predict the internal anthocyanin content of potatoes in the spectra after feature extraction, and the results of the two algorithms were compared and analyzed. The characteristic wavelength variables extracted from the outer skin are shown in Table 2, and the characteristic wavelength variables extracted from the inner skin are shown in Table 3.
As shown in Table 2 and Table 3 for potato skin, the SG + SPA and SNV + DET + SPA algorithms extracted the least number of characteristic variables (12). Compared with the original spectral data, the number of input spectral variables was reduced by 98.05%. In the potato endothelium, the SNV + DET + SPA algorithm extracted the least number of characteristic variables (11), and the input spectral variables were reduced by 98.2% compared with the original spectral data. Studies have shown that the SPA algorithm extracts relatively few variables in the feature extraction of skin and inner skin parts, and this algorithm is more dominant in feature variable extraction because it identifies the least useful data, simplifies the input of prediction model data, avoids the input of redundant data, and plays a significant role in reducing the algorithm’s operation time.

3.3. Different Algorithms Predict Anthocyanin Content

3.3.1. PLS Method Was Used to Predict Anthocyanin Content

In this study, the PLS method was used to predict the anthocyanin content in the original spectrum, pretreatment spectrum, and characteristic wavelength variable spectrum of two types of red, black, and gold potatoes. The results are presented in Table 4 and Table 5.
It can be observed from Table 4 that the PLS anthocyanin predictive model was established based on the original spectrum, RC = 0.7874, RMSEC = 0.429%, RPD = 2.30, and RPD value greater than 2, indicating that this model can predict the content of anthocyanins. It can also be noted that the PLS model based on the original spectrum-based construction is better than using some pre-processing and feature wavelength variables. Pre-processing and feature wavelength extraction methods cannot accurately remove and extract the main variables from the original spectral data. However, there are too many models based on the original spectrum construction, and redundant information exists between the spectrum bands. The performance of the PLS predictive model built using the original spectrum was low. Therefore, it is necessary to find the best pre-processing and feature variable method to reduce complexity and simplify the model.
According to the results in Table 4, for the PLS anthocyanin prediction model built based on the pretreatment method, the model with the best effect was SNV + PLS (Rp = 0.7496, RMSEP = 0.435%, Rc = 0.7439, RMSEC = 0.449%, RPD = 2.25, and RPD value greater than 2). This model can be used to predict potato anthocyanins quantitatively. The optimal PLS anthocyanin prediction model based on the characteristic wavelength was DET + CARS + PLS, with Rp = 0.8593, RMSEP = 0.37%, Rc = 0.8158, RMSEC = 0.334%, RPD = 2.64, and an RPD value greater than 2.5. This shows that the model can predict the anthocyanin content in a stable and accurate manner.
Table 5 presents the results of the PLS model constructed for inner skin cells. The results showed that the PLS anthocyanin prediction model established based on the original spectral information was ideal (RPD = 2.45), indicating that the model can accurately predict anthocyanins. However, owing to excessive input data and the addition of redundant information between the spectral data, the operational efficiency of the model is low. Therefore, it is necessary to optimize the pre-processing and feature wavelength extraction algorithms to reduce the complexity and simplify the model.
As shown in Table 5, the optimal PLS anthocyanin prediction model built based on the pretreatment method was SNV + PLS, with Rp = 0.839, RMSEP = 0.395%, Rc = 0.7707, RMSEC = 0.446%, and RPD = 2.50. According to the model evaluation criteria, this model could completely predict the anthocyanin content of potatoes. The optimal PLS anthocyanin prediction model based on the characteristic wavelength was the DET + CARS + PLS model, with Rp = 0.9287, RMSEP = 0.325%, Rc = 0.9227, RMSEC = 0.329%, RPD = 3.03, and an RPD value greater than 2.5. This shows that this model has strong robustness and stability and can accurately predict anthocyanins.
From Table 4 and Table 5, it can be observed that SNV + PLS is the ideal model for pre-processing, DET + CARS + PLS is the optimal model for feature variables, and DET + CARS + PLS is the best model among all models, indicating that the DET pre-processing algorithm can effectively reduce noise and remove unwanted information. The CARS algorithm can extract more effective feature variables, and the PLS algorithm can accurately predict the anthocyanin content in potatoes.
It can be noted from the results in Table 4 and Table 5 that under the premise of using the same algorithm, the models established for the outer and inner skin have different effects, indicating that the sampling location still affects the prediction accuracy. Further, the prediction accuracy of the models constructed for the inner skin was better than that of the outer skin, indicating that the spectrum of the inner skin is most closely related to the anthocyanin content. If a conventional algorithm is used to predict the anthocyanin content of potatoes, then the endothelium is the preferred location.

3.3.2. Prediction of Anthocyanin Content by CNN Method

In this study, a CNN prediction model for anthocyanin content in potatoes was constructed based on the original full spectrum, pretreatment spectrum, and characteristic wavelength variable spectrum of potato skin and inner skin cells. The dataset was randomly divided into a training set (80%) and a test set (20%). The training set is used to train the model so that the model can learn features, and the test set is used to check the model effect.
The results of the CNN model constructed on the outer skin are listed in Table 6.
As shown in Table 6, the CNN model was built based on the original full spectrum, Rc = 0.9486, RMSEC = 0.0312%, Rp = 0.9446, RMSEP = 0.2291%, and RPD = 4.41, and the RPD value was much higher than 2.5, indicating that the CNN model built based on the original full spectrum was very robust. It can accurately predict anthocyanin levels.
SG, SNV, DET, and their combinations were used to preprocess the original spectrum of the outer skin to establish a CNN prediction model. The DET + CNN model had the best accuracy, with Rc = 0.9499, RMSEC = 0.0235%, Rp = 0.9457, and RMSEP = 0.2234%. RPD = 4.4893, and the RPD value of the model was much higher than 2.5, indicating that this model could accurately predict the level of anthocyanins.
CARS and SPA feature extraction algorithms were used to extract the feature wavelength variables. The DET + CARS + CNN model exhibited the best prediction accuracy: Rc = 0.9524, RMSEC = 0.0657%, Rp = 0.9468, RMSEP = 0.2651%, and RPD = 4.0942. The RPD value of the model is still greater than 2.5, and this model can achieve accurate prediction of anthocyanins. As shown in the above table, the accuracy of the CNN model built based on SPA feature extraction is relatively general. The feature information extracted by SPA and input spectral data are limited; therefore, the CNN model learns less effective information during the training process, whereas the characteristic wavelength information variables of anthocyanins extracted by SPA in this study are not sufficiently accurate resulting in average model accuracy.
The training and testing processes of the Raw + CNN, DET + CNN, and DET + CARS + CNN models are shown in Figure 6.
The specific results of the CNN model constructed for inner skin cells are listed in Table 7.
As shown in Table 7, the Raw + CNN model built based on the original full spectrum had a high prediction accuracy: Rc = 0.9508, RMSEC = 0.0374%, Rp = 0.9461, RMSEP = 0.2361%, and RPD = 4.4933. The RPD value of this model was very high. This indicates that the anthocyanin content can be predicted accurately.
SG, SNV, DET, and their combinations were used to pretreat the original spectrum of the endothelium. The SG + DET + CNN model exhibited the best prediction performance, with Rc = 0.9499, RMSEC = 0.0359%, Rp = 0.9439, RMSEP = 0.2384%, and RPD = 4.6516. The RPD value of this model was also very high, indicating that it is very robust and can accurately predict anthocyanin content.
The CARS and SPA algorithms were used to extract feature wavelength variables. The DET + CARS + CNN model exhibited the best prediction performance: Rc = 0.9527, RMSEC = 0.0708%, Rp = 0.9457, RMSEP = 0.2711%, and RPD = 4.1623. The RPD value of this model is also very high. However, the prediction accuracy of this model was worse than that of the Raw + CNN and SG + DET + CNN models, indicating that the extracted feature variable information was incomplete and inaccurate. The research also found that the prediction accuracy of the CNN model based on SPA is worse than that of the CNN model based on CARS, owing to the small input variable information of SPA and the small amount of feature information learned in the training process of the convolutional neural network. In addition, the characteristic variable information extracted by the SPA in this study was not accurate, which led to a decline in the model prediction accuracy.
In the CNN model constructed by the inner skin part, the training and testing processes of Raw + CNN, SG + DET + CNN, DET + CARS + CNN are shown in Figure 7.
By comparing Table 6 and Table 7, the CNN prediction model of potato anthocyanin content built using different pretreatment methods and feature extraction algorithms for both the skin and endothelium showed little change in the correlation coefficient and root-mean-square error of the model. Compared with the CNN prediction model based on pre-processing and feature extraction algorithms, the CNN model based on the original full spectrum has a better effect, which is basically consistent with the conclusions of previous studies. Zhang Xiaolei [23] used a convolutional neural network to carry out end-to-end qualitative analysis of the spectral data of grape varieties and found that the classification and recognition effect without pretreatment was better. At the same time, the protein content of corn, the active substance content of tablets, the protein content of wheat, and the organic carbon content of soil are also quantitatively analyzed, and it is concluded that the convolutional neural network can learn from the original data without pretreatment, thus improving the accuracy.
LeCun Y [23] uses traditional algorithms and deep learning convolutional neural network algorithms to identify the feature information in the image. It is found that convolutional neural network algorithms realize linear and nonlinear expression, and identify image features adaptively under the action of different network layers and inverse functions, which reduces the pre-processing links of conventional algorithms.
Xin Wang [44] proposed an end-to-end deep learning method called the residual spectrum, which combined residual modules to learn features from original data to improve model performance and compared this algorithm with the classical convolutional neural network. The results show that the residual spectrum is better than other CNN models and traditional machine learning models in the original data.
To a certain extent, deep learning has the ability to automatically extract feature information, and extracting spectral features does not require manual pretreatment. This also shows that the CNN algorithm can simplify several operation processes of traditional machine learning algorithms and effectively improve the efficiency of model prediction.
In addition, the results showed that the accuracy of the CNN prediction model established by the inner skin part was better than that established by the outer skin part, which is consistent with the prediction model established by the previous PLS algorithm.

4. Conclusions

Based on a basic convolutional neural network, this study proposes the use of normalization and activation functions to build a network framework for processing spectral data information. The optimized CNN network structure and traditional machine learning PLS algorithm were used to construct potato internal anthocyanin level prediction models, and the best algorithm was selected for comparative analysis.
CNN and PLS models for anthocyanin content prediction were constructed based on the original spectrum. Rp = 0.7765, RMSEP = 0.426%, Rc = 0.7874, RMSEC = 0.429%, RPD = 2.30 for the Raw + PLS prediction model constructed in the outer skin. Rp = 0.8429, RMSEP = 0.403%, Rc = 0.8159, RMSEC = 0.422%, RPD = 2.45 for the Raw + PLS prediction model constructed in the inner skin part. Rc = 0.9486, RMSEC = 0.0312%, Rp = 0.9446, RMSEP = 0.2291%, RPD = 4.41 for the Raw + CNN prediction model constructed in the outer skin. Rc = 0.9508, RMSEC = 0.0374%, Rp = 0.9461, RMSEP = 0.2361%, and RPD = 4.4933 for the Raw + CNN prediction models constructed for the inner skin part. The results showed that the accuracy of the prediction model established by the CNN algorithm was better than that of the conventional machine learning algorithm PLS model, and the prediction accuracy of the model established using the spectral information of the endothelium was better than that of the model established using the spectral information of the outer skin.
SG, SNV, SET, SG + SNV, SG + DET, SNV + DET, and other algorithms were used for the pre-processing of the original spectrum. After pre-processing, the PLS prediction model and CNN prediction models for anthocyanin content were constructed. The specific results are as follows: for the prediction model built based on PLS, the SNV + PLS algorithm was the best model for the potato skin (Rp = 0.7496, RMSEP = 0.435%, Rc = 0.7439, RMSEC = 0.449%, and RPD = 2.25). For the inner skin part, the SNV + PLS algorithm established the best model, with Rp = 0.839, RMSEP = 0.395%, Rc = 0.7707, RMSEC = 0.446%, and RPD = 2.50. In the model constructed based on the CNN, the DET + CNN model was the best for the potato skin, with Rc = 0.9499, RMSEC = 0.0235%, Rp = 0.9457, RMSEP = 0.2234%, and RPD = 4.4893; in the inner skin part, the SG + DET + CNN model was the best, with Rc = 0.9499 and RPD = 4.4893, RMSEC = 0.0359%, Rp = 0.9439, RMSEP = 0.2384%, and RPD = 4.6516. From the above data, it can be concluded that the CNN algorithm established using deep learning is superior to the conventional PLS algorithm and it shows that the prediction model established by the inner skin part is better than that by the outer skin part of the potato.
For the pre-processed spectral data, the CARS and SPA algorithms were used to select the variables most related to the anthocyanin content, and then the PLS and CNN prediction models were established. The specific results were as follows: for the model built based on PLS, DET + CARS + PLS was the best model for the skin (Rp = 0.8593, RMSEP = 0.37%, Rc = 0.8158, RMSEC = 0.334%, and RPD = 2.64). For inner skin cells, DET + CARS + PLS was the best model (Rp = 0.9287, RMSEP = 0.325%, Rc = 0.9227, RMSEC = 0.329%, and RPD = 3.03). In the model constructed based on the CNN, DET + CARS + CNN was the best model for the skin (Rc = 0.9524, RMSEC = 0.0657%, Rp = 0.9468, RMSEP = 0.2651%, RPD = 4.0942), and DET + CARS + CNN was the best model (Rc = 0.9527, RMSEC = 0.0708%, Rp = 0.9457, RMSEP = 0.2711%, and RPD = 4.1623). According to the above data, the CNN neural network algorithm is more ideal than the PLS algorithm for the model established by the feature variable extraction method, and the model built in the inner skin part is better than the model built in the outer skin part.
According to the data, the models established based on the feature variables were worse than those established based on the original spectrum and pre-processing. This was because the data extracted from the feature variables contained less information than the original and pre-processed spectral information. Compared with the CNN algorithm, the models constructed using pre-processing and feature variable extraction showed little difference in correlation coefficients and RMS errors. These results provide meaningful directions for building deep learning models using spectral information in the future. The results also showed that the sampling position affected the prediction accuracy of the model, providing a reference for improving the prediction effect of the model in the future.

Author Contributions

Methodology and writing—original draft preparation, F.W.; data curation, Q.L.; methodology, C.W.; funding acquisition, W.D.; project administration, L.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by National Natural Science Foundation of China, grant number 32060415. Inner Mongolia double first-class construction funds, grant number (RK2300003375). Program for improving the Scientific Research Ability of Youth Teachers of Inner Mongolia Agricultural University (BR220127). The central government guides the local science and technology development plan, grant number 2022ZY0205.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Acknowledgments

The authors would like to thank the technical editor and anonymous reviewers for their constructive comments and suggestions on this study.

Conflicts of Interest

The author declares no conflicts of interest.

References

  1. Yi, K.; Fei, G.; Tao, J.; Zheng, L. Research progress on effects of temperature on anthocyanin synthesis in plants. Chin. Agric. Sci. Bull. 2015, 31, 101–105. [Google Scholar]
  2. Shaochuan, S.; Yike, G.; Xiuhai, Z.; Wang, Y. Study and application of genes related to anthocyanin biosynthesis in plants. Plant Res. 2011, 31, 633–664. [Google Scholar]
  3. Karadeniz, F.; Burdurlu, H.S.; Koca, N.; Soyer, Y. Antioxidant activity of selected fruits and vegetables grown in Turkey. Turk. J. Agric. For. 2005, 89, 297–303. [Google Scholar]
  4. Zarzecka, K.; Gugala, M. The effect of herbicides and soil tillage systems on the content of polyphenols in potato tubers. Pol. J. Environ. Stud. 2011, 20, 513–517. [Google Scholar]
  5. Liming, H.; Dan, T. Research status and prospect of anthocyanins in “Black beauty” potato. J. Chin. Vet. Med. 2019, 39, 95–98. (In Chinese) [Google Scholar]
  6. Stintzing, F.C.; Carle, R. Functional properties of anthocyanins and betalains in plants, food, and in human nutrition. Trends Food Sci. Technol. 2004, 15, 19–38. [Google Scholar] [CrossRef]
  7. Dianxin, L.; Yuanwei, D.; Wei, C.; Liu, H. Research progress on the nutritional value and health function of Blueberry. Agric. Prod. Process. 2018, 47, 69–70. [Google Scholar]
  8. Hamouz, K.; Lachman, J.; Pazderu, K. Differences in anthocyanin content and antioxidant activity of potato tubers with different flesh colour. Plant Soil Environ. 2011, 57, 478–485. [Google Scholar] [CrossRef]
  9. Huck, C.W. Near-Infrared Spectroscopy as a Rapid Screening Method for the Determination of Total Anthocyanin Content in Sambucus Fructus. Sensors 2020, 20, 4983. [Google Scholar] [CrossRef]
  10. Dinish, U.S.; Teng, M.T.J.; Xinhui, V.T.; Dev, K.; Tan, J.J.; Koh, S.S.; Urano, D.; Olivo, M. Miniaturized Vis–NIR handheld spectrometer for non-invasive pigment quantification in agritech applications. Sci. Rep. 2023, 13, 9524. [Google Scholar] [CrossRef]
  11. Zhu, Y.; Zhang, J.; Li, M.; Zhao, L.; Ren, H.; Yan, L.; Zhao, G.; Zhu, C. Rapid determination of spore germinability of Clostridium perfringens based on microscopic hyperspectral imaging technology and chemometrics. J. Food Eng. 2020, 280, 109896. [Google Scholar] [CrossRef]
  12. Li, Q.L.; Xue, Y.Q.; Xiao, G.H. Study on microscope hyperspectral medical imaging method for biomedical quantitative analysis. Sci. Bull. 2008, 53, 1431–1434. [Google Scholar] [CrossRef]
  13. Monma, C.; Hatakeyama, K.; Obata, H. Four Foodborne Disease Outbreaks Caused by a New Type of Enterotoxin-Producing Clostridium perfringens. J. Clin. Microbiol. 2015, 53, 859–867. [Google Scholar] [CrossRef]
  14. Du, M.H. Detection of Antioxidant Oxidase Activity in Tomato Leaves Based on Microscopic Hyperspectral Imaging; Ningxia University: Ningxia, China, 2023. [Google Scholar]
  15. Wei, X.; Liu, Y.; Song, Q.; Zou, J.; Wen, Z.; Li, J.; Jie, D. Microscopic hyperspectral imaging and an improved detection model based detection of Mycogone perniciosa chlamydospore in soil. Eur. J. Agron. 2023, 152, 1–9. [Google Scholar] [CrossRef]
  16. Guo, J.H. Changes of Antioxidant Enzymes and their Molecular Hyperspectral Detection during Cold Storage of Tan Sheep Meat; Ningxia University: Ningxia, China, 2022. [Google Scholar]
  17. Jinping, Z.; Xuan, W. Target detection of thick-wall spores of Agaricus bisporus based on microscopic hyperspectral images. Fujian Agric. Sci. Technol. 2022, 53, 1–8. [Google Scholar]
  18. Lu, Q. Comparative Study on Rapid Detection of Antioxidants in Mutton by Hyperspectral Method with Different Spatial Resolution; Ningxia University: Ningxia, China, 2022. [Google Scholar]
  19. Huan, D.; Longguo, W.; Xiaoguang, H.; Songlei, W. Study on the method of detecting superoxide dismutase activity in muscle cells by microhyperspectral technique. Sci. Technol. Food Ind. 2019, 40, 225–230. [Google Scholar]
  20. Xiaoli, L.; Dongyi, Z.; Yulun, D.; Juanjuan, J.; Yong, H. Rapid spectral detection method of main inclusions in fresh tea leaves based on Convolutional neural network. J. China Agric. Univ. 2021, 26, 113–122. [Google Scholar]
  21. Ahn, D.H.; Choi, J.Y.; Kim, H.C. Estimating the Composition of Food Nutrients from Hyperspectral Signals Based on Deep Neural Networks. Sensors 2019, 19, 1560. [Google Scholar] [CrossRef]
  22. Yu, X.; Wang, J.; Wen, S. A deep learning based feature extraction method on hyperspectral images for nondestructive prediction of TVB-N content in Pacific white shrimp (Litopenaeus vannamei). Biosyst. Eng. 2019, 178, 244–255. [Google Scholar] [CrossRef]
  23. Xiao-Lei, Z. Convolutional Neural Network Spectral Analysis Method and Its Application in Quality Detection of Agricultural Products; Zhejiang University: Hangzhou, China, 2021. [Google Scholar]
  24. Segler, M.; Preuss, M.; Waller, M.P. Planning chemical syntheses with deep neural networks and symbolic AI. Nature 2018, 555, 604. [Google Scholar] [CrossRef]
  25. Silver, D.; Huang, A.; Maddison, C.J. Mastering the game of Go with deep neural networks and tree search. Nature 2016, 529, 484–489. [Google Scholar] [CrossRef]
  26. Chen, Y.; Wang, Z. Quantitative analysis modeling of infrared spectroscopy based on ensemble convolutional neural networks. Chemom. Intell. Lab. Syst. 2018, 181, 1–10. [Google Scholar]
  27. Ju, L.; Lyu, A.; Hao, H. Deep learning-assisted three-dimensional fluorescence difference spectroscopy for identification and semiquantification of illicit drugs in biofluids. Anal. Chem. 2019, 91, 9343–9347. [Google Scholar] [CrossRef] [PubMed]
  28. Nyman, N.A.; Kumpulainen, J.T. Determination of anthocyanins in plant-derived foods by high performance liquid chromatography. J. Agric. Food Chem. 2001, 49, 4183–4187. [Google Scholar] [CrossRef] [PubMed]
  29. Kamruzzaman, M.; Makino, Y.; Oshita, S. Hyperspectral imaging for real-time monitoring of water holding capacity in red meat. LWT—Food Sci. Technol. 2016, 66, 685–691. [Google Scholar] [CrossRef]
  30. Luypaert, J.; Zhang, M.H.; Massart, D.L. Feasibility study for the use of near infrared spectroscopy in the qualitative and quantitative analysis of green tea, Camellia sinensis (L.). Anal. Chim. Acta 2003, 478, 303–312. [Google Scholar] [CrossRef]
  31. Li, H.; Liang, Y.; Xu, Q.; Cao, D. Key wavelengths screening using competitive adaptive reweighted sampling method for multivariate calibration. Anal. Chim. Acta 2009, 648, 77–84. [Google Scholar] [CrossRef]
  32. Araujo, M.C.U.; Saldanha, T.C.B.; Galvao, R.K.H.; Yoneyama, T.; Chame, H.C.; Visani, V. The successive projections algorithm for variable selection in spectroscopic multicomponent analysis. Chemometr. Intell. Lab. 2001, 57, 65–73. [Google Scholar] [CrossRef]
  33. Kutsanedzie, F.Y.H.; Agyekum, A.A.; Annavaram, V.; Chen, Q.S. Signal-enhanced SERS sensors of CAR-PLS and GA-PLS coupled AgNPs for ochratoxin a and aflatoxin B1 detection. Food Chem. 2020, 315, 126231. [Google Scholar] [CrossRef]
  34. Xu, Y.; Kutsanedzie, F.Y.H.; Hassan, M.; Zhu, J.; Ahmad, W.; Li, H.H.; Chen, Q.S. Mesoporous silica supported orderly-spaced gold nanoparticles SERS-based sensor for pesticides detection in food. Food Chem. 2020, 315, 126300. [Google Scholar] [CrossRef]
  35. Jiao, L.; Zhang, F.; Liu, F. A Survey of Deep Learning-based Object Detection. IEEE Access 2019, 7, 128837–128868. [Google Scholar] [CrossRef]
  36. Lecun, Y.; Bottou, L. Gradient-based learning applied to document recognition. Proc. IEEE 1998, 86, 2278–2324. [Google Scholar] [CrossRef]
  37. Ioffe, S.; Szegedy, C. Batch normalization: Accelerating deep network training by reducing internal covariate shift. In Proceedings of the International Conference on Machine Learning PMLR, Lille, France, 7–9 July 2015; pp. 448–456. [Google Scholar]
  38. Wei, L.; Miaomiao, C.; Qiang, Z.; Shan, L. Rapid classification method of black wolfberry by hyperspectral and ensemble learning. Spectrosc. Spectr. Anal. 2021, 41, 2196–2204. (In Chinese) [Google Scholar]
  39. Xiong, Z.; Sun, D.W.; Xie, A. Quantitative determination of total pigments in red meats using hyperspectral imaging and multivariate analysis. Food Chem. 2015, 178, 339–345. [Google Scholar] [CrossRef] [PubMed]
  40. Haixia, Z.; Ye, Z. Comparison of anthocyanin contents in two kinds of purple potato. Agric. Process. 2020, 10, 52–53. [Google Scholar]
  41. Weidong, W.; Qingrui, C.; Yuna, W. Hyperspectral monitoring of relative content of anthocyanins in winter wheat leaves. J. Triticum Crops 2020, 40, 754–761. [Google Scholar]
  42. Rady, A.; Guyer, D.; Lu, R. Evaluation of Sugar Content of Potatoes using Hyperspectral Imaging. Food Bioprocess Technol. 2015, 8, 995–1010. [Google Scholar] [CrossRef]
  43. Chen, J.Y.; Zhang, H.; Yelian, M.; Mitsunaka, A. Nondestructive determination of sugar content in potato tubers using visible and near infrared spectroscopy. J. Food Eng. 2010, 11, 59–64. [Google Scholar] [CrossRef]
  44. Wang, X.; Yu, L.; Tian, S.; Lv, X.; Meng, X. End-to-end analysis modeling of vibrational spectroscopy based on deep learning approach. J. Chemom. 2020, 34, e3291. [Google Scholar] [CrossRef]
Figure 1. Micro-hyperspectral equipment.
Figure 1. Micro-hyperspectral equipment.
Foods 13 02096 g001
Figure 2. (a) Red Mei; (b) Black Jingang.
Figure 2. (a) Red Mei; (b) Black Jingang.
Foods 13 02096 g002
Figure 3. Raw spectrum: (a) Spectral information of the outer skin; (b) Spectral information of the inner skin.
Figure 3. Raw spectrum: (a) Spectral information of the outer skin; (b) Spectral information of the inner skin.
Foods 13 02096 g003
Figure 4. After the pre−processing of the outer skin.
Figure 4. After the pre−processing of the outer skin.
Foods 13 02096 g004
Figure 5. After the pre−processing of the inner skin.
Figure 5. After the pre−processing of the inner skin.
Foods 13 02096 g005
Figure 6. Results of the optimal outer skin model.
Figure 6. Results of the optimal outer skin model.
Foods 13 02096 g006
Figure 7. Results of the optimal inner skin model.
Figure 7. Results of the optimal inner skin model.
Foods 13 02096 g007
Table 1. Parameters of potato micro-hyperspectral image acquisition.
Table 1. Parameters of potato micro-hyperspectral image acquisition.
System ParameterParameter Value
Wavelength range365–1025 nm
Number of bands616
Image space size816 × 616
Objective factor20
Moving speed0.2 mm/s
Exposure time100 ms
Instrument parameters are default values.
Table 2. Characteristic wavelength information of the outer skin.
Table 2. Characteristic wavelength information of the outer skin.
MethodAll-Band NumberNumber of Characteristic VariablesSpecific Band
SG + CARS61647370,377,378,378,535,560,579,606,636,661,
763,770,781,782,783,814,819,821,827,842,
845,856,866,880,889,893,905,907,916,921,
927,947,949,969,973,978,987,993,1003,
1004,1008,1019,1020,1021,1023,1024
SNV + CARS61659365,372,373,378,383,392,396,398,419,425,
457,494,534,535,574,596,648,654,661,688,
690,700,745,750,784,813,814,818,820,821,
824,825,827,828,837,843,849,850,879,886,
889,892,894,900,905,906,916,935,942,950,
951,957,964,970,977,979,989,1008,1023
DET + CARS61675365,370,373,378,378,383,395,398,409,418,
446,473,494,541,542,552,566,574,648,649,
660,688,699,736,780,783,796,807,809,814,
818,819,827,831,837,841,843,847,849,864,
874,879,881,886,889,894,897,904,906,907,
915,916,917,922,928,946,949,950,951,953,
957,966,967,969,977,997,1001,1003,1004,
1005,1008,1010,1020,1023,1024
SG + SNV + CARS61642365,374,378,379,384,408,419,438,479,493,
534,535,551,552,560,662,689,699,750,783,
813,814,819,821,825,827,836,845,848,866,
872,880,893,906,923,947,970,988,1004,1019,1020,1023
SG + DET + CARS61667370,373,375,376,378,378,379,409,417,495,
534,535,560,561,565,592,602,660,690,764,
781,782,792,806,813,814,819,821,827,836,
842,843,845,848,864,866,879,880,889,890,
893,905,906,907,914,922,923,931,932,947,
948,949,965,968,969,978,988,1003,1004,
1005,1008,1019,1020,1021,1023,1024
SNV + DET + CARS61653373,378,391,420,489,494,535,560,575,662,
691,699,712,750,781,784,807,813,814,816,
818,820,824,825,827,837,843,857,867,873,
874,883,886,889,892,894,897,901,906,907,
920,922,928,932,938,946,947,950,951,953,977,1005,1020
SG + SPA61612379,381,696,820,845,857,880,931,947,1003,1021,1023
SNV + SPA61629383,388,390,418,465,489,541,646,663,685,
706,712,763,802,809,813,816,818,819,823,
824,825,826,846,849,861,867,879,1017
DET + SPA61619480,624,637,664,686,706,827,890,951,953,
957,964,965,970,976,978,988,1003,1010
SG + SNV + SPA61653365,372,378,382,408,423,486,573,577,596,
612,621,624,641,653,662,665,688,697,706,
712,729,734,739,742,750,778,794,798,814,
816,818,820,827,837,845,853,866,869,872,
875,888,893,907,926,929,940,947,958,964,
967,994,1021
SG + DET + SPA61663408,423,451,492,585,617,619,625,629,633,
635,659,670,674,677,693,704,710,715,730,
818,827,831,835,838,848,878,899,907,911,
916,918,921,923,925,931,936,944,946,948,
949,951,954,955,956,966,967,968,970,978,
980,981,989,992,994,996,1002,1004,1008,
1019,1021,1023
SNV + DET + SPA61612636,670,677,708,852,881,951,957,964,968,
989,990
The algorithms that are shown individually in bold in the table are the ones that are the focus of the following analysis.
Table 3. Characteristic wavelength information of the inner skin.
Table 3. Characteristic wavelength information of the inner skin.
MethodAll-Band NumberNumber of Characteristic VariablesSpecific Band
SG + CARS61659378,484,524,525,546,549,558,562,570,
582,585,590,608,625,632,635,637,683,
748,756,772,775,792,793,796,814,827,
828,835,854,857,862,866,873,874,889,
893,896,902,905,907,912,914,915,921,
935,940,956,962,970,971,988,1004,
1014,1016,1021,1023,1024
SNV + CARS61653368,380,389,392,409,422,448,507,570,
571,626,646,655,685,755,766,773,777,
782,789,792,793,798,812,814,818,827,
832,837,851,853,855,889,891,897,903,
905,907,926,934,951,957,967,972,974,
987,989,994,1010,1020,1022,1023,1025
DET + CARS61684369,370,378,388,389,422,426,519,548,
553,570,571,582,583,586,590,625,626,
627,630,632,635,637,668,683,693,739,
753,754,755,765,766,773,782,784,789,
793,796,798,810,814,819,827,837,855,
856,859,865,874,883,889,891,895,897,
902,905,906,907,916,921,922,926,927,
934,935,940,942,949,953,957,962,967,
970,971,974,975,989,994,1007,1014,
1017,1022,1023,1025
SG + SNV + CARS61659378,379,390,397,405,507,512,558,563,
570,571,643,655,685,752,766,771,773,
776,777,792,793,797,812,813,814,826,
827,829,856,862,866,873,875,887,889,
894,898,906,915,923,930,941,946,956,
959,971,972,980,988,989,993,994,
1005,1014,1017,1019,1021,1025
SG + DET + CARS61642507,518,520,526,548,549,570,590,594,
599,625,630,740,754,767,782,788,793,
797,805,813,827,856,865,875,889,894,
902,907,912,914,922,940,956,959,961,
970,988,1004, 1021,1023,1024
SNV + DET + CARS61637408,505,559,562,565,571,590,630,752,
764,771,792,794,795,815,827,830,832,
835,840,850,859,860,889,891,892,904,
906,957,966,967,985,989,993,1022,1023,1024
SG + SPA61620367,537,577,654,681,716,830,856,859,
907,914,921,934,957,959,960,987,988,
1020,1021
SNV + SPA6169391,588,636,704,793,813,826,867,873
DET + SPA61612625,685,821,854,891,946,947,971,976,
989,993,1016
SG + SNV + SPA61633379,382,400,577,590,593,618,636,639,
643,646,651,654,678,681,729,734,751,
760,802,814,820,824,827,835,868,871,
874,923,929,947,992,1014
SG + DET + SPA61631365,389,460,470,611,619,625,629,636,
638,670,683,710,814,824,828,874,907,
922,929,940,949,956,959,960,969,971,
981,1002,1004,1020
SNV + DET + SPA61611533,854,939,946,950,957,961,965,989,
993,998
The algorithms that are shown individually in bold in the table are the ones that are the focus of the following analysis.
Table 4. Regression prediction of PLS in outer skin region.
Table 4. Regression prediction of PLS in outer skin region.
Treatment MethodRpRMSEPRcRMSECRPD
Primary spectrum (Raw)0.77650.4260.78740.4292.30
SG0.71610.4480.72160.4592.18
SNV0.74960.4350.74390.4492.25
DET0.74460.4440.79280.4252.20
SG + SNV0.72370.4510.72490.4622.17
SG + DET0.71300.4570.72270.4692.14
SNV + DET0.74190.4370.74900.4442.24
SG + CARS0.82670.3870.85090.3802.53
SNV + CARS0.84620.3730.84100.3862.62
DET + CARS0.85930.3700.81580.3342.64
SG + SNV + CARS0.83820.3780.81470.4032.59
SG + DET + CARS0.83980.3780.84290.3862.59
SNV + DET + CARS0.84090.3780.83590.3892.59
SG + SPA0.74640.4280.78370.4662.29
SNV + SPA0.71230.4450.72740.4482.20
DET + SPA0.63160.4770.62330.4912.05
SG + SNV + SPA0.73470.4330.70930.4552.26
SG + DET + SPA0.72630.4540.78970.4182.15
SNV + DET + SPA0.62810.4770.60460.4952.05
The algorithms that are shown individually in bold in the table are the ones that are the focus of the following analysis.
Table 5. Regression prediction of PLS in inner skin region.
Table 5. Regression prediction of PLS in inner skin region.
Treatment MethodRpRMSEPRcRMSECRPD
Primary spectrum (Raw)0.84290.4030.81590.4222.45
SG0.82050.4210.78770.4412.35
SNV0.83900.3950.77070.4462.50
DET0.83340.4060.79850.4312.43
SG + SNV0.83970.3960.75090.4522.50
SG + DET0.78700.4290.78140.4392.30
SNV + DET0.83700.3970.75660.4492.49
SG + CARS0.90860.3460.90190.3462.85
SNV + CARS0.88680.3600.85240.3812.74
DET + CARS0.92870.3250.92270.3293.03
SG + SNV + CARS0.88320.3620.85770.3832.73
SG + DET + CARS0.91780.3320.87380.3712.97
SNV + DET + CARS0.90570.3450.85610.3782.86
SG + SPA0.86680.3760.81340.4132.63
SNV + SPA0.78290.4320.71450.4672.29
DET + SPA0.81230.4140.75010.4502.38
SG + SNV + SPA0.80870.4170.78700.4342.37
SG + DET + SPA0.81390.4140.80010.4222.38
SNV + DET + SPA0.82200.4100.77080.4392.41
The algorithms that are shown individually in bold in the table are the ones that are the focus of the following analysis.
Table 6. CNN-anthocyanin prediction model in outer skin.
Table 6. CNN-anthocyanin prediction model in outer skin.
Treatment MethodInput VariableRcRMSECRpRMSEPRPD
Primary spectrum (Raw)6160.94860.03120.94460.22914.41
SG6160.95060.03120.94400.23394.3728
SNV6160.95030.02740.94460.22734.4556
DET6160.94990.02350.94570.22344.4893
SG + SNV6160.94870.03180.94520.23094.4367
SG + DET6160.95180.02780.94450.22824.4782
SNV + DET6160.95040.02250.94560.22374.3895
SG + CARS470.95390.11030.94940.31373.7346
SNV + CARS590.95270.08150.95010.28433.9260
DET + CARS750.95240.06570.94680.26514.0942
SG + SNV + CARS420.95360.10280.94830.3003.8091
SG + DET + CARS670.95090.06730.94740.26874.0421
SNV + DET + CARS530.95490.09090.94840.29113.9047
SG + SPA120.96000.20190.95430.40123.3207
SNV + SPA290.95650.13750.95190.33793.6080
DET + SPA190.95730.18230.95260.38393.4089
SG + SNV + SPA530.95240.07760.94920.27673.9991
SG + DET + SPA630.95360.08570.94790.28603.9538
SNV + DET + SPA120.95920.22050.95410.42323.2656
The algorithms that are shown individually in bold in the table are the ones that are the focus of the following analysis.
Table 7. CNN-anthocyanin prediction model in inner skin.
Table 7. CNN-anthocyanin prediction model in inner skin.
Treatment MethodInput VariableRcRMSECRpRMSEPRPD
Primary spectrum (Raw)6160.95080.03740.94610.23614.4933
SG6160.95180.03560.94440.23644.5894
SNV6160.94840.03800.94500.23904.5236
DET6160.95120.03540.94500.23484.5869
SG + SNV6160.95010.03920.94580.23564.5635
SG + DET6160.94990.03590.94390.23844.6516
SNV + DET6160.95090.03640.94530.24024.4951
SG + CARS590.95260.08910.94890.28764.1448
SNV + CARS530.95400.09980.94870.30014.0022
DET + CARS840.95270.07080.94570.27114.1623
SG + SNV + CARS590.95300.09880.94760.29974.0345
SG + DET + CARS420.95420.12010.95120.31183.8870
SNV + DET + CARS370.95420.11680.94730.31943.9253
SG + SPA200.95650.14190.95100.34093.8222
SNV + SPA90.95960.20920.95590.41033.4881
DET + SPA120.95610.16490.95110.36723.6420
SG + SNV + SPA330.95210.09950.94790.30104.0386
SG + DET + SPA310.95420.12100.94920.32213.8943
SNV + DET + SPA110.95920.21840.95340.42013.3842
The algorithms that are shown individually in bold in the table are the ones that are the focus of the following analysis.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wang, F.; Li, Q.; Deng, W.; Wang, C.; Han, L. Detection of Anthocyanins in Potatoes Using Micro-Hyperspectral Images Based on Convolutional Neural Networks. Foods 2024, 13, 2096. https://doi.org/10.3390/foods13132096

AMA Style

Wang F, Li Q, Deng W, Wang C, Han L. Detection of Anthocyanins in Potatoes Using Micro-Hyperspectral Images Based on Convolutional Neural Networks. Foods. 2024; 13(13):2096. https://doi.org/10.3390/foods13132096

Chicago/Turabian Style

Wang, Fuxiang, Qiying Li, Weigang Deng, Chunguang Wang, and Lei Han. 2024. "Detection of Anthocyanins in Potatoes Using Micro-Hyperspectral Images Based on Convolutional Neural Networks" Foods 13, no. 13: 2096. https://doi.org/10.3390/foods13132096

APA Style

Wang, F., Li, Q., Deng, W., Wang, C., & Han, L. (2024). Detection of Anthocyanins in Potatoes Using Micro-Hyperspectral Images Based on Convolutional Neural Networks. Foods, 13(13), 2096. https://doi.org/10.3390/foods13132096

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop