Next Article in Journal
The Influence of Temperature Changes on the Rice Starch Structure and Digestive Characteristics: One and Two-Step Annealing
Previous Article in Journal
Food Consumption Characteristics and Influencing Factors in a Grassland Transect of Inner Mongolia Based on the Emergy Method
Previous Article in Special Issue
Prediction of Anthocyanin Color Stability against Iron Co-Pigmentation by Surface-Enhanced Raman Spectroscopy
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Prediction Models for the Content of Calcium, Boron and Potassium in the Fruit of ‘Huangguan’ Pears Established by Using Near-Infrared Spectroscopy

1
School of Horticulture, Anhui Agricultural University, 130 Changjiang West Road, Hefei 230036, China
2
School of Information and Computer Science, Anhui Agriculture University, 130 Changjiang West Road, Hefei 230036, China
*
Author to whom correspondence should be addressed.
Foods 2022, 11(22), 3642; https://doi.org/10.3390/foods11223642
Submission received: 6 September 2022 / Revised: 21 October 2022 / Accepted: 7 November 2022 / Published: 14 November 2022
(This article belongs to the Special Issue Spectroscopy Analysis for Foods)

Abstract

:
It has been proved that the imbalance of the proportion of elements of ‘Huangguan’ pears in the pulp and peel, especially calcium, boron and potassium, may be important factors that can seriously affect the pears’ appearance quality and economic benefits. The objective of this study was to predict the content of calcium, boron and potassium in the pulp and peel of ‘Huangguan’ pears nondestructively and conveniently by using near-infrared spectroscopy (900–1700 nm) technology. Firstly, 12 algorithms were used to preprocess the original spectral data. Then, based on the original and preprocessed spectral data, full-band prediction models were established by using Partial Least Squares Regression and Gradient Boosting Regression Tree. Finally, the characteristic wavelengths were extracted by Genetic Algorithms to establish the characteristic wavelength prediction models. According to the prediction results, the value of the determination coefficient of the prediction sets of the best prediction models for the three elements all reached ideal levels, and the values of their Relative analysis error also showed high levels. Therefore, the micro near-infrared spectrometer based on machine learning can predict the content of calcium, boron and potassium in the pulp and peel of ‘Huangguan’ pears accurately and quickly. The results also provide an important scientific theoretical basis for further research on the degradation of the quality of ‘Huangguan’ pears caused by a lack of nutrients.

1. Introduction

‘Huangguan’ pears (Pyrus bretschneideri Rehd cv.Huangguan), which are famous for their excellent quality in Chinese and even the world’s pear market, have been introduced and planted in many places with good performance and have a good development prospect [1,2]. In actual production, ‘Huangguan’ pears are often bagged to prevent pests and diseases. However, brown spots that often appear on the surface of bagged ‘Huangguan’ pears have been seriously affecting the appearance quality and economic benefits of the fruits [3,4]. A large number of studies have proved that the content of calcium, boron and potassium may be important factors affecting the appearance quality of bagged ‘Huangguan’ pears [5,6,7,8,9]. The traditional detection of the mineral nutrient content is mostly based on laboratory physical and chemical analysis, including inductively coupled plasma mass spectrometry (ICP-MS), atomic absorption spectrometry, UV-visible spectrophotometer measurement and so on [10,11,12]. Although the results are accurate, destructive sampling, high time consumption, high labor intensity, high costs and some other disadvantages have brought many limitations to the study of pear fruit mineral nutrition. If the mineral nutrient content in the fruit of pears can be conveniently and nondestructively detected by modern technology during the critical period of the growth and development of ‘Huangguan’ pears, then the fertilization and other measures can be adjusted accordingly, and it will be of great significance to improve the quality of the fruit of ‘Huangguan’ pears.
Up to now, some methods are often used for the nondestructive testing of fruits, such as hyperspectral imaging technology, near-infrared spectroscopy, X-ray testing, etc. [13]. Among them, the near-infrared spectroscopy detection method is favored by many researchers because of its advantages of nondestructiveness, convenience, environmental protection and safety. The absorption of molecules within the coverage of the near-infrared spectrum consists of the frequency combination and double frequency absorption of hydrogen-containing groups or other chemical bonds in various organic substances, mainly including C-H, N-H, S-H, O-H, etc. Therefore, the spectral curve obtained by using near-infrared spectroscopy technology can better reflect the information of organic substances containing hydrogen groups in the tested samples and composition information in various other biochemical structures [14].
At present, near-infrared spectroscopy is applied to the detection and research of fruit quality, for example, the internal dry matter content (DMC) and soluble solid concentration (SSC) of peach fruit, and the prediction of the titratable acidity, pH value and maturity index in citrus [15,16,17,18,19,20,21,22]. Many investigators have also used near-infrared spectroscopy to predict the mineral content of different trees or crops. Wiebke et al. [23] quickly measured the content of calcium and boron in Hass avocado fruit with near-infrared hyperspectral imaging, greatly improving the previous time-consuming and destructive laboratory method for evaluating fruit quality. Cozzolino et al. [24] explored the use of near-infrared spectroscopy to predict the electrical conductivity and elements Ca, Mg, K, etc. in grapes as a fast method to monitor such component parameters. Wang et al. [25] used the visible-near-infrared spectroscopy technique combined with the Partial Least Squares Regression method to establish a full-band (350–2500 nm) pear leaf nitrogen content prediction model more accurately, and based on this, timely estimated the nitrogen content in supplementary topdressing; Damián [26] et al. used near-infrared spectroscopy combined with MPLS to determine the content of calcium, potassium, magnesium and other mineral elements in zucchini, and promoted summer squash varieties rich in micronutrients; Rossa et al. [27] used near-infrared spectroscopy to determine some mineral nutrients in mate tea leaves and found that the NIRS had a good effect on predicting the contents of phosphorus, copper and iron in leaves, but it was not effective in the prediction of sodium, zinc and manganese and potassium content. It can be seen that NIRS technology has great potential in the determination of the mineral nutrient content of pear fruit. Although near-infrared spectroscopy is more suitable for detecting organic compounds, it can still reflect the information of inorganic parts by indirectly measuring other compounds that can be directly measured by near-infrared spectroscopy, or by combining inorganic compounds with organic compound functional groups or directly with organic matrix.
Therefore, ‘Huangguan’ pears were taken as the research object to explore the effect of using near-infrared spectroscopy (900–1700 nm) technology to predict the content of calcium, boron and potassium in pear fruit in this paper. Firstly, based on the original spectral data and the spectral data obtained after pretreatment, and successive uses of PLSR and GBRT to establish full-band models for predicting the content of the mineral elements calcium, boron and potassium in the pulp and peel of ‘Huangguan’ pears, we used a genetic algorithm to extract the characteristic wavelengths and establish characteristic wavelength models. According to the model evaluation index-determination coefficient and Relative analysis error, the best models for predicting the content of mineral elements were selected to provide some theoretical basis for the nondestructive detection of the mineral nutrient content of pear fruits.

2. Materials and Methods

2.1. Samples

The tested pear fruits were from a horticultural farm located in Dangshan County, Anhui Province, which is known as the ‘pear capital’ of China. In mid-August 2021, at 26 °C, the fruits were picked and transported back to the laboratory by cartons with good air permeability. A total of 65 ‘Huangguan’ pears with uniform appearance and no damage were strictly selected as the test samples. The surface of the pears was washed, then wiped clean and numbered for further use.

2.2. Near-Infrared Spectrum Data Acquisition

The near-infrared spectrum acquisition instrument used was a reflective handheld near-infrared spectrometer with the model ‘NIR-S-G1′ produced by the Shenzhen Ugreen Group, Ltd. (Figure 1) (Shenzhen, China). The detected spectral wavelength range was from 900 nm to 1700 nm with 228 bands of spectrum acquisition points on it. The spectral resolution was 3.89 nm, and the signal-to-noise ratio (SNR) was 5000:1.
Before each measurement, the spectrometer was calibrated with a black and white plate. The material used for the white board calibration was polytetrafluoroethylene. The instrument was closely attached to the calibration white board. The light emitted by the instrument shined on the white board and reflected. This reflected light was captured and recorded as the brightness value (W) of the white board; for the blackboard correction, we turned off the emission light source of the instrument and recorded the brightness value captured by the instrument when there was no light source, which was recorded as the blackboard brightness value (B). After calibration, the instrument was used to measure the spectral data of the pear surface, the light source window on the instrument was close to the pear sample and the reflected light obtained was recorded as the brightness value (R) of the pear surface. The spectral reflectance of the sample can be calculated by Formula (1). The app ‘Instagram’ on a mobile phone was connected with the instrument by Bluetooth before use, and then the spectral data was collected.
R = (I − B)/(W − B)
In the spectral measurement process, it is inevitable that the results will be affected by environmental factors. If the data are collected only once, the error will be too large. Therefore, in order to make the measurement results as accurate as possible, we took five points on the surface of each pear sample and used each point to collect spectral data once, and we repeated this five times. The specific operations were as follows: As shown in Figure 2, before collecting spectral data, an ellipse with a short axis length of about 5 cm and a long axis length of about 8 cm was delimited near the equator of the surface of each pear, and the area within the ellipse was used as the scope of subsequent spectral data collection and sampling. Then, we closed the scanning window at the front end of the spectrometer to the designated range on the surface of pears, scanned once at each spectral acquisition point and scanned 5 times for each pear sample in total. Finally, the average of the reflectance spectral data of the 5 scans of each pear sample was taken as the original modeling spectral data.

2.3. Detection of Mineral Elements in the Fruit of Pears

After the spectral data collection was completed, the pulp and peel of each pear were sampled within the designated range, packed in self-sealing bags and marked with numbers. The content of the mineral elements calcium, boron and potassium in the samples were detected by the instruments in the Anhui Institute of geological experiments. The detection basis was GB 5009.268-2016, and the main instrument used was an inductively coupled plasma emission spectrometer/B1497.

2.4. Near-Infrared Spectrum Data Information Preprocessing and Characteristic Wavelength Extraction

Since there is no literature indicating which preprocessing method is the best, trial and error is usually used to determine the appropriate spectral data preprocessing method [28]. Therefore, after obtaining the original spectral curve based on the algorithms first derivative (FD), second derivative (SD), Multiple Scattering Correction (MSC), Standard Normal Variate (SNV), Savitzky Golay Convolution Smoothing (SG) and Logarithmic Transformation (LG), we used algorithms combined by two single preprocessed methods—SG+MSC and SG+SNV—and algorithms combined by three single preprocessed methods—SG+MSC+SD, SG+MSC+FD, SG+SNV+FD and SG+SNV+SD—to preprocess the original spectral data and compare the impact of different preprocessed methods on the later modeling accuracy.
After preprocessing based on Genetic Algorithms (GA), we extracted the characteristic wavelengths of the full-band models that met the prediction standard, and then established the characteristic wavelength models to further optimize the models [29,30].

2.5. Establishment of Prediction Models for the Content of Mineral Elements in the Fruit of ‘Huangguan’ Pears

Partial Least Square Regression (PLSR) is a classical statistical method; it combines the advantages of principal component analysis, multiple linear regression, and canonical correlation analysis, and can solve the problems of spectral data redundancy and collinearity [31,32]. It can be seen that PLSR is superior to the general least squares method, so we selected PLSR as one of the modeling methods. Gradient Boosting Regression Tree (GBRT) uses the boosting integration strategy to integrate and improve the traditional algorithms. It combines the advantages of multiple algorithms and has great advantages in nonlinear fitting. However, in the process of model training, it is necessary to continuously adjust the parameters to achieve a more reasonable training effect [33]. Therefore, we also selected GBRT as another modeling method.
Additionally, the model evaluation indicators in this paper are the determination coefficient (R2) and Relative Percent Deviation (RPD). The value range of the determination coefficient (R2) is (0, 1). The closer R2 is to 1, the smaller the error is, the more stable the model is, and the better the prediction effect is. Relative analysis error (RPD) is an important indicator to evaluate the quality of the model. This paper evaluates the prediction abilities of the models according to the RPD evaluation grade proposed by Chang et al. [34]: when ERPD ≥ 2, it indicates that the prediction effect of the model is very good, and the model belongs to level A, which can be applied to the corresponding quantitative prediction; when 1.4 ≤ ERPD < 2, it indicates that the model has the moderate predictive ability and belongs to level B; when ERPD < 1.4, it indicates that the prediction effect of the model is relatively poor, and the model belongs to level C and cannot be used for quantitative prediction.

3. Results

3.1. Analysis of Original Spectral Curve

It can be seen from Figure 3 that the overall shape of the spectral curves of all ‘Huangguan’ pear fruit samples was similar, with clear peaks at wavelengths of 1050 nm, 1260 nm and 1680 nm, and clear troughs at 980 nm, 1200 nm, 1450 nm and 1660 nm. According to the literature, 980 nm and 1450 nm are the main water absorption peaks, corresponding to the first frequency doubling information of the O-H bond stretching vibration of related carbohydrates, and about 1660 nm corresponds to the first frequency doubling information of the C-H bond stretching vibration in methyl [35]. The main chemical components in pears include flavones, triterpenes, phenolic acids, luminol esters, polysaccharides, etc., of which polyphenols are the most abundant component, which are also very valuable secondary metabolites in pears, including chlorogenic acid, arbutin, coumaric acid, etc. [36]. At the same time, phenolic acids, flavones and triterpenes are usually more abundant in pear peels than that in pear pulp [37]. The presentation of the spectral curve, which corresponds to the frequency doubling and frequency information of different chemical bond vibrations accordingly, was complicated because of the complex compounds in pear fruits.

3.2. Analysis of Spectral Data Preprocessing Results

The spectral curves obtained by twelve pretreatment methods are shown in Figure 4. The different pretreatment methods had different effects. Firstly, it can be seen from Figure 4a,b that the baseline offset phenomenon was improved to a certain extent after the derivative methods, and the signal-to-noise ratio also decreased, but the noise also increased accordingly. The processing of the second derivative algorithm had an even greater noise impact than the first derivative algorithm. Secondly, the spectral curves after being processed by MSC and SNV are shown in Figure 4c,d. MSC and SNV had no obvious tendency to change the original spectral curve. MSC transformation mainly eliminated the scattering effect caused by uneven particle distribution and different particle sizes on the surface of the samples, and SNV was mainly used to eliminate the influence of scattering and optical path changes on the spectrum. Thirdly, it can be seen from Figure 4e that the overall shape of the spectral curve preprocessed by SG was similar to the original spectral curve. SG removed some irrelevant noise based on the original spectral data, making the spectral curve smoother. Fourthly, LG reduced the absolute value of the spectral data to a certain extent, which was conducive to later calculation. In general, the curves processed by MSC, SNV, SG, SG+MSC and SG+SNV were similar to the original spectral curves. In addition, the curves pretreated by FD, SG+MSC+FD and SG+SNV+FD were similar, and the curves pretreated by SD, SG+MSC+SD and SG+MSC+SD were also similar.

3.3. Characteristic Analysis of Sample Detection Data

From the data shown in Table 1, the contents of calcium, boron and potassium in the peel were higher than those in the pulp, and the difference was obvious. In the pulp, the content of potassium was much higher than that of calcium and boron, and it was higher in the peel as well.
It can be seen from Figure 5 that except for a few samples that were far away from the reference line, other samples were closely distributed around the reference line, where the x-axis value is the actual detection value and the corresponding y-axis value is the expected normal value.

3.4. Modelling and Prediction Analysis of the Content of Mineral Elements in Pear Pulp

Table 2 shows the comparison of parameters R2 and RPD of each prediction model for calcium, boron and potassium content in pear pulp based on full-wave band.

3.4.1. Modelling and Prediction Analysis of the Content of Calcium in Pear Pulp

(1)
Full-band based modelling
Based on the original spectral data and the spectral data after 12 kinds of pretreatment, 13 PLSR and GBRT models were established to predict the content of calcium in pear pulp. Among the 13 PLSR models, Raw-PLSR (prediction set R2 = 0.716, RPD = 1.431) and SG-PLSR (prediction set R2 = 0.717, RPD = 1.433) met level B. However, the predictive effects of the GBRT models for predicting the content of calcium in the pear pulp based on the full-wave band were not ideal, and they could not be used for the prediction of the content of calcium in pear pulp.
(2)
Characteristic wavelength extraction
Based on GA, 125 and 119 characteristic wavelengths were extracted from the original spectral curves of RAW-PLSR and SG-PLSR, respectively. As shown in Figure 6, the characteristic wavelengths extracted from the original spectral curve points of Raw-GA-PLSR were concentrated at 1050–1300 nm and 1400–1680 nm, and at the same time, the characteristic wavelengths extracted from the original spectral curve points of SG-GA-PLSR were concentrated at 950–1300 nm and 1400–1680 nm. Although the specific values of the characteristic wavelengths extracted from the spectral curves of Raw-GA-PLSR and SG-GA-PLSR were not the same, their distribution ranges on their original spectral curves were similar, which indicates that the characteristic wavelength points corresponding to the compounds related to the content of calcium in pear pulp were roughly concentrated in the range above.
(3)
Modelling of characteristic wavelengths
According to the modelling results presented in Table 3 and Figure 7, after GA treatment, the two models for predicting the content of calcium in pear pulp—Raw-PLSR and SG-PLSR—showed a slight ‘underfitting’ state (the effect of the prediction set was slightly better than that of the modelling set), which may have been due to the sample size, but the prediction performance was improved to level A. Furthermore, the prediction performance of the two models was better than before, which further shows that the feature extraction process can remove wavelengths that are not relevant to later modelling to a certain extent, and keep the features with rich information as much as possible to improve the predictive effect of the models.
(4)
Modelling effect evaluation
After comparative analysis, among the established models for predicting the content of calcium in the pulp of ‘Huangguan’ pears, Raw-GA-PLSR showed the best prediction effect and met level A, whose modelling sets R2 and RPD were 0.987 and 6.110, respectively.

3.4.2. Modelling and Prediction Analysis of the Content of Boron in Pear Pulp

(1)
Full-band based modelling
A total of 65 samples of pear pulp boron content were randomly divided into a modeling set at 52 and a prediction set at 13, and then subsequent modeling analyses were carried out. Among the 13 PLSR models established, only RAW-PLSR and SG-PLSR met the prediction standards; moreover, the R2 of the modelling sets and prediction sets of SG-PLSR increased slightly and reached level B, and at the same time, the R2 of the modelling sets and prediction sets of the remaining 11 models decreased to varying degrees, among which SD-PLSR, SG+MSC+SD-PLSR and SG+SNV+SD-PLSR failed. Among the 13 GBRT models established, except LG-GBRT, the R2 and RPD of the modelling sets of the other 11 models based on pretreatment spectra were improved compared with RAW-PLSR. Although the R2 and RPD of the prediction sets of SNV-GBRT, MSC-GBRT, SD-GBRT, SG+MSC-GBRT, SG+SNV-GBRT and SG+MSC+FD-GBRT were improved compared with RAW-GBRT, they still did not meet the prediction standard.
(2)
Characteristic wavelength extraction
Among the 26 models for predicting the content of boron in pear pulp based on the full-wave band, RAW-PLSR and SG-PLSR, which initially met the prediction standards, were extracted with characteristic wavelengths. The genetic algorithm extracted 116 and 118 characteristic wavelengths from the spectral curves of RAW-PLSR and SG-PLSR, respectively. As shown in Figure 8, in the original spectral curve points of Raw-GA-PLSR, the extracted characteristic wavelengths were concentrated at 900–1100 nm, 1200–1400 and 1500–1700 nm; meanwhile, in the original spectral curve points of SG-GA-PLSR, the extracted characteristic wavelength points were mainly distributed at 950–1100 nm and 1200–1680 nm. The dense distribution areas of the characteristic wavelengths of the two models were similar, indicating that the compounds related to the content of boron in pear pulp mainly responded in the range of 950–1100 nm, 1200–1400 nm and 1500–1680 nm.
(3)
Modelling of characteristic wavelengths
After GA treatment, the predictive effects of RAW-PLSR and SG-PLSR for predicting the content of boron in pear pulp was significantly improved as the results shown in Table 4. The scatter plots of the prediction results shown in Figure 9 verify the good prediction effects of RAW-GA-PLSR and SG-GA-PLSR. The R2 of the two model prediction sets exceeded 0.9, reaching a relatively accurate prediction level, while the R2 of the modelling set was slightly lower than that of the respective prediction sets. Although it showed a slight ‘underfitting’ state, it did not affect the overall predictive effects.
(4)
Modelling effect evaluation
After analysis and discussion, SG-GA-PLSR had the best effect on predicting the content of boron in pear pulp, whose R2 = 0.846 and RPD = 1.878 in the modelling set; R2 = 0.986 and RPD = 6.033 in the prediction set; and the effect of the prediction set had reached level A.

3.4.3. Modelling and Prediction Analysis of the Content of Potassium in Pear Pulp

(1)
Full-band based modelling
Among the 13 established PLSR models, Raw-PLSR and SG-PLSR met the predicting standards, while the fitting results of the remaining 11 PLSR models did not meet the expectations. While among the 13 established GBRT models, RAW-GBRT, SNV-GBRT, FD-GBRT, MSC-GBRT, SG+MSC-GBRT, SG+SNV-GBRT, SG+MSC+FD-GBRT and SG+SNV+FD-GBRT were invalid, SD-GBRT, SG-GBRT, LG-GBRT, SG+MSC+SD-GBRT and SG+SNV+SD-GBRT did not meet the prediction standards. On the whole, the established GBRT models were not suitable for the prediction of the content of potassium in ‘Huangguan’ pear pulp, and the reasons may be as follows: On one hand, GBRT is an additive model, and the next round is the residual between the predicted value and the actual value of the previous round, which is used as a label to continue fitting, and finally, the results are added, which in the end may lead to cases where R2 is negative or the prediction is invalid. On the other hand, the number of samples, data detection errors and so on may also be other reasons.
(2)
Characteristic wavelength extraction
The spectral curves of Raw-PLSR and SG-PLSR, whose full-wave band models met the prediction standards for predicting the content of potassium in pear pulp, were optimized to extract the characteristic wavelengths. Then, GA extracted 118 and 114 characteristic band points from the original spectral curve and the spectral curve preprocessed by SG, respectively. As shown in Figure 10, on the original spectral curve, the extracted characteristic wavelengths were densely distributed in the range of 900–1100 nm, 1200–1300 nm and 1500–1680 nm. Meanwhile, on the spectral curve after SG pretreatment, the extracted characteristic wavelengths were most densely distributed in the range of 1020–1150 nm and 1430–1650 nm. It can be inferred that the response of the compounds related to the content of potassium in pear pulp on the spectral curve was mostly concentrated in the range of 1000–1100 nm and 1500–1650 nm.
(3)
Modelling of characteristic wavelengths
According to the results shown in Table 5, after GA treatment, the prediction abilities of RAW-PLSR and SG-PLSR were both greatly improved, and the model levels also rose from level B to level A. The scatter plots of the prediction results are shown in Figure 11, and they more intuitively show that both Raw-GA-PLSR and SG-GA-PLSR can more accurately predict the content of potassium in the pulp of ‘Huangguan’ pears.
(4)
Modelling effect evaluation
After discussion and analysis, SG-GA-PLSR had the best performance in predicting the content of potassium in pear pulp, with R2 = 0.834 and RPD = 1.810 for the modelling set and R2 = 0.985 and RPD = 5.833 for the prediction set.

3.5. Modelling and Prediction Analysis of the Content of Mineral Elements in Pear Peel

Table 6 shows the comparison of parameters R2 and RPD of each prediction model for calcium, boron and potassium content in pear peel based on a full-wave band.

3.5.1. Modelling and Prediction Analysis of the Content of Calcium in Pear Peel

(1)
Full-band based modelling
Among the 13 PLSR models established, the predicting level of RAW-PLSR, SNV-PLSR, MSC-PLSR, SG-PLSR, LG-PLSR and SG+SNV-PLSR were all B. Meanwhile, although the R2 and RPD of the modelling sets of SNV-PLSR, MSC-PLSR and SG+SNV-PLSR decreased slightly compared with the original models, the R2 and RPD of the prediction sets of these three models were improved, which means that they may have better abilities to predict the content of calcium in the peel of ‘Huangguan’ pears. Furthermore, among the 13 established GBRT models, although the predictive effects of MSC-GBRT and SG+MSC-GBRT were outstanding at the modelling sets, their R2 and RPD of the prediction sets did not meet the predictive standards.
(2)
Characteristic wavelength extraction
As shown in Figure 12, GA extracted 113, 109, 118, 136, 105 and 104 characteristic wavelengths from the spectral curves processed by SNV, MSC, SG, LG, SG+MSC and SG+SNV, respectively. Firstly, the characteristic wavelengths extracted from the spectral curve processed by SNV were roughly concentrated at 1000–1100 nm, 1200–1350 nm and 1550–1680 nm; secondly, the characteristic wavelengths extracted from the spectral curve points processed by MSC were concentrated at 900–1100 nm, 1200–1300 nm and 1500–1650 nm; thirdly, the characteristic wavelengths extracted from the spectral curve points processed by SG were concentrated at 900–1080 nm, 1150–1380 nm and 1500–1650 nm; fourthly, the characteristic wavelengths extracted from the spectral curve points processed by LG were concentrated at 900–1350 nm and 1500–1600 nm; fifthly, the characteristic wavelengths extracted from the spectral curve points processed by SG+MSC were concentrated at 1000–1100 nm and 1450–1650 nm; and finally, the characteristic wavelengths extracted from the spectral curve points processed by SG+SNV were concentrated at 1000–1300 nm and 1400–1680 nm. Overall, the characteristic wavelengths extracted from the six spectral curves were mostly concentrated in 1000–1100 nm, 1200–1300 nm and 1500–1600 nm, indicating that the information reflected by the organic matters related to the content of calcium in the peel was mostly concentrated on these three bands.
(3)
Modelling of characteristic wavelengths
After the GA treatment, the prediction abilities of the six models for predicting the content of calcium in pear peel were improved. As the results shown in Table 7 and Figure 13, the prediction effects of the modelling set and prediction set of MSC-GA-PLSR and SG+MSC-GA-PLSR all reached level A. Although the prediction sets of SNV-GA-PLSR, SG-GA-PLSR, LG-GA-PLSR and SG+SNV-GA-PLSR met level A, the level of the modelling sets remained at level B; moreover, the R2 and RPD of the modelling sets were slightly lower than those of the model without feature processing. Firstly, this may have been because, in the process of feature wavelength extraction, the eliminated feature wavelengths may have contained useful information that was closely related to later modelling, and the lack of this part of the information had a certain impact on the accuracy of the models. Secondly, it may have also been related to the data structure characteristics of the original modelling sets.
(4)
Modelling effect evaluation
Through comparative analysis, LG-GA-PLSR was the best model for predicting the content of calcium in pear peels, with R2 = 0.93 and RPD = 2.671 in the modelling set and R2 = 0.989 and RPD = 6.951 in the prediction set, which also met level A.

3.5.2. Modelling and Prediction Analysis of the Content of Boron in Pear Peel

(1)
Full-band based modelling
Among the 12 PLSR models based on the spectrum after pretreatment, SNV-PLSR, MSC-PLSR, SG+MSC-PLSR and SG+SNV-PLSR significantly improved the R2 and RPD of the prediction sets compared with RAW-PLSR, and reached level B. Among the 13 GBRT models for predicting the content of boron in pear peels, although the modelling sets of FD-GBRT, SD-GBRT, LG-GBRT, SG+MSC+FD-GBRT, SG+MSC+SD-GBRT and SG+SNV performed well, with R2 exceeding 0.8 and RPD exceeding 1.4, their prediction sets did not meet the basic prediction criteria.
(2)
Characteristic wavelength extraction
Among the 26 established full-wave band models, SNV-PLSR, MSC-PLSR, SG+MSC-PLSR and SG+SNV-PLSR, which initially met the prediction standards, were further extracted with characteristic wavelengths. GA extracted 116, 109, 108 and 124 characteristic wavelengths from the spectral curves preprocessed by SNV, MSC, SG+MSC and SG+SNV, respectively. It can be seen from Figure 14 that on the spectral curve after SNV, the extracted characteristic wavelengths were most densely distributed in the range of 900–1070 nm, 1200–1350 nm and 1450–1650 nm; on the spectral curve after MSC, the extracted characteristic wavelengths were most densely distributed in the range of 910–1090 nm, 1200–1300 nm and 1400–1680 nm; on the spectral curve after SG+MSC, the extracted characteristic wavelengths were most densely distributed in the range of 940–1060 nm, 1130–1260 nm and 1490–1680 nm; on the spectral curve after SG+SNV, the extracted characteristic wavelengths were most densely distributed in the range of 960–1090 nm, 1150–1300 nm and 1450–1680 nm. The dense distribution areas of the extracted characteristic wavelengths were similar, which indicates that the compounds with a great correlation with the content of boron in the pear peels mainly responded in the range of 900–1050 nm, 1200–1300 nm and 1500–1650 nm.
(3)
Modelling of characteristic wavelengths
As the results shown in Table 8 and Figure 15, after the GA treatment, the performances of SNV-GA-PLSR, MSC-GA-PLSR, SG+MSC-GA-PLSR and SG+SNV-GA-PLSR for predicting the content of boron in the pear peel were improved to varying degrees. The scatter plots of the prediction results shown in Figure 15 further intuitively show the predictive effects of the models.
(4)
Modelling effect evaluation
Through comparative analysis, SG+MSC-GA-PLSR had the best effect on predicting the content of boron in pear peels, with the R2 of the modelling set = 0.870 and RPD = 2.027, and the R2 of the prediction set = 0.984 and RPD = 5.598.

3.5.3. Modelling and Prediction Analysis of the Content of Potassium in Pear Peel

(1)
Full-band based modelling
As shown in Table 5, among the 13 PLSR models established for predicting the content of potassium in pear peels, SNV-PLSR and SG+SNV-PLSR met the predictive standards, and the R2 and RPD of the SG+SNV-PLSR prediction set were slightly higher than those of the SNV-PLSR prediction set. This may be because the SG+SNV pretreatment further smoothed the curve and improved the modelling accuracy based on the SNV. At the same time, among the 13 GBRT models, SNV-GBRT, MSC-GBRT, SD-GBRT, SG+MSC-GBRT, SG+SNV-GBRT, SG+MSC+SD-GBRT and SG+SNV+SD-GBRT failed. The remaining six GBRT models belonged to level C and could not be used to predict the content of potassium in the peel of ‘Huangguan’ pears.
(2)
Characteristic wavelength extraction
SNV-PLSR and SG+SNV-PLSR, which preliminarily met the prediction standards, were processed by extracting the characteristic wavelengths on the spectral curves. GA extracted 122 and 133 characteristic wavelengths from the spectral curves pretreated by SNV and SG+SNV, respectively. As shown in Figure 16, on the spectral curve pretreated by SNV, the extracted characteristic wavelengths were mainly concentrated in 1160–1280 nm and 1400–1650 nm; on the spectral curve pretreated by SG+SNV, the extracted characteristic wavelengths were mainly concentrated at 1050–1110 nm, 1200–1300 nm and 1450–1680 nm. Therefore, the response intervals of the compounds related to the content of potassium in the pear peel on the spectral curve were mainly 1200–1300 nm and 1450–1650 nm.
(3)
Modelling of characteristic wavelengths
It can be seen from Table 9 and Figure 17 that the prediction ability of the model built after GA processing was improved. Although the R2 and RPD of the SNV-GA-PLSR and SG+SNV-GA-PLSR modelling sets were slightly decreased, the R2 and RPD of their prediction sets were higher than those before feature extraction processing, and the prediction level of the model increased to level A.
(4)
Modelling effect evaluation
From the discussion and comparison above, we can see that SG+SNV-GA-PLSR had the best ability to predict the content of potassium in pear pulp with an R2 = 0.888 and RPD = 2.175 for the modelling set, and an R2 = 0.973 and RPD = 4.294 for the prediction set.

4. Discussion

Previous studies have pointed out that the brown spots that seriously affect the appearance quality of ‘Huangguan’ pears are caused by physiological disease, [38] and are largely related to the content and proportion of mineral elements in the pulp and peel of pears. In previous studies, the mineral elements calcium, boron and potassium were proved to be important factors affecting the appearance quality of ‘Huangguan’ pears, and the content of mineral elements in the pulp and peel of diseased pears may have a certain degree of correlation, which will show a different appearance with the severity of the disease [6]. The timely detection of the content of calcium, boron and potassium in fruits and the adjustment of fertilization and bagging based on the detected results are of great significance to the improvement of fruit quality in the later stage. Compared with the traditional laboratorial and expensive physical and chemical detection methods, near-infrared spectroscopy has been applied to the study of soluble solids, titratable acids and the mineral element content of pears for its several potential advantages [20,21,22,23]. Therefore, in this paper, which is based on the visible-near-infrared (900–1700 nm) spectroscopy technology, the ‘NIR-S-G1′ micro spectrometer was used to collect spectral data, and combined with 12 pretreatment methods, the content of calcium, boron and potassium in ‘Huangguan’ pears’ fruit (including peel and pulp) were predicted by PLSR and GBRT. For the models that met the prediction standard, GA was used to extract the characteristic wavelengths of its original spectral curve to further optimize the models.
The full-wave band modelling results showed that the overall prediction abilities of the PLSR models were stronger than those of the GBRT models. The GBRT integration algorithm combines the advantages of many algorithms, but its performance in processing high-dimensional data with small samples was not as good as that of the traditional single regression model PLSR. This is roughly the same as what was found in previous studies [39,40,41], but there are also differences. Relatively speaking, the overall prediction effects of the models with feature extraction by the genetic algorithm were better than those of the full-band models without feature extraction. Although the specific characteristic wavelengths extracted from each model were not the same, the distribution ranges of the extracted characteristic wavelengths on the original spectral curve were similar, which may have been due to the similarity of the compounds corresponding to the corresponding mineral elements, resulting in the similar chemical bond vibration laws contained in the corresponding compounds. Previous studies have shown that the chemical components in the pulp and peel of pears are complex, and the main chemical components in pears include flavonoids, triterpenes, phenolic acids, etc. Generally, the peel of a pear contains more phenolic acids, flavonoids and triterpenes than the pulp. The presentation of the spectral curves also corresponds to the frequency doubling and frequency combining information of different chemical bond vibrations. In the range of 900–1700 nm, some chemical bonds with a strong saturation activity, especially the X-H bond, and some other chemical bonds are also active in this range, such as the C=O bond of ester and the C=N bond of amine. This paper may provide some theoretical basis for the study of pear fruit characteristic wavelength modelling.
The results of this study showed that the best models for predicting the content of calcium in the pulp and peel of ‘Huangguan’ pears were RAW-GA-PLSR (the R2 and RPD of the modelling set were 0.898 and 2.271, respectively, and the R2 and RPD of the prediction set were 0.987 and 6.110, respectively) and LG-GA-PLSR (the R2 and RPD of the modelling set were 0.93 and 2.671, respectively, and the R2 and RPD of the prediction set were 0.989 and 6.951, respectively). The prediction ability of LG-GA-PLSR was slightly stronger than RAW-GA-PLSR, which may have occurred because the compounds related to the content of calcium in pear peels are more sensitive in the range of 900–1700 nm. Secondly, the best models for predicting the content of boron in the pulp and peel of ‘Huangguan’ pears were SG-GA-PLSR (the R2 and RPD of the modelling set were 0.846 and 1.878, respectively, and the R2 and RPD of the prediction set were 0.986 and 6.033, respectively) and SG+MSC-GA-PLSR (the R2 and RPD of the modelling set were 0.870 and 2.027, respectively, and the R2 and RPD of the prediction set were 0.984 and 5.598, respectively). The prediction effect of SG-GA-PLSR was slightly stronger than that of SG+MSC-GA-PLSR, which may have occurred due to the pretreatment method, or because the compounds related to the content of boron in the pulp have a stronger response within the spectral range determined in this paper. Thirdly, the best models for predicting the content of potassium in the pulp and peel of ‘Huangguan’ pears were SG-GA-PLSR (the R2 and RPD of the modelling set were 0.834 and 1.810, respectively, and the R2 and RPD of the prediction set were 0.985 and 5.833, respectively) and SG+SNV-GA-PLSR (the R2 and RPD of the modelling set were 0.888 and 2.175, respectively, and the R2 and RPD of the prediction set were 0.973 and 4.294, respectively).

5. Conclusions

From the discussion above, it can be concluded that near-infrared spectroscopy can be used in the effectively and accurately nondestructive prediction of calcium, boron and potassium in the pulp and peel of ‘Huangguan’ pears. Furthermore, the precision of the mineral element content in the pulp was slightly higher than that in the peel, so the occurrence of the skin browning of ‘Huangguan’ pears can be more accurately predicted. The research in this paper provides a theoretical basis for the nondestructive prediction of the element content affecting the appearance quality of ‘Huangguan’ pears. The next stage is the validation of methods, which refers to the demonstration of the suitability of the method by analyzing random samples, to further enrich the modeling samples from different dimensions and to optimize the model, by combining it with artificial intelligence to implement the application of the research results in the field.

Author Contributions

Data curation, J.F. and L.L.; Formal analysis, X.J. and W.H.; Investigation, L.W. and Y.Z.; Methodology, X.J.; Project administration, B.J., W.H. and L.L.; Resources, Z.Y.; Supervision, B.J. and Z.Y.; Validation, L.W. and Y.Z.; Writing—original draft, J.F.; Writing—review & editing, X.J. and L.L. J.F. and X.J. contributed equally to this paper. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the China Agriculture Research System of MOF and MARA, the Key research project of Natural Science in Colleges and Universities of the Anhui Provincial Department of Education in 2021 (KJ2021A0188), and the Anhui Province Fruit-Tree Industry Technology System (AHCYTX-10).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The datasets generated for this study are available on request to the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Mou, D.; Wang, X.; Guo, Y.; Dong, C. Introduction and labor-saving cultivation techniques of “Huangguan Pear” in Wuwei Oasis. J. North. Hortic. 2020, 7, 171–175. (In Chinese) [Google Scholar]
  2. Li, R.; Ma, L.; Yang, X.; Hu, H. Introduction and cultivation of Huangguan pear in Jingshan County, Hubei Province. J. Fruit Trees South. China 2017, 46, 159–160. (In Chinese) [Google Scholar]
  3. Wang, Q.; Wu, X.; Liu, L.; Yao, D.; Li, J.; Fang, J.; Chen, X.; Zhu, L.; Ye, Z.; Jia, B.; et al. Transcriptome and metabolomic analysis to reveal the browning spot formation of ‘Huangguan’ pear. BMC Plant Biol. 2021, 21, 1–18. [Google Scholar] [CrossRef] [PubMed]
  4. Sun, R. Causes of severe chicken feet disease in ‘Huangguan’ pear in Hebei Xinji in 2017 and suggestions for prevention and treatment. J. China Fruit Tree 2017, S1, 60–61. (In Chinese) [Google Scholar]
  5. Dong, Y.; Guan, J.; Ma, S.; Liu, L.; Feng, Y.; Cheng, Y. Calcium content and its correlated distribution with skin browning spot in bagged Huangguan pear. Protoplasma 2015, 252, 165–171. [Google Scholar] [CrossRef]
  6. Zhang, Y.; Wang, X.; Sun, L.; Xu, T.; Pang, H.; Zhai, Z.; Zhao, Y.; Xu, J.; Zhang, Y. Analysis of mineral elements in fruits of ‘Huangguan’ pear with chicken feet disease. J. Fruit Trees 2018, A01, 1. (In Chinese) [Google Scholar]
  7. Kou, X.; Wu, M.; Li, L.; Wang, S.; Xue, Z.; Liu, B.; Fei, Y. Effects of CaCl2 dipping and pullulan coating on the development of brown spot on ‘Huangguan’ pears during cold storage. Postharvest Biol. Technol. 2015, 99, 63–72. [Google Scholar] [CrossRef]
  8. Liu, Y.; Wang, Y.; Li, J.; Li, L. Responses of intracellular Ca2+ and its sensors to Venturia nashicola infection in pear (Pyrus bretschneideri) with differing resistance. Sci. Hortic. 2019, 243, 552–558. [Google Scholar] [CrossRef]
  9. Shen, C.; Li, Y.; Wang, J.; Shoffe, Y.A.; Dong, C.; Shen, Q.; Xu, Y. Potassium Influences Expression of Key Genes Involved in Sorbitol Metabolism and Its Assimilation in Pear Leaf and Fruit. J. Plant Growth Regul. 2018, 37, 883–895. [Google Scholar] [CrossRef]
  10. Hamida, S.; Ouabdesslam, L.; Ladjel, A.F.; Escudero, M.; Anzano, J. Determination of Cadmium, Copper, Lead, and Zinc in Pilchard Sardines from the Bay of Boumerdés by Atomic Absorption Spectrometry. Anal. Lett. 2018, 51, 2501–2508. [Google Scholar] [CrossRef]
  11. Rodríguez-Bermúdez, R.; Herrero-Latorre, C.; López-Alonso, M.; Losada, D.E.; Iglesias, R.; Miranda, M. Organic cattle products: Authenticating production origin by analysis of serum mineral content. Food Chem. 2018, 264, 210–217. [Google Scholar] [CrossRef] [PubMed]
  12. Zhang, J.; Yang, R.; Li, Y.; Wen, X.; Peng, Y.; Ni, X. Use of mineral multi--elemental analysis to authenticate geographical origin of different cultivars of tea in Guizhou, China. J. Sci. Food Agric. 2020, 100, 3046–3055. [Google Scholar] [CrossRef] [PubMed]
  13. Jamila, N.; Khan, N.; Hwang, I.; Nho, E.; Choi, J.; Atlas, A.; Khan, S.; Amin, F.; Javed, F.; Kim, K. Application of Phytochemical and Elemental Profiling, Chemometric Multivariate Analyses, and Biological Activities for Characterization and Discrimination of Fruits of Four Garcinia Species. Anal. Lett. 2020, 53, 122–139. [Google Scholar] [CrossRef]
  14. Shao, X.; Ning, Y.; Liu, F.; Li, J.; Cai, W. Application of Near-infrared Spectroscopy in Micro Inorganic Analysis. Acta Chim. Sin. 2012, 70, 2109–2114. [Google Scholar] [CrossRef]
  15. Minas, I.S.; Blanco-Cipollone, F.; Sterle, D. Accurate non-destructive prediction of peach fruit internal quality and physiological maturity with a single scan using near infrared spectroscopy. Food Chem. 2021, 335, 127626. [Google Scholar] [CrossRef]
  16. Torres, I.; Sánchez, M.-T.; De La Haba, M.-J.; Pérez-Marín, D. LOCAL regression applied to a citrus multispecies library to assess chemical quality parameters using near infrared spectroscopy. Spectrochim. Acta Part A Mol. Biomol. Spectrosc. 2019, 217, 206–214. [Google Scholar] [CrossRef]
  17. Alamar, P.D.; Caramês, E.T.S.; Poppi, R.J.; Pallone, J.A.L. Quality evaluation of frozen guava and yellow passion fruit pulps by NIR spectroscopy and chemometrics. Food Res. Int. 2016, 85, 209–214. [Google Scholar] [CrossRef]
  18. Li, J.; Sun, D.; Cheng, J. Recent advances in nondestructive analytical techniques for determining the total soluble solids in fruits: A review. Compr. Rev. Food Sci. Food Saf. 2016, 15, 897–911. [Google Scholar] [CrossRef]
  19. Zhang, B.; Zhang, M.; Shen, M.; Li, H.; Zhang, Z.; Zhang, H.; Zhou, Z.; Ren, X.; Ding, Y.; Xing, L.; et al. Quality monitoring method for apples of different maturity under long-term cold storage. Infrared Phys. Technol. 2021, 112, 103580. [Google Scholar] [CrossRef]
  20. Li, J.; Chen, L.; Huang, W.; Wang, Q.; Zhang, B.; Tian, X.; Fan, S.; Li, B. Multispectral detection of skin defects of bi-colored peaches based on vis–NIR hyperspectral imaging. Postharvest Biol. Technol. 2016, 112, 121–133. [Google Scholar] [CrossRef]
  21. Lu, Z.; Lu, R.; Chen, Y.; Fu, K.; Song, J.; Xie, L.; Zhai, R.; Wang, Z.; Yang, C.; Xu, L. Nondestructive Testing of Pear Based on Fourier Near-Infrared Spectroscopy. Foods 2022, 11, 1076. [Google Scholar] [CrossRef] [PubMed]
  22. Li, J.; Huang, W.; Chen, L.; Fan, S.; Zhang, B.; Guo, Z.; Zhao, C. Variable Selection in Visible and Near-Infrared Spectral Analysis for Noninvasive Determination of Soluble Solids Content of ‘Ya’ Pear. Food Anal. Methods 2014, 7, 1891–1902. [Google Scholar] [CrossRef]
  23. Kämper, W.; Trueman, S.; Tahmasbian, I.; Bai, S. Rapid Determination of Nutrient Concentrations in Hass Avocado Fruit by Vis/NIR Hyperspectral Imaging of Flesh or Skin. Remote Sens. 2020, 12, 3409. [Google Scholar] [CrossRef]
  24. Cozzolino, D.; Cynkar, W.; Shah, N.; Smith, P. Quantitative analysis of minerals and electric conductivity of red grape homogenates by near infrared reflectance spectros-copy. J. Comput. Electron. Agric. 2011, 77, 81–85. [Google Scholar] [CrossRef]
  25. Wang, J.; Shi, X.; Xu, Y.; Dong, C. Nitrogen Management Based on Visible/Near Infrared Spectroscopy in Pear Orchards. Remote Sens. 2021, 13, 927. [Google Scholar] [CrossRef]
  26. Martínez-Valdivieso, D.; Font, R.; Gómez, P.; Blanco-Díaz, T.; Del Río-Celestino, M. Determining the mineral composition in Cucurbita pepo fruit using near infrared reflectance spectroscopy. J. Sci. Food Agric. 2014, 94, 3171–3180. [Google Scholar] [CrossRef]
  27. Rossa, Ü.B.; Angelo, A.C.; Nisgoski, S.; Westphalen, D.J.; Frizon, C.N.T.; Hoffmann-Ribani, R. Application of the NIR method to determine nutrients in yerba mate (Ilex paraguariensis A. St.-Hill) leaves. J. Commun. Soil Sci. Plant Anal. 2015, 46, 2323–2331. [Google Scholar] [CrossRef]
  28. Xu, L.; Zhou, Y.-P.; Tang, L.-J.; Wu, H.-L.; Jiang, J.-H.; Shen, G.-L.; Yu, R.-Q. Ensemble preprocessing of near-infrared (NIR) spectra for multivariate calibration. Anal. Chim. Acta 2008, 616, 138–143. [Google Scholar] [CrossRef]
  29. Holland, J.H. Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence; MIT Press: Cambridge, MA, USA, 1992. [Google Scholar]
  30. Shinzawa, H.; Li, B.Y.; Nakagawa, T.; Takehiro, N.; Yukihiro, O. Multi-objective genetic algorithm-based sample selection for partial least squares model building with applications to near-infrared spectroscopic data. Appl. Spectrosc. 2006, 60, 631–640. [Google Scholar] [CrossRef]
  31. Filzmoser, P.; Gschwandtner, M.; Todorov, V. Review of sparse methods in regression and classification application to chemometrics. J. Chemom. 2012, 26, 42–51. [Google Scholar] [CrossRef]
  32. Hair, J.F.; Ringle, C.M.; Sarstedt, M. Partial Least Squares Structural Equation Modeling: Rigorous Applications, Better Results and Higher Acceptance. Long Range Plan. 2013, 46, 1–12. [Google Scholar] [CrossRef]
  33. Friedman, J.H. Greedy function approximation: A gradient boosting machine. Ann. Stat. 2001, 29, 1189–1232. [Google Scholar] [CrossRef]
  34. Chang, C.-W.; Laird, D.A.; Mausbach, M.J.; Hurburgh, C.R. Near-infrared reflectance spectroscopy–principal components regression analyses of soil properties. Soil Sci. Soc. Am. J. 2001, 65, 480. [Google Scholar] [CrossRef] [Green Version]
  35. Qi, S.; Oshita, S.; Makino, Y.; Han, D. Influence of Sampling Component on Determination of Soluble Solids Content of Fuji Apple Using Near-Infrared Spectroscopy. Appl. Spectrosc. 2016, 71, 856–865. [Google Scholar] [CrossRef] [PubMed]
  36. Li, X.; Wang, T.; Zhou, B.; Gao, W.; Cao, J.; Huang, L. Chemical composition and antioxidant and anti-inflammatory potential of peels and flesh from 10 different pear varieties (Pyrus spp.). Food Chem. 2013, 152, 531–538. [Google Scholar] [CrossRef]
  37. Lin, L.-Z.; Harnly, J.M. Phenolic compounds and chromatographic profiles of pear skins (Pyrus spp.). J. Agric. Food Chem. 2008, 56, 9094–9101. [Google Scholar] [CrossRef]
  38. Zhu, X.; Wei, J.; Qi, N. Research status and prospect of Jizhua disease in Huangguan pear. Hebei Agric. Sci. 2009, 13, 26–28. [Google Scholar]
  39. Ren, G.X.; Wei, Z.Q.; Fan, P.P.; Wang, X.Y. Visible/near infrared spectroscopy method applied research in wetland soil nutrients rapid test. IOP Conf. Ser. Earth Environ. Sci. 2019, 344, 012123. [Google Scholar] [CrossRef]
  40. Shu, X.M.; Shi, T.Z.; Liu, Y.; Peng, H.W.; Gao, W.X.; Zhang, C. Estimating soil total nitrogen contents from visible/near-infrared reflectance based on successive projections algorithm. IOP Conf. Ser. Earth Environ. Sci. 2014, 17, 012046. [Google Scholar] [CrossRef] [Green Version]
  41. Demir, S.; Sahin, E.K. An investigation of feature selection methods for soil liquefaction prediction based on tree-based ensemble algorithms using AdaBoost, gradient boosting, and XGBoost. Neural Comput. Appl. 2022, 154, 107–130. [Google Scholar] [CrossRef]
Figure 1. ‘NIR-S-G1′ miniature spectrum acquisition instrument.
Figure 1. ‘NIR-S-G1′ miniature spectrum acquisition instrument.
Foods 11 03642 g001
Figure 2. Schematic diagram showing the acquisition points for spectral sampling.
Figure 2. Schematic diagram showing the acquisition points for spectral sampling.
Foods 11 03642 g002
Figure 3. Original spectral reflectance map (a) and average spectral reflectance map (b) of all tested ‘Huangguan’ pears.
Figure 3. Original spectral reflectance map (a) and average spectral reflectance map (b) of all tested ‘Huangguan’ pears.
Foods 11 03642 g003
Figure 4. Spectral graphs after transformation based on different preprocessing methods. (a) FD, (b) SD, (c) MSC, (d) SNV, (e) SG, (f) LG, (g) SG+MSC, (h) SG+SNV, (i) SG+MSC+FD, (j) SG+MSC+SD, (k) SG+SNV+FD, (l) SG+SNV+SD.
Figure 4. Spectral graphs after transformation based on different preprocessing methods. (a) FD, (b) SD, (c) MSC, (d) SNV, (e) SG, (f) LG, (g) SG+MSC, (h) SG+SNV, (i) SG+MSC+FD, (j) SG+MSC+SD, (k) SG+SNV+FD, (l) SG+SNV+SD.
Foods 11 03642 g004
Figure 5. Curves fitted for prediction of mineral elements in pear pulp and peel. (a) Distribution of calcium content in pear pulp; (b) distribution of boron content in pear pulp; (c) distribution of potassium content in pear pulp; (d) distribution of calcium content in pear peel; (e) distribution of boron content in pear peel; (f) distribution of potassium content in pear peel.
Figure 5. Curves fitted for prediction of mineral elements in pear pulp and peel. (a) Distribution of calcium content in pear pulp; (b) distribution of boron content in pear pulp; (c) distribution of potassium content in pear pulp; (d) distribution of calcium content in pear peel; (e) distribution of boron content in pear peel; (f) distribution of potassium content in pear peel.
Foods 11 03642 g005
Figure 6. Characteristic wavelengths’ distribution of prediction models for the content of calcium in pear pulp. (a) Raw-GA-PLSR; (b) SG-GA-PLSR.
Figure 6. Characteristic wavelengths’ distribution of prediction models for the content of calcium in pear pulp. (a) Raw-GA-PLSR; (b) SG-GA-PLSR.
Foods 11 03642 g006
Figure 7. Scatter plots of models’ prediction results of the content of calcium in pear pulp samples. (a) Raw-GA-PLSR; (b) SG-GA-PLSR.
Figure 7. Scatter plots of models’ prediction results of the content of calcium in pear pulp samples. (a) Raw-GA-PLSR; (b) SG-GA-PLSR.
Foods 11 03642 g007
Figure 8. Characteristic wavelengths’ distribution of prediction models for the content of boron in pear pulp. (a) Raw-GA-PLSR; (b) SG-GA-PLSR.
Figure 8. Characteristic wavelengths’ distribution of prediction models for the content of boron in pear pulp. (a) Raw-GA-PLSR; (b) SG-GA-PLSR.
Foods 11 03642 g008
Figure 9. Scatter plots of models’ prediction results of the content of boron in pear pulp samples. (a) Raw-GA-PLSR; (b) SG-GA-PLSR.
Figure 9. Scatter plots of models’ prediction results of the content of boron in pear pulp samples. (a) Raw-GA-PLSR; (b) SG-GA-PLSR.
Foods 11 03642 g009
Figure 10. Characteristic wavelengths’ distribution of prediction models for the content of potassium in pear pulp. (a) Raw-GA-PLSR; (b) SG-GA-PLSR.
Figure 10. Characteristic wavelengths’ distribution of prediction models for the content of potassium in pear pulp. (a) Raw-GA-PLSR; (b) SG-GA-PLSR.
Foods 11 03642 g010
Figure 11. Scatter plots of models’ prediction results of the content of potassium in pear pulp samples. (a) Raw-GA-PLSR; (b) SG-GA-PLSR.
Figure 11. Scatter plots of models’ prediction results of the content of potassium in pear pulp samples. (a) Raw-GA-PLSR; (b) SG-GA-PLSR.
Foods 11 03642 g011
Figure 12. Characteristic wavelengths’ distribution of prediction models for the content of calcium in pear peel. (a) SNV-GA-PLSR; (b) MSC-GA-PLSR; (c) SG-GA-PLSR; (d) LG-GA-PLSR; (e) SG+MSC-GA-PLSR; (f) SG+SNV-GA-PLSR.
Figure 12. Characteristic wavelengths’ distribution of prediction models for the content of calcium in pear peel. (a) SNV-GA-PLSR; (b) MSC-GA-PLSR; (c) SG-GA-PLSR; (d) LG-GA-PLSR; (e) SG+MSC-GA-PLSR; (f) SG+SNV-GA-PLSR.
Foods 11 03642 g012
Figure 13. Scatter plots of models’ prediction results of the content of calcium in pear peel samples. (a) SNV-GA-PLSR; (b) MSC-GA-PLSR; (c) SG-GA-PLSR; (d) LG-GA-PLSR; (e) SG+MSC-GA-PLSR; (f) SG+SNV-GA-PLSR.
Figure 13. Scatter plots of models’ prediction results of the content of calcium in pear peel samples. (a) SNV-GA-PLSR; (b) MSC-GA-PLSR; (c) SG-GA-PLSR; (d) LG-GA-PLSR; (e) SG+MSC-GA-PLSR; (f) SG+SNV-GA-PLSR.
Foods 11 03642 g013
Figure 14. Characteristic wavelengths’ distribution of prediction models for the content of boron in pear peel. (a) SNV-GA-PLSR; (b) MSC-GA-PLSR; (c) SG+MSC-GA-PLSR; (d) SG+SNV-GA-PLSR.
Figure 14. Characteristic wavelengths’ distribution of prediction models for the content of boron in pear peel. (a) SNV-GA-PLSR; (b) MSC-GA-PLSR; (c) SG+MSC-GA-PLSR; (d) SG+SNV-GA-PLSR.
Foods 11 03642 g014
Figure 15. Scatter plots of models’ prediction results of the content of boron in pear peel samples. (a) SNV-GA-PLSR; (b) MSC-GA-PLSR; (c) SG+MSC-GA-PLSR; (d) SG+SNV-GA-PLSR.
Figure 15. Scatter plots of models’ prediction results of the content of boron in pear peel samples. (a) SNV-GA-PLSR; (b) MSC-GA-PLSR; (c) SG+MSC-GA-PLSR; (d) SG+SNV-GA-PLSR.
Foods 11 03642 g015
Figure 16. Characteristic wavelengths’ distribution of prediction models for the content of potassium in pear peel. (a) SNV-GA-PLSR; (b) SG+SNV-GA-PLSR.
Figure 16. Characteristic wavelengths’ distribution of prediction models for the content of potassium in pear peel. (a) SNV-GA-PLSR; (b) SG+SNV-GA-PLSR.
Foods 11 03642 g016
Figure 17. Scatter plots of models’ prediction results of the content of potassium in pear peel samples. (a) SNV-GA-PLSR; (b) SG+SNV-GA-PLSR.
Figure 17. Scatter plots of models’ prediction results of the content of potassium in pear peel samples. (a) SNV-GA-PLSR; (b) SG+SNV-GA-PLSR.
Foods 11 03642 g017
Table 1. Overview of mineral elements’ content detection.
Table 1. Overview of mineral elements’ content detection.
Calcium in PulpBoron in PulpPotassium in PulpCalcium in PeelBoron in PeelPotassium in Peel
Maximum (mg/kg)40.57.86523081324.8521.67510,073
Minimum (mg/kg)142.1151365.8281.156.1752691.85
Average (mg/kg)26.6873.9951783.7653.4410.6465323.39
Table 2. Comparison of the prediction and modelling sets for the content of calcium, boron and potassium in pear pulp based on full-wave band.
Table 2. Comparison of the prediction and modelling sets for the content of calcium, boron and potassium in pear pulp based on full-wave band.
ModelsPrediction Results for the Content of Calcium in Pear PulpPrediction Results for the Content of Boron in Pear PulpPrediction Results for the Content of Potassium in Pear Pulp
R2 of Modelling SetRPD of Modelling SetR2 of Prediction SetRPD of Modelling SetR2 of Modelling SetRPD of Modelling SetR2 of Prediction SetRPD of Prediction SetR2 of Modelling SetRPD of Modelling SetR2 of Prediction SetRPD of Prediction Set
Raw-PLSR0.922.5740.7161.4310.9042.3440.7341.4720.861.970.7671.557
SNV-PLSR0.9322.7680.6841.3710.5161.1670.4951.1510.974.4520.3821.082
FD-PLSR0.761.5350.3481.0660.2071.0220.2121.0230.997.0890.1411.01
MSC-PLSR0.9362.8320.6511.3170.5681.2150.4921.1490.997.0890.1411.01
SD-PLSR0.6941.3880.3041.050.7121.424--0.974.4520.3821.082
SG-PLSR0.9212.5740.7171.4330.9042.3440.7351.4750.861.9730.7671.558
LG-PLSR0.9917.3760.5821.230.8712.0360.4971.1520.861.9750.6581.328
SG+MSC-PLSR0.9362.8320.6511.3170.5681.2150.4921.1490.997.0890.1411.01
SG+SNV-PLSR0.9322.7680.6841.3710.5161.1670.4951.1511-0.311.053
SG+MSC+FD-PLSR0.7051.4090.3381.0630.1871.0180.2161.0240.984.9460.621.275
SG+MSC+SD-PLSR0.6761.3560.3011.0480.7051.41--0.741.4870.4321.109
SG+SNV+FD-PLSR0.7071.4150.3461.0660.1911.0190.2181.0250.984.9610.6251.282
SG+SNV+SD-PLSR0.6781.360.2991.0480.7051.411--0.984.9610.6251.282
Raw-GBRT0.9825.3180.5111.1630.5621.2090.2781.0380.0121--
SNV-GBRT0.9794.938-00.9754.4770.4481.1190.0141--
FD-GBRT0.997.0890.4111.0970.9897.0630.1311.0090.0171--
MSC-GBRT0.4191.101-00.9082.3920.5391.1870.0151--
SD-GBRT0.9613.617-00.99510.280.4831.1420.9997.0910.051.001
SG-GBRT0.9825.3180.5121.1640.5621.2090.2681.0380.99815.8190.6091.261
LG-GBRT0.9563.4260.4411.1140.4321.1090.2641.0370.9794.850.5471.195
SG+MSC-GBRT0.0161-00.9082.3920.5391.1870.0151--
SG+SNV-GBRT0.9794.938-00.9754.4770.4481.1190.0141--
SG+MSC+FD-GBRT0.9835.3850.61.2440.9493.1650.4361.1110.0161--
SG+MSC+SD-GBRT0.9997.089-00.8581.9480.2761.040.9896.7050.2931.046
SG+SNV+FD-GBRT0.9997.090.3621.0730.9876.1930.2221.0260.0161--
SG+SNV+SD-GBRT0.99610.659-00.8581.9480.2761.040.9896.7050.2931.046
Note: “-” means the value here is invalid, the same below.
Table 3. Modelling prediction results of the content of calcium in pear pulp based on the selected characteristic wavelengths by using GA.
Table 3. Modelling prediction results of the content of calcium in pear pulp based on the selected characteristic wavelengths by using GA.
ModelsModelling SetPrediction SetLevel of Models
R2RPDR2RPD
Raw-GA-PLSR0.8982.2710.9876.110A
SG-GA-PLSR0.8952.2460.9734.325A
Table 4. Modelling prediction results of the content of boron in pear pulp based on the selected characteristic wavelengths.
Table 4. Modelling prediction results of the content of boron in pear pulp based on the selected characteristic wavelengths.
Methods of Extracting Characteristic WavelengthsModelsModelling SetPrediction SetLevel of Models
R2RPDR2RPD
GARaw-GA-PLSR0.7681.5600.9845.560A
SG-GA-PLSR0.8461.8780.9866.033A
Table 5. Modelling prediction results of the content of potassium in pear pulp based on the selected characteristic wavelengths.
Table 5. Modelling prediction results of the content of potassium in pear pulp based on the selected characteristic wavelengths.
Methods of Extracting Characteristic WavelengthsModelsModelling SetPrediction SetLevel of Models
R2RPDR2RPD
GARaw-GA-PLSR0.8231.7600.9784.756A
SG-GA-PLSR0.8341.8100.9855.833A
Table 6. Comparison of the prediction and modelling sets for the content of calcium, boron and potassium in pear peel based on full-wave band.
Table 6. Comparison of the prediction and modelling sets for the content of calcium, boron and potassium in pear peel based on full-wave band.
ModelsPrediction Results for the Content of Calcium in Pear PeelPrediction Results for the Content of Boron in Pear PeelPrediction Results for the Content of Potassium in Pear Peel
R2 of Modelling SetRPD of Modelling SetR2 of Prediction SetRPD of Modelling SetR2 of Modelling SetRPD of Modelling SetR2 of Prediction SetRPD of Prediction SetR2 of Modelling SetRPD of Modelling SetR2 of Prediction SetRPD of Prediction Set
Raw-PLSR0.811.7050.7081.4160.8291.790.1861.0180.0971.0050.121.007
SNV-PLSR0.761.530.8111.7110.8972.2570.7071.4140.8952.2410.7561.529
FD-PLSR0.151.0110.4561.1240.99611.1540.1551.0120.5121.1640.2741.04
MSC-PLSR0.781.6030.781.5990.9422.9890.7011.4020.7181.4360.4721.135
SD-PLSR0.611.2580.121.0070.6651.3380.0381.0010.6691.3450.0951.005
SG-PLSR0.912.4580.8081.6960.8291.790.1861.0180.0971.0050.121.007
LG-PLSR0.963.4370.7961.6510.8571.940.2211.0250.5211.1710.1171.007
SG+MSC-PLSR0.781.6030.781.5990.9422.9890.7011.4020.7181.4360.4721.135
SG+SNV-PLSR0.761.530.8121.7130.8972.2570.7091.4150.8952.2410.7571.531
SG+MSC+FD-PLSR0.141.010.4211.1030.8331.8080.1451.0110.8251.7710.0361.001
SG+MSC+SD-PLSR0.61.2480.1041.0050.8481.8850.00610.9997.0890.0641.002
SG+SNV+FD-PLSR0.141.010.4211.1020.8491.8950.1481.0110.5821.230.0411.001
SG+SNV+SD-PLSR0.61.2480.1041.0050.8491.8950.1481.0110.9997.0890.0641.002
Raw-GBRT0.997.160.4231.1040.541.1880.0751.0030.8842.1430.241.03
SNV-GBRT0.9915.8020.4311.2950.1031.005−0.05-0.2581.035−0.007-
FD-GBRT0.997.0890.4991.1540.8271.7770.3451.0650.7111.4220.0651.002
MSC-GBRT0.997.0890.5121.1640.0141−0.007-0.4111.097−0.057-
SD-GBRT0.932.6910.4111.0970.9917.5320.2361.0290.0631.002−0.037-
SG-GBRT0.997.0890.4231.1040.541.1880.0751.0030.8842.1430.241.03
LG-GBRT0.997.0890.481.140.8261.7760.0391.0010.8842.1430.2451.031
SG+MSC-GBRT0.997.0890.5121.1640.0141−0.007-0.0731.003−0.023-
SG+SNV-GBRT0.997.0890.4311.1080.1031.005−0.05-0.2581.035−0.007-
SG+MSC+FD-GBRT0.9918.1010.5091.1620.9312.7470.0711.0030.9132.4470.2631.037
SG+MSC+SD-GBRT0.997.0890.3511.0680.9714.1790.2321.0280.9292.693−0.006-
SG+SNV+FD-GBRT0.997.0890.4981.1530.99815.8190.1351.0090.6811.3660.1681.014
SG+SNV+SD-GBRT0.997.0890.3671.1130.9152.4830.1671.0140.2051.022−0.011-
Table 7. Modelling prediction results of the content of calcium in pear peel based on the selected characteristic wavelengths.
Table 7. Modelling prediction results of the content of calcium in pear peel based on the selected characteristic wavelengths.
Methods of Extracting Characteristic WavelengthsModelsModelling SetPredicting SetLevel of Models
R2RPDR2RPD
GASNV-GA-PLSR0.731.4540.9896.837A
MSC-GA-PLSR0.801.6750.9928.119A
SG-GA-PLSR0.872.0540.9896.672A
LG-GA-PLSR0.932.6710.9896.951A
SG+MSC-GA-PLSR0.791.6540.9917.282A
SG+SNV-GA-PLSR0.741.4810.9896.858A
Table 8. Modelling prediction results of the content of boron in pear peel based on the selected characteristic wavelengths.
Table 8. Modelling prediction results of the content of boron in pear peel based on the selected characteristic wavelengths.
Methods of Extracting Characteristic WavelengthsModelsModelling SetPredicting SetLevel of Models
R2RPDR2RPD
GASNV-GA-PLSR0.8551.9260.9734.301A
MSC-GA-PLSR0.8601.9620.9734.344A
SG+MSC-GA-PLSR0.8702.0270.9845.598A
SG+SNV-GA-PLSR0.8591.9530.9866.077A
Table 9. Modelling prediction results of the content of potassium in pear peel based on the selected characteristic wavelengths.
Table 9. Modelling prediction results of the content of potassium in pear peel based on the selected characteristic wavelengths.
Methods of Extracting Characteristic WavelengthsModelsModelling SetPrediction SetLevel of Models
R2RPDR2RPD
GASNV-GA-PLSR0.8732.0520.9714.148A
SG+SNV-GA-PLSR0.8882.1750.9734.294A
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Fang, J.; Jin, X.; Wu, L.; Zhang, Y.; Jia, B.; Ye, Z.; Heng, W.; Liu, L. Prediction Models for the Content of Calcium, Boron and Potassium in the Fruit of ‘Huangguan’ Pears Established by Using Near-Infrared Spectroscopy. Foods 2022, 11, 3642. https://doi.org/10.3390/foods11223642

AMA Style

Fang J, Jin X, Wu L, Zhang Y, Jia B, Ye Z, Heng W, Liu L. Prediction Models for the Content of Calcium, Boron and Potassium in the Fruit of ‘Huangguan’ Pears Established by Using Near-Infrared Spectroscopy. Foods. 2022; 11(22):3642. https://doi.org/10.3390/foods11223642

Chicago/Turabian Style

Fang, Jing, Xiu Jin, Lin Wu, Yuxin Zhang, Bing Jia, Zhenfeng Ye, Wei Heng, and Li Liu. 2022. "Prediction Models for the Content of Calcium, Boron and Potassium in the Fruit of ‘Huangguan’ Pears Established by Using Near-Infrared Spectroscopy" Foods 11, no. 22: 3642. https://doi.org/10.3390/foods11223642

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop