Next Article in Journal
Production Capacity Evaluation of Farmland Using Long Time Series of Remote Sensing Images
Previous Article in Journal
Impact of Off-Farm Employment on Farmland Transfer: Insight on the Mediating Role of Agricultural Production Service Outsourcing
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Near-Infrared Model and Its Robustness as Affected by Fruit Origin for ‘Dangshan’ Pear Soluble Solids Content and pH Measurement

1
National Engineering Research Center for Agro-Ecological Big Data Analysis & Application, Anhui University, Hefei 230601, China
2
School of Informatics and Engineering, Suzhou University, Suzhou 234000, China
3
Beijing Research Center of Intelligent Equipment for Agriculture, Beijing 100097, China
4
Anhui Rural Comprehensive Economic Information Center, Hefei 230031, China
5
Texas A&M AgriLife Research Center, Beaumont, TX 77713, USA
*
Author to whom correspondence should be addressed.
Agriculture 2022, 12(10), 1618; https://doi.org/10.3390/agriculture12101618
Submission received: 24 August 2022 / Revised: 29 September 2022 / Accepted: 30 September 2022 / Published: 5 October 2022
(This article belongs to the Section Agricultural Product Quality and Safety)

Abstract

:
Soluble solid content (SSC) and acidity (pH) are two important factors indicating the fruit quality of pears and can be measured by near-infrared spectroscopy (NIRS). However, the robustness of these measurements as affected by different origins of pears remains largely unknown. In this study, we developed an NIRS method to measure ‘Dangshan’ pear (Pyrus spp.) SSC and pH and evaluated the robustness of this non-destructive detection method by examining the effects of pears from three different origins in 2019 and 2020. First, the Kennard–Stone method was used to divide the calibration set of the 2020 pear samples from different orchards. The partial least squares (PLS) model was used to establish the local origin and hybrid origin models to predict the pears’ SSC and pH. Second, a combination of competitive adaptive reweighted sampling (CARS), successive projections algorithm (SPA), and uninformative variable elimination (UVE) was implemented to construct spectral prediction models based on effective variables for assessing the pears’ SSC and pH from local and hybrid origins. The results showed that the local origin detection model produced large errors in predicting the SSC and pH of pears from different origins, and the model, established based on the pear samples of three origins, performed better than the local origin and other hybrid origin models. Finally, the model could be effectively simplified using 70 and 52 characteristic variables selected by the CARS method. Pear samples harvested from three different orchards in 2019 were used as an independent set to verify the validity of the selected characteristic variables. The results showed that the predicted R2p for the SSC and pH measurements of pears of three different origins were more than 0.9 and 0.85, respectively. This finding indicates that the difference in the origin of pears has an important influence on the quantitative inversion of pear SSC and pH measurements, and the combination of the hybrid origin model constructed based on the characteristic variables can improve the prediction accuracy. These findings provide an important theoretical basis for the development of rapid detection devices for the measurements of pears’ SSC and pH.

1. Introduction

Pears are one of the most widely consumed fruits in the world [1]. They are crispy and juicy and have a good combination of sweetness and sourness. Pears are rich in malic acid, citric acid, glucose, fructose, calcium, phosphorus, iron, vitamins, and other nutrients. China is ranked number one in global pear production, with approximately 1731 tons of pears harvested in 2019 [2]. However, the technologies used for pear quality testing and grade classification in China are evidently lagging compared to most developed countries. With an increasing demand for high-quality products in the fresh fruit market, one of the main goals of the current fruit industry is to provide the market with products that are not only good-looking but also have a high-quality taste [3]. Fruit acidity (pH) and soluble solids content (SSC) are two important factors that affect the taste and quality of pears and directly influence consumers’ desire to consume pears. Most traditional methods for fruit pH and SCC measurements are destructive, inefficient, and time-consuming [4]. Therefore, developing non-destructive, efficient, and convenient assessment methods for the fruit pH and SSC can help further promote the development of China’s pear fruit industry.
The use of hyperspectral imaging (HSI) to detect the internal quality of fruit has been developed very rapidly in recent years. The HSI has the advantage of combining traditional imaging and spectroscopic techniques. Peng and Lu [5] used hyperspectral scattering for non-destructive detection of various quality attributes of apples. They evaluated and compared different mathematical models to describe hyperspectral scattering profiles in the 450–1000 nm spectral region, and successfully selected the best model to predict apple hardness and SSC. Li and Chen [6] examined hyperspectral imaging in the visible and near-infrared (NIR) spectral range (400–1000 nm) to determine each SSC non-invasively. They developed a more robust and accurate model based on multi-regional information. Fan et al. [7] studied the SSC and hardness of pears using hyperspectral imaging and constructed a partial least squares (PLS) prediction model based on the CARS-SPA algorithm to select the characteristic variables. The correlation coefficient (Rpre) of 0.876 for SSC and 0.867 for hardness demonstrated that the hyperspectral imaging system and the CARS-SPA-PLS model can be used as rapid and potential methods for the SSC and hardness determination of pears. However, hyperspectral imaging techniques contain large hyperspectral data cubes that require much higher hardware speed to accommodate data acquisition and analysis [8]. Relatively, the non-imaging spectral data selected for the current study requires lower CPU, memory, and display functions that facilitate processing [9].
Near-infrared spectroscopy (NIR) is a low-cost optical detection technique with the advantages of non-destructiveness, rapidity, precision, control, accuracy, repeatability, and reproducibility [10]. Its main principle is to analyze the spectral responses of different chemical bonds in organic matter in the near-infrared interval of the electromagnetic spectrum. The organic matter in the sample has different spectral fingerprints, such as C-H, N-H, and especially O-H, due to the different specific vibrational frequencies of the chemical bonds [11]. The wealth of information absorbed by these chemical bond vibrations helps in the construction of multi-variate calibration models for food quality analysis [12]. In recent years, NIR technology has been widely used in the non-destructive testing of pears. Sun et al. [13] established a non-destructive detection method for the determination of the internal blackheart and SSC of pears and demonstrated that the online detection of internal blackheart and SSC of pears using non-imaging NIR spectroscopy was feasible. Wang et al. [14] used a portable non-imaging visual NIR spectrometer (model K-BA100R; Kubota Co., Osaka, Japan) to demonstrate the feasibility of a multi-species model for the in-situ determination of the internal quality of European pears. The MLR SSC prediction model was developed with the best R2 of 0.87 and an RMSEP of 0.45. Sun et al. [15] used non-imaging Vis/NIR spectroscopy to accurately predict the SSC of “Cuiguan” pear online. The developed SSC-PLS calibration model had a coefficient of determination (R2) and a root mean square error of prediction (RMSEP) of 0.916 and 0.530, respectively.
Non-imaging spectrometers can obtain large amounts of spectral data that suffer from severe covariance problems and inevitably contain interference and useless information. If the Partial least squares (PLS) model is built based on the full wavelength, it will not only increase the complexity of the model, but even decrease the prediction performance of the model [16]. Therefore, it becomes necessary to screen the effective feature wavelengths. Travers et al. [17] developed regression models for measuring pre-harvest dry matter and SSC in the spectral ranges of 680–1000 nm and 1100–2350 nm for pears. They demonstrated that both spectra in the two band intervals were good detectors of dry matter and SSC. They also found that the internal quality detection model of pears using the spectral range of 1100–2350 nm performed better. Xu et al. [18] compared four wavelength variable screening methods, and the results showed that the accuracy of vis-NIR spectroscopy for quantitative analysis of pear saccharinity could be improved by selecting the appropriate wavelength. Li et al. [19] used visible-NIR spectroscopy combined with PLS, LV-LS-SVM, and EW-LS-SVM models to successfully determine pear SSC, pH, and hardness; the EW-LS-SVM model has better robustness and reliability. PLS and LS-SVM models constructed in the 400–1800 nm spectral range were found to be more suitable for SSC, pH, and hardness prediction than those in the 400–780 nm and 781–1800 nm spectral ranges.
In these studies, each of the testing models or methods has unique advantages and characteristics and achieves expected testing results. However, there are at least four limitations to these studies. First, the samples used in these studies were mostly purchased from the fruit market, and thus the picking time and storage time are unclear. The storage time affects the internal quality detection results of pears. Second, the origin or source of the fruit samples used in those studies is unclear, and almost none of the studies involve hybrid pear varieties, an important origin of pears. Third, most of the studies examine pear samples collected from a single year, and no research has been conducted on pear samples obtained from multiple years. Fourth, most studies lack independent validation or robustness verification. The introduction of multiple years or batches of data is crucial for validating the model’s robustness [20]. Model verification should be performed to examine the stability and robustness of the models.
The objectives of this research are to (1) test whether NIRS has a good ability to predict the SSC and pH of ‘Dangshan’ pears (cv. Dangshan), (2) identify the effective characteristic wavelength variables for predicting pear SSC and pH, (3) develop models and evaluate their performance for predicting the SSC and pH in local and hybrid pear origins by using full and effective variables, and (4) test the stability and accuracy of the models using independent validation pear sample sets from different years.

2. Materials and Methods

2.1. Experimental Pear Samples

Sample group 1: Production management practice and location significantly affected the yield and quality of pear. To increase the diversity of pear internal quality on pH and SSC, 496 pear samples without visual defects (e.g., scars, cuts, and wrinkles) were collected from three different orchards with different farming management practices (Table 1). The samples were obtained in September 2020 from Tangzhai (TZ), Lishuangwang (LSW), and Yihao (YH) orchards in Dangshan County, Suzhou City, Anhui Province, China (Figure 1).
Sample group 2: A total of 450 pear samples were collected in 2019 from Lishuwang 1 (LSW1), Huanghegudao Farm (HHGD), and Lishuwang 2 (LSW2) orchards. These pear samples were used as an independent validation data set to test the robustness of the proposed model that detected the internal quality (pH and SSC) of pears.
All pear samples were cleaned and placed at a constant temperature (20 °C) for 24 h before they were used to reduce the effect of temperature on the accuracy of the pH and SSC prediction models [21]. Three areas with a diameter of 3 cm were marked as measurement points near the equator of each pear by taking a point at an interval of 120 degrees.

2.2. Spectral Data Acquisition

Spectral reflectance data for all the pear samples in this study were obtained using measurements with a PSR+3500 non-imaging portable spectrometer (Spectral Evolution, Haverhill, MA, USA) equipped with a 25° viewing angle fiber. This spectrometer has a spectral measurement range of 350–2500 nm and a spectral resolution between 3 and 8 nm. All spectral data acquisition was performed in a dark room to reduce the influence of external light on the observed spectra. In this case, the illumination system, which consisted of two 350 W halogen lamps, was placed at an angle of 45° on either side of the console and 40 cm from the carrier table to ensure light coverage of the entire pear. During the spectroscopic measurements, the fiber optic probe was fixed at the appropriate position, and the pear samples were placed 7 cm below the lens to ensure that the measurement environment was consistent for each sample. This distance was determined by Equation (1), as follows:
H = tan α R
where H is the distance from the fiber optic lens to the target nucleus, α is the fiber optic field of view angle, and R is the field of view radius. The spectral acquisition system is shown in Figure 2.
Calibration set and prediction set partitioning using the Kennard–Stone method can remarkably improve the accuracy of model predictions; it is widely used in the field of qualitative analysis of spectral data [22]. In the present study, the Kennard–Stone method was used to divide the calibration and prediction sets of pear samples from different production areas at a ratio of 3 to 1.

2.3. Measurement of pH and SSC

After the spectral data collection was completed, the pH and SSC of ‘Dangshan’ pears were measured using a portable pH tester (Testo 206 PH1) and a digital display saccharimeter (CHT65), respectively. A 10-mm skinned pulp from the equatorially marked part of the collection point was taken and filtered by manually pressing on gauze. Then, half of the pear juice was poured into the sample slot of the digital display saccharimeter, and a measured SSC value was recorded. A portable pH tester probe was inserted into the test tube containing the remaining pear juice, and the measured pH value was recorded.

2.4. Data Analysis

2.4.1. Spectral Pre-Processing

During the spectral data acquisition, disturbances, such as external stray light, sample background, electronic sources, and instrument performance, can cause baseline translation and rotation as well as spectral scattering [23]. The original reflectance spectra should be pre-processed to eliminate or attenuate the effects of non-target factors, and thus build a stable detection model. The original reflectance spectral data were pre-processed using multiple scattering correction (MSC) [24], standard normal variables (SNV) [25], Suavity–Golay smoothing [26], first-order derivatives, second-order derivatives [27], and maximum normalization.

2.4.2. Effective Variables Selection

The full-band spectral data were pre-processed to remove some noise information. However, they still contained excessive wavelength information variables. The PSR+3500 spectrometer contains more than 2000 wavelengths. If the full spectral bands are used for modeling and analysis, then the cost of equipment computing and running time increases. More importantly, the covariance problem of spectral data and the large number of wavelength variables are unrelated to the detected object, thereby affecting the prediction accuracy and stability of the model [16]. Therefore, selecting effective information variables from the full wavelength band is important. Several commonly used methods for effective wavelength selection are described in the following section.
(1)
Competitive adaptive reweighted sampling (CARS)
The CARS algorithm is a wavelength selection method proposed by Li et al. that can select the optimal combination of wavelengths in the full spectrum [28]. The most important evaluation metric for CARS variable selection is the absolute value of the PLS model regression coefficients. CARS uses an iterative competitive approach to sequentially select a subset of N variables from N Monte Carlo sampling runs. The number of wavelengths selected for each sample is then determined by an exponentially decreasing function (EDF) and adaptive reweighted sampling (ARS). Finally, the most critical variable for the prediction target is selected on the basis of the minimum root mean square error of each subset cross-validation (RMSECV). The PLS subset model is selected using 10 cross-validations, and the number of Monte Carlo samples is set to 50. When the RMSECV reaches its minimum value, the corresponding subset is the optimal variable subset [29].
(2)
Successive projections algorithm (SPA)
The SPA algorithm can effectively select spectral feature variables. The smallest combination of redundant information variables is screened in the spectral information by using vector projection analysis to reduce the number of modeled variables by minimizing the covariance between them [30]. The principle of the SPA algorithm for screening variables is to initially create a subset of candidate variables with the smallest covariance as a result of a projection operation on the columns of the spectral matrix that could be used in the training data. Then, the results of the multiple linear regression model are evaluated using cross-validation or by applying the results of a separate validation set. Finally, uninformative variables that contribute to the loss of predictive power are eliminated [31,32].
(3)
Uninformative variable elimination (UVE)
The UVE algorithm is a variable screening method commonly used to process spectral data. The method initially evaluates the reliability of each variable in the model by means of variable selection criteria, it assesses the stability of each variable. Then, variables containing redundant information in the spectral information are eliminated [33]. Modeling with the variables selected by UVE could effectively solve the over-fitting problem and improve the predictive power of the model [34]. UVE could not only eliminate redundant information variables when processing spectral data but also reduce the computing time [35].

2.4.3. Model Building

PLS is a multi-variate statistical method that is sensitive to non-collinear variables. It can handle data sets with a large number of variables and is widely used in regression model construction because of its simplicity of use, stability of performance, and ease of access. PLS was used to identify the fundamental relationship between two matrices (X and Y), the spectral matrix (X) and the target chemical composition matrix (Y), and to search for a set of latent variables (LVs) to perform X and Y decomposition [36]. PLS was used to filter spectral data LVs. The optimal number of LVs was determined by the minimum of the sum of squared errors, thereby eliminating redundant information and random errors in the original data. The ideal model should have a low RMSEC and RMSE of validation or prediction (RMSEP) [37].
The effects of appellation and year differences on the NIR spectral detection of SSC and pH values of ‘Dangshan’ pears were investigated comprehensively. Spectral prediction models for SSC and PH values of pear from local origin models (Case a) and hybrid origin models (Case b) were developed (Figure 3). First, the PLS models for three different origins were established with the calibration set data of pear samples from Pear King Orchard, Pear One Orchard, and Tangzhai Town Farm in 2020, respectively, and the SSC and pH of pears produced by each of the three origins were predicted. Then the PLS models of each origin were validated using the validation set data of the three origins in 2020, respectively (Case a). The calibration set samples from the three production locations were then mixed to create an integrated hybrid origin model to predict the SSC and pH of pears of different origins (Case b I). The model was verified and further observed to see whether it was affected by pear origin and test year. The validity of this model was verified using the spectral information of 450 pear samples obtained in 2019 from Pear King No.1 Orchard (LSW1), Yellow River Gorge Farm (HHGD), and Pear King No.2 Orchard (LSW2); these data were used as the prediction set (Case b II). The performance of the two detection models was compared, and the more accurate and robust model was selected for further analysis.

2.4.4. Model Evaluation

The coefficient of determination (R2c and R2p) and the RMSE (RMSEC and RMSEP) were used to evaluate the performance of the model. In general, a good model has high R2c and R2p values and low RMSEC and RMSEP values [38,39,40]. The R2c and R2p values should be close to 1, and the RMSEC and RMSEP values should be connected to 0. The formulae for calculating R2c and R2p, RMSEC, and RMSEP are as follows:
R c 2 = i = 1 n c y i ^   y   ¯ 2 i = 1 n c y i   y   ¯ 2
R p 2 = i = 1 n p y i ^   y   ¯ 2 i = 1 n p y i   y   ¯ 2
RMSEC = ( 1 N i = 1 N c y i y i ^ 2 ) 1 2
RMSEP = ( 1 N i 1 N p y i y i ^ 2 ) 1 2
where y indicates the data to be fitted with a mean value of y ¯ and predicted value of y ^ ; and N is the total number of predicted samples.

3. Results

3.1. Spectral Characterization of Pear of Different Origins

Figure 4a shows the original reflectance spectral information of 496 pear samples collected in 2020 in the wavelength range of 350–2500 nm. Sample reflectance showed very distinct peaks near 400, 680, 1150, and 1550 nm. The absorption peak at approximately 400 nm was associated with the absorption of carotenoids, such as lutein and β-carotene [41]. The absorption peak near 680 nm might be related to the absorption band of chlorophyll [42]. The absorption peak at approximately 1150 nm was related to the first overtone of the O-H band in water [36]. The absorption peak at approximately 1550 nm was related to the stretching and bending of O-H [30]. The average spectral curves of the pear samples from the three origins are shown in Figure 4b. The figure shows that the spectral curves had similar trends, but the reflectance intensities were different at different wavelengths. This finding indicates that the different pear internal substances were the same, but the content of each compound was different. The difference in spectral reflectance intensities is a prerequisite for building the regression model [43].
After the analysis and comparison of the preprocessing algorithms, the preprocessed spectra by the SNV method were used for subsequent variable selection and modeling in this study. The original spectral profile (Figure 4a) and the SNV preprocessed spectral profile (Figure 5) were observed. The results showed significant bursts in the visible (959–985 nm) and NIR (1842–1859 nm) band ranges. Therefore, only 2004 wavelength points in the 400–958, 986–1841, and 1859–2450 nm spectral ranges were used for the analyses.

3.2. Analysis of SSC and pH of Pear of Different Origins

The test sample calibration set and prediction set were divided into a ratio of three to one, and the results of the division of the actual measured values of pH and SSC of ‘Dangshan’ pear samples from four different locations are shown in Table 2 and Table 3. The maximum value, minimum value, mean value, standard deviation, and range of values of SSC and pH of ‘Dangshan’ pears in different locations were different. The statistical values indicate that the mean pH values of pear in the four different locations were between 5.24 and 5.39, indicating that the pH of pear does not vary remarkably with pear origin and is relatively stable. For SSC, the highest average value (9.89) was found in pears from Yihao Orchard, whereas the lowest average value (8.69) was found in pears from Tangzhai Farm, which is 12% lower than that of the former. This difference could be attributed to the differences in fertilization and cultivation management practices between the two locations. Tangzhai Farm and Lishuwang Orchard were considered to have rough farming practices and rough-to-fine farming practices in terms of fertilization and agronomical management. In these orchards, applications of chemical and organic fertilizers were not based on soil testing recommendations, but on personal experience. In contrast, Yihao Orchard adopted modern production management practices and made balanced applications of chemical, farmyard manure, and biological fertilizers via soil and foliar treatments based on soil testing recommendations to meet the needs of pear plant growth. In addition, other modern techniques, such as proper pruning and effective disease and weed and insect pest management, were adopted. These modern farming practices contributed to the improvement of pear quality [22]. The differences in farming practice also contributed, to some extent, to the significant differences observed in pear spectra among the four different orchards.
The results also showed that the ranges of pH and SSC in the prediction data sets were all within the ranges of pH and SSC in the calibration data sets (Table 2 and Table 3). These findings indicate that the developed pH and SSC prediction models can be applied to predict the pH and SCC in pears. This creates a condition that can help construct a more stable and reliable prediction model.

3.3. Modeling Analysis of pH and SSC of Pear of Different Origins

3.3.1. Local Model

The PLS models for three local pear origins were developed to predict the SSC and pH of pears produced by Lishuwang Orchard, Yihao Orchard, and Tangzhai Farm, and were validated with the pear samples from the calibration sets of different origins. The prediction results of the full spectrum-based PLS model based on the different prediction data sets are shown in Table 4. All the PLS models could effectively predict the pear SSC and pH. The local model based on the calibration set of pear samples of the same origin achieved a better prediction for all the samples in the prediction set of the same origin.
In the actual production situation, if the SSC or pH value models were constructed separately for each pear origin, the workload would be massive even when the detection results were improved. This condition is not favorable for practical application. For SSC and pH prediction of unknown origin samples, initially determining their origin information based on spectral information is necessary. Subsequently, the SSC and pH values are predicted based on the model under the corresponding origin; otherwise, the model prediction accuracy is affected [44]. However, this process is excessively tedious in accordance with the appellation determination results in the first step. Therefore, the established local origin model was applied to the prediction set samples from other pear production locations, and the results are shown in Table 4. The results show that the detection model with a local origin produced large errors when predicting the SSC and pH of pears from different origins. For example, using the proposed models, the RMSEP of the Lishuwang origin was 0.279 and 0.218, which was significantly greater than the 0.201 corresponding to the model built for the Lishuwang calibration set. The prediction coefficients of determination were also reduced to a varying degree. Similar conclusions could be drawn for the predictions of pears from all other locations. All exhibited a decrease in R2p and an increase in RMSEP values. There was a maximum difference of 0.156 for R2p and 0.243 for RMSEP in the SSC prediction, and a maximum difference of 0.187 for R2p and 0.028 for RMSEP in the pH prediction. Therefore, the SSC and pH detection models for a local origin cannot satisfy the actual production requirements, and establishing a comprehensive model for hybrid (multiple) pear origins is needed for practical application.

3.3.2. Integrated Model for Hybrid Pear Origins

The calibration set of pear samples of different origins was combined, two-by-two, to establish a mixed NIR spectral detection model for pH and SSC of pears from two locations to further characterize the effect of pear origin on the pH and SSC spectral detection model. Then, the calibration set of pear samples from three locations was integrated to establish a comprehensive hybrid origin model, and the results are shown in Table 5.
The prediction accuracy of the model continued to improve as the spectral information of pear samples from more origins was combined in the calibration set. The model built by integrating spectral information of pears from three origins was significantly better than that built by combining spectral information of pears from two origins. Moreover, the model built by combining the calibration sets of all three pear origins achieved better detection results for the prediction set of pear samples from the three origins. Therefore, when the calibration set contained more spectral information about pears from different origins, the constructed models achieved better predictions of SSC and pH values for pear samples from different origins. Thus, the influence of the pear origin on the spectral detection models of pH and SSC decreased. However, the full-band NIR spectra had 2004 wavelength variables, and the covariance between wavelengths and redundant information resulted in a more complex model; the PLS modeling time increases linearly with the increase in variables [45]. Therefore, to further simplify the detection model, an attempt was made to preferentially select the characteristic wavelengths used for assessing SSC and pH using CARS, SPA, and UVE methods to select characteristic wavelength-based prediction models for SSC and pH values.

3.3.3. Integrated Models for Different Pear Origins Based on Characteristic Wavelengths

The results of the CARS algorithm in selecting the effective variables of SSC and pH values of pear are shown in Figure 6. The results show that the number of selected variables decreased continuously with the increase in sampling size, and the rate of decrease appeared to change from fast to slow as a result of the exponential decay function (EDP) (Figure 6aI, bI). This finding indicates that the CARS algorithm uses a screening scheme that combines coarse and fine selection, which greatly improves the efficiency of variable selection. Figure 6aII, bII show the variation curves of RMSECV versus the number of variables. The RMSECV showed a slight decrease with the increase in the number of samples. When the number of samples was 25, the RMSECV reached its lowest point and then started to increase. This process was the result of eliminating the wavelengths that carried redundant information. The optimal set of variables was selected when the sampling number was 25 and the RMSECV was minimal. CARS selected 70 and 52 variables as valid variables for predicting SSC and pH values of ‘Dangshan’ pear, respectively.
A clear relationship was found between the number of wavelengths selected by SPA and the RMSEP value. Therefore, when the number of selected wavelengths increased, the RMSEP value decreased and then stabilized when it reached a certain value. The overall trend remained the same despite the small fluctuations in the process. The SPA algorithm selected the pear SSC and pH value feature variables. As shown in Figure 7a,b, the RMSEP value stabilized when the number of selected variables exceeded 18 and 11, but the RMSEP reached its best value at 18 and 11 variables (marked as red rectangles). SPA selected 18 and 11 variables as effective variables for predicting SSC and pH of ‘Dangshan’ pears, respectively.
Valid variables were selected using UVE for 2004 wavelengths of the preprocessed full-spectral variables, where the number of random variables was set to 200. The results of selecting the pear SSC and pH feature variables are shown in Figure 8a,b. The blue vertical line in Figure 8 indicates the demarcation line between the input spectral variables and random variables; the red curve to the left of the demarcation line depicts the distribution of the spectral variable’s T-values; and the random variable’s T-values are distributed to the right of the blue demarcation line, as indicated by the purple curve. The threshold used to filter the valid variables was determined by the random variable, the T-valuemax, of the maximum value of the random variable T-value, and the threshold was triggered when the T-value of the random variable was equal to 0.99 times the T-valuemax. The variables outside the two dashed lines were the valid variables selected, and the remaining variables in between the two lines were excluded. A total of 505 and 557 variables were selected by UVE as valid variables for predicting SSC and pH values of ‘Dangshan’ pears, respectively.
The characteristic variables selected by the three algorithms were used as input variables to construct the PLS prediction models for assessing the SSC and pH of ‘Dangshan’ pears, and the results are shown in Table 6. For the prediction of SSC, the constructed CARS-PLS model performed the best and achieved better prediction results for the three different pear origins, namely, Lishuwang Orchard, Tangzhai Farm, and Yihao Orchard. Among them, the prediction results of R2p = 0.943 and RMSEP = 0.142 for the Yihao Orchard were better than the prediction results of R2p = 0.921 and RMSEP = 0.178 for the full wavelength. The prediction results of the other two locations, Lishuwang Orchard and Tangzhai Farm, were close to the prediction results of the full wavelength. The CARS-PLS model used only 3.5% of the characteristic variables of the full wavelength and achieved prediction results close to or even better than the full-wavelength model. SPA, although it selected the smallest number of variables, performed slightly worse than the other two models, where R2p was above 0.87. The prediction results of the UVE-PLS model were better than those of the SPA-PLS model, where the R2p was greater than 0.88. However, the number of input variables was much greater than those of the CARS-PLS and SPA-PLS models, and it was not the best model. Therefore, the UVE-PLS model outperforms the CARS-PLS and SPA-PLS models, and the UVE-PLS model is the best model in terms of prediction accuracy. In addition, the CARS-PLS model is the second-best model, which is better than the SPA-PLS model. However, considering the number of characteristic variables, the UVE-PLS model had 557 input variables, while the CARS-PLS model had only 52 input variables. Thus, the model based on the 52-feature wavelengths by the CARS was selected as the best model for pear pH detection. The results of the comprehensive analysis indicate that the performance of the CARS-PLS model is ideal in the prediction of pear SSC and pH. Accurate predictions of the SSC and pH of pears from different production areas can be achieved, and the influence of pear origin on the model can be reduced.

3.4. Model Validation

To verify the validity of the CARS-PLS model and to determine whether it was affected by variability in pear origin and test year, its validity was verified using 450 pears harvested in 2019 from Lishuwang 1 Orchard (LSW1), Yellow River Gorge Farm (HHGD), and Lishuwang 2 Orchard (LSW2) (Figure 1).
The pear SSC and pH (Table 7 and Table 8) were measured for the 450 Dangshan pear samples using the same method and instrumentation to collect spectral information (Figure 9).
The average spectral curves of the pear samples for both years (Figure 10) were observed and found that although the general trend of the spectral curves was the same, the spectral features were remarkably different at several characteristic peaks. This finding indicates that the internal substances of the pear were essentially the same in both years, but the content of each compound was different. The comparison of Table 2, Table 3, Table 7 and Table 8 further illustrates the remarkable differences between the data from 2019 and 2020. The SSC value for the 2020 pear samples ranged from 6.57 to 11.97, and the SSC value for the 2019 pear samples ranged from 9.40 to 13.77, which was higher than the 20-year range of approximately 15.03–43%; the mean value reached 11.70, which was much higher than the 2020 value of 9.43. The 2020 pear samples had pH values in the range of 4.77–6.34, with a mean value of 5.305, and the 2019 pH ranged from 5.03 to 5.90, with a mean value of 5.65, which was also significantly different than the mean value of the 2020 pear samples.
The factors contributing to these differences mainly include light, precipitation, fertilization, and others. However, light and precipitation are among the most important factors. The query data in the key growth period of pear in the 3-month period indicates that Dangshan County, where the pear samples were collected from, had significantly more cloudy and rainy days in 2020 than in 2019. Rainfall was also much more in 2020 than in 2019; light intensity was significantly less than the average level in 2019 within the same period of time. More cloudy and rainy days and small temperature differences between day and night could significantly affect pear photosynthesis and weaken their photosynthesis, resulting in less sugar accumulation, which was the most important factor leading to the large difference between 2019 and 2020 [46]. In addition, increased precipitation led to poor soil permeability and reduced soil microbial activity, resulting in poorer nutrient uptake by pear trees, which thereby affected the internal quality of pears [16].
The pear SSC and pH values were predicted directly using the CARS-PLS model, constructed based on the pear samples from three different orchards in 2019 to verify the validity of the model in Section 3.3 and to observe whether it was affected by the variability of pear origin and test year. In addition, the spectral information of 450 pear samples was used as the prediction set after preprocessing.
The predicted results are shown in Table 9. The results show that the predicted R2p for SSC was above 0.9 in the three different orchards, and the predictions for pH were slightly lower. R2p was also above 0.85 for all orchards. Although the 19-year data samples were very different from the 20-year data samples, the model achieved valid predictions for the fruit SSC and pH for the 19-year pear samples.
Considerable progress has been achieved in the non-destructive detection of the internal quality of pears in recent years. Yu et al. [47] developed a handheld spectrometer using a linear variable filter (LVF) and a complementary metal oxide semi-conductor (CMOS) linear detector array, which successfully detected the SSC of pears in the 620–1080 spectral interval. Although this method achieved handheld online detection, only the spectral interval at 620–1080 was used for modeling and analysis, and no comparative study of different spectral intervals was conducted. In their study, there was also a lack of an independent validation process that could demonstrate the universal applicability of handheld spectrometers developed based on the 620–1080 spectral interval. Li et al. [48] investigated the quantitative determination of three pear varieties using a multi-species model. A multi-species model and a local origin model were developed and optimized using different variable selection algorithms. The coefficient of determination of the prediction set (R2) and RMSEP of the model were 0.92 and 0.30, respectively, demonstrating the feasibility of accurately determining the SSC of different pear varieties using the multi-species model. Passos et al. [49], based on short-wave NIR spectroscopy in packaging and sorting facilities, successfully detected the SSC of pear samples from different batches, storage times, and maturity levels. They further found that incorporating temperature and fruit size information into the detection model could improve the predictability of fruit SSC. Li et al. [48] and Passos et al. [49] studied multiple pear varieties, batches, and other factors, such as temperature and fruit size. However, only a single detection of the SSC of pear samples was performed, and fruit pH, another important index affecting the internal quality of pears, was not investigated. The effect of pear origin on the non-destructive detection of the internal quality of pears was not evaluated. There was also a lack of an independent validation for the most important detection model in their study. In this study, local and hybrid models for pears of different origins were developed using three-variable screening methods combined with a PLS algorithm to better predict the SSC and pH of pear samples of different origins. Independent prediction sets from different years with large variability were also used for model validation. The biggest difference between this study and previous studies is that our study fully considered the effects of origin differences, meteorology, time (season), and management methods on pear quality. The model constructed in this study is highly accurate and robust, which can better predict pears of unknown origin and different years and has a scalable application prospect.

4. Conclusions

This study selected ‘Dangshan’ pears of different origins to evaluate the effects of fruit origin on the NIR spectral internal quality (SSC and pH) detection models. It was found that compared with the local origin and the pairwise hybrid origins models, the model established by the calibration set of hybrid pears from three producing origins achieved the most ideal prediction results. Combined with the characteristic wavelength variables screened by CARS, the model was further improved. In order to verify the validity of the selected characteristic wavelengths, the above model was directly used to detect the SSC and PH values of pear fruit samples from three different producing areas in 2019, and good detection results were achieved. The predicted R2p for the pear SSC was above 0.9, and the predicted R2p for the pear pH was also above 0.85, and the value range of RMSEP was 0.167–0.201 and 0.027–0.032. The results showed that the calibration model can achieve better performance in the prediction of the pear SSC and pH if it involves more origins of pears and uses the effective characteristic wavelengths. Variations due to the difference in the origin of pears on the detection of the fruit SSC and pH by NIR spectroscopy can also be decreased. The results of the current study provide not only the theoretical basis but also practical implementation for the accurate and online detection of the SSC and pH of pears using NIR spectroscopy. The model constructed in this study can be used to develop a more adaptable, stable, and affordable portable instrument for the rapid non-destructive detection of soluble solids and acidity in pears.

Author Contributions

Conceptualization, T.C. and D.Z.; methodology, T.C. and D.Z; software, S.J.; validation, T.C., S.G. and X.-G.Z.; formal analysis, T.C.; investigation, T.C., S.G., S.J., Z.X. and F.J.; data curation, D.Z.; writing—original draft preparation, T.C.; writing—review and editing, Z.P., S.F., Z.X., X.-G.Z. and D.Z.; visualization, T.C., S.G. and Z.P.; supervision, D.Z.; project administration, D.Z.; and funding acquisition, D.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Key Research and Technology Development Projects of Anhui Province (grant no. 202004a06020045, grant no.202204c06020012), the Open Research Fund of National Engineering Research Center for Agro-Ecological Big Data Analysis & Application (grant no. AE202211, grant no. AE202201).

Institutional Review Board Statement

Not applicable.

Data Availability Statement

Some or all data that support the finding of this study are available from the corresponding author upon reasonable request.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Reiland, H.; Slavin, J. Systematic Review of Pears and Health. Nutr. Today 2015, 50, 301–305. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Food and Agricultural Organization of the United Nations. Available online: http://www.fao.org/faostat/en/#data (accessed on 20 December 2019).
  3. Hu, M.H.; Dong, Q.L.; Liu, B.L.; Opara, U.L. Prediction of mechanical properties of blueberry using hyperspectral interactance imaging. Postharvest Biol. Technol. 2016, 115, 122–131. [Google Scholar] [CrossRef]
  4. Liu, F.; He, Y.; Wang, L. Comparison of calibrations for the determination of soluble solids content and pH of rice vinegars using visible and short-wave near infrared spectroscopy. Anal. Chim. Acta 2008, 610, 196–204. [Google Scholar] [CrossRef] [PubMed]
  5. Peng, Y.; Lu, R. Analysis of spatially resolved hyperspectral scattering images for assessing apple fruit firmness and soluble solids content. Postharvest Biol. Technol. 2008, 48, 52–62. [Google Scholar] [CrossRef]
  6. Li, J.; Chen, L. Comparative analysis of models for robust and accurate evaluation of soluble solids content in ‘Pinggu’ peaches by hyperspectral imaging. Comput. Electron. Agric. 2017, 142, 524–535. [Google Scholar] [CrossRef]
  7. Fan, S.; Huang, W.; Guo, Z.; Zhang, B.; Zhao, C. Prediction of Soluble Solids Content and Firmness of Pears Using Hyperspectral Reflectance Imaging. Food Anal. Meth. 2015, 8, 1936–1946. [Google Scholar] [CrossRef]
  8. Mahesh, S.; Jayas, D.S.; Paliwal, J.; White, N.D.G. Hyperspectral imaging to classify and monitor quality of agricultural materials. J. Stored Prod. Res. 2015, 61, 17–26. [Google Scholar] [CrossRef]
  9. Zhang, D.; Wang, Q.; Lin, F.; Weng, S.; Lei, Y.; Chen, G.; Gu, C.; Zheng, L. New Spectral Classification Index for Rapid Identification of Fusarium Infection in Wheat Kernel. Food Anal. Meth. 2020, 13, 2165–2175. [Google Scholar] [CrossRef]
  10. Nicolaï, B.M.; Beullens, K.; Bobelyn, E.; Peirs, A.; Saeys, W.; Theron, K.I.; Lammertyn, J. Nondestructive measurement of fruit and vegetable quality by means of NIR spectroscopy: A review. Postharvest Biol. Technol. 2007, 46, 99–118. [Google Scholar] [CrossRef]
  11. Zareef, M.; Chen, Q.; Hassan, M.M.; Arslan, M.; Hashim, M.M.; Ahmad, W.; Kutsanedzie, F.Y.H.; Agyekum, A.A. An Overview on the Applications of Typical Non-linear Algorithms Coupled With NIR Spectroscopy in Food Analysis. Food Eng. Rev. 2020, 12, 173–190. [Google Scholar] [CrossRef]
  12. Qiao, L.; Mu, Y.; Lu, B.; Tang, X. Calibration Maintenance Application of Near-infrared Spectrometric Model in Food Analysis. Food Rev. Int. 2021, 1–17. [Google Scholar] [CrossRef]
  13. Sun, X.; Zhu, K.; Jiang, X.; Liu, Y. Non-destructive detection of blackheart and soluble solids content of intact pear by online NIR spectroscopy. J Supercomput. 2018, 76, 3173–3187. [Google Scholar] [CrossRef]
  14. Wang, J.; Wang, J.; Chen, Z.; Han, D. Development of multi-cultivar models for predicting the soluble solid content and firmness of European pear (Pyrus communis L.) using portable vis–NIR spectroscopy. Postharvest Biol. Technol. 2017, 129, 143–151. [Google Scholar] [CrossRef]
  15. Sun, T.; Lin, H.; Xu, H.; Ying, Y. Effect of fruit moving speed on predicting soluble solids content of ‘Cuiguan’ pears (Pomaceae pyrifolia Nakai cv. Cuiguan) using PLS and LS-SVM regression. Postharvest Biol. Technol. 2009, 51, 86–90. [Google Scholar] [CrossRef]
  16. Zuo, Y.; Zhang, J.; Zhao, R.; Dai, H.; Zhang, Z. Application of vermicompost improves strawberry growth and quality through increased photosynthesis rate, free radical scavenging and soil enzymatic activity. Sci. Hortic. 2018, 233, 132–140. [Google Scholar] [CrossRef]
  17. Travers, S.; Bertelsen, M.G.; Petersen, K.K.; Kucheryavskiy, S.V. Predicting pear (cv. Clara Frijs) dry matter and soluble solids content with near infrared spectroscopy. LWT Food Sci. Technol. 2014, 59, 1107–1113. [Google Scholar] [CrossRef]
  18. Xu, H.; Qi, B.; Sun, T.; Fu, X.; Ying, Y. Variable selection in visible and near-infrared spectra: Application to on-line determination of sugar content in pears. J. Food Eng. 2012, 109, 142–147. [Google Scholar] [CrossRef]
  19. Li, J.; Huang, W.; Zhao, C.; Zhang, B. A comparative study for the quantitative determination of soluble solids content, pH and firmness of pears by Vis/NIR spectroscopy. J. Food Eng. 2013, 116, 324–332. [Google Scholar] [CrossRef]
  20. Mishra, P.; Woltering, E.; Brouwer, B.; Hogeveen-van Echtelt, E. Improving moisture and soluble solids content prediction in pear fruit using near-infrared spectroscopy with variable selection and model updating approach. Postharvest Biol. Technol. 2021, 171, 111348. [Google Scholar] [CrossRef]
  21. Fan, S.; Huang, W.; Li, J.; Zhao, C.; Zhang, B. Characteristic wavelengths selection of soluble solids content of pear based on NIR spectral and LS-SVM. Spectrosc. Spectr. Anal. 2014, 34, 2089–2093. [Google Scholar]
  22. Peck, G.; Andrews, P.; Reganold, J. Apple orchard productivity and fruit quality under organic, conventional, and integrated management. Hortscience 2006, 41, 99–107. [Google Scholar] [CrossRef]
  23. Zhang, B.; Dai, D.; Huang, J.; Zhou, J.; Gui, Q.; Dai, F. Influence of physical and biological variability and solution methods in fruit and vegetable quality nondestructive inspection by using imaging and near-infrared spectroscopy techniques: A review. Crit. Rev. Food Sci. Nutr. 2018, 58, 2099–2118. [Google Scholar] [CrossRef] [PubMed]
  24. Xiaobo, Z.; Jiewen, Z.; Holmes, M.; Hanpin, M.; Jiyong, S.; Xiaopin, Y.; Yanxiao, L. Independent component analysis in information extraction from visible/near-infrared hyperspectral imaging data of cucumber leaves. Chemometrics Intellig. Lab. Syst. 2010, 104, 265–270. [Google Scholar] [CrossRef]
  25. Wu, G.; Wang, C. Investigating the effects of simulated transport vibration on tomato tissue damage based on vis/NIR spectroscopy. Postharvest Biol. Technol. 2014, 98, 41–47. [Google Scholar] [CrossRef]
  26. Savitzky, A.; Golay, M.J.E. Smoothing and differentiation of data by simplified least squares procedures. Anal. Chem. 1964, 36, 1627–1639. [Google Scholar] [CrossRef]
  27. Li, H.; Liang, Y.; Xu, Q.; Cao, D. Key wavelengths screening using competitive adaptive reweighted sampling method for multivariate calibration. Anal. Chim. Acta 2009, 648, 77–84. [Google Scholar] [CrossRef]
  28. Ma, T.; Li, X.; Inagaki, T.; Yang, H.; Tsuchikawa, S. Noncontact evaluation of soluble solids content in apples by near-infrared hyperspectral imaging. J. Food Eng. 2018, 224, 53–61. [Google Scholar] [CrossRef]
  29. Zhan, B.; Ni, J.; Li, J. Hyperspectral technology combined with CARS algorithm to quantitatively determine the SSC in Korla fragrant pear. Spectrosc. Spectr. Anal. 2014, 34, 2752–2757. [Google Scholar]
  30. Zhang, D.; Xu, Y.; Huang, W.; Tian, X.; Xia, Y.; Xu, L.; Fan, S. Nondestructive measurement of soluble solids content in apple using near infrared hyperspectral imaging coupled with wavelength selection algorithm. Infrared Phys. Technol. 2019, 98, 297–304. [Google Scholar] [CrossRef]
  31. Song, D.; Song, L.; Sun, Y.; Hu, P.; Tu, K.; Pan, L.; Yang, H.; Huang, M. Black Heart Detection in White Radish by Hyperspectral Transmittance Imaging Combined with Chemometric Analysis and a Successive Projections Algorithm. Appl. Sci. 2016, 6, 249. [Google Scholar] [CrossRef]
  32. Soares, S.F.C.; Gomes, A.A.; Araujo, M.C.U.; Filho, A.R.G.; Galvão, R.K.H. The successive projections algorithm. TrAC Trends Anal. Chem. 2013, 42, 84–98. [Google Scholar] [CrossRef]
  33. Cai, W.; Li, Y.; Shao, X. A variable selection method based on uninformative variable elimination for multivariate calibration of near-infrared spectra. Chemometrics Intellig. Lab. Syst. 2008, 90, 188–194. [Google Scholar] [CrossRef]
  34. Ye, S.; Wang, D.; Min, S. Successive projections algorithm combined with uninformative variable elimination for spectral variable selection. Chemometrics Intellig. Lab. Syst. 2008, 91, 194–199. [Google Scholar] [CrossRef]
  35. Gottardo, P.; De Marchi, M.; Cassandro, M.; Penasa, M. Technical note: Improving the accuracy of mid-infrared prediction models by selecting the most informative wavelengths. J. Dairy Sci. 2015, 98, 4168–4173. [Google Scholar] [CrossRef] [Green Version]
  36. Magwaza, L.S.; Landahl, S.; Cronje, P.J.; Nieuwoudt, H.H.; Mouazen, A.M.; Nicolai, B.M.; Terry, L.A.; Opara, U.L. The use of Vis/NIRS and chemometric analysis to predict fruit defects and postharvest behaviour of ‘Nules Clementine’ mandarin fruit. Food Chem. 2014, 163, 267–274. [Google Scholar] [CrossRef]
  37. Huang, Y.; Lu, R.; Chen, K. Assessment of tomato soluble solids content and pH by spatially-resolved and conventional Vis/NIR spectroscopy. J. Food Eng. 2018, 236, 19–28. [Google Scholar] [CrossRef]
  38. Fox, R.A.; Steel, R.G.D.; Torrie, J.H. Principles and Procedures of Statistics with Special Reference to the Biological Sciences. Inc. Stat. 1961, 11, 170. [Google Scholar] [CrossRef]
  39. Glantz, S.; Slinker, B.; Neilands, T. Primer of Applied Regression and Analysis of Variance; McGraw-Hill: New York, NY, USA, 1990; Available online: https://lib.ugent.be/catalog/rug01:000258630 (accessed on 5 October 1990).
  40. Draper, N.; Smith, H. Applied Regression Analysis; John Wiley & Sons: Hoboken, NJ, USA, 1998. [Google Scholar] [CrossRef]
  41. Merzlyak, M.N.; Solovchenko, A.E.; Gitelson, A.A. Reflectance spectral features and non-destructive estimation of chlorophyll, carotenoid and anthocyanin content in apple fruit. Postharvest Biol. Technol. 2003, 27, 197–211. [Google Scholar] [CrossRef]
  42. Stchur, P.; Cleveland, D.; Zhou, J.; Michel, R.G. A review of recent applications of near infrared spectroscopy, and of the characteristics of a novel PbS CCD array-based near-infrared spectrometer. Appl. Spectrosc. Rev. 2002, 37, 383–428. [Google Scholar] [CrossRef]
  43. Curran, P.J. Remote sensing of foliar chemistry. Remote Sens. Environ. 1989, 30, 271–278. [Google Scholar] [CrossRef]
  44. Yuan, L.M.; Mao, F.; Chen, X.; Li, L.; Huang, G. Non-invasive measurements of ‘Yunhe’ pears by vis-NIRS technology coupled with deviation fusion modeling approach. Postharvest Biol. Technol. 2020, 160, 111067. [Google Scholar] [CrossRef]
  45. Chauchard, F.; Cogdill, R.; Roussel, S.; Roger, J.M.; Bellon-Maurel, V. Application of LS-SVM to non-linear phenomena in NIR spectroscopy: Development of a robust and portable sensor for acidity prediction in grapes. Chemometrics Intellig. Lab. Syst. 2004, 71, 141–150. [Google Scholar] [CrossRef] [Green Version]
  46. Wang, C.; Wang, H.; Zhao, X.; Chen, B.; Wang, F. Mulching affects photosynthetic and chlorophyll a fluorescence characteristics during stage III of peach fruit growth on the rain-fed semiarid Loess Plateau of China. Sci. Hortic. 2015, 194, 246–254. [Google Scholar] [CrossRef]
  47. Yu, X.; Lu, Q.; Gao, H.; Ding, H. Development of a Handheld Spectrometer Based on a Linear Variable Filter and a Complementary Metal-Oxide-Semiconductor Detector for Measuring the Internal Quality of Fruit. J. Near Infrared Spectrosc. 2016, 24, 69–76. [Google Scholar] [CrossRef]
  48. Li, J.; Zhang, H.; Zhan, B.; Wang, Z.; Jiang, Y. Determination of SSC in pears by establishing the multi-cultivar models based on visible-NIR spectroscopy. Infrared Phys. Technol. 2019, 102, 103066. [Google Scholar] [CrossRef]
  49. Passos, D.; Rodrigues, D.; Cavaco, A.M.; Antunes, M.D.; Guerra, R. Non-Destructive Soluble Solids Content Determination for ‘Rocha’ Pear Based on VIS-SWNIR Spectroscopy under ‘Real World’ Sorting Facility Conditions. Sensors 2019, 19, 5165. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Geographic origins of pear samples used in this study.
Figure 1. Geographic origins of pear samples used in this study.
Agriculture 12 01618 g001
Figure 2. Spectral acquisition system.
Figure 2. Spectral acquisition system.
Agriculture 12 01618 g002
Figure 3. Experimental scheme of the soluble solids content (SSC) and acidity (pH) detection models for local and hybrid origins. Note: Rectangles indicate the calibration set; ovals indicate the prediction set; and blue-filled rectangles indicate the full sample set.
Figure 3. Experimental scheme of the soluble solids content (SSC) and acidity (pH) detection models for local and hybrid origins. Note: Rectangles indicate the calibration set; ovals indicate the prediction set; and blue-filled rectangles indicate the full sample set.
Agriculture 12 01618 g003
Figure 4. Spectral data of pear samples in 2020.(a) Original reflectance spectral profile of pear samples in the wavelength range of 350–2500 nm. (b) Average original reflectance spectral profile of pear samples from three orchards.
Figure 4. Spectral data of pear samples in 2020.(a) Original reflectance spectral profile of pear samples in the wavelength range of 350–2500 nm. (b) Average original reflectance spectral profile of pear samples from three orchards.
Agriculture 12 01618 g004
Figure 5. Spectral profile of pear samples after pretreatment with the SNV method in the wavelength range of 400–2450 nm.
Figure 5. Spectral profile of pear samples after pretreatment with the SNV method in the wavelength range of 400–2450 nm.
Agriculture 12 01618 g005
Figure 6. Trends in the number of characteristic wavelengths (I) and RMSECV values (II) of the soluble solids content (SSC) (a) and acidity (pH) (b) of pear with increasing number of CARS samples.
Figure 6. Trends in the number of characteristic wavelengths (I) and RMSECV values (II) of the soluble solids content (SSC) (a) and acidity (pH) (b) of pear with increasing number of CARS samples.
Agriculture 12 01618 g006
Figure 7. SPA algorithm to select pear soluble solids content (SSC) (a) and acidity (pH) (b) feature variables.
Figure 7. SPA algorithm to select pear soluble solids content (SSC) (a) and acidity (pH) (b) feature variables.
Agriculture 12 01618 g007
Figure 8. UVE-based selection of pear soluble solids content (SSC) (a) and acidity (pH) (b) feature variables.
Figure 8. UVE-based selection of pear soluble solids content (SSC) (a) and acidity (pH) (b) feature variables.
Agriculture 12 01618 g008
Figure 9. Original reflectance spectra profile of the 2019-year pear samples in the wavelength range of 350–2500 nm.
Figure 9. Original reflectance spectra profile of the 2019-year pear samples in the wavelength range of 350–2500 nm.
Agriculture 12 01618 g009
Figure 10. Average spectral profiles of the 2019-year and 2020-year pear samples in the 350–2500 nm range.
Figure 10. Average spectral profiles of the 2019-year and 2020-year pear samples in the 350–2500 nm range.
Agriculture 12 01618 g010
Table 1. Pear production management practices used in three different orchards.
Table 1. Pear production management practices used in three different orchards.
Origin *Production Management Practices
TZRough farming management
LSWRough and fine combined farming management
YHModern management
Note: * Tangzhai (TZ), Lishuangwang (LSW), and Yihao (YH).
Table 2. Measured values of acidity (pH) in ‘Dangshan’ pear in the calibration and prediction data sets of four different origins.
Table 2. Measured values of acidity (pH) in ‘Dangshan’ pear in the calibration and prediction data sets of four different origins.
Origin *Sample SetNo. of SamplesMinMaxMeanStandard Deviation
LSWCalibration set904.906.225.240.17
Prediction set325.165.555.310.07
TZZCalibration set1144.775.645.330.10
Prediction set375.105.555.390.15
YHCalibration set1684.855.535.300.10
Prediction set555.135.475.290.09
hybridCalibration set3724.776.345.310.14
Prediction set1244.855.535.300.10
Note: * Lishuangwang (LSW), Tangzhai (TZ), and Yihao (YH).
Table 3. Measured values of soluble solids content (SSC/%) in ‘Dangshan’ pear in the calibration and prediction data sets of four different origins.
Table 3. Measured values of soluble solids content (SSC/%) in ‘Dangshan’ pear in the calibration and prediction data sets of four different origins.
Origin *Sample SetNo. of SamplesMinMaxMeanStandard Deviation
LSWCalibration set907.2310.639.360.61
Prediction set328.1310.479.150.47
TZZCalibration set1146.5711.208.860.74
Prediction set376.9710.908.690.98
YHCalibration set1687.2311.979.730.90
Prediction set558.2711.339.890.76
hybridCalibration set3726.5711.979.020.78
Prediction set1247.2311.209.840.86
Note: * Lishuangwang (LSW), Tangzhai (TZ), and Yihao (YH).
Table 4. Prediction results of the soluble solids content (SSC) and acidity (pH) detection models for local pear origins.
Table 4. Prediction results of the soluble solids content (SSC) and acidity (pH) detection models for local pear origins.
ParametersCalibration Set *Prediction Set
LSWTZYH
R2pRMSEPR2pRMSEPR2pRMSEP
SSCLSW0.9130.2010.8030.3300.8430.268
TZ0.8310.2790.8960.2180.8800.243
YH0.8990.2180.7740.4180.9300.175
pHLSW0.9140.0260.7900.0460.8360.039
TZ0.6930.0580.8800.0300.7420.056
YH0.8980.0280.7490.0510.9140.025
Note: * Lishuangwang (LSW), Tangzhai (TZ), and Yihao (YH).
Table 5. Predicted results of the soluble solids content (SSC) and acidity (pH) detection model for hybrid (mix) pear origins.
Table 5. Predicted results of the soluble solids content (SSC) and acidity (pH) detection model for hybrid (mix) pear origins.
ParametersCalibration SetPrediction Set
LSWTZYH
R2pRMSEPR2pRMSEPR2pRMSEP
SSCLSW-TZ0.9060.2020.8730.2610.8850.228
LSW-YH0.8900.2240.8020.3320.9130.201
TZ-YH0.8450.2650.9010.2150.9130.201
LSW-TZ-YH0.9190.1800.9060.2020.9210.178
pHLSW-TZ0.9030.0270.8510.0360.8970.030
LSW-YH0.8870.0300.7800.0490.8550.032
TZ-YH0.8100.0440.8240.0440.8380.039
LSW-TZ-YH0.9350.0190.8500.0380.9180.020
Table 6. Prediction results of different PLS models for the prediction set of SSC and pH of ‘Dangshan’ pears from various origins.
Table 6. Prediction results of different PLS models for the prediction set of SSC and pH of ‘Dangshan’ pears from various origins.
ParametersModelVariablesPrediction Set *
LSWTZYH
R2pRMSEPR2pRMSEPR2pRMSEP
SSCPLS20040.9190.1800.9060.2020.9210.178
CARS-PLS700.9030.2100.9040.2060.9430.142
SPA-PLS180.8750.2580.8900.2210.8760.248
UVE-PLS5050.8840.2350.8940.2200.9130.201
pHPLS20040.9350.0190.8500.0320.9180.020
CARS-PLS520.8970.0290.9170.0220.8740.032
SPA-PLS110.8840.0310.8240.0440.8620.033
UVE-PLS5570.9030.0270.9360.0180.9150.024
Note: * Lishuangwang (LSW), Tangzhai (TZ), and Yihao (YH).
Table 7. Statistical results of the measured fruit acidity (pH) values of all ‘Dangshan’ pear samples.
Table 7. Statistical results of the measured fruit acidity (pH) values of all ‘Dangshan’ pear samples.
OriginNo. of SamplesMinMaxMeanStandard DeviationFp
LSW12075.035.875.630.09575.7230.000 **
HHGD1095.435.855.660.085
LSW21345.255.905.6750.100
Note: ** p < 0.01. F is F-ratio and p is probability value.
Table 8. Statistical results of the measured fruit soluble solids content (SSC/%) values for all ‘Dangshan’ pear samples.
Table 8. Statistical results of the measured fruit soluble solids content (SSC/%) values for all ‘Dangshan’ pear samples.
OriginNo. of SamplesMinMaxMeanStandard DeviationFp
LSW12079.9713.7711.670.6205.70.004 **
HHGD10910.513.3712.180.595
LSW21349.4013.3311.270.595
Note: ** p < 0.01. F is F-ratio and p is probability value.
Table 9. Validation results for the 19-year samples.
Table 9. Validation results for the 19-year samples.
ParametersModelVariablesPrediction Set
LSW1HHGDLSW2
R2pRMSEPR2pRMSEPR2pRMSEP
SSCCARS-PLS700.9130.2010.9300.1670.9160.185
PHCARS-PLS520.8990.0280.8530.0320.9060.027
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Cheng, T.; Guo, S.; Pan, Z.; Fan, S.; Ju, S.; Xin, Z.; Zhou, X.-G.; Jiang, F.; Zhang, D. Near-Infrared Model and Its Robustness as Affected by Fruit Origin for ‘Dangshan’ Pear Soluble Solids Content and pH Measurement. Agriculture 2022, 12, 1618. https://doi.org/10.3390/agriculture12101618

AMA Style

Cheng T, Guo S, Pan Z, Fan S, Ju S, Xin Z, Zhou X-G, Jiang F, Zhang D. Near-Infrared Model and Its Robustness as Affected by Fruit Origin for ‘Dangshan’ Pear Soluble Solids Content and pH Measurement. Agriculture. 2022; 12(10):1618. https://doi.org/10.3390/agriculture12101618

Chicago/Turabian Style

Cheng, Tao, Sen Guo, Zhenggao Pan, Shuxiang Fan, Shucun Ju, Zhenghua Xin, Xin-Gen Zhou, Fei Jiang, and Dongyan Zhang. 2022. "Near-Infrared Model and Its Robustness as Affected by Fruit Origin for ‘Dangshan’ Pear Soluble Solids Content and pH Measurement" Agriculture 12, no. 10: 1618. https://doi.org/10.3390/agriculture12101618

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop