Next Article in Journal
NAC and MYB Families and Lignin Biosynthesis-Related Members Identification and Expression Analysis in Melilotus albus
Next Article in Special Issue
Soil and Regulated Deficit Irrigation Affect Growth, Yield and Quality of ‘Nero d’Avola’ Grapes in a Semi-Arid Environment
Previous Article in Journal
Development of Quinoa Value Chain to Improve Food and Nutritional Security in Rural Communities in Rehamna, Morocco: Lessons Learned and Perspectives
Previous Article in Special Issue
Detecting Mild Water Stress in Olive with Multiple Plant-Based Continuous Sensors
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Modelling Soluble Solids Content Accumulation in ‘Braeburn’ Apples

1
Kompetenzzentrum Obstbau Bodensee, Fachgebiet Ertragsphysiologie, 88213 Ravensburg, Germany
2
TUM School of Life Sciences, Technische Universität München, 85354 Freising, Germany
3
Institut für Obstbau, Hochschule Geisenheim University, 65366 Geisenheim, Germany
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Plants 2021, 10(2), 302; https://doi.org/10.3390/plants10020302
Submission received: 31 December 2020 / Revised: 25 January 2021 / Accepted: 1 February 2021 / Published: 5 February 2021

Abstract

:
Optical sensor data can be used to determine changes in anthocyanins, chlorophyll and soluble solids content (SSC) in apple production. In this study, visible and near-infrared spectra (729 to 975 nm) were transformed to SSC values by advanced multivariate calibration models i.e., partial least square regression (PLSR) in order to test the substitution of destructive chemical analyses through non-destructive optical measurements. Spectral field scans were carried out from 2016 to 2018 on marked ‘Braeburn’ apples in Southwest Germany. The study combines an in-depth statistical analyses of longitudinal SSC values with horticultural knowledge to set guidelines for further applied use of SSC predictions in the orchard to gain insights into apple carbohydrate physiology. The PLSR models were investigated with respect to sample size, seasonal variation, laboratory errors and the explanatory power of PLSR models when applied to independent samples. As a result of Monte Carlo simulations, PLSR modelled SSC only depended to a minor extent on the absolute number and accuracy of the wet chemistry laboratory calibration measurements. The comparison between non-destructive SSC determinations in the orchard with standard destructive lab testing at harvest on an independent sample showed mean differences of 0.5% SSC over all study years. SSC modelling with longitudinal linear mixed-effect models linked high crop loads to lower SSC values at harvest and higher SSC values for fruit from the top part of a tree.

1. Introduction

In apple fruit production, tree physiological status, the farmer’s management decisions in the orchard, together with environmental factors influence postharvest fruit quality and storage pack-out. More specifically, factors that affect fruit quality can include crop load [1,2], timing of harvest [3], application of calcium and potassium fertilizer [4,5], light distribution within the orchard and temperature during important growth periods [6,7] as well as single tree or tree sector physiology [3].
Many factors within apple fruit tissues (cells per apple, energy status, cell wall stability [8]) which can determine harvest date and storage pack-out cannot be seen from the outside of the fruit. Depending on the wavelength, optical sensors (visible (Vis) and near-infrared (NIR) point spectroscopy) can help to get a non-destructive view of the fruit from 1–2 cm under the skin [9]. These portable optical sensors are now relatively inexpensive and fast [10]. In addition, data handling and chemometric software are user friendly (own experience). Light reflectance from the fruit can be monitored in the field to give information about plant pigment development such as chlorophyll, anthocyanins and carotinoids in the Vis spectra [11]. In addition, partial least squares regression (PLSR) modelling for the NIR spectra can be used to estimate soluble solids content (SSC) and dry matter content [12,13,14]. Further information on fruit tissues can be obtained from the light scattering of cell walls and other cellular components using more advanced technologies like spatial frequency domain imaging [15,16]. However, the latter laboratory based technology is not available for applied field-sensing to separate the absorbance from the scattering coefficient. One of the advantages of non-destructive sensor technology is the possibility to gain a large data set on a small-orchard scale during fruit maturation and link these spectral data together with other temporal and orchard field data. Biological and spatial variation are typically high, even within a small-scale apple orchard [17,18]. Moreover, fruit should be harvested and managed according to orchard variation such as site and cropping history to maintain the best possible fruit quality after long-term controlled atmosphere storage [19].
The development of SSC in individual apples depends mainly on the light distribution within the planting system and the fruit to leaf ratio per tree [20]. Furthermore, SSC values vary between different fruit and even for the same measurement position [12,21]. However, spectral scanning allows a large sample size to be obtained relatively fast. These measurements could provide a feasible alternative in the practice to labour intensive and costly laboratory analyses to gain a better idea of the distribution in SSC values.
Furthermore, standard ANOVA analyses are used at a particular moment in time (mostly at harvest) to determine the SSC distribution. This approach overlooks SSC developement over the course of time [22]. In the case of repeated measurements in agriculture and horticulture, mixed-effects models show clear advantages with respect to missing or unbalanced observations and different or restricted measurement periods [23]. When modelling is based on repeated measurements during fruit development, the longitudinal structure results in linear mixed-effect (LME) models to describe time-dependent changes linked to treatment effects and physiological influences. This class of LME models is a flexible subset of (generalized) regression models and can be used to model growth patterns in horticulture [24,25] and other research areas such as physical anthropology [26], clinical biometry [27] or ecology [28]. Modelling apple growth with expolinear, Gompertz and logistic [29,30] functions and adapted von Bertalanffy models [31] is common but SSC accumulation has been less frequently modelled. This is gradually changing through the use of in-depth biochemical analyses and the use of optical handheld sensors. Vis/NIR point spectrometers allow for repeated non-destructive spectral scanning on the same fruit. In a further classification process, longitudinal Vis/NIR data can enable modelling and classification of optimal harvest dates [32,33].
The present study focuses on non-destructive and longitudinal SSC accumulation in fruit in the orchard and the practical application of the above outlined methodology to set guidelines for their broader use. The carbohydrate physiology i.e., SSC accumulation during apple ripening was monitored within the experimental field treatments, reviewed from a user perspective and a statistical viewpoint. The results of the study were based on a large data set of Vis/NIR scans obtained over three study years.
This study investigates in detail (1) the number of calibration samples needed for a robust SSC prediction, (2) the effects of laboratory errors in wet chemistry analyses on PLSR model results, (3) the reliability of modelled SSC values in the orchard in comparison to standard laboratory tests of an independent sample and (4) time-dependent treatment effects on longitudinal SSC accumulation.

2. Results

The apple cropping seasons of 2016, 2017 and 2018 in Southwest Germany were distinctively different. A very wet spring with a light frost event during bloom was recorded in 2016. In 2017, severe frosts occurred during bloom in many European horticulture regions. At the Kompetenzzentrum Obstbau Bodensee, the number of trees available for research studies was reduced to those protected with heaters in plastic tents. Furthermore, 2018 was a relatively hot and dry year with 398 mm of precipitation between April to October compared to 956 mm and 730 mm for the same periods of 2016 and 2017, respectively. There were also clear differences between years in plant development for the growth stages (BBCH) flowering and fruit ripe for picking with differences of up to 12 days.
SSC accumulation derived from the yearly calibrated PLSR model (Section 2.1.1) (Figure 1a) and fruit diameter growth (Figure 1b) are plotted as days after full bloom (DAFB) over the three study years. Fruit diameter and SSC were monitored at the same measurement intervals and for the same fruit. Fruit growth will not be further discussed here and serves only as additional background information on orchard data variance. SSC increases over time following a linear trend. In general, these data indicate a continuous mean SSC accumulation until apple maturity which is consistent with biochemical fruit analyses. In 2017 around 120 DAFB, the within-fruit variability during the time-series data acquisition shows either that the same fruit accumulate and degrade SSC between the scanning intervals or that the measurement environment negatively affected data acquisition. In 2016 and 2017, SSC had approximately the same values at around 120 DAFB. For 2018 the highest SSC values for the three study years were observed. In order to obtain a higher time-series resolution and larger time-series data for improved modelling in [33], data acquisition took place on a daily basis for 120-180 DAFB and SSC scanning started at 50 DAFB in 2018.

2.1. PLSR Calibration Models

2.1.1. Multi-Year (2016–2018) Calibrated Model

The multi-year model was built with calibration data from all study years (2016–2018). The multi-year model was subsequently validated with either a multi-year or the respective yearly data set.The multi-year model results in a root mean square error of prediction (RMSEP) of 0.65% SSC (adjusted R 2 of 0.77) in 2016, 0.67% SSC in 2017 (adjusted R 2 of 0.72), and 0.54% SSC in 2018 (adjusted R 2 of 0.89) compared to 0.62% SSC (adjusted R 2 of 0.81) over all three years combined (Figure 2). A look at the RMSEP and the adjusted prediction R 2 is usually not sufficient to determine the presence of systematic errors in the PLSR model. Fitting a linear regression to the prediction values resulted in a slight deviation from a diagonal line indicating the presence of small but negligible systematic errors (bias). More specifically, the multi-year PLSR model possibly underestimates or overestimates the frequency of particularly low or high SSC values in 2017 and 2018. Residual plots (data not shown) suggest a slightly heteroscedastic structure. In terms of model performance over all years, the multi-year PLSR model appears to yield reasonable predictions with minor restrictions.

2.1.2. Year-Dependent Calibration Model Transfer to Other Years

Independent PLSR calibration models for each study year were calculated to check the transferability and adequacy of year-dependent PLSR model to other study years (500 observations in the calibration data set, 100 observations in the validation data set). Results suggest that yearly calibrated models perform best for scans taken within the same year (Table 1).
The multi-year PLSR model predictions (calibration data set) based on reference samples measured from all years in equal parts show a slight increase in the mean RMSEP values compared to the yearly calibrated prediction models. The calibration model based on 2016 data gives a RMSEP and standard deviation (sd) in bracket value of 0.61 (+/−0.05) % SSC for 2016 validation data compared to a RMSEP value of 0.68 (+/−0.05) % SSC for the multi-year model (2016–2018). In 2017 and 2018 similar results were obtained. Yet, all yearly calibrated models performed poorly in other years. Standard deviations were considerably higher when using validation data from years that were not part of the calibration data sets. This suggests two conclusions: first, yearly calibrated models tend to overfit the data and can hardly be used as general PLSR models. Second, as the range in SSC values in 2018 was wider than in 2016 or 2017, this increased range led to poor model performance for the 2016 and 2017 models which were not trained for particularly low or high SSC values.

2.2. Evaluation of the Training Data

2.2.1. Effect of Sample Sizes

To assess the effects of different calibration sample sizes, 500 repeated Monte Carlo simulation runs were performed. For each simulation run a stratified random sample with n = 20, 30, 40, 50, 75, 100, 125, 150, 175, 200, 300, 400 or 500 calibration measurements per year (corresponding to 60 to 1500 calibration measurements in total) was used as a calibration data set to calibrate a multi-year (2016–2018) PLSR model. The RMSEP was determined using a validation set with n = 200 observations for each year. The mean RMSEP of these Monte Carlo simulation runs and the standard deviations thereof are shown in Figure 3. PLSR models based only on 20 calibration scans per year show a mean RMSEP of 0.93 (+/−0.18 sd) % SSC in 2016, 0.91 (+/−0.17 sd) % SSC in 2017 and 0.95 (+/−0.28 sd) % SSC in 2018. PLSR models based on 100 calibration measurements per year result in a mean RMSEP of 0.70 (+/−0.06 sd) % SSC in 2016, 0.70 (+/−0.06 sd) % SSC in 2017 and 0.60 (+/−0.07 sd) % SSC in 2018. Based on 500 calibration measurements per year, a mean RMSEP of 0.67 (+/−0.05 sd) % SSC in 2016, 0.67 (+/−0.04 sd) % SSC in 2017 and 0.56 (+/−0.04 sd) % SSC in 2018 are obtained. As the 2018 calibration measurements include a higher proportion of scans taken during early fruit development, the larger range of SSC values available results in a supposedly lower RMSEP value. This fact, to a large extent, explains the apparent model improvement in 2018. Besides, the accuracy of laboratory calibration work may also have improved in the third year of the study. No differences in mean RMSEP values were detected for scans conducted at different temperatures in the laboratory with Monte Carlo simulation (∼10, 20 or 30 C).

2.2.2. Effect of the Data Range

To assess the prediction quality at certain SSC values the RMSEP was also determined with 500 repeated Monte Carlo simulation (1200 observations in the calibration data, 300 observations in the validation data set). The multi-year model (2016–2018) was split into values of <9, 9–10, 10–11, 11–12, 12–13 and >13% SSC and resulted in a RMSEP of 0.59 (+/−0.08 sd), 0.55 (+/−0.05 sd), 0.57 (+/−0.04 sd), 0.65 (+/−0.06 sd), 0.73 (+/−0.08 sd) and 0.82 (+/−0.09 sd) % SSC, respectively. The analysis of the mean RMSEP shows signs of heteroscedasticity with a worse PLSR prediction for lower and especially higher SSC values. However, there was a smaller calibration data set at the beginning and end of fruit ripening.
In all years, mean RMSEP values were very high for a low number of calibration values and decrease rapidly up to n = 100 reference values per year with only a slight additional improvement in model adequacy as shown in the RMSEP for n > 100 calibration values per year. Standard deviations for the RMSEP values are rather high for small calibration sets and decrease with increasing sample size. This suggests that model accuracy might appear high in some cases “by chance”. This has two implications: first, the number of calibration measurements can be limited to a rather small number of observations per year and a reduced number of calibration measurements can be used in future experiments. Second, a certain prediction error seems to be inevitable with a given PLSR model no matter how many calibration measurements are available.

2.2.3. Effect of Refractometer Errors

In this simulation procedure, additional normally distributed noise with a mean value of m = 0% SSC and standard deviations of s = 0, 0.1, 0.2, 0.3, 0.4, 0.5, 0.75, 1.0, 2.0% SSC was added to the laboratory reference values for each simulation run. The standard deviation of s = 0% SSC corresponds to the standard PLSR calibration model. All the simulation runs show highly robust PLSR models. Even moderate and substantial laboratory errors only increase the RMSEP values slightly (Figure 4). The RMSEP in 2016 increased from 0.67 (+/−0.05 sd) % SSC to 0.74 (+/−0.06 sd) % SSC with an additional laboratory error and a standard deviation of 2.0% SSC. The simulations show similar results for 2017 and 2018.

2.3. Use of LME Models to Describe SSC Accumulation

At any given time, the SSC values follow a linear trend with a normal distribution and a slight increase in variance during fruit development and maturation (Figure 1b). A breakdown by different orchard factors/experimental management treatments for tree sector, crop load, cell division temperature and calcium treatment for the 2018 season mainly suggests a clear effect of sector position, lower effects of crop load and temperature and no effect of calcium treatment (data not shown). These trends are consistent with the observations made for the 2016 and 2017 seasons. Three different models are considered to evaluate whether the specification of random effects and interaction terms yields a substantial improvement in model quality.
Model 1 is a fully-specified LME model with tree-specific and fruit-specific random intercepts, year, weeks after full bloom (WAFB), sector position, crop load, cell division temperature and calcium treatment as fixed effects and interactions between time and the above listed main effects. Model 1 is a fully-specified LME:
y i j k = β 0 + u j , 0 + u i , 0 + β 1 t k + u j , 1 t k + u i , 1 t k + β X X i + β Y X i t k + e i j k
with a population intercept β 0 , a population parameter β 1 for modelling a time-dependent linear accumulation trend, a time-independent population parameter β X for any other fixed effects such as treatment effects and sector position, a time-dependent parameter β Y for these fixed effects, a random tree-specific intercept u j , 0 with u j , 0 N ( 0 , σ u j , 0 2 ) , a random tree-specific slope u j , 1 with u j , 1 N ( 0 , σ u j , 1 2 ) , a random fruit-specific intercept u i , 0 N ( 0 , σ u i , 0 2 ) , a random fruit-specific slope u i , 1 N ( 0 , σ u i , 1 2 ) and a random error term e i j k N ( 0 , σ e 2 ) . Model 2 is nested in Model 1 without interaction terms ( β Y = 0 ) :
y i j k = β 0 + u j , 0 + u i , 0 + β 1 t k + u j , 1 t k + u i , 1 t k + β X X i + e i j k
Model 3 corresponds to a fully-specified linear regression model without random effects (with u j , 0 = u i , 0 = u j , 1 = u i , 1 = 0 ) and serves as a baseline model to compare the effects of the LME modelling. Model 3 is a standard linear regression model without random effects:
y i j k = β 0 + β 1 t k + β X X i + β Y X i t k + e i j k
The fixed time and treatment effects for any given fruit i from tree j were calculated as:
E ( y i j k | i , j ) = β 0 + β 1 t k + β X X i + β Y X i t k
Using Akaike information criterion (AIC) and Bayesian information criterion (BIC) as indicators of goodness of fit, both LME models (Models 1–2) are clearly favourable over the standard linear regression (Model 3). Based on 17,004 observations, Model 1, 2 and 3 result in an AIC of 27,896, 29,154 and 35,708 and a BIC of 28,074, 29,255 and 35,871, respectively. The root mean square error (RMSE) was 0.48, 0.50 and 0.69% SSC for Model 1, 2 and 3, respectively. Further information about regression coefficients and time-dependent treatment effect of Model 1 can be obtained from Table 2 and Table 3. Of these two mixed-effects models, Model 1 seems to give the best fit which suggests that the specification of interaction terms is appropriate to reflect the spatial and temporal dependencies between observations. Model 1 shows highly significant effects for year, WAFB, sector crop load and cell division temperature and their interactions. No significant effect is observed for the calcium treatments.
The time-dependent effects of different treatment levels are displayed in Table 2. Fruit from the light crop load treatment show increasing SSC values throughout fruit development. At the end of cell division (∼40 DAFB), only small differences between different crop loads (range of 0.09% SSC between light and high crop load) and cell division temperature regimes (0.16% SSC between cold and warm temperatures) can be observed. Close to harvest (140 DAFB), relatively large differences in SSC can be seen between different levels of tree sector (range of 0.89% SSC between bottom and top sector) compared to only minor differences in SSC for different crop loads (range of 0.42% SSC between light and high crop load) and only negligible effects for different temperature regimes (range of 0.12% SSC).

2.4. Sensitivity Analysis of the Experimental Setup

Data collection in large orchard trial designs is labour and cost intensive. Therefore, it is of interest to investigate whether reduced sample sizes lead to different results. SSC values were derived from the multi-year PLSR calibration model based on laboratory reference measurements. The influence of reduced sample sizes and unbiased laboratory measurement errors of 1.0% SSC on research results was investigated with LME Model 1. Different simulation settings are presented in Table 3.
Three settings are discussed and the standard Model respectively altered: Model A shows the effects of a reduced number of trees (100 trees within three years, same number of fruit per tree). Model B is based on a reduced number of fruit (500 fruit within three years, same number of trees). While the first setting with less experimental trees makes if possible to have additional experiments in the same orchard block, the second setting reduces the number of working hours per tree. Additionally, Model C shows the effect of an increased but unbiased measurement error (additional white noise of 1.0% SSC) in the SSC measurements. As expected, Models A–C show increased standard deviations of all estimates compared to the multi-year SSC model due to a reduced sample size (for Models A and B) or an increased measurement error (Model C). In Model C, the cell division temperature and calcium treatment are not marked as “statistically significant” due to increased measurement errors. In most cases the effect of sample size remains comparable to the multi-year SSC model.
In summary, these simulations show the possibility of reduced sample sizes when the focus is on treatments with large effects. In order to detect small differences between different treatments, large sample sizes are still required, especially in the presence of measurement errors due to increased t-values of the estimates.

2.5. A Practical Comparison of Spectral and Conventional Laboratory Methods to Determine SSC at Harvest

Traditional destructive laboratory samples for SSC were taken at harvest from eight apples per tree sector, treatment (2 or 3× levels) and repetition (3×). At the same time, the last non-destructive scans in the orchard were taken from an independent batch of approx. seven apples, scanned and postprocessed with the yearly calibrated PLSR model. Mean destructive laboratory values for 2016, 2017, 2018 and all study years from 2016 to 2018 were 11.3, 11.1, 12.3 and 11.7% SSC, respectively (2016: +/−0.52 sd, n = 63; 2017: +/−0.55 sd, n = 9; 2018: +/−0.53 sd, n = 50). The non-destructive samples were 11.8, 10.7, 12.0 and also 11.7% SSC, respectively (2016: +/−0.85 sd, n = 63; 2017: +/−0.70 sd, n = 27; 2018: +/−0.56 sd, n = 68). There is a higher variance for the PLSR modelled SSC values as compared to the laboratory values. The mean difference of each treatment level for the two methods is 0.5% SSC for all study years. The obtained values from the two approaches were not biased.

3. Discussion

Varying weather conditions during the three study years resulted in different SSC values at harvest which is in accordance with the literature [34,35,36]. Using time-series data in the orchard offers the possibility to see a linear carbohydrate development in the form of SSC accumulation over time. For 2017, the severe frost year, the SSC increase showed a larger variance (Figure 1a), as was also seen for fruit growth (Figure 1b). Non-destructive technologies can provide researchers with new tools to study fruit physiology or offer the possibility to use these values in digital orchard management information systems to predict and manage fruit quality, as seen for fruit diameter [37]. The effects of field treatments and physiological differences were directly related to the developmental stage of the fruit. Differences in tree sector position and crop load [1,38] caused increasingly large differences in SSC during fruit development and negligible differences due to early season temperature ( Δ 2 C to ambient). Differences in sector position influence SSC early in the season whereas crop load effects increase steadily during fruit development.
Up until now the practical application of non-destructive scanning in apple research experiments has been restricted due to the intensive amount of laboratory work necessary to obtain reference samples and to the unknown precision of PLSR calibration models in the orchard. The results of the PLSR calibration only partially depend on the number and precision of the reference laboratory measurements. The results suggest that special emphasis should be placed on scanning fruit at low and high SSC values at the beginning and end of each season to cover a wider range of possible SSC values within a particular growing season. These results have some practical implications and suggest that even a considerably reduced sample size (100 samples) leads to comparable results, although the standard deviations of the estimates increase with reduced sample size. It suggests that repeated laboratory reference measurements of the same fruits to increase the accuracy of reference values lead to almost negligible improvements of the PLSR calibration models. These simulation results are consistent with standard results from statistical measurement error theory for response variables [39]. Moreover, in future experimental designs a reduced number of field scans would be sufficient to detect SSC differences between the treatments. A classical experimental field design with blocks and repetitions did not play a role in the LME modelling, which relaxes some limitations of the classical variance analysis framework and provides a more flexible way to adapt to temporal, spatial and tree-specific dependencies. A precision horticulture approach beyond research trials to monitor fruit SSC development on large sample numbers aligned to orchard structure should be possible.
The accuracy and robustness of the PLSR models was examined in great detail and only showed minor limitations to their broader use for our purposes. Yearly calibrated models cannot be generalised to other years, but multi-year models can be used for the same orchard and cultivar as was also seen in Peirs et al. (2003) [21]. The practical comparison between all laboratory based destructive measurements and the non-destructive orchard SSC data collection showed that the independent apple selection was unbiased and for the apple cultivar ‘Braeburn’ there was no difference between the two methods for determining SSC values at harvest in the orchard. In the future, however, new developments with model transfer methodology [40] together with neural networks or other ‘big data’ applications may facilitate the wider use of non-destructive sensor based SSC predictions for apples.
Our results may not be generalised to other apple cultivars or fruit species and to other sites or other climate regimes. However, since all effects are comparatively large and consistent with a literature review, additional measurements would probably confirm the overall effects. Analyses of dry matter content which can also be obtained by PLSR models were not considered in this study. As the number of non-destructive sensors available for horticultural practice and research is expected to increase in the coming years, longitudinal data will be available in ever greater quantities. The collaboration of horticultural science, computer science and statistics will avoid the collection of data as an end in itself and allow for new insights into currently hidden patterns of fruit physiology and development.

4. Materials and Methods

4.1. Experimental Setup

This research took place at the Kompetenzzentrum Obstbau-Bodensee (47 46′01.9″ N 9 33′23.3″ E) in the Lake Constance region of Southwest Germany using the apple cultivar ‘Braeburn’ Malus domestica. A randomised field design with treatments of crop load (light, standard, heavy), calcium spraying (with, without) and cell division temperature (ambient, Δ + 2 and Δ − 2 C) were used. Each tree was divided into three sectors of ∼1.25 m height each for the bottom, middle and top. Apple phenological growth stages were recorded following the BBCH code scheme [41]. The experimental design (treatments and scanning number/frequency) varied during the different study years. For a detailed description of the field experiments see [33,42].

4.2. SSC Sampling

Around June drop, one representative fruit per tree and sector was selected, marked and repeatedly measured (scanned) until harvest. Fruit were scanned on the equatorial and sun side with a handheld portable Vis/NIR device (F-750, Felix Instruments, Camas, WA, USA). The device had a 3 nm spectral sampling over a 310–1100 nm spectral window, a spectral resolution of 8–13 nm and corrected each scan for background daylight. The spectral range of 729–975 nm was used in the PLSR models to predict SSC. Fruit were replaced by a similar nearby fruit, if the fruit was lost or was not representative.
Orchard sampling was performed weekly in 2016 between 15 August and 16 October for n = 198 fruit from n = 33 trees. A total of n = 3994 scans were performed. In 2017, sampling took place weekly between 3 August and 30 October for n = 603 fruit from n = 96 trees. A total of n = 5957 scans were made. In 2018, n = 473 fruit from n = 146 trees were measured biweekly between 6 June and 25 October. In total, n = 7087 scans were recorded. In 2018, data acquisition took place on a daily basis for 120–180 DAFB and SSC scanning started at 50 DAFB.

4.3. PLSR Models

Reference measurements combined both destructive wet chemistry results and non-destructive spectral scans. A sample of n = 30 reference fruit were taken regularly over the fruit development and maturation periods from nearby trees in the same block and around the same field at scanning time to ensure the transferability of the calibration model to the SSC of sampled fruit. In total, n = 599 fruit in 2016, n = 211 fruit in 2017 and n = 333 fruit in 2018 were selected. Non-destructive spectral reference measurements were performed at different temperatures (∼10, 20, 30 C) to help adjust for temperature induced changes in hydrogen bonding [43]. The number of reference measurements is given as a total of n = 1639 observations in 2016 (529 observations at 10 C, n = 583 observations at 20 C, n = 527 observations at 30 C), in 2017 n = 631 observations (210 observations at 10 C, n = 211 observations at 20 C, n = 210 observations at 30 C) and in 2018 n = 984 observations (n = 328 observations at each temperature level). Destructive wet chemistry SSC measurements were obtained with a refractometer (Atago, Tokyo, Japan). PLSR models were postprocessed on a year- and site-specific basis.
The original models were built with the Felix model builder software (v1.3.0.177). Additional PLSR models were fitted using the R package pls [44]. Spectral data was transformed using second derivative spectra from 729 nm to 975 nm. The maximum number of principle components was set to 7 and the models were validated with leave-one-out cross validation methods. The reference data set was split into a calibration data set to train the PLSR model and a validation data set which was only used to test the prediction quality. A stratified random sample was drawn for each year to generate equal parts for all years. In total, n = 1200 observations were used as calibration data and n = 300 for validation data, if not stated otherwise. The RMSEP and adjusted prediction R 2 were used to describe the model performance and goodness of prediction. Reference measurements were taken as the longitudinal observations took place in the orchard. Therefore, we assume that the RMSEP for the validation data corresponds to the RMSEP of the SSC sampled fruit which could not be chemically analysed destructively due to the longitudinal structure of the study.

4.4. Monte Carlo Simulations

Monte Carlo simulations were used to assess PLSR model sensitivity to changes in input parameters and effects of sample size [45]. Measurement accuracy using standard laboratory analyses was simulated with repeated random samples. For simulations a modified and randomly sampled calibration set was generated without replacement. Reference measurements were split into a calibration data set with 1200 observations and a validation data set with 300 observations. The modified calibration sets were used in an automatic Monte Carlo simulation process to calculate the RMSEP and adjusted R 2 values for each setting. Each setting was repeated n = 100 times to calculate mean RMSEP values and standard deviations.
For sample size analyses, calibration sets with a reduced sample size were sampled for each year and all years combined. For laboratory errors, calibration sets were sampled for each year and all years combined. Laboratory errors were assumed to be unbiased and normally distributed. Additional normally distributed error terms (white noise) with different magnitudes were added afterwards.

4.5. Longitudinal LME Models

LME models (hierarchical regression models, nested linear models, multi-level regression models) are a subset of generalized regression methods to analyse repeated time-correlated and cluster-correlated observations [46]. DAFB and WAFB were used for time-dependent analyses in each year.
All time-correlated observations on a single fruit were part of a natural cluster of observations which shared the same fruit-specific and tree-specific characteristics. A hierarchical (nested) data structure was therefore applied. The LME model combined population-specific and subject-specific (spatial variation in the orchard) random effects.
The specification of the random effects needs special consideration as the longitudinal data structure makes two adaptions necessary: first, a random effect is given by a fruit-specific dependency as these observations are correlated over time (fruit-specific intercept and slope). Second, a random effect is necessary due to tree-specific dependency for all fruit from the same tree. Therefore, with respect to the natural dependency of observations from the same tree, a tree-specific intercept and slope were specified. Two random effects were modelled in addition to fixed effects which affect all fruit simultaneously. Details are specified in the previous sections. Modelling was done with the R package lme4 which provided various functions for fitting, analysing and evaluating mixed-effects models in a linear, generalised linear and nonlinear framework [47,48]. The restricted maximum likelihood method and full maximum likelihood method were used to estimate parameters. The R package lmerTest was used to approximate the degrees of freedom and calculate p-values for mixed-effects models using Satterthwaite’s method [49]. Yet no emphasis is placed on the interpretation of these p-values, as there is an unresolved statistical discussion about their theoretical applicability [50]. Coefficients of fixed effects with a t-value (ratio of estimate and its standard deviation) of less than −2 or greater than 2 were considered statistically significant. Model choice was based on the AIC and the BIC both of which use the log-likelihood ratio and describe model quality by adjusting the goodness of fit with a penalization term for model complexity [51,52]. RMSEP was used to compare model predictions and observations.

4.6. Mann–Whitney–Wilcoxon Test

A Mann–Whitney–Wilcoxon test was conducted in R to compare the modelled SSC values from the field scans based on the PLSR modelsto destructively measured fruit in the laboratory. Refractometer values showed a normal distribution, whereas PLSR modelled SSC were not normally distributed and Mann–Whitney–Wilcoxon test was used. For the laboratory samples the top half of a fruit batch of eight apples was mixed in the laboratory with a conventional fruit blender.

5. Conclusions

In summary, the non-destructive temporal development of SSC accumulation could contribute new insights into apple fruit carbohydrate physiology. The present study linked an in-depth statistical analysis of large data sets with horticultural knowledge in order to test the application of ‘Braeburn’ SSC prediction models with a special focus on model transferability and accuracy.
  • In terms of model performance over all years, the multi-year PLSR model appeared to be reasonable with minor restrictions for especially low and high SSC predictions. However, independent yearly calibration models performed best for the same year.
  • A sample size of n = 100 fruit for a yearly PLSR model with a wide range of SSC values seems to be sufficient.
  • Differences in sector position and crop load resulted in increasingly large differences in SSC during fruit development and offer the possibility for further physiological studies.

Author Contributions

Conceptualisation, K.B. and R.J.M.; methodology, R.J.M. and K.B. and D.S.; software, D.S. and R.J.M. and K.B.; validation, K.B. and D.S. and R.J.M.; formal analysis, D.S.; investigation, K.B. and R.J.M.; data curation, K.B. and R.J.M.; writing—original draft preparation, K.B. and D.S.; writing—review and editing, R.J.M. and P.B.; visualisation, K.B.; supervision, R.J.M. and P.B.; project administration, K.B.; funding acquisition, R.J.M. All authors have read and agreed to the published version of the manuscript.

Funding

This project was supported by the Federal Ministry of Food and Agriculture (BMEL) based on a decision of the Parliament of the Federal Republic of Germany via the Federal Office for Agriculture and Food (BLE) under the innovation support programme.

Data Availability Statement

All data reported here is available from the authors upon request.

Acknowledgments

We would like to thank the Kompetenzzentrum Obstbau-Bodensee orchard and lab staff and the yearly interns for extensive help in conducting this study and the ‘BigApple’ project team.

Conflicts of Interest

The authors declare no conflict of interest. The funder had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

Abbreviations

The following abbreviations are used in this manuscript:
AICAkaike information criterion
ANOVAAnalysis of variance
BICBayesian information criterion
DAFBDays after full bloom
LME modelLinear mixed-effects model
NIRNear-infrared
PLSRPartial least squares regression
RMSEPRoot-mean-square error of prediction
sdStandard deviation
SSCSoluble solids content
VisVisible
WAFBWeeks after full bloom

References

  1. Meland, M. Effects of different crop loads and thinning times on yield, fruit quality, and return bloom in Malus x domestica Borkh. ‘Elstar’. J. Hortic. Sci. Biotechnol. 2009, 84, 117–121. [Google Scholar] [CrossRef]
  2. Yuri, J.; Talice, J.G.; Verdugo, J.; del Pozo, A. Responses of fruit growth, quality, and productivity to crop load in apple cv. Ultra Red Gala/MM111. Sci. Hortic. 2011, 127, 305–312. [Google Scholar] [CrossRef]
  3. Ferguson, I.; Volz, R.; Woolf, A. Preharvest factors affecting physiological disorders of fruit. Postharvest Biol. Technol. 1999, 15, 255–262. [Google Scholar] [CrossRef]
  4. Rabus, C.; Streif, J. Effect of various preharvest treatments on the development of internal browning in ‘Braeburn’ apples. Acta Hortic. 2000, 518, 151–157. [Google Scholar] [CrossRef]
  5. Hatoum, D.; Buts, K.; Hertog, M.L.A.T.M.; Geeraerd, A.H.; Schenk, A.; Vercammen, J.; Nicolaï, B.M. Effects of pre- and postharvest factors on browning in Braeburn. Hortic. Sci. 2014, 41, 19–26. [Google Scholar] [CrossRef] [Green Version]
  6. Ford, E.M. Effect of post-blossom environmental conditions on fruit composition and quality of apple. Commun. Soil Sci. Plant Anal. 1979, 10, 337–348. [Google Scholar] [CrossRef]
  7. Warrington, I.; Fulton, T.; Halligan, E.; de Silva, H. Apple Fruit Growth and Maturity are Affected by Early Season Temperatures. J. Am. Soc. Hortic. Sci. 1999, 124, 468–477. [Google Scholar] [CrossRef] [Green Version]
  8. Hatoum, D.; Hertog, M.L.A.T.M.; Geeraerd, A.H.; Nicolai, B.M. Effect of browning related pre- and postharvest factors on the ‘Braeburn’ apple metabolome during CA storage. Postharvest Biol. Technol. 2016, 111, 106–116. [Google Scholar] [CrossRef]
  9. Seifert, B.; Zude, M.; Spinelli, L.; Torricelli, A. Optical properties of developing pip and stone fruit reveal underlying structural changes. Physiol. Plant. 2014, 153, 327–336. [Google Scholar] [CrossRef] [Green Version]
  10. Walsh, K.B.; McGlone, V.A.; Han, D.H. The uses of near infra-red spectroscopy in postharvest decision support: A review. Postharvest Biol. Technol. 2020, 163, 111139. [Google Scholar] [CrossRef]
  11. Beers, R.V.; Aernouts, B.; Watté, R.; Schenk, A.; Nicolaï, B.; Saeys, W. Effect of maturation on the bulk optical properties of apple skin and cortex in the 500–1850 nm wavelength range. J. Food Eng. 2017, 214, 79–89. [Google Scholar] [CrossRef]
  12. Fan, S.; Zhang, B.; Li, J.; Huang, W.; Wang, C. Effect of spectrum measurement position variation on the robustness of NIR spectroscopy models for soluble solids content of apple. Biosyst. Eng. 2016, 143, 9–19. [Google Scholar] [CrossRef]
  13. Abasi, S.; Minaei, S.; Jamshidi, B.; Fathi, D.; Khoshtaghaza, M.H. Rapid measurement of apple quality parameters using wavelet de-noising transform with Vis/NIR analysis. Sci. Hortic. 2019, 252, 7–13. [Google Scholar] [CrossRef]
  14. Cozzolino, D.; Cynkar, W.U.; Shah, N.; Smith, P. Multivariate data analysis applied to spectroscopy: Potential application to juice and fruit quality. Food Res. Int. 2011, 44, 1888–1896. [Google Scholar] [CrossRef]
  15. Zude-Sasse, M.; Hashim, N.; Hass, R.; Polley, N.; Regen, C. Validation study for measuring absorption and reduced scattering coefficients by means of laser-induced backscattering imaging. Postharvest Biol. Technol. 2019, 153, 161–168. [Google Scholar] [CrossRef]
  16. Lohner, S.; Biegert, K.; Hohmann, A.; McCormick, R.; Kienle, A. Determining the optical properties of apple tissue and their dependence on physiological and morphological characteristics during maturation. Part 1: Spatial frequency domain imaging. Postharvest Biol. Technol. in press.
  17. Aggelopoulou, K.D.; Wulfsohn, D.; Fountas, S.; Gemtos, T.A.; Nanos, G.D.; Blackmore, S. Spatial variation in yield and quality in a small apple orchard. Precis. Agric. 2009, 11, 538–556. [Google Scholar] [CrossRef]
  18. Manfrini, L.; Grappadelli, L.C.; Morandi, B.; Losciale, P.; Taylor, J.A. Innovative approaches to orchard management: Assessing the variability in yield and maturity in a ‘Gala’ apple orchard using a simple management unit modeling approach. Eur. J. Hortic. Sci. 2020, 85, 211–218. [Google Scholar] [CrossRef]
  19. Elgar, H.J.; Lallu, N.; Watkins, C.B. Harvest Date and Crop Load Effects on a Carbon Dioxide–related Storage Injury of ‘Braeburn’ Apple. HortScience 1999, 34, 305–309. [Google Scholar] [CrossRef] [Green Version]
  20. Lordan, J.; Francescatto, P.; Dominguez, L.I.; Robinson, T.L. Long-term effects of tree density and tree shape on apple orchard performance, a 20 year study—Part 1, agronomic analysis. Sci. Hortic. 2018, 238, 303–317. [Google Scholar] [CrossRef]
  21. Peirs, A.; Tirry, J.; Verlinden, B.; Darius, P.; Nicolai, B.M. Effect of biological variability on the robustness of NIR models for soluble solids content of apples. Postharvest Biol. Technol. 2003, 28, 269–280. [Google Scholar] [CrossRef]
  22. Palmer, J.W.; Giuliani, R.; Adams, H.M. Effect of Crop Load on Fruiting and Leaf Photosynthesis of ‘Braeburn’/M.26 Apple Trees. Tree Physiol. 1997, 17, 741–746. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  23. Gezan, S.A.; Carvalho, M. Analysis of Repeated Measures for the Biological and Agricultural Sciences. In Applied Statistics in Agricultural, Biological, and Environmental Sciences; American Society of Agronomy, Crop Science Society of America, and Soil Science Society of America, Inc.: Madison, WI, USA, 2018; pp. 279–297. [Google Scholar] [CrossRef]
  24. Ketelaere, B.D.; Lammertyn, J.; Molenberghs, G.; Nicolai, B.; Baerdemaeker, J.D. Statistical models for analyzing repeated quality measurements of horticultural products. Math. Biosci. 2003, 185, 169–189. [Google Scholar] [CrossRef]
  25. Godoy, C.; Monterubbianesi, G.; Tognetti, J. Analysis of highbush blueberry (Vaccinium corymbosum L.) fruit growth with exponential mixed models. Sci. Hortic. 2008, 115, 368–376. [Google Scholar] [CrossRef]
  26. Johnson, W.; Balakrishna, N.; Griffiths, P.L. Modeling physical growth using mixed effects models. Am. J. Phys. Anthropol. 2012, 150, 58–67. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  27. Bastogne, T.; Samson, A.; Vallois, P.; Wantz-Mézières, S.; Pinel, S.; Bechet, D.; Barberi-Heyob, M. Phenomenological modeling of tumor diameter growth based on a mixed effects model. J. Theor. Biol. 2010, 262, 544–552. [Google Scholar] [CrossRef] [Green Version]
  28. Weisberg, S.; Spangler, G.; Richmond, L.S. Mixed effects models for fish growth. Can. J. Fish. Aquat. Sci. 2010, 67, 269–277. [Google Scholar] [CrossRef] [Green Version]
  29. Lakso, A.N.; Grappadelli, L.C.; Barnard, J.; Goffinet, M.C. An expolinear model of the growth pattern of the apple fruit. J. Hortic. Sci. 1995, 70, 389–394. [Google Scholar] [CrossRef]
  30. Stanley, C.J.; Tustin, D.S.; Lupton, G.B.; Mcartney, S.; Cashmore, W.M.; Silva, H.N.D. Towards understanding the role of temperature in apple fruit growth responses in three geographical regions within New Zealand. J. Hortic. Sci. Biotechnol. 2000, 75, 413–422. [Google Scholar] [CrossRef]
  31. Tijskens, L.M.M.; Unuk, T.; Okello, R.C.O.; Wubs, A.M.; Šuštar, V.; Šumak, D.; Schouten, R.E. From fruitlet to harvest: Modelling and predicting size and its distributions for tomato, apple and pepper fruit. Sci. Hortic. 2016, 204, 54–64. [Google Scholar] [CrossRef]
  32. Herold, B.; Truppel, I.; Zude, M.; Geyer, M. Spectral Measurements on ‘Elstar’ Apples during Fruit Development on the Tree. Biosyst. Eng. 2005, 91, 173–182. [Google Scholar] [CrossRef]
  33. Osinenko, P.; Biegert, K.; McCormick, R.J.; Göhrt, T.; Devadze, G.; Streif, S. Application of non-destructive sensors and big-data analysis to predict physiological storage disorders and fruit firmness in ‘Braeburn’ apples. Comput. Electron. Agric. 2021, in press. [Google Scholar]
  34. Sugiura, T.; Ogawa, H.; Fukuda, N.; Moriguchi, T. Changes in the taste and textural attributes of apples in response to climate change. Sci. Rep. 2013, 3. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. Warmund, M.R.; Starbuck, C.; Kadir, S. Changes in fruit quality parameters of ‘Jonathan Rasa’ and ‘Delicious Flanagan’ apples in response to elevated temperatures. Trans. Kans. Acad. Sci. 2007, 110, 259–267. [Google Scholar] [CrossRef]
  36. Toivonen, P.M.; Lannard, B. Dry matter content association with time of on-tree maturation, quality at harvest, and changes in quality after controlled atmosphere storage for ‘Royal Gala’ apples. Can. J. Plant Sci. 2020, 1–9. [Google Scholar] [CrossRef]
  37. Li, M.; Chen, M.; Zhang, Y.; Fu, C.; Xing, B.; Li, W.; Qian, J.; Li, S.; Wang, H.; Fan, X.; et al. Apple Fruit Diameter and Length Estimation by Using the Thermal and Sunshine Hours Approach and Its Application to the Digital Orchard Management Information System. PLoS ONE 2015, 10, e0120124. [Google Scholar] [CrossRef]
  38. Anthony, B.; Serra, S.; Musacchi, S. Optimizing Crop Load for New Apple Cultivar: “WA38”. Agronomy 2019, 9, 107. [Google Scholar] [CrossRef] [Green Version]
  39. Sepanski, J.H. On a repeated-measurement model with errors in dependent variable. Statistics 2001, 35, 97–112. [Google Scholar] [CrossRef]
  40. Mishra, P.; Roger, J.M.; Rutledge, D.N.; Woltering, E. Two standard-free approaches to correct for external influences on near-infrared spectra to make models widely applicable. Postharvest Biol. Technol. 2020, 170, 111326. [Google Scholar] [CrossRef]
  41. Meier, U.; Graf, H.; Hack, H.; Hess, M.; Kennel, W.; Klose, R.; Mappes, D.; Seipp, D.; Stauss, R.; Streif, J.; et al. Phänologische Entwicklungsstadien des Kernobstes (Malus domestica Borkh. und Pyrus communis L.), des Steinobstes (Prunus-Arten), der Johannisbeere (Ribes-Arten) und der Erdbeere (Fragaria x ananassa Duch.). Nachrichtenbl. Deut. Pflanzenschutzd. 1994, 46, 141–153. [Google Scholar]
  42. McCormick, R.J.; Biegert, K.; Streif, J. Occurrence of physiological browning disorders in stored ‘Braeburn’ apples as influenced by orchard and weather conditions. Postharvest Biol. Technol. 2021, in press. [Google Scholar]
  43. Golic, M.; Walsh, K.B. Robustness of calibration models based on near infrared spectroscopy for the in-line grading of stonefruit for total soluble solids content. Anal. Chim. Acta 2006, 555, 286–291. [Google Scholar] [CrossRef]
  44. Mevik, B.H.; Wehrens, R. The pls Package: Principal Component and Partial Least Squares Regression in R. J. Stat. Softw. 2007, 18. [Google Scholar] [CrossRef] [Green Version]
  45. Brodský, L.; Vašát, R.; Klement, A.; Zádorová, T.; Jakšík, O. Uncertainty propagation in VNIR reflectance spectroscopy soil organic carbon mapping. Geoderma 2013, 199, 54–63. [Google Scholar] [CrossRef]
  46. Fitzmaurice, G.M.; Laird, N.M.; Ware, J.H. Applied Longitudinal Analysis; John Wiley & Sons, Inc.: Hoboken, NJ, USA, 2011. [Google Scholar] [CrossRef]
  47. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2019. [Google Scholar]
  48. Bates, D.; Mächler, M.; Bolker, B.; Walker, S. Fitting Linear Mixed-Effects Models Using lme4. J. Stat. Softw. 2015, 67. [Google Scholar] [CrossRef]
  49. Kuznetsova, A.; Brockhoff, P.B.; Christensen, R.H.B. lmerTest Package: Tests in Linear Mixed Effects Models. J. Stat. Softw. 2017, 82, 1–26. [Google Scholar] [CrossRef] [Green Version]
  50. Luke, S.G. Evaluating significance in linear mixed-effects models in R. Behav. Res. Methods 2016, 49, 1494–1502. [Google Scholar] [CrossRef] [PubMed]
  51. Akaike, H. Information Theory and an Extension of the Maximum Likelihood Principle. In Springer Series in Statistics; Springer: New York, NY, USA, 1998; pp. 199–213. [Google Scholar] [CrossRef]
  52. Schwarz, G. Estimating the Dimension of a Model. Ann. Stat. 1978, 6, 461–464. [Google Scholar] [CrossRef]
Figure 1. Soluble solids content (SSC) accumulation derived from the yearly calibrated (Section 2.1.1) partial least squares regression models (a) and fruit diameter growth (b) for the three study years and all treatments are shown. Mean values per measurement day are plotted as solid line, single values as grey dots and +/−standard deviations as black vertical bars.
Figure 1. Soluble solids content (SSC) accumulation derived from the yearly calibrated (Section 2.1.1) partial least squares regression models (a) and fruit diameter growth (b) for the three study years and all treatments are shown. Mean values per measurement day are plotted as solid line, single values as grey dots and +/−standard deviations as black vertical bars.
Plants 10 00302 g001
Figure 2. Regressions between laboratory measured and modelled % SSC. The calibration model was trained on 2016–2018 data. This model was thereafter evaluated with an independent validation data set for all years together and separately. Regression lines are plotted for each validation data set and adjusted prediction R 2 is displayed.Diagnostic plots
Figure 2. Regressions between laboratory measured and modelled % SSC. The calibration model was trained on 2016–2018 data. This model was thereafter evaluated with an independent validation data set for all years together and separately. Regression lines are plotted for each validation data set and adjusted prediction R 2 is displayed.Diagnostic plots
Plants 10 00302 g002
Figure 3. Root mean square error of prediction (RMSEP) in % soluble solids content (SSC) based on 2016, 2017 and 2018 partial least squares regression (PLSR) models to test the effect of reduced calibration sample sets. For each setting, 500 Monte Carlo simulation runs were performed. Mean and standard deviation for each point are shown and 500 observations in each calibration data set and 100 observations in each validation data set used.
Figure 3. Root mean square error of prediction (RMSEP) in % soluble solids content (SSC) based on 2016, 2017 and 2018 partial least squares regression (PLSR) models to test the effect of reduced calibration sample sets. For each setting, 500 Monte Carlo simulation runs were performed. Mean and standard deviation for each point are shown and 500 observations in each calibration data set and 100 observations in each validation data set used.
Plants 10 00302 g003
Figure 4. Root mean square error of prediction (RMSEP) in % soluble solids content (SSC) based on 2016, 2017 and 2018 partial least squares regression (PLSR) models to test for nonsystematic laboratory errors during wet chemistry analyses. For each setting, 500 Monte Carlo simulation runs were performed. Mean and standard deviation for each point are shown and 500 observations in each calibration data set and 100 observations in each validation data set used.
Figure 4. Root mean square error of prediction (RMSEP) in % soluble solids content (SSC) based on 2016, 2017 and 2018 partial least squares regression (PLSR) models to test for nonsystematic laboratory errors during wet chemistry analyses. For each setting, 500 Monte Carlo simulation runs were performed. Mean and standard deviation for each point are shown and 500 observations in each calibration data set and 100 observations in each validation data set used.
Plants 10 00302 g004
Table 1. Root mean square error of prediction (RMSEP) of % soluble solids content (SSC) to test the transferability of calibration models after 500 Monte Carlo simulation runs per point (500 observations in each calibration data set, 100 observations in each validation data set. Mean and standard deviation (in brackets) for each point are shown.
Table 1. Root mean square error of prediction (RMSEP) of % soluble solids content (SSC) to test the transferability of calibration models after 500 Monte Carlo simulation runs per point (500 observations in each calibration data set, 100 observations in each validation data set. Mean and standard deviation (in brackets) for each point are shown.
Training Data SetValidation Data Set of Respective Years
2016–2018201620172018
2016–20180.65 (0.05)0.68 (0.05)0.68 (0.04)0.57 (0.04)
20160.90 (0.16)0.61 (0.05)1.00 (0.30)1.02 (0.21)
20170.83 (0.12)0.95 (0.23)0.57 (0.04)0.90 (0.14)
20180.95 (0.10)1.19 (0.20)1.01 (0.08)0.48 (0.03)
Table 2. Time-dependent treatment effects in 2017 on % soluble solids content (SSC) accumulation at different stages of fruit development given in days after full bloom (DAFB). The baseline configuration (Base) corresponds to fruit from the bottom sector of a tree with medium crop load without alterations of cell division temperature. Displayed effects are bottom, middle, top sectors and light, standard (stand.), heavy crop load and cold, ambient (amb.), warm cell division temperature treatments. SSC values of the respective effects need to be added or subtracted to the base value.
Table 2. Time-dependent treatment effects in 2017 on % soluble solids content (SSC) accumulation at different stages of fruit development given in days after full bloom (DAFB). The baseline configuration (Base) corresponds to fruit from the bottom sector of a tree with medium crop load without alterations of cell division temperature. Displayed effects are bottom, middle, top sectors and light, standard (stand.), heavy crop load and cold, ambient (amb.), warm cell division temperature treatments. SSC values of the respective effects need to be added or subtracted to the base value.
DAFBBase in % SSCSectorCrop LoadTemperature
MiddleTopLightHeavyColdWarm
405.540.210.530.02−0.070.080.16
606.430.250.600.06−0.090.050.13
807.310.300.670.11−0.110.010.11
1008.190.340.750.16−0.13−0.020.08
1209.080.390.820.20−0.15−0.060.06
1409.960.430.890.25−0.17−0.090.03
Table 3. SSC model simulations with a reduction of sample size and different measurement errors. Multi-year model (MYM) corresponds to a linear mixed-effect model with all data from 2016–2018 included (LME Model 1), Model A was reduced to samples from 100 trees of the MYM, Model B was reduced to 500 fruit samples of the MYM and Model C had an unbiased error of 1.0% SSC added to the MYM. The estimate % SSC (standard deviation) is stated with a significance code with *** <0.001, ** <0.01, * <0.05. The number of observations, different fruit and trees is shown.
Table 3. SSC model simulations with a reduction of sample size and different measurement errors. Multi-year model (MYM) corresponds to a linear mixed-effect model with all data from 2016–2018 included (LME Model 1), Model A was reduced to samples from 100 trees of the MYM, Model B was reduced to 500 fruit samples of the MYM and Model C had an unbiased error of 1.0% SSC added to the MYM. The estimate % SSC (standard deviation) is stated with a significance code with *** <0.001, ** <0.01, * <0.05. The number of observations, different fruit and trees is shown.
Multi-Year ModelModel AModel BModel C
Observations (N)17,0047457677717,004
Fruit (N)12745405001274
Trees (N)237100211237
Intercept3.78 *** (0.084)3.72 *** (0.115)3.90 *** (0.127)3.86 *** (0.157)
Year 20171.47 *** (0.096)1.58 *** (0.136)1.45 *** (0.148)1.51 *** (0.183)
Year 20181.85 *** (0.081)2.04 *** (0.114)1.87 *** (0.125)1.85 *** (0.150)
Week0.31 *** (0.003)0.31 *** (0.005)0.31 *** (0.005)0.31 *** (0.007)
Middle sector0.11 * (0.054)0.00 (0.078)−0.03 (0.087)0.01 (0.100)
Top sector0.39 *** (0.054)0.38 *** (0.078)0.35 *** (0.089)0.423 *** (0.101)
Cold temperature0.15 (0.089)−0.02 (0.126)−0.03 (0.131)−0.05 (0.152)
Warm temperature0.21 ** (0.065)0.15 (0.090)0.26 ** (0.095)0.09 (0.107)
Light crop load−0.07 (0.064)−0.05 (0.096)−0.17 (0.094)−0.11 (0.110)
Heavy crop load−0.03 (0.069)−0.11 (0.094)−0.18 (0.101)−0.01 (0.117)
Without calcium0.17 (0.140)0.00 (0.175)0.43 (0.230)0.03 (0.227)
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Biegert, K.; Stöckeler, D.; McCormick, R.J.; Braun, P. Modelling Soluble Solids Content Accumulation in ‘Braeburn’ Apples. Plants 2021, 10, 302. https://doi.org/10.3390/plants10020302

AMA Style

Biegert K, Stöckeler D, McCormick RJ, Braun P. Modelling Soluble Solids Content Accumulation in ‘Braeburn’ Apples. Plants. 2021; 10(2):302. https://doi.org/10.3390/plants10020302

Chicago/Turabian Style

Biegert, Konni, Daniel Stöckeler, Roy J. McCormick, and Peter Braun. 2021. "Modelling Soluble Solids Content Accumulation in ‘Braeburn’ Apples" Plants 10, no. 2: 302. https://doi.org/10.3390/plants10020302

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop