Next Article in Journal
Overcoming Recalcitrance: A Review of Regeneration Methods and Challenges in Roses
Previous Article in Journal
First EST-SSRs of Helichrysum italicum (Roth) G. Don (Asteraceae) Revealed Insights into the Genetic Diversity and Population Structure in Corsica
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Precise Apple Quality Prediction Model Integrating Driving Factor Screening and BP Neural Network

1
College of Horticulture and Forestry Science, Tarim University, Alar 843300, China
2
Southern Xinjiang Special Fruit Trees High-Quality, High-Quality Cultivation and Deep Processing of Fruit Products Processing Technical National Local Joint Engineering Laboratory, Alar 843300, China
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Plants 2025, 14(24), 3795; https://doi.org/10.3390/plants14243795
Submission received: 5 November 2025 / Revised: 2 December 2025 / Accepted: 11 December 2025 / Published: 13 December 2025
(This article belongs to the Special Issue Advanced Remote Sensing and AI Techniques in Agriculture and Forestry)

Abstract

Apple fruit quality is primarily determined by Vitamin C (VC), Soluble Saccharides (SSs), Titratable Acid (TA), and the Soluble Saccharides/Titratable Acid (SSs/TA). This study aims to establish a prediction model based on the Back Propagation (BP) neural network by analyzing the intrinsic relationships between these quality indicators and the photosynthetic physiological characteristics of fruit trees, providing a new method for the precise prediction and regulation of fruit quality. Using ‘Fuji’ apple as the material, fruit quality indicators, leaf photosynthetic parameters, canopy structure indicators, and carbon–water–nitrogen metabolism indicators were systematically measured. Correlation analysis was employed to identify key influencing factors, BP neural network models with different hidden layer structures were constructed, and the optimal feature subset was screened through feature importance analysis, single-factor sensitivity analysis, and ablation experiments, ultimately establishing a simplified and efficient prediction model. Pn, Gs, SPCI, and DUE showed significant positive correlations with VC, SS, and SS/TA, whereas N and NLT were significantly positively correlated with TA content. SUE was identified as a common core driving factor for VC, SS, and SS/TA. The BP neural network demonstrated strong predictive performance for the four quality indicators, with the optimal model achieving validation set R2 values of 0.87, 0.86, 0.86, and 0.89, respectively. The simplified model developed through feature screening exhibited further improved performance: the validation set R2 for the VC prediction model increased to 0.93, while MAE and MAPE decreased by 32% and 35%, respectively. Photosynthetic characteristics and nitrogen metabolism status of the fruit trees serve as key physiological foundations determining apple quality. The quality prediction model based on the BP neural network achieved high accuracy, and its predictive performance was significantly enhanced after feature refinement, providing an effective tool for precise apple quality prediction and smart orchard management.

1. Introduction

Apple (Malus domestica Borkh.), one of the most important temperate fruits globally, has its internal fruit quality—such as vitamin C (VC) content, the composition of soluble saccharides (SSs) and titratable acid (TA), and the sugar-acid ratio (SS/TA)—directly determining the fruit’s nutritional value, flavor, and market value [1]. In recent years, with growing consumer demand for high-quality fruits, how to precisely regulate fruit quality through cultivation has become a core issue facing the apple industry. The formation of fruit quality is a complex physiological process resulting from the combined effects of genetic background, environmental factors, and cultivation management practices. Among these, photosynthesis in fruit trees and the carbon–nitrogen metabolism driven by it serve as the physiological foundation for material accumulation and the synthesis of flavor compounds [2].
Leaves are the primary organs for photosynthesis, and their photosynthetic characteristics, such as net photosynthetic rate (Pn) and stomatal conductance (Gs), directly determine the capacity for photosynthetic product formation. Concurrently, tree architecture parameters, including leaf area index (LAI), fractional interception of radiation (FIR), and the utilization efficiencies of direct and diffuse light (DUE and SUE, respectively), collectively govern the spatial distribution and utilization efficiency of light within the canopy, ultimately influencing the total amount and allocation of photosynthetic assimilates [3]. Furthermore, nitrogen, as a key constituent element of chloroplasts, enzymes, and proteins, influences not only the establishment and function of the photosynthetic apparatus but also directly participates in the anabolic pathways of quality-related components such as organic acids and amino acids. This influence is mediated through its content (N) and metabolic status, including indicators like the nitrogen limitation threshold (NLT) [4,5]. Therefore, systematically elucidating the intrinsic relationships between fruit quality and the aforementioned photosynthetic physiology and canopy structural indicators serves as the theoretical prerequisite for achieving precise quality regulation.
However, the relationships between canopy indicators and quality traits are highly nonlinear and multidimensional, forming a complex system that traditional linear statistical models (such as multiple linear regression and principal component regression) often struggle to accurately capture and interpret [6]. In recent years, artificial intelligence models have been increasingly applied in agricultural prediction, including artificial neural networks (ANN) [7], support vector machines (SVMs) [8], and Random Forest (RF) [9]. While these models demonstrate good performance under certain conditions, they still exhibit limitations when handling complex physiological and ecological data: ANN models are prone to becoming trapped in local minima and suffer from insufficient training stability [10]; SVM incurs high computational costs in high-dimensional, large-data scenarios [11]; and ensemble methods like Random Forest, although capable of handling nonlinearity, offer weak mechanistic interpretability due to their “black-box” nature [12]. These limitations hinder the practical application of existing models in precision orchard management.
In contrast, the Back Propagation (BP) neural network, which adjusts network weights via an error back-propagation algorithm, exhibits stronger nonlinear mapping capabilities and more robust learning characteristics [13,14,15]. It can autonomously extract complex features from high-dimensional data and effectively establish an end-to-end prediction model linking canopy driving factors to fruit quality, offering a novel approach to overcome the limitations of existing methods [16].
Based on this, this study proposes the following hypothesis: Integrating key canopy driving factors with the BP neural network algorithm can construct a highly accurate and strongly generalizable apple quality prediction model, which will outperform traditional linear methods and some existing AI models. To test this hypothesis, this study integrated the measurement of core apple quality indicators and key canopy physiological structure parameters, aiming to achieve the following objectives: (1) systematically analyze the intrinsic relationships between various canopy indicators and VC, SS, TA, and SS/TA; (2) construct and refine a quality prediction model based on a BP neural network, determining the optimal network architecture and training algorithm for each quality parameter; (3) identify the core driving factors influencing different quality components through feature importance analysis and ablation experiments, thereby building a streamlined and efficient refined model; and (4) validate the effectiveness and reliability of the developed models. This research aims to provide new methodological support for elucidating the formation mechanisms of apple fruit quality and for achieving accurate pre-harvest prediction, while also offering practical tools for the precision management of smart orchards.

2. Materials and Methods

2.1. Site Overview and Experimental Materials

This experiment was conducted in an apple orchard located on the campus of Tarim University in Alar, China. The orchard was oriented in an East–West row direction, with a planting spacing of 3.5 m × 1.8 m, and trees were trained to a central leader fruiting form. The test trees were approximately 3.5 m in height, with a canopy spread of 1.25–1.75 m and an operation path width of 3.5 m. The soil type in the orchard area was sandy loam soil. The study employed a field experimental design and was carried out in conjunction with statistical analysis methods.

2.2. Experimental Design and Sample Collection

The experiment employed a completely randomized block design. From October to November 2024, during the fruit ripening period, a total of 45 sample trees were selected for sampling. Representative fruit samples were collected uniformly from each sample tree according to four orientations (east, west, south, and north) and three canopy layers (upper, middle, and lower), yielding 225 samples in total. Concurrently, mature and healthy leaves were collected from vegetative shoots near the corresponding fruit-bearing branches for the determination of photosynthetic parameters and physiological indicators. All measurements were performed with three replicates.

2.3. Items and Methods of Determination

2.3.1. Determination of Fruit Quality Parameters

The vitamin C content was determined using the molybdenum blue colorimetric method [17]. First, a 1 mg/mL ascorbic acid standard stock solution was prepared: 100 mg of ascorbic acid standard was accurately weighed and diluted to 100 mL with a 50 g/L oxalic acid solution. Aliquots of 0, 0.2, 0.4, 0.6, 0.8, and 1.0 mL of the stock solution were separately transferred into 10 mL centrifuge tubes, and each was adjusted to a final volume of 1.0 mL by adding 50 g/L oxalic acid solution. Subsequently, 1 mL of 5% sulfuric acid-ethanol solution and 1 mL of 5% ammonium molybdate solution were added sequentially to each tube. After thorough mixing, the reaction was carried out in a water bath at 30 °C for 30 min. The absorbance was measured at a wavelength of 705 nm, and a standard curve was plotted.
For sample analysis, 2.00 g of fruit homogenate was accurately weighed and mixed with 10 mL of pre-chilled 50 g/L oxalic acid solution. The mixture was homogenized in an ice bath and then centrifuged at 4 °C and 8000 r/min for 15 min. The resulting supernatant was collected as the test solution. A 1.0 mL aliquot of the test solution was taken, and sulfuric acid-ethanol solution and ammonium molybdate solution were added following the same procedure described above. The mixture was reacted in a water bath at 30 °C for 30 min before measurement.
The absorbance of the reaction solution was measured at a wavelength of 705 nm using a UV-Vis spectrophotometer. The vitamin C content in the sample was calculated based on the standard curve, as shown in Formula (1).
Vc = C   ×   V   ×   n m   ×   1000   ×   100
where C is the ascorbic acid concentration (μg/mL) obtained from the standard curve; V is the total volume of the extraction solution (mL); n is the dilution factor; and m is the sample mass (g).
The soluble sugar content was determined by the anthrone colorimetric method [18]. First, a 1 mg/mL glucose standard solution was prepared by accurately weighing 100 mg of anhydrous glucose, dissolving it in distilled water, and diluting to a final volume of 100 mL. Aliquots of 0, 0.1, 0.2, 0.4, 0.6, and 0.8 mL of this solution were separately transferred into stoppered test tubes, and distilled water was added to bring the total volume in each tube to 1.0 mL. Subsequently, 4.0 mL of a freshly prepared anthrone-ethyl acetate solution (0.2 g anthrone dissolved in 100 mL ethyl acetate) and 5.0 mL of concentrated sulfuric acid were added sequentially. After thorough shaking, the mixtures were reacted in a boiling water bath for 10 min, immediately cooled in an ice bath, and the absorbance was measured at a wavelength of 630 nm to construct the standard curve.
For sample processing, 1.00 g of fruit homogenate was accurately weighed, mixed with 10 mL of an 80% ethanol solution, and extracted in an 80 °C water bath for 30 min with intermittent shaking. After cooling, the mixture was centrifuged at 4000 r/min for 10 min, and the supernatant was collected. The residue was extracted once more, and the combined supernatants were decolorized with activated carbon and then diluted to 25 mL with 80% ethanol. A 0.5 mL aliquot of the extract was taken and subjected to the color development procedure described above. The absorbance at 630 nm was measured, and the soluble sugar content was calculated according to Formula (2).
S = C   ×   V   ×   n m   ×   10 6   ×   100
The titratable acidity (TA) content was determined by neutralization titration with NaOH [19]. Precisely 5.00 g of fruit homogenate sample was weighed into a 150 mL conical flask. Then, 50 mL of freshly boiled and cooled distilled water was added, and the mixture was extracted in an 80 °C water bath for 30 min with occasional shaking. After cooling, the mixture was filtered, and the residue was washed with distilled water 3–4 times. The filtrate was collected and diluted to volume in a 100 mL volumetric flask.
Exactly 20.0 mL of the sample extract was pipetted into a 150 mL conical flask, followed by the addition of 2 drops of 1% phenolphthalein indicator. The solution was titrated with a 0.01 mol/L NaOH standard solution until a faint pink color appeared and persisted for 30 s as the endpoint. The volume of NaOH solution consumed was recorded. A blank test was performed simultaneously using distilled water.
The titratable acidity, expressed as malic acid, was calculated according to the following formula (Formula (3)).
TA = V   -   V 0   ×   C   ×   K   ×   V 1 m   ×   V 2   ×   1000
V is the volume of NaOH consumed in sample titration (mL); V0 is the volume of NaOH consumed in blank titration (mL); C is the concentration of the NaOH standard solution (mol/L); K is the conversion factor for malic acid (0.067); V1 is the total volume of the sample extract (mL); V2 is the volume of sample solution used in titration (mL); m is the sample mass (g).
Sugar-acid ratio (SS/TA): Calculated as (SS/TA) [20].

2.3.2. Determination of Photosynthetic and Canopy Structural Indices in Fruit Trees

Photosynthetically active radiation (PAR) was measured using an LAI-2200C Plant Canopy Analyzer (LI-COR Biosciences, Lincoln, NE, USA) or an equivalent model [21]. The instrument was equipped with a 180° fisheye lens and calculated PAR by comparing the radiation difference above and below the canopy. Measurements were conducted under clear, cloudless weather conditions, within the time window from 2 h after sunrise to 2 h before sunset.
Prior to formal measurement, the instrument was preheated for 20 min and uniformly set to the UTC + 8 time zone to automatically record timestamps. Initial calibration was performed at an open, unobstructed area to establish a reference point, and reference PAR data above the canopy were collected. Subsequently, five measurement points were arranged beneath the canopy of each sample tree, located at the four quadrant points and the center point of the crown projection. The detector was placed horizontally at a height of 1.5 m above the ground, with the fisheye lens oriented toward the zenith. At each measurement point, three consecutive readings were recorded, and their average was taken as the PAR value for that point.
During the measurement process, radiation data were simultaneously recorded both above the canopy (in an open area) and below the canopy. The instrument automatically calculated the canopy transmittance and output the PAR value (unit: μmol·m−2·s−1). Finally, the distribution of PAR within the canopy was analyzed based on the instrument’s built-in model, with the specific calculation formula provided in Formula (4).
P A R C a n o p y = P A R A b o v e × T t o t a l
P A R A b o v e is the measurement value at the reference point above the canopy; T t o t a l is the total transmittance of the canopy (calculated by the instrument using the gap fraction model).
Leaf area index (LAI) and direct transmittance coefficient (T) were measured using an LAI-2200C plant canopy analyzer (LI-COR Biosciences, USA) [22]. The instrument is equipped with a 180° fisheye lens and a light sensor, enabling simultaneous acquisition of canopy digital images and radiation data. Measurements were conducted under clear, cloudless weather conditions between sunrise and sunset, specifically during periods when the solar elevation angle exceeded 30° and light intensity was stable (10:00–14:00), with no strong wind interference (wind speed < 3 m/s).
Prior to measurement, the instrument was preheated for 15 min, and the measurement parameters were set to 5 zenith angle rings and 4 azimuth angles. A reference point measurement was performed in an open area above the canopy: the sensor was placed horizontally at a height of 1.8 m above the ground, and three sets of PAR data were recorded consecutively. The average of these readings was taken as the reference value above the canopy (PAR0). For each sample tree, measurement points were arranged using the five-point method, located at the center and the east, west, south, and north directions, respectively. Each point was positioned at a distance of two-thirds of the crown radius from the main trunk. The sensor was kept horizontal with the lens facing the zenith, and three sets of valid data were acquired at each measurement point.
L A I = l n 1 T G Ω
T = P A R ¯ b e l o w P A R 0
In Equation (5), G is the leaf projection function (default value is 0.85); Ω is the aggregation index (default value is 0.95).
In Equation (6), T is the direct transmittance coefficient; PARbelow is the average photosynthetically active radiation below the canopy; PAR0 is the reference value above the canopy.
Net photosynthetic rate (Pn), transpiration rate (Tr), stomatal conductance (Gs), and intercellular CO2 concentration (Ci) were measured using an LI-6800 portable photosynthesis system (LI-COR Biosciences, USA) [23,24,25]. The instrument was equipped with a standard 2 × 3 cm transparent leaf chamber and a red-blue light source. All measurements were conducted between 9:00 and 11:00 AM under clear weather conditions.
Prior to measurement, the instrument was preheated for 30 min. The CO2 absorbent and desiccant were replaced, and the CO2/H2O infrared gas analyzer was calibrated. The LED light source was also checked and preheated. The measurement parameters were set as follows: photosynthetic photon flux density (PPFD) at 1000 μmol·m−2·s−1, reference chamber CO2 concentration maintained at 400 μmol·mol−1, leaf chamber temperature controlled at 25 ± 1 °C, airflow rate at 500 μmol·s−1, and relative humidity kept at 60 ± 5%. During measurement, current-year branches facing east within the canopy were selected. From these, fully expanded, well-illuminated, and disease-free functional leaves were chosen, and the measurement points were marked (avoiding the midrib). The leaf was flattened and clamped into the leaf chamber, ensuring complete coverage of the measurement window. After allowing parameters to stabilize (typically requiring 2–3 min), recording commenced when the rates of change for both the CO2 concentration difference (ΔCO2) and the H2O concentration difference (ΔH2O) fell below 0.5%/min. Three sets of stable data were collected consecutively at 30 S intervals, with the measured values for Pn, Tr, Gs, and Ci automatically output by the instrument.
Water use efficiency (WUE) was calculated using Equation (7) [26].
W U E = P n T r
The total nitrogen content in leaves was determined using the Kjeldahl method [27]. After drying and grinding the leaf samples, 0.2000 g was accurately weighed, mixed with a combined catalyst and concentrated sulfuric acid, and digested at 420 °C for 2 h. The digest was then distilled in a Kjeldahl distillation apparatus, absorbed with boric acid, and titrated with a 0.01 mol/L HCl standard solution. The total nitrogen content was calculated according to Formula (8).
N = V V 0 × C × 0.014 × D m × 100
where V is the volume of hydrochloric acid consumed by the sample (mL); V0 is the volume of hydrochloric acid consumed by the blank (mL); C is the concentration of the hydrochloric acid standard solution (mol/L); D is the dilution factor (10 in this method); m is the sample mass (g); and 0.014 is the millimolar mass of nitrogen (g/mmol).
The stomatal limitation value (Ls) was calculated using Equation (9) [28].
L s = 1 C i C α
C i is the measured intercellular CO2 concentration (μmol·mol−1); C α is the atmospheric CO2 concentration, using the average measured value of 412 μmol·mol−1 above the canopy in this study.
Extinction coefficient (k): This parameter was calculated using the Beer-Lambert law in conjunction with LAI and the light intensity beneath the canopy, as shown in Equation (10) [29].
k = l n I I 0 L A I
where I is the average light intensity below the canopy (μmol·m−2·s−1); I0 is the natural light intensity above the canopy (μmol·m−2·s−1); and LAI is the leaf area index.
Canopy fractional interception of radiation (FIR) [30], direct light use efficiency (DUE) [31], and scattered light use efficiency (SUE) [32] were calculated using Formulas (11), (12), and (13), respectively, based on measurements of photosynthetically active radiation (PAR) inside and outside the canopy.
F I R = 1 P A R b e l o w P A R a b o v e × 100 %
D U E = P A R a b o v e × F I R × η d i r e c t
S U E = P A R a b o v e × 1 F I R × η d i f f u s e
In the formula, PARbelow is the average photosynthetically active radiation below the canopy (μ mol·m−2·s−1); PARabove is the natural photosynthetically active radiation above the canopy (μ mol·m−2·s−1); ηdirect is the direct light conversion efficiency coefficient, with a value of 0.85; ηdiffuse is the diffuse light conversion efficiency coefficient, with a value of 0.92.
Canopy openness (Fb) was measured using a plant canopy analyzer [33]. Multiple measurement points were evenly distributed beneath the canopy of the fruit trees. An LAI-2200C canopy analyzer (LI-COR Biosciences, Lincoln, NE, USA) was used to simultaneously record light data from a reference point above the canopy and from each measurement point below it. The instrument’s dedicated analysis software automatically calculated the canopy openness value based on the canopy porosity in the 0° zenith angle direction.

2.3.3. Calculation of Carbon–Water–Nitrogen Co-Metabolism Indices

Stomatal-Photosynthetic Coupling Index (SPCI) [30,34]: Calculated based on the synergistic variation relationship between Gs and Pn, with the formula shown in Equation (14).
Nitrogen Limitation Threshold (NLT) [31,35]: Computed using a model based on the relationship between leaf nitrogen content and photosynthetic capacity, with the formula shown in Equation (15).
Carbon Cost Index (CCI) [36]: Represents the balance between carbon assimilation and consumption, with the formula shown in Equation (16).
S P C I = G S P n × N × 1000
N L T = P n m a x P n a c t u a l G S × N × 1000
C C I = T r P n × N × 1000

2.4. Construction and Validation of a BP Neural Network Model

Model structure: In terms of model construction, this study adopts a three-layer Back Propagation (BP) neural network architecture [37]. A weight and threshold iterative update mechanism is established, with the training termination conditions set to a maximum of 30 iterations or upon reaching a predefined error threshold [38]. The 18 measured physiological structure indicators (in the feature-optimized model modeling, the input indicators refer to the physiological structure indicators selected based on feature importance evaluation from ablation experiments and feature selection) serve as input variables, while individual fruit quality indicators (VC, TA, SS, and SS/TA) are used as output targets.
Regarding the choice of training function, this study encompasses six algorithms with distinct numerical characteristics: trainlm, based on Levenberg–Marquardt optimization, offers rapid convergence and is suitable for fast convergence on small to medium-sized datasets; trainbfg employs a quasi-Newton algorithm, achieving a balance between memory usage and precision; trainscg, as a scaled conjugate gradient method, is applicable for large-scale parameter optimization; traingdx combines adaptive learning rate and momentum terms, effectively avoiding local minima; traincgb enhances the stability of conjugate gradient training through the Powell–Beale restart strategy; and traincgp employs the Polak–Ribière update rule, demonstrating distinct numerical characteristics in conjugate gradient computation. The systematic comparison of these algorithms enables a comprehensive evaluation of the network’s performance under different optimization strategies [39,40]. Regarding the configuration of the number of nodes in the hidden layer, the test range was set from 5 to 15 nodes. This interval ensures the model possesses sufficient expressive capacity to capture nonlinear relationships while effectively controlling the parameter scale to prevent overfitting caused by an overly complex structure. Through this systematic parameter search, the network architecture most suitable for the complexity of this dataset can be determined [41,42], aiming to identify the optimal model structure. The specific process is illustrated in Figure 1.
In the data preparation stage, all feature data were subjected to min-max normalization (Formula (17)) and then randomly divided into a training set (158 samples) and a validation set (67 samples) at a 7:3 ratio [43].
The selection of the BP neural network as the modeling approach is primarily based on the following considerations: compared to Support Vector Regression (SVR), which faces limitations in kernel function selection for high-dimensional nonlinear systems [44], and ensemble methods such as Random Forest (RF), which exhibit shortcomings in interpretability [45], the BP network can more flexibly capture the complex nonlinear relationships between photosynthetic parameters and quality traits through its multi-layer error backpropagation mechanism. Simultaneously, when contrasted with the broader category of artificial neural networks (ANNs), the BP algorithm offers distinct advantages, including a well-defined structure, stable training processes, and ease of implementation [46], making it more suitable for the modeling requirements of the medium-sized dataset in this study. Furthermore, this algorithm has accumulated numerous successful applications in agricultural physiological prediction problems, providing methodological references for the present research.
Q n = Q Q m i n Q m a x Q m i n
where Q is the raw measurement data, Qnis the standardized value of the raw measurement data, and Qmax and Qmin are the maximum and minimum values of the variable, respectively.
Model Training and Evaluation: The model performance was comprehensively assessed using the coefficient of determination (R2, Equation (18)), mean absolute error (MAE, Equation (19)), mean absolute percentage error (MAPE, Equation (20)), and model bias error (MBE, Equation (21)) [47].
R 2 = 1 i = 1 n y i y i ^ 2 i = 1 n y i y ¯ i 2
M A E = 1 n i = 1 n y i y ^ i
M A P E = 100 % n i = 1 n y i y ^ i y i
M B E = 1 n i = 1 n y ^ i y i

2.5. Feature Importance Analysis and Model Optimization

Feature importance ranking: The importance scores of each input feature in the trained BP network were calculated based on the connection weight method.
Single-factor sensitivity analysis and ablation experiments: By sequentially excluding individual features and re-evaluating the model performance (measured by metrics such as ΔR2), the sensitivity of each feature to the model prediction ability was assessed [48].
Feature selection and model simplification: Based on the feature importance scores and the results of sensitivity analysis, a core subset of features critical to the prediction target was screened. A simplified model was then reconstructed using this subset, and its performance was compared with that of the original model [49].

2.6. Data Analysis

All testing data were subjected to descriptive statistics and Pearson correlation analysis using Origin 2022. Neural network modeling and related computations were completed in the MATLAB R2024b environment.

3. Results and Analysis

3.1. Analysis of Data on Apple Fruit Quality and Physiological Structure Indicators

The quality indices of apple fruits, including Vc, TA, and SS contents as well as SSC/TA ratio, exhibited certain variations among different samples. Specifically, Vc content generally ranged from 0.42 to 0.90 mg/100 g, while TA and SS varied within 0.29–15.41% and 6.32–16.90%, respectively. Regarding photosynthetic characteristics, the average canopy effective radiation was approximately 516.87 μmol·m−2·s−1, and the mean leaf area index was 1.90, indicating a relatively open canopy structure overall. Among leaf photosynthetic parameters, the average net photosynthetic rate was 9.81 μmol·m−2·s−1, while the mean transpiration rate and water use efficiency were 3.72 mmol·m−2·s−1 and 2.89 μmol·mol−1, respectively, demonstrating favorable photosynthetic performance and moisture regulation capacity in the leaves. Parameters related to light energy utilization, such as the average light interception rate, reached 51.61%. The utilization rates of direct and scattered light were 27.70 μmol·m−2·s−1 and 0.03 μmol·m−2·s−1, respectively. Among the indicators of carbon–water–nitrogen synergistic metabolism, the average stomatal-photosynthetic coupling index was 3.27 mmol·μmol−1·g−1 (Table 1), indicating generally favorable overall metabolic coordination. Differences in fruit quality and mineral element content persisted among the orchards, which may be attributed to factors such as regional climate, tree nutritional status, and growth microenvironment.

3.2. Pearson Correlation Analysis

Figure 2 visually illustrates the correlation patterns between 18 plant physiological and structural indicators and key fruit quality components. As “positive regulatory factors,” VC (vitamin C) and SS (soluble saccharides) enhance the net photosynthetic rate (Pn) significantly by promoting canopy development (showing a strong positive correlation with LAI) and optimizing stomatal behavior (positively correlated with Gs and SPCI), thereby improving direct light use efficiency (DUE) and water use efficiency (WUE). Specifically, the correlation coefficient between SS and Pn reaches 0.85. Ample photosynthetic products not only supply carbon skeletons and energy for VC synthesis but also directly promote the accumulation of soluble saccharides. On the other hand, because plants prioritize resource allocation for efficient direct light utilization, VC and SS exhibit a significant negative correlation with scattered light use efficiency (SUE). TA (titratable acid) acts as a “negative regulatory factor.” Its accumulation exhibits an inhibitory association with photosynthetic efficiency (negatively correlated with Pn) and canopy light interception capacity (negatively correlated with LAI and FIR). The core reason lies in the fact that the metabolic process of titratable acid competes for nitrogen resources (correlation coefficient with NLT is 0.75) and simultaneously restricts stomatal opening (positively correlated with Ls, negatively correlated with Gs), thereby indirectly reducing CO2 supply and diminishing photosynthetic intensity. Furthermore, the accumulation of TA likely tends to consume photosynthetic intermediates, creating a metabolic competition with sugars and VC.
SS/TA (sugar-acid ratio), as a key indicator of fruit flavor harmony, plays a dual role as both a “balancing signal” and a “final expression.” Its positive correlations with Pn, Gs, and DUE, and negative correlations with NLT and Ls, essentially reflect the synergistic interplay among photosynthetic efficiency, resource utilization, and sugar-acid metabolism. Specifically, stronger photosynthesis is associated with tighter coupling between stomatal conductance and photosynthesis, greater dominance of sugar accumulation, and a weaker inhibitory effect of titratable acid, ultimately contributing to a more favorable sugar-acid ratio.
In detail, the high correlation (0.85) between soluble saccharides (SS) and Pn indicates both the direct accumulation of soluble saccharides as photosynthetic products and their feedback regulatory role as metabolic signals on the photosynthetic process. The positive correlations of vitamin C (VC) with photosynthetic and canopy metrics confirm the dependence of its synthesis on photosynthetic products and energy. The inhibitory associations exhibited by titratable acid (TA) further clarify the specific pathway through which TA interferes with photosynthesis—by competing for nitrogen resources and imposing stomatal limitations—and ultimately affects fruit quality formation. This study provides a basis for elucidating the synthesis mechanisms of VC, soluble saccharides, and titratable acid in fruits. It also indicates that cultivation practices—such as regulating the canopy light environment and optimizing nitrogen supply—can be employed to directionally improve the fruit sugar-acid ratio, thereby enhancing fruit quality.

3.3. BP Neural Network Model for Predicting Vc

To predict apple Vc content based on photosynthetic data, this study systematically compared various back propagation neural network architectures. The results indicated that the model with five hidden layers and the trainlm training function demonstrated the best performance. This model achieved a coefficient of determination (R2) of 0.87 and a mean absolute percentage error (MAPE) of 0.0123 on the validation set, both being the optimal or near-optimal values among all models.
Notably, the model exhibited comparable performance on the training set and the validation set (R2 values of 0.84 and 0.87, respectively), with all metrics on the validation set slightly outperforming those on the training set. This indicates good generalization ability and stability, with no signs of overfitting. In contrast, models with more hidden layers, specifically the 18-7-1 and 18-9-1 architectures, showed significant performance discrepancies between the training and validation sets, indicating clear overfitting. Although the 18-13-1 model performed stably, its prediction accuracy was limited (see Table 2 and Supplementary Table S1). The relationship between measured and predicted values shown in Figure 3 further verifies the reliability of the preferred model. Both the Training set and Validation set exhibit a high goodness of fitting, with scatter points closely distributed around the diagonal line, indicating that the model can accurately capture the variation pattern of Vc content. In summary, the trainlm model with an 18-5-1 structure, owing to its excellent generalization ability and prediction accuracy, proves to be a suitable choice for predicting apple Vc content.

3.4. BP Neural Network Model for Predicting Titratable Acidity

To screen for a suitable model for predicting apple titratable acid content, this study compared various neural network architectures. Overall, the model with 9 hidden layers and employing the trainbfg training function demonstrated the best comprehensive performance. This model achieved a high coefficient of determination (R2 = 0.86) on the validation set. More importantly, the performance difference between its training set and validation set was the smallest among all models, indicating good generalization ability and predictive stability without significant overfitting (Table 3 and Supplementary Table S2).
Based on these results, the model with an 18-9-1 architecture using the trainbfg training function was selected as the predictive tool for titratable acid content. The fitting between measured and predicted values (Figure 4A,B) shows that data points from both the training and validation sets are closely distributed along the regression line, reflecting consistent predictive performance of the model. Further examination of the sample scatter plot (Figure 4C) reveals that the predicted values for most samples are close to their measured values, providing additional support for the model’s reliable prediction accuracy in practical applications.

3.5. BP Neural Network Model for Predicting Soluble Saccharides

After comprehensively comparing various models, the 18-5-1 architecture paired with the trainlm training function was identified as the optimal model for predicting the soluble sugar content in apples. This model demonstrated high accuracy on the validation set (R2 = 0.86). Its key strength lies in the high consistency of performance metrics between the training set and the validation set, indicating excellent generalization ability and structural robustness. Compared to other candidate models, this network effectively captured the underlying patterns in the data while avoiding overfitting, thereby delivering more reliable predictions when faced with unseen data (Table 4 and Supplementary Table S3).
The fitting results between the measured values and predicted values (Figure 5A,B) further validated the model’s consistency, with data points from both the training set and validation set closely distributed around the regression line. The scatter plot of samples (Figure 5C) shows that the predicted values for most samples are close to the measured values, further corroborating the model’s accuracy and stability in practical application.

3.6. A BP Neural Network Model for Predicting Sugar-Acid Ratio

Following a comprehensive evaluation, the model with an 18-6-1 architecture employing the training function ‘trainbfg’ demonstrated optimal performance in predicting the sugar-acid ratio of apples. This model exhibited a high goodness of fitting (R2 = 0.89) during validation, with both the mean absolute error and mean relative error being the lowest among all models, indicating minimal deviation between the predicted values and the measured values. Notably, the model’s overall performance on the validation set surpassed that on the training set, suggesting it did not overfit to the specific features of the training data but instead effectively captured the general patterns underlying sugar-acid ratio variation. Consequently, it possesses more reliable generalization ability (Table 5 and Supplementary Table S4).
The fitting between the measured values and predicted values (Figure 6A–C) shows that, for both the training set and validation set, the majority of sample points cluster closely around the regression line. This further confirms the model’s favorable accuracy and stability in predicting the sugar-acid ratio of apples.

3.7. Feature Importance Visualization

This study elucidates the underlying physiological mechanisms by analyzing feature importance in four prediction models for apple quality components.
In the prediction of vitamin C, the scattering light use efficiency (SUE) plays a central role, primarily by promoting the formation of precursors for VC synthesis. Photosynthesis-related parameters, such as stomatal conductance (Gs) and net photosynthetic rate (Pn), collectively influence VC accumulation by regulating carbon–nitrogen allocation and material transport processes (Figure 7A).
The titratable acid content is predominantly governed by the total leaf nitrogen content, which directly participates in acid synthesis during nitrogen metabolism. Features including the nitrogen limitation threshold (NLT) and SUE contribute to regulation by influencing enzyme activity and carbon skeleton supply (Figure 7B). In the soluble sugar model, SUE, direct light use efficiency (DUE), and Pn collectively form the core features. These three factors work synergistically within the integrated “light energy–photosynthesis–nitrogen metabolism” pathway to jointly regulate sugar accumulation (Figure 7C).
For the sugar-acid ratio prediction, SUE again emerges as the most important driving factor, influencing the sugar-acid balance by regulating the allocation of photosynthetic products. Indicators such as leaf total nitrogen and DUE represent the nitrogen and light energy utilization pathways, respectively, together constituting a coordinated regulatory network (Figure 7D).
Comprehensive analysis reveals that scattering light use efficiency (SUE) plays a central role in the formation of multiple quality traits; nitrogen-related indicators dominate titratable acid synthesis; while photosynthetic parameters serve as fundamental regulatory factors underlying the formation of all quality components.

3.8. One-Way Sensitivity Analysis

Based on single-factor sensitivity analysis, this study identified the most critical feature factors affecting the performance of the prediction models for each quality indicator and elucidated their underlying physiological significance.
In the vitamin C prediction model (Figure 8A), net photosynthetic rate (Pn), stomatal-photosynthesis coupling index (SPCI), and scattered light utilization efficiency (SUE) exhibited high sensitivity. These three factors correspond, respectively, to carbon skeleton supply, photosynthetic coordination efficiency, and light energy utilization capacity. Their absence leads to a significant decline in model accuracy, indicating their irreplaceable driving roles in VC synthesis. Moderately sensitive factors such as stomatal conductance (Gs) and leaf nitrogen content are primarily involved in regulating substance transport and metabolic balance; their removal has a relatively limited impact on the model. In contrast, low-sensitivity factors such as the stomatal limitation value (Ls) reflect background environmental information; their removal has minimal impact on model performance and may even slightly improve the fitting effect in specific cases by reducing model complexity.
The titratable acid model shows high dependence on nitrogen status (Figure 8B). High-sensitivity factors, including the nitrogen limitation threshold (NLT) and leaf total nitrogen content (N), are directly linked to acid synthesis and metabolism. Medium-sensitivity factors such as SUE and Pn play auxiliary regulatory roles by influencing carbon allocation and the supply of photosynthetic products.
In the models for soluble saccharides and the sugar-acid ratio (Figure 8C,D), parameters related to photosynthesis and light-use efficiency—such as SUE, Pn, and SPCI—generally exhibit high sensitivity, further confirming the central role of photosynthetic assimilation in sugar accumulation and sugar-acid balance. Meanwhile, nitrogen-related parameters also show relatively high sensitivity in the sugar-acid ratio model, indicating that sugar-acid composition is jointly regulated by the coordinated interaction of carbon and nitrogen metabolism. Overall, the sensitivity structure of different models aligns closely with their corresponding physiological mechanisms. Vitamin C synthesis relies more heavily on photosynthetic carbon fixation and light energy conversion; titratable acid content is primarily regulated by nitrogen metabolism; while the sugar-acid balance depends on the synergistic interaction between light energy utilization and nitrogen supply. These findings provide a clear rationale for model simplification: when constructing practical prediction models, medium- and high-sensitivity features can be prioritized, effectively reducing data collection and computational costs while maintaining high accuracy.

3.9. Feature Importance Assessment and Feature Selection Based on Ablation Experiments

Based on single-factor sensitivity analysis and ablation experiments, this study identified the optimal feature subsets for the prediction models of each quality indicator.
For the Vc prediction model (Figure 9A), five key features—Pn, Gs, SUE, SPCI, and NLT—were selected. This combination retains all core driving factors (Pn, SPCI, SUE) while incorporating Gs and NLT to enhance informational completeness. Redundant features were removed, improving model robustness without compromising predictive performance.
The core features of the titratable acid model focus on nitrogen metabolism-related indicators (Figure 9B), with the final subset consisting of NLT, N, SUE, and Pn. Among these, NLT and N were prioritized as primary regulators of acid synthesis, whereas SUE and Pn were included as auxiliary indicators reflecting light energy utilization and carbon supply. The optimal subset for the soluble saccharides prediction model comprised Pn, SUE, SPCI, DUE, and NLT (Figure 9C). These features collectively encompassed multiple key pathways, including photosynthesis, light energy utilization, and nitrogen regulation, thereby forming a systematic representation of sugar synthesis.
For the sugar-acid ratio model, the selected feature collection consisted of SUE, NLT, N, Pn, and SPCI (Figure 9D). This combination simultaneously reflected the integrated effects of light energy allocation and nitrogen regulation on the sugar-acid balance.
By further intersecting the results of sensitivity analysis with the modeling features, the core drivers for each quality indicator were definitively identified: Vc was primarily regulated by Pn, Gs, SUE, and SPCI (Figure 9E); the key influencing factors for titratable acid were Pn, N, SUE, and NLT (Figure 9F); and soluble saccharides and the sugar-acid ratio were both systematically regulated by Pn, N, SUE, SPCI, and NLT (Figure 9G,H). These findings provide a basis for feature selection in constructing concise and efficient prediction models for apple quality.

3.10. Feature Verification

To validate the effectiveness of feature screening, this study reconstructed prediction models based on the key feature set obtained from the screening. As shown in Table 6, the refined models demonstrated superior performance in predicting all quality indicators.
In the vitamin C prediction task, the new model maintained its accuracy on the training set while significantly enhancing its generalization ability. The goodness of fitting on the validation set increased to 0.93, and error metrics decreased by over 30%, indicating that removing redundant variables helps the model focus more on key physiological processes.
For the prediction of titratable acid, the new model achieved performance improvements on both the training and validation sets, with errors significantly reduced. Moreover, the prediction bias remained close to zero, suggesting that the model enhanced explanatory power without introducing systematic error. In the prediction of soluble saccharides, the refined model also demonstrated outstanding performance, with the goodness of fitting for both the Training set and the Validation set exceeding 0.90. The validation error was reduced by approximately one-quarter, reflecting the model’s enhanced Generalization ability and stability.
The improvement in sugar-acid ratio prediction was particularly notable. While maintaining a high goodness of fitting, the new model effectively corrected the systematic bias present in the original model, resulting in more accurate and reliable predictions.
Overall, the simplified model constructed through feature screening not only has a more compact structure but also outperforms the original model in terms of generalization ability, error control, and systematic bias correction. This validates the effectiveness and practical value of the feature screening strategy employed in this study.

4. Discussion

4.1. Photosynthetic Physiology and Carbon–Nitrogen Metabolism Basis of Apple Fruit Quality Formation

This study systematically measured the quality indices of apple fruits and the related physiological and structural parameters of the trees. Correlation analysis revealed that photosynthetic characteristics, such as net photosynthetic rate (Pn) and stomatal conductance (Gs), showed significant positive correlations with vitamin C (VC) content, soluble saccharides (SS) content, and the sugar-acid ratio (SS/TA), but a negative correlation with titratable acid (TA) content [50]. This finding underscores that photosynthesis serves as the material and energetic foundation for fruit quality formation. Specifically, a higher Pn directly promotes the accumulation of photosynthetic products, providing sufficient carbon skeletons and ATP for the synthesis of VC and SS [51]. Concurrently, the positive correlation between Gs and SPCI indicates that refined stomatal behavior ensures effective CO2 supply and coordinates with the photosynthetic process, thereby further enhancing this mechanism [52].
Notably, the effects of different light resource utilization modes on quality components vary. Direct light use efficiency (DUE) shows a positive correlation with soluble sugar (SS) content and the sugar-acid ratio, whereas scattered light use efficiency (SUE) exhibits a strong negative correlation with vitamin C (VC) and SS content. This discrepancy may stem from the resource allocation strategy formed by plants under the regulation of canopy structure: in environments with sufficient direct light, photosynthetic products tend to be efficiently converted into substances such as sugars; in environments dominated by scattered light or with a closed canopy, although the SUE value increases, the overall light energy capture and conversion efficiency is limited, potentially leading to a relative insufficiency of resources allocated for the synthesis of VC and SS [53].
On the other hand, nitrogen metabolism plays a central role in regulating fruit acidity. Leaf total nitrogen content (N) and nitrogen limitation threshold (NLT) showed a strong positive correlation with TA content, but a negative correlation with the sugar-acid ratio. This confirms that nitrogen serves as a crucial precursor and driving force for titratable acid synthesis. When nitrogen supply is sufficient, fruit trees preferentially allocate photosynthetic products and nitrogen resources to the synthesis of organic acids (such as malic acid). This process may compete with sugar synthesis for substrates, thereby explaining the negative correlations observed between TA and SS or VC. Such synergy and trade-off between carbon and nitrogen metabolism ultimately determine the sugar-acid balance and flavor quality of the fruit [54].

4.2. Applicability and Refinement of BP Neural Network Model in Fruit Quality Prediction

This study successfully developed BP neural network models for predicting VC, TA, SS, and SS/TA, respectively. Each optimal model demonstrated excellent performance on the validation set (R2 > 0.86), confirming the feasibility of reliably predicting fruit internal quality based on photosynthetic physiology data. Compared with traditional linear regression models, the BP neural network was more effective in capturing the complex nonlinear relationships between various physiological indicators and quality components [55]. Notably, the BP models in this study exhibited considerable competitiveness when compared to other intelligent algorithms applied in recent agricultural research. For instance, in the study by Song et al. (2025), Random Forest models typically reported an R2 of 0.73, while Support Vector Regression models achieved an R2 of 0.65 [56]. The accuracy level achieved by this model indicates that BP neural networks possess unique advantages in handling such complex agricultural biological data.
Through feature importance analysis and single-factor sensitivity analysis, we found that the core driving factors for different quality components vary, which aligns with the aforementioned physiological mechanisms. VC synthesis is highly dependent on SUE, Pn, and SPCI, highlighting the central role of light resource utilization efficiency and photosynthetic synergy [57]; TA accumulation is primarily driven by N and NLT, underscoring the critical function of nitrogen metabolism; SS and SS/TA are jointly regulated by photosynthetic factors such as SUE and Pn, as well as nitrogen-related factors including N and NLT, reflecting the integrated influence of carbon–nitrogen co-metabolism [58].
Based on the aforementioned analysis, feature screening significantly enhanced model efficiency and simplicity. The optimal feature subset identified through ablation experiments, after removing redundant and noisy variables, yielded a refined model that not only exhibited a more streamlined structure but also demonstrated superior prediction accuracy and generalization ability compared to the original model utilizing all features. Taking VC prediction as an example, the coefficient of determination (R2) on the validation set for the refined model increased from 0.87 to 0.93, while the mean absolute error (MAE) decreased by 32%. These results demonstrate that feature screening grounded in physiological mechanisms enables the model to focus more effectively on core patterns, effectively mitigating overfitting and consequently exhibiting greater robustness when confronted with unseen data [59].

4.3. Model Limitations

The prediction model developed in this study demonstrates potential for application in apple quality assessment, yet several limitations warrant further refinement. The model was trained on sampling data from 45 trees and 225 fruits, a sample size and diversity that remain insufficient. This foundational dataset may inadequately capture the comprehensive effects of different apple stocks, growing regions, and interannual climatic variations on fruit quality formation. Consequently, the model requires more extensive external validation before practical deployment [60].
Regarding variable interpretation, although input parameters were optimized through feature screening, the complex intrinsic relationships between photosynthetic indices and quality traits warrant cautious consideration. Taking the scattering light use efficiency (SUE) as an example, its dominant role observed in multiple models may stem from its specific physiological function in light energy utilization under low-light conditions. Alternatively, it could be attributed to coupling effects with unquantified factors such as canopy structure and foliage distribution [61]. This potential multicollinearity issue suggests that when applying the model for production decision-making, it is necessary to further discern the true contribution of variables through field experiments.
It should be noted that the current model has not yet incorporated key environmental factors such as soil physicochemical properties, moisture dynamics, and microclimate data. These elements exert direct regulatory effects on photosynthetic metabolism and substance conversion, and their absence may limit the model’s explanatory power in complex cultivation environments.

4.4. Implications for Cultivation Management and Research Prospects

The findings of this study provide clear guidance for precision cultivation management in orchards. First, by monitoring key photosynthetic parameters such as Pn, Gs, and SUE, early prediction and assessment of fruit VC and sugar content can be achieved [62]. Second, to improve fruit flavor (by increasing the sugar-to-acid ratio), cultivation practices should focus on both “promoting sugar” and “controlling acid”: “promoting sugar” requires improving canopy structure through proper pruning to enhance DUE and Pn; “controlling acid” necessitates precise regulation of nitrogen fertilizer application to avoid excessive nitrogen leading to over-accumulation of organic acids [63].
Future research could be further developed in the following aspects: First, environmental factors such as soil nutrients and moisture could be incorporated into the model to build a more comprehensive quality prediction system. Second, the model established in this study is based on a specific dataset, and its generalizability requires further validation and calibration across different stocks, regions, and climatic conditions. Third, exploring the integration of such prediction models with intelligent orchard management systems could provide decision-making support for targeted regulation of fruit quality and intelligent production.

5. Conclusions

The apple quality prediction model based on photosynthetic physiological parameters developed in this study offers a novel technical pathway for precision management in smart orchards. By measuring key indicators such as net photosynthetic rate and scattered light utilization efficiency, and integrating an optimized BP neural network, the model enables effective prediction of intrinsic fruit quality attributes—including vitamin C content and sugar-acid composition—prior to harvest.
At the practical application level, this model can directly support orchard management decisions. By monitoring core parameters such as Pn and SUE, growers can promptly assess the physiological status of trees, providing a basis for precise fertilization and pruning. For instance, when the model predicts a low sugar-acid ratio, nitrogen fertilizer application can be adjusted to improve the allocation of photosynthetic products. Meanwhile, abnormal fluctuations in SUE can serve as a reference signal for optimizing canopy structure. It is important to acknowledge that the current model still has certain limitations. First, the training data for the model were all sourced from a single production region and have not been validated under different ecological conditions. Its generalization ability requires further confirmation through cross-regional external testing. Second, the model has not yet integrated environmental data such as soil moisture and meteorological factors. The mechanisms by which these factors influence quality formation need to be quantified in subsequent research.
Future research efforts will focus on advancing three key directions: (1) conducting joint trials across multiple stock varieties and production regions to establish a quality prediction standard with broad applicability; (2) exploring the integration of unmanned aerial vehicle (UAV) remote sensing and Internet of Things (IoT) sensor data into the model to extend predictions from the individual tree to the orchard scale; and (3) developing an intelligent decision-making system that integrates quality prediction and cultivation regulation, ultimately forming a scalable smart orchard solution. In summary, the method proposed in this study lays a foundation for the targeted regulation of fruit quality. However, transforming it into a mature application technology still requires continuous advancement in areas such as model generalizability, data integration, and system integration.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/plants14243795/s1, Table S1: Model metric indicators of Vitamin C under different hidden layer numbers; Table S2: Model metric indicators of Titratable Acid under different hidden layer numbers; Table S3: Model metric indicators of soluble sugars under different hidden layer numbers; Table S4: Model metric indicators of. Saccharides/Titratable Acid under different hidden layer numbers.

Author Contributions

J.Z.: conceptualization, methodology, data curation, writing—original draft, writing—review and editing; M.Y.: conceptualization, methodology, data curation, writing—original draft, writing—review and editing; Y.C.: visualization, software; X.L.: validation, investigation; J.B.: validation, investigation; X.P.: resources, writing—review and editing, project administration, funding acquisition. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the Tarim University President’s Fund through the Populus Euphratica Talent (Doctoral) Program (Grant No. TDZKBS202424).

Data Availability Statement

The data that support the findings of this study are available from the corresponding author upon reasonable request.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
VCVitamin C
SSSoluble Saccharides
SS/TASoluble Saccharides/Titratable Acid
TATitratable Acid
BPBack Propagation
PARPhotosynthetically Active Radiation
TTemperature
PnNet Photosynthetic Rate
TrTranspiration Rate
GsStomatal Conductance
CiIntercellular CO2 Concentration
WUEWater Use Efficiency
LsStomatal Limitation Value
LAILeaf Area Index
FbCanopy Openness
kExtinction Coefficient
FIRFraction of Intercepted Radiation
DUEDirect Light Use Efficiency
SUEScattered Light Use Efficiency
NNitrogen Content
SPCISpecific Leaf Nitrogen Content
NLTNitrogen Limitation Threshold
CCIChlorophyll Content Index
ANNArtificial Neural Network
RFRandom Forest
SVMSupport Vector Machine

References

  1. Sun, M.; Liu, J.; Li, J.; Huang, Y. Endophytic Bacterium Serratia plymuthica from Chinese Leek Suppressed Apple Ring Rot on Postharvest Apple Fruit. Front. Microbiol. 2022, 12, 802887. [Google Scholar] [CrossRef]
  2. Wang, F.; Ge, S.; Xu, X.; Xing, Y.; Du, X.; Zhang, X.; Lv, M.; Liu, J.; Zhu, Z.; Jiang, Y. Multiomics Analysis Reveals New Insights into the Apple Fruit Quality Decline under High Nitrogen Conditions. J. Agric. Food Chem. 2021, 69, 5559–5572. [Google Scholar] [CrossRef]
  3. Kviklys, D.; Viškelis, J.; Liaudanskas, M.; Janulis, V.; Laužikė, K.; Samuolienė, G.; Uselis, N.; Lanauskas, J. Apple Fruit Growth and Quality Depend on the Position in Tree Canopy. Plants 2022, 11, 196. [Google Scholar] [CrossRef]
  4. Zhang, C.; Zhu, X.; Li, M.; Xue, Y.; Qin, A.; Gao, G.; Wang, M.; Jiang, Y. Utilization of the Fusion of Ground-Space Remote Sensing Data for Canopy Nitrogen Content Inversion in Apple Orchards. Horticulturae 2023, 9, 1085. [Google Scholar] [CrossRef]
  5. Li, J.; He, P.; Jin, Q.; Chen, J.; Chen, D.; Dai, X.; Ding, S.; Chu, L. Aeration Alleviated the Adverse Effects of Nitrogen Topdressing Reduction on Tomato Root Vigor, Photosynthetic Performance, and Fruit Development. Plants 2024, 13, 1378. [Google Scholar] [CrossRef]
  6. Cheng, J.; Yang, H.; Qi, J.; Sun, Z.; Han, S.; Feng, H.; Jiang, J.; Xu, W.; Li, Z.; Yang, G.; et al. Estimating canopy-scale chlorophyll content in apple orchards using a 3D radiative transfer model and UAV multispectral imagery. Comput. Electron. Agric. 2022, 202, 107401. [Google Scholar] [CrossRef]
  7. Castillo-Girones, S.; Munera, S.; Martínez-Sober, M.; Blasco, J.; Cubero, S.; Gómez-Sanchis, J. Artificial Neural Networks in Agriculture, the core of artificial intelligence: What, When, and Why. Comput. Electron. Agric. 2025, 230, 109938. [Google Scholar] [CrossRef]
  8. Li, H.; Huang, C.; Zhang, Y.; Li, S.; Liu, Y.; Yang, K.; Lu, J. Estimating Maize Leaf Area Index Using Multi-Source Features Derived from UAV Multispectral Imagery and Machine Learning Models. Plants 2025, 14, 3534. [Google Scholar] [CrossRef] [PubMed]
  9. Zheng, F.; Lu, D.; Ou, W.; Tan, S.; Xu, X.; Zeng, S.; Xian, L. Adaptation Mechanisms of Understory Vegetation in Subtropical Plantations: Synergistic Drivers of Stand Spatial Structure and Soil Fertility. Plants 2025, 14, 3452. [Google Scholar] [CrossRef] [PubMed]
  10. Khateeb, I.; Kim, S.; Tachikawa, Y. Assessing the effectiveness of ANN model in spatial downscaling of d4PDF hourly precipitation data: A case study in Japan. J. Disaster Sci. Manag. 2025, 1, 3. [Google Scholar] [CrossRef]
  11. Zhao, C.; Tang, H. An Abnormal Monitoring Model for Symbiosis Monitoring Data of Ecological Environment Based on Density Clustering. Pol. J. Environ. Stud. 2025, 34, 6483–6498. [Google Scholar] [CrossRef]
  12. Simon, S.M.; Glaum, P.; Valdovinos, F.S. Interpreting random forest analysis of ecological models to move from prediction to explanation. Sci. Rep. 2023, 13, 3881. [Google Scholar] [CrossRef]
  13. Chen, H.; Niu, F.; Su, X.; Geng, T.; Liu, Z.; Li, Q. Initial Results of Modeling and Improvement of BDS-2/GPS Broadcast Ephemeris Satellite Orbit Based on BP and PSO-BP Neural Networks. Remote Sens. 2021, 13, 4801. [Google Scholar] [CrossRef]
  14. Xie, X.; Zhang, X.; Shen, J.; Du, K. Poplar’s Waterlogging Resistance Modeling and Evaluating: Exploring and Perfecting the Feasibility of Machine Learning Methods in Plant Science. Front. Plant Sci. 2022, 13, 821365. [Google Scholar] [CrossRef]
  15. Zhu, F.; Zhang, L.; Hu, X.; Zhao, J.; Meng, Z.; Zheng, Y. Research and Design of Hybrid Optimized Backpropagation (BP) Neural Network PID Algorithm for Integrated Water and Fertilizer Precision Fertilization Control System for Field Crops. Agronomy 2023, 13, 1423. [Google Scholar] [CrossRef]
  16. Gu, C.; Zou, W.; Wang, X.; Chen, L.; Zhai, C. Wind loss model for the thick canopies of orchard trees based on accurate variable spraying. Front. Plant Sci. 2022, 13, 1010540. [Google Scholar] [CrossRef] [PubMed]
  17. Jiang, H.; Li, X.; Tian, J.; Liu, H. Pre-Harvest Supplemental Blue Light Enhanced Antioxidant Activity of Flower Stalk in Chinese Kale during Storage. Plants 2021, 10, 1177. [Google Scholar] [CrossRef] [PubMed]
  18. An, W.; Wang, G.; Dou, J.; Zhang, Y.; Yang, Q.; He, Y.; Tang, Z.; Yu, J. Protective mechanisms of exogenous melatonin on chlorophyll metabolism and photosynthesis in tomato seedlings under heat stress. Front. Plant Sci. 2025, 16, 1519950. [Google Scholar] [CrossRef]
  19. Fakhri, L.A.; Ghanbarzadeh, B.; Falcone, P.M. New Healthy Low-Sugar and Carotenoid-Enriched/High-Antioxidant Beverage: Study of Optimization and Physicochemical Properties. Foods 2023, 12, 3265. [Google Scholar] [CrossRef]
  20. Kunter, B.; Unal, O.B.; Keskin, S.; Hatterman-Valenti, H.; Kaya, O. Comparison of the sugar and organic acid components of seventeen table grape varieties produced in Ankara (Türkiye): A study over two consecutive seasons. Front. Plant Sci. 2024, 15, 1321210. [Google Scholar] [CrossRef]
  21. Li, W.; Fang, H.; Wei, S.; Weiss, M.; Baret, F. Critical analysis of methods to estimate the fraction of absorbed or intercepted photosynthetically active radiation from ground measurements: Application to rice crops. Agric. For. Meteorol. 2021, 297, 108273. [Google Scholar] [CrossRef]
  22. Yan, L.; Liu, X.; Jing, X.; Geng, L.; Che, T.; Liu, L. Enhancing Leaf Area Index Estimation for Maize with Tower-Based Multi-Angular Spectral Observations. Sensors 2023, 23, 9121. [Google Scholar] [CrossRef]
  23. El Horri, H.; Vitiello, M.; Ceccanti, C.; Lo Piccolo, E.; Lauria, G.; De Leo, M.; Braca, A.; Incrocci, L.; Guidi, L.; Massai, R.; et al. Ultraviolet-to-Blue Light Conversion Film Affects Both Leaf Photosynthetic Traits and Fruit Bioactive Compound Accumulation in Fragaria × ananassa. Agronomy 2024, 14, 1491. [Google Scholar] [CrossRef]
  24. Wu, C.; Zhang, L.; Fu, Y.; Fu, G.; Fu, D.; Li, H.; Su, S.; Xie, H.; Tian, H.; Wang, R.; et al. Exogenous Salicylic Acid Alleviates Physiological Stress in Salix matsudana Seedlings and Increases 2,4-Dinitrophenol Removal. Forests 2024, 15, 1392. [Google Scholar] [CrossRef]
  25. Sun, X.; Tan, Y.; Zhang, Y.; Guo, W.; Li, X.; Golub, N.; Zhang, L.; Wang, H. Effects of salinity stress on morphological structure, physiology, and mRNA expression in different wheat (Triticum aestivum L.) cultivars. Front. Genet. 2025, 16, 1535610. [Google Scholar] [CrossRef] [PubMed]
  26. Vadez, V.; Pilloni, R.; Grondin, A.; Hajjarpoor, A.; Belhouchette, H.; Brouziyne, Y.; Chehbouni, G.; Kharrou, M.H.; Zitouna-Chebbi, R.; Mekki, I.; et al. Water use efficiency across scales: From genes to landscapes. J. Exp. Bot. 2023, 74, 4770–4788. [Google Scholar] [CrossRef]
  27. Pourdarbani, R.; Sabzi, S.; Rohban, M.H.; Hernández-Hernández, J.L.; Gallardo-Bernal, I.; Herrera-Miranda, I.; García-Mateos, G. One-Dimensional Convolutional Neural Networks for Hyperspectral Analysis of Nitrogen in Plant Leaves. Appl. Sci. 2021, 11, 11853. [Google Scholar] [CrossRef]
  28. Chen, X.; Zhang, H.; Teng, A.; Zhang, C.; Lei, L.; Ba, Y.; Wang, Z. Photosynthetic characteristics, yield and quality of sunflower response to deficit irrigation in a cold and arid environment. Front. Plant Sci. 2023, 14, 1280347. [Google Scholar] [CrossRef]
  29. Bates, J.S.; Montzka, C.; Schmidt, M.; Jonard, F. Estimating Canopy Density Parameters Time-Series for Winter Wheat Using UAS Mounted LiDAR. Remote Sens. 2021, 13, 710. [Google Scholar] [CrossRef]
  30. Zhang, Y.; Yang, J.; van Haaften, M.; Li, L.; Lu, S.; Wen, W.; Zheng, X.; Pan, J.; Qian, T. Interactions between Diffuse Light and Cucumber (Cucumis sativus L.) Canopy Structure, Simulations of Light Interception in Virtual Canopies. Agronomy 2022, 12, 602. [Google Scholar] [CrossRef]
  31. Bellos, E.; Tzivanidis, C. Concentrating Solar Collectors for a Trigeneration System—A Comparative Study. Appl. Sci. 2020, 10, 4492. [Google Scholar] [CrossRef]
  32. Wang, M.; Sun, R.; Zhu, A.; Xiao, Z. Evaluation and Comparison of Light Use Efficiency and Gross Primary Productivity Using Three Different Approaches. Remote Sens. 2020, 12, 1003. [Google Scholar] [CrossRef]
  33. Liu, Z.; Guo, P.; Liu, H.; Fan, P.; Zeng, P.; Liu, X.; Feng, C.; Wang, W.; Yang, F. Gradient Boosting Estimation of the Leaf Area Index of Apple Orchards in UAV Remote Sensing. Remote Sens. 2021, 13, 3263. [Google Scholar] [CrossRef]
  34. Wang, Y.; Wang, Y.; Tang, Y.; Zhu, X.-G. Stomata conductance as a goalkeeper for increased photosynthetic efficiency. Curr. Opin. Plant Biol. 2022, 70, 102310. [Google Scholar] [CrossRef]
  35. Ciampitti, I.; van Versendaal, E.; Rybecky, J.F.; Lacasa, J.; Fernandez, J.; Makowski, D.; Lemaire, G. A global dataset to parametrize critical nitrogen dilution curves for major crop species. Sci. Data 2022, 9, 277. [Google Scholar] [CrossRef]
  36. Luo, W.; Sandanayake, M.; Zhang, G.; Tan, Y. Construction Cost and Carbon Emission Assessment of a Highway Construction—A Case towards Sustainable Transportation. Sustainability 2021, 13, 7854. [Google Scholar] [CrossRef]
  37. Tan, B.; You, W.; Huang, C.; Xiao, T.; Tian, S.; Luo, L.; Xiong, N. An Intelligent Near-Infrared Diffuse Reflectance Spectroscopy Scheme for the Non-Destructive Testing of the Sugar Content in Cherry Tomato Fruit. Electronics 2022, 11, 3504. [Google Scholar] [CrossRef]
  38. Prono, L.; Bich, P.; Boretti, C.; Mangia, M.; Pareschi, F.; Rovatti, R.; Setti, G. A Multiply-And-Max/Min Neuron Paradigm for Aggressively Prunable Deep Neural Networks. IEEE Trans. Neural Netw. Learn. Syst. 2025, 36, 14414–14427. [Google Scholar] [CrossRef]
  39. Kumar, P.; Almeida, F.; Ar, A.; Al-Mdallal, Q. Artificial neural network model using Levenberg Marquardt algorithm to analyse transient flow and thermal characteristics of micropolar nanofluid in a microchannel. Partial Differ. Equ. Appl. Math. 2025, 13, 101061. [Google Scholar] [CrossRef]
  40. Moghrabi, I.A.R.; Hassan, B.A. An Efficient Limited Memory Multi-Step Quasi-Newton Method. Mathematics 2024, 12, 768. [Google Scholar] [CrossRef]
  41. Nadda, W.; Boonchieng, W.; Boonchieng, E. Influenza, dengue and common cold detection using LSTM with fully connected neural network and keywords selection. BioData Min. 2022, 15, 5. [Google Scholar] [CrossRef] [PubMed]
  42. Gąsiorowski, M.; Szymak, P.; Bychto, L.; Patryn, A. Application of Artificial Neural Networks in Analysis of Time-Variable Optical Reflectance Spectra in Digital Light Projection Spectroscopy. Coatings 2022, 12, 37. [Google Scholar] [CrossRef]
  43. Chen, Y.; Wu, J.; Zhang, Y.; Fu, L.; Luo, Y.; Liu, Y.; Li, L. Research on Hyperparameter Optimization of Concrete Slump Prediction Model Based on Response Surface Method. Materials 2022, 15, 4721. [Google Scholar] [CrossRef] [PubMed]
  44. Liu, W.; Liang, S.; Qin, X. Weighted p-norm distance t kernel SVM classification algorithm based on improved polarization. Sci. Rep. 2022, 12, 6197. [Google Scholar] [CrossRef]
  45. Aria, M.; Cuccurullo, C.; Gnasso, A. A comparison among interpretative proposals for Random Forests. Mach. Learn. Appl. 2021, 6, 100094. [Google Scholar] [CrossRef]
  46. Wright, L.G.; Onodera, T.; Stein, M.M.; Wang, T.; Schachter, D.T.; Hu, Z.; McMahon, P.L. Deep physical neural networks trained with backpropagation. Nature 2022, 601, 549–555. [Google Scholar] [CrossRef]
  47. Bekdaş, G.; Cakiroglu, C.; Islam, K.; Kim, S.; Geem, Z.W. Optimum Design of Cylindrical Walls Using Ensemble Learning Methods. Appl. Sci. 2022, 12, 2165. [Google Scholar] [CrossRef]
  48. Cai, J.; Hu, Y.; Peng, Y.; Guo, F.; Xiong, J.; Zhang, R. A hybrid MCDM approach based on combined weighting method, cloud model and COPRAS for assessing road construction workers’ safety climate. Front. Public Health 2024, 12, 1452964. [Google Scholar] [CrossRef]
  49. Chen, M.; Hernández, A. Towards an Explainable Model for Sepsis Detection Based on Sensitivity Analysis. IRBM 2022, 43, 75–86. [Google Scholar] [CrossRef]
  50. Duan, X.; Wang, Q.; Mu, W.; Wei, X. Optimization of irrigation and fertilization of apples under magnetoelectric water irrigation in extremely arid areas. Front. Plant Sci. 2024, 15, 1356338. [Google Scholar] [CrossRef] [PubMed]
  51. Castro, J.C.; Castro, C.G.; Cobos, M. Genetic and biochemical strategies for regulation of L-ascorbic acid biosynthesis in plants through the L-galactose pathway. Front. Plant Sci. 2023, 14, 1099829. [Google Scholar] [CrossRef] [PubMed]
  52. Santos, M.G.; Davey, P.A.; Hofmann, T.A.; Borland, A.; Hartwell, J.; Lawson, T. Stomatal Responses to Light, CO2, and Mesophyll Tissue in Vicia faba and Kalanchoë fedtschenkoi. Front. Plant Sci. 2021, 12, 740534. [Google Scholar] [CrossRef]
  53. Xu, H.; Watanabe, Y.; Ediger, D.; Yang, X.; Iritani, D. Characteristics of Sunburn Browning Fruit and Rootstock-Dependent Damage-Free Yield of AmbrosiaTM Apple after Sustained Summer Heat Events. Plants 2022, 11, 1201. [Google Scholar] [CrossRef]
  54. Cao, X.; Li, W.; Wang, P.; Ma, Z.; Mao, J.; Chen, B. New Insights into MdSPS4-Mediated Sucrose Accumulation under Different Nitrogen Levels Revealed by Physiological and Transcriptomic Analysis. Int. J. Mol. Sci. 2022, 23, 16073. [Google Scholar] [CrossRef]
  55. Jiang, N.; Yang, Z.; Luo, J.; Wang, C. Quantifying Chilling Injury on the Photosynthesis System of Strawberries: Insights from Photosynthetic Fluorescence Characteristics and Hyperspectral Inversion. Plants 2023, 12, 3138. [Google Scholar] [CrossRef]
  56. Song, T.; Guo, L.; Sun, Q.; Gao, G.; Chen, J.; Zhang, Q. Inversion and validation of soil water-holding capacity in a wild fruit forest, using hyperspectral technology combined with machine learning. Sci. Rep. 2025, 15, 26244. [Google Scholar] [CrossRef] [PubMed]
  57. Knyazeva, I.V.; Panfilova, O.; Vershinina, O.; Smirnov, A.A.; Dorokhov, A.S.; Kahramanoğlu, I. The Effect of Nighttime LED Lighting on Tomato Growth, Yield, and Nutrient Content of Fruits. Horticulturae 2024, 10, 1259. [Google Scholar] [CrossRef]
  58. Burbidge, C.A.; Ford, C.M.; Melino, V.J.; Wong, D.C.J.; Jia, Y.; Jenkins, C.L.D.; Soole, K.L.; Castellarin, S.D.; Darriet, P.; Rienth, M.; et al. Biosynthesis and Cellular Functions of Tartaric Acid in Grapevines. Front. Plant Sci. 2021, 12, 643024. [Google Scholar] [CrossRef]
  59. Dadon, Z.; Rav Acha, M.; Orlev, A.; Carasso, S.; Glikson, M.; Gottlieb, S.; Alpert, E.A. Artificial Intelligence-Based Left Ventricular Ejection Fraction by Medical Students for Mortality and Readmission Prediction. Diagnostics 2024, 14, 767. [Google Scholar] [CrossRef] [PubMed]
  60. Jung, M.; Quesada-Traver, C.; Roth, M.; Aranzana, M.J.; Muranty, H.; Rymenants, M.; Guerra, W.; Holzknecht, E.; Pradas, N.; Lozano, L.; et al. Integrative multi-environmental genomic prediction in apple. Hortic. Res. 2025, 12, uhae319. [Google Scholar] [CrossRef]
  61. Wu, C.; Chen, D.; Xia, G.; Sun, X.; Zhang, S. Response Characteristics of Photosynthetic Productivity to the Canopy Spatial Distribution Pattern of Larix kaempferi. Forests 2023, 14, 1171. [Google Scholar] [CrossRef]
  62. Ouyang, Z.; Tian, J.; Yan, X.; Yang, Z. Micro-nano oxygenated irrigation improves the yield and quality of greenhouse cucumbers under-film drip irrigation. Sci. Rep. 2023, 13, 19453. [Google Scholar] [CrossRef] [PubMed]
  63. Wang, Z.; Yang, H.; Ma, Y.; Jiang, G.; Mei, X.; Li, X.; Yang, Q.; Kan, J.; Xu, Y.; Yang, T.; et al. WGCNA analysis revealing molecular mechanism that bio-organic fertilizer improves pear fruit quality by increasing sucrose accumulation and reducing citric acid metabolism. Front. Plant Sci. 2022, 13, 1039671. [Google Scholar] [CrossRef] [PubMed]
Figure 1. BP Neural Network Flowchart.
Figure 1. BP Neural Network Flowchart.
Plants 14 03795 g001
Figure 2. Correlation between apple fruit quality and physiological structure indicators.
Figure 2. Correlation between apple fruit quality and physiological structure indicators.
Plants 14 03795 g002
Figure 3. Analysis of results from the 18-5-1 structured Back Propagation (BP) neural network model with the trainlm training function: (A) shows the linear fitting analysis between measured values and predicted values in the Training set; (B) presents the linear fitting analysis between measured values and predicted values in the Validation set; (C) displays the visual analysis of measured values versus predicted values for Vc content across various samples.
Figure 3. Analysis of results from the 18-5-1 structured Back Propagation (BP) neural network model with the trainlm training function: (A) shows the linear fitting analysis between measured values and predicted values in the Training set; (B) presents the linear fitting analysis between measured values and predicted values in the Validation set; (C) displays the visual analysis of measured values versus predicted values for Vc content across various samples.
Plants 14 03795 g003
Figure 4. Results analysis of the 18-9-1 structured Back Propagation (BP) neural network model with trainbfg training function: (A) shows the linear fitting analysis between measured values and predicted values in the Training set; (B) presents the linear fitting analysis between measured values and predicted values in the Validation set; (C) displays the visual analysis of measured values versus predicted values for titratable acid content in each sample.
Figure 4. Results analysis of the 18-9-1 structured Back Propagation (BP) neural network model with trainbfg training function: (A) shows the linear fitting analysis between measured values and predicted values in the Training set; (B) presents the linear fitting analysis between measured values and predicted values in the Validation set; (C) displays the visual analysis of measured values versus predicted values for titratable acid content in each sample.
Plants 14 03795 g004
Figure 5. Analysis of results for the 18-5-1 structured Back Propagation (BP) neural network model with trainlm training function: (A) shows the linear fitting analysis between measured values and predicted values in the Training set; (B) presents the linear fitting analysis between measured values and predicted values in the Validation set; (C) displays the visual analysis of measured values versus predicted values for titratable acid content across all samples.
Figure 5. Analysis of results for the 18-5-1 structured Back Propagation (BP) neural network model with trainlm training function: (A) shows the linear fitting analysis between measured values and predicted values in the Training set; (B) presents the linear fitting analysis between measured values and predicted values in the Validation set; (C) displays the visual analysis of measured values versus predicted values for titratable acid content across all samples.
Plants 14 03795 g005
Figure 6. Analysis of results from the Back Propagation (BP) neural network model with 18-6-1 structure using trainbfg training function: (A) shows the linear fitting analysis between measured values and predicted values in the Training set; (B) presents the linear fitting analysis between measured values and predicted values in the Validation set; (C) displays the visual analysis of measured values versus predicted values for titratable acid content across samples.
Figure 6. Analysis of results from the Back Propagation (BP) neural network model with 18-6-1 structure using trainbfg training function: (A) shows the linear fitting analysis between measured values and predicted values in the Training set; (B) presents the linear fitting analysis between measured values and predicted values in the Validation set; (C) displays the visual analysis of measured values versus predicted values for titratable acid content across samples.
Plants 14 03795 g006
Figure 7. Visualization of feature importance: (A) shows the feature visualization for the Vc prediction model; (B) shows the feature visualization for the titratable acid prediction model; (C) shows the feature visualization for the soluble saccharides prediction model; (D) shows the feature visualization for the sugar-acid ratio prediction model.
Figure 7. Visualization of feature importance: (A) shows the feature visualization for the Vc prediction model; (B) shows the feature visualization for the titratable acid prediction model; (C) shows the feature visualization for the soluble saccharides prediction model; (D) shows the feature visualization for the sugar-acid ratio prediction model.
Plants 14 03795 g007
Figure 8. Single-factor sensitivity analysis: (A) shows the single-factor sensitivity analysis of the Vc prediction model; (B) shows the single-factor sensitivity analysis of the titratable acid prediction model; (C) shows the single-factor sensitivity analysis of soluble saccharides; (D) shows the single-factor sensitivity analysis of the sugar-acid ratio prediction model.
Figure 8. Single-factor sensitivity analysis: (A) shows the single-factor sensitivity analysis of the Vc prediction model; (B) shows the single-factor sensitivity analysis of the titratable acid prediction model; (C) shows the single-factor sensitivity analysis of soluble saccharides; (D) shows the single-factor sensitivity analysis of the sugar-acid ratio prediction model.
Plants 14 03795 g008
Figure 9. Feature importance assessment and feature selection based on ablation experiments: (A) shows the single-factor sensitivity analysis and ablation experiment results for the Vc prediction model; (B) shows the single-factor sensitivity analysis and ablation experiment results for the titratable acid prediction model; (C) shows the single-factor sensitivity analysis and ablation experiment results for the soluble saccharides prediction model; (D) shows the single-factor sensitivity analysis and ablation experiment results for the sugar-acid ratio prediction model; (E) presents the intersection analysis between the single-factor sensitivity analysis results and the features used in modeling for the Vc prediction model; (F) presents the intersection analysis between the single-factor sensitivity analysis results and the features used in modeling for the titratable acid prediction model; (G) presents the intersection analysis between the single-factor sensitivity analysis results and the features used in modeling for the soluble saccharides prediction model; (H) presents the intersection analysis between the single-factor sensitivity analysis results and the features used in modeling for the sugar-acid ratio prediction model.
Figure 9. Feature importance assessment and feature selection based on ablation experiments: (A) shows the single-factor sensitivity analysis and ablation experiment results for the Vc prediction model; (B) shows the single-factor sensitivity analysis and ablation experiment results for the titratable acid prediction model; (C) shows the single-factor sensitivity analysis and ablation experiment results for the soluble saccharides prediction model; (D) shows the single-factor sensitivity analysis and ablation experiment results for the sugar-acid ratio prediction model; (E) presents the intersection analysis between the single-factor sensitivity analysis results and the features used in modeling for the Vc prediction model; (F) presents the intersection analysis between the single-factor sensitivity analysis results and the features used in modeling for the titratable acid prediction model; (G) presents the intersection analysis between the single-factor sensitivity analysis results and the features used in modeling for the soluble saccharides prediction model; (H) presents the intersection analysis between the single-factor sensitivity analysis results and the features used in modeling for the sugar-acid ratio prediction model.
Plants 14 03795 g009
Table 1. Data on fruit quality indicators and physiological structure indicators used in the artificial neural network model.
Table 1. Data on fruit quality indicators and physiological structure indicators used in the artificial neural network model.
Parameter MetricsMaximumMinimumAverageSDCV
Vc (mg/100 g)0.900.420.730.0416.75
TA (%)15.410.290.872.230.39
SS (%)16.906.3211.152.215.04
SS/TA51.850.7028.989.922.92
PAR (μmol·m−2·s−1)1135.00100.50516.87243.792.12
T0.930.060.480.212.35
LAI (Leaf Area Index)5.130.281.901.051.82
Fb1.030.060.590.183.37
Pn (μmol·m−2·s−1)15.206.439.811.745.65
Tr (mmol·m−2·s−1)6.230.713.720.745.05
Gs (mmol·m−2·s−1)322.13106.20226.7747.094.82
Ci (μmol·mol−1)379.30252.84308.5619.7015.66
WUE (μmol·mol−1)21.471.652.891.781.62
N (%)2.851.672.220.1713.08
k1.540.060.480.232.06
FIR (%)94.407.3051.6120.592.51
DUE (μmol·m−2·s−1)164.418.8327.7024.311.14
SUE (μmol·m−2·s−1)0.120.010.030.021.39
Ls0.370.050.230.054.64
SPCI (mmol·μmol−1·g−1)6.471.283.270.963.39
NLT (μmol·mmol−1·g−1)880.0370.67130.6177.911.68
CCI (mmol·μmol−1·g−1)1.740.070.670.242.77
Table 2. Best Model Metrics for Different Numbers of Hidden Layers in VC.
Table 2. Best Model Metrics for Different Numbers of Hidden Layers in VC.
ModelTraining FunctionTraining SetValidation Set
R2MAEMBEMAPER2MAEMBEMAPE
18-5-1trainlm0.840.00750.00020.01000.870.00870.00140.0123
18-6-1trainbfg0.830.00820.00050.01120.830.01060.00270.0142
18-7-1trainlm0.890.00730.00070.00990.830.01190.00170.0164
18-8-1trainlm0.840.01040.00380.01430.860.00960.00450.0131
18-9-1trainlm0.870.00640.00280.00870.760.00970.00390.0126
18-10-1traincgb0.850.00980.00020.01320.740.01200.00060.0166
18-11-1trainlm0.830.00880.00410.01210.840.01020.00490.0138
18-12-1trainlm0.820.00980.00140.01330.700.01440.00420.0195
18-13-1trainlm0.820.00870.00160.01180.810.01050.00520.0142
18-14-1trainbfg0.790.01130.00080.01560.730.01120.00170.0147
Table 3. Best Model Metrics for Titratable Acid at Different Numbers of Hidden Layers.
Table 3. Best Model Metrics for Titratable Acid at Different Numbers of Hidden Layers.
ModelTraining FunctionTraining SetValidation Set
R2MAEMBEMAPER2MAEMBEMAPE
18-5-1traingdx0.800.01430.00010.03910.730.01660.00050.0445
18-10-1trainlm0.860.01080.00050.02950.810.01300.00450.0336
18-7-1traincgb0.880.01020.00010.02770.780.01190.00210.0318
18-8-1trainlm0.860.01130.00430.02980.820.01100.00080.0301
18-9-1trainbfg0.850.01100.00040.02990.860.01180.00100.0320
18-10-1trainscg0.850.01170.00010.03150.790.01500.00100.0407
18-11-1trainbfg0.840.01160.00030.03170.790.01440.00410.0368
18-12-1trainlm0.860.01060.00170.02900.860.01280.00450.0340
18-13-1trainlm0.860.01190.00140.03170.790.01410.00010.0388
18-14-1trainlm0.830.01210.00830.03310.710.01510.00170.0422
18-15-1trainlm0.850.01180.00070.03190.820.01360.00110.0365
Table 4. Best model metrics for soluble saccharides under different hidden layer numbers.
Table 4. Best model metrics for soluble saccharides under different hidden layer numbers.
ModelTraining FunctionTraining SetValidation Set
R2MAEMBEMAPER2MAEMBEMAPE
18-5-1trainlm0.860.50410.04900.04550.860.61800.03070.0597
18-3-1traincgb0.840.65010.05060.05830.850.65670.13370.0610
18-7-1trainscg0.880.57030.00720.05020.730.75870.07620.0681
18-8-1trainlm0.870.61310.03720.05590.710.83510.13620.0761
18-9-1trainlm0.850.61430.10110.05420.780.74430.22340.0632
18-10-1trainlm0.830.66290.03190.05890.830.75060.06070.0716
18-11-1trainlm0.830.62070.05730.05490.800.72010.08460.0636
18-12-1trainlm0.830.53230.05170.04720.770.77080.04620.0669
18-13-1trainlm0.850.55090.04840.04810.800.74380.11910.0654
18-14-1trainlm0.820.67660.01730.05950.800.65290.31810.0652
18-15-1trainlm0.870.52560.11290.04760.830.63070.01020.0572
Table 5. Best model metrics for sugar-acid ratio under different numbers of hidden layers.
Table 5. Best model metrics for sugar-acid ratio under different numbers of hidden layers.
ModelTraining FunctionTraining SetValidation Set
R2MAEMBEMAPER2MAEMBEMAPE
18-5-1trainbfg0.821.87730.02080.05940.851.58280.53260.0530
18-6-1trainbfg0.821.77940.06040.05490.891.30470.32200.0462
18-7-1trainlm0.851.66570.21460.05410.742.26260.39760.0727
18-8-1trainbfg0.801.61310.04840.05010.851.87430.06690.0617
18-9-1trainscg0.841.81620.17420.05840.771.93370.05280.0607
18-10-1trainlm0.791.98170.04510.06330.842.27910.46350.0731
18-11-1traincgb0.841.63630.01600.04950.851.53340.22380.0554
18-12-1trainscg0.841.60810.00430.05200.811.71870.03430.0533
18-13-1traincgb0.811.76390.03300.05570.881.78140.42170.0586
18-14-1trainlm0.841.58900.19550.05110.801.98450.55220.0625
18-15-1trainlm0.821.81660.63250.05650.791.89080.15620.0630
Table 6. Comparison between the feature refinement model and the original model.
Table 6. Comparison between the feature refinement model and the original model.
Parameter MetricsModel ConfigurationDatasetR2MAEMBEMAPE
Vc4-Feature Optimization ModelTraining Set0.850.0070.00030.0095
Validation Set0.930.00590.00020.008
8-5-1 Original ModelTraining Set0.840.00750.00020.01
Validation Set0.870.00870.00140.0123
TA4-Feature Optimization ModelTraining Set0.900.00850.00020.022
Validation Set0.910.0090.00030.024
18-9-1 Original ModelTraining Set0.850.0110.00040.0299
Validation Set0.860.01180.0010.032
SS5-Feature Optimization ModelTraining Set0.910.4210.02010.037
Validation Set0.900.46250.0150.048
18-5-1 original modelTraining Set0.860.50410.0490.0455
Validation Set0.860.6180.03070.0597
SS/TA5-Feature Optimization ModelTraining Set0.881.350.0350.045
Validation Set0.900.900.0180.035
18-6-1 Original ModelTraining Set0.821.77940.06040.0549
Validation Set0.891.30470.3220.0462
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zeng, J.; Yu, M.; Chen, Y.; Li, X.; Bao, J.; Pu, X. A Precise Apple Quality Prediction Model Integrating Driving Factor Screening and BP Neural Network. Plants 2025, 14, 3795. https://doi.org/10.3390/plants14243795

AMA Style

Zeng J, Yu M, Chen Y, Li X, Bao J, Pu X. A Precise Apple Quality Prediction Model Integrating Driving Factor Screening and BP Neural Network. Plants. 2025; 14(24):3795. https://doi.org/10.3390/plants14243795

Chicago/Turabian Style

Zeng, Junkai, Mingyang Yu, Yan Chen, Xin Li, Jianping Bao, and Xiaoqiu Pu. 2025. "A Precise Apple Quality Prediction Model Integrating Driving Factor Screening and BP Neural Network" Plants 14, no. 24: 3795. https://doi.org/10.3390/plants14243795

APA Style

Zeng, J., Yu, M., Chen, Y., Li, X., Bao, J., & Pu, X. (2025). A Precise Apple Quality Prediction Model Integrating Driving Factor Screening and BP Neural Network. Plants, 14(24), 3795. https://doi.org/10.3390/plants14243795

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop