Next Article in Journal
Intraseasonal Oscillation Features of the Two Types of Persistent High Temperature Events over Jiangnan Region
Next Article in Special Issue
Hydrometeorological Hazards on Crop Production in the State of Veracruz, Mexico
Previous Article in Journal
Production of Biocoal from Wastewater Sludge and Sugarcane Bagasse: A Review
Previous Article in Special Issue
Estimation and Assessment of the Root Zone Soil Moisture from Near-Surface Measurements over Huai River Basin
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Effects of Meteorological Factors on Apple Yield Based on Multilinear Regression Analysis: A Case Study of Yantai Area, China

Yantai Institute, China Agricultural University, Yantai 264670, China
*
Author to whom correspondence should be addressed.
Atmosphere 2023, 14(1), 183; https://doi.org/10.3390/atmos14010183
Submission received: 11 November 2022 / Revised: 11 January 2023 / Accepted: 12 January 2023 / Published: 15 January 2023

Abstract

:
Evaluating the impact of different meteorological conditions on apple yield and predicting the future yield in Yantai City is essential for production. Furthermore, it provides a scientific basis for the increase in apple yield. In this study, first, a grey relational analysis (GRA) was used to determine the quantitative relationship between different meteorological factors and meteorological yield which is defined as affected only by meteorological conditions. Then, the comprehensive meteorological factors extracted by a principal component analysis (PCA) were used as inputs for multiple linear regression (MLR). The apple yield accuracy was compared with the lasso regression prediction. Trend analysis showed that the actual apple yield increased annually, but the meteorological yield decreased annually over a long time. Correlation ranking illustrated that the meteorological yield was significantly correlated with the frost-free period, the annual mean temperature, the accumulated temperature above 10 °C, etc. The good consistency between GRA and MLR–PCA showed that the accumulated temperature above 10 °C, the March–October mean temperature, and the June–August mean temperature are key meteorological factors. In addition, it was found that the principal components F 2 ,   F 4 , and F 5 were negatively correlated with meteorological yield, while the principal components F 1 and F 3 were positively correlated with meteorological yield. Moreover, the MLR–PCA model predicted the apple yield in 2020 as 47.256 t·ha−1 with a 7.089% relative error. This work demonstrates that the principal component regression model can effectively extract information about different meteorological factors and improve the model’s accuracy for analyzing key meteorological factors and predicting apple yield.

1. Introduction

Climate change affects fruit production in some regions in China and worldwide by changing agricultural climate conditions [1], directly affecting fruit yields and quality [2]. Meanwhile, China is the largest apple producer in the world with its northern regions contributing more than 70 percent of the country’s apple yield [3,4]. Seasonal climate change is evident and has led to limitations in apple yield in northern China [5]. Some studies have indicated that climate change, including global warming and frost disasters, might induce tremendous decreases in fruit yield. A recent report by Benlloch-González et al. [6] showed that every 4 °C increased in ambient temperature resulted in a reduction in fruit setting and a significant decrease in fruit yield. Zhu et al. [7] discovered that the Loess Plateau area had experienced frost disasters during the apple flowering period, leading to a significant decline and even extinction of apple yield. Cui et al. [8] employed different El Niño southern oscillation patterns to reveal La Niña years promoted apple yield, while El Niño years inhibited apple yield. Many studies have shown that meteorological factors such as accumulated temperature, precipitation, and frost can affect the growth and yield of apples [9,10]. Sen et al. [11] found that minimum temperatures in January, February, and November, rainfall in December, and maximum temperatures in March and October were important factors affecting apple yield. However, studies have shown that appropriate measures can increase fruit production under trends of future climate change [12,13,14], especially at middle and high latitudes [15,16]. Apple yield is affected by a variety of meteorological factors. Therefore, identifying the varied characteristics of the different meteorological factors and their impact on apple yield is of great significance for the scientific development of agricultural management measures to ensure stable and increased apple yield.
At present, scholars in different countries have carried out relevant studies on the influence of climate change on apple yield and quality. However, most of the research focuses on the influence of meteorological factors on apple yield in a certain phenological period, ignoring the influence of meteorological factors throughout the whole growing period of apples. Unterberger et al. [17] applied the combination of the phenological series model and Austrian climate prediction to show that spring blocking would seriously affect apples’ flowering and fruit setting rates. Delgado et al. [18] researched the effect of climate change on apple phenology in northwest Spain and demonstrated that climate change could affect the start date of internal dormancy and offset the advancing effect of phenology. Funes et al. [19] manipulated the dynamic model to study the meteorological factors affecting the flowering period in the downstream of the Fluvia watershed, revealing the cold and heat demand of crops during the flowering period and improving the credibility of the prediction of the flowering period. Che et al. [20] studied the effect of climate change on apple yield in each phenological period in the Longdong region. Jing et al. [21] applied an HP filter combined with a BP neural network to study the effect of meteorological factors on early apple yield in Yuncheng City. The overall increase in research has led to significant achievements in research outcomes. However, most of the previous studies have used meteorological data directly and there is a need for more analysis of the degree of coverage between data. Qu et al. [22] used grey relational analysis to assess the relationship between apple quality and climatic factors in different apple-producing areas in Shaanxi. Li et al. [10] identified the relationship between climate factors, circulation indicators, and apple yield in China’s major apple-producing areas. Demestihas et al. [23] developed a simulation model to study the collaborative services among multiple ecosystems in apple orchards that provide a sustainable system. Yao et al. [24] utilized a sigmoid curve to simulate the relationship between apple quality and yield and meteorological factors.
Currently, simulation models, including the planting density model, crop spatial distribution model, remote sensing simulation model, and evapotranspiration model, are widely applied in the study of crop spatial responses to climate change [25,26,27,28]. However, the relevant models depend highly on data acquisition precision, resolution, experimental area, model parameter adjustment, and their application scope. Therefore, these model algorithms need to be selected according to different regions, and at the same time, the model’s parameters need to be constantly adjusted [29,30,31]. Due to the limitations of the above common models, appropriate indicators should be selected to quantify the climate resources that have eliminated soil and fertilization effects. Additionally, indicators are based on expert opinions and literature reviews in agricultural meteorology and ecology, including temperature, precipitation, accumulated temperature, and humidity. For example, the GLAM model was employed by Challinor et al. to quantify the impact of high-temperature threshold exceedance on crops [32]. Based on factors of temperature, precipitation, frost, and soil humidity, López-Morales et al. [33] presented an Internet of Things (IoT) architecture, and usage of this architecture to improve decision making in the agricultural industry using meteorological information. Furthermore, Rosbakh et al. [34] applied climate data to study how Siberian plants shifted their phenology under climate change. This method is easy to apply in terms of expert evaluation and literature review. However, due to the differences in study areas and the subjectivity of index selection, it is necessary to introduce statistical methods and detect the coverage degree of indicators, making the indicators’ selection more scientific and reliable.
As the main apple-producing area in China, the apple yield in Shandong Province accounted for 21.6% of the total apple yield in 2020 [35]. However, the spatial distribution of Shandong apple production is uneven. The main production areas are mostly concentrated in the eastern coastal areas of Shandong Province, led by Yantai city, which was rated as the third batch of agricultural characteristics of the advantageous areas [36].At the same time, the Yantai apple has been awarded the title of “the best brand in China’s fruit industry” for thirteen consecutive years [37]. Therefore, the sustainable and healthy development of the Yantai apple industry plays an essential role in leading the progress of the Shandong apple industry. In this study, it is hypothesized that:
(i)
For non-stationary yield data, they are thought to be near stationary over an appropriate small interval (5 years is chosen as a small interval in this paper);
(ii)
The collinearity of different meteorological factors requires dimensionality reduction;
(iii)
There is a multivariate linear relationship between the separated obtained meteorological yields and the meteorological factors.
This work aims to investigate the effects of different meteorological factors on apple yield by analyzing meteorological and yield data for thirteen consecutive years in Yantai, Shandong Province. On the basis of clarifying the changing trend of the meteorological yield of apples, the multiple linear regression based on principal component analysis (MLR–PCA) method is used to eliminate the problem of multicollinearity. This work hopes to explore the key meteorological factors affecting meteorological yield and forecast apple yield. The aim of this study is to provide a scientific basis for coping with climate change, improving agricultural production and management patterns, and ensuring increased apple yields.

2. Materials and Methods

2.1. Study Area

Yantai is located in the northeast of Shandong Province, bordering the Bohai Sea and the Yellow Sea, with a typical temperate continental monsoon climate. The terrain is mostly low and hilly [38], with the hilly area accounting for 76.32% of the total area (Figure 1). The mean annual temperature of the study area is 12.9 °C, the annual precipitation is 661 mm, the frost-free period is 227.7 days, the accumulated temperature above 10 °C is 4567.4 °C/ a , and the relative humidity in July is 75.7%. The favorable climate and geographical environment in this area has created the famous “Apple Capital of China” [39].

2.2. Data

Long-term monthly climate data, gathered from eight meteorological stations in Yantai City, Shandong Province, China, from 1999 to 2019, were obtained from the Shandong Bureau of Statistics [40]. These data were organized in a database containing twelve indicators (Table 1) obtained from existing research experience and local conditions [5,41,42]. Statistics including corn yield and acreage data, from 1999 to 2019, were obtained from the Shandong Bureau of Statistics.
Since the statistical yearbook of the Shandong Province only gives the municipal-level yield, it is not realistic to study the eight meteorological stations separately. Therefore, we adopted the equal-weighted average method [43] to convert the original data of eight meteorological stations into meteorological data.

2.3. Research Methods

2.3.1. Yield Separation Model

Social and natural factors mainly influence the apple yield. In the long-term production time series of apples, the influence of social factors is reflected in the improvement of production capacity caused by the strengthening of science and technology. The fluctuation of apple yield caused by the improvement of productivity levels is called the trend yield. The influence of natural factors is mainly manifested in the variation of apple yield caused by the difference in interannual meteorological conditions. The fluctuation of apple yield caused by changes in the meteorological conditions is called meteorological yield. Meanwhile, the variation of apple yield that is caused by other factors is called random yield. Therefore, apple yield can be further classified into trend yield, meteorological yield, and error yield [9]. The following equation can explain yield separation:
y = y t + y c + ε
where y is the actual yield (t·ha−1), y t is trend yield (t·ha−1), y c is the meteorological yield (t·ha−1), and ε is the error yield (t·ha−1) which is usually negligible.
To separate the trend yield sequence from the actual yield, a five-year moving average method, also known as 5a sliding average, was applied to fit the trend yield [44,45]. The calculation of 5a sliding average is shown by the following equation:
y i = a i t + b i   ( i = n K + 1 )  
where i denotes the number of equations, K denotes the moving step length, n denotes the number of years, and t denotes time.
The moving average included the four preceding years and the year of interest, and was calculated as follows:
y ^ t ( t ) = 1 5 i 4 i y i ( t )
where y i ( t ) denotes the value in the year of interest and y ^ t ( t ) denotes the five-year moving average of the year of interest.

2.3.2. Screening Model for Key Meteorological Factors

The meteorological yield of apples was taken as the parent sequence { x 0 ( k ) } . Each meteorological factor is taken as the sub-sequence { x i ( k ) } . k represents the number of samples. Calculating the correlation coefficient of { x i ( k ) } and { x 0 ( k ) } as:
ξ i ( k ) = min i min k Δ i ( k ) + ρ max i max k Δ i ( k ) Δ i ( k ) + ρ max i max k Δ i ( k )
where Δ i ( k ) = | x i ( k ) x 0 ( k ) | is the absolute difference between { x i ( k ) } and { x 0 ( k ) } at the k-th term at i-th point. ρ represents resolution factor, generally ρ = 0.5.

2.3.3. Quantifying the Relationship between Meteorological Factors and Yield

Multiple linear regression based on the principal component analysis model (MLR–PCA) uses indicators obtained from a PCA as the inputs. A PCA adopts the idea of dimensionality reduction to simplify multiple indicators into a few representative comprehensive indicators that can reflect most of the information, so it avoids the possible multicollinearity problem among multiple indicators [46,47].
(1)
To eliminate the effect of dimension of different meteorological factors and apple yield by Z-Score standardization. The following equation can explain the standardized calculation:
D i = x i x ¯ i S i
where x ¯ i is the mean value of the i-th data set, S i is the standard deviation of the i-th data set.
(2)
Correlation analysis is a method used to calculate the degree of correlation between two variables. The correlation between each meteorological factor and yield and each meteorological factor was determined by correlation analysis. The correlation coefficient R was defined as:
R = n i = 1 n x i y i i = 1 n x i i = 1 n y i n i = 1 n x i 2 i = 1 n ( x i ) 2 n i = 1 n y i 2 i = 1 n ( y i ) 2
where n is the number of meteorological factor and yield, and x i , y i are two sets of data to be judged.
(3)
Multicollinearity refers to a linear regression model in which the estimation of the model is distorted or difficult to estimate accurately due to the high correlation between the explanatory variables. To avoid model distortion, variance inflation factor analysis (VIF) is applied to test for multicollinearity through the equation:
V I F m = 1 1 R 1 ~ k \ m 2
where the maximum V I F m value in excess of 10 is often taken as an indication that multicollinearity may be unduly influencing the least square estimate.
(4)
KMO test statistic and Bartlett spherical test are adopted to judge the correlation between indicators and determine whether the variables are suitable for a principal component analysis. Assuming that there are n groups of samples and P indicators, a standard matrix V of size N × P can be formed. Then, the first, second, … i-th (i < n) principal components corresponding to the eigenvalues with a cumulative contribution of more than 85% are generally taken. The expression of the principal component is calculated through Equation (8).
[ F 1 F 2 F i ] = [ c 11 c 12 c 1 n c 21 c 22 c 2 n c i 1 c i 2 c i n ] [ v 1 v 2 v n ]
where F i is the i-th PC of variables, c i n is the loading coefficient that indicates how much the n-th variables participate in defining F i , and v n is the standardized variable.
(5)
Let y be the dependent variable and F 1 ,     F 2 ... F k be the principal component of n independent variables, then the expression is calculated through equation:
y = β 0 + β 1 F 1 + β 2 F 2 + + β n F n
where β n represents the regression coefficient.

2.3.4. The Trend Yield Forecast Model

Harmonic weight prediction is an algorithm for trend yield extension [48,49,50]. The specific steps are as follows:
w t + 1 = y c , t + 1 y c , t
where, y c , t is the trend yield of the t-th year. y c , t + 1 is the trend yield of next year. w t + 1 is the annual increase of the trend yield.
w = t = 1 n 1 c t + 1 · w t + 1
where, c t + 1 is the harmonic weight coefficient, and c t + 1 is calculated according to the following Equations (12) and (13):
c t + 1 = m t + 1 n 1 ( t = 1 , 2 , 3 , , n 1 )
m t + 1 = m t + 1 n t   ( m 1 = 0 )
Then, the prediction of trend yield is calculated by Equation (14).
y ^ c , n + 1 = y c , n + w

2.3.5. Comparison of Prediction Accuracy

To verify the reliability of the model constructed in this paper, the meteorological output in 2021 was predicted and compared with the results predicted by the lasso regression model [51]. Lasso regression avoids the distortion of the regression results caused by multicollinearity and endogeneity between variables in the regression process by rapidly compressing the coefficients of nonimportant explanatory variables to zero. The objective function of lasso regression can be expressed as:
L ( b ) = ( y X b ) 2 + η b 1 = ( y X b ) 2 + η | b |
where η b 1 is the penalty term of the function, η is the penalty coefficient, and b 1 is the regularization of regression coefficient b and represents the sum of the absolute values of all regression coefficients.

3. Results and Analysis

3.1. Meteorological Yield Separation

The 5a sliding average method was selected to calculate the trend yield [7,43,44]. The trend yield of apples from 1999 to 2019 was counted. Then, the meteorological yield was separated from the actual yield. The results are shown in Figure 2.
As shown in Figure 2, the actual yield of apples demonstrates a steady upward trend, in particular, since 2014. The apple yield shows an increasing annual trend, which may be related to the development and progress of field management measures, seed selection and seedling raising methods, etc. The meteorological yield shows a declining annual trend, which may be related to meteorological factors caused by severe global climate change and the increasing greenhouse effect in recent years. Meanwhile, as shown in Figure 2, in the early years (before 2014), the meteorological yield was stable and near the 0 t· ha−1 line. While in the years after 2014, the meteorological yield was significantly less than 0 t· ha−1, which would seriously hinder the growth of apple yield. These data suggest that orchard managers have a low utilization rate of local climate resources and may not consider climate change when making apple production plans.

3.2. Screening of Key Meteorological Factors

The KMO statistic test is required before a principal component analysis is used, and the KMO statistic was 0.422 < 0.600, which is not suitable for variable importance ranking using a principal component analysis [52]. In previous studies by Li et al. [10] and Shen et al. [53], GRA was utilized to screen key meteorological factors. The data of Yantai City from 1999 to 2019 were standardized, and the GRA was used to calculate each index’s correlation coefficients in the sub-sequence and the parent sequence. Combined with Equation (4), the average value of the correlation coefficient of each meteorological factor was taken as the grey relational coefficient between each meteorological factor and the meteorological yield.
r i = 1 n k = 1 n ξ i ( k )   ( k = 1 , 2 , 3 , n )
GRA was conducted on twelve meteorological factors and meteorological yield to obtain the grey correlation coefficient. The results are shown in Figure 3.
The correlation degree of different meteorological factors was sorted, and the ranking order represents the impact degree of different meteorological factors on apple meteorological yield. Figure 3 illustrates that the twelve meteorological factors affecting apple meteorological yield were sorted as follows: the frost-free period, the annual mean temperature, the accumulated temperature above 10 °C, the March–October mean temperature, the June–August mean temperature, the July relative humidity, the annual extreme low temperature, the annual difference in temperature, the annual average precipitation, the June–August precipitation, the mid-January mean temperature, and the coldest month mean temperature.
Grey correlation analysis can be employed to obtain the correlation ranking among variables, but it is not easy to judge the promoting or inhibiting relationship between variables. Therefore, key factors are initially screened by a grey relational analysis. Generally, two to four variables are selected as key variables [54,55]. The low threshold is set by selecting eight variables as candidates for key meteorological factors based on the initial screening. Thus, the meteorological factors with a correlation coefficient greater than 0.62 are defined as the key meteorological factors. The preliminary screening results include the frost-free period, the annual mean temperature, the accumulated temperature above 10 °C, the March–October mean temperature, the June–August mean temperature, the July relative humidity, the annual extreme low temperature, and the annual difference in temperature. Producers should pay more attention to the changes in the key meteorological factors, timely adjust the field’s microclimate, and formulate a reasonable fruit-tree management plan to prevent the dramatic decline of apple yield.

3.3. Construction of Meteorological Yield Forecast Model

SPSS 28.0 was applied for the collinearity test, principal component analysis, and multiple linear regression. The correlation and scatter diagrams were conducted with R version 4.2.2 using the packages dplyr, tidyverse, and ggplot2 for visualization of data in plots and heatmaps [56,57]. The results of the correlation analysis are shown in Figure 4.
Figure 4 demonstrates that, under the two-tailed test conditions, the meteorological yield was significantly correlated with the frost-free period and the accumulated temperature above 10 °C. Additionally, the accumulated temperature above 10 °C had an extremely significant inhibitory effect on the meteorological yield of apples (p < 0.01), and the frost-free period had a significant inhibitory effect on the meteorological yield of apples (p < 0.05). There were certain correlations among different meteorological factors. For example, the annual mean temperature was extremely significantly positively correlated with the March–October mean temperature, the annual extreme low temperature, and the coldest month mean temperature (p < 0.01), while the annual mean temperature was extremely significantly negatively correlated with the annual difference in temperature and the annual average precipitation (p < 0.01). The annual difference in temperature was extremely significantly negatively correlated with the coldest month mean temperature, the mid-January mean temperature, the annual mean temperature, and the annual extreme low temperature (p < 0.01), while the annual difference in temperature was significantly positive correlated with the annual average precipitation, the June–August precipitation, and the June–August mean temperature (p < 0.05). The coldest month mean temperature was extremely significantly positively correlated with the mid-January mean temperature, the annual mean temperature, and the annual extreme low temperature (p < 0.001). The correlation analysis shows that the correlation between meteorological factors was strong, but the correlation between meteorological factors and meteorological yield was not high. The multiple regression equation and variance analysis are shown in Equation (17) and Table 2.
Y c = 0.58 A A T 4.47 A R 2.16 M A T T T 4.48 L A M T 0.01 M A P 0.03 F F P 0.01 A T A 10 0.34 A T M J + 0.54 A M T + 4.09 M A T S E + 0.05 M T P S E + 0.22 R H J + 86.12
It can be remarked from the results of variance analysis in Table 2 that Significance F value is greater than 0.05, which indicates that the multiple regression equation was not significant and could not be used for prediction. This phenomenon is probably caused by the serious collinearity between meteorological factors. Therefore, the collinearity diagnosis of meteorological factors needs to be checked. The collinearity diagnosis result is shown in Table 3.
Table 3 shows the V I F values of different meteorological factors were greater than 10, and the V I F values of the annual difference in temperature indicators were far more than 100, indicating that there is serious multicollinearity among different meteorological factors in Yantai. The meteorological factors in the meteorological system will interact and influence each other, so that one meteorological factor can be calculated from other meteorological factors. For example, the annual difference in temperature is obtained by subtracting the mean temperature of the coldest month from the mean temperature of the hottest month in a year. Interaction may be the reason for the serious collinearity of meteorological factors in Yantai City. The direct establishment of regression models will cause serious information coverage problems making the original data unable to be fully utilized. Because of the serious collinearity between meteorological factors in Yantai City, the established multiple regression model is invalid. A principal component analysis reduces the dimensionality of multivariate factors, extracts most of the information of the original data, and reduces the coverage between the information. Therefore, a principal component analysis can solve the problem.
The principal component analysis model was established, and the Bartlett sphericity test was performed on the standardized correlation coefficient matrix. The significance was 0.00 < 0.05; thus, the original hypothesis of Bartlett’s sphericity test was rejected, and the principal component analysis could be performed. The meteorological yield and twelve meteorological factors were analyzed using a principal component analysis. The results are shown in Table 4.
Table 4 demonstrates that the cumulative contribution of the first five principal components reached 90.076%, which meets the requirement of a cumulative contribution of >85% for a principal component analysis. It indicates that most of the information in the original data has been extracted at this point. For its principal components, only five principal components were needed to represent the twelve indicators, and the contribution of each variable to the principal components constitutes the principal component matrix, as shown in Table 5.
As shown in Table 5, the principal component F 1 is composed of the annual mean temperature, the annual difference in temperature, the coldest month mean temperature, and the annual extreme low temperature. The principal component F 2  is composed of the June–August mean temperature, the March–October mean temperature, and the accumulated temperature above 10 °C. The principal component F 3 is composed of the July relative humidity. The principal component F 4 is composed of the annual average precipitation and the June–August precipitation. The principal component F 5 is composed of the mid-January mean temperature. The principal components F 1 F 5 can be named as temperature factor, growing season heat factor, humidity factor, precipitation factor, and cooling factor, respectively. The principal component F 1 F 5 can be expressed as:
{ F 1 = 1 4.255 ( 0.804 Z A A T 0.818 Z A R + 0.48 Z M A T T T + 0.918 Z L A M T 0.8 Z M A P + 0.638 Z F F P 0.001 Z A T A 10 + 0.684 Z A T M J + 0.799 Z A M T 0.06 Z M A T S E 0.786 Z M T P S E 0.367 Z R H J ) F 2 = 1 2.337 ( 0.381 Z A A T + 0.45 Z A R + 0.776 Z M A T T T 0.163 Z L A M T 0.09 Z M A P + 0.304 Z F F P + 0.69 Z A T A 10 0.279 Z A T M J 0.135 Z A M T + 0.867 Z M A T S E + 0.027 Z M T P S E + 0.149 Z R H J ) F 3 = 1 1.951 ( 0.208 Z A A T + 0.023 Z A R + 0.21 Z M A T T T + 0.104 Z L A M T + 0.01 Z M A P 0.447 Z F F P 0.561 Z A T A 10 + 0.274 Z A T M J + 0.132 Z A M T + 0.3 Z M A T S E + 0.034 Z M T P S E + 0.82 Z R H J ) F 4 = 1 1.608 ( 0.219 Z A A T 0.28 Z A R + 0.169 Z M A T T T + 0.254 Z L A M T + 0.538 Z M A P 0.199 Z F F P + 0.246 Z A T A 10 + 0.018 Z A T M J + 0.284 Z A M T 0.071 Z M A T S E + 0.571 Z M T P S E 0.12 Z R H J ) F 5 = 1 0.659 ( 0.175 Z A A T + 0.003 Z A R 0.099 Z M A T T T + 0.123 Z L A M T + 0.081 Z M A P + 0.189 Z F F P + 0.246 Z A T A 10 + 0.555 Z A T M J + 0.28 Z A M T 0.037 Z M A T S E + 0.089 Z M T P S E + 0.168 Z R H J )
The principal components F 1 F 5 were taken as independent variables, and the method of multiple linear regression was adopted to perform an ordinary least squares (OLS) analysis on meteorological yield after the normalization of dependent variables. The relationships between principal components F 1 F 5 and normalized meteorological yield are shown in Figure 5.
Figure 5 demonstrates that with the increase of the independent variables F 1 and F 3 , the value of Z Y c increases continuously; F 2 ,   F 4 , and F 5 with the opposite trend of change, indicating that Z Y c and F 2 ,   F 4 ,   and   F 5 have a negative correlation coefficient, and Z Y c has a positive correlation coefficient with F 1 and F 3 .
According to the standardized residual analysis (Figure 6), the standardized residual follows the mean value of −1.73 × 10−16 and the normal distribution with a standard deviation of 0.894, which can be approximated following a normal distribution with a mean of zero and a standard deviation of one. The residual distribution satisfies the applicable range of multiple linear regression. Meanwhile, the value of the Durbin–Watson statistic is closer to two and there is less correlation between the residual terms. The value of the Durbin–Watson statistic of the model constructed in this paper is 1.658, which indicates that the residual terms are not correlated.
According to the above analysis, the relationship between the dependent variable Z Y c and the independent variables F 1 F 5 is linear. The independent variables F 1 F 5 are not random. Moreover, there is no exact linear relationship between two or more independent variables due to the extraction of the PCA. The expected value of the residual in terms of the independent variable is 0: E ( ε | F 1 , F 2 ,   F 3 , F 4 , F 5 ) = 0 . The variance of the residual term is the same for all observations: E   ( ε i 2 ) = σ 2 . The residual term is not correlated between the observed values: E   ( ε i ε j ) = 0 ,   j i . The residual term is normally distributed. All the basic assumptions of the multiple linear regression model are consistent. Therefore, the MLR–PCA was established and the backward stepwise regression analysis results are shown in Table 6.
Table 6 shows that the V I F values of meteorological yield after the normalization of the principal components F 1 F 5 and the dependent variable are all less than 10, indicating that the principal component analysis solved the interference of multicollinearity on the prediction model. The principal components F 2 ,   F 4 , and F 5 are negatively correlated with the meteorological yield, which means the higher the growing season heat factor, precipitation factor, and cooling factor are, the lower the meteorological yield is. The principal components F 1 and F 3 are positively correlated with meteorological yield, which means that the higher temperature factor and humidity factor are, the higher the meteorological yield is. The regression coefficient is consistent with the results observed in Figure 5. The expression of multiple linear regression is:
Z Y c = 0.108 F 1 0.354 F 2 + 0.268 F 3 0.597 F 4 0.313 F 5
According to Equations (18) and (19), the standardized linear equation can be expressed as:
Z Y c = 0.068 Z A A T 0.075 Z A R 0.236 Z M A T T T 0.005 Z L A M T 0.398 Z M A P 0.099 Z F F P 0.619 Z A T A 10 + 0.062 Z A T M J + 0.088 Z A M T 0.202 Z M A T S E 0.454 Z M T P S E + 0.146 Z R H J
According to Equation (20), the coefficients of the accumulated temperature above 10 °C, the June–August precipitation, the annual average precipitation, the June–August mean temperature, and the March–October mean temperature were large. Combined with the conclusion in Section 3.2 “Screening of Key Meteorological Factors”, the key meteorological factors were the accumulated temperatures above 10 °C, the March–October mean temperature, and the June–August mean temperature.
The standardized linear equation was destandardized and the expression of the linear equation was obtained as:
Y c = 0.541 A A T 0.172 A R 2.019 M A T T T 0.015 L A M T 0.012 M A P 0.034 F F P 0.015 A T A 10 + 0.126 A T M J + 0.180 A M T 1.440 M A T S E 0.048 M T P S E + 0.145 R H J + 160.088
Using Equation (19), in terms of the temperature factor, the whole period of the increase of the temperature factor on apple meteorological yield does not have a significantly positive effect. This result proves that there is not a simple linear relationship between the temperature factor and the meteorological yield in Yantai, but there may be a complex nonlinear interaction. In terms of the growing season heat factor, there is a significantly negative correlation (p < 0.05) between the growing season heat factor and apple’s meteorological yield. This result proves that the heat supply in the growing period of Yantai may exceed the heat required for the development of apples. High temperatures can easily cause thermal damage, resulting in cell dehydration, affecting the physiological metabolic activities of plants and reducing the yield. In terms of the humidity factor, there is a positive correlation (p < 0.1) between the humidity factor and apple’s meteorological yield. This shows that Yantai summer humidity may not meet the needs of apple growth. In summer, when the humidity is low and the temperature is high, the stomata on the leaves of plants will close to keep the water in the plant. The closure of stomata will make the leaves unable to capture carbon dioxide, leading to starvation. In terms of precipitation factor, the precipitation factor on apple meteorological yield over the whole period has an extremely significantly negative effect (p < 0.001). This means that the precipitation in Yantai exceeds the amount needed for apple growth. Excessive precipitation may cause waterlogging, and the surge in soil water content may cause anoxia, which will cause a series of hazards. In terms of cooling factor, the cooling factor on apple meteorological yield over the whole period has an extremely significantly negative effect (p < 0.01). The result indicates that low value of cooling, especially in winter, significantly reduces apple’s yield. When apple trees cannot meet the cooling requirements for releasing dormancy, forcibly breaking dormancy will greatly reduce their flowering and fruit setting rates and affect the apple yield.

3.4. Establishment and Evaluation of Comparative Models for Forecasting Meteorological Yield

Lasso regression avoids the distortion of the regression results caused by multicollinearity and endogeneity between variables in the regression process by rapidly compressing the coefficients of nonimportant explanatory variables to zero. We chose to compare the fitting effect and prediction results of lasso regression and the MLR–PCA model constructed in this paper and evaluate whether the prediction accuracy was improved by the MLR–PCA model. Lasso regression was conducted with R version 4.2.2 using the packages glmnet for visualization of data (Figure 7).
The lambda for which we choose to cross-validate the model with the smallest mean error is denoted as lambda.min. According to Figure 7b, the value of lambda.min is 1.00, and the corresponding number of important explanatory variables is 3. Combining with Figure 7a, the important explanatory variables were the frost-free period, the accumulated temperature above 10 °C, and the June–August mean temperature. In order to compare the fitting effect and significance of the model [58], the multiple linear regression model was established with important explanatory variables as independent variables and meteorological yield as dependent variables.
The goodness-of-fit and significance of the two models were compared, including MLR–PCA and lasso regression. The results are shown in Table 7. It can be seen from Table 7 that the regression effect of the MLR–PCA model and the lasso regression model is extremely significant (p < 0.01). In addition, the goodness-of-fit of MLR–PCA is higher than that of the lasso regression model.
In order to verify whether MLR–PCA analysis can effectively improve the model accuracy, the relevant data from Yantai City in 2020 were used [40], the meteorological yield forecasts of apple from the MLR–PCA model and the lasso regression model were calculated and combined with the harmonic weight trend yield forecast method [48,49,50]. The predicted value of apple yield was obtained, and the yield prediction effects of the three models are shown in Table 8.
By comparing the methods, the predicted value of the MLR–PCA model was 47.256 t·ha−1, compared with the actual yield of 44.128 t·ha−1. The relative error was 7.089%, which was the smallest among the selected models, indicating that the MLR–PCA model is superior and accurate in forecasting.

4. Conclusions

In this work, the influence of different meteorological factors on the apple yield in Yantai was investigated numerically and experimentally, and the weights of the influence of different meteorological factors were analyzed. The MLR–PCA model and prediction results were validated by yearbook data. The main scientific and valuable conclusions are as follows:
(1)
Apple yield is in a stable growth stage. The trend yield is the decisive factor affecting apple yield, and the increase of trend yield drives the increase of apple yield. Meteorological output has been decreasing annually since 2014, which inhibits the growth of apple yield.
(2)
An accumulated temperature above 10 °C has extremely significant inhibitory effects on the meteorological yield of apples (p < 0.01) and the frost-free period has a significant inhibitory effect on the meteorological yield of apples (p < 0.05). There are certain correlations among different meteorological factors. Among them, the accumulated temperature above 10 °C, the March–October mean temperature, and the June–August mean temperature are the key meteorological factors affecting the meteorological yield of apples.
(3)
The MLR–PCA method for dimension reduction includes 12 indicators from five separate comprehensive meteorological factor indicators which are: temperature factor, growing season heat factor, humidity factor, precipitation factor, and cooling factor. The precipitation factor regression coefficient has the highest absolute value, which proves that precipitation factor is the key indicator that most affects meteorological yield. By parity of reasoning, in the growing season, heat factor is the second, cooling factor is the third, humidity factor is the fourth, and temperature factor is the lowest key indicator affecting meteorological yield. The results demonstrate that precipitation factor, growing season heat factor, and cooling factor are beyond the optimal adaptation range of apples, which results in the decrease of apple yield. For each 1% increase of the three factors apple yield is reduced by 0.597%, 0.354%, and 0.313%, respectively. However, humidity factor and temperature factor do not meet the demand of apples, and each 1% increase of the humidity factor and temperature factor increases apple yield by 0.268% and 0.108%, respectively.
(4)
The MLR–PCA and lasso regression methods were used to make predictions and comparisons. The apple yield predicted by the MLR–PCA model was 47.256 t·ha−1, compared with the actual yield of 44.128 t·ha−1. The relative error was only 7.089%. The prediction effect was better than other models. This work provides theoretical support for the future prediction of apple yields.

Author Contributions

Conceptualization, X.H.; methodology, X.H. and L.C.; validation, X.H. and L.C.; formal analysis, X.H. and N.W.; data curation, X.H. and C.W.; writing—original draft preparation, X.H.; supervision, W.K. and C.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Innovation Training Program for College Students (202110019088), the Yantai School-Land Integrated Development Project (2019XDRHXMPT30), and the URP Project of China Agricultural University (U2021057).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The datasets generated and analyzed in the current study are publicly available and are also available from the corresponding author upon reasonable request.

Acknowledgments

The authors would like to thank the Yantai Bureau of Statistics for collecting the apple production dataset. The authors are grateful for the comments from anonymous reviewers and the editors.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Trnka, M.; Olesen, J.E.; Kersebaum, K.C.; Skjelvåg, A.O.; Eitzinger, J.; Seguin, B.; Peltonen-Sainio, P.; Rötter, R.; Iglesias, A.; Orlandini, S.; et al. Agroclimatic Conditions in Europe under Climate Change. Glob. Change Biol. 2011, 17, 2298–2318. [Google Scholar] [CrossRef] [Green Version]
  2. Zhao, J.; Li, K.; Wang, R.; Tong, Z.; Zhang, J. Yield Data Provide New Insight into the Dynamic Evaluation of Maize’s Climate Suitability: A Case Study in Jilin Province, China. Atmosphere 2019, 10, 305. [Google Scholar] [CrossRef] [Green Version]
  3. Ritchie, H.; Roser, M. Agricultural Production. Our World in Data. Available online: https://ourworldindata.org/agricultural-production (accessed on 22 July 2022).
  4. The State Statistical Bureau. China Statistical Yearbook. China Statistics Press. 2021. Available online: http://www.stats.gov.cn/tjsj/ndsj/2021/indexch.htm (accessed on 8 August 2022).
  5. Zhang, Q.; Zhou, B.; Li, M.; Wei, Q.; Han, Z. Multivariate Analysis between Meteorological Factor and Fruit Quality of Fuji Apple at Different Locations in China. J. Integr. Agric. 2018, 17, 1338–1347. [Google Scholar] [CrossRef]
  6. Benlloch-González, M.; Sánchez-Lucas, R.; Bejaoui, M.A.; Benlloch, M.; Fernández-Escobar, R. Global Warming Effects on Yield and Fruit Maturation of Olive Trees Growing under Field Conditions. Sci. Hortic. 2019, 249, 162–167. [Google Scholar] [CrossRef]
  7. Zhu, Y.; Yang, G.; Yang, H.; Zhao, F.; Han, S.; Chen, R.; Zhang, C.; Yang, X.; Liu, M.; Cheng, J.; et al. Estimation of Apple Flowering Frost Loss for Fruit Yield Based on Gridded Meteorological and Remote Sensing Data in Luochuan, Shaanxi Province, China. Remote Sens. 2021, 13, 1630. [Google Scholar] [CrossRef]
  8. Cui, C.; Liu, Y.; Liu, B.; Sun, Y.; Yang, F.; Zhang, X.; Qiu, M. ENSO events impacts on apple production in Shandong I: A study of changes on apple climatic resources and yields under different senarios. Chin. J. Eco-Agric. 2022, 31, 1–16. (In Chinese) [Google Scholar]
  9. Lecerf, R.; Ceglar, A.; López-Lozano, R.; Van Der Velde, M.; Baruth, B. Assessing the Information in Crop Model and Meteorological Indicators to Forecast Crop Yield over Europe. Agr. Syst. 2019, 168, 191–202. [Google Scholar] [CrossRef]
  10. Li, M.; Guo, J.; Xu, C.; Lei, Y.; Li, J. Identifying Climatic Factors and Circulation Indices Related to Apple Yield Variation in Main Production Areas of China. Glob. Ecol. Conserv. 2018, 16, e00478. [Google Scholar] [CrossRef]
  11. Sen, V.; Rana, R.S.; Chauhan, R.C.; Aditya. Impact of Climate Variability on Apple Production and Diversity in Kullu Valley, Himachal Pradesh. Ind. Jour. Hort. 2015, 72, 14. [Google Scholar] [CrossRef] [Green Version]
  12. Seo, S.N.; Mendelsohn, R. An Analysis of Crop Choice: Adapting to Climate Change in South American Farms. Ecol. Econ. 2008, 67, 109–116. [Google Scholar] [CrossRef]
  13. Luedeling, E.; Brown, P.H. A Global Analysis of the Comparability of Winter Chill Models for Fruit and Nut Trees. Int. J. Biometeorol. 2011, 55, 411–421. [Google Scholar] [CrossRef]
  14. Fujisawa, M.; Kobayashi, K. Climate Change Adaptation Practices of Apple Growers in Nagano, Japan. Mitig. Adapt. Strateg. Glob. Change. 2011, 16, 865–877. [Google Scholar] [CrossRef]
  15. Ramirez, F.; Kallarackal, J. Responses of Fruit Trees to Global Climate Change; Springer International Publishing: Cham, Switzerland, 2015; pp. 11–37. ISBN 978-3-319-14199-2. [Google Scholar]
  16. Solomon, S.; Qin, D.; Manning, M.; Averyt, K.; Marquis, M. Climate Change 2007: The Physical Science Basis: Contribution of Working Group I to the Fourth Assessment Report of the Intergovernmental Panel on Climate Change. In Intergovernmental Panel on Climate Change; Cambridge University Press: Cambridge, UK, 2007; pp. 3, 8–52. ISBN 978-0-521-88009-1. [Google Scholar]
  17. Unterberger, C.; Brunner, L.; Nabernegg, S.; Steininger, K.; Steiner, A.; Stabentheiner, E.; Monschein, S.; Truhetz, H. Spring Frost Risk for Regional Apple Production under a Warmer Climate. PLoS ONE 2018, 13, e0200201. [Google Scholar] [CrossRef]
  18. Delgado, A.; Egea, J.A.; Luedeling, E.; Dapena, E. Agroclimatic Requirements and Phenological Responses to Climate Change of Local Apple Cultivars in Northwestern Spain. Sci. Hortic. 2021, 283, 110093. [Google Scholar] [CrossRef]
  19. Funes, I.; Aranda, X.; Biel, C.; Carbó, J.; Camps, F.; Molina, A.J.; de Herralde, F.; Grau, B.; Savé, R. Future Climate Change Impacts on Apple Flowering Date in a Mediterranean Subbasin. Agr. Water Manage. 2016, 164, 19–27. [Google Scholar] [CrossRef]
  20. Che, X.; Zhang, T.; Ren, J.; Lu, X.; Zhang, M.; Zhang, B. Impact of climate resources change on apple production in Longdong area. Jiangsu Agric. Sci. 2020, 48, 128–133. (In Chinese) [Google Scholar] [CrossRef]
  21. Jing, H.; Yang, H.; Zhao, H.; Meng, Y. Early Prediction of Apple Yield Based on Meteorological Data in Young Fruit Stage:An Example of Yuncheng City. Chin. Agric. Sci. Bull. 2021, 37, 132–136. (In Chinese) [Google Scholar]
  22. Qu, Z.; Zheng, X.; Liu, L.; Zhang, Y. Relationship Between Apple Quality and Meteorological Factors in Different Ecological Regions in Shaanxi. Meteorol. Mon. 2017, 43, 872–878. (In Chinese) [Google Scholar]
  23. Demestihas, C.; Plénet, D.; Génard, M.; Raynal, C.; Lescourret, F. A Simulation Study of Synergies and Tradeoffs between Multiple Ecosystem Services in Apple Orchards. J. Environ. Manage. 2019, 236, 1–16. [Google Scholar] [CrossRef] [PubMed]
  24. Yao, X.; Li, T.; Ma, J.; Yuan, B.; Deng, Z. Huaniu Apple in Southeast Gansu:The Correlation Between Yield and Quality and Meteorological Factors. Chin. Agric. Sci. Bull. 2018, 34, 108–112. (In Chinese) [Google Scholar]
  25. Fischer, R.A.; Moreno Ramos, O.H.; Ortiz Monasterio, I.; Sayre, K.D. Yield Response to Plant Density, Row Spacing and Raised Beds in Low Latitude Spring Wheat with Ample Soil Resources: An Update. Field Crop Res. 2019, 232, 95–105. [Google Scholar] [CrossRef]
  26. Leng, G.; Huang, M. Crop Yield Response to Climate Change Varies with Crop Spatial Distribution Pattern. Sci. Rep. 2017, 7, 1463. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  27. Kern, A.; Barcza, Z.; Marjanović, H.; Árendás, T.; Fodor, N.; Bónis, P.; Bognár, P.; Lichtenberger, J. Statistical modelling of crop yield in Central Europe using climate data and remote sensing vegetation indices. Agr. Forest. Meteorol. 2018, 260–261, 300–320. [Google Scholar] [CrossRef]
  28. Ramezani Etedali, H.; Ahmadi, M. Evaluation of Various Meteorological Datasets in Estimation Yield and Actual Evapotranspiration of Wheat and Maize (Case Study: Qazvin Plain). Agr. Water Manage. 2021, 256, 107080. [Google Scholar] [CrossRef]
  29. Rosenzweig, C.; Phillips, J.; Goldberg, R.; Carroll, J.; Hodges, T. Potential Impacts of Climate Change on Citrus and Potato Production in the US. Agr. Syst. 1996, 52, 455–479. [Google Scholar] [CrossRef]
  30. Boote, K.J.; Rybak, M.R.; Scholberg, J.M.S.; Jones, J.W. Improving the CROPGRO-Tomato Model for Predicting Growth and Yield Response to Temperature. Hortscience 2012, 47, 1038–1049. [Google Scholar] [CrossRef]
  31. Feng, P.; Wang, B.; Liu, D.L.; Waters, C.; Yu, Q. Incorporating Machine Learning with Biophysical Model Can Improve the Evaluation of Climate Extremes Impacts on Wheat Yield in South-Eastern Australia. Agr. For. Meteorol. 2019, 275, 100–113. [Google Scholar] [CrossRef]
  32. Challinor, A.J.; Wheeler, T.R.; Craufurd, P.Q.; Ferro, C.A.T.; Stephenson, D.B. Adaptation of Crops to Climate Change through Genotypic Responses to Mean and Extreme Temperatures. Agric. Ecosyst. Environ. 2007, 11, 190–204. [Google Scholar] [CrossRef] [Green Version]
  33. López-Morales, J.A.; Martínez, J.A.; Caro, M.; Erena, M.; Skarmeta, A.F. Climate-Aware and IoT-Enabled Selection of the Most Suitable Stone Fruit Tree Variety. Sensors 2021, 21, 3867. [Google Scholar] [CrossRef]
  34. Rosbakh, S.; Hartig, F.; Sandanov, D.V.; Bukharova, E.V.; Miller, T.K.; Primack, R.B. Siberian Plants Shift Their Phenology in Response to Climate Change. Glob. Change Biol. 2021, 27, 4435–4448. [Google Scholar] [CrossRef]
  35. Shandong Provincial Bureau of Statistics, Shandong Survey Team National Bureau of Statistics. Shandong Statistical Yearbook. China Statistics Press. 2021. Available online: http://tjj.shandong.gov.cn/tjnj/nj2021/zk/indexce.htm (accessed on 8 August 2022).
  36. Notice on the Identification of Agricultural Products with Chinese Characteristics Advantage Zones (the Third Batch). Available online: http://www.moa.gov.cn/govpublic/SCYJJXXS/202002/t20200226_6337781.html (accessed on 8 August 2022).
  37. Jiaodong Online. Brand Value Broke 15 billion! Yantai Apple a Century of the Rise of the Way! Available online: http://www.yantai.gov.cn/art/2022/3/23/art_41950_2972297.html (accessed on 8 August 2022).
  38. Yantai Local History Compilation Committee. Yantai Zhijun. China Popular Science Press. 1994. Available online: https://book.douban.com/subject/26994605/ (accessed on 8 August 2022).
  39. Ge, W. Geochemical Environment of Soil and Suitability Evaluation of High-quality Apple Production in Yantai Area, Shandong Province. PhD Thesis, China University of Geosciences, Wuhan, China, 2013. (In Chinese). [Google Scholar]
  40. Shandong Bureau of Statistics. Shandong Statistical Yearbook; 2000–2021. Available online: https://data.cnki.net/area/Yearbook/Single/N2022010255?dcode=D15 (accessed on 15 December 2022).
  41. Zhang, Q.; Li, M.; Zhou, B.; Zhang, J.; Wei, Q. Analysis and Optimization of the Effects of Meteorological Factors on ‘Fuji’ Fruit Quality in Two Dominant Production Regions of China. HortScience 2021, 56, 71–78. [Google Scholar] [CrossRef]
  42. Wang, B.; Bo, Y.; Jiang, R.; Ma, D.; Wang, N.; Chen, X.; Ding, X. Climatic regionalization of apple cultivation in Jiaodong Peninsula based on cluster analysis. Jiangsu Agric. Sci. 2019, 47, 134–138. [Google Scholar] [CrossRef]
  43. Zhang, L.; Si, J.; Jiapaer, G.; Zhang, T.; Mao, W.; Dong, S. Spatial Homogenization Adjustment and Application of Weather Station Networks in Xinjiang, China. Atmosphere 2022, 13, 1840. [Google Scholar] [CrossRef]
  44. Stewart, B.A.; Thapa, S.; Xue, Q.; Shrestha, R. Climate change effect on winter wheat (Triticum aestivum L.) yields in the US Great Plains. J. Soil Water Conserv. 2018, 73, 601–609. [Google Scholar] [CrossRef]
  45. Sjulgård, H.; Colombi, T.; Keller, T. Spatiotemporal patterns of crop diversity reveal potential for diversification in Swedish agriculture. Agric. Ecosyst. Environ. 2022, 336, 108046. [Google Scholar] [CrossRef]
  46. Granato, D.; Santos, J.S.; Escher, G.B.; Ferreira, B.L.; Maggio, R.M. Use of Principal Component Analysis (PCA) and Hierarchical Cluster Analysis (HCA) for Multivariate Association between Bioactive Compounds and Functional Properties in Foods: A Critical Perspective. Trends Food Sci. Technol. 2018, 72, 83–90. [Google Scholar] [CrossRef]
  47. Jiang, X.; Zhao, Y.; Tong, L.; Wang, R.; Zhao, S. Quantitative Analysis of Tomato Yield and Comprehensive Fruit Quality in Response to Deficit Irrigation at Different Growth Stages. HortScience 2019, 54, 1409–1417. [Google Scholar] [CrossRef] [Green Version]
  48. Wang, S. A Statistical Forecasting Method of Crop Yield. Acta Meteorol. Sin. 1984, 3, 349–355. (In Chinese) [Google Scholar] [CrossRef]
  49. Пoлевoй, А.Н. Динамикo-статистические метoды прoгнoзирoвания урoжайнoсти сельскoхoзяйственных культур. Метеoрoлoгия Гидрoлoгия 1981, 2, 92–102. (In Russian) [Google Scholar]
  50. Kleshenko, A.D.; Lebedeva, V.M.; Naidina, T.A.; Savitskaya, O.V. MODIS satellite data usage in operational agrometeorology. Sovrem. Probl. Distantsionnogo Zondirovaniya Zemli Iz Kosm. 2015, 12, 143–154. (In Russian) [Google Scholar]
  51. Kumar, S.; Attri, S.D.; Singh, K.K. Comparison of Lasso and stepwise regression technique for wheat yield prediction. J. Agrometeorol. 2021, 21, 188–192. [Google Scholar] [CrossRef]
  52. Schreiber, J.B. Issues and Recommendations for Exploratory Factor Analysis and Principal Component Analysis. Res. Soc. Adm. Pharm. 2021, 17, 1004–1011. [Google Scholar] [CrossRef] [PubMed]
  53. Shen, S.; Yang, J.; Gong, Z. Grey correlation analysis of climate factors affecting apple yield and yield prediction in Shanxi. J. South. Agric. 2016, 47, 1146–1154. (In Chinese) [Google Scholar]
  54. Zhou, W.; Yan, F.; Fan, Y.; Zhang, C.; Yang, F.; Liu, Q.; Wang, T.; Li, B.; Chen, Y.; Deng, F.; et al. Accumulated temperature and allocation of photo-thermal resources are key meteorological factors affecting rice cadmium content. Eur. J. Agron. 2023, 142, 126674. [Google Scholar] [CrossRef]
  55. Zhao, L.; Zhao, X.; Pan, X.; Shi, Y.; Qiu, Z.; Li, X.; Xing, X.; Bai, J. Prediction of daily reference crop evapotranspiration in different Chinese climate zones: Combined application of key meteorological factors and Elman algorithm. J. Hydrol. 2022, 610, 127822. [Google Scholar] [CrossRef]
  56. Wickham, H. Data Analysis. In ggplot2: Elegant Graphics for Data Analysis; Wickham, H., Ed.; Springer International Publishing: Cham, Switzerland, 2016; pp. 189–201. ISBN 978-3-319-24277-4. [Google Scholar]
  57. Xie, M.; Li, Z.; Li, X.; Ai, L.; Jin, M.; Jia, N.; Yang, Y.; Li, W.; Xue, F.; Zhang, M.; et al. Identifying crucial biomarkers in peripheral blood of schizophrenia and screening therapeutic agents by comprehensive bioinformatics analysis. J. Psychiatr. Res. 2022, 152, 86–96. [Google Scholar] [CrossRef]
  58. Belloni, A.; Chernozhukov, V. Least squares after model selection in high-dimensional sparse models. Bernoulli 2013, 19, 521–547. [Google Scholar] [CrossRef]
Figure 1. A map showing the location of Yantai city in Shandong Province, China (coordinates: 119°34′–121°57′ E longitude and 36°16′–38°23′ N latitude).
Figure 1. A map showing the location of Yantai city in Shandong Province, China (coordinates: 119°34′–121°57′ E longitude and 36°16′–38°23′ N latitude).
Atmosphere 14 00183 g001
Figure 2. Actual, trend and meteorological yield of Yantai apples from 1999 to 2019 (the orange line in the figure where the meteorological yield is equal to 0 t·ha−1 is the dividing line where the meteorological yield plays a facilitating or inhibiting role in the actual yield. If it is higher than the orange line, it plays a facilitating role in the actual yield; conversely, if it is lower than the orange line it plays an inhibiting role in the actual yield.
Figure 2. Actual, trend and meteorological yield of Yantai apples from 1999 to 2019 (the orange line in the figure where the meteorological yield is equal to 0 t·ha−1 is the dividing line where the meteorological yield plays a facilitating or inhibiting role in the actual yield. If it is higher than the orange line, it plays a facilitating role in the actual yield; conversely, if it is lower than the orange line it plays an inhibiting role in the actual yield.
Atmosphere 14 00183 g002
Figure 3. The grey correlation coefficient between twelve meteorological factors and yield (variables in the figure present twelve meteorological factors after standardized treatment).
Figure 3. The grey correlation coefficient between twelve meteorological factors and yield (variables in the figure present twelve meteorological factors after standardized treatment).
Atmosphere 14 00183 g003
Figure 4. Correlation coefficient matrix heatmap and significance analysis of twelve meteorological factors and yield (*: p < 0.05 **: p < 0.01 ***: p < 0.001).
Figure 4. Correlation coefficient matrix heatmap and significance analysis of twelve meteorological factors and yield (*: p < 0.05 **: p < 0.01 ***: p < 0.001).
Atmosphere 14 00183 g004
Figure 5. Relationships between normalized meteorological yield and principal components F 1 F 5 .
Figure 5. Relationships between normalized meteorological yield and principal components F 1 F 5 .
Atmosphere 14 00183 g005aAtmosphere 14 00183 g005b
Figure 6. The standardized residual analysis.
Figure 6. The standardized residual analysis.
Atmosphere 14 00183 g006
Figure 7. Lasso regression adjustment parameter process: (a) coefficient plot (b) parameter plot.
Figure 7. Lasso regression adjustment parameter process: (a) coefficient plot (b) parameter plot.
Atmosphere 14 00183 g007
Table 1. The description of the meteorological indicators employed in this study.
Table 1. The description of the meteorological indicators employed in this study.
Variable NameUnitNotation
Annual mean temperature°CAAT
The annual difference in temperature°CAR
March–October mean temperature°CMATTT
The coldest month mean temperature°CLAMT
Annual average precipitationmmMAP
The frost-free perioddFFP
The accumulated temperature above 10 °C°C/aATA10
Mid-January mean temperature°CATMJ
Annual extreme low temperature°CAMT
June–August mean temperature°CMATSE
June–August precipitationmmMTPSE
July relative humidity%RHJ
Table 2. Analysis of variance between meteorological yield and 12 meteorological factors.
Table 2. Analysis of variance between meteorological yield and 12 meteorological factors.
Analysis of VariancedfSSMSFSignificance F
Regression Analysis12298.22524.8522.1060.148
Residual894.38811.799
Total20392.613
Table 3. Diagnosis of covariance between different meteorological factors.
Table 3. Diagnosis of covariance between different meteorological factors.
Variables V I F ToleranceVariables V I F   Tolerance
AAT9.0110.111ATA107.8270.128
AR109.2300.009ATMJ3.3600.298
MATTT10.3150.097AMT15.8310.063
LAMT46.5320.021MATSE40.9780.024
MAP21.3300.047MTPSE29.2260.034
FFP2.9750.336RHJ4.9930.200
Table 4. The sum of squared loadings and eigenvalues extracted by the principal component analysis.
Table 4. The sum of squared loadings and eigenvalues extracted by the principal component analysis.
F i EigenvalueExtraction Sums of Squared LoadingsRotation Sums of Squared Loadings
EigenvalueVariance Contribution/%EigenvalueVariance Contribution/%EigenvalueVariance Contribution/%Cumulative Variance Contribution/%
15.29944.1565.29944.1564.25535.45635.456
22.42420.1972.42420.1972.33719.47454.929
31.46912.2411.46912.2411.95116.25671.185
41.0358.6261.0358.6261.60813.40384.587
50.5834.8560.5834.8560.6595.48890.076
60.4713.924
70.3532.944
80.1851.542
90.0850.711
100.0570.472
110.0350.295
120.0040.036
Table 5. Load matrix of different meteorological factors.
Table 5. Load matrix of different meteorological factors.
VariablesComponent
F 1   F 2   F 3   F 4   F 5  
ZAAT0.8040.3810.2080.219−0.175
ZAR−0.8180.450.023−0.280.003
ZMATTT0.480.7760.210.169−0.099
ZLAMT0.918−0.1630.1040.2540.123
ZMAP−0.8−0.090.010.5380.081
ZFFP0.6380.304−0.447−0.1990.189
ZATA10−0.0010.69−0.5610.2460.246
ZATMJ0.684−0.2790.2740.0180.555
ZAMT0.799−0.1350.1320.284−0.28
ZMATSE−0.060.8670.3−0.0710.037
ZMTPSE−0.7860.0270.0340.5710.089
ZRHJ−0.3670.1490.82−0.120.168
Table 6. Multiple linear backward stepwise regression analysis based on a principal component analysis (B and Beta represent the regression coefficient; Std. represents standard error; Tol. represents tolerance; Sig. represents significance. Besides, *: p < 0.05 **: p < 0.01 ***: p < 0.001).
Table 6. Multiple linear backward stepwise regression analysis based on a principal component analysis (B and Beta represent the regression coefficient; Std. represents standard error; Tol. represents tolerance; Sig. represents significance. Besides, *: p < 0.05 **: p < 0.01 ***: p < 0.001).
Regression AnalysisUnstandardized CoefficientStandardized CoefficientTSig.Covariance Statistics
BStd.BetaTol.VIF
Constant−2.56 × 10−170.144 0.0001.000
F 1 0.1080.1500.1080.7220.48111
F 2 −0.3540.148−0.354−2.3960.029 *11
F 3 0.2680.1480.2681.8170.08811
F 4 −0.5970.148−0.597−4.0480.001 ***11
F 5 −0.3130.148−0.313−2.1180.05 **11
Table 7. Comparison of fitting degree and significance test of different models (**: p < 0.01).
Table 7. Comparison of fitting degree and significance test of different models (**: p < 0.01).
ModelR2Sig.
MLR–PCA0.6630.003 **
Lasso Regression0.5340.004 **
Table 8. Effect of yield prediction.
Table 8. Effect of yield prediction.
ModelActual Apple Yield/(t·ha−1)Forecasted Apple Yield/(t ha−1)Trend Yield Forecast/(t·ha−1)Meteorological Yield Forecast/(t·ha−1)Relative Error
MLR–PCA44.12847.25658.481−11.2257.089%
Lasso Regression44.12848.11658.481−10.3659.038%
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Han, X.; Chang, L.; Wang, N.; Kong, W.; Wang, C. Effects of Meteorological Factors on Apple Yield Based on Multilinear Regression Analysis: A Case Study of Yantai Area, China. Atmosphere 2023, 14, 183. https://doi.org/10.3390/atmos14010183

AMA Style

Han X, Chang L, Wang N, Kong W, Wang C. Effects of Meteorological Factors on Apple Yield Based on Multilinear Regression Analysis: A Case Study of Yantai Area, China. Atmosphere. 2023; 14(1):183. https://doi.org/10.3390/atmos14010183

Chicago/Turabian Style

Han, Xirui, Longbo Chang, Nan Wang, Weifu Kong, and Chengguo Wang. 2023. "Effects of Meteorological Factors on Apple Yield Based on Multilinear Regression Analysis: A Case Study of Yantai Area, China" Atmosphere 14, no. 1: 183. https://doi.org/10.3390/atmos14010183

APA Style

Han, X., Chang, L., Wang, N., Kong, W., & Wang, C. (2023). Effects of Meteorological Factors on Apple Yield Based on Multilinear Regression Analysis: A Case Study of Yantai Area, China. Atmosphere, 14(1), 183. https://doi.org/10.3390/atmos14010183

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop