ML-LME: A Plant Growth Situation Analysis Model Using the Hierarchical Effect of Fractal Dimension

: Rice plays an essential role in agricultural production as the most signiﬁcant food crop. Automated supervision in the process of crop growth is the future development direction of agriculture, and it is also a problem that needs to be solved urgently. Productive cultivation, production and research of crops are attributed to increased automation of supervision in the growth. In this article, for the ﬁrst time, we propose the concept of rice fractal dimension heterogeneity and deﬁne it as rice varieties with different fractal dimension values having various correlations between their traits. To make a comprehensive prediction of the rice growth, Machine Learning and Linear Mixed Effect (ML-LME) model is proposed to model and analyze this heterogeneity, which is based on the existing automatic measurement system RAP and introduces statistical characteristics of fractal dimensions as novel features. Machine learning algorithms are applied to distinguish the rice growth stages with a high degree of accuracy and to excavate the heterogeneity of rice fractal dimensions with statistical meaning. According to the information of growth stage and fractal dimension heterogeneity, a precise prediction of key rice phenotype traits can be received by ML-LME using a Linear Mixed Effect model. In this process, the value of the fractal dimension is divided into groups and then rices of different levels are respectively ﬁtted to improve the accuracy of the subsequent prediction, that is, the heterogeneity of the fractal dimension. Afterwards, we apply the model to analyze the rice pot image. The research results show that the ML-LME model, which possesses the hierarchical effect of fractal dimension, performs more excellently in predicting the growth situation of plants than the traditional regression model does. Further comparison conﬁrmed that the model we proposed is the ﬁrst to consider the hierarchy structure of plant fractal dimension, and that consideration obviously strengthens the model on the ability of variation interpretation and prediction precision.


Introduction
The most significant basic aspect of agricultural production and related scientific research is the acquisition of crop traits. So far, the main research on rice is focused on the physical and chemical properties of the growing conditions. Jing et al. studied the physicochemical characters and bacterial community structure of rice root in different fertilizers [1], and Mohammad et al. studied the relationship between the growth cycle of rice and the distribution of weeds [2]. Under normal circumstances, the growth of crops is greatly affected by soil physicochemical properties, light intensity and the proportion of air components with the fluctuation of actual situations. There is no doubt that the robustness of the model will be affected if the above-mentioned indicators are used to supervise or predict rice traits, and establish a simple linear model to analyze may result in a lack of comprehensive information extraction. The same goes for certain correlations among the rice traits, so that the fitted accuracy of linear regression cannot be guaranteed. When the model is applied to the intelligent and automated management of agriculture, these factors will leave some limitations.
More complex and comprehensive properties of an image could be attracted by fractal dimensions. In this paper, given the diversity of the fractal dimension of the sample pictures and its manifold distribution in space, the metrics that are obtained using Locally Linear Embedding (LLE) dimensionality reduction are combined with other phenotypic features as discriminative metrics for subsequent machine learning. Research on the discrimination of crops generally focuses on the variety discrimination, like one study by Zhai et al. who classified crops using an improved non-linear L-ISOMAP dimensionality reduction algorithm [3]; Xue et al. compared the discriminating effects of two artificial neural network algorithms in Chinese herbal medicine. As a result, one of the neural network separators combined from the fractal body characteristics of Chinese herbal medicine [4].
The linear mixed effect model has come into widespread use in agricultural production, animal and plant breeding and botany research, intended mostly for analyzing duplicate data, vertical data, nested data, and cluster data structures. Wang et al. fitted the growth parameters of rice aboveground biomass (AGB) and leaf area index (LAI) by the mixed linear model and multispectral images data from a fixed-wing unmanned aerial vehicle [5]. Li et al., utilized LME to realize the non-destructive estimation of the above-ground biomass on the entire rice growing season with terrestrial laser scanning data [6]. In this study, we tried to take advantage of RAP [7] and fractal dimensions of rice image to estimate rice growth parameters. However, when the linear regression model was used to fit rice traits, the empirical regression function has a tendency of random slope and random intercept, and the residual sequence also exhibited heteroscedasticity (Appendix B). In this case, we tried to stratify the fractal dimension and assume that there was a phenotypic heterogeneity among different fractal dimension groups of rice. After that, a Gaussian Mixture Model and an LME model were introduced to analyze the heterogeneity. The model established in this paper is so updated heterogeneity model in VERBEK [8] that the whole work is more pertinent.
In this study, machine learning discrimination algorithms and an LME model (ML-LME model) were applied to rice growth stage recognition and rice phenotype prediction, and fractal dimensions were introduced as new features to the model as well. The model linked multifarious rice traits that could be automatically measured to identify rice samples at three growth stages (tillering stage, booting and jointing transition period, heading and grain filling transition period), and to predict essential rice growth parameters (fresh weight, dry weight, plant height and green leaf area). It turned out that the heterogeneity of rice fractal dimensions at different growth stages can be modelled by Gaussian Mixture Model which are conducive to the subsequent research on classification, prediction, and clustering of rice species.
More specifically, in the rest of this article, the data set our model performs on will be explained first, then the ML-LME model will be constructed in detail within three separate modules, which are discriminant modules based on machine learning, a GMMbased fractal dimension hierarchical division module and a rice phenotype fitting module based on LME. After that, the performance of module 1 and module 3 on the rice plot data set will be illustrated. During that, module 1 will obtain the optimal machine learning method to discriminant rice growth stage, and module 3 will obtain the optimal fractal dimension method that acquires the most significant heterogeneity and the best ability to make predictions on rice traits. Finally, the model performance and potential contribution will be detailed discussed in the discussion and conclusion sections.
The overall performance of the model was tested. In the process of identifying the growth stage of rice, the model was compared with traditional machine learning methods. By exploring the spatial manifold data features among multiple fractal dimensions of the rice data, the classification precision improved by more than 2%. After that the model was used for excavating and testing the hierarchical effect of the fractal dimension. When fitting the fresh weight and dry weight of plants, the intraclass correlation coefficient was adopted to verify the heterogeneity between two traits in different varieties. Combining the hierarchical effect of the fractal dimension and the mixed linear model to predict plant traits, the prediction of the four rice traits was more accurate than the simple linear regression model, the improvement of RRMSE ranges from 0.03% to 9.5% and R 2 could be endowed an improvement from 0.01 to 0.02.

Data Source and Description
The data in this paper are derived from the measurement results of rice traits grown in the Potted of Huazhong Agricultural University by Yang's RAP (Automatic Rice Phenotyping System, 2014). RAP can measure 28 rice traits and traditional plant height, plant width, plant vertical height, plant height/width, side view projected area of rice plant, rice structural parameters, relative frequency and projected area/length by width are involved in this study. For the abbreviations of all rice phenotypic traits could be found in the Abbreviations of this article. The fresh weight (g), dry weight (g), plant height (cm), number of tillers and green leaf area (mm 2 ) are manually measured. The data set used in this paper contains the results of RAP, measuring the phenotypic traits of 521 rice varieties at three different growth stages [7].

ML-LME Model
As shown in Figure 1, we started with pretreatments of images including background denoising, binarization and grayscale before modelling. The panelists calculated the Boxcounting dimension, the Sandbox dimension and the Random Walk fractal dimension, which were built on the above mentioned images. Taking into account the diversity of the fractal dimension and manifold distribution in space, LLE was applied to reduce dimensionality. Subsequently, the combination of indicators obtained by dimensionality reduction and other easy-to-observe rice phenotypic traits was a new machine learning discriminant indicator. The universal performance of four machine learning algorithms (BP neural network, SVM, KNN, decision tree) is compared when discriminating rice growth stage among all 521 varieties. The model with the highest precision are chosen to make a prediction (Module 1). Taking advantage of this model, the rice growth stage could be precisely divided during the rice tillering stage, the transition stage of booting and jointing and the transition stage of heading and grain filling. Whereafter, the distribution characteristics of the fractal dimension of rice were evacuated and fitted with a Gaussian mixture model (Module 2). The experiment proved that improved prediction accuracy was obtained when classifying the rice samples according to the distribution of fractal dimension-the hierarchical effect of the fractal dimension. Setting the hierarchical effect of the fractal dimension to the random effect of LME and the intra-group correlation coefficient have identified that there was a significant difference among various fractal dimension groups of rice phenotypic trait. All indicated that the fractal dimension contained the heterogeneity information of rice traits. Therefore, we classified fractal dimensions of rice after distinguishing different rice growth stages, and finally established an LME prediction model (Module 3) for each type of fractal dimension and corresponding rice phenotypic traits, which further improved the level of automation of rice supervision. The concept of the fractal dimension was first proposed to describe complex and irregular physical characteristics [9]. The fractal dimension can reflect the space occupation and complexity of an image [10]. This paper mainly uses the fractal dimension to describe the rice image characteristics and then uses its statistical characteristics to provide more information for subsequent discrimination and prediction. There are many specific calculation methods for the fractal dimension; this article includes three fractal dimensions: the Box-counting dimension, the SandBox dimension and the Random Walk fractal dimension [9,11,12], see Appendix A for details.
Considering the variety of fractal dimensions and rice traditional characteristic traits, we use the LLE algorithm and the Principal Component Analysis algorithm (PCA) to reduce the dimensions of these indicators. Among them, the LLE dimensionality reduction algorithm, as a nonlinear dimensionality reduction method, has a good dimensionality reduction effect for the spatially overlapping data of manifold [13,14].
A three-dimensional scatter plot is introduced to speculate the manifold characteristics between the distribution of multiple fractal dimensions of rice, while the LLE algorithm is applied for experiments. At the same time, traditional PCA is used for comparative analysis.

Module 2: GMM-Based Fractal Dimension Hierarchical Division Module
Bayesian analysis models are widely used to detect heterogeneity [15,16]. Combining an LME model with a Gaussian Mixture Model to analyze heterogeneity was first relaized by Geert Verbek (1996) [8]. By setting the distribution of random effects as a mixture of multiple Gaussian distributions, Verbek proved that the LME model could model the heterogeneity of populations. However, in this study, the rice populations are not divided in advance. Thus, GMM is used to cluster the fractal dimensions thus the hierarchical struc-ture of the rice data can be obtained. The clustering results of the fractal dimensions are set as the random effects of the LME model, then the LME model could be used to evaluate the heterogeneity of rice and improve the efficiency of prediction. In each growth stage of rice, the fractal dimension x of a rice image is supposed to be generated from multiple Gaussian populations with different means and variances, and its prior distribution is: Among them, µ is the mean value of each Gaussian distribution, Σ is the variance of each Gaussian distribution, α is the proportion and N (.; µ i , σ i ) is the Gaussian distribution density function. The EM algorithm is used to fit the rice fractal dimension data. By clustering the fractal dimension, the hierarchical effect of the rice fractal dimension is obtained, and the algorithm will learn the parameters (α, µ, Σ) in the prior distribution. For the fractal dimension data within and outside the sample, Bayesian Maximum Posterior Probability Estimation is used to infer the hierarchy of the fractal dimension. Use {x ∈ I i } to indicate that the fractal dimension x belongs to the Gaussian population corresponding to the i-th level. Obtained by the Bayesian formula: Then use: to determine the clusters of fractal dimension. Where p(x ∈ I i |x; α, µ, Σ) indicate the posterior probability of sample x belongs to cluster I i , which is obtained by the GMM.

Module 3: Rice Phenotype Fitting Module Based on LME
The general expression of the mixed linear model is: where Y ∈ R n×1 is the observation value vector. X ∈ R n×p is the matrix for fixed effects. β ∈ R p×1 is a parameter vector of fixed effects, which is not random. Z ∈ R n×q is the design matrix of random effects. µ ∈ R q×1 is the random effect of the model, usually set to obey the normal distribution of 0 mean is a random vector, that is, µ and are mutually exclusive. G and R are both positive definite matrices. It is usually assumed that G = G(λ), R = R(γ). Therefore, the parameter vector of the model to be estimated is Θ = (σ, γ, λ).
The essence of the mixed linear model is to further model the residuals of the linear model [17]. For the selection of random effects, considering that the linear model cannot completely extract the residual information of the fractal dimension and there is a certain heterogeneity in the growth of rice at different growth stages, indicating that rice has a hierarchical effect on these two variables. Therefore, the fractal dimension and growth stage are set as random effects. The Pearson Correlation Coefficient is considered while screening out the phenotypic traits that are significantly related to the predicted traits as a fixed effect. In this module, plant width, plant vertical height (rice is not straightened, the height of the highest point in the natural state), rice plant texture parameters, projected area of rice plant side view, relative frequency and rice structural parameters are the final selected indicators. L ogarithmic transformation has been applied to all indicators to eliminate the influence of dimension.
The construction, data processing and analysis of the mixed linear model are completed in the R language (version: R i386 3.6.0) through the packages lme4 and lmerTest [18,19].
Various models can be obtained by the interaction between the growth stage and different dimensions, and the optimal model is selected by comprehensively considering the minimum criteria of AIC and BIC [18]. Intra-group correlation coefficient analysis is considered to evaluate the heterogeneity of rice traits among different groups. In the linear Mixed Effect Model, the Intra-group correlation coefficient is calculated through the analysis of variance components: where σ 2 µ indicates the variance of random effects in the model, and σ 2 indicates the variance of the residual. ICC was first used to quantify and evaluate the reliability of measurement [20,21], which is generally between 0 and 1, and the thresholds are 0.4 and 0.75, respectively. An ICC less than 0.4 illustrates poor reliability, while higher than 0.75 illustrates good reliability. In this study, ICC could quantify the heterogeneity of rice varieties between different fractal dimension levels. When an ICC higher than 0.75 occurs, it can be trusted that the rice varieties between different levels are heterogeneous, and the rice varieties within the same hierarchy are clustered and show correlations among similar traits.

Comparison of Classification and Discrimination Results Based on Traditional Fractal Dimension
Based on the traditional representation attributes, Module 1 contains four machine learning algorithms (BP neural network, Decision Tree, Support Vector Machine (SVM), K-Nearest Neighbor (KNN) algorithm [22][23][24]) and nine fractal dimensions processing methods. The comparison of the performance of four machine learning algorithms combined with different kinds of fractal dimensions' processing are detailed in Table 1. It can be found that, on average, when the fractal dimension feature is introduced, the discrimination effect of the classifier has been significantly improved. The most obvious improvement in the discrimination effect is that, after the introduction of the DBCG dimension, the overall classification precision has increased by 1.51%, and the Kappa index has increased by 0.03. In terms of precision, the dimensions of Sandbox and RFD have also increased by 1.35% and 1.44%, respectively. Other fractal dimensions have also increased by close to 1%. From the perspective of Kappa coefficient, there is generally an increase of more than 0.02. Therefore, the introduction of fractal dimension is of positive significance for the identification of growth stage. Table 2 discusses the classification effects of different classifiers on rice growth stages after combining the seven traditional representation attributes with five fractal dimensions, the dimensions obtained after dimensionality reduction using PCA, and the dimensions obtained after dimensionality reduction using LLE. At the same time, multiple multicategory evaluation indicators are used for measurement. , projected area/length by width (SA/PH_V*PW); ALL represents the set of five fractal dimensions, including DBCB, DBCG, Sandbox, SFD, RFD. LLE represents the two low-dimensional sets of five fractal dimensions after the LLE nonlinear dimensionality reduction method; PCA represents the two low-dimensional sets of the five fractal dimension results after the PCA linear dimensionality reduction method.

Comparison of the Classification Results of Multiple Fractal Dimensions Using the Dimensionality Reduction Method
From Table 2, it can be seen that, after using the LLE dimensionality reduction method, the classification accuracy of KNN and Decision Tree has been improved the most, being improved by 1.62% and 1.15%, respectively, compared with the traditional model of T+ALL. In terms of classification accuracy, KNN and Decision Tree improved by 2.86% and 3.17%, respectively. At the same time, the classification accuracy under the BP neural network + LLE model is the highest, reaching 93.64% and its Kappa coefficient, Micro-F1 and Macro-F1 (0.83, 0.95, 0.95, respectively) have the highest index levels. Therefore, the BP+LLE model is selected as the first part of the ML-MLE model to improve the effect of rice growth stage discrimination.

The Results of the LME Fitting
The interaction between the growth stage and fractal dimension hierarchy can produce different random effect models. The optimal model is selected by comprehensively considering the minimum AIC and BIC criteria [25,26]. The significance of heterogeneity among rice populations could be evaluated through ICC. Table 3 contains the results of model selection. For fresh weight, the optimal division method is to consider the interaction between DBCB dimension and growth stage. Its intra-group correlation coefficient is 0.802 and the heterogeneity is significant. For dry weight, the optimal division method is to consider the interaction between the DBCB dimension and the growth stage. The intra-group correlation coefficient is 0.881, and the heterogeneity is extremely significant. For plant height, the optimal division method is SFD dimension, but the correlation coefficient within the group is less than 0.4, there is no obvious heterogeneity. For green leaf area, the optimal division is to consider the interaction between SFD dimension and growth stage, the correlation coefficient within the group is less than 0.4, no significant heterogeneity is detected. Note: SR: LME model that takes the interaction between the growth stage and the fractal dimension calculated by the random walk method as a random effect; SS1: LME model that takes the interaction between the growth stage and the fractal dimension calculated by the SandBox method as a random effect; SS2: LME model that takes the interaction between the growth stage and the fractal dimension calculated by the SFD method as a random effect; SD1: LME model that takes the interaction between the growth stage and the fractal dimension calculated by the DBCB method as a random effect; SD2: LME model that takes the interaction between the growth stage and the fractal dimension calculated by the DBCG method as a random effect.
In terms of prediction, for each trait, the model obtained by the interaction between the SandBox dimension and the growth stage achieves the best results on both AIC and BIC. The possible reason is that the SandBox dimension considers the centroid of the image when calculating the position, so more emphasis is placed on the part of the rice plant in the image. Therefore, consider using this model to predict rice traits. In Figure 2, predictions considering hierarchies of fractal dimensions have a certain improvement in accuracy and the variation of the observations comparing those of a simple Linear Regression model. This enhancement obviously demonstrates that the proposed hierarchical approach modeling more information of rice phenotypic traits relationship than classical approaches. Shapiro-Wilk test is performed on the residual distribution of the linear model. The Shapiro-Wilk test can detect whether the sample data set obeys a normal distribution [27]. Table 4 shows the test results of the linear regression model for the prediction of the four rice traits involved in this article. If the residuals of the linear model fail the Shapiro-Wilk test at a higher confidence level (p = 0.01), the assumption that the residuals of the linear model fit are normally distributed can be rejected. This shows that the data set used in this article cannot match the assumptions required by the linear regression model, and the direct use of the linear model for modeling cannot fully extract the correlation information between rice traits.
According to the AIC and BIC model selection result, model SS1, which considers the interaction between rice growth stage and the hierarchy of SandBox dimension as a random effect, are the optimal Linear Mixed Effect model to make prediction. The Shapiro-Wilk test was performed on the residuals of the mixed linear model. Table 5 shows the test results of the residuals of the Linear Mixed Effect model in the four rice traits predicted in this paper. The residuals of this model have passed the normality test on the dry weight and green leaf area, while the normality of the residuals of weight and plant height is rejected. This result shows that, on the data set used in this article, the Linear Mixed Effect model is more suitable than the Linear Regression Model in fitting dry weight and green leaf area.  Note: p-value with * means the statistic is statistically significant at a confidence level of α = 0.01.
A box plot is used to more intuitively describe the distribution of the residuals of different models. A box plot could illustrate the dispersion (spread) and skewness in the data. Figure 3 shows the box plots of residuals of different models while fitting fresh weight, dry weight, plant height and green leaf area. The box plot indicates that, for fresh weight, dry weight and green leaf area, the Linear Mixed Effect model has more advantages than the Linear model. It is mainly reflected in the relatively concentrated residual distribution of the Linear Mixed Effect Model. The Linear Mixed Effect Model extracts more information about the correlation of plant traits than the Linear Model. Meanwhile, the Linear Mixed Effect Model established by the hierarchy of the SandBox dimension tends to have more arrow distribution of the residual compared with other Linear Mixed Effect models, which supports the result of the AIC and BIC model selection. However, when fitting plant height, the performance of the Linear Mixed Effect model has no evident advantage. Detailed results of prediction are shown in Table 6. It is found that the mixed linear model has significant advantages for fresh weight and dry weight. But the mixed linear model and linear regression of plant height and leaf area have no significant advantages. It may be that the difference between these two traits in different categories is not significant, and the mixed linear model cannot extract more information. In terms of prediction accuracy, LME also showed good results, using the generalization ability of the 10-fold cross-validation model. In terms of fresh weight, the RRMSE of the LME is 0.172, while the RRMSE of the linear model is 0.259. In terms of dry weight, the RRMSE of the LME is 0.169, while the RRMSE of the linear model is 0.264. In terms of green leaf area, the RRMSE of the LME is 0.135, while the RRMSE of the linear model is 0.144. In terms of plant height, the RRMSE of the LME model is 0.093, while the RRMSE of the linear model is 0.096. In terms of prediction accuracy, the LME showed a lower root mean square error.

Discussion
This research brings together machine learning methods and the hierarchical effect of fractal dimensions to improve the ability of fitting rice essential phenotypic traits and the precision of rice growth discrimination, during which the potential contribution of fractal dimension theory in rice research is discovered. A reference for related research on smart agriculture and precision agriculture is provided. Moreover, previous research on rice phenotypes only modeled the traits studied and rarely considered the rice growth stage. In this research, the interaction of the rice growth stage and the hierarchical structure of the fractal dimension was set as random effects, while in previous studies, the random effects of mixed linear models were often directly set to physical quantities with practical meaning such as different individuals, different times, and geographical distribution. Before obtaining the hierarchical structure, the locally linear embedding algorithm combined with the machine learning models are used to realize high precision identification of the rice growth stage. Among these models, the BP+LLE model has the highest discriminant precision for the rice growth period, reaching 93.64%.
At each growth stage, the Gaussian Mixture Model was used to model the distribution of the fractal dimension of rice to obtain the hierarchy of the fractal dimension. The significance of the hierarchical effect of the fractal dimension on the fresh weight, dry weight and other traits was verified by intra-class correlation coefficient of mixed linear model. For fresh weight and dry weight, the model produces significant heterogeneity, whose optimal intra-class correlation coefficients are 0.802 and 0.811. The results showed that the correlations between the growth patterns and traits of rice varieties distributed in different fractal dimension levels may be different. This reflects the heterogeneity of different rice varieties, which is of reference significance for the study of the stratification effect in botany and the heterogeneity of rice varieties. In terms of prediction accuracy, both the modified R 2 and RRMSE of the linear mixed model are better than the general linear regression model, proving its potential in predicting key rice traits. The prediction of the four rice traits was more accurate than the simple linear regression model, the improvement of RRMSE variety from 0.03% to 9.5% and R 2 could be endowed an improvement from 0.01 to 0.02.

Conclusions
In this study, we propose a rice growth situation analysis method based on fractal dimension theory, which is the ML-LME model. In both rice growth stage discrimination and rice phenotypic traits, our model achieves significant improvement. After a detailed past reference comparison, it could be confirmed that we are the first to enhance rice traits fitting with the concept of rice fractal dimension heterogeneity. In the rice phenotype fitting module based on LME, for testing sets, the R 2 of four rice traits ranged from 0.90 to 0.97, respectively. Compared with the original precision made by Yang et al., the R 2 of four rice traits ranged from 0.82 to 0.90, respectively [7].
The model proposed in this paper is also applicable to other crop types, which can expand the application range of the hierarchical structure of the fractal dimension in agricultural production. In future research, considering that there is a nonlinear relationship between the traits of certain crops and that the fixed effects in the LME only use linear functions for fitting, using nonlinear functions as the fixed effects in the mixed-effects model is considered, that is, the Generalized Linear Mixed Model (GLME) [17]. Hajjem et al. proposed the Mixed Effects Random Forest Algorithm (MERF), which uses the random forest algorithm as a fixed effect to predict clustering data [28]. Fitting fixed effects by the random forest algorithm can extend the model in this article to more complex situations. Therefore, in future research, we can consider using these algorithms to expand the applicable fields of the model. At the same time, due to the ambiguity in the growth stage of the original rice data, in future long-term experiments, more rigorous and detailed experiments could be carried out and more accurate experimental results can be obtained.

Institutional Review Board Statement:
In this section, please add the Institutional Review Board Statement and approval number for studies involving humans or animals. Please note that the Editorial Office might ask you for further information.
Informed Consent Statement: Any research article describing a study involving humans should contain this statement. Data Availability Statement: Collected data are available from the authors.

Conflicts of Interest:
The authors declare no conflict of interest.

Abbreviations
The

Appendix A. Algorithm for Calculating Fractal Dimension
The calculation of the fractal dimension in the research includes the gray image and the binary image of the rice plant. The grayscale image is obtained by the camera shooting each rice plant from 12 different angles. The binary image is converted from each gray image using MATLAB 2016a software programming. A total of 5 fractal dimension calculation methods are selected, namely box-counting dimension based on binary image (DBCB), box-counting dimension based on gray-scale image (DBCG), sandbox dimension based on gray-scale image (sandbox dimension) Count) the dimension (RWD) measured by the random walk method based on the grayscale image and the box-counting dimension (SFD) based on the minimum bounding rectangle of the binary image.
in the image in each area N(r k ), also using linear fitting to obtain the linear regression equation of log N(r) and log r, the coefficient of the log r term in the equation is the estimated value of the fractal dimension [5,7].
The SandBox method has many methods to determine the location of the coverage area. The method selected in this article is to use the centroid of the original image as the location of the center of all coverage areas, which can simplify the calculation complexity and obtain a higher value. The correlation between log N(r) and log r [4].  Figure A2 presents the scatter plot to speculate the manifold characteristics using LLE algorithm.

Appendix C
(a) (b) Figure A2. scatter plots to speculate the manifold characteristics using LLE algorithm. The round points represent tillering stag, the cross-shaped points represent booting and jointing transition period, and the triangular points represent heading and grain filling transition period. (a) Fractal dimension scatter plot. (b) LLE dimensionality reduction results.