Combining Different Transformations of Ground Hyperspectral Data with Unmanned Aerial Vehicle (UAV) Images for Anthocyanin Estimation in Tree Peony Leaves

: To explore rapid anthocyanin (Anth) detection technology based on remote sensing (RS) in tree peony leaves, we considered 30 species of tree peonies located in Shaanxi Province, China. We used an SVC HR~1024i portable ground object spectrometer and mini-unmanned aerial vehicle (UAV)-borne RS systems to obtain hyperspectral (HS) reﬂectance and images of canopy leaves. First, we performed principal component analysis (PCA), ﬁrst-order differential (FD), and continuum removal (CR) transformations on the original ground-based spectra; commonly used spectral parameters were implemented to estimate Anth content using multiple stepwise regression (MSR), partial least squares (PLS), back-propagation neural network (BPNN), and random forest (RF) models. The spectral transformation highlighted the characteristics of spectral curves and improved the relationship between spectral reﬂectance and Anth, and the RF model based on the FD spectrum portrayed the best estimation accuracy ( R 2c = 0.91; R 2v = 0.51). Then, the RGB (red-green-blue) gray vegetation index (VI) and the texture parameters were constructed using UAV images, and an Anth estimation model was constructed using UAV parameters. Finally, the UAV image was fused with the ground spectral data, and a multisource RS model of Anth estimation was constructed, based on PCA + UAV, FD + UAV, and CR + UAV, using MSR, PLS, BPNN, and RF methods. The RF model based on FD+UAV portrayed the best modeling and veriﬁcation effect ( R 2c = 0.93; R 2v = 0.76); compared with the FD-RF model, R 2c increased only slightly, but R 2v increased greatly from 0.51 to 0.76, indicating improved modeling and testing accuracy. The optimal spectral transformation for the Anth estimation of tree peony leaves was obtained, and a high-precision Anth multisource RS model was constructed. Our results can be used for the selection of ground-based HS transformation in future plant Anth estimation, and as a theoretical basis for plant growth monitoring based on ground and UAV multisource RS.


Introduction
Anthocyanin (Anth) is one of the three main pigments in plants and is responsible for the color of plant petals and fruits [1]. It is generally found in the cytoplasm of plants and is not a photosynthetic pigment, but can protect the plant's photosynthetic system from excessive light radiation, especially excessive ultraviolet radiation [2,3]. At the same time, Anth is also a secondary metabolite in plants subjected to environmental and biological stress (e.g., high temperature, water shortage, high salinity, diseases, insects, and pests), and its content can be used as an index to indirectly reflect stress levels [4]. Tree peony, one of the top ten most famous flowers in China, is reputed to be the king of flowers. Its flowers are ornamental, its roots can be used as medicine, and its seeds can be pressed to

Study Area
The experiment was carried out at the tree peony garden of Northwest A&F University in Shaanxi Province, China (34 • 15 -34 • 20 N, 107 • 56 -108 • 7 E), on 8 April 2018, when the peonies were in full bloom. The average elevation of the garden is 460 m and the climate is warm temperate continental monsoon. There were 32 plots in the study area, among which 2 were mixed variety plots and the remaining 30 were single variety plots. The single variety plots were selected for the study, covering 30 varieties of tree peonies such as Zierqiao, Yinhongqiaopair, and Yaohuang. According to the rule of diagonal sampling, two tree peonies representing the average growth of tree peonies in each plot were selected for spectral and anthocyanin measurement, and a total of 60 peony samples were studied. The location of the study area and the distribution of samples are shown in Figure 1.

Anth Quantification
Leaf Anth was measured with Dualex 4 (FORCE-A, Orsay, France). This measures the leaf epidermal anthocyanin absorbance at 520 nm by means of chlorophyll fluorescence screening, equalizing the chlorophyll fluorescence signal under 520 nm excitation, and that under red excitation at 650 nm, as reported by Goulas et al. [22]. The method also accurately measures the leaf chlorophyll and surface flavonoid contents and nitrogen balance index, and is easy to use for real-time non-destructive measurement. Some scholars have compared Anth measured by traditional chemical methods, such as the Multiplex instrument and UV-A-PAM fluorimeter, with that measured using Dualex, which further proved the reliability of Dualex for Anth measurement [23,24]. To obtain representative Anth, 10 leaves were selected from the tree peony sample, and six Anth measurements were carried out for each leaf, which were then averaged. The measured Anth was subjected to stratified and random sampling at a ratio of 2:1, ignoring the effect of variety; 40 samples were selected for model construction, and the remaining 20 samples were used for model verification. The Anth statistics of the calibration and test sets conducted in our study are shown in Table 1; the data indicated that the Anth content of the calibration set was 0.051-0.171 µg/cm 2 , and that of the test set was 0.056-0.169 µg/cm 2 . The test set was within the calibration set and had similar data distribution characteristics to the calibration set. The coefficients of variance (CVs) of the calibration and test sets were 27.722% and 26.732%, respectively, with moderate spatial variation. Note: Coefficient of variation (CV); standard deviation (SD); number (N).

Hyperspectral Data Acquisition
The reflection spectra of tree peony leaves were measured under good weather conditions and stable solar radiation. The instrument used was an SVC HR~1024i spectrometer produced by Spectra Vista Corporation (SVC) of America, with a band range of 350-2500 nm. Firstly, in the absence of any shielding, the test spear head was vertically aligned with the reference plate in the direction of the sun for spectral measurement, and the obtained spectrum was used as a reference for the reflection spectrum correction of the tree peony leaves. Then, the probe connected to the optical fiber (viewing angle of 8 • ) was placed 30 cm above the tree peony leaves and measured vertically downward. Ten leaves were selected from each tree peony sample, and six spectra were measured for each leaf. Finally, all spectra were averaged as the final spectrum of the tree sample. The spectrometer and tree peony samples are shown in Figure 2.
Remote Sens. 2022, 14, x FOR PEER REVIEW to the calibration set. The coefficients of variance (CVs) of the calibration a were 27.722% and 26.732%, respectively, with moderate spatial variation.

Hyperspectral Data Acquisition
The reflection spectra of tree peony leaves were measured under go conditions and stable solar radiation. The instrument used was an SVC HR~ trometer produced by Spectra Vista Corporation (SVC) of America, with a ba 350-2500 nm. Firstly, in the absence of any shielding, the test spear head wa aligned with the reference plate in the direction of the sun for spectral measur the obtained spectrum was used as a reference for the reflection spectrum c the tree peony leaves. Then, the probe connected to the optical fiber (viewing was placed 30 cm above the tree peony leaves and measured vertically dow leaves were selected from each tree peony sample, and six spectra were m each leaf. Finally, all spectra were averaged as the final spectrum of the tree s spectrometer and tree peony samples are shown in Figure 2.

Airborne Campaigns
The flight experiment was carried out after the spectral measurement on The aircraft was Phantom 4 Pro-DJI UAV (DJ-Innovations, Shenzhen, China) equipped with a GPS/GLONASS dual positioning module with accurate coo formation. The weight of the fuselage was approximately 1.4 kg, and the end 30 min under conditions of no wind and maximum load. The aircraft had a b

Airborne Campaigns
The flight experiment was carried out after the spectral measurement on the ground. The aircraft was Phantom 4 Pro-DJI UAV (DJ-Innovations, Shenzhen, China), which was equipped with a GPS/GLONASS dual positioning module with accurate coordinate information. The weight of the fuselage was approximately 1.4 kg, and the endurance was 30 min under conditions of no wind and maximum load. The aircraft had a built-in dual inertial measurement unit (IMU) and dual compass, which can record its geographical position and three-axis attitude angle in real time during the flight, improving the accuracy of data. The sensor used was the instrument's 1 inch COM lens, with 20 million effective pixels, 35 mm equivalent focal length, and maximum photo resolution of 19.96 million (5472 × 3648). The flight path was designed in Altizure, with a flight altitude of 50 m, and the flight path overlap rate and side overlap rate were both 75%. Finally, 162 effective RGB (red-green-blue) images were obtained in the experiment.

Pretreatment and Spectral Transformation of Ground-Based Spectrum
First, the spectrum was preprocessed using Savitzky-Golay filtering in Unscrambler X 10.4 with a smoothing point of 5, while effectively removing the influence of ambient noise. Then the spectral resolution was resampled to 1 nm. The wavelengths that are strongly correlated with plant pigments are concentrated in the visible and infrared regions, and the constituent wavelengths of vegetation indices commonly used for estimating plant physiological and biochemical parameters are also in this region, therefore we studied only the reflection spectra of tree peony leaves in the region of 400-1500 nm.

Principal Component Analysis (PCA) of Spectra
Principal component analysis (PCA) constructs a new orthogonal feature in the K dimension by mapping the original N-dimension feature to the K-dimension (K < N). Only the features containing the most variance were retained, and the features containing almost zero variance were ignored for data dimension reduction [25]. In the field of RS, PCA can effectively remove the correlation, redundancy and collinearity between bands, and is the most widely used method for dimensionality reduction [26]. Therefore, we used this method to reduce the dimension of the tree peony canopy spectrum.

First-Order Differential (FD) Processing of Spectra
First-order differential (FD) processing of plant spectra can compress the influence of background noise on target signals and to a certain extent enhance the contrast of the spectral absorption characteristics of all biochemical components [27]. In this study, the difference method was used to approximate the first-order differential spectrum of peony leaves. The specific calculation formula can be expressed as follows: where λ i refers to the wavelength of band i, R(λ i ) refers to the original spectral reflectance corresponding to wavelength λ i , and R (λ i ) refers to the reflectance of first-order differential spectrum corresponding to wavelength λ i . In the FD spectrum, "three-edge" parameters were the most commonly used spectral parameters, which can accurately reflect the growth status of plants and are the intuitive expression of plant pigment, cell structure, water content, and dry matter quality in the reflection spectrum [28]. We calculated the "three-edge" parameters (position, amplitude, and area) based on the red, blue, and yellow light regions of tree peony leaf reflection spectrum respectively, and their definitions and formulas are shown in Table 2. Note: first-order differential (FD).
Vegetation indices (VIs) based on the FD spectrum have been commonly used to analyze and detect changes in plant physiology and biochemistry [31,32]. These indices, based on information at specific wavelengths, have been developed to reflect diverse plant parameters, such as pigment content, water content, and leaf area. However, the quantitative analysis of a specific tree peony pigment based on the commonly used vegetation indices is not possible at present due the lack of crop species specificity within the available indices. Therefore, to simplify the RS monitoring of tree peony Anth, we constructed differential vegetation index (DVI), ratio vegetation index (RVI), normalized vegetation index (NDVI), and soil-regulated vegetation index (SAVI) for all possible two-band combinations of 400-1500 nm. The coefficients of determination (R 2 ) between Anth and VI can reflect the predictive power of the two independent band combinations.

Continuum Removal Processing of Spectra
Continuum removal (CR), also known as the envelope removal method, was first proposed by Kokaly and Clark [33]. Continuum removal reflectance is the ratio of the original spectral reflectance to the continuum of the corresponding band. The continuum is approximated by a straight line joining the two local reflectance maxima placed on both shoulders (λ min and λ max ) of the peak absorption wavelength (λ peak ). Continuum removal, CR λ , was thus written as a function of reflectance values R(λ) at wavelength λ, with the constraint that its maximum value could not be above 1.0 (concavity of the reflectance spectra at this location) [34,35].
Continuum removal can effectively remove spectral information noise, eliminate the influence of mesophyll structural parameters, and increase the depth difference of the absorption valley between the spectra of plants with different health statuses. Absorption characteristic parameters based on continuum removal spectrum development can improve the response ability of crop nitrogen and chlorophyll [36,37]. Therefore, in this paper, the seven most commonly used absorption parameters were extracted based on the continuum removal spectrum, and their effect on Anth estimation of tree peonies was investigated. Absorption-band parameters, such as the position, depth, width, and asymmetry of the feature have been used to quantitatively estimate the composition of samples from hyper-spectral fields and laboratory reflectance data. In this study, the total area of absorption peak (TA), left area of absorption peak (LA), right area of absorption peak (RA), degree of symmetry (S), normalized maximum absorption depth (NAD), maximum absorption depth (BD max ), and absorption band wavelength (P) were extracted from the CR spectra of tree peony leaves using ENVI 5.1. The definition and calculation formula of the absorption parameters are shown in Table 3. Table 3. Absorption parameters of continuum removal (CR) spectrum.

TA
Integral of the depth of the band from the beginning to the end of a continuum λ max λ min dR (λ) [38] LA Integral area range from the wavelength corresponding to the maximum absorption depth to the left absorption peak λ peak λ min dR (λ) [38] RA Integral area range from the wavelength corresponding to the maximum absorption depth to the absorption peak on the right λ max λ peak dR (λ) [38] S Ratio of left area of absorption peak to right area of absorption peak LA RA [38] NMAD Ratio of maximum absorption depth to total area of absorption peak AD max Maximum absorption depth 1 − R (λ peak ) [40] P Wavelength corresponding to the maximum absorption depth λ peak [38] Note: R indicates the continuum-removed reflectance value.

RGB (Red-Green-Blue) Gray Vegetation Index Extraction
The UAV images were first aligned using the Structure from Motion (SFM) algorithm in the Agisoft PhotoScan professional software (Agisoft, Saint Petersburg, Russia); then, we generated dense point clouds based on the dense multi-view stereo matching algorithm, followed by mesh and texture generation. Finally, an orthomosaic of the study area with real coordinates and detailed texture information was obtained. The mosaic process of the UAV images is shown in Figure 3.
The orthomosaic contained the gray information of the R, G, and B bands, and its pixel size was 0.027 m 2 (0.18 m × 0.15 m). Region of interest (ROI) was plotted in ENVI 5.1 to extract the average gray values of the R, G, and B bands of the tree peony sample leaves, and the RGB gray VIs were constructed based on the VI construction principle. Although the gray value was different from the reflectivity of the corresponding wavelength, it was also a quantified expression of the reflected light intensity. Vegetation index is currently extensively used in the field of RS, but most VIs are based on the visible and near infrared bands. However, the sensors in our study were only able to obtain spectral information in the R, G and B bands; therefore, we chose the VIs based on visible light to estimate Anth of tree peony leaves. Among them, the visible atmospherically resistant index (VARI) can highlight the spectral reflection in the visible band, and can reduce the influence of light and atmosphere. The red-green ratio Index (RGRI) was calculated using the ratio of the reflectance of green band and red band. The value of RGRI is closely related to the nutritional status of plants, and has achieved good results in the monitoring of pasture quality and soybean biomass. The normalized green index (NGI), normalized blue index (NBI), and normalized red index (NRI) normalized the reflectance of red, green, and blue bands to a unified standard, and had a good effect on crop recognition. The normalized green-red difference index (NGRDI) was constructed based on the principle of NDVI, making full use of the strong reflection of the green band and strong absorption of red light; it can be used as an alternative to NDVI and has a good relationship with plant growth. The dark green color index (DGCI) was constructed based on the color space of HSV (hue, saturation, value) and represented the greening rate of the plant canopy [41]. Therefore, we constructed the VIs based on gray information from the UAV images, as presented in Table 4.

Texture Parameter Extraction of Unmanned Aerial Vehicle (UAV) Images
In addition to the spectral parameters, the texture characteristics of the image were not easily affected by the color and brightness of the ground objects, and thus, well reflected the growth of plants. Image texture is represented by the gray distribution of the pixel and its surrounding spatial neighborhood. Extraction methods can be divided into structure-based and statistics-based methods. The latter approach was used in this study; statistics-based methods can directly and quantitatively describe the statistical properties of texture features and are increasingly used in plant growth monitoring. The orthographic image of the study area was extracted using texture information based on probability and statistical filtering, and the processing window was 3×3. Five texture parameters for the R, G, and B channels were extracted individually, including data range, mean, variance, entropy, and skewness. The UAV image processing flow is shown in Figure 3.

Parameters Formulas References
Visible atmospherically resistant index Note: R, G, and B represent the gray values of red, green, and blue bands respectively.

Texture Parameter Extraction of Unmanned Aerial Vehicle (UAV) Images
In addition to the spectral parameters, the texture characteristics of the image were not easily affected by the color and brightness of the ground objects, and thus, well reflected the growth of plants. Image texture is represented by the gray distribution of the pixel and its surrounding spatial neighborhood. Extraction methods can be divided into structure-based and statistics-based methods. The latter approach was used in this study; statistics-based methods can directly and quantitatively describe the statistical properties of texture features and are increasingly used in plant growth monitoring. The orthographic image of the study area was extracted using texture information based on probability and statistical filtering, and the processing window was 3 × 3. Five texture parameters for the R, G, and B channels Remote Sens. 2022, 14, 2271 9 of 20 were extracted individually, including data range, mean, variance, entropy, and skewness. The UAV image processing flow is shown in Figure 3.

Regression Model Construction
To explore the influence of modeling methods on model accuracy, multiple stepwise regression (MSR), partial least squares (PLS), back-propagation neural network (BPNN), and random forest (RF) were used to construct Anth estimation models of the tree peony leaves using different transformed spectra. In this study, the predicted residual sum of squares (PRESS) and lowest root mean square error of prediction from cross validation (RMSEPCV) were used to determine the optimal number of LVs and to prevent overfitting. The PRESS statistic determines the number of LVs required to achieve minimum root mean square error (RMSE) between modelled and observed leaf traits. The BPNN used in this study was composed of input, hidden, and output layers. The number of hidden layers of the BPNN model was 1. The number of hidden layer neurons was based on (l: hidden layer neuron, m: input layer neuron, n: output layer neuron, a: constant between 0 and 10), which can constantly adjust the neuron number of the hidden layer to find the model with the highest accuracy. The neuron numbers of both input and output layers were determined by the number of independent and dependent variables. Meanwhile, 10-fold cross verification was used to ensure the stability of the model. The MSR, PLS, and BPNN models were constructed in MATLAB R2016a, and the RF model was constructed in R X64 3.3.3.

Evaluation Index
The coefficient of determination (R 2 ), root mean square error (RMSE), and relative error of prediction (REP) obtained by unitary linear regression of predicted Anth to measured Anth, were selected as the evaluation index of model accuracy. The closer R 2 was to 1, the smaller were the RMSE and REP, indicating the higher accuracy of the model. R 2 , RMSE, and REP were determined using the following equations: where y i represents the measured values, y is the average of the measured value,ŷ i is the predicted value, and n is the number of samples.

Characteristics of Spectrum
Based on the three spectra of tree peony leaves in Figure 4, it is evident that the CR method projects the original spectral reflectance in the range of 400-1500 nm to 0-1, such that the spectral reflectance in the ranges of 400-747 nm, 932-1056 nm, 1109-1267 nm, and 1324-1500 nm shows more obvious variance. Hence, it can be concluded that the CR spectrum is sensitive to the variation in spectral reflectance. The FD spectral conversion method not only removes the baseline but also avoids excessive signal-to-noise ratio reduction in the corrected spectrum. In this study, the maximum FD spectral reflectance of tree peony leaves appeared at 723 nm, indicating that the original spectral reflectance increased most rapidly in this band. This is a unique spectral characteristic of green plants. The minimum value of FD spectral reflectance appeared at 1405 nm, showing that the original spectral reflectance decreased most rapidly in this band. Noticeably, the FD spectrum was sensitive to the rate of change of the original spectral reflectance.

Principal Components of PCA Spectrum
According to the principle of initial eigenvalue greater than 1, a total of nine prin cipal component variables were screened out and labeled F1 to F9, respectively; thei cumulative variance contribution rate was 99.8 %. The PCA results of tree peony leaves spectra are shown in Figure 5.

Vegetation Index of Any Two Bands of FD Spectrum
As shown in the contour map of R 2 shown in Figure 6, the R 2 distribution of DVI and SAVI had obvious similarity, and the distribution of the high value area of R 2 was wide than that of RVI and NDVI. The optimal band combinations of RVI, DVI, NDVI and SAV were (D661 nm, D1475 nm), (D666 nm, D1082 nm), (D664 nm, D720 nm) and (D666 nm D1082 nm), respectively, and the corresponding maximum R 2 was 0.57, 0.48, 0.51 and 0.48, respectively.

Principal Components of PCA Spectrum
According to the principle of initial eigenvalue greater than 1, a total of nine principal component variables were screened out and labeled F1 to F9, respectively; their cumulative variance contribution rate was 99.8 %. The PCA results of tree peony leaves spectra are shown in Figure 5.

Principal Components of PCA Spectrum
According to the principle of initial eigenvalue greater than 1, a total of nine prin cipal component variables were screened out and labeled F1 to F9, respectively; their cumulative variance contribution rate was 99.8 %. The PCA results of tree peony leaves spectra are shown in Figure 5.

Vegetation Index of Any Two Bands of FD Spectrum
As shown in the contour map of R 2 shown in Figure 6, the R 2 distribution of DVI and SAVI had obvious similarity, and the distribution of the high value area of R 2 was wider than that of RVI and NDVI. The optimal band combinations of RVI, DVI, NDVI and SAV were (D661 nm, D1475 nm), (D666 nm, D1082 nm), (D664 nm, D720 nm) and (D666 nm D1082 nm), respectively, and the corresponding maximum R 2 was 0.57, 0.48, 0.51 and 0.48, respectively.

Vegetation Index of Any Two Bands of FD Spectrum
As shown in the contour map of R 2 shown in Figure 6, the R 2 distribution of DVI and SAVI had obvious similarity, and the distribution of the high value area of R 2 was wider than that of RVI and NDVI. The optimal band combinations of RVI, DVI, NDVI and SAVI were (D661 nm, D1475 nm), (D666 nm, D1082 nm), (D664 nm, D720 nm) and (D666 nm, D1082 nm), respectively, and the corresponding maximum  Table 5 shows the correlation coefficient between Anth of tree peony leaves and the principal components of the PCA spectrum, "three-edge" parameters of the FD spectrum, and absorption parameters of the CR spectrum. It is evident that principal components F1, F2, and F8 were significantly positively correlated with Anth (p < 0.05) while F4 and F5 were significantly negatively correlated with Anth (p < 0.05). The highest correlation was with F8, with the correlation coefficient being 0.37. Meanwhile, F7 and F9 had the worst correlation with Anth. Among the "three-edge" parameters, λr had the best correlation with Anth, the correlation coefficient reaching −0.5, followed by SDb and λy. Overall, position-based parameters (λr and λy) had a higher correlation with Anth than those based on area (Dr and Dy) and amplitude (SDr and SDb).

Correlation Analysis between Spectral Parameters and Anth
Taking as the boundary the reflection peaks observed at 550 nm of the green light, CR transformation was carried out at 400-550 nm, 550-788 nm and 400-788 nm. The correlation coefficient between absorption parameters and Anth shows that the correlation between Anth and the absorption parameters at 550-788 nm and 400-788 nm was higher than that at 400-550 nm, and most of the absorption parameters were significantly correlated with Anth (p < 0.05). P at 400-550 nm, NAD at 550-788 nm and RA at 400-788 nm were the absorption parameters with the best correlation with Anth in the corresponding band range.  Table 5 shows the correlation coefficient between Anth of tree peony leaves and the principal components of the PCA spectrum, "three-edge" parameters of the FD spectrum, and absorption parameters of the CR spectrum. It is evident that principal components F1, F2, and F8 were significantly positively correlated with Anth (p < 0.05) while F4 and F5 were significantly negatively correlated with Anth (p < 0.05). The highest correlation was with F8, with the correlation coefficient being 0.37. Meanwhile, F7 and F9 had the worst correlation with Anth. Among the "three-edge" parameters, λr had the best correlation with Anth, the correlation coefficient reaching −0.5, followed by SD b and λ y . Overall, position-based parameters (λr and λy) had a higher correlation with Anth than those based on area (D r and D y ) and amplitude (S Dr and S Db ).

Correlation Analysis between Spectral Parameters and Anth
Taking as the boundary the reflection peaks observed at 550 nm of the green light, CR transformation was carried out at 400-550 nm, 550-788 nm and 400-788 nm. The correlation coefficient between absorption parameters and Anth shows that the correlation between Anth and the absorption parameters at 550-788 nm and 400-788 nm was higher than that at 400-550 nm, and most of the absorption parameters were significantly correlated with Anth (p < 0.05). P at 400-550 nm, NAD at 550-788 nm and RA at 400-788 nm were the absorption parameters with the best correlation with Anth in the corresponding band range.
were the principal components extracted by PCA spectra; λ r , D r , S Dr , λ y , D y , S Dy , λ b , D b and S Db are parameters based on position, amplitude and area of R, G, B bands; TA, LA, RA, S, NAD, BD max and P are the absorption parameters extracted by the CR spectrum. ** p < 0.01, * p < 0.05. Table 6 shows the correlation between the RGB gray vegetation index, texture parameters and Anth. Only NRI was positively correlated with Anth, and VARI, RGRI, GRVI, NGRDI, and DGCI were all significantly negatively correlated with Anth, with DGCI portraying the highest correlation; the correlation coefficient was −0.6. The mean of the R band had the greatest correlation with Anth, with a correlation coefficient of 0.52, while the mean of the B band had the lowest correlation. Overall, the correlation between the texture parameters and Anth was low.

Anth Estimation Based on Hyperspectral (HS) of Different Spectral Transformations
The results of the model's calibration and test sets are shown in Table 7. The R 2 c of the model based on the PCA spectrum was between 0.45 and 0.91, and the R 2 v was between 0.21 and 0.58. The R 2 c of the model based on the FD spectrum ranged from 0.53 to 0.91, and the R 2 v ranged from 0.51 to 0.59. The model based on the CR spectrum had an R 2 c range from 0.57 to 0.87, and an R 2 v range from 0.25 to 0.34. Obviously, there was a large difference in the modeling accuracy with respect to different modeling methods, with a small difference in the test accuracy. Meanwhile, considering R 2 c and R 2 v , the accuracy of the FD spectral model was better than that of the PCA and CR spectral models. In the PCA and FD spectral models, compared with MSR and PLS, BPNN and RF were the better modeling methods. The PCA-RF and FD-RF models had the highest R 2 c , and the PCA-BPNN and FD-BPNN models had the highest R 2 v . However, both the PCA-RF and FD-RF models experienced over-fitting; this was because the amount of training data was small in this experiment, and the model overfitted the training data without considering the generalization ability. The PLS model portrayed moderate accuracy, while the MSR model portrayed good performance only in the calibration set of the PCA model (R 2 c = 0.45, R 2 v = 0.21). This was because both MSR and PLS are linear models, and stable and effective regression can be performed in the presence of multicollinearity of independent variables. However, PLS combines the characteristics of multiple linear regression (MLR), canonical correlation analysis (CCA), and principal component analysis (PCA), and the final model contains the information of all the original independent variables, while the MSR model contains only the information of several important variables [47]. Among the CR spectral models, the accuracy of the test set was low, and overfitting occurred in all of them, which may be because the parameters used in the CR models were extracted based only on the measurements acquired at 400-550, 550-778, and 400-778 nm, and the spectra in these band regions were mainly affected by Chl and less affected by Anth. In general, the FD-RF model had the best calibration accuracy (R 2 c = 0.91, RMSE c = 0.01, REP c = 10.45%), and the FD-BPNN model had the best test accuracy (R 2 c = 0.59, RMSE c = 0.04, REP c = 31.23%). Table 7. Anth estimation models based on principal component analysis (PCA), first-order differential (FD), and continuum removal (CR) spectra.

Anth Estimation Based on Unmanned Aerial Vehicle (UAV) Images
Tree peony Anth estimation models based on UAV VIs and texture parameters were structured in Table 8. The R 2 c ranged from 0.49 to 0.71, and the R 2 v ranged from 0.25 to 0.45; the R 2 c of each model was obviously higher than its R 2 v . Among them, the UAV-RF model had the highest calibration and test accuracy, R 2 c and R 2 v were 0.71 and 0.45, respectively. The UAV-BPNN and UAV-MSR models followed, and the UAV-PLS model had the worst accuracy. This was because the RF model performed regression through repeated binary data, and its sampling method and the generation of decision tree features were random; therefore, the prediction accuracy of the model could be improved without significantly increasing the amount of computation [48]. Compared with the ground-based FD-RF model, the accuracy of the UAV-RF model was low in the calibration and test sets; this is mainly because the ground spectrum had higher spatial resolution and rich band information, and the spectral reflectance obtained of ground objects was more precise than that obtained by the UAV sensor.

Anth Estimation Based on Multi-Source Remote Sensing (RS) Data
To make full use of the rich band of HS, as well as the flexible, fast, and wide monitoring range of UAVs, the multi-source RS model was constructed by combining the parameters extracted from different ground-based spectra and UAV images ( Table 9). The R 2 c ranges of the models based on PCA + UAV, FD + UAV, and CR + UAV were 0.73-0.92, 0.75-0.93, and 0.61-0.91, and the R 2 v ranges were 0.34-0.58, 0.65-0.76, and 0.35-0.56, respectively. Obviously, the model based on FD + UAV had the highest accuracy in both the calibration and test sets, and the RF model had the highest accuracy (R 2 c = 0.93, R 2 v = 0.76), followed by the BPNN model (R 2 c = 0.85, R 2 v = 0.69), whereas the MSR and PLS models had relatively poor accuracy. Among the models based on PCA + UAV and CR + UAV, the modeling accuracy of the RF model was the highest in both the calibration and test sets (R 2 c = 0.92 and 0.91, respectively), and the verification accuracy of the BPNN model was the highest in both the calibration and test sets (R 2 v = 0.58 and 0.56, respectively). The accuracies of the MSR and PLS models were obviously lower. Table 8. Anth estimation model based on unmanned aerial vehicle (UAV) parameters. Compared with the PCA, FD, and CR ground-based models, the accuracy of the multi-source RS model improved in both the calibration set and the test set, and the most obvious improvement was in the model based on FD + UAV. With the addition of UAV information, the R 2 c of the optimal ground-based model increased from 0.91 to 0.93, and R 2 v increased from 0.51 to 0.76; thus, the RF model of FD + UAV was the best multi-source RS model for tree peony leaves Anth estimation. Compared with the optimal UAV model (UAV-RF), R 2 c improved from 0.71 to 0.93 and R 2 v improved from 0.45 to 0.76, and the accuracy of the calibration and test sets improved by 30.99 % and 68.89 %, respectively.

Calibration Set Test Set
In multisource RS models, the RF model showed an obvious superiority over the BPNN, PLS and MSR methods; the optimal models based on PCA + UAV, FD + UAV, and CR + UAV were all constructed by the RF model. Figure 7 shows the fitting effect of predicted Anth on measured Anth for the RF models; the predicted value of the calibration set had a good fitting effect (R 2 c is all over 0.9), and the Anth content was evenly distributed on both sides of the 1:1 line, without obvious aggregation. The predicted values of the test set had a slightly poor fitting effect; when the Anth value was near 0.10, the predicted Anth was close to the measured Anth, whereas when the Anth value was far from 0.10, the Anth prediction portrayed a large deviation from the measured Anth. Among all the models, the RF model of FD + UAV had the best fitting relationship between the predicted and measured Anth values in the calibration and test sets. tion set had a good fitting effect (R 2 c is all over 0.9), and the Anth content was evenly distributed on both sides of the 1:1 line, without obvious aggregation. The predicted values of the test set had a slightly poor fitting effect; when the Anth value was near 0.10, the predicted Anth was close to the measured Anth, whereas when the Anth value was far from 0.10, the Anth prediction portrayed a large deviation from the measured Anth. Among all the models, the RF model of FD + UAV had the best fitting relationship between the predicted and measured Anth values in the calibration and test sets.

Application of Spectral Information Extraction from Hyperspectral (HS) Data
HS sensors collect information over a very large number of wavelengths, equivalent to dozens or hundreds of wavebands. However, due to the large amount of HS data, not all acquired bands are highly correlated with target features. HS data compression methods can be divided into lossless compression and lossy compression. Lossless compression methods are based on statistical redundancy of suppressed data, whereas a lossy algorithm minimizes the data by discarding irrelevant parts of the information. These methods all de-correlated the HS data to represent the inherent information content in a low-dimensional domain [49]. However, from the perspective of coding gain, PCA was considered to be the optimal transformation of gaussian sources. In this study, PCA was used to compress the 400-1500 nm band region into nine principal component variables, which maximized the information of the original spectrum, while greatly reducing the spectral dimension. Notably, the cumulative variance contribution rate of all principal components was as high as 99.8%, which almost entirely represented the spectral information of tree peony leaves in the whole band region, and effectively extracted the reflection spectrum of tree peony leaves.
Among the techniques developed in spectroscopy, derivative analysis has been particularly promising in the application of RS data [50]. The derivative of the spectrum, its rate of change with respect to the wave length, overcame many of the problems of quantitative analysis in a more elegant and efficient manner by comparing ratios and differences [51]. In the field of RS, FD spectroscopy has mainly been used to help locate critical wavelengths. Guo et al. constructed a high-precision estimation model of chlorophyll content in tobacco leaves by constructing a normalized variable (SDr − SDy)/(SDr + SDy) based on the FD spectrum [52]. In this study, "three-edge" parameters and VI extracted from the FD spectrum portrayed a good correlation with Anth, and the model accuracy based on FD and FD + UAV was better than that based on PCA and the CR spectrum, which fully reflected the superiority of the FD method in spectral transformation research.
CR analysis removes the uninteresting absorption features by dividing the reflectance value of each point using the reflectance of the continuum line (convex lobe) at the corresponding wavelength, thus, standardizing and enhancing the specific absorption characteristics of foliar biochemical components [53]. Among the absorption characteristic parameters extracted based on the CR spectrum, the NAD of 550-788 nm had the strongest correlation with Anth, and the correlation coefficient reached 0.63. However, the models built based on CR and CR + UAV experienced over-fitting, which was related to the narrow band range of extracting absorption characteristic parameters and the fact that the spectrum of these regions was mainly affected by the Chl content and less affected by the Anth content.

Advantages of Ground-Based Spectrum and Unmanned Aerial Vehicle (UAV) Data
With the rapid development of UAV and lightweight hyperspectral imaging (HSI) sensors, mini-UAV-borne hyperspectral remote sensing systems have been developed, and demonstrate great value and application potential. In this study, UAV multi-spectral information, texture information, and ground HS data were combined to estimate accurately the Anth content of tree peony leaves, by overcoming the saturation problem related to VI in scenarios of dense canopies [54]; variable structural characteristics of the canopy can also be effectively detected using this method [55]. Notably, this method is superior to Anth estimation using ground HS data and UAV images individually. This is consistent with the results of a study conducted by Zheng et al. [56] in which rice nitrogen content was estimated by vegetation index and texture parameters, based on near-ground and UAV platform spectra.

Machine Learning and Plant Growth Monitoring
Compared with physical radiative transfer models, empirical statistical models have been widely used in the study of plant growth due to their stable, easy-input parameters, and simple modeling methods. Among these, machine learning algorithms can deal with regression problems arising from the complex relationships between independent and dependent variables, while achieving reliable estimations of plant pigment content, nitrogen content, LAI, and biomass [57][58][59][60]. Among the MSR, PLS, BPNN, and RF models constructed in this study, the BPNN and RF models demonstrated obvious superiority, which was consistent with the findings of previous researchers that the RF and BPNN methods had obvious advantages for cotton LAI estimation, citrus pest identification, wetland plant total nitrogen inversion, and winter wheat growth monitoring [61][62][63]. This was because the BPNN model is generally a multilayer feedforward neural network, with signal forward propagation and error back propagation. In this model, the input signals are processed step by step, from the input layer to the output layer, through multiple hidden layers. When the output layer is inconsistent with the desired output, it turns to back propagation, apportioning the error to all the cells in each layer. Error signals are used to correct the weights of each unit, so that the predicted output of BPNN is consistently close to the expected output [64]. The BPNN models in this study all set the hidden layer number to 1, according to newff function, and thus we could develop an optimal training model by constantly changing the hidden layer node number. Finally, the hidden layer nodes of the PCA-BPNN, FD-BPNN, and CR-BPNN optimal models were 6, 8, and 10, respectively, and the hidden layer nodes of the BPNN models based on PCA + UAV, FD + UAV, and CR + UAV were 10, 8, and 9, respectively. With the addition of UAV information, the accuracy of the BPNN model based on multi-source RS data greatly improved compared with the model constructed from single-source RS data. The most obvious improvement was in the model based on CR + UAV, where R 2 c increased from 0.59 to 0.73 and R 2 v increased from 0.27 to 0.56. This is consistent with previous findings that the BPNN model has a positive effect on net primary productivity estimation, as shown by Yan et al., and for soil pH study, as reported by Huang et al. [37,65]. The RF model applied an integration algorithm, with high accuracy and generalization ability. It performed regressions through repeated dichotomous data, and the generation of sampling method and decision tree features was random; therefore, we could increase the prediction accuracy significantly, without significantly increasing the amount of computation [66]. In the PCA-RF, FD-RF, and CR-RF models constructed in this study, R 2 c was greater than 0.85, which was higher than the accuracy of other models constructed on the same spectrum, but its R 2 v was slightly less than that of other models. Thus, we could deduce that the addition of UAV information improves the R 2 c and R 2 v of RF models based on PCA + UAV, FD + UAV, and CR + UAV, and to some extent overcomes the over-fitting difficulties of models based on ground spectrum data.

Conclusions
The key to effective Anth estimation based on spectral reflectance is to find the band or spectral parameters closely related to the pigment. In this study, we first analyzed the characteristics of PCA, FD and CR ground-based hyperspectral data of tree peony leaves. Then, the Anth of tree peony leaves was estimated using multiple methods (MSR, PLS, BPNN and RF) based on the common spectral parameters extracted from three kinds of transformed spectra, and the best spectral transformation method and the best precision model were obtained. However, the ground-based hyperspectral model was not sufficient to estimate Anth. Therefore, to improve the Anth estimation accuracy, we added 8 RGB gray vegetation index and texture parameters closely related to Anth based on UAV spectral extraction, in combination with ground hyperspectral to build a multi-source estimation model. In addition, to compare the Anth estimation ability of spectral information using different platforms and the advantage of multi-source remote sensing data compared with single remote sensing data, we also built an Anth estimation model based solely on UAV images. The main conclusions were as follows: 1.
In the HS Anth estimation models constructed based on the three transformed spectra, the RF model based on "three-edge" parameters and VI of any two bands had the highest fitting accuracy, which can provide a reference for the selection of the spectral transformation method and regression model in crop growth monitoring in the future.

2.
Compared with the ground hyperspectral model and the visible UAV model, the accuracy of the multi-source RS models greatly improved. The addition of UAV data enriched the RS information used for near-surface estimation, which improved the accuracy of the model.