Estimation of Forest Aboveground Biomass and Leaf Area Index Based on Digital Aerial Photograph Data in Northeast China

Forest aboveground biomass (AGB) and leaf area index (LAI) are two important parameters for evaluating forest growth and health. It is of great significance to estimate AGB and LAI accurately using remote sensing technology. Considering the temporal resolution and data acquisition costs, digital aerial photographs (DAPs) from a digital camera mounted on an unmanned aerial vehicle or light, small aircraft have been widely used in forest inventory. In this study, the aerial photograph data was acquired on 5 and 9 June, 2017 by a Hasselblad60 digital camera of the CAF-LiCHy system in a Y-5 aircraft in the Mengjiagang forest farm of Northeast China, and the digital orthophoto mosaic (DOM) and photogrammetric point cloud (PPC) were generated from an aerial overlap photograph. Forest red-green-blue (RGB) vegetation indices and textural factors were extracted from the DOM. Forest vertical structure features and canopy cover were extracted from normalized PPC. Regression analysis was carried out considering only DOM data, only PPC data, and a combination of both. A recursive feature elimination (RFE) method using a random forest was used for variable selection. Four different machine-learning (ML) algorithms (random forest, k-nearest neighbor, Cubist and supporting vector machine) were used to build regression models. Experimental results showed that PPC data alone could estimate AGB, and DOM data alone could estimate LAI with relatively high accuracy. The combination of features from DOM and PPC data was the most effective, in all the experiments considered, for the estimation of AGB and LAI. The results showed that the height and coverage variables of PPC, texture mean value, and the visible differential vegetation index (VDVI) of the DOM are significantly related to the estimated AGB (R2 = 0.73, RMSE = 20 t/ha). The results also showed that the canopy cover of PPC and green red ratio index (GRRI) of DOM are the most strongly related to the estimated LAI, and the height and coverage variables of PPC, the texture mean value and visible atmospherically resistant index (VARI), and the VDVI of DOM followed (R2 = 0.79, RMSE = 0.48).


Introduction
Nowadays, global climate change is becoming a major challenge for current and future generations.Forests play crucial roles in adjusting the global and regional carbon cycle and bioenergy consumption.
Forest aboveground biomass (AGB) is a key biophysical parameter.It can provide important vegetation information about growth, health, and productivity [1,2].Also, it is often required by the implementation of effective climate policies [3,4].Additionally, the leaf area index (LAI), another important biophysical parameter, could provide more detailed canopy structure information [5].Therefore, the accurate estimation and prediction of these two forest parameters is of great importance.
Traditional forest investigation relies on manual ground measurements with intensive time and high costs, such as destructive individual tree sampling and non-destructive field measurements.The obvious advantage is that they can provide results with relatively high accuracy [6,7].In an attempt to obtain a more efficient estimation of forest parameters under different scales and in various environments, remote sensing techniques have been commonly applied over the past few decades.Data obtained from different sensors, such as optical cameras [8][9][10], radar [11,12], and terrestrial [13][14][15] or airborne [16][17][18] light detection and ranging (LiDAR), are significantly more efficient and have a lower cost than laborious ground-based estimation methods and have become widely used to characterize forest structure [19].
It is known that AGB and LAI cannot be directly obtained using remote sensing techniques; they are usually estimated by establishing a regression relationship between parameters derived from remote sensing data.However, it is worth noting that one problem when using optical remote sensing or radar data for estimation is saturation.Therefore, under high biomass or canopy density, the estimation accuracy, especially for biomass, is often under estimation [12,20,21].LiDAR can provide quite accurate 3-D information and a reliable estimation, but the cost to obtain data is high; therefore, it is not suitable for continuous monitoring over large areas.
In order to obtain accurate results, many researchers have tried to combine multispectral and textural information from optical remote sensing data and vertical structure information from LiDAR data to estimate forest parameters [22][23][24][25][26].These findings have demonstrated that estimation accuracy could be improved by making most use of the potential by combining these two types of data [27,28].
However, due to the influence of cloud and haze, it is sometimes difficult for satellites to capture high-quality images in the short term.On the other hand, and as mentioned above, LiDAR data acquisition and processing costs are too high to perform monitoring of large forest areas [29].Both satellite remote sensing and LiDAR data have some limitations in high phase and extensive forest inventory applications.
With the recent development of unmanned aerial vehicle (UAV) technology over the past few years, aerial photographs from a digital camera mounted on an UAV or light, small aircraft have been widely used in forest inventory [30][31][32][33][34][35], and they generally have good affordability and availability.One thing should be mentioned here is that most of the relevant research used either digital orthophoto mosaic (DOM) or photogrammetric point cloud (PPC).Dandois et al. [36] used "Structure from Motion (SfM)" computer vision algorithms to extract canopy structure and spectral attributes based on red-green-blue (RGB) aerial images.Understory digital terrain models (DTMs) and canopy height models (CHMs) were generated from leaf-on and leaf-off point clouds using procedures commonly applied to LiDAR point clouds.CHMs were strong predictors of field-measured tree heights (R 2 = 0.63-0.84).Mathews et al. [35] also used SfM computer vision algorithms to obtain high-density point clouds.Points near samples were extracted and input into a stepwise regression model to predict LAI.The final R 2 value was approximately 0.57.Ota et al. [37] investigated the capabilities of CHM derived from aerial photographs using the SfM approach to estimate AGB in a tropical forest and yielded an accurate estimate (R 2 = 0.79).These successful applications show the potential of the SfM algorithm.
In addition, recent advances in commercial software based on computer vision algorithms such as Pix4DMapper (https://pix4d.com/,Pix4D S.A. Lausanne, Switzerland) and Agisoft Photoscan (https: //www.agisoft.com/,Agisoft LLC, St. Petersburg, Russia) have enabled the mass production of digital surface models (DSMs) using the SfM algorithm with a much higher level of automation and much greater ease of use [38].Applying the SfM approach enables us to produce high spatial resolution photogrammetric point clouds with vertical structure features similar to those derived from airborne LiDAR and DOM similar to satellite optical images with horizontal features.
In terms of the regression model used for building the relationship, many researchers have applied machine-learning methods in forest inventory and have obtained better accuracy in the past few years [39,40].It is well known that the near-infrared (NIR) domain is a good indicator of vegetation health; we imitated the structure of the NIR band to construct the vegetation indices based on only red-green-blue three spectral bands in this study.To fully explore the advantages of digital aerial photographs (only RGB bands, not including the NIR band) in forest inventory, we focus on the estimation of AGB and LAI based on only DOM, only PPC, and DOM + PPC using a machine-learning method.This study will achieve the following goals: (1) processing digital aerial photograph (DAP) data based on the SfM approach to generate PPC and DOM and (2) estimating forest AGB and LAI based on only DOM, only PPC, and a combination of both.

Study Area
The study area is located at the Mengjiagang forest farm (46 • 26 N, 130 • 43 E) of Jiamusi city, in Heilongjiang Province of China (Figure 1), which is influenced by a temperate continental climate.Annual precipitation occurs mainly in summer.The study area covers an approximate land area of 260 km 2 , which varies in elevation from about 180 m to 450 m above sea level.The land is relatively flat without an extreme slope.The dominant tree species is Larch (Larix gmelinii (Rupr.)Rupr.), followed by Korean pine (Pinus koraiensis Sieb.et Zucc.),Scots pine (Pinus sylvestris L.var.mongolica Litv.), and Spruce (Picea asperata Mast).

DAP Data Collection
DAP and LiDAR data were acquired at the same time on 5 and 9 June 2017 by the Chinese Academy of Forestry (CAF) using a Hasselblad60 digital camera and Riegl LMS-Q680i of CAF-LiCHy system in a Y-5 aircraft.The average flight altitudes were 950 m and 1250 m above ground level (AGL), and the mean ground sampling distance (GSD) of DAP images was 9 cm and 13 cm, respectively.The average flight speed was 40 m/s, and the mean forward overlap (FO) and the mean overlap between flight lines of DAP images were about 70% and 50%, respectively.In terms of aerial photograph data, the camera focal length, image size, and pixel size inside the camera are 50 mm, 8964 × 6716 pixels, and 6.0 µm respectively.The details of the flight parameters are shown in Table 1.The standard plots were set based on comprehensive field exploration.The shape of the plot was rectangular, and the area of the plot was determined according to factors such as tree age, the density of the forest, and site quality.In this study, we chose three kinds of tree ages: mature forest, middle forest, and young forest.We established three density plot types: dense young tree plots, dense middle tree plots, and sparse mature large tree plots.There should be at least 50 trees in every plot in the mature forest, at least 70 trees in the middle forest, and at least 90 trees in the young forest.
According to this principle, an original area of 400 m 2 could be set in advance, and then the numbers of individual trees in each plot should be counted to determine the area of the plot.There were six different area sizes, which were 400 m 2 (20 m × 20 m), 600 m 2 (20 m × 30 m), 900 m 2 (30 m × 30 m), 1000 m 2 (25 m × 40 m), 1200 m 2 (30 m × 40 m), 1500 m 2 (30 m × 50 m).The numbers of plots of Korean pine, Scots pine, and Larch are 16, 5, and 35 respectively.The field data of 9 Larch plots were collected in July 2016.The field data of 26 Larch plots were collected under leaf-on canopy conditions from 4 to 20 June 2017.The field data of all Korean pine and Scots pine plots were measured from 20 to 30 August 2017.In addition to these three main tree species, there are Elm (Ulmus laciniata (Trautv.)Mayr), Linden (Tilia mandshurica Rupr.et Maxim), Aspen (Populus tomentosa Carr), Oak (Quercus mongolica Fisch.ex Ledeb.),Silver birch (Betula platyphylla Suk.), Maple birch (Betula costata Trautv.),Black birch (Betula davurica Pall.), and Ashtree (Fraxinus mandschurica Rupr.) in the plots.Fifteen Korean pine plots, three Scots pine plots, and nineteen Larch plots cover 600 m 2 .One Scots pine plot covers 400 m 2 .Nine Larch plots and one Scots pine plot cover 900 m 2 .One Scots pine plot and one Larch plot cover 1000 m 2 .Two Larch pine plots cover 1200 m 2 .Four Larch pine plots cover 1500 m 2 .
The quadrangle boundaries of field plots were firstly measured using GPS and tape.Based on the ground base station data, the accurate locations of plots were obtained by a differential global positioning system (DGPS).The differential total accuracy was between 0.5 m and 0.8 m.
Tree species, diameter at breast height (DBH), tree height, and the crown width of all living trees with a DBH greater than 5 cm were measured in each plot using tape or meters.The lengths of the east-west and south-north directions of the tree crown were measured with the tape, and the average value of two lengths was taken as the crown width.Tree species and DBH of all dead trees were measured.The statistical results of living trees are shown in Table 2.We calculated the AGB of each measured living tree using these equations of different tree species based on tree height and DBH in Table 3.

Tree Species AGB Equation Reference
Korean pine WT = 0.027847(D 2 H) 0.956544 [41] Scots pine WT = 0.3364D 2.0067 + 0.2983D We only used the AGB of living trees in this study and did not consider the AGB of dead trees.The AGB of each plot was then calculated by summing up the AGB of each living tree.
We used two methods to calculate the AGB of living trees.One was on the basis of five tree species: Korean pine, Scots pine, Larch, Birch (Sliver birch, Maple birch and Black birch), and Soft broad-leaf trees (Aspen, Elm, Linden, Oak, and Ashtree).Another was on the basis of all 11 tree species.The statistical results of all plots are shown in Table 4.As we can see from Table 4, the results of the two methods are not very different.Thus, we used the AGB results of five tree species to build a model and analyze retrieval results in this study.

LAI Field Measurements
LAI field data were obtained from 5 to 20 June, 2017.There were 192 circular plots at a radius of 15 m.The LAI in each plot was measured using a LAI-2000 Plant Canopy Analyzer (https://www.licor.com/, LI-COR Corporate, Lincoln NE, USA).In total, 12 points in four perpendicular directions were measured in every plot, and the average value of 12 points was calculated as the field measured data.The central positions of these plots were first measured using a GPS instrument.Then, based on the ground base station data, the accurate locations of plots were obtained using DGPS.The differential total accuracy was between 0.5 m and 0.8 m.There are four species in the plots: Spruce (Picea asperata Mast), Korean pine (Pinus koraiensis Sieb.et Zucc.),Scots pine (Pinus sylvestris L. var.mongolica Litv.), and Larch (Larix gmelinii (Rupr.)Rupr.).The statistical results of the LAI field data are shown in Table 5.  1 Training, the number of plots used to train in study result of the paper. 2Validation, the number of plots used to validate in study result of the paper.

Methods
The research method consists of four parts: data preprocessing, features extraction, selection of the variables, and the model and validation (Figure 2).

LiDAR Data Pre-Processing
LiDAR data pre-processing was carried out by the CAF.This included noise removal of the point cloud and point cloud classification, which gives ground and non-ground echoes according to the proprietary algorithm implemented in the TerraScan software (https://www.terrasolid.com,TerraSolid Ltd., Helsinki, Finland).A digital terrain model (DTM) with a 1-m spatial resolution was derived using a triangulated irregular network (TIN) interpolating method from the ground-classified points.

DAP Data Pre-Processing
First, we combined an onboard inertial measurement unit (IMU)/global positioning system (GPS) with camera exposure position information and ground base station data of the closest official reference points of the Heilongjing Bureau of Surveying and Mapping Geoinformation to generate the accurate position and attitude information of every photo taken using DGPS.The overall accuracy was between 10 cm and 15 cm.We used the SfM algorithm to generate a DSM dense point cloud and DOM from an overlapping collection of digital aerial photographs in the proprietary software Pix4DMapper Professional Edition 2.1.0(64 bit) (https://pix4d.com/,Pix4D S.A. Lausanne, Switzerland).The resolution of the DOM is 0.1 m, and the tolerance of data processing is 0.02 m.SfM is the process of estimating the 3D structure of a scene from a set of 2D images.SfM requires point correspondences between images.Corresponding points were identified either by matching features or tracking points from image 1 to image 2 [43].The fundamental matrix describes the epipolar geometry of two images and is computed using the corresponding points of two images.The orientation and location in the specified coordinate system are returned by relative and absolute orientation calculation.The 3D locations of matched points are determined using triangulation.
We compared the DSM point cloud from the DAP data to the DSM point cloud from the LiDAR data (Figure 3).The results showed that the average range deviation between the LiDAR point cloud and the DAP point cloud results was less than 0.5 m.RGB vegetation indices were constructed by imitating the structure of the NIR band from 0.1-m resolution DOM data.We used the digital number (DN) value to take the place of reflectance in this study (Table 6).Six kinds of vegetation indices were created using DN values of the red, green, and blue bands.

Vegetation Indices Name Equations 1 References
Visible differential vegetation index (VDVI) Green red ratio index (GRRI) DN G DN R [46] Green blue ratio index (GBRI) DN G DN B [47] Red blue ratio index (RBRI) DN R DN B Four textural features (mean, homogeneity, dissimilarity, and correlation) were extracted from the first principal component of the 0.1-m DOM in this study (Table 7).Every feature includes four different window sizes of 4.5 m, 6.5 m, 10.1 m and 25.1 m.
The mean values of vegetation indices and textural features were calculated using R packages including caret [48], raster [49] and dplyr [50].

Feature Name
Equations 1   Mean √ variance i variance j 1 i is the line number of the gray level co-occurrence matrix.j is the column number of the gray level co-occurrence matrix.P ij is normalized co-occurrence matrix.ME is the mean value of the gray level co-occurrence matrix.VA is the variance value of the gray level co-occurrence matrix.
CC is the index that describes the degree of canopy connection of trees, and it is the ratio of canopy projected area to woodland area.CC was generated from the normalized point cloud.We used the height threshold to distinguish between ground points and tree points.In the study, three types of data sources were used to estimate forest AGB and LAI.We merged all features from DOM, all structural variables from PPC, and a combination of both.Because there are four different texture window sizes, we need to separately merge all parameters.Because of high correlations among variables, the correlation was statistically calculated using R packages including caret [48] and corrplot [55].We tested for multicollinearity between variables and removed variables with Pearson's r > 0.8 to reduce the redundancy of variables.In AGB retrieval, 10 variables for DOM, 6 variables for PPC, and 16 variables for a combination of both were selected after removing redundant variables.In LAI retrieval, 10 variables for DOM, 10 variables for PPC, and 20 variables for a combination of both were selected after removing redundant variables (Table 9).Random forest (RF) is a natural multiclass algorithm with an internal measure of feature importance.The random forest recursive feature elimination (RF-RFE) selection method is basically a recursive process that ranks features according to some measure of their importance [56].
According to the importance of variables and referring to the model accuracy of cross validation, the influence of each variable on the cross-validation accuracy of the model was considered iteratively from the most important variable to the least important one.Following a machine-learning (ML) regression analysis, the joint RF-RFE algorithm was carried out to estimate AGB and LAI based on DOM variables, PPC variables, and a combination of both.For each of the three data sources, we used four ML model algorithms: random forest (RF), supporting vector machine (SVM), k-nearest neighbor (KNN) and Cubist to build the model.The tuning methods of the four models and RF-RFE operation were achieved using R package, including caret [48], e1071 [57], cubist [58] and randomForest [59].In the Cubist model, the value range of number parameter committees of model trees is from 1 to 50, with a step of 1.The range of the nearest-neighbor sample neighbors is from 0 to 9, with a step of 1.The tuneLength of mtry is 7 and ntree is 1000 in the RF model.The tuneLength of k in the KNN model is 50.The values of kernel function gamma are 0.5, 1, and 2. The values of the punish coefficient cost are 0.1, 1, 10, and 100.
For DOM and PPC alone, we chose the first half of all variables.For DOM + PPC, we chose the first third of total variables based on RFE results to participate in THE ML model.Thus, five variables of DOM, three variables of PPC and six variables of DOM + PPC were used in the estimation of AGB.Five variables of DOM, five variables of PPC, and seven variables of DOM + PPC were chosen in the estimation of LAI.
In total, 70% of samples (approximately 40 plots for AGB and approximately 135 plots for LAI) were randomly selected for training, and 30% of samples (approximately 16 plots for AGB and approximately 57 plots for LAI) were selected for validation in this study.For regression analysis, we ensured the distribution of different densities and different tree species plots in training and validation samples.

Model Accuracy Evaluation
After the regression model was established, the coefficient of determination, R-Square (R 2 ), and root mean square error (RMSE) were used to assess the goodness and accuracy of the established models.The larger the R 2 value, and the stronger correlation.The smaller the RMSE value, the higher the predicted accuracy.R 2 and RMSE were calculated using Equations ( 1) and (2).
where n is the number of plots, y i is the ground field measurement reference value of AGB or LAI for plot i; y i is the average value of y i ; y i is the model estimate value of AGB or LAI.

Results
About 70% of plot measurements were used to build the regression analysis model, and 30% were retained to validate the model.By training and comparing AGB and LAI retrieval results based on textural features of four window sizes using same method, we chose features of 6.5 m window size to retrieve AGB and LAI.The tuning results of four machine-learning models for the estimation of AGB and LAI from the three data sources are shown in Table 10.
Table 10.The best combination of parameters for the four models for the estimation of AGB and LAI.The variables used for final modeling are shown in Table 11, ranked according to the importance.As we can see from Table 11, vegetation indices are the most significant variables for the estimation of AGB in models obtained using DOM data.In addition, the texture mean variable is important.In those models comprising only PPC variables, height percentile and statistic are the most significant variables.Concerning DOM + PPC variables, height and coverage variables are the most significant variables; texture and vegetation index variables followed.

Models
For estimation of LAI, the green red ratio index (GRRI) is the most significant variable in DOM models; the texture mean variable followed.In PPC models, coverage and height percentile are the most significant variables.In the combined DOM + PPC variables, the coverage variable is the most significant, followed by the vegetation index variable.
As we can see from Figure 5, for field data versus estimated data in the estimation of AGB, the optimal values of R 2 and RMSE in DOM alone are 0.65 and 21 t/ha, respectively.In PPC alone, the optimal values are 0.7 and 26 t/ha.In DOM + PPC, the values are 0.73 and 20 t/ha, respectively.The results obtained using a combination of DOM and PPC provide increased R 2 accuracy.DOM results are particularly poor.
It is worth noting that the differences between PPC and DOM + PPC models are less marked.In some cases, PPC variables alone provide better results.It is apparent that R 2 is much smaller for the model based only on DOM variables compared to the two others.PPC and DOM + PPC models provide very similar results.
The field data versus the estimated data in the estimation of LAI are shown in Figure 6.The values of R 2 and RMSE in DOM alone are 0.73 and 0.49, respectively.In PPC alone, the values are 0.65 and 0.57, respectively.In DOM + PPC, the values are 0.79 and 0.48, respectively.The results obtained using the combination of DOM and PPC provide increased R 2 accuracy.PPC results are particularly poor.
The differences between DOM and DOM + PPC models are less marked.DOM variables alone provide better results.It is apparent that R 2 is much smaller for the model based only on PPC variables compared to the two other models.In particular, DOM and DOM + PPC models provide very similar results.

Discussion
As important ecological and biophysical parameters, AGB and LAI were estimated using three data sources (DOM data alone, PPC data alone, and DOM + PPC data) from the DAP data in this study.Our findings showed that combining DOM and PPC data could improve estimation accuracy for AGB and LAI.
Better accuracy (R 2 = 0.73 and RMSE = 20 t/ha) for the AGB estimation based on a Cubist regression model was achieved from the combined model processing of vertical structure, vegetation indices, and textural features from the combination of DOM and PPC data.The retrieval result of AGB from height variables of PPC data was better than the textural feature and vegetation indices from DOM data.Height and coverage variables derived from PPC data were the first three selections.This shows that height is a key parameter in the estimation of AGB.Studies using airborne LiDAR or a combination of LiDAR and aerial photographs have shown how height can be used to estimate AGB [4,37,60].Our study has confirmed that height is particularly important and a suitable index to AGB estimation when we used photogrammetric point cloud from the results of SfM approach processing.Ota et al. [37] obtained aboveground biomass using aerial photographs in a seasonal tropical forest.Canopy height models from aerial photograph DSM and LiDAR DTM yielded higher accuracy (R 2 = 0.93).Hansen et al. [61] estimated forest biomass based on empirical relationships between field-observed biomass and variables derived from LiDAR data, and a relatively lower accuracy was obtained (R 2 = 0.71, RMSE = 158 Mg/ha).Our accuracy of AGB estimation from combined DOM and PPC data is between Ota's and Hansen's results.
The R 2 and RMSE values of LAI estimation based on the SVM regression model from the combined DOM and PPC data were 0.79 and 0.48, respectively.Canopy cover (CC) from PPC data and the green red ratio index (GRRI) from DOM data contributed better results.It is worth noting that the GRRI is the second most important variable after CC.This indicates that GRRI has a strong contribution to LAI estimation.Previous studies using LiDAR, satellite images, or a combination of both showed similar results in estimating LAI.Ma et al. [62] estimated LAI based on full-waveform LiDAR data using a radiative transfer model.The R 2 and RMSE values of estimated LAI were 0.73 and 0.67, respectively.Mathews et al. [35] used a stepwise regression model to predict LAI based on a high-density point cloud using unmanned aerial vehicle (UAV) collection.The final result of the R 2 value was 0.57.Omer et al. [63] used spectral vegetation indices calculated from WorldView-2 data to predict LAI at the tree species level using support vector machines and artificial neural networks machine learning regression algorithms.They obtained better accuracy (R 2 = 0.75, RMSE = 0.05).Ma et al. [23] used the canopy height variable from LiDAR and the BRDF/Albedo variable from MODIS optical data to estimate LAI.The highest R 2 value was 0.73.Our accuracy is slightly higher than previous study results, and our study has confirmed that canopy coverage and vegetation index are particularly important and suitable indices to LAI estimation.
Compared with PPC data, DOM data had lower estimation accuracy (R 2 = 0.65, RMSE = 21 t/ha) of AGB, but the estimation accuracy (R 2 = 0.73, RMSE = 0.49) of LAI was higher.Similar findings were reported [9,63].Summarizing these results above, we believe that the spectral and textural features had a larger contribution to LAI retrieval than vertical height features, and the forest vertical height features had greater effects on AGB retrieval.Our study is slightly different from previous studies.The differences are mainly because of the different data sources, different types of vegetation, biomass abundance, and canopy density.
As we can see in Figures 5 and 6, a combination of DOM and PPC data could improve the estimation accuracy of AGB and LAI compared with either data source alone.The combined DOM and PPC data yielded the highest estimation accuracy for AGB (R 2 = 0.73, RMSE = 20 t/ha) and LAI (R 2 = 0.79, RMSE = 0.48).These estimations were made using machine-learning with the Cubist regression model and SVM regression model, respectively.Our study has demonstrated the ability to estimate AGB and LAI using the combination of DOM and PPC from DAP data with the SfM approach.
The results of our study using only DAP data are consistent with previous studies using LiDAR and other optical remote sensing data.However, the acquisition and processing cost of DAP data is much lower than that of LiDAR, while the spatial resolution is much higher than that of optical remote sensing data.Using DAP data to estimate forest parameters is worthy of further exploration and research.Our study could provide valuable guidance for accurate AGB and LAI estimation using DOM+PPC data from DAP in boreal coniferous forests.
In this study, with the exception of DEM from LiDAR data, we used DAP data alone with only red, green, and blue visible bands to estimate AGB and LAI.In previous studies, multispectral vegetation indices were used [63].The near-infrared band can provide much better information about vegetation health [64].We imitated the principle of multispectral vegetation indices to construct visible light vegetation indices.The results may have been improved if we added the spectral information of the near-infrared band.This work will be performed in a future study.
For a future practical application of the results from this study in other areas, we can consider all variables (vertical features and horizontal features) in this paper as realistic approaches.

Conclusions
The objective of this study is to explore the ability of retrieving forest parameters only using DOM, PPC, and DOM+PPC from RGB-only DAP data.In the paper, three analyses on DOM, PPC, and combined DOM and PPC data for the estimation of AGB and LAI have been presented.The study indicates that a combination of DOM and PPC data is useful as it provides a slight increase in estimation accuracy.Height and coverage variables of PPC, texture mean value, and visible differential vegetation index (VDVI) of DOM are significantly related to the estimation of AGB (R 2 = 0.73, RMSE = 20 t/ha).The canopy cover of PPC and green red ratio index (GRRI) of DOM are the most strongly related to the estimation of LAI, followed by the height and coverage variables of PPC, texture mean value, the visible atmospherically resistant index (VARI), and the VDVI of DOM (R 2 = 0.79, RMSE = 0.48).The model derived from only DOM data provides lower accuracy than only PPC data for the estimation of AGB at the Mengjiagang forest farm.In terms of LAI estimation, the result is different.Variables from either DOM or PPC data alone can provide the majority of the explanatory contribution for LAI estimation.

Figure 1 .
Figure 1.Location of the study area and distribution of field measurements.Red " , , +, " points are the sites of center coordinates of the Scots pine, Korean pine, Larch and Spruce, respectively, for LAI observations.Yellow locations are the sites of Larch plots in 2016.Dark blue locations are the sites of Larch plots in 2017.Light blue locations are the sites of Scots pine plots.Purple locations are the sites of Korean pine plots.

Figure 2 .
Figure 2. Technical flow chart of the process of retrieving AGB and LAI.

Figure 3 .
Figure 3. Matching the figures of the digital aerial photograph (DAP) point cloud and the light detection and ranging (LiDAR) point cloud.Red points are LiDAR point cloud and the green points are the DAP point cloud.

Figure 4 .
Figure 4. Data pre-processing results of digital aerial photographs and LiDAR.

Figure 5 .
Figure 5. Prediction accuracy of the four models for AGB estimation based on the three sources of data.

Figure 6 .
Figure 6.Prediction accuracy of the four models for LAI retrieval based on the three data sources.

Table 1 .
Details of digital aerial photograph and LiDAR data specifications.

Table 2 .
Summary of the field data (living trees).

Table 4 .
Summary of field AGB (living trees).

Table 5 .
Summary of the leaf area index (LAI) field data.

Table 8 .
Forest vertical structure parameters from the normalized point cloud.

Table 9 .
Variables from DOM and PPC after reducing redundancy.

Table 11 .
The variables used in modeling.