Estimation of Forest Structural Diversity Using the Spectral and Textural Information Derived from SPOT-5 Satellite Images

Uneven-aged forest management has received increasing attention in the past few years. Compared with even-aged plantations, the complex structure of uneven-aged forests complicates the formulation of management strategies. Forest structural diversity is expected to provide considerable significant information for uneven-aged forest management planning. In the present study, we investigated the potential of using SPOT-5 satellite images for extracting forest structural diversity. Forest stand variables were calculated from the field plots, whereas spectral and textural measures were derived from the corresponding satellite images. We firstly employed Pearson’s correlation analysis to examine the relationship between the forest stand variables and the image-derived measures. Secondly, we performed all possible subsets multiple linear regression to produce models by including the image-derived measures, which showed significant correlations with the forest stand variables, used as independent variables. The produced models were evaluated with the adjusted coefficient of determination (Radj) and the root mean square error (RMSE). Furthermore, a ten-fold cross-validation approach was used to validate the best-fitting models (Radj > 0.5). The results indicated that basal area, stand volume, the Shannon index, Simpson index, Pielou index, standard deviation of DBHs, diameter differentiation index and species intermingling index could be reliably predicted using the spectral or textural measures extracted from SPOT-5 satellite images.


Introduction
Forests are the largest territorial ecosystems and play a significant role in providing us with economic benefits as well as ecological services [1,2].In the past, commodity production was the most significant objective of forest management and forests were mainly managed for producing wood for timber, pulp, and fuel.In recent decades, however, ecological services provided by forests such as soil and water conservation, combating climate change, biodiversity conservation as well as the recreational values of forest landscapes have been highlighted due to the worsening environmental problems [2][3][4].In this context, multiple-purpose forest management has been proposed as a silvicultural alternative to traditional management regimes specialized for even-aged, mono-specific stands [5][6][7][8].Managing forests as complex systems to achieve multiple objectives is receiving increasing attention.[9,10].System structure determines system function [11].Likewise, forest functions are determined by forest structure.Forest structure, therefore, is a fundamental base to formulate a sound forest management regime aimed at various objectives.Forest structure can be quantitatively represented by different forest stand variables that vary with respect to detail.For the management of even-aged, mono-specific stands, the conventional forest stand variables such as number of trees (NT), stand volume per unit area (SV), basal area (BA), and quadratic mean diameter (QMD) are considered to be sufficient to prescribe management strategies because such types of forests have a simple structure and are easy to manage.However, these variables fail to describe the complex forest structure of uneven-aged, mixed-species, irregular forests managed for multi-purposes and hence more detailed information should be included.Corona [12] documented that forest inventory and mapping are broadening their scope towards multipurpose resources surveys in the context of global change, utilities from ecosystem management and recent change in forest management perspective.
Forest structural diversity provides a more detailed description of forest stands and is a significant component of forest structure.It can be subdivided into three categories: tree species diversity, tree dimension diversity or tree size diversity, and tree position diversity [13,14].Forest structural diversity is expected to provide several potential applications as tools for forest management planning.For instance, Lexerod and Eid [15] and O'Hara [16] argued that selective cuttings are most profitable in stands of high tree size diversity whereas clearcuttings should be suggested if the tree size diversity is low.Based on species intermingling, i.e., an index of species diversity, Bettinger and Tang [17] formulated tree-level harvest optimization for structure-based forest management.
The conventional means for the collection of these forest stand variables is via national or regional forest field inventories.For example, in China, there are three types of forest inventories, namely, the national forest inventory (NFI), the forest management planning inventory (FMPI), and the forest operation design operational inventory [18].In these types of forest inventories, ground plots are installed and forest stand variables are recorded.In the past, the most commonly investigated variables were the conventional forest stand variables as the forest inventory at that time was directly related to timber assessment [19].However, in the context of multiple-purpose forest management, forest inventories are currently evolving towards multipurpose resource surveys and are broadening their scope in several directions [19][20][21].Biodiversity is one of the most popular newly included variables as it is an essential prerequisite to support management decisions to maintain multiple forest ecosystem functions in the long term [19].Forest inventory by field surveys can indeed provide us with the information about conventional forest stand variables and forest biodiversity for forest management planning as well as forest policy formulation, but they are time consuming, expensive and not spatially exhaustive [22,23].Furthermore, these inventories are conducted periodically, e.g., five years for NFI and 10 years for FMPI, and therefore up-to-date information cannot be guaranteed.
Remote sensing is widely used as an effective and supportive tool for extracting forest attributes because of its wide scale, rapid data collection and cost savings [1,24].A large number of studies have been reported with respect to the estimation of forest stand variables or forest mapping using remote sensing data.The most commonly used remote sensing data in forestry include airborne LiDAR data and optical multispectral satellite data, which could be further divided into high spatial resolution satellite data (0.6 m-4 m) and synoptic satellite data of relatively lower resolution [25].The application of airborne LiDAR and high spatial resolution satellite data are limited due to their high cost as well as small coverage (e.g., swath width of Quickbird, Ikonos, Worldview 1, Worldview 2, and Worldview 3 are 16.8 km, 11.3 km, 17.6 km, 16.4 km, and 13.1 km, respectively), though they are promising in certain applications.For instance, LiDAR was reported to provide promising estimates of forest biomass [26][27][28], tree height [29][30][31], and detection of individual tree crowns [32][33][34].In comparison, the other optical multispectral satellite data of lower spatial resolution had a relatively large coverage area (e.g., the swath width at the nadir of RapidyEye, SPOT-5, Landsat and MODIS is 77 km, 60 km, 185 km, and 2330 km, respectively), which reduced the cost per unit area.Amongst these optical sensors, Wolter et al. [25] argued that SPOT-5 represented a reasonable compromise between high and medium spatial resolution and also had a large coverage area compared with the high spatial resolution of other satellite data.
In the present study, we employed SPOT-5 satellite data as well as field survey data to produce regression models for extracting forest structural information.The objectives can be summarized as follows: (i) determine the correlations between the spectral and textural measures extracted from SPOT-5 imagery and forest stand variables; (ii) develop models predicting forest stand variables with image-derived measures as independent variables; and (iii) produce thematic maps, especially for forest structural diversity, using the produced models.

National Forest Inventory Data
The 8th Chinese National Forest Inventory (CNFI) data of Guangxi Zhuang Autonomous Region, collected in 2015, were used.The inventory consists of a systematic sample of permanent square plots with a size of 1 Mu (Chinese unit of area, 0.067 ha) distributed on a square grid of 4 km ˆ6 km (Figure 1).In each plot, all trees were identified to the species level and their diameter (dbh) as well as their spatial location were recorded.
comparison, the other optical multispectral satellite data of lower spatial resolution had a relatively large coverage area (e.g., the swath width at the nadir of RapidyEye, SPOT-5, Landsat and MODIS is 77 km, 60 km, 185 km, and 2330 km, respectively), which reduced the cost per unit area.Amongst these optical sensors, Wolter et al. [25] argued that SPOT-5 represented a reasonable compromise between high and medium spatial resolution and also had a large coverage area compared with the high spatial resolution of other satellite data.
In the present study, we employed SPOT-5 satellite data as well as field survey data to produce regression models for extracting forest structural information.The objectives can be summarized as follows: (i) determine the correlations between the spectral and textural measures extracted from SPOT-5 imagery and forest stand variables; (ii) develop models predicting forest stand variables with image-derived measures as independent variables; and (iii) produce thematic maps, especially for forest structural diversity, using the produced models.

National Forest Inventory Data
The 8th Chinese National Forest Inventory (CNFI) data of Guangxi Zhuang Autonomous Region, collected in 2015, were used.The inventory consists of a systematic sample of permanent square plots with a size of 1 Mu (Chinese unit of area, 0.067 ha) distributed on a square grid of 4 km × 6 km (Figure 1).In each plot, all trees were identified to the species level and their diameter (dbh) as well as their spatial location were recorded.

Remote Sensing Data
Three SPOT-5 images, which had the K-J numbers 275/300, 275/301, and 275/302, taken on 21 September 2010, were used in the present study (Figure 1).The three SPOT-5 images consisted of both multi-spectral and panchromatic images.The multi-spectral images have a resolution of 10 m in the near-infrared (780-890 nm), red (610-680 nm) and green (500-590 nm) bands and 20 m in the shortwave infrared (1580-1750 nm) band.The panchromatic image was recorded at a resolution of 2.5 m.

Remote Sensing Data
Three SPOT-5 images, which had the K-J numbers 275/300, 275/301, and 275/302, taken on 21 September 2010, were used in the present study (Figure 1).The three SPOT-5 images consisted of both multi-spectral and panchromatic images.The multi-spectral images have a resolution of 10 m in the near-infrared (780-890 nm), red (610-680 nm) and green (500-590 nm) bands and 20 m in the shortwave infrared (1580-1750 nm) band.The panchromatic image was recorded at a resolution of 2.5 m.
Geometrical corrections were performed using Ground Control Points (GCP), determined with a differential GPS.Atmospheric correction was carried out using the improved Dark Object Subtraction suggested by Castillo-santiago et al. [35].The processing of these images was performed by the Survey & Planning Institute of State Forestry Administration, China prior to the present study.A total of 233 NFI plots fell into these three satellite images, of which 65 plots were dominated by trees.Amongst these 65 plots, there were 48 plots with canopy cover more than 20%, which were defined as forest stands (Figure 1).These 48 plots were employed to derive forest stand variables.

Conventional Forest Variables
The conventional forest variables, including quadratic mean diameter (QMD), basal area (BA), number of trees (NT), and stand volume (SV), were calculated for each plot.These variables provide a basic description of forest structure and were mostly derived information for forest management decision-making purposes.

Forest Structural Diversity
In comparison to conventional forest variables, forest structural diversity provides more details on forest structure [15].Structural diversity can be subdivided into three categories: tree species diversity, tree dimension diversity, and tree position diversity [13,14].

Species Diversity
We used the Shannon-Wiener index (SHI), Pielou index (PI), Simpson's index (SII) and the species intermingling index to characterize species diversity [36].
Shannon-Wiener index: where p i is the proportion of basal areas in the ith species.Pielou index: where SHI is the Shannon-Wiener index and S is the total number of species in a sample, across all samples in a dataset.Simpson's index: where pi is the proportion of basal areas in the ith species and n is the number of species observed.The species intermingling index of a forest stand (M): where v ij " # 0, i f neighbour j is the same species as re f erence tree i 1, otherwise ; Mi is the species intermingling index for reference tree i. M ranges from zero to one and indicates the degree of mixing in a forest stand.Values close to zero indicate that the forest stand has a low level of species mingling and a high degree of aggregation.High values that are close to one, on the other hand, imply that the forest stand has a high level of species mingling and a low degree of aggregation [37].

Diameter or Tree Size Diversity
Tree size diversity can be measured by the Gini coefficient (GC) and the standard deviation of the DBHs (SDDBH) [13,15].
Gini coefficient: GC " ř n t"1 p2t ´n ´1qba t ř n t"1 ba t pn ´1q where ba t is basal area for tree in rank t (m 2 /ha) and t is the rank of a tree in order from 1, . . ., n. GC ranged from zero to one.The GC has a minimum value of zero, when all trees are of equal size, and a theoretical maximum of one in an infinite population in which all trees except one have a value of zero for basal area.

Tree Position Diversity
Tree position diversity can be represented by the uniform angle index, DBH dominance index and Diameter differentiation index [37][38][39].These indices have been widely employed in analyzing spatial structure and thus support the formulation of management strategy especially for mixed, irregular, uneven-aged forests [17,40].
Uniform angle index of a forest stand (W): where z ij " # 1, i f the jth α ă α 0 0, i f the jth α ą α 0 pα 0 " 72 ˝q; n is the number of reference trees in the forest stand; i is any reference tree; j is the four nearest trees around reference tree i; and Wi is the uniform angle index, describing the uniformity of the distribution of neighboring trees around the reference tree i.
If W falls within [0.475, 0.517], it represents a random distribution; W > 0.517 represents a clumpy distribution, and W < 0.475 represents a uniform distribution [41].
The DBH dominance index of a forest stand (U): where k ij " # 0, i f neighbour j is smaller than re f erence tree i 1, otherwise ; Ui is the DBH dominance index for reference tree i. U explains the tree size differentiation within a forest stand; its value fall between 0 and 1.The higher the value, the greater the tree size differentiation in the forest stand [37].
Diameter differentiation index (DDI): The diameter differentiation T i quantifies diameter heterogeneity in the immediate neighborhood of a tree i.For a central tree i (I = 1, . . ., n) and its nearest neighbors j (j = 1, . . ., m), the diameter differentiation Ti is defined as: with where n is the number of central trees, m is the number of neighor trees (m = 3 in present study) and DBH i and DBH j are the diameter of the central tree and its neighbors, respectively.
In the present study, we calculated the mean DDI (T) within a stand using the following equation: The above tree position diversity indices could be significantly influenced by the edge trees since some of their neighbor trees might fall outside the plot [42].It is therefore necessary to conduct edge correction.In this study we used the reflection method.
GR " SAVI " MSI " where RED, GREEN, NIR, and SWIR are the surface reflectance of the red, green, near-infrared and shortwave infrared bands, respectively, and L was set to 0.5; and

Textural Measures
First-and second-order textural measures were derived for each plot.The first-order textural feature (standard deviation of gray levels, SDGL) was calculated for all multispectral reflectance bands, producing SDGL_green, SDGL_red, SDGL_nir and SDGL_mir.
The panchromatic band is reported to be particularly well suited for the analysis of spatial relationships using image textural measures [22,[50][51][52].As a result, we only extracted the second-order textural measures from the panchromatic band for each plot in comparison to the spectral and first-order textural measures.
The second-order textural feature was calculated based on the grey level co-occurrence matrix (GLCM).Eight GLCMs-Mean, Std.Dev., Correlation, Dissimilarity, Entropy, angular second moment (ASM), Contrast and Homogeneity-were selected for this study as the potential independent variables to establish the predictive models.A more detailed description of these textural measures is provided by Trimble [43].In addition to the spatial resolution, the value of the textural variables depends on window size.To determine the optimum window size, Shaban and Dikshit [53] and Castillo-santiago et al. [35] calculated and compared the Pearson correlation coefficient of texture statistics with the dependent variables (forest stand variables) at different window sizes and concluded that a 9 ˆ9 pixel window represented a trade-off between a desirable high correlation coefficient and a desirable minimum window size.Following them, we calculated the Pearson correlation coefficient of texture statistics with SHI and DDI, at seven window sizes (3 ˆ3, 5 ˆ5, 7 ˆ7, 9 ˆ9, 11 ˆ11, 13 ˆ13 and 15 ˆ15 pixels).

Model Construction and Validation
Prior to producing the predictive models, pairwise correlation analysis was first conducted between the forest stand variables and the image-derived measures.A two-tailed t-test was used to determine whether the correlations were statistically significant.Only the image-derived variables that showed significant correlations with the forest stand variables were included as independent variables for the subsequent multiple-variable regression.In order to correct nonlinearity and non-constant variance, we used Box-Cox transformations of the response variables (forest stand variables).For the determination of the potential subset of independent variables, there are two distinctly different approaches, namely, all possible subsets and stepwise methods [54].In the present study, we used all possible subsets.Following Castillo-Santiago et al. [35], Ozdemir and Karnieli [13] and Wallner et al. [55], we employed a cut-off value for variance inflation factor (VIF) of less than four, and restricted the number of independent variables to four to avoid multicollinearity.
The general fitting statistics including the adjusted coefficient of determination (R 2 adj ), and the root mean square error (RMSE) between observed and predicted forest stand variables was computed to evaluate the overall accuracy of the fitted models.In addition, residual plots were produced to inspect the normal distribution of the residuals.For model validation, a ten-fold cross-validation approach was employed to calculate the cross-validated root mean square error (RMSE cv ).In this approach, all candidate plots for constructing the predictive models were divided into ten folds.In each iteration, one fold was excluded and the remaining folds were retained for regression.The produced models were used to provide predictions pertaining to the excluded fold.The residuals were calculated for each data point in the excluded data and the corresponding RMSE i was derived.This process was carried out for all folds and the RMSE cv was calculated as the mean of all the RMSE i .

Structural Parameters
The descriptive forest stand variables derived from the 48 sampling plots are summarized in Table 1.Although the number of sampling plots is only 48, they represent a wide range of forest structural characteristics.For instance, SV, one of the most important conventional forest stand variables, ranged from 21.02 m 3 /ha to 263.13 m 3 /ha with a mean value of 101.23 m 3 /ha, which is almost the same as the average value of 100.20 m 3 /ha in Guangxi Zhuang Autonomous Region [56].In terms of species diversity, SHI, for example, ranged from 0 to 1.801, which indicates that the sampling plots contained both single-tree species and mixed-tree species stands, representing a wide range of species diversity.GC ranged from 0.062 to 0.362 with an average value of 0.230, implying a relatively low degree of tree size diversity because the theoretical maximum value is 1.With respect to tree position diversity, the range of M from 0 to 0.778 indicated that the stands varied considerably in species intermingling from a very low level to an extremely high level.U ranged from 0.470 to 0.531 with an average value of 0.482, implying that in the stands, tree size was moderately differentiated.W ranged from 0.273 to 0.706 with an average value of 0.498.Eight plots were uniform distribution (W < 0.475), 16 plots were clumpy distribution (W > 0.517) and 24 plots fell within [0.475, 0.517], representing random distribution.

Correlation Analyses
The correlation analyses between the forest stand variables and spectral measures are summarized in Table 2.The average surface reflectance of all bands was significantly negatively correlated with the forest stand variables except W and U. Specifically, the average surface reflectance of the nir, red, green and pan bands showed much higher correlation (|r| ą 0.60) with SHI, SII, PI and M. The vegetation indices except GEMI, MSI, SVR were significantly correlated with some forest stand variables, amongst which VI and SAVI were correlated with almost all stand variables and showed a much higher correlation (|r| ą 0.60) with SHI, SII, PI and M. Regarding the layer value features, Brightness was significantly correlated with all forest stand variables expect W and U and the correlation was extremely high with SHI, SII, PI and M(|r| ą 0.70).Max_diff was only correlated with QMD, BA, NT, SDDBH and SV.
The Pearson correlation coefficient between correlation (a texture statistics) and SHI increased until a window size of 9 ˆ9 pixels was reached and no further significant improvement was observed when continuing increasing the window up to 15 ˆ15 pixels (Figure 2).In comparison, the Pearson correlation coefficient between the other texture statistics did not show any notable change along with the window size.Similar pattern was also observed for DDI (Figure 2).The window size of 9 ˆ9 pixels was therefore determined to be the optimum size to calculate the texture statistics, which was consistent with the findings reported by Shaban and Dikshit [53] and Castillo-santiago et al. [35].
In terms of the relationship between textural measures and forest stand variables, the first-order textural measures did not show significant correlation with many forest stand variables, except SDGL_red.For instance, SDGL_swir and SDGL_green were significantly correlated with only two stand variables and the correlations were also not high (|r| < 0.4).In contrast, the second-order textural measures exhibited significant correlation with much more forest stand variables.For example, Glcm_contrast, Glcm_mean and Glcm_variance, were highly correlated with all the forest stand variables except W and U. Similarly, much higher correlations were observed between these textural measures and SHI, SII, PI and M. For example, the correlation coefficient between Glcm_mean and SHI was ´0.812, which was ranked as the highest value in the present study.

Model Establishment
We first produced the predictive models using both textural and spectral measures as independent variables.Although most of the textural measures and the spectral measures indicated a significant correlation with the forest stand variables (Tables 2 and 3), we excluded some of them to avoid multicollinearity.The produced models are summarized in Table 4.The developed models had at most three independent variables.Brightness was the most commonly used independent variable in the models predicting BA, QMD, SV, NT and SDDBH.The following one was Max_diff, which contributed to the models predicting BA, QMD, SV and NT.VI and mean_red ranked third.VI was an independent variable in the models predicting PI, and DDI, whereas mean_red was involved in the models predicting SHI, SII, and GC.The other independent variables for the models included SDGL_nir, SDGL_green and SDGL_nir.

Model Establishment
We first produced the predictive models using both textural and spectral measures as independent variables.Although most of the textural measures and the spectral measures indicated a significant correlation with the forest stand variables (Tables 2 and 3), we excluded some of them to avoid multicollinearity.The produced models are summarized in Table 4.The developed models had at most three independent variables.Brightness was the most commonly used independent variable in the models predicting BA, QMD, SV, NT and SDDBH.The following one was Max_diff, which contributed to the models predicting BA, QMD, SV and NT.VI and mean_red ranked third.VI was an independent variable in the models predicting PI, M and DDI, whereas mean_red was involved in the models predicting SHI, SII, and GC.The other independent variables for the models included SDGL_nir, SDGL_green and SDGL_nir.
As many studies have demonstrated that forest stand variables can be estimated using only textural measures as independent variables, for comparison purposes, the predictive models were also built using only textural measures as independent variables.All of the produced models except the ones predicting M and DDI, had only one independent variable, which was Glcm_mean.Amongst all the twelve models in Table 5, only models (15, 16, 20, 22 and 23) predicing SHI, SII, PI and M could be trusted as their adjusted correlation coefficients (R 2 adj ) were larger than 0.5.Their RMSEs were 0.321, 0.170, 0.127 and 0.160 and 0.147, respectively (Table 5).The residuals of the models were normally distributed and showed evidence of uniform variance (Figure 5).Their prediction abilities were substantiated by the cross-validation scores (RMSE cv values were 0.342, 0.174, 0.134, 0.166 and 0.155).
The residual plots of the reliable models are presented in Figure 3, and no particular patterns were observed.We therefore concluded that these models had potential to predict and map the forest stand variables.Based on model 5, the thematic map of Simpson's index was produced as an example (Figure 4).
As many studies have demonstrated that forest stand variables can be estimated using only textural measures as independent variables, for comparison purposes, the predictive models were also built using only textural measures as independent variables.All of the produced models except the ones predicting and DDI, had only one independent variable, which was Glcm_mean.Amongst all the twelve models in Table 5, only models (15, 16, 20, 22 and 23) predicing SHI, SII, PI and could be trusted as their adjusted correlation coefficients ( ) were larger than 0.5.Their RMSEs were 0.321, 0.170, 0.127 and 0.160 and 0.147, respectively (Table 5).The residuals of the models were normally distributed and showed evidence of uniform variance (Figure 5).Their prediction abilities were substantiated by the cross-validation scores (RMSEcv values were 0.342, 0.174, 0.134, 0.166 and 0.155).

Discussion
We first built our predictive models using both spectral and textural measures, but only certain spectral measures were retained in the models.This could be attributed to the problems of multicollinearity.The produced models (1, 3, 4, 5, 8, 9, 10, 11 and 12) allowed predictions of the BA, SV, SHI, SII, SDDBH, PI, DDI and ( values were between 0.50 and 0.70, p < 0.01).Vegetation indices were commonly used and promising independent variables in estimation of forest stand variables [55,57].In the present study, only VI was included in models (9)(10)(11)(12), whereas no vegetation indices were included in model (1,3,4,5,8), though eight vegetation indices were involved as potential regressors to establish the predict models.Similar results were reported by Castillo-Santiago et al. [35] who employed four vegetation indices to produce models predicting BA, SV and above ground biomass but found no one was included in the final models.The effectiveness of vegetation indices for predicting forest stand variables was determined by both nature of the forests and the quantity of shadows [35].For instance, Steininger [58] and Castillo-Santiago et al. [35] documented that the best results for spectral information (vegetation indices) to explain variation in forest structure were at lower biomass level.Eckert [59] and Wallner et al. [55] explained the effectiveness of the their vegetation indices (e.g., GR and SR) for estimating forest stand variables as follows: a low value for the vegetation indices implies the presence of stands of coniferous forest with shady areas and relatively low stand density, while higher values for these indices imply broadleaved forest with a closed canopy.In the present study, we obtained very low and non-statistically significant correlation coefficients between the forest stand variables and vegetation indices, e.g., GEMI, GR, MSI, and SVR.This might be attributed to the nature of the 48 plots used to produce the models.These plots were of relatively low density and with shady area.Furthermore, most of them were dominated by the coniferous tree species such as Cunninghamia lanceolata and Pinus massoniana.These features together might result in a very low value of vegetation indices.In addition, multicollinearity might also account for the exclusion of vegetation indices.
None of these models, except model 10, had textural measures as independent variables.In comparison to our results, many studies have demonstrated that the inclusion of textural features, especially second-order textural measures, to spectral measures could improve the estimation of forest stand variables as well as the accuracy of forest classification.For example, Wulder et al. [60] found that with the inclusion of texture, the ability to estimate hardwood forest leaf area index (LAI) from remotely sensed imagery increased by approximately 20%.Kim et al. [61] reported that the classification accuracy using IKONOS imagery was improved by adding the textural features to the spectral properties.Eckert [1] documented that estimation of tropical rainforest biomass/carbon, based on WorldView-2, exhibited an obvious improvement after introducing textural information to spectral information.The reason why our results are not consistent with these studies might be due to the relatively lower spatial resolution of the SPOT-5 imagery compared with the very high resolution (VHR) satellite imagery (e.g., IKONOS and Worldview-2 employed in the above-

Discussion
We first built our predictive models using both spectral and textural measures, but only certain spectral measures were retained in the models.This could be attributed to the problems of multicollinearity.The produced models (1, 3, 4, 5, 8, 9, 10, 11 and 12) allowed predictions of the BA, SV, SHI, SII, SDDBH, PI, DDI and M (R 2 adj values were between 0.50 and 0.70, p < 0.01).Vegetation indices were commonly used and promising independent variables in estimation of forest stand variables [55,57].In the present study, only VI was included in models (9-12), whereas no vegetation indices were included in model (1,3,4,5,8), though eight vegetation indices were involved as potential regressors to establish the predict models.Similar results were reported by Castillo-Santiago et al. [35] who employed four vegetation indices to produce models predicting BA, SV and above ground biomass but found no one was included in the final models.The effectiveness of vegetation indices for predicting forest stand variables was determined by both nature of the forests and the quantity of shadows [35].For instance, Steininger [58] and Castillo-Santiago et al. [35] documented that the best results for spectral information (vegetation indices) to explain variation in forest structure were at lower biomass level.Eckert [59] and Wallner et al. [55] explained the effectiveness of the their vegetation indices (e.g., GR and SR) for estimating forest stand variables as follows: a low value for the vegetation indices implies the presence of stands of coniferous forest with shady areas and relatively low stand density, while higher values for these indices imply broadleaved forest with a closed canopy.In the present study, we obtained very low and non-statistically significant correlation coefficients between the forest stand variables and vegetation indices, e.g., GEMI, GR, MSI, and SVR.This might be attributed to the nature of the 48 plots used to produce the models.These plots were of relatively low density and with shady area.Furthermore, most of them were dominated by the coniferous tree species such as Cunninghamia lanceolata and Pinus massoniana.These features together might result in a very low value of vegetation indices.In addition, multicollinearity might also account for the exclusion of vegetation indices.
None of these models, except model 10, had textural measures as independent variables.In comparison to our results, many studies have demonstrated that the inclusion of textural features, especially second-order textural measures, to spectral measures could improve the estimation of forest stand variables as well as the accuracy of forest classification.For example, Wulder et al. [60] found that with the inclusion of texture, the ability to estimate hardwood forest leaf area index (LAI) from remotely sensed imagery increased by approximately 20%.Kim et al. [61] reported that the classification accuracy using IKONOS imagery was improved by adding the textural features to the spectral properties.Eckert [1] documented that estimation of tropical rainforest biomass/carbon, based on WorldView-2, exhibited an obvious improvement after introducing textural information to spectral information.The reason why our results are not consistent with these studies might be due to the relatively lower spatial resolution of the SPOT-5 imagery compared with the very high resolution (VHR) satellite imagery (e.g., IKONOS and Worldview-2 employed in the above-mentioned studies).Lu and Weng [62] argued that the importance of introducing textural information increases as spatial resolution increases.Franklin et al. [63] also found that the addition of image texture increased the classification accuracy of high spatial detail imagery (pixel size < 1 m) relative to low spatial detail imagery.Furthermore, the poor significance of textural measures could be attribute to the variables to predict such diversity indices.These diversity indices only measure one dimension of diversity [64], whereas the texture measures included many dimensions.For example, mixture of two pine species could have the same species diversity as a mixture of one pine species and one birch species.The two mixtures would have similar diversity indices but much different texture indices.In this case, the correlation between textural measures and diversity indices would be extremely low.
In addition, the importance of introducing textural measures also depends on the research subject.For instance, Ota et al. [65] found that the addition of textural information improved the discrimination of hinoki cypress and cool-temperate mixed forest whereas no improvement for Japanese cedar and a clear cut area was observed.Franklin [66] also documented that the addition of texture generally improved the classification accuracy of hardwood stands, more so than for softwood stands.Using the estimation of structural diversity indices (GC, SDDBH and DDI) for example, Ozdemir and Karnieli [13] explained why the importance of textural measures varied between different research subjects.They stated that the stands in which trees were regularly interspaced and the stem density was high, had lower structural diversity but produced higher textural values.On the contrary, the stands with higher structural diversity produced lower textural values because the large crowns and the gaps in such clumped stands increase the number of adjacent (neighbor) pixels with similar or identical gray levels.In our study, the 48 candidate plots used to build the predictive models varied significantly in terms of species composition and structural characteristics (Table 1).For instance, SHI ranged from 0 to 1.801, indicating the plots consisted of single-tree species plantations and mixed-species forests.SV ranged from 21.02 m 3 /ha to 263.13 m 3 /ha, producing a coefficient of variance (CV) of 57.10%.The high variation of these 48 plots may have changed the relationship between the textural measures and the forest stand variables when we combined them for regression analysis, which might also account for the exclusion of textural measures when developing the predictive models.As a result, prior to model development, it seemed to be quite necessary to stratify the forest inventory plots into sub-categories (e.g., pure plantation and mixed-tree species forests) for which subsequent regression analysis should be done separately.
This assumption was supported by our findings that the R 2 adj would increase from 0.59 to 0.62 (Table 5) if we built the model predicting M using all data excluding those from the pure plantation.Actually, many other studies have already demonstrated the efficiency of classification/stratification in building such predictive models.For example, Eckert [1] reported that the estimation of tropical rainforest biomass/carbon could be improved by developing and applying forest stratum-specific models.Wallner et al. [55] firstly stratified forest inventory plots based on forest types and then produced separate predictive models and found that stratification improved the regression models.Similarly, we also produced the models for the other stand variables using all data excluding those from the pure plantation; however, no significant/obvious improvement was observed for R 2 adj , in contrast to M. Therefore, these are not listed in Tables 4 and 5.We might conclude that amongst the predicted variables in the present study, M was the most sensitive to the image information.This might be attributed to the much more detailed information that M provided compared with the other forest stand variables.For instance, BA and SV contained neither species composition nor tree position information.SHI and SII did not involve tree position information, though they accounted for tree species composition.In addition, although two stands had similar diversity indices, they would differ with respect to image textural measures.For example, mixture of two pine species could have the same species diversity as a mixture of one pine species and one birch species.The two mixtures would have similar diversity indices but much different texture indices.In contrast, M involved both species composition and tree position information and could be better represented by textural measures.
In addition, we also developed the models with only textural measures as independent variables for comparison purposes because some published literature has shown that textural measures alone were also promising for the prediction of forest stand variables.For instance, based on IKONOS satellite data, Kayitakire et al. [67] and Gebreslasie [52] succeeded in developing reliable models predicting conventional forest variables using only textural features.Our results indicated that amongst the 11 candidate forest stand variables (dependent variables), only species diversity represented by SHI, PI and SII could be reliably estimated using only textural measures (R 2 adj values for SHI, PI and SII were 0.62, 0.57 and 0.59, respectively (p < 0.01)).Actually, the relationship between species diversity and textural measures has been explored by many authors in varying research fields.For instance, a similar result was reported by Nagendra et al. [68], who also found that the textural measures were significantly correlated with tree species diversity measured by species richness and the Shannon index.St-Louis et al. [69] and wood et al. [70], who tested image texture as a predictor of bird species richness and density, concluded that textural measures are very promising predictors and even perform better than field-measured vegetation structure.Magurran [36] classified diversity as either species richness measures or heterogeneity measures.Heterogeneity measures such SHI, PI and SII are those that combine the component diversity of the richness and evenness measures and are hence regarded to represent considerably more information [36,71,72].The rich information of the heterogeneity measures (e.g., SHI, PI and SII in the present study) might account for their high sensitivity to remotely sensed image texture, which was regarded as a surrogate for vegetation structure [70].Furthermore, Gallardo-Cruz et al. [73] argued that compared with first-order texture, the second-order texture had greater potential to reflect the heterogeneity of forest stands as it considers pixel-neighbor relationships.Their statement is supported by our t predicting SHI, PI and SII, whose independent variable was Glcm_mean (second-order texture).
In terms of the forest stand variables to be predicted, most studies focused on the estimation of conventional forest stand variables such as SV, NT, BA, and QMD using remotely sensed data [35,55,74,75].Only a few studies have investigated the extraction of the more complex structural variables such as tree size diversity and tree position diversity [13,76,77].However, these complex structural variables are significant in the development of management plans, especially for multipurpose forests, and are usually more expensive and time-consuming to collect in a field survey.In addition, the complexity of such structural variables was further introduced by the spatial and temporal scale at which they should be investigated.For instance, Lamonaca et al. [77] argued that these complex structural variables representing spatial heterogeneity should be detected across scales since it was not possible to infer the multiple-scale structural and dynamical patterns from a system description that spanned only a narrow window of resolution [78].They therefore compared three-level segmentation and demonstrated that multi-resolution segmentation was able to delineate scale-dependent patterns of forest structural heterogeneity, even in an initial stage of old-growth structural differentiation.Their findings have a potential to improve the sampling design of field surveys aimed at characterizing forest structural complexity across multiple spatio-temporal scales.In the present study, in addition to the conventional forest variables, we also succeeded in producing models allowing us to forecast the more complex forest structure, i.e., tree size diversity represented by SDDBH, and tree position diversity represented by M. The tree size diversity affected the economical, ecological as well as social values and hence provided important information for prescribing management regimes [15].The tree position diversity was not only used to infer ecological mechanisms [79,80] but was also of practical importance, e.g., formulation of tree-level harvest optimization [17] and identification of the optimal tree species arrangement for enrichment planting [81].In the present study, the predictive models predicting these complex structural variables was unfortunately built without taking the concept of multiple-scale analysis and hierarchy.Following the findings of Lamonaca et al. [77], it might be necessary to first conduct multi-resolution segmentation and then produce the predictive models for the segments which had the same structural variables if the training data (field plots) was sufficient.In addition, the predictive models were developed using only three SPOT-5 images and hence it was not safe to apply across the entire Guangxi Zhuang Autonomous Region.However, thematic maps could be reliably produced within the research area for which the models were built.Furthermore, non-compatibility of the produced models might be introduced because each forest variables were predicted independently.For instance, BA, QMD and NT were related by BA = QMD ˆQMD ˆNT ˆ0.00007854 (given BA in m 2 /ha, QMD in cm and NT in trees/ha).Because of non-compatibility, the estimated BA might differ from the value produced above.Therefore compatible models should be encouraged to develop.We also could have negative predictions.For example, the first five smallest values for VI were 1.278, 1.307, 1.311, 1.317 and 1.320.If model 11 were employed for prediction, we would get four negative predictions, i.e., ´0.177, ´0.046, ´0.028 and ´0.003, which were close to zero.Actually, the plots with the species intermingling index around zero (no matter positive or negative) were all pure plantation.Therefore, if we get the negative predictions, we could assign them to pure plantation.
Multiple linear regression was commonly employed in forestry researches.For instance, it was widely used to produce forest growth and yield models [82][83][84][85].Also, like this present study, it was normally performed to extract forest stand variables using remotely sensed data [13,35,55,66,86,87].However, this statistical technique was criticized for its limitations.For example, Gebreslasie et al. [52], Dye et al. [88] and Lottering and Mutanga [89] documented that multiple linear regression assumed both linearity and independence between variables, which was seldom observed in forest and remotely sensed data.Furthermore, linear regression also required the absence of collinearity amongst input variables [88,90].VIF was normally employed to analyze multicollinearity and some variables indicating on collinearity (multicollinearity) might be removed, which resulted in model that explained less variance than the best possible full model with more variables.Therefore, more robust statistical methods, which did not need to make any assumptions about the data, such as artificial Neural Networks (ANN) [89][90][91], Classification and Regression Tree Analysis (CART) [22,92,93], and Random forests (RF) [88,94,95] were widely used to investigate complex relationship between forests stand variables and remotely sensed data.These robust statistical techniques should be given first priority in future remote sensing studies as many researches have already demonstrated that nonlinear interactions might exist between the observed data and remotely sensed data [88,90,96].Even within these robust statistical techniques, they presented different performance in producing the predictive models.For instance, Breiman [97] documented that CART was sensitive to small variations in the training dataset, which could cause instability with regard to variable selection and can adversely affect the predictive performance of the final model [98].Correspondingly, Dye et al. [88] recommended RF to reduce the instability of single regression trees and improve the overall predictive performance.Therefore, studies comparing different statistical techniques for predicting forest stand variables using remotely sensed data should be encouraged.Although the R 2 was a frequently employed efficiency criteria to identify the optimum models and the models with R 2 equal to or more than 0.5 were normally regarded to be reliable [13,25], it might still be reputable since in certain cases models with very low R 2 were also useful for prediction.Actually in addition to R 2 , there were also other efficiency criteria such as Nash-Sutcliffe efficiency and Index of agreement, which placed different emphasis on different types of simulated and observed behaviors [99,100].Janssen and Heuberger [101] documented that the selection of the best efficiency measures should reflect the intended use of the model and should concern model quantities which are deemed relevant for the study at hand.Krause et al. [99] recommended a combination of different efficiency criteria for scientific sound model calibration and validation after examining the utility of several efficiency criteria.
Uneven-aged forest management with various objectives has received more attention as a silvicultural alternative in the past few years [2,15,83,102].In this context, much more detailed information concerning complex forests is needed for management decision-making.Corona [12] considered new paradigms in large-scale monitoring and assessment of forest ecosystems under the changing perspectives and made commented discussions with examples from the literature produced in the last decade.Remote sensing techniques with various sensors of different spatial and spectral resolutions provide a promising opportunity to extract such detailed information.Therefore, further investigation exploring the relationship between these complex structural indices and the indices derived from different remotely sensed data should be encouraged.

Conclusions
Forest structural diversity indices were of great importance to the management of uneven-aged forests.However, they were time consuming and expensive to obtain.In the present study, we have successfully built the predictive models predicting forest structural diversity indices, i.e., Shannon-Wiener index, Simpson's index, Standard deviation of DBHs, Pielou index, Diameter differentiation index and Species intermingling index using both spectral and textural measures.In addition, we also produced models estimating basal area and stand volume.The predictive models would contribute to the formulation of forest management strategy, especially for uneven-aged forests in the context of climate change.Although the produced predictive models provided us a quick and economical estimation of forest structural diversity, they should be applied with great care as biased estimation might occur if we employ them beyond the scope that we developed them.It was noteworthy that multiple linear regression assumed both linearity and independence between variables, which was seldom observed in forest and remotely sensed data.The robust statistical methods, e.g., machine learning, need to perform in future research.

Figure 1 .
Figure 1.Overview and zoomed map of the study area.The zoomed maps consist of three SPOT-5 image footprints in which the red circle represents the forest plots.

Figure 1 .
Figure 1.Overview and zoomed map of the study area.The zoomed maps consist of three SPOT-5 image footprints in which the red circle represents the forest plots.

Figure 2 .
Figure 2. Correlation coefficient of the texture statistics with SHI and DDI, as a function of window size.

Figure 2 .
Figure 2. Correlation coefficient of the texture statistics with SHI and DDI, as a function of window size.

Figure 4 .
Figure 4. Thematic map of Simpson's index of a county in Guangxi Autonomous Region.

Figure 4 .
Figure 4. Thematic map of Simpson's index of a county in Guangxi Autonomous Region.

Figure 4 .
Figure 4. Thematic map of Simpson's index of a county in Guangxi Autonomous Region.

Table 1 .
Descriptive statistics of the conventional forest parameters and structural diversity indices of the 48 plots.

Table 2 .
Pearson correlation coefficients between the spectral image measures and the forest stand variables.

Table 3 .
Pearson correlation coefficients between the textural image measures and the forest structural parameters.

Table 4 .
Regression model predicting the forest stand variables using both spectral and textural measures as independent variables.
´6* Species intermingling index calculated from all the plots excluding the plots of pure plantation.

Table 5 .
Regression model predicting the forest stand variables using only textural measures as independent variables.
´6* Species intermingling index calculated from all the plots excluding the plots of pure plantation.