Comparison of Four Machine Learning Methods for Generating the GLASS Fractional Vegetation Cover Product from MODIS Data

Yang, Linqing; Jia, Kun; Liang, Shunlin; Liu, Jingcan; Wang, Xiaoxia

doi:10.3390/rs8080682

Open AccessArticle

Comparison of Four Machine Learning Methods for Generating the GLASS Fractional Vegetation Cover Product from MODIS Data

¹

State Key Laboratory of Remote Sensing Science, School of Geography, Beijing Normal University, Beijing 100875, China

²

Department of Geographical Sciences, University of Maryland, College Park, MD 20742, USA

^*

Author to whom correspondence should be addressed.

Remote Sens. 2016, 8(8), 682; https://doi.org/10.3390/rs8080682

Submission received: 18 May 2016 / Revised: 15 August 2016 / Accepted: 17 August 2016 / Published: 20 August 2016

Download

Browse Figures

Versions Notes

Abstract

:

Long-term global land surface fractional vegetation cover (FVC) products are essential for various applications. Currently, several global FVC products have been generated from medium spatial resolution remote sensing data. However, validation results indicate that there are inconsistencies and spatial and temporal discontinuities in the current FVC products. Therefore, the Global LAnd Surface Satellite (GLASS) FVC product algorithm using general regression neural networks (GRNNs), which achieves an FVC estimation accuracy comparable to that of the GEOV1 FVC product with much improved spatial and temporal continuities, was developed. However, the computational efficiency of the GRNNs method is low and unsatisfactory for generating the long-term GLASS FVC product. Therefore, the objective of this study was to discover an alternative algorithm for generating the GLASS FVC product that has both an accuracy comparable to that of the GRNNs method and adequate computational efficiency. Four commonly used machine learning methods, back-propagation neural networks (BPNNs), GRNNs, support vector regression (SVR), and multivariate adaptive regression splines (MARS), were evaluated. After comparing its performance of training accuracy and computational efficiency with the other three methods, the MARS model was preliminarily selected as the most suitable algorithm for generating the GLASS FVC product. Direct validation results indicated that the performance of the MARS model (R² = 0.836, RMSE = 0.1488) was comparable to that of the GRNNs method (R² = 0.8353, RMSE = 0.1495), and the global land surface FVC generated from the MARS model had good spatial and temporal consistency with that generated from the GRNNs method. Furthermore, the computational efficiency of MARS was much higher than that of the GRNNs method. Therefore, the MARS model is a suitable algorithm for generating the GLASS FVC product from Moderate Resolution Imaging Spectroradiometer (MODIS) data.

Keywords:

fractional vegetation cover (FVC); GLASS FVC product; multivariate adaptive regression splines (MARS); general regression neural networks (GRNNs); MODIS

Graphical Abstract

1. Introduction

Fractional vegetation cover (FVC), generally defined as the fraction of green vegetation as seen from the nadir, is an important variable for describing land surface vegetation [1,2,3]. FVC is also a key biophysical parameter for studying the atmosphere, pedosphere, hydrosphere, and biosphere, as well as their interactions [4], and is widely used in weather prediction, hydrological monitoring, and related research fields [5,6,7,8]. Long-term and global-scale FVC datasets are highly important for land surface process and climate change studies and also for their extensive applications in the monitoring of agriculture, forestry, disaster risk, and drought [5,8,9,10]. Estimation of FVC from remote sensing data is the only effective way to generate FVC products, especially at the regional and global scale.

There are three main types of FVC estimation methods using remotely sensed data: empirical methods, pixel un-mixing modeling, and physical model-based methods. The empirical methods are based on the statistical relationships between FVC and spectral band reflectance or vegetation indices [11,12]. Empirical methods are simple, computationally efficient, and widely used for estimating FVC at the regional scale. However, empirical methods are limited temporally and spatially because their statistical relationships are constructed using data acquired at specific times in distinct regions. Thus, although they are typically applicable to specific research areas and vegetation types, they may become invalid when they are expanded to larger areas. The pixel un-mixing model assumes that each pixel is composed of several components, and considers the fraction of vegetation compositions as the FVC of the pixel [13,14,15]. The dimidiate pixel model, which assumes that the pixels are composed solely of vegetation and non-vegetation components, is the simplest and most widely used pixel un-mixing model [5,14,16]. However, due to the complexity of land surfaces, it is difficult to use the pixel un-mixing model to determine the number of endmembers and the spectral responses of the endmembers over large areas for FVC estimation. The physical model-based methods for FVC estimation are based on the inversion of canopy radiative transfer models that simulate the physical relationships between vegetation canopy spectral reflectance and FVC. Such physical models have clear physical significance and can be adapted to a wide range of scenarios [17]. However, because of the complexity of the physical models, direct inversion is generally complex, and artificial neural networks (ANNs) are usually used for indirect inversion of the physical model by training with a pre-computed reflectance database from the physical models to simplify the inversion [8,18]. ANN methods have the advantages of computational efficiency and robustness to noisy data, and can approximate multivariate non-linear relationships, which make them popular choices for large-area FVC estimation from remote sensing data [1,19,20,21].

Furthermore, the current global FVC products were largely generated using medium spatial resolution remotely sensed data. However, specific validation results indicate that there are inconsistencies among the different FVC products as well as spatial and temporal discontinuities within them [22,23,24]. For example, the Spinning Enhanced Visible and Infrared Imager (SEVIRI) and MEdium-Resolution Imaging Spectrometer (MERIS) FVC products were found to have systematic bias, such that the MERIS FVC presents lower values (by approximately 0.1–0.2) [24]. VEGETATION (VGT) FVC was found to be generally higher than SEVIRI FVC by approximately 0.15, and Carbon cYcle and Change in Land Observational Products from an Ensemble of Satellites (CYCLOPES) FVC values were found to be underestimates [25,26]. With the underestimation problem corrected, the GEOV1 FVC product is an improved version of the CYCLOPES FVC product. However, the GEOV1 FVC product suffers from unsatisfactory temporal and spatial continuities [24]. Therefore, a long-term global FVC product with both high accuracy and satisfactory spatial and temporal continuities is urgently needed.

Recently, Jia et al. [27] developed an operational algorithm for estimating FVC from Moderate Resolution Imaging Spectroradiometer (MODIS) surface reflectance data using general regression neural networks (GRNNs). Their method was used to generate the Global LAnd Surface Satellite (GLASS) FVC product, which was supported by China’s National High Technology Research and Development Program. Validation results demonstrated that the GRNNs-generated GLASS FVC product had accuracy that was comparable to the accuracy of the GEOV1 FVC product, which was considered to be the best global FVC product available at the time. Moreover, the spatial and temporal continuities of the GLASS FVC were superior to those of the GEOV1 FVC [27]. Balancing the estimation accuracy and the computational efficiency of the estimation algorithm is a key issue for generating lengthy time series, and high- resolution and global-scale FVC products. However, the computational efficiency of using the GRNNs method to generate the GLASS FVC product is currently unsatisfactory. It takes more than one hour to generate one tile of data from MODIS data, which severely restricts the production of the GLASS FVC product. Therefore, a highly efficient method with accuracy comparable to that of the GRNNs method is urgently needed for generating the GLASS FVC product.

Machine learning methods have been applied widely to land surface parameter product generation using remotely sensed data due to their capability of nonlinear fitting and computational efficiency. For example, Baret et al. used back-propagation neural networks (BPNNs) to produce the GEOV1 leaf area index (LAI), fraction of absorbed photosynthetically active radiation (FAPAR), and fraction of vegetation cover (FCOVER) products [1,21,27]. Durbha et al. [28] used support vector regression (SVR) to retrieve LAI from Multi-angle Imaging SpectroRadiometer (MISR) data with satisfactory results. Jiang et al. [29] used multivariate adaptive regression splines (MARS) to generate the long-term GLASS Daytime All-Wave Net Radiation Product. To generate the global FVC product, a satisfactory balance between the accuracy and the computational efficiency of the FVC estimation algorithm is needed. Therefore, the objective of this study was to find an alternative algorithm for generating the GLASS FVC product having both accuracy comparable to that of the GRNNs method and relatively high computational efficiency. In this study, four commonly used and effective machine learning methods, BPNNs, GRNNs, SVR, and MARS, were evaluated to identify the most suitable method for generating the GLASS FVC product.

2. Data and Methods

This study was designed to find a suitable GLASS FVC product algorithm having accuracy comparable to that of the GRNNs method and also high computational efficiency (Figure 1). First, four machine learning methods, GRNNs, BPNNs, SVR, and MARS, were trained on identical samples of different sizes, and the performances of the four methods were evaluated. Then, the method with both suitable accuracy and high computational efficiency was used to generate the GLASS FVC product.

2.1. Training Samples

The training samples used to develop the FVC estimation algorithm from MODIS surface reflectance data with GRNNs were used in this study [27]. The sampling locations consisted of the BEnchmark Land Multi-site Analysis and Intercomparison of Products (BELMANIP) sites, specific sites in FLUXNET that were not overlaid nor particularly proximal to the BELMANIP sites, and Validation of Land European Remote Sensing Instruments (VALERI) sites with ground measured FVC data. In total, approximately 500 global sampling locations were selected for use in the present study. The sample locations are located in relatively flat and homogeneous areas, are globally distributed, and cover all types of vegetation. The red and NIR band reflectance in the reprocessed MODIS data of the 25 pixels (5 × 5 = 25) surrounding the global sampling locations were extracted, and the corresponding Landsat thematic mapper (TM)/Enhanced Thematic Mapper plus (ETM+) FVC pixels for each extracted MODIS pixel were averaged as the sampling FVC of the MODIS pixel. The Landsat TM/ETM+ FVC data were derived from a dimidiate pixel model based on the terrestrial ecoregions and vegetation types [27]. To remove abnormal samples and guarantee the reliability and stability of the training samples, the samples were refined further. The sampling FVC values were plotted against the normalized difference vegetation index (NDVI) values, which were computed from the MODIS reflectance in the red and NIR bands. Then, for each NDVI value class (20 classes over the [0, 1] domain of variation), the cases with FVC values that were lower (higher) than the 5% percentile (95% percentile) were removed from the sample datasets. After further optimization, 16,980 cases with consistent MODIS surface reflectance (reflectance in the red and NIR bands) paired with refined sampling FVC values were generated. The sampling dataset was split randomly into a training dataset composed of 90% of the available data and an essentially independent validation dataset composed of the remaining 10% of the sampling dataset, which was used to evaluate the theoretical performances of the four models. Further details regarding the training samples can be found in the study of Jia et al. [27].

2.2. Four Machine Learning Methods for FVC Estimation

GRNNs model

GRNNs, the generalization of radial basis function networks and probabilistic neural networks, were developed by Donald Specht [30,31]. Jia et al. [27] applied GRNNs to the estimation of FVC and demonstrated that GRNNs were reliable for FVC estimation. GRNNs contain four layers, the input, pattern, summation, and output layers. The input layer provides all of the measurement variables to all of the neurons in the pattern layer; each neuron represents a training pattern, and the output of each neuron is a measure of the distance of the input from the stored patterns. The summation layer consists of two types of summation neurons: one type computes the summation of the weighted outputs of the pattern layer, and the other type calculates the unweighted outputs of the pattern neurons. Finally, the output layer performs a normalization step to compute the predicted value of the output variable. In this study, the input variables to the GRNNs for retrieving FVC were the reprocessed MODIS reflectance values in the red and NIR bands, and the output variable was the corresponding FVC. In this study, a Gaussian function was used as the kernel function of the GRNNs, and the fundamental formulation was expressed as follows:

Y^{'} (X) = \frac{\sum_{i = 1}^{n} Y^{i} exp (- \frac{D_{i}^{2}}{2 σ^{2}})}{\sum_{i = 1}^{n} exp (- \frac{D_{i}^{2}}{2 σ^{2}})},

(1)

D_{i}^{2} = {(X - X^{i})}^{T} (X - X^{i}),

(2)

where

D_{i}^{2}

represents the squared Euclidean distance between the input vectors

X

and the i-th training input vector

X^{i}

,

Y^{i}

is the output vector corresponding to

X^{i}

,

Y' (X)

is the estimation corresponding to

X

,

n

is the number of samples, and σ is a smoothing parameter that controls the size of the receptive region. Because the architecture and weights of GRNNs are determined by the input, the training of GRNNs consists essentially of optimizing the smoothing parameter σ [27]. The smoothing parameter significantly affects the prediction accuracy of the GRNNs, and a suitable smoothing parameter was found by using the holdout method. The holdout method for a particular σ consisted of removing one sample from the training data at a time and then constructing GRNNs based on all the other training samples. The GRNNs were then used to estimate

Y

for the removed sample. By repeating this process for each sample and storing each estimate, the mean squared error between the actual sample values

Y^{i}

and the estimates could be evaluated. The value of σ giving the smallest error was used in the final GRNNs [30,32]. The training process was completed when the minimum of the cost function of the smoothing parameter was reached as follows:

f (σ) = \frac{1}{n} \sum_{i = 1}^{n} {[\hat{Y_{i}} (X_{i}) - Y_{i}]}^{2},

(3)

where

\hat{Y_{i}} (X_{i})

is the estimation corresponding to

X_{i}

using the GRNNs trained over all of the training samples except the i-th sample. The widely used shuffled complex evolution method developed by the University of Arizona was selected to obtain the optimal smoothing parameter of the GRNNs [33,34]. The GRNNs method was implemented with the Visual Studio 2012 platform (Microsoft Corporation, Redmond, Washington, DC, USA).

BPNNs

BPNNs, a popular type of neural networks, have proven to be an effective algorithm for estimating land surface vegetation variables, such as LAI and FAPAR [1,35]. Therefore, BPNNs were selected for the comparison of the performance among methods. The BPNNs training procedure is divided into two parts, a forward propagation of information and a backward propagation of error. The back-propagation algorithm network adjusts the weights in each successive layer to reduce the errors at each level. In the linkage of the layers, the transmission of information procedure is unidirectional transmission to the input layer, with treatment of the information in the input layer, the hidden layer, and finally transmission to the output layer. The status of each layer can only be affected by the next layer. If the anticipated outcome is not generated in the output layer, the algorithm switches to back-propagation, and the error between the outcome and the expected value is returned along the original path. In the present study, the BPNNs first learned from the training dataset and built relationships between reflectance and FVC, then the trained BPNNs could produce the optimal FVC estimates based on the actual reflectance of the remotely sensed data. The inputs of the BPNNs included the reflectance of the red and NIR bands, and the output consisted of the corresponding FVC. The number of nodes in the hidden layer was set to four. The BPNNs activation function in the hidden layer was set to “tansig”, the transfer function for the output layer was set to “purelin”, and the training function was set to “trainlm”, respectively. Because of its efficient convergence capacity, the Levenberg–Marquardt minimization algorithm was used to calibrate the synaptic coefficients [36]. The BPNNs modeling was implemented with the Matlab 2014a platform (Matlab 2014a, The MathWorks, Natick, MA, USA).

SVR model

SVR is a commonly used machine learning method for solving nonlinear regression estimation problems [37]. Given a set of data points

G = {(x_{i}, d_{i})}_{i}^{n}

(

x_{i}

is the input vector,

d_{i}

is the desired value and n is the total number of data patterns), SVR approximates the function f(x) using the following equation:

f (x) = ω \times τ (x) + b,

(4)

where ω is the weight vector, b is the bias, and τ(x) is the kernel function, which is typically a non-linear function for transforming non-linear inputs into a linear mode in a high-dimensional feature space. Unlike the traditional regression model, in which coefficients are estimated by minimizing the squared loss, SVR applies the so-called ε-insensitivity loss function (

L_{ε}

) to estimate its parameters:

L_{ε} = {\begin{matrix} | f (x) - y | - ε i f | f (x) - y | \geq ε \\ 0 otherwise \end{matrix},

(5)

where y is the desired (target) output and ε is defined as the region of ε-insensitivity. When the predicted value falls into the band area, the loss is zero. In contrast, if the predicted value falls outside the band area, the loss is equal to the difference between the predicted value and the margin.

When empirical risk and structure risk are considered together, the SVR model can be constructed to minimize the following quadratic programming problem:

min : \frac{1}{2} z^{T} z + C \sum_{i} (ξ_{i} + ξ_{i}^{*}), subject to {\begin{matrix} y_{i} - z^{T} x_{i} - b \leq ε + ξ_{i} \\ z^{T} x_{i} + b - y_{i} \leq ε + ξ_{i}^{*}, \\ ξ_{i}, ξ_{i}^{*} \geq 0 \end{matrix}

(6)

where i = 1,…,n is the number of training data; (

ξ_{i} \pm ξ_{i}^{*}

) is the empirical risk;

\frac{1}{2} z^{T} z

is the structure risk preventing over-learning and the lack of applied universality; and C is referred to as the regularized constant, which determines the trade-off between the empirical risk and regularization terms. This study adopted the general form of the SVR-based regression function, defined as follows [38]:

y (x) = \sum_{i = 1}^{n} (α_{i} - {α_{i}}^{*}) K (x, x_{i}) + b,

(7)

where

α_{i}

and

{α_{i}}^{*}

are Lagrangian multipliers that satisfy

α_{i} \times {α_{i}}^{*} = 0

, n is the number of support vectors, and b is the bias. K is the kernel function, which in this study was the radial basis function (RBF) kernel, one of the most widely used SVR kernel functions [37,38,39], which is defined as

K (x, x_{i}) = exp (- γ ∥ x_{i} - x_{j} ∥^{2})

, where σ denotes the width of the RBF.

Parameters C,

ε

, and

γ

must be determined for SVR training. To obtain the optimal SVR model, the training dataset was randomly divided further into 90% and 10% proportions for model building and testing, respectively, and the grid search method [40] was applied to determine the best parameter set of C and

γ

that could generate the minimum forecasting mean square error. In the searching process, C and

γ

are increasing exponentially with the base 2 and the exponent located in [−8, 8] with the step of 0.8. Finally,

ε

was determined based on the training data and set to 0.1. In this study, the FVC estimation utilized the Library for Support Vector Machines (LIBSVM) and was implemented with the Matlab 2014a platform.

MARS model

MARS is a nonparametric and multivariate regression analysis model presented by Jerome H. Friedman [39]. MARS essentially builds flexible models by fitting piecewise linear regressions; that is, the non-linearity of a model is approximated through the use of separate linear regression slopes in distinct intervals of the independent variable space. In addition to searching for variables sequentially, MARS also searches for the interactions between variables, allowing any degree of interaction to be considered as long as it provides a better fit to the data.

MARS builds models of the form [39]:

f (x) = \sum_{i = 1}^{k} c_{i} B_{i} (x),

(8)

where

B_{i} (x)

are piecewise linear basis functions, and each

c_{i}

is a constant coefficient. Each piecewise linear basis function takes the following form:

\max (0, x - c), x - c > 0 or max (0, c - x), c - x > 0,

(9)

where c is a constant called the knot.

The inputs of the MARS included the reflectance of the red and NIR bands, and the output was the corresponding FVC. The optimal MARS model is determined by a two-stage process. First, MARS constructs a very large number of basis functions to fit the data, where variables can interact with each other to fit the data. Then, the basis functions are deleted in the order of least importance using the generalized cross-validation (GCV) criterion [40]. The importance of a variable can be assessed by observing the decrease in the calculated GCV when it is removed from the model. MARS is capable of reliably tracking the extremely complex data structures hidden in high-dimensional data. Additional details regarding the MARS model building process can be found in [40]. In this study, the best number of basis functions was determined using the GCV method. MARS was implemented with the Matlab2014a platform.

The four aforementioned methods were trained with identical training samples of varying sample sizes, and their performances were evaluated based on independent validation. The R-square (R²) and root mean square error (RMSE) were selected to evaluate the comparison results. Moreover, reprocessed MODIS reflectance data for BELMANIP sites containing all kinds of vegetation types in the year 2003 were entered into the four machine learning algorithms, and the outputs of BPNNs, SVR, and MARS were compared with those of GRNNs, which have been proven to be very efficient for FVC estimation. In this study, all of the four methods were implemented using the Microsoft Windows 7 operating system (Microsoft Corporation, Redmond, Washington, DC, USA) on a 3.20 GHz Intel Core PC with 8 GB of memory.

2.3. Spatial-Temporal Comparison and Direct Validation

After training with the optimal training sample size, the four methods were used to generate the global land surface FVC from reprocessed MODIS reflectance data. The MODIS land cover product (MCD12Q1) was used as priori conditions: the FVC values in the vegetation regions, including eight main biomes, were estimated from the reprocessed MODIS surface reflectance data using the proposed method, and the FVC values in the non-vegetated regions were set to zero. Because the GRNNs method has accuracy comparable to that of the GEOV1 FVC product and improved spatial and temporal continuities [27], the accuracy and the spatial and temporal consistencies of the machine learning methods were compared with those of the GRNNs method to determine a suitable alternative to the GRNNs method for generating the GLASS FVC product.

Assessment and validation of moderate-resolution FVC products are generally difficult because ground point measurements are not suitable for direct comparisons due to surface heterogeneity. High spatial resolution remote sensing data can be used to scale the ground measurements up to moderate spatial resolution pixels for comparison and evaluation. In this study, high-resolution FVC maps from VALERI (accessed at http://w3.avignon.inra.fr/valeri) [41] were used to validate the four machine learning methods and inter-compare with the GRNNs method. Following the guidelines defined by the subgroup Land Product Validation (LPV) of Committee Earth Observing Satellites’ Working Group on Calibration and Validation (CEOS WGCV), an empirical transfer function between the high-resolution reflectance data and the FVC ground measurements for a site was established to derive a high-resolution FVC map that was then aggregated to the moderate-resolution products for comparison [42]. Most of these sites are sized at 3 km × 3 km (some are larger), and multi-date measurements are available at specific sites. In total, 44 high-resolution FVC maps over 27 VALERI sites [27] were selected to validate the FVC estimates of the four machine learning methods assessed in this study. Information about these locations, such as validation site positions, land cover types, dates of ground measurement, and mean FVC values from the validation site areas is given in Table 1. The sites cover various land cover types, including grassland, cropland, shrubs, forest, etc. Therefore, the validation results in this study are representative.

In this study, to obtain improved spatial matching of the FVC estimates from the MODIS data and high-resolution remote sensing data of the VALERI sites, the FVC data estimated using MODIS data were converted to the same projection as those of the corresponding high-resolution FVC maps. Then, the averaged FVC estimates corresponding to the scope of the high-resolution maps were extracted. Meanwhile, the extracted FVC values were linearly interpolated to the acquisition date of the corresponding high-resolution FVC maps. Finally, the performances of the machine learning methods were directly validated using these extracted FVC samples.

3. Results and Discussion

3.1. Training Accuracy and Computational Efficiency

The performances of the four machine learning methods with different training sample sizes were evaluated by R² and RMSE, and the results are shown in Figure 2. The smoothing parameter for the GRNNs method, the C and

γ

parameters for the SVR method, and the maximum number of model terms for the MARS method are determined in the training process. In this study, only the parameters with training sample size of 15,282, which achieved the best training results, are presented. The determined optimal smoothing parameter for FVC estimation from MODIS data for the GRNNs method was 0.0042, the C and

γ

for the SVR method were 27.8576 and 1.7411, respectively, and the best number of the basis functions for the MARS method was 21. In addition, the independent validation results from the 15,282 training samples are summarized in Table 2.

Figure 2 shows that R² increases while RMSE decreases with sample size, with some fluctuations. Generally, GRNNs performed slightly better than SVR and slightly worse than MARS and BPNNs. All of the four methods had satisfactory performance (R² > 0.96, RMSE < 0.07) given sufficient training samples. The BPNNs and MARS had lower sensitivity than the GRNNs to training sample size variations, and SVR was very sensitive to training sample size. All of the four methods began to have stable performance when the training sample size was large enough. In addition, a turning point was noted between the sample sizes of 304 and 456. Prior to the sample size of 304, the values of R² and RMSE changed quickly, and the four models were sensitive to sample size variation. Subsequent to the training sample size of 456, the performances of the four methods, particularly the MARS method, began to stabilize, and the R² and RMSE values showed only slight improvement. With a training sample size of 15,282, over-fitting was not observed, and all of the four methods, particularly the MARS method, presented their best performance (R² > 0.96, RMSE < 0.07). Therefore, 15,282 training samples were used as the final training data set.

The computation time required for model training and estimation differed considerably among the four methods (Table 2). The computational efficiency of the GRNNs and SVR methods was very low when large datasets were used; therefore, these two methods were not suitable for generating a long time series FVC product. Additionally, when the training sample size was reduced, their estimation accuracy decreased. Although the four methods were implemented on different platforms, the comparison of computation time was still reasonable for the assertion that the computational efficiency of MARS and BPNNs was better than that of GRNNs and SVR.

One year of FVC estimates were also derived using the four methods at the BELMANIP sites containing all kinds of vegetation types. Due to a lack of ground truth data and the fact that GRNNs have proven to be very efficient for FVC estimation, the FVC estimates of the other three methods were compared with those of the GRNNs method. Scatterplots between the GRNNs derived FVC and the BPNNs, SVR and MARS derived FVCs are shown in Figure 3. The results show that the other three methods performed well in estimating FVC and had good consistency with the GRNNs method (R² > 0.98, RMSE < 0.037). However, the BPNNs method had a small overestimation problem for FVC values from 0.05 to 0.2. In contrast, the MARS method had better consistency with the GRNNs method compared with the BPNNs and SVM methods.

Due to its high computational efficiency and estimation accuracy, the MARS method was preliminarily selected for subsequent direct validation and spatial–temporal comparison with the GRNNs method, and the BPNNs and SVR methods were not explored further.

3.2. Spatial-Temporal Comparison and Direct Validation

A direct validation of the MARS method was evaluated, and the spatial and temporal consistencies between the global FVC maps derived from the MARS and the GRNNs methods were compared. Figure 4a, b show the global FVC maps from the MARS and GRNNs methods on day 201 of 2003 (20 July 2003), respectively. A difference map between the two global FVC maps generated by the MARS and GRNNs methods is also presented in Figure 4c for comparison. Figure 4 shows that there was adequate spatial agreement between the MARS and GRNNs derived FVC from the MODIS data, and no missing data were observed. Quantitatively, approximately 99.95% of the difference values were located between −0.1 and 0.1. Furthermore, the distributions of the FVC values of both data sets were consistent with the actual conditions of the land cover distributions. Therefore, the global FVC map generated by the MARS model had sufficient spatial consistency and FVC values very similar to those generated by the GRNNs method, as well as satisfactory spatial continuity.

The FVC temporal profiles of the sampling sites in the year of 2003 were also generated to compare the temporal consistency of the MARS derived and GRNNs derived FVC. Several representative FVC temporal profiles for cropland, grassland, shrubland and forestland, are shown in Figure 5. These representative FVC temporal profiles show good temporal consistency between the two data sets. The two data sets also have similar magnitude and dynamic range, and no missing data are observed. Moreover, the temporal profiles extracted from the MARS method-derived FVC are very smooth, whereas those extracted from the GRNNs method show very slight fluctuations. Furthermore, the two data sets exhibit similar seasonal variations, which could reflect actual vegetation growth characteristics. These seasonal vegetation changes and the temporal consistency between the MARS derived FVC and the GRNNs derived FVC indicated that the MARS method for FVC estimation is reliable and capable of revealing actual earth surface variations. In summary, the results of this study indicate that the MARS method-derived FVC were temporally continuous and had good temporal consistency with the GRNNs derived FVC.

To evaluate the performance of the MARS method further, 44 ground measurement-based samples from VALERI sites were used to directly validate the results from the MARS method and compare them with those from the GRNNs method (Figure 6). There was a reliable agreement between the FVC estimated from both the MARS and GRNNs methods and the ground measurements. The performance of the MARS method (R² = 0.836, RMSE = 0.1488) was similar to that of the GRNNs method (R² = 0.8353, RMSE = 0.1459), which indicated an acceptable level of consistency between their FVC estimates. In addition, it also could be seen that there were small differences between the validation results of the GRNNs method in this study and the results in [27]. The reason for the differences was that the FVC in [27] was first aggregated to the same scale as that of the GEOV1 FVC product (spatial resolution of 1 km) to compare their performances. Therefore, the small differences are reasonable.

The MARS method had both high computational efficiency and estimation accuracy comparable to that of the GRNNs method. The global FVC maps generated from MARS were highly consistent with those generated from GRNNs. Furthermore, the MARS temporal profiles of various types of representative vegetation were smoother than those of GRNNs and capable of reflecting true growth trends. Overall, the MARS model was determined to be the most suitable method for generating the GLASS FVC product.

4. Conclusions

In this study, four commonly used machine learning methods were evaluated for their capability to generate the GLASS FVC product. The four machine learning methods were trained with identical sample data from global sampling locations, and the MARS model was preliminarily selected as the most suitable method after comparing the fitting accuracies and computational efficiencies of the four methods. Furthermore, a direct validation using ground FVC measurements and a comparison of the spatial and temporal consistencies of MARS derived and GRNNs derived FVC indicated that the accuracy of the MARS method was comparable to the accuracy of the GRNNs method for global FVC estimates. In addition, the computational efficiency of the MARS method is superior to that of the GRNNs method. Therefore, the MARS method is a suitable alternative to the GRNNs method for generating the GLASS FVC product from MODIS data. Further work should focus on an extensive assessment of the performance of the MARS method using additional ground measurement data.

Acknowledgments

This study was partially supported by the National Natural Science Foundation of China (No. 41301353 and 41331173) and the National High Technology Research and Development Program of China (No. 2013AA122801).

Author Contributions

K.J., L.Y. and S.L. conceived and designed the experiments; L.Y. performed the experiments; K.J. and L.Y. conducted the analysis of the results; and all of the aforementioned contributed towards writing the manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

Baret, F.; Weiss, M.; Lacaze, R.; Camacho, F.; Makhmara, H.; Pacholcyzk, P.; Smets, B. GEOV1: LAI and FAPAR essential climate variables and FCOVER global time series capitalizing over existing products. Part1: Principles of development and production. Remote Sens. Environ. 2013, 137, 299–309. [Google Scholar] [CrossRef]
Gitelson, A.A.; Kaufman, Y.J.; Stark, R.; Rundquist, D. Novel algorithms for remote estimation of vegetation fraction. Remote Sens. Environ. 2002, 80, 76–87. [Google Scholar] [CrossRef]
Wu, D.; Wu, H.; Zhao, X.; Zhou, T.; Tang, B.; Zhao, W.; Jia, K. Evaluation of spatiotemporal variations of global fractional vegetation cover based on GIMMS NDVI data from 1982 to 2011. Remote Sens. 2014, 6, 4217–4239. [Google Scholar] [CrossRef]
Qin, W.; Zhu, Q.K.; Zhang, X.X.; Li, W.H.; Fang, B. Review of vegetation covering and its measuring and calculating method. J. Northwest SCI Tech. Univ. Agric. For. 2006, 34, 164–170. [Google Scholar]
Gutman, G.; Ignatov, A. The derivation of the green vegetation fraction from NOAA/AVHRR data for use in numerical weather prediction models. Int. J. Remote Sens. 1998, 19, 1533–1543. [Google Scholar] [CrossRef]
Matsui, T.; Lakshmi, V.; Small, E.E. The effects of satellite-derived vegetation cover variability on simulated land-atmosphere interactions in the NAMS. J. Clim. 2005, 18, 21–40. [Google Scholar] [CrossRef]
Zhang, X.; Wu, B.; Ling, F.; Zeng, Y.; Yan, N.; Yuan, C. Identification of priority areas for controlling soil erosion. Catena 2010, 83, 76–86. [Google Scholar] [CrossRef]
Roujean, J.-L.; Lacaze, R. Global mapping of vegetation parameters from POLDER multiangular measurements for studies of surface-atmosphere interactions: A pragmatic method and its validation. J. Geophys. Res. 2002, 107. [Google Scholar] [CrossRef]
Zeng, X.; Dickinson, R.E.; Walker, A.; Shaikh, M.; DeFries, R.S.; Qi, J. Derivation and evaluation of global 1-km fractional vegetation cover data for land modeling. J. Appl. Meteorol. 2000, 39, 826–839. [Google Scholar] [CrossRef]
Godínez-Alvarez, H.; Herrick, J.E.; Mattocks, M.; Toledo, D.; Van Zee, J. Comparison of three vegetation monitoring methods: Their relative utility for ecological assessment and monitoring. Ecol. Indic. 2009, 9, 1001–1008. [Google Scholar] [CrossRef]
Xiao, J.; Moody, A. A comparison of methods for estimating fractional green vegetation cover within a desert-to-upland transition zone in central New Mexico, USA. Remote Sens. Environ. 2005, 98, 237–250. [Google Scholar] [CrossRef]
Carlson, T.N.; Ripley, D.A. On the relation between NDVI, fractional vegetation cover, and leaf area index. Remote Sens. Environ. 1997, 62, 241–252. [Google Scholar] [CrossRef]
Jimenez-Munoz, J.C.; Sobrino, J.A.; Plaza, A.; Guanter, L.; Moreno, J.; Martinez, P. Comparison between fractional vegetation cover retrievals from vegetation indices and spectral mixture analysis: Case study of PROBA/CHRIS data over an agricultural area. Sensors 2009, 9, 768–793. [Google Scholar] [CrossRef] [PubMed]
Jiapaer, G.; Chen, X.; Bao, A. A comparison of methods for estimating fractional vegetation cover in arid regions. Agric. For. Meteorol. 2011, 151, 1698–1710. [Google Scholar] [CrossRef]
Johnson, B.; Tateishi, R.; Kobayashi, T. Remote sensing of fractional green vegetation cover using spatially-interpolated endmembers. Remote Sens. 2012, 4, 2619–2634. [Google Scholar] [CrossRef]
Wu, B.; Li, M.; Yan, C.; Zhou, W.; Yan, C. Developing method of vegetation fraction estimation by remote sensing for soil loss equation: A case in the Upper Basin of Miyun Reservoir. Proc. IEEE IGARSS 2004, 6, 4352–4355. [Google Scholar]
Kimes, D.S.; Knyazikhin, Y.; Privette, J.L.; Abuelgasim, A.A.; Gao, F. Inversion methods for physically-based models. Remote Sens. Rev. 2000, 18, 381–439. [Google Scholar] [CrossRef]
Baret, F.; Pavageau, K.; Béal, D.; Weiss, M.; Berthelot, B.; Regner, P. Algorithm Theoretical Basis Document for MERIS Top of Atmosphere Land Products (TOA_VEG); Report of ESA Contract AO/1–4233/02/I-LG; INRA-CSE: Avignon, France, 2006; p. 37. [Google Scholar]
Ahmad, S.; Kalra, A.; Stephen, H. Estimating soil moisture using remote sensing data: A machine learning approach. Adv. Water Res. 2010, 33, 69–80. [Google Scholar] [CrossRef]
Verger, A.; Baret, F.; Camacho, F. Optimal modalities for radiative transfer-neural network estimation of canopy biophysical characteristics: Evaluation over an agricultural area with CHRIS/PROBA observations. Remote Sens. Environ. 2011, 115, 415–426. [Google Scholar] [CrossRef]
Jia, K.; Liang, S.; Gu, X.; Baret, F.; Wei, X.; Wang, X.; Yao, Y.; Yang, L.; Li, Y. Fractional vegetation cover estimation algorithm for Chinese GF-1 wide field view data. Remote Sens. Environ. 2016, 177, 184–191. [Google Scholar] [CrossRef]
Camacho, F.; Cernicharo, J.; Lacaze, R.; Baret, F.; Weiss, M. GEOV1: LAI, FAPAR essential climate variables and FCOVER global time series capitalizing over existing products. Part 2: Validation and intercomparison with reference products. Remote Sens. Environ. 2013, 137, 310–329. [Google Scholar] [CrossRef]
Mu, X.; Huang, S.; Ren, H.; Yan, G.; Song, W.; Ruan, G. Validating GEOV1 fractional vegetation cover derived from coarse-resolution remote sensing images over croplands. IEEE. J. Sel. Top. Appl. Earth. Obs. Remote. Sens. 2015, 8, 439–446. [Google Scholar] [CrossRef]
García-Haro, F.J.; Camacho-de Coca, F.; Meliá Miralles, J. Inter-comparison of SEVIRI/MSG and MERIS/ENVISAT biophysical products over Europe and Africa. In Proceedings of the 2nd MERIS/(A)ATSR User Workshop, Frascati, Italy, 22–26 September 2008; p. 8.
Fillol, E.; Baret, F.; Weiss, M.; Dedieu, G.; Demarez, V.; Gouaux, P.; Ducrot, D. Cover fraction estimation from high resolution SPOT HRV&HRG and medium resolution SPOT-VEGETATION sensors. Validation and comparison over South-West France. In Proceedings of the 2nd Recent Advances in Quantitative Remote Sensing Symposium, Torrent, Spain, 25–29 September 2006; pp. 659–663.
Camacho de Coca, F.; Jiménez-Muñoz, J.-C.; Martínez, B.; Bicheron, P.; Lacaze, R.; Leroy, M. Prototyping of the fCover product over Africa based on existing CYCLOPES and JRC products for VGT4 Africa. In Proceedings of the 2nd Recent Advances in Quantitative Remote Sensing Symposium, Torrent, Spain, 25–29 September 2006; pp. 722–727.
Jia, K.; Liang, S.; Liu, S.; Li, Y.; Xiao, Z.; Yao, Y.; Jiang, B.; Zhao, X.; Wang, X.; Xu, S.; et al. Global land surface fractional vegetation cover estimation using general regression neural networks from MODIS surface reflectance. IEEE Trans. Geosci. Remote Sens. 2015, 53, 4787–4796. [Google Scholar] [CrossRef]
Durbha, S.S.; King, R.L.; Younan, N.H. Support vector machines regression for retrieval of leaf area index from multiangle imaging spectroradiometer. Remote Sens. Environ. 2007, 107, 348–361. [Google Scholar] [CrossRef]
Jiang, B.; Liang, S.; Ma, H.; Zhang, X.; Xiao, Z.; Zhao, X.; Jia, K.; Yao, Y.; Jia, A. GLASS daytime all-wave net radiation product: Algorithm development and preliminary validation. Remote Sens. 2016, 8, 222. [Google Scholar] [CrossRef]
Specht, D.F. A general regression neural network. IEEE Trans. Neural Netw. 1991, 2, 568–576. [Google Scholar] [CrossRef] [PubMed]
Specht, D.F. The general regression neural network—Rediscovered. Neural Netw. 1993, 6, 1033–1034. [Google Scholar] [CrossRef]
Xiao, Z.; Liang, S.; Wang, J.; Chen, P.; Yin, X.; Zhang, L.; Song, J. Use of General regression neural networks for generating the GLASS Leaf Area Index Product from time-series MODIS surface reflectance. IEEE Trans. Geosci. Remote Sens. 2014, 52, 209–223. [Google Scholar] [CrossRef]
Duan, Q.; Sorooshian, S.; Gupta, V. Effective and efficient global optimization for conceptual rainfall-runoff models. Water Resour. Res. 1992, 28, 1015–1031. [Google Scholar] [CrossRef]
Xiao, Z.; Wang, J.; Liang, S.; Zhou, H.; Li, X.; Zhang, L.; Jiao, Z.; Liu, Y.; Fu, Z. Variational retrieval of leaf area index from MODIS time series data: Examples from the Heihe river basin, North-West China. Int. J. Remote Sens. 2012, 33, 730–745. [Google Scholar] [CrossRef]
Baret, F.; Hagolle, O.; Geiger, B.; Bicheron, P.; Miras, B.; Huc, M.; Berthelot, B.; Niño, F.; Weiss, M.; Samain, O.; et al. LAI, fAPAR and fCover CYCLOPES global products derived from VEGETATION. Remote Sens. Environ. 2007, 110, 275–286. [Google Scholar] [CrossRef] [Green Version]
Ngia, L.S.H.; Sjoberg, J. Efficient training of neural nets for nonlinear adaptive filtering using a recursive Levenberg-Marquardt algorithm. IEEE Trans. Sign. Proc. 2000, 48, 1915–1927. [Google Scholar] [CrossRef]
Vapnik, V.N. The Nature of Statistical Learning Theory; Springer: New York, NY, USA, 2000; pp. 988–999. [Google Scholar]
Chang, C.C.; Lin, C.J. LIBSVM: A library for support vector machines. ACM Trans. Intel. Syst. Technol. 2011, 2, 389–396. [Google Scholar] [CrossRef]
Barron, A.R.; Xiao, X. Multivariate adaptive regression splines. Ann. Stat. 1991, 19, 1–67. [Google Scholar] [CrossRef]
Friedman, J.H. Multivariate adaptive regression splines (with discussion). Ann. Stat. 1991, 19, 1–141. [Google Scholar] [CrossRef]
Demarez, V.; Duthoit, S.; Baret, F.; Weiss, M.; Dedieu, G. Estimation of leaf area and clumping indexes of crops with hemispherical photographs. Agric. For. Meteorol. 2008, 148, 644–655. [Google Scholar] [CrossRef] [Green Version]
Morisette, J.T.; Baret, F.; Privette, J.L.; Myneni, R.B.; Nickeson, J.E.; Garrigues, S.; Shabanov, N.V.; Weiss, M.; Fernandes, R.A.; Leblanc, S.G.; et al. Validation of global moderate-resolution LAI products: A framework proposed within the CEOS land product validation subgroup. IEEE Trans. Geosci. Remote Sens. 2006, 44, 1804–1817. [Google Scholar] [CrossRef]

Figure 1. Flowchart of this study.

Figure 2. Performance of the four methods with different training sample size. (a) Coefficient of determination (R²); (b) Root mean squared error (RMSE).

Figure 3. Scatterplots of the comparison results of the fractional vegetation cover (FVC) estimates from the general regression neural networks (GRNNs) with the three other methods: (a) back-propagation neural networks (BPNNs); (b) support vector regression (SVR); (c) multivariate adaptive regression splines (MARS).

Figure 4. Global land surface FVC maps generated using the MARS and GRNN methods for day 201 (20 July) of 2003: (a) the MARS method; (b) the GRNNs method; (c) the global FVC difference map between the MARS and GRNNs methods.

Figure 5. Temporal profiles of the MARS derived FVC and the GRNNs derived FVC over sampling sites for year 2003: (a) Evergreen needle forest; (b) Grassland; (c) Cropland; and (d) Open shrubland.

Figure 6. Scatterplots of FVC estimated using the two methods ((a) MARS method; (b) GRNNs method) against the ground FVC measurements.

Table 1. Characteristics of the sites selected for accuracy assessment.

**Table 1.** Characteristics of the sites selected for accuracy assessment.
Site Name	Lat (°)	Lon (°)	Land cover	DOY	Year	FVC
Barrax	39.06	−2.10	Cropland	193	2003	0.236
Camerons	−32.60	116.25	Broadleaf forest	63	2004	0.414
Chilbolton	51.16	−1.43	Crops and forest	166	2006	0.647
Counami	5.35	−53.24	Tropical forest	269	2001	0.838
Counami	5.35	−53.24	Tropical forest	286	2002	0.858
Demmin	53.89	13.21	Crops	164	2004	0.586
Donga	9.77	1.78	Grassland	172	2005	0.420
Fundulea	44.41	26.58	Crops	128	2001	0.341
Fundulea	44.41	26.58	Crops	144	2002	0.374
Fundulea	44.41	26.59	Crops	144	2003	0.319
Gilching	48.08	11.32	Crops and forest	199	2002	0.676
Gnangara	−31.53	115.88	Grassland	61	2004	0.221
Gourma	15.32	−1.55	Grassland	244	2000	0.236
Gourma	15.32	−1.55	Grassland	275	2001	0.126
Haouz	31.66	−7.60	Cropland	71	2003	0.248
Hirsikangas	62.64	27.01	Forest	226	2003	0.644
Hirsikangas	62.64	27.01	Forest	190	2004	0.537
Hirsikangas	62.64	27.01	Forest	159	2005	0.442
Hombori	15.33	−1.48	Grassland	242	2002	0.200
Jarvselja	58.29	27.29	Boreal forest	188	2000	0.705
Jarvselja	58.30	27.26	Boreal forest	165	2001	0.783
Jarvselja	58.30	27.26	Boreal forest	178	2002	0.793
Jarvselja	58.30	27.26	Boreal forest	208	2003	0.803
Jarvselja	58.30	27.26	Boreal forest	180	2005	0.842
Jarvselja	58.30	27.26	Boreal forest	112	2007	0.535
Jarvselja	58.30	27.26	Boreal forest	199	2007	0.731
Laprida	−36.99	−60.55	Grassland	311	2001	0.722
Laprida	−36.99	−60.55	Grassland	292	2002	0.534
Larose	45.38	−75.22	Mixed forest	219	2003	0.847
Le Larzac	43.94	3.12	Grassland	183	2002	0.300
Les Alpilles	43.81	4.71	Crops	204	2002	0.349
Plan-de-Dieu	44.20	4.95	Crops	189	2004	0.172
Puechabon	43.72	3.65	Forest	164	2001	0.540
Rovaniemi	66.46	25.35	Crops	161	2004	0.423
Rovaniemi	66.46	25.35	Crops	166	2005	0.497
Sonian forest	50.77	4.41	Forest	174	2004	0.903
Concepcion	−37.47	−73.47	Mixed forest	9	2003	0.455
Hyytiälä	61.85	24.31	Evergreen forest	188	2008	0.461
Sud_Ouest	43.51	1.24	Crops	189	2002	0.352
Turco	−18.24	−68.18	Shrubs	208	2001	0.106
Turco	−18.24	−68.19	Shrubs	240	2002	0.020
Turco	−18.24	−68.19	Shrubs	105	2003	0.044
Wankama	13.64	2.64	Grassland	174	2005	0.036
Zhang Bei	41.28	114.69	Pastures	221	2002	0.353

* DOY: Day of Year; FVC: Fractional vegetation cover.

Table 2. The statistical performances of the four machine learning methods with 15,282 training samples.

**Table 2.** The statistical performances of the four machine learning methods with 15,282 training samples.
Model	R²	RMSE	Training Time	Estimation Time (One Tile)
GRNNs	0.9625	0.0645	572.266 s	4772.493 s
BPNNs	0.9617	0.0666	8.691 s	5.629 s
SVR	0.9627	0.0663	34,576.585 s	271.621 s
MARS	0.9645	0.0645	123.173 s	7.479 s

* R²: Coefficient of determination; RMSE: Root mean squared error.

© 2016 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC-BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Yang, L.; Jia, K.; Liang, S.; Liu, J.; Wang, X. Comparison of Four Machine Learning Methods for Generating the GLASS Fractional Vegetation Cover Product from MODIS Data. Remote Sens. 2016, 8, 682. https://doi.org/10.3390/rs8080682

AMA Style

Yang L, Jia K, Liang S, Liu J, Wang X. Comparison of Four Machine Learning Methods for Generating the GLASS Fractional Vegetation Cover Product from MODIS Data. Remote Sensing. 2016; 8(8):682. https://doi.org/10.3390/rs8080682

Chicago/Turabian Style

Yang, Linqing, Kun Jia, Shunlin Liang, Jingcan Liu, and Xiaoxia Wang. 2016. "Comparison of Four Machine Learning Methods for Generating the GLASS Fractional Vegetation Cover Product from MODIS Data" Remote Sensing 8, no. 8: 682. https://doi.org/10.3390/rs8080682

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Comparison of Four Machine Learning Methods for Generating the GLASS Fractional Vegetation Cover Product from MODIS Data

Abstract

1. Introduction

2. Data and Methods

2.1. Training Samples

2.2. Four Machine Learning Methods for FVC Estimation

2.3. Spatial-Temporal Comparison and Direct Validation

3. Results and Discussion

3.1. Training Accuracy and Computational Efficiency

3.2. Spatial-Temporal Comparison and Direct Validation

4. Conclusions

Acknowledgments

Author Contributions

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI