1. Introduction
Forests perform various functions, including soil and water conservation, carbon sequestration, air purification, and moderating the global climate. According to the Global Forest Resources Assessment 2015, published by the United Nations Food and Agriculture Organization [
1], the global forest area decreased by approximately 129 million hectares from 1990 to 2015, equivalent to the entire area of South Africa, reaching an annual net loss rate of 0.13% [
1]. The UN formally released 17 sustainable development goals in 2015 [
2]; goal 15 was to ensure the sustainable development of terrestrial ecosystems, including forests—and the response to this requires improvements to the conservation and management of forest resources at an unprecedented level. It is now urgent that timely, effective, and accurate forest monitoring methods are established in order to achieve sustainable forest resources management.
Remote sensing and digital image processing enable the observation, identification, classification, and monitoring of forests at a range of spatial, temporal, and thematic scales [
3]. Liu et al. achieved forest type classification for Wuhan, China, using a machine learning algorithm and spectral and spatial features derived from multi-source remote sensing data [
4]. Zhang et al. (2010) calculated spectral features using NIR and infrared bands and identified shrub forests in higher altitude areas located in Dingri County (Tibet Autonomous Region, China) [
5]. Pasquarella et al. (2018) combined spectral and temporal features obtained from Landsat time series images and were able to extract forest type information for the western portion of Massachusetts [
6]. Grabska et al. (2019) calculated temporal–spectral features and used them to retrieve information on various forest types, based on Sentinel-2 time series remote sensing images [
7], while Cheng and Wang identified temporal patterns of different forest types and then combined them with spectral indexes and bands to identify forest types in Hunan, China, accurately and in detail [
8]. Although these studies achieved forest type classification using spectral or temporal features at a local scale, these methods cannot be extended to large-scale forest type mapping owing to the limitations of remote sensing data acquisition, storage, and analysis capabilities [
9,
10]. Remote sensing cloud computing platforms have powerful geo-big data processing capabilities, which can effectively solve these limitations [
11]. However, there is still an urgent challenge concerning how to use current big data processing platforms (such as Google Earth Engine) to design a general method that combines spectral, spatial, and temporal features suitable for large-scale forest type classification.
Using data from Landsat 7 long time series imagery, Hansen et al. (2013) produced global forest area change maps for 2000–2012 [
12], while the Japanese Aviation Administration released a global forest/non-forest cover product at a spatial resolution of 25 m [
13]; however, neither of these exceptional studies contained information on varying forest types. With the growth of cloud-computing technology, such as the Google Earth Engine (GEE) [
11], and machine learning algorithms, the limitations of intensive data processing present for traditional remote sensing image classification have been effectively resolved [
14]. In addition, publicly available medium- and high-resolution remote sensing data sources, such as Landsat and Sentinel satellite images, are becoming increasingly abundant, providing an unprecedented opportunity for data-intensive approaches to remote sensing forest type classification at larger scales.
Therefore, the overarching goal of this study was to explore forest type discrimination for large-scale mapping using spectral, spatial, and temporal (SST) features derived from earth observation big data. Through our research, we want to provide a general and reliable forest type classification method and reveal relative SST feature contributions to different geographical areas.
3. Results
3.1. SST Features
The optimal number of SST features for different regions was obtained by executing RF-RFE 20 times [
19,
27], as shown in
Figure 3 and
Table 3.
NE (
Figure 3a): the highest accuracy (0.8693) occurred when the number of features was 14, and as the number of features increased beyond this, classification accuracy dropped. Closer examination revealed that in this region (
Table 3), the spectral features mainly included redE1, NDVIre1, and NDVIre2 for the growing season, and redE2, NDVIre2, and EVI for the deciduous season. Only three spatial features remained, namely, red_con and redE1_ent from the growing season and nir_var from the deciduous season. The temporal features remaining were NDVI_amp, NDVI_rmse, EVI_pha, and redE2_rmse, with the elevation indicator derived from the DEM also retained.
SW (
Figure 3b): the best accuracy (0.8905) occurred when the number of features reached 24, and it showed a similar trend to NE, although with different remaining features (
Table 3). Among all the features, the number of spectral features was apparently more than that of the other feature types and included the blue, redE1, and NIR bands, and the NDVIre1, NDVI, and EVI indices, from the growing season, along with the green, redE3, and NIR bands, and the NDVIre3 and EVI indices, from the deciduous season. The spatial features remaining were derived from the red, nir, and redE1 bands and included red_con, red_cor, and nir_ent, derived from the growing season, and redE1_var and nir_con from the deciduous season. Temporal features in the SW for forest type classification included redE1_rmse, redE2_rmse, NDVI_amp, NDVI_rmse, EVI_pha, and NDVI_pha and, except for elevation, the auxiliary features were also retained for this aspect.
SO (
Figure 3c): the best-fit number for forest type classification in the SO was 16, with an accuracy of 0.8632. In the determined feature list (
Table 3), spectral features included the blue and redE1 bands, together with the NDVIre1 and NDVIre2 indices for the growing season, along with the blue and redE3 bands and the NDVIre3 index from the deciduous season. The number of spatial features remained the smallest among the SST features, with only three—nir_con and nir_cor from the growing season and red_var from the deciduous season. The temporal features remaining in this region were redE1_amp, redE1_pha, NDVI_pha, and EVI_pha, while for the auxiliary features, elevation and slope remained, affecting the forest type classification.
SNT (
Figure 3d): the peak classification accuracy (0.8630) corresponded to a feature quantity of 21. The detailed feature list indicated that spectral features constituted the largest number, including green, red, nir, NDVI, and EVI for the growing season and blue, green, red, nir, and NDVI for the deciduous season (
Table 3). The only spatial features retained were nir_con and red_con from the growing season and nir_con and red_cor from the deciduous season. The temporal features were all derived from EVI and NDVI indices and included EVI_pha, NDVI_pha, EVI_amp, NDVI_amp, EVI_rmse, and NDVI_rmse, while only elevation remained in the auxiliary feature set.
3.2. Forest Cover Map
The classification results are shown in
Figure 4. The forest cover rate and areal proportion for each forest type were calculated and are presented in
Figure 5.
NE (
Figure 4a): forest types mainly included MIF, DBF, and DNF, which were mostly distributed around the Daxing’an and Xiaoxing’an Mountains and around Changbai Mountain. It was apparent that the proportion of DNF in NE China was significantly higher than in the other three regions examined, and was distributed in the east and north of the Daxing’an Mountains. This was consistent with the fact that NE China is the largest DNF-producing area nationally [
28]. It could also be seen that forest resources in NE were mainly concentrated in the border areas (adjacent to Russia, North Korea, and Mongolia), which included more forests than other areas. The forest coverage rate in NE was approximately 30.95%, with MIF contributing 56.07% of this, followed by DBF (~30.16%). The results showed that ENF contributed the least, accounting for just 4.86%.
SW (
Figure 4b): the forest types in the SW region included ENF, EBF, DNF, DBF, and MIF. ENF and EBF were found distributed around the Hengduan Mountains, in the center, west, and south of the SW region. The amounts of MIF and DNF were relatively small, and were concentrated in the eastern part of the region, while the DBF was located mainly in the south-central region. Forest coverage rate in the SW was found to be approximately 39.09%, with ENF representing the largest portion, accounting for 54.62%. MIF and EBF followed, with 21.39% and 19.12%, respectively, with DBF having the smallest coverage, at approximately 4.88%.
SO (
Figure 4c): EBF, ENF, MIF, DBF, and DNF forest types were seen to occur in SO, and were mostly concentrated in the southern and eastern areas of the region, with a forest-cover rate of 57.17%. The amount of EBF accounted for 38.85% of the forest area, followed by MIF and ENF (28.57% and 27.95%, respectively), with DBF covering the smallest area, at 4.63%.
SNT (
Figure 4d): the forest resources here were found to be mainly distributed near the Qinling Mountains, in the central and western areas of this region. The forest types were mainly MIF, EBF, and DBF, and were concentrated in mountainous areas at higher elevations, such as Taibai Mountain of the Qinling Mountains area. The forest coverage rate in this zone was approximately 26.08%, with MIF accounting for 63.42% of the total, followed by EBF, at 32.14%, and then DBF and ENF, which covered 4.19% and 0.25% of the area, respectively.
3.3. Accuracy Assessment
In the NE area, 715 field validation samples were used to assess result accuracy and the OA was found to be 91.47% (
Table 4). Among all the forest types, DBF had the highest PA and UA (98.89% and 100%, respectively), which suggested that almost all DBFs in this region had been mapped accurately.
In the SW, 616 field validation samples were employed to assess the accuracy, giving an OA of 86.85%, as shown in
Table 5. Here, EBF acquired relatively high PA (97.44%) and UA (82.61%) results, which suggested that this forest type had been classified with fewer omission and commission errors than the others, and that EBF coverage had been provided at a relatively high accuracy level in this region.
In the SO, 815 field validation samples were used to evaluate classification accuracy, and the confusion matrix (
Table 6) showed that the regional OA was 93.50%. The most accurate classification result here was obtained for EBF, with both PA and UA > 90%.
For the SNT, 302 field validation samples were obtained and, as shown in
Table 7, the OA here was found to be 92.38%. ENF achieved PA and UA values of >90%, showing that the ENF area in the SNT region had been provided with a relatively high level of accuracy.
3.4. Feature Importance Assessment
Feature importance represented differences in contribution when classifying various forest types in the four regions (
Figure 6). Overall, when the classifier was trained on all forest types, it was found that the importance of the SST features was analogous and that the performance of each feature was relatively average, with no apparent mutation. However, when discriminating between any two forest types, the contributions from the features represented significant differences.
NE: temporal features played a vital role in discriminating between evergreen and deciduous forests, which were the main forest types in this region and were significantly seasonal. Spectral features contributed more to the classification of mixed forests with complex forest types, whereas spatial features played a more significant role in the classification of needle-leaved and broad-leaved forests.
SW: spectral and temporal characteristics were relatively more important in this region, with spatial characteristics being slightly less for all forest type divisions. Temporal features played a significant role in the classification of evergreen and deciduous forests, while spectral features were better suited to classifying all forest types and mixed forests. In addition, the auxiliary features of this region also made contributions, particularly with respect to the complex regional topography, as exemplified by Hengduan Mountain.
SO: the prominent feature here was the elevation featureof auxiliary features, which played a significant role in classifying all forest types and specific forest types, indicating that forest type distribution here was largely correlated with elevation. Temporal features played a greater role in the classification of evergreen and deciduous forests, while spectral features were generally more uniform, maintaining a stable role in identifying forest types. The contribution made by spatial features was insignificant compared with that made by other feature types.
SNT: spectral features contributed the most to forest type classifications here, accounting for the largest percentage in most assessment results and having the greatest significance in classifying evergreen and deciduous forests. This was followed by temporal features, particularly in the classification of evergreen and deciduous forests; spectral features had the greatest significance, with spatial features playing a lesser role, affecting only ENF and EBF classification. Auxiliary features were significant in the SNT region, which indicated that the forest types here were greatly affected by elevation—as was expected, given the significant vertical distribution of vegetation types in this area.
4. Discussion
4.1. Image Composition Method
Obtaining high quality and reliable composite images is an important prerequisite for studying the SST feature set and its application in the use of remote sensing data in forest classifications. In this study, we adopted a method for pixel-based, multi-rule compositing of Sentinel-2 archive data to generate cloud-free images for the four studied regions. The adopted image composition method made full use of pixels over the same period for many years and assigned weights according to set rules, making the composition results more reliable. The image composition method presented is flexible and the rules can be adjusted for different information needs.
Four parameters—image composition years, image composition period, distance to clouds and cloud shadows, and quality assessment of pixel reflectance—were used to obtain synthetic images of the growing and deciduous seasons in the four different regions [
25,
26,
29]. The image composition years were set to three in this study, allowing composite images covering the growing and deciduous seasons to be obtained for the four regions. The image composition period was determined using those months with an average NDVI ≥ 95% of the peak greenness value [
22,
23,
26], and the resulting periods for the four regions can be seen in
Table 8. Distances to clouds/cloud shadows were calculated to determine the suitability of each pixel for the composition. The assessment of the pixel reflectance was performed to avoid oversaturation, clouds, or shadowed pixels that were omitted in the cloud masking. These pixels were included in the NIR image composite, as it showed the least noise over forested areas [
22].
Furthermore, in order to assess the composition results, 500 sample points were randomly selected from a single image, withholding the most cloud-free pixels as a reference, to evaluate the consistency of the synthesis results for the four regions on GEE. The assessment result was as shown in
Figure 7. The surface reflectance values in the composite image for the four regions exhibited strong correspondence with the reference images in the blue, green, red, and NIR bands, with R2 values ranging from 0.69–0.93.
4.2. SST Forest Types Classification Framework
Forests can grow in many different climates, from the tropics to cold temperate zones, and forest compositions and types are diverse—which poses many challenges when using remote sensing technology for large-scale forest mapping. In previous studies, single classification methods and coarse resolution remote sensing images were used for forest classification and mapping on a large scale, often ignoring differences in forest distribution in different climate zones [
21].
The SST forest type classification framework applied here comprehensively took account of the distribution characteristics of forests in different climates or latitudes, and registered differences between forest types from spectral, spatial, and temporal perspectives. This was the first study to make full use of the characteristics available in Landsat and Sentinel-2 time series data for application in forest classification, and we collected as much spectral, spatial, and temporal feature data as possible from them. The results showed the SST classification framework to be effective for mapping large-scale forest areas, especially across climate zones.
With regard to spectral features, we considered the fact that Sentinel-2 data include more spectral information than Landsat data and used the Sentinel-2 spectral feature set with the original bands and vegetation indices. Previous studies have shown that these features are all suitable for forest classification [
6,
7,
19]. Considering the unique red-edge bands in Sentinel-2, we replaced the NIR band with the red-edge band, based on the theoretical NDVI, acquiring three red-edge indices (NDVIre1, NDVIre2, and NDVIre3). The evaluation results for feature importance also showed that various types of features related to the red-edge bands showed greater significance, indicating that the red-edge bands could make a relatively important contribution to forest type classifications.
Spatial features were derived from Sentinel-2 images composited using their high spatial resolution and were described using the texture index calculated using the GLCM method. We adopted this approach as the texture features could show basic attributes (size, density, and reflectance) of internal canopy shadows and leaves [
30]. We selected the contrast (con), entropy (ent), correlation (cor), and variance (var) of the texture features for each selected band to represent spatial features, these having been previously shown to be effective in discriminating forest types [
19,
27,
31,
32,
33].
Temporal features were extracted using the HANTS method, using the long NDVI and EVI time series derived from Landsat time series data. NDVI and EVI time series have been shown to reflect forest growth indicators and have been widely used in temporal feature extraction [
19,
34,
35,
36].
4.3. Feature Importance for Forest Type Classification
In the feature importance assessment, the indices ranked highly were mostly from the temporal feature set, suggesting that the time series indices were generally more helpful for forest type classification—which is consistent with the findings of Pasquarella [
6]. Furthermore, our study found that in areas with different geographic characteristics, the features which were most important for forest type classification varied. In the middle- and high-altitude areas of NE China, the indicators in the temporal feature set played a significant role. For complex mountainous areas in SW China, the importance of spectral feature indicators increased, being significantly higher than that in other areas. In terms of cloudy and rainy southern regions, spectral and temporal features played a similar role, while spatial features clearly played a larger role in SW China than in the other studied regions. For the climate transition zone in China, elevation, from the auxiliary features set, played a relatively important role in classifying each forest type. We also saw that the role of time features was most important here, especially for separating evergreen and deciduous forests.
In terms of the general feature importance, spectral feature importance accounted for the majority of the ratio (
Figure 8) in the four regions, suggesting that spectral features derived from remote sensing images remained the most important in forest type classification processes. We saw that spectral features performed particularly well in classifying complex forest types, such as MIF. The importance of temporal features was second only to spectral features in most scenarios (
Figure 8), being particularly evident with regard to the classification of evergreen and deciduous forest types with significant differences in growth patterns, where temporal features were found to have major significance.
Although some features in the spatial features set had higher values—for example, in classifying DBF and DNF in the SW region, the spatial features Growing_nir_ent and Growing_red_cor presented the greatest significance among all the features—the overall effect produced by spatial features was the smallest among the SST features in most schemes, which may be related to their limited number. For some forest types with distinct spatial structural differences, however, such as needle-leaved and broad-leaved forests, spatial features produced optimal effects (
Figure 8b). We found that topography was one of the major factors influencing forest type classification, especially in mountainous areas and areas with large elevation transitions, with forest type often affected by terrain elevation, slope, and aspect, as has been previously reported [
19].
Overall, for forest type classification, although a single feature sometimes played a significant role (e.g., NDVI amplitude in the temporal feature set), the optimal result was often obtained through a combination of SST features. Hence, the power of SST features in our work depended on the combination of features which characterized SST variability in reflectance properties. This was illustrated by the results of the feature importance assessment and suggested that SST features all played significant roles in the forest type classification process.
5. Conclusions
In this study, we conducted a detailed classification of forest types, using characteristics available from multi-source time series remote sensing sources, particularly high spatial resolution image data for large areas. Based on the GEE cloud computing platform and machine learning algorithms, we established an extraction framework for the SST features of forest type classification. Considering four typical Chinese geographic regions (middle and high latitudes, complex mountainous areas, cloudy and rainy areas, and the SNT zone) as examples, the accuracy of the method was verified and a forest type map for 2018 was obtained with a 10 m spatial resolution. The results achieved an overall classification accuracy of > 85%, which indicated that the proposed method was reliable.
The feature indicators that affected the forest type classifications differed with geographical location. We found that when classifying each forest type, spectral features were more important in classifications involving complex compositions, with our feature importance assessment demonstrating that red-edge bands were very useful in forest type classification. Temporal features could be very important in identifying forest types with distinct evergreen and deciduous growth patterns, while spatial features produced better classification results for forest types with different spatial structures, such as needle- and broad-leaved forests. We also saw that topographic factors were indispensable indicators for classifying forest types in mountainous areas or areas with large elevation transitions.