3.1. FVC Extraction Using Different Spatial Resolutions
Four images with similar acquisition dates (Sentinel-2, MOD13Q1, and MOD13A1 images obtained on 28 July 2017 and the Landsat-8 image obtained on 21 July 2017) were used to extract the FVC of the study area.
Figure 2 compares the results within a sample site mainly consisting of woodland (a total of 2100 pixels). For FVCs extracted from the Sentinel-2 and Landsat-8 images, the overall spatial distribution of vegetation is basically consistent. The FVC spatial distribution is clearer with images of higher spatial resolution. When the image resolution is reduced to 250 m and 500 m, the woodland boundaries are blurred, and the spatial distribution information of the FVCs is significantly lost.
According to the statistical results from the sample site (
Table 3), the maximum and minimum values of FVC extracted from 10 m and 30 m resolution images are equal. However, as the image resolution decreases, the range of FVC values indicates a decrease in the maximum values and an increase in the minimum values. The FVC mean values for the four resolutions do not vary substantially, but the standard deviations decrease gradually as the resolution is reduced. The maps of woodland FVC in Liyang at different resolutions (
Figure 3) show that the spatial distributions of FVC
30m and FVC
10m are similar, whereas the spatial distributions of FVC
250m and FVC
500m are similar but with obvious differences relative to FVC
10m. Differences between the FVC distributions are most evident along the edges of woodland patches, as well as around the woodlands near Shahe Reservoir and Daxi Reservoir in Tianmuhu township (see
Figure 1b for location).
To better understand the spatial distribution of differences among the FVCs extracted using different spatial resolutions, deviation of FVC at 30 m, 250 m, and 500 m spatial resolutions from FVC at 10 m spatial resolution were calculated (
Figure 4). We found that the difference between FVC
30m and FVC
10m mainly occurs at lower values, ranging from −0.1 to 0.1. The differences between FVC
250m and FVC
10m and between FVC
500m and FVC
10m mainly occur at higher values, ranging from −0.4 to −0.1. The differences between Sentinel-2 and MOD13Q1/MOD13A FVCs are most obvious in Tianmuhu, especially at the edges of woodland patches. In these areas, the difference can reach −0.4 to −0.1, indicating an underestimation of the FVCs extracted from MOD13Q1 and MOD13A1 images relative to the FVCs extracted from 10 m Sentinel-2 images.
3.2. Comparison of FVC-Level Distribution
FVCs were classified into five levels according to classification standards (
Table 2). The proportion of each FVC level was calculated as shown in
Figure 5. The percentage of each FVC level varies depending on the spatial resolution. Specifically, the percentages of FVC
30m and FVC
10m are similar across all levels. The area of level V at both spatial resolutions accounts for more than 90%. The difference in the area percentage of level II and level IV is quite small for the two spatial resolutions. These results indicate that the ability of the Landsat-8 image to reflect the FVC level in the region is similar to that of the Sentinel-2 image. The percentages of FVC
250m and FVC
500m are quite similar but differ considerably from those of FVC
10m. The percentage of level IV is significantly higher than that of FVC
10m, representing an approximate difference of 11%. The percentage of level V is remarkably lower than that of FVC
10m, with a difference of approximately 12%. However, levels I, II, and III do not differ considerably.
Taking FVC
10m as a reference,
Table 4,
Table 5 and
Table 6 represent the misclassified area for five levels of FVC
30m, FVC
250m, and FVC
500m, respectively (see Equation (3) for more detail).
Table 7 presents the PA and UA for three spatial resolutions (see Equations (5) and (6)). The overall accuracy values of FVC
30m, FVC
250m, and FVC
500m were calculated according to Equation (4), totaling 91.0%, 76.3%, and 76.7%, respectively.
The overall accuracy of FVC
30m is the highest, whereas the overall accuracy of FVC
250m and FVC
500m is not much lower. Regarding the misclassification of each FVC level (
Table 4,
Table 5 and
Table 6), the difference in FVC
30m is mainly caused by misclassification into adjacent higher coverage levels. In comparison, the differences in FVC
250m and FVC
500m are mainly caused by misclassification into higher coverage levels, showing that compared to FVC
10m, the FVCs extracted from all three resolution images are overestimated. However, the overestimation is greater in FVC
250m and FVC
500m than in FVC
30m.
As shown by the extraction accuracy of each coverage level (
Table 7), with increased FVC, the overall PA and UA of FVC
30m, FVC
250m, and FVC
500m improve. In addition, the extraction accuracy of level V is much higher than that of the other levels. The PA and UA at each level of FVC
30m are higher than those of FVC
250m and FVC
500m. Among these, the difference in extraction accuracy of level II is greatest, with a smaller difference in extraction accuracy of level V. This indicates that the magnitude of FVC can have a considerable impact on the accuracy of FVC extraction.
3.3. Comparison of FVC Time Series
Considering that the quality of the Sentinel-2 images is sensitive to weather and the immaturity of the spatiotemporal fusion method for acquiring 10 m resolution time-series images, in this study, time-series images with 30 m spatial resolution were generated using ESTARFM, as FVC30m is most similar to the FVC10m reference data. The time series of FVC30m, FVC250m, and FVC500m were compared in order to study the impact of image resolution on FVC time series.
After screening, nine periods of high-quality (cloud cover less than 10%) raw Landsat-8 data remained (
Table 8). The remaining 14 periods of images in the study year were obtained by fusion (one image every 15 days, for a total of 23 periods per year). For each missing 30 m spatial resolution image within the target period, two pairs of reference images in different periods and low-spatial-resolution images at the predicted time were taken as input data. The closer the acquisition time of images between two resolutions, the better the fusion effect. To verify the accuracy of ESTARFM, 10,000 pixels on the actual image and the fused image were randomly selected for correlation analysis. The correlation coefficient reached 0.9234, indicating that the difference between the actual images and the fused images was quite small and that the spectral information was well preserved. The fused images not only include the spatial information of Landsat-8 NDVI but also the temporal variation characteristics of MOD13A1. Therefore, NDVI time series with 30 m spatial resolution for 23 periods in 2017 were constructed.
The time-series data of FVC30m, FVC250m, and FVC500m in 23 periods were obtained based on the corresponding NDVI time series using the pixel dimidiate model. To facilitate the comparison of the FVC time series, the FVC10m extracted from Sentinel-2 on 28 July as used as the reference to classify the FVC level with an interval of 0.1. Because the area of woodland pixels with an FVC < 0.4 is small, these pixels were ignored.
Figure 6 shows the annual time-series variation in the mean FVC from three different spatial resolutions. The temporal profiles were derived from Landsat-8, MOD13Q1, and MOD13A1 images. All these temporal profiles of varying spatial resolutions present consistent seasonal dynamics and magnitudes. The FVCs began to increase around 6 March as the growing season started, reaching their peaks around 28 July, when the vegetation was at its climax. After September 30, FVCs began to decline sharply. However, the time-series values of FVC
30m are higher than those of FVC
250m/FVC
500m during spring and autumn.
Moreover, the time-series variations in the mean FVC levels from three spatial resolutions were extracted, as shown in
Figure 7. The time-series values of each FVC
30m are generally higher than the values of each level of FVC
10m. The temporal profiles of FVC
30m, FVC
250m, and FVC
500m are the most similar when FVC
10m is 0.6~0.7 (
Figure 7c). When FVC
10m is less than 0.6, the temporal profile of FVC
30m is lower than that of FVC
250m/FVC
500m, and the gap increases as FVC decreases (
Figure 7a,b). On the contrary, when FVC
10m is higher than 0.7, the temporal profile of FVC
30m is higher than that of FVC
250m/FVC
500m, and the gap enlarges as FVC increases.
Finally, the difference rate between FVC
30m and FVC
250m/FVC
500m was analyzed, as shown in
Figure 8. The FVC
30m and FVC
250m/FVC
500m difference rates are almost the same at each FVC level, as the temporal profiles of FVC
250m and FVC
500m are very similar. When FVC
10m is less than 0.6, the difference between FVC
30m and FVC
250m/FVC
500m is less in spring and autumn and greater in summer and winter. However, when FVC
10m is higher than 0.7, the difference between FVC
30m and FVC
250m/FVC
500m is greater in spring and autumn and lower in summer and winter.
The above results show that even the temporal profiles of the mean FVC at each resolution are very similar. The profiles of FVC at different spatial resolutions present with consistent seasonal dynamics and magnitudes. However, the degree of variation in FVC at different spatial resolutions depends on the season and level of FVC.