Next Article in Journal
Tracking of Evasive Objects Using Bistatic Doppler Radar Operating in the Millimeter Wave Regime
Previous Article in Journal
HyperLiteNet: Extremely Lightweight Non-Deep Parallel Network for Hyperspectral Image Classification
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Spatial Stratification Method for the Sampling Design of LULC Classification Accuracy Assessment: A Case Study in Beijing, China

1
Research Center of Information Technology, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100097, China
2
Forestry Experiment Center of North China, Chinese Academy of Forestry, Beijing 102300, China
3
College of Global Change and Earth System Science, Beijing Normal University, Beijing 100875, China
4
College of Land Science and Technology, China Agricultural University, Beijing 100193, China
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Remote Sens. 2022, 14(4), 865; https://doi.org/10.3390/rs14040865
Submission received: 30 December 2021 / Revised: 4 February 2022 / Accepted: 8 February 2022 / Published: 11 February 2022

Abstract

:
Spatial sampling design is important for accurately assessing land use and land cover (LULC) classification results from remote sensing data. Spatial stratification can dramatically improve spatial sampling efficiency by dividing the study area into several strata when classification correctness is spatially stratified heterogeneous. By integrating the LULC classification results from different sources and spatial resolutions, a spatial stratification method for spatial sampling of accuracy assessment is presented in this paper. Its efficiency is demonstrated in the case study using LULC data of Beijing, China, in the following steps. First, we standardized and reclassified multiresolution remote sensing data, including China’s land use/cover datasets (CLUDs) from 2017 (resolution: 30 m), 500 m MCD12Q1, and 10 m FROM-GLC10 data, into six classes. Second, we customized stratification rules, formulated a technical specification to realize 11 strata using CLUDs and MCD12Q1, and employed FROM-GLC10 as the reference data for accuracy assessment. Furthermore, six sample sets with sizes of 16,417; 1821; 652; 337; 198; and 142 were drawn using different methods, and their overall accuracy (OA), deviation accuracy (DA), root-mean-square error (RMSE), and standard deviation (STDEV) values were also evaluated to demonstrate the efficiency brought by spatial stratification. Compared with the spatial even sampling method, the OAs of the stratified even sampling method adopting the proposed stratification method was much closer to the true OA, and the corresponding RMSE and STDEV results decreased from 2.097% and 2.127% to 0.914% and 0.713%, respectively, due to the contribution of spatial stratification in the sampling scheme. The method can be used to distinguish the differences and improve the representativeness of samples, and it can be employed to select validation samples for LULC classification.

Graphical Abstract

1. Introduction

Land use and land cover (LULC) information is fundamental for cropland protection, ecological and environmental change studies, and sustainable development [1,2,3], and LULC has changed markedly due to frequent human activities at global and regional scales [4]. The classification accuracy of the LULC maps is the key to their applications. To assess the accuracy of LULC maps, sampling sites need to be established to obtain the true ground classes and compare with the classified results of LULC maps. Classification accuracy assessment not only describes the quality of a map [5], but also provides a means to enhance its usefulness [6]. However, due to the high cost of field sampling, only a limited number of sites can be sampled. Therefore, the validated sites for LULC classification should be distributed efficiently [7]. Moreover, the representatives of both feature space and geographical space should be considered [8,9]. Geographical space consists of latitude, longitude, and elevation, or plane coordinates after map projection [10]. The feature space, or attribute space, is a virtual space with each attribute as an axis [11,12]. Stratification, which divides the study area into several more homogeneous subregions, is a useful tool to improve the representativeness of samples in feature space and can simplify spatial heterogeneity [13]. Stratification strategies can be grouped into two main types: direct and indirect stratification strategies. The direct stratification strategy directly divides the spatial coverage using existing subregional units, e.g., LULC type, ecological zonation, and administrative unit. For example, the stratified even sampling method utilizes the LULC type of the product, whose classification accuracy needs to be assessed, as stratification and then adopts spatial simulated annealing (SSA) to distribute sampling sites evenly in the strata [14]. Existing subregional units, however, are not sufficiently precise in representing the spatial distribution of misclassification. The indirect stratification strategy employs certain clustering methods [15] to achieve spatial stratification using auxiliary factors [16], e.g., prior knowledge, historical data, and ancillary data.
The target population for the spatial sampling design of LULC classification accuracy assessments is an image consisting of pixels with a value that is true (1 if the classification is correct) or false (0 if the classification is wrong) [14]. The stratification should reflect the spatial heterogeneity of those true/false values. The consistency and inconsistency of LULC classification products from different sources and spatial resolutions can reflect the probability of misclassification to some extent, although they are produced using different spatial information, prior knowledge, and classification methods [17,18,19]. The parts where the classification results of different products with different sources and spatial resolutions are consistent, suggest easy-to-identify and low probability of misclassification. In contrast, the parts where the classification results of different products are inconsistent suggest hard-to-identify and high probability of misclassification. Furthermore, the misclassification probability is the basis of spatial stratification of the target population for accuracy assessments. Therefore, this study proposes a spatial stratification method by integrating the consistency and inconsistency of different products for spatial sampling to evaluate the LULC classification accuracy assessment. The performance of the proposed method is demonstrated using the LULC classification products of Beijing, China, as a case study. In the case study, we first standardized and reclassified multiresolution remote sensing data into six classes, then applied the stratification method to obtain several strata, and finally carried out the stratified even sampling method to assess the classification accuracy and compared its performance with spatial even sampling with different sample sizes.

2. Spatial Stratification Method

The proposed spatial stratification method aims to divide the target population into several homogeneous subregions with a close probability of misclassification. Given that the true misclassification is unknown and needs to be inferenced, the spatial stratification method tries to derive information of spatial heterogeneity from the classifications of other LULC products with different sources and spatial resolutions. It is different from the stratified even sampling method, which directly utilizes the LULC types of the product to be assessed as strata. As the LULC type cannot directly represent the spatial distribution of the misclassification probability, the spatial stratification proposed in this study improves it by integrating the consistency and inconsistency of different data products with different sources and spatial resolutions to represent the misclassification probability. In this new spatial stratification method, by comparing the classification results of target data, whose classification accuracy needs to be assessed, using other ancillary LULC products, both consistent and inconsistent classifications can be obtained. Those pixels that all LULC products give out the same judgment could have low probability of misclassification, while pixels that all products give out different judgment may have high probability of misclassification. According to the consistency and inconsistency of the comparison, e.g., three different LULC classification products shown in Figure 1, the target population can be stratified.
The spatial stratification method developed in this study is composed of three steps, as illustrated in Figure 2.
Step 1: Data standardization and reclassification. Prepare the target data and obtain K different LULC products of the same area and time period. Then standardize and reclassify those LULC products to make them comparable.
Step 2: Define the stratification rules and basic stratification units. The spatial overlay analysis of the K + 1 reclassification results was used for spatial stratification by the customized stratification rules. Based on the results of spatial overlay analysis, basic stratification units of each LULC class were obtained for the study area according to the different attribute values of the pixels.
Step 3: Spatial stratification results. Through a spatial combination of basic stratification units, spatial stratification results of each LULC class were achieved accordingly. The strata of all classes were integrated as the spatial stratification results of the study area.
For K + 1 LULC classification data with different sources and spatial resolutions, the target data whose classification accuracy needs to be assessed and K ancillary data with different resolutions should be standardized and reclassified to make them comparable. After the reclassification, the class of one pixel in the target data and k ancillary data are donated with Po and Pi (i = 1,···, k), respectively.
For each class of the target data, we can stratify it by comparing Po with Pi, i.e., Pi = [P1, P2, ···, Pk], using the spatial overlay analysis. At each pixel, a vector with k binary values can be obtained, for example [1, 0, …, 1], with 1 donating that Po = Pi, and 0 donating PoPi. There are at most 2k possible combinations, and the rules for stratification could be designed based on the vector. In practice there will be less than 2k combinations because some combinations may not exist. When k is small, the vector can be directly used as stratification rules. When k is large, the rules can be designed based on the summary of the vector, for example, using the sum of the vector as stratification rules.
Then, the final spatial stratification results of the study area can be obtained by iterating all classes of the target data and stratifying them as above steps.

3. Case Study

3.1. Data Sources and Experiment Roadmap

CLUDs (China’s land-use/cover datasets) MCD12Q1 and FROM-GLC10 data for Beijing (115°25′ E–117°30′ E, 39°28′ N–41°05′ N) were used in this study. The 30 m CLUDs for 2017 were provided by the Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences [20]. The 500 m MCD12Q1 data for 2017 were developed by Boston University [21], and the 10 m FROM-GLC10 data for 2017 can be freely downloaded from http://data.ess.tsinghua.edu.cn (accessed on 23 September 2019) [22]. The datasets were first converted by file formatting and then mosaicked and reprojected into the UTM zone 50N projection with the WGS 84 datum using nearest-neighbor resampling. The number of classes of MCD12Q1, CLUDs, and FROM-GLC10 data for Beijing was 11, 19, and 8, respectively, and corresponding three datasets are shown in Figure 3a,c,e respectively. To make them comparable, MCD12Q1, CLUDs, and FROM-GLC10 data for Beijing were transformed to have six classes, i.e., cropland, woodland, grassland, water body, built-up land, and unused land. The classification system and corresponding relationships are shown in Table 1, and the LULC reclassification results of the three datasets are shown in Figure 3b,d,f respectively. Data processing was conducted using ENVI 5.1 (ITT Visual Information Solutions, Circle Boulder, CO, USA) and ArcGIS 10.5 (Environmental Systems Research Institute, Inc., Redlands, CA, USA) software.
In this case study, the 2017 30 m CLUDs was treated as target data, and 500 m MCD12Q1 was used as ancillary data to help carry out the spatial stratification. Given the true land class was unknown, the 2017 10 m FROM-GLC10 with the higher spatial resolution was adopted as reference data to act as the true ground land class. The experiment roadmap is illustrated in Figure 4 following the spatial stratification method in the previous section.

3.2. Spatial Stratification

In the experiment, the CLUDs and MCD12Q1 were standardized and reclassified according to Table 1, and overlaid to obtain spatial strata. For each class of CLUDs, two possible strata can be obtained: one includes pixels that CLUDs and MCD12Q1 have the same class, the other includes pixels where CLUDs and MCD12Q1 have different classes. Specifically, the class of one pixel in the CLUDs and MCD12Q1 are donated with Po, and Pl, respectively.
For each class of the CLUDs, we can stratify it by comparing Po with Pl through the stratification rules illustrated in Figure 4. Based upon the comparison results using spatial overlay analysis, two possible stratifications labeled as stratum Ⅰ and stratum Ⅱ can be obtained. Stratum Ⅰ is composed of those pixels s that belong to class Po in the CLUDs and Po(s) = Pl(s), and those with Po(s) ≠ Pl(s) are divided into stratum Ⅱ. For the CLUDs and MCD12Q1, each LULC class can be divided into two strata at most, as depicted in Table 2. By iterating six classes of the CLUDs and stratifying them according to the steps given above, the spatial stratification results were achieved.

3.3. Sampling Optimization

The values of sampling sites were obtained by comparing the land class of CLUDs with that of the reference data. If they are consistent, the value is set to one, otherwise zero. The corresponding estimation methods were then used to assess the overall accuracy (OA) of the whole study area, whose true value was the mean of the target population.
The target population was obtained through a pixel-by-pixel comparison of CLUDs and reference data, and true accuracy OA0 is calculated as:
OA 0 = ( i = 1 n P i i ) / N t o t a l
where Pii refers to the number of sampling sites that were correctly classified, n is the number of LULC types, and Ntotal is the total number of pixels.
To evaluate the efficiency of the stratification, a stratified even sampling method following the divided strata was used to draw samples with different sizes to estimate the OA of CLUDs. The stratified even sampling method included spatial stratification and sample allocation in feature space and sampling optimization in geographical space [14], which was developed based upon compound sampling strategy [23] and polygonal declustering estimation [24]. After the spatial stratification only using the LULC type of the product to be asessed, the SSA and the minimization of the mean of the shortest distances (MMSD) criteria were employed to optimize sampling sites in geographical space [25,26,27]. The OA of the stratified even sampling method is defined as:
OA = i = 1 k ( OA i × W i )
where OAi is the overall accuracy of the ith stratum and the calculation of OAi is specifically introduced in [14]; k is the number of strata; and Wi is the weight of the ith stratum, which is estimated by the ratio of the area of this stratum to the total area.
In addition, the frequently used spatial even sampling method, which was realized by designing a series of kilometer grids to cover the study area and taking the center of a grid as a sampling site [28], was used as a comparison. The OA for this method can be estimated by:
OA = j = 1 n ( V j × w j )  
where Vj refers to the value of the jth sampling site, and Vj is set to 1 if the classification is correct, otherwise zero; n is the total number of sampling sites; and wj is the weight of the area surrounding the jth sampling site by polygonal method.
The total sample size of this case study was determined by Foody [29], and the required sample size needed to estimate the population proportion of correctly allocated cases in classification can be calculated by:
M = z α / 2 2 P ( 1 P ) h 2
where M is the sample size,   z α / 2 is the critical value of the normal distribution for the two-tailed significance level α, P is a planning value for the correctly allocated case population proportion, and h is the half-width of the desired confidence interval.
The area-weighted proportion method was adopted to allocate sampling sites in the case study [30]. The sample size of the kth stratum Nk is defined as:
N k = N × S k S
where Sk is the area of the kth stratum, S is the total area, and N is the total sample.

3.4. Comparative Metrics for Accuracy Assessment

The estimated OA was compared with the true OA and the deviation accuracy (DA), root-mean-square error (RMSE), and standard deviation (STDEV) were adopted to measure the assessment accuracy. The DA and RMSE measure the deviation between the observed and true values, while the STDEV measures the discrete range of a given dataset. The spatial stratification method was more effective when the values of DA, RMSE, and STDEV were low. The DA, RMSE, and STDEV are calculated, respectively, as follows:
DA = | OA j OA 0 |
RMSE = 1 n j = 1 n ( OA j OA 0 ) 2
STDEV = 1 n 1 j = 1 n ( OA j OA ¯ ) 2
where OAj is the overall accuracy estimated by the jth sample, OA0 is the true accuracy of the classification data, OA ¯ is the mean overall accuracy of all the samples, and n is the number of samples.
In addition to OA, producer accuracy (PA) and user accuracy (UA) were also calculated to reveal the classification accuracy in detail.

4. Results

4.1. Spatial Stratification of CLUDs and MCD12Q1 for Beijing

According to the experiment roadmap and spatial stratification rules in Figure 4 and Table 2, the 2017 30 m CLUDs of Beijing was stratified into 11 strata as shown in Figure 5. The unused land Ⅰ is absent in the results because CLUDs and the ancillary data (MCD12Q1) do not match each other at any pixel in judging the unused land.

4.2. Sampling Optimization and Sample Allocation

To obtain a stable performance, we set different sample sizes. The minimum sample size was determined with Equation (4). A 0.05 significance level is adopted by convention and z α / 2 equal to 1.96 [29]. One hundred thirty-nine sampling sites were required to obtain a target accuracy of 90%. The sample size needs to be larger than 139; thus, grids of different sizes were adopted to generate sampling designs for covering the study area. Six sets of grids (in km), i.e., 1 × 1; 3 × 3; 5 × 5; 7 × 7; 9 × 9; and 11 × 11, were designed, and their corresponding sample size was 16,417; 1821; 652; 337; 198; and 142, which all satisfied the minimum requirement of the sample size.
For the spatial even sampling method, the center of a grid was generated as a sampling site. As an example, the configuration of the samples with a size of 142 for the spatial even sampling method in Beijing is shown in Figure 6a.
Although the stratified even sampling method can draw any sample size and not be restricted by the number of grids divided, the same sample sizes were set for it to make the two sampling methods comparable. In applying the stratified even sampling method, 11 strata in Figure 5 were employed, and Equation (5) was used to allocate the sample size, as shown in Table 3. As an example, the configuration of the samples with a size of 142 for the stratified even sampling method in Beijing is shown in Figure 6b.

4.3. Accuracy Assessment of CLUDs Using FROM-GLC10 and Comparative Analysis

Referring to the 10 m FROM-GLC10 data of Beijing, accuracy assessments of the 30 m CLUDs were conducted using samples drawn from the stratified even sampling method and the spatial even sampling method, respectively, to evaluate the contribution of spatial stratification in the sampling scheme. Using Equation (1), we assessed CLUDs in Beijing using FROM-GLC10 data through a pixel-by-pixel comparison, and the wall-to-wall OA result of CLUDs in Beijing was 71.083%, which was the mean of the target population, i.e., the true accuracy.
For the stratified even sampling method proposed in this study, the OAs and DAs were estimated using samples based on Equations (2) and (6), respectively, and the results are shown in Table 4. The results suggest that the OA and DA of the CLUDs data for Beijing based on the stratified even sampling method were 71.110–72.926% and 0.027–1.843%, respectively. The UA and PA of each LULC class were also calculated and the results are shown in Table 5. Specifically, cropland, woodland, and built-up land were well classified with a high accuracy whilst grassland and water body had low accuracy. Misclassification of unused land may be mainly attributed to this class that covered a very limited area in Beijing (area proportion, 0.01%).
For the spatial even sampling method, the OAs and DAs were estimated using samples based on Equations (3) and (6), respectively, and the results are shown in Table 6. The results suggested that the OA and DA of the CLUDs data for Beijing based on the spatial even sampling method were 66.766–73.232% and 0.032–4.317%, respectively. The UA and PA of each LULC class were also calculated and the results are shown in Table 7. Compared with the stratified even sampling method, the estimated UA and PA of CLUDs using the spatial even sampling method were generally classified with a lower accuracy except for unused land. Meanwhile, the accuracy for specific LULC types demonstrated notable differences, as depicted in Table 5 and Table 7. Based on Table 4 and Table 6, the results suggest that the OAs estimated by the stratified even sampling method were much closer to the true OA than those by the spatial even sampling method, as depicted in Figure 7.
The RMSE and STDEV were also selected to evaluate the performance of spatial stratification on the accuracy assessment in this study. Using Equations (7) and (8), the RMSE and STDEV of the stratified even sampling and spatial even sampling methods were computed, respectively, and the calculated results are shown in Figure 8. Compared with the spatial even sampling method, the RMSE and STDEV results decreased from 2.097% and 2.127% to 0.914% and 0.713% for the stratified even sampling method, respectively, due to the contribution of spatial stratification.

5. Discussion

Given the true land class was unknown, we selected 30 m CLUDs as target data, used 500 m MCD12Q1 as ancillary data to carry out the spatial stratification, and employed 10-m FROM-GLC10, which had a higher spatial resolution and more reliable global/regional accuracy [31,32], as reference data to measure the sampling efficiency in the case study. However, in application of the proposed spatial stratification method, products with different resolutions should be used in the stratification as much as possible, and reference data, e.g., FROM-GLC10, can be also used for spatial stratification.
Taking three LULC reclassifications of MCD12Q1, CLUDs, and FROM-GLC10 in Figure 3 for Beijing as an example, CLUDs was treated as target data, and MCD12Q1 and FROM-GLC10 were employed as ancillary data to achieve spatial stratification. The class of one pixel in the CLUDs, FROM-GLC10, and MCD12Q1 is donated with Po, Ph, and Pl, respectively. Based upon the spatial stratification method developed in this study, for each class of CLUDs, the stratification rules of three LULC classification data are illustrated in Figure 9, and four stratification units, labeled as stratum Ⅰ, stratum Ⅱ, stratum Ⅲ, and stratum Ⅳ, were obtained accordingly. Stratum Ⅰ is composed of those pixels s that belong to class Po in the target data and Po(s) = Ph(s) and Po(s) = Pl(s); those with Po(s) = Ph(s) and Po(s) ≠ Pl(s) are divided into stratum Ⅱ; those with Po(s) ≠ Ph(s) and Po(s) = Pl(s) are divided into stratum Ⅲ; and those with Po(s) ≠ Ph(s) and Po(s) ≠ Pl(s) are divided into stratum Ⅳ. In addition, the four stratification units can be grouped, for example, stratum Ⅱ and stratum Ⅲ can be grouped into one stratum to represent partly consistency. Therefore, for three different data, including MCD12Q1, CLUDs, and FROM-GLC10, each LULC class can be divided into one stratum, two strata, three strata and four strata, respectively, as depicted in Figure 9.
Through different combinations of stratum Ⅰ, stratum Ⅱ, stratum Ⅲ, and stratum Ⅳ, the possible strata can be grouped into one stratum, two strata (stratum Ⅰ; stratum Ⅱ/stratum Ⅲ/stratum Ⅳ), three strata (stratum Ⅰ; stratum Ⅳ; stratum Ⅱ/stratum Ⅲ), and four strata (stratum Ⅰ; stratum Ⅱ; stratum Ⅲ; stratum Ⅳ), as shown in Figure 9. We iterated six classes of the CLUDs data and stratified them according to the spatial stratification rules depicted in Figure 9. Accordingly, four spatial stratification results for Beijing were integrated by the strata of six classes, as illustrated in Figure 10. The results suggested that the coverage of Beijing can be divided into 6, 11, 17, and 22 strata using the spatial stratification rules developed in this study. Stratum Ⅰ and stratum Ⅲ for unused land in Beijing did not exist because CLUDs and MCD12Q1 are not consistent in predicting unused land in the whole study area. Additionally, the strata obtained by the spatial stratification method can be used for spatial sampling in the future.
By integrating the LULC classification products of different sources and spatial resolutions, the spatial stratification method developed in this study can achieve spatial stratification and generate better estimations than the commonly used method for accuracy assessment. The main contribution of spatial stratification was to distinguish the differences of the probability of misclassification and improve the representativeness of samples [33]. The spatial stratification can be used in two important scenes: one is to draw a sample with more representativeness from each stratum to ensure that the accuracy assessment results of the LULC classification are much closer to the true accuracy; the other is to select much more representative training samples from different strata to improve the classification accuracy. For future implementation, we formulated a technical specification to describe the appropriate approaches and procedures for design, response, and analysis for data stratification in this study, as depicted in Figure 1, and a case study on accuracy assessment in Beijing, China, as illustrated in Figure 4. It is suggested that spatial stratification should be integrated together with different spatial sampling schemes for LULC classification in the future, for example, the stratified even sampling method in this study.

6. Conclusions

We presented a spatial stratification method to improve the efficiency of spatial sampling for classification accuracy assessment of LULC results of remote sensing data. Its performance was demonstrated in a case study using FROM-GLC10 data as the reference data to evaluate CLUDs for Beijing, and selecting MCD12Q1 as ancillary data for spatial stratification. The results suggested that the coverage of Beijing can be divided into 11 strata using the spatial stratification rules developed in this study. Compared with the spatial even sampling method, the OAs of the stratified even sampling method adopting the proposed spatial stratification method were much closer to the true OA, and the estimated UA and PA of CLUDs were generally classified with a higher accuracy except for unused land. Meanwhile, the corresponding RMSE and STDEV results decreased from 2.097% and 2.127% to 0.914% and 0.713%, respectively, due to the contribution of spatial stratification to sample selection. Therefore, the method proposed has promising performance and great potential to be widely employed to select the samples for the accuracy assessment of LULC classification products.

Author Contributions

Conceptualization, S.D.; methodology, S.D., Y.P. and B.G.; validation, S.D. and H.G.; writing—original draft preparation, S.D.; writing—review and editing, B.G., Y.P. and Z.C.; funding acquisition, S.D. and Y.P. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Key Research and Development Program of China (Grant Number 2021YFD1500104), the National Natural Science Foundation of China (Grant Number 41801276), and the Beijing Natural Science Foundation (Grant Number 8192015).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

We thank Mengmeng Li from Fuzhou University for the helpful suggestions.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Searchinger, T.D.; Wirsenius, S.; Beringer, T.; Dumas, P. Assessing the efficiency of changes in land use for mitigating climate change. Nature 2018, 564, 249–253. [Google Scholar] [CrossRef] [PubMed]
  2. Stehfest, E.; van Zeist, W.J.; Valin, H.; Havlik, P.; Popp, A.; Kyle, P.; Tabeau, A.; Mason-D’Croz, D.; Hasegawa, T.; Bodirsky, B.L.; et al. Key determinants of global land-use projections. Nat. Commun. 2019, 10, 2166. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  3. Zhang, C.; Sargent, I.; Pan, X.; Li, H.; Gardiner, A.; Hare, J.; Atkinson, P.M. Joint deep learning for land cover and land use classification. Remote Sens. Environ. 2019, 221, 173–187. [Google Scholar] [CrossRef] [Green Version]
  4. Song, X.P.; Hansen, M.C.; Stehman, S.V.; Potapov, P.V.; Tyukavina, A.; Vermote, E.F.; Townshend, J.R. Global land change from 1982 to 2016. Nature 2018, 560, 639–643. [Google Scholar] [CrossRef]
  5. Olofsson, P.; Foody, G.M.; Herold, M.; Stehman, S.V.; Woodcock, C.E.; Wulder, M.A. Good practices for estimating area and assessing accuracy of land change. Remote Sens. Environ. 2014, 148, 42–57. [Google Scholar] [CrossRef]
  6. Stehman, S.V.; Foody, G.M. Key issues in rigorous accuracy assessment of land cover products. Remote Sens. Environ. 2019, 231, 111199. [Google Scholar] [CrossRef]
  7. Wagner, J.E.; Stehman, S.V. Optimizing sample size allocation to strata for estimating area and map accuracy. Remote Sens. Environ. 2015, 168, 126–133. [Google Scholar] [CrossRef]
  8. Lyons, M.B.; Keith, D.A.; Phinn, S.R.; Mason, T.J.; Elith, J. A comparison of resampling methods for remote sensing classification and accuracy assessment. Remote Sens. Environ. 2018, 208, 145–153. [Google Scholar] [CrossRef]
  9. Gao, B.; Pan, Y.; Chen, Z.; Wu, F.; Ren, X.; Hu, M. A spatial conditioned Latin hypercube sampling method for mapping using ancillary data. Trans. GIS 2016, 20, 735–754. [Google Scholar] [CrossRef] [Green Version]
  10. Beguin, H.; Thisse, J.-F. An axiomatic approach to geographical space. Geogr. Anal. 1979, 11, 325–341. [Google Scholar] [CrossRef]
  11. Zeng, Y.; Li, J.; Liu, Q.; Qu, Y.; Huete, A.R.; Xu, B.; Yin, G.; Zhao, J. An optimal sampling design for observing and validating long-term leaf area index with temporal variations in spatial heterogeneities. Remote Sens. 2015, 7, 1300–1319. [Google Scholar] [CrossRef] [Green Version]
  12. Hengl, T.; Rossiter, D.G.; Stein, A. Soil sampling strategies for spatial prediction by correlation with auxiliary maps. Aust. J. Soil Res. 2003, 41, 1403–1422. [Google Scholar] [CrossRef]
  13. Ge, Y.; Jin, Y.; Stein, A.; Chen, Y.; Wang, J.; Wang, J.; Cheng, Q.; Bai, H.; Liu, M.; Atkinson, P.M. Principles and methods of scaling geospatial earth science data. Earth-Sci. Rev. 2019, 197, 17. [Google Scholar] [CrossRef]
  14. Dong, S.; Chen, Z.; Gao, B.; Guo, H.; Sun, D.; Pan, Y. Stratified even sampling method for accuracy assessment of land use/land cover classification: A case study of Beijing, China. Int. J. Remote Sens. 2020, 41, 6427–6443. [Google Scholar] [CrossRef]
  15. Leichtle, T.; Geiss, C.; Wurm, M.; Lakes, T.; Taubenbock, H. Unsupervised change detection in VHR remote sensing imagery—An object-based clustering approach in a dynamic urban environment. Int. J. Appl. Earth Obs. 2017, 54, 15–27. [Google Scholar] [CrossRef]
  16. Xu, E.; Zhang, H.; Yao, L. An elevation-based stratification model for simulating land use change. Remote Sens. 2018, 10, 1730. [Google Scholar] [CrossRef] [Green Version]
  17. Dong, S.; Li, H.; Sun, D. Fractal feature analysis and information extraction of woodlands based on MODIS NDVI time series. Sustainability 2017, 9, 1215. [Google Scholar] [CrossRef] [Green Version]
  18. Pflugmacher, D.; Krankina, O.N.; Cohen, W.B.; Friedl, M.A.; Sulla-Menashe, D.; Kennedy, R.E.; Nelson, P.; Loboda, T.V.; Kuemmerle, T.; Dyukarev, E.; et al. Comparison and assessment of coarse resolution land cover maps for northern Eurasia. Remote Sens. Environ. 2011, 115, 3539–3553. [Google Scholar] [CrossRef]
  19. Wang, L.B.; Bartlett, P.; Pouliot, D.; Chan, E.; Lamarche, C.; Wulder, M.A.; Defourny, P.; Brady, M. Comparison and assessment of regional and global land cover datasets for use in class over Canada. Remote Sens. 2019, 11, 2286. [Google Scholar] [CrossRef] [Green Version]
  20. Liu, J.; Liu, M.; Tian, H.; Zhuang, D.; Zhang, Z.; Zhang, W.; Tang, X.; Deng, X. Spatial and temporal patterns of China’s cropland during 1990–2000: An analysis based on Landsat TM data. Remote Sens. Environ. 2005, 98, 442–456. [Google Scholar] [CrossRef]
  21. Friedl, M.A.; Sulla-Menashe, D.; Tan, B.; Schneider, A.; Ramankutty, N.; Sibley, A.; Huang, X. MODIS collection 5 global land cover: Algorithm refinements and characterization of new datasets. Remote Sens. Environ. 2010, 114, 168–182. [Google Scholar] [CrossRef]
  22. Gong, P.; Liu, H.; Zhang, M.; Li, C.; Wang, J.; Huang, H.; Clinton, N.; Ji, L.; Li, W.; Bai, Y.; et al. Stable classification with limited sample: Transferring a 30-m resolution sample set collected in 2015 to mapping 10-m resolution global land cover in 2017. Sci. Bull. 2019, 64, 370–373. [Google Scholar] [CrossRef] [Green Version]
  23. Gruijter, J.J.D.; Bierkens, M.F.P.; Brus, D.J.; Knotters, M. Sampling for Natural Resource Monitoring; Springer: Berlin/Heidelberg, Germany, 2006; pp. 106–110. [Google Scholar]
  24. Isaaks, E.H.; Srivastava, R.M. An Introduction to Applied Geostatistics; Oxford University Press: New York, NY, USA, 1989; pp. 238–247. [Google Scholar]
  25. Van Groenigen, J.; Stein, A. Constrained optimization of spatial sampling using continuous simulated annealing. J. Environ. Qual. 1998, 27, 1078–1086. [Google Scholar] [CrossRef]
  26. Gao, B.; Liu, Y.; Pan, Y.; Gao, Y.; Chen, Z.; Li, X.; Zhou, Y. Error index for additional sampling to map soil contaminant grades. Ecol. Indic. 2017, 77, 129–138. [Google Scholar] [CrossRef]
  27. Van Groenigen, J.; Siderius, W.; Stein, A. Constrained optimisation of soil sampling for minimisation of the Kriging variance. Geoderma 1999, 87, 239–259. [Google Scholar] [CrossRef]
  28. Dong, S.; Gao, B.; Pan, Y.; Li, R.; Chen, Z. Assessing the suitability of FROM-GLC10 data for understanding agricultural ecosystems in China: Beijing as a case study. Remote Sens. Lett. 2020, 11, 11–18. [Google Scholar] [CrossRef]
  29. Foody, G.M. Sample size determination for image classification accuracy assessment and comparison. Int. J. Remote Sens. 2009, 30, 5273–5291. [Google Scholar] [CrossRef]
  30. Heung, B.; Bulmer, C.E.; Schmidt, M.G. Predictive soil parent material mapping at a regional-scale: A random forest approach. Geoderma 2014, 214–215, 141–154. [Google Scholar] [CrossRef]
  31. Tu, Y.; Lang, W.; Yu, L.; Li, Y.; Jiang, J.; Qin, Y.; Wu, J.; Chen, T.; Xu, B. Improved mapping results of 10 m resolution land cover classification in Guangdong, China using multisource remote sensing data with Google Earth Engine. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2020, 13, 5384–5397. [Google Scholar] [CrossRef]
  32. Robinson, C.; Malkin, K.; Jojic, N.; Chen, H.; Qin, R.; Xiao, C.; Schmitt, M.; Ghamisi, P.; Haensch, R.; Yokoya, N. Global land-cover mapping with weak supervision: Outcome of the 2020 IEEE GRSS data fusion contest. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2021, 14, 3185–3199. [Google Scholar] [CrossRef]
  33. Padilla, M.; Olofsson, P.; Stehman, S.V.; Tansey, K.; Chuvieco, E. Stratification and sample allocation for reference burned area data. Remote Sens. Environ. 2017, 203, 240–255. [Google Scholar] [CrossRef]
Figure 1. The consistency and inconsistency of different LULC classification products with 500 m (a), 30 m (b) and 10 m (c) resolutions in subsets as indicated in Figure 3.
Figure 1. The consistency and inconsistency of different LULC classification products with 500 m (a), 30 m (b) and 10 m (c) resolutions in subsets as indicated in Figure 3.
Remotesensing 14 00865 g001
Figure 2. Flowchart of spatial stratification method.
Figure 2. Flowchart of spatial stratification method.
Remotesensing 14 00865 g002
Figure 3. LULC of MCD12Q1 (a), CLUDs (c), FROM-GLC10 (e) and corresponding LULC reclassification of MCD12Q1 (b), CLUDs (d), and FROM-GLC10 (f) for Beijing. The red subsets were used for the detailed exhibition shown in Figure 1.
Figure 3. LULC of MCD12Q1 (a), CLUDs (c), FROM-GLC10 (e) and corresponding LULC reclassification of MCD12Q1 (b), CLUDs (d), and FROM-GLC10 (f) for Beijing. The red subsets were used for the detailed exhibition shown in Figure 1.
Remotesensing 14 00865 g003
Figure 4. The experiment roadmap of the case study. Function T represents the LULC class.
Figure 4. The experiment roadmap of the case study. Function T represents the LULC class.
Remotesensing 14 00865 g004
Figure 5. The 11 strata using CLUDs and MCD12Q1 for Beijing.
Figure 5. The 11 strata using CLUDs and MCD12Q1 for Beijing.
Remotesensing 14 00865 g005
Figure 6. Sampling results of 142 samples for the spatial even sampling method (a) and the stratified even sampling method (b).
Figure 6. Sampling results of 142 samples for the spatial even sampling method (a) and the stratified even sampling method (b).
Remotesensing 14 00865 g006
Figure 7. The DA results from different sample sizes in Beijing.
Figure 7. The DA results from different sample sizes in Beijing.
Remotesensing 14 00865 g007
Figure 8. The RMSE and STDEV results using different methods for Beijing.
Figure 8. The RMSE and STDEV results using different methods for Beijing.
Remotesensing 14 00865 g008
Figure 9. Spatial stratification rules of the CLUDs, FROM-GLC10, and MCD12Q1 data.
Figure 9. Spatial stratification rules of the CLUDs, FROM-GLC10, and MCD12Q1 data.
Remotesensing 14 00865 g009
Figure 10. The 6 (a), 11 (b), 17 (c), and 22 (d) strata using CLUDs, MCD12Q1, and FROM-GLC10 for Beijing.
Figure 10. The 6 (a), 11 (b), 17 (c), and 22 (d) strata using CLUDs, MCD12Q1, and FROM-GLC10 for Beijing.
Remotesensing 14 00865 g010
Table 1. Classification system and corresponding relationships of different datasets in Beijing.
Table 1. Classification system and corresponding relationships of different datasets in Beijing.
ClassesMCD12Q1CLUDsFROM-GLC10
CroplandCroplandsPaddy Land Areas; Dry Land AreasCropland
WoodlandDeciduous Broadleaf Forests; Mixed Forests; Closed Shrublands; Woody Savannas; SavannasForests; Shrublands; Woodlands; Other WoodlandsForest;
Shrubland
GrasslandGrasslandsDense Grasslands; Moderate Grasslands; Sparse GrasslandsGrassland
Water BodyPermanent Wetlands; Water BodiesStreams and Rivers; Lakes; Reservoirs and Ponds; BottomlandsWetland;
Waterbody
Built-up LandUrban and Built-up LandsUrban Built-up Land Areas; Rural Settlements; Other Built-up LandsImpervious Area
Unused LandBarrenSwampland; Bare Soil Areas; Bare Rock AreasBare Land
Table 2. The strata of each LULC class based on the CLUDs and MCD12Q1.
Table 2. The strata of each LULC class based on the CLUDs and MCD12Q1.
IDLULCCLUDs (30 m)MCD12Q1 (500 m)Strata
1CroplandCropland Ⅰ
2×Cropland Ⅱ
3WoodlandWoodland Ⅰ
4×Woodland Ⅱ
5GrasslandGrassland Ⅰ
6×Grassland Ⅱ
7Water BodyWater Body Ⅰ
8×Water Body Ⅱ
9Built-up LandBuilt-up Land Ⅰ
10×Built-up Land Ⅱ
11Unused LandUnused Land Ⅰ
12×Unused Land Ⅱ
Table 3. The area weights and sample sizes of different strata.
Table 3. The area weights and sample sizes of different strata.
StrataArea Weights (%)Sample Size
Cropland Ⅰ12.970182644852362129
Cropland Ⅱ9.774141933641781605
Woodland Ⅰ34.42549681162246275652
Woodland Ⅱ11.257162238732051848
Grassland Ⅰ2.6874591849441
Grassland Ⅱ5.095710173393836
Water Body Ⅰ0.647112412106
Water Body Ⅱ1.7803461232292
Built-up Land Ⅰ14.128202848922572319
Built-up Land Ⅱ6.60891322431201085
Unused Land Ⅱ0.629112411103
Total100142198337652182116,417
Table 4. The OA and DA of the CLUDs data using the stratified even sampling method (%).
Table 4. The OA and DA of the CLUDs data using the stratified even sampling method (%).
Samples
(km × km)
142
(11 × 11)
198
(9 × 9)
337
(7 × 7)
652
(5 × 5)
1821
(3 × 3)
16,417
(1 × 1)
Total
OA71.27871.78372.92672.12371.12771.11071.083
DA0.1950.7001.8431.0400.0440.0270.000
Table 5. The UA and PA of CLUDs using the stratified even sampling method for Beijing (%).
Table 5. The UA and PA of CLUDs using the stratified even sampling method for Beijing (%).
Samples
(km × km)
IndicesCroplandWoodlandGrasslandWater BodyBuilt-Up LandUnused Land
142
(11 × 11)
UA75.00084.61518.18225.00072.4140.000
PA70.58879.71018.182100.00080.7690.000
198
(9 × 9)
UA71.73978.88926.66720.00080.4880.000
PA63.46280.68222.222100.00084.6150.000
337
(7 × 7)
UA75.32584.41626.92337.50068.5710.000
PA68.23583.33328.00060.00073.8460.000
652
(5 × 5)
UA71.14185.52225.49056.25065.1850.000
PA61.27283.00726.53175.00080.7340.000
1821
(3 × 3)
UA67.63385.57724.64850.00065.2520.000
PA61.94781.83925.36275.86276.8750.000
16,417
(1 × 1)
UA68.59585.63719.36745.06764.8030.000
PA61.97081.78620.32471.30878.1880.000
Table 6. The OA and DA of the CLUDs data using the spatial even sampling method (%).
Table 6. The OA and DA of the CLUDs data using the spatial even sampling method (%).
Samples
(km × km)
142
(11 × 11)
198
(9 × 9)
337
(7 × 7)
652
(5 × 5)
1821
(3 × 3)
16,417
(1 × 1)
Total
OA70.42373.23266.76669.47970.73071.11571.083
DA0.6602.1494.3171.6040.3530.0320.000
Table 7. The UA and PA of CLUDs using the spatial even sampling method for Beijing (%).
Table 7. The UA and PA of CLUDs using the spatial even sampling method for Beijing (%).
Samples
(km × km)
IndicesCroplandWoodlandGrasslandWater BodyBuilt-Up LandUnused Land
142
(11 × 11)
UA70.73283.87127.2730.0003.8460.000
PA67.44282.54021.4290.00076.1900.000
198
(9 × 9)
UA63.15891.11111.11157.14376.7440.000
PA61.53879.61220.00057.14384.6150.000
337
(7 × 7)
UA62.66788.1943.84644.44456.0980.000
PA58.02578.3955.00080.00068.6570.000
652
(5 × 5)
UA71.32984.59018.75035.00056.6180.000
PA59.30283.22616.07187.50075.4900.000
1821
(3 × 3)
UA68.12786.61721.81843.59064.8100.000
PA60.87079.70426.47168.00080.0000.000
16,417
(1 × 1)
UA68.59585.61219.36745.06764.8030.000
PA61.97081.77320.32471.30878.1880.000
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Dong, S.; Guo, H.; Chen, Z.; Pan, Y.; Gao, B. Spatial Stratification Method for the Sampling Design of LULC Classification Accuracy Assessment: A Case Study in Beijing, China. Remote Sens. 2022, 14, 865. https://doi.org/10.3390/rs14040865

AMA Style

Dong S, Guo H, Chen Z, Pan Y, Gao B. Spatial Stratification Method for the Sampling Design of LULC Classification Accuracy Assessment: A Case Study in Beijing, China. Remote Sensing. 2022; 14(4):865. https://doi.org/10.3390/rs14040865

Chicago/Turabian Style

Dong, Shiwei, Hui Guo, Ziyue Chen, Yuchun Pan, and Bingbo Gao. 2022. "Spatial Stratification Method for the Sampling Design of LULC Classification Accuracy Assessment: A Case Study in Beijing, China" Remote Sensing 14, no. 4: 865. https://doi.org/10.3390/rs14040865

APA Style

Dong, S., Guo, H., Chen, Z., Pan, Y., & Gao, B. (2022). Spatial Stratification Method for the Sampling Design of LULC Classification Accuracy Assessment: A Case Study in Beijing, China. Remote Sensing, 14(4), 865. https://doi.org/10.3390/rs14040865

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop