1. Introduction
Natural and anthropogenic causes influence land use/cover (LUC) [
1]. It reflects the natural properties of land and the impact of human activities and is of great significance for economic and social development, ecological environment assessment, regional food security, and global and national development strategies [
2,
3]. LUC datasets can provide basic data support for a wide range of research fields [
4,
5].
Many global-scale mid- and high-resolution LUC products are freely available for download and use [
6]. Some well-known products include European Space Agency’s CCI Land Cover series (
https://www.esa-landcover-cci.org/, accessed on 2 April 2022) and GlobCover series (
https://www.esa.int/, accessed on 5 April 2022) [
7,
8]; Copernicus Programme’s CGLS-LC100 series (
https://land.copernicus.eu/, accessed on 5 April 2022) [
9]; National Aeronautics and Space Administration’s MODIS Land Cover series (
https://lpdaac.usgs.gov/, accessed on 5 April 2022) [
10]. Their resolution is typically a few hundred meters to one kilometer. With the rapid expansion of remote sensing (RS) observation platforms, the enhancement of RS cloud computing capability, and the advancement of LUC mapping technology over the last decade, the spatial resolution of LUC datasets has been continuously improved [
11]. The Globeland30 dataset (
http://www.globallandcover.com/, accessed on 5 April 2022), developed by the Ministry of Natural Resources of China, contains global data with a 30 m resolution in 2000, 2010, and 2020 [
12]. GLC_FCS30 dataset (
https://data.casearth.cn/, accessed on 5 April 2022) developed by the Aerospace Information Research Institute, Chinese Academy of Sciences, which is updated every five years, contains global data with a 30 m resolution from 1985 to 2020 [
13,
14]. LSV10 dataset (
https://esa-worldcover.org/, accessed on 5 April 2022), developed by the European Space Agency, contains global data with a 10 m resolution in 2020 [
15]. The ESRI Land Cover dataset (ESRI10,
https://www.arcgis.com/, accessed on 5 April 2022), developed by the Environmental Systems Research Institute and updated yearly, contains global data with a 10 m resolution in 2017–2021 [
16].
These LUC datasets are convenient for researchers worldwide to conduct research in the fields of nature, resources, and ecology [
17,
18,
19]. However, they may differ regarding the type, quantity, accuracy, and spatial consistency because of the different RS images, classification systems, and mapping methods employed [
20]. A single LUC dataset might sometimes fail to fulfill the reliability criteria of research and applications [
11]. In the context of multisource data coexistence and integration research, integrating information from existing data to form new LUC data products is a significant path to overcoming this challenge [
21,
22].
Many researchers have studied LUC data fusion, which has produced a batch of reliable and practical products on a regional or global scale [
23,
24]. See et al. [
25] used a spatial consistency and geographically weighted regression method to determine the consistent location in the source datasets (GLC2000, GlobCover, and MODIS Land Cover) and then assigned the attribute with the highest consistency to the pixels. Select the surrounding LUC samples for the location with no consistency to carry out geographically weighted regression and assign the attribute with the highest probability of accuracy to the pixels. The generated medium-resolution LUC dataset has better accuracy than the source datasets. Fritz et al. [
26] proposed a convergence of evidence approach to rank different products (GLC2000, MODISLC 2005, GlobCover 2005, GEOCOVER, Cropland probability layer) using crowdsourced data from Geo-Wiki. Statistical data is then used for calibration to finally allocate cropland within each country and sub-national unit, resulting in a fusion data product that is superior to the input data. Lu et al. [
27] proposed a hierarchical optimization synergy approach to form the 2010 China cropland dataset by fusing five existing cropland products (GlobeLand30, CCI-LC, GlobCover 2009, MODIS C5, MODIS Cropland). The accuracy evaluation shows that the spatial location accuracy of the result is higher than the existing five datasets, and the consistency with the statistics is better. Bai et al. [
28] combined expert scoring and semantic correlation to build an evidence fusion method and fused five global LUC datasets (GLCC, UMD, GLC2000, MODIS LC, and GlobCover) to form a global 1 km fusion LUC dataset (SYNLCover). SYNLCover includes 8 first-level and 12 fine types, and its consistency is significantly improved. Pérez-Hoyos et al. [
29] used a multi-criteria analysis method that considered five factors, including timeliness, resolution, FAO statistical data, accuracy, and expert assessment, combined with manual fine-tuning. The source datasets (CGLS-LC, GLC2000, GLCNMO, GlobCover, GlobeLand30, CCI-LC, MODIS Land Cover) were fused to form African grassland and cropland datasets with 250 m resolution, providing reliable support for agricultural monitoring in Africa.
These fusion datasets provide high-quality alternatives for both scholars and managers. However, there are problems in existing research cases, such as large year disparities in source datasets, large spatial resolution differences, and uneven data quality, all of which are important factors that affect the reliability of the results. On the other hand, the existing fusion methods also have some problems, such as (1) pixel attributes are selected from the same location of the source datasets, which cannot effectively correct the common errors in the source datasets; (2) Only the land types possessed by all source datasets can be fused, and the fine types information possessed by only a few datasets cannot be fully utilized; (3) Expert scoring and multi-criteria analysis cannot avoid the influence of subjective factors; (4) Most methods cannot effectively consider the spatial relationships that play an important role in land classification. The Bayesian posterior probability principle is similar to LUC dataset error correction and multisource information fusion. Combining the Bayesian probability principle with the spatial neighborhood relation, a new LUC data set fusion method can overcome the above problems. This is a feasible approach worth exploring further.
The Indochina Peninsula is a vital link between East and South Asia. It has an important geopolitical and geo-economics status. Furthermore, it is important in China’s “six corridors and six channels” infrastructure construction planning. The Indochina Peninsula is mostly underdeveloped and lacks regional- or national-scale LUC datasets. The development of fusion LUC products with good timeliness, high resolution, and high reliability in this area will be a vital approach for researchers and developers to understand and enter this area.
Therefore, we intend to develop a fusion method to eliminate the influence of subjective factors and introduce spatial relations and Bayesian probabilities. Four high-resolution LUC products (LSV10, GLC_FCS30, ESRI10, and Globeland30) were selected for fusion research and analysis in five Indochina Peninsula nations (Myanmar, Vietnam, Thailand, Cambodia, and Laos). This study aimed to achieve the following three objectives:
(1) Develop a LUC data-fusion method based on Bayesian fuzzy probability prediction;
(2) Generate a 30 m resolution fusion LUC product for the Indochina Peninsula in 2020 based on the LSV10, GLC_FCS30, ESRI10, and Globeland30 datasets;
(3) Evaluate the overall accuracy and type accuracy of the fusion dataset, provide a reference for researchers on the fusion method and provide a precision reference for selecting and using the fusion dataset.
3. Method
3.1. Overall Technical Route
The realization process of this research can be divided into six steps, as shown in
Figure 2. They are as follows: (1) Build the target LUC classification system and merge the source LUC datasets on a unified basis. (2) The accuracy verification information of the source LUC datasets is used to evaluate the overall misclassification among land types to determine the prior probability. (3) Based on the Bayesian posterior probability formula and the introduction of spatial neighborhood relation, improve the posterior probability calculation method. A posteriori probability is calculated based on the improved method. (4) Considering the accuracy and reliability of the LUC datasets, the weighted average method is used to synthesize the posterior probability results of the source LUC datasets. By assigning the attribute with the highest synthetic posterior probability to the pixel, the fuzzy probability fusion is realized, and the first-level types fusion result is obtained. (5) According to the detailed classification information of GLC_FCS30, the second-level classification of cropland and forest was carried out according to the spatial correlation principle and neighborhood search method. (6) From the two aspects of consistency and validation accuracy of the datasets, the accuracy of fusion results was tested by using six indexes, namely average overall consistency, overall accuracy (OA), Kappa coefficient, user accuracy (UA), producer accuracy (PA) and F1 score.
3.2. Merge LUC Types
Since the source LUC datasets use different land classification systems, they must be merged into a unified system. Drawing on previous research experience [
32,
33], we initially identified nine common first-level types: cropland, forest, grassland, shrubland, wetland, water bodies, built-up land, bare land, and snow and ice, which were coded in multiples of 10 from 10 to 90. Considering the important roles of croplands and forests (diverse ecosystems and important economic sectors), we further subdivided cropland into two second-level types: rainfed cropland and irrigated cropland, coded as 11 and 12, respectively, and subdivided forests into six second-level types: evergreen broadleaved forest, deciduous broadleaved forest, evergreen needle-leaved forest, deciduous needle-leaved forest, mixed leaf forest, and fruit forest (orchard), coded in order from 21 to 26, respectively. Therefore, the land classification system constructed in this study includes 15 types that are both refined and practical.
We then merge the source datasets into the constructed classification system. It can be noted that only GLC_FCS30 has fine second-level type information, and the other three have only first-level type information. The merging process strictly follows the definition and description of the source data type. We have also made the necessary adjustments. For example, GLC_FCS30 classifies the Orchard as a second-level type of cropland, and we merge it into a second-level type of forest. This is more in line with the ecosystem [
6]. For overly detailed types in GLC_FCS30, we merge them into the same type. For example, we combine the Shrubland (120), Evergreen shrubland (121), and Deciduous shrubland (122) in GLC_FCS30 into the Shrubland (40) in the constructed system. The LUC codes and meanings of the source and merged datasets are presented in detail in
Table 2.
3.3. Definition of Prior Probability
The degree of misclassification between land types in the LUC datasets indicates the degree of confusion. From the perspective of probability theory, when the total amount is sufficiently large, the frequency of an event reflects its probability [
34]. The misclassification frequency can be used to characterize the misclassification probability. The formulae are as follows:
where
and
are the the
ith and
jth types of land use/cover in the LUC dataset, respectively;
is the ratio of classifying
as
;
is the area of classifying
as
, in km
2;
is the area of
, in km
2;
is the probability of classifying
as
.
In developing the source LUC datasets, the original authors performed accuracy validation and released the results based on global stratified sampling and cross-validation. Their error matrices reflect the overall misclassification. This information can be used to calculate prior probabilities.
3.4. Posterior Probability Calculation Introducing Spatial Relationships
According to the Bayesian posterior probability formula [
35,
36], if a pixel has been classified as a land type by the LUC dataset, the probability that the pixel belongs to another (or the same) type can be calculated using the following formula:
where
and
are the
ith and
jth types of land use/cover in the LUC dataset, respectively;
is the probability that the pixel is actually
when the LUC dataset has classified it as
, that is, the posterior probability;
is the probability that
is in the dataset, and the value is equal to the area ratio of
(
).
There is a strong geographical correlation in the spatial distribution of land use/cover, and the same type of land often shows obvious spatial aggregation characteristics [
37,
38]. Therefore, continuity and spatial distribution are important for developing LUC datasets. Similarly, considering the spatial relationship in the fusion of LUC datasets is an important direction for improving the accuracy of the results. To this end, this study introduces a neighborhood window and improves the posterior probability calculation formula.
The specific method is to construct a neighborhood window of an appropriate size with the target pixel as the center, count the frequencies of various types of land in the window, and use the window frequency to substitute its frequency in the dataset in the posterior probability calculation. The improved formula is as follows:
where
is the probability that the target pixel is actually
when the LUC dataset has classified it as
;
is the neighborhood window, which is a square of 9 × 9 pixels in this study;
and
are the proportions of
and
in
, respectively.
3.5. Fusion Based on Fuzzy Probability
According to formula 5, the posterior probabilities of the nine first-level types of the four LUC datasets can be obtained pixel-by-pixel. Taking a pixel of LSV10 as an example, we now have the probability that it belongs to cropland, to forest, to grassland… (9 first-level types). The same is true for GLC_FCS30, ESRI10, and Globeland30.
After that, we need to synthesize the posterior probabilities of four LUC datasets into an indicator to obtain the synthetic probability so that it is convenient to assign a land type to the pixel (fusion result). Taking cropland as an example, each of the four LUC datasets has a value at the location of a pixel, indicating the probability that the pixel belongs to cropland. Through the process of probability synthesis, the four values become one value, which indicates the probability that the pixel belongs to the cropland. The same applies to forest, grassland, etc. (the other 8 first-level types).
Considering the differences in accuracy and credibility of different LUC datasets, this study used the weighted average method to synthesize the posterior probability results of the four datasets to form a comprehensive posterior probability (CPP). The formula used is as follows:
where
is the jth type of land use/cover;
is the CPP of
on pixel
c;
is the number of LUC datasets, which is 4 in this paper;
is the posterior probability of
on pixel
c in the Lth LUC dataset;
is the overall accuracy of the Lth LUC dataset.
Then, for each pixel, we compared the CPP of nine first-level types and assigned the land type with the largest CPP to the pixel. Take a pixel, for example. We have now acquired the probability that it belongs to cropland, forest, grassland, etc. (9 first-level types). We select the largest value and assign the corresponding land type to the pixel.
The formula used is as follows:
where
is the finalized land use/cover type on pixel
c;
is the CPP of
on pixel
c;
is the number of first-level types, which is nine in this study,
is a function used to find the maximum value.
3.6. Second-Level Classification
Like the first-level types, the second-level types of cropland and forest have spatial continuity and repetition. The same second-level type tends to cluster spatially and geographically, and two parcels of land adjacent to each other are likely to share the same properties [
18]. Therefore, based on obtaining the first-level type fusion result, this study refers to the detailed classification information of the GLC_FCS30 dataset and conducts the second-level classification of cropland and forest according to the principle of spatial correlation.
First, it needs to be clear that there are two situations: the fusion result is the same as GLC_FCS30, or the fusion result is different from GLC_FCS30. This is the basis for our next steps.
Taking cropland as an example, for each cropland pixel in the fusion result, there are two situations. GLC_FCS30 identifies the pixel as cropland; GLC_FCS30 identifies the pixel as other land types. For the former case, we directly assign the second-level attribute of GLC_FCS30 to this pixel. For the latter case, we use the neighborhood search method to assign the second-level attribute of the nearest GLC_FCS30 cropland pixel to this pixel. So far, we have completed the second-level classification of all cropland pixels in the first-level fusion results.
For forest pixels, the second-level classification strategy was the same as for cropland.
3.7. Accuracy Validation
This study compares and analyzes the overall consistency, overall accuracy, Kappa coefficient, user accuracy, producer accuracy, F1 score of the fusion result, and the source LUC datasets of the first-level type from the aspects of dataset consistency and validation accuracy.
The consistency check can analyze all the pixels in the LUC datasets, revealing their consistent level of quantification. In the specific implementation process, first, the five datasets (fusion result and source datasets) were paired to calculate the overall consistency (OC) of each combination. The overall consistency of the four combinations in which each dataset is located is then arithmetically averaged to obtain the average overall consistency (AOC). The formula used is as follows:
where
and
are the ith and jth LUC datasets, respectively;
is the overall consistency of the
/
combination;
is the consistent area in the
/
combination, in km
2;
is the total area of the region, in km
2;
is the average overall consistency between
and the other datasets;
is the number of combinations containing
, which is four in this study.
The F1 score is an indicator used in statistics to measure the accuracy of dichotomous models. The core idea is that a high score requires not only high precision (producer accuracy) and recall (user accuracy) but also a small difference between the two [
39]. The formulae are as follows:
where
and
are the the
ith and
jth types of land use/cover, respectively;
,
, and
are the precision, recall, and F1 score of
, respectively;
is the number of pixels that misclassify
as
;
is the number of pixels that are correctly classified as
;
is the number of land types, which is nine in this study.
The overall accuracy is a commonly used indicator for validating classification results. It is the ratio of correctly classified pixels to all pixels and reflects the overall credibility [
40]. The Kappa coefficient is also an overall accuracy evaluation index; however, it not only considers the overall accuracy but also the balance of the accuracy of each type, increasing the influence of the type with a small area in the results [
41]. The formulae are as follows:
where
and
are the the
ith and
jth types of land use/cover, respectively;
and
are the overall accuracy and Kappa coefficient of the dataset, respectively;
is the expected agreement rate;
is the number of pixels that misclassify
as
;
is the number of pixels that are correctly classified as
;
is the number of land types, which is nine in this study.
In this study, LUC samples from LACO-Wiki were used for accuracy validation. LACO-Wiki is a web-based validation platform (
https://www.laco-wiki.net, accessed on 25 April 2022) that uses data from high-resolution images from Google Earth, Bing Maps, and OpenStreetMap [
42]. The dataset used in this study contained 1270 sample points, which included 224 croplands, 581 forests, 83 grasslands, 147 shrublands, 45 wetlands, 78 water areas, 58 built-up lands, and 55 bare lands. Because the study area is in the tropics, snow and ice are uncommon; thus, the accuracy validation excludes these phenomena.
4. Results and Analysis
4.1. Fusion Results
Based on the Bayesian-fuzzy probability prediction method, this study obtained 30 m resolution land use/cover fusion datasets in the Indochina Peninsula region in 2020 (BeyFusLUC30), as shown in
Figure 3.
4.2. Consistency Analysis
Consistency checks can put all the pixels of the LUC dataset into the analysis, revealing how consistent they are overall quantitatively. We divided the 5 datasets into two groups and calculated the overall consistency (OC) of each combination. The results are shown in
Figure 4, where darker colors indicate higher overall consistency. The BeyFusLUC30 combination had the highest OC content. Among them, the BeyFusLUC30/Globeland30 combination had the highest OC, 83.34%; the BeyFusLUC30/GLC_FCS30 combination had the lowest OC, 73.76%, but it was also 7–14% higher than the combination of GLC_FCS30 and other datasets.
Further, we arithmetically average the overall consistency of the 4 combinations in which each dataset is located to obtain the average overall consistency (AOC). Comparing the AOC between BeyFusLUC30 and the 4 source datasets, they are ranked from high to low: BeyFusLUC30 (79.57%) > Globeland30 (73.15%) > LSV10 (73.12%) > ESRI10 (69.39%) > GLC_FCS30 (65.96%). The AOC of BeyFusLUC30 was 6–14% higher than the source datasets.
4.3. Overall Accuracy Test
Accuracy validation was performed on five datasets, and the results are listed in
Table 3. Analysis at the dataset level shows that in the study area, the overall accuracy (OA) of the source datasets was between 77.00% and 79.27%. The OA of BeyFusLUC30 was 84.11%, 4.84–7.11% higher than the source datasets. The Kappa coefficient of the source datasets was 75.45–78.07%; the Kappa coefficient of BeyFusLUC30 was 83.05%, which was 4.98–7.60% higher than that of the source datasets.
According to formulas 10 and 11, the difference between user and producer accuracy can reflect the quantitative relationship between the LUC sample and the land in the validated dataset. If the user accuracy is higher, the number it identifies is higher than the number of the land type in the sample. Analysis of land type accuracy in five datasets: for cropland, GLC_FCS30 had the highest user accuracy (92.96%) but the lowest producer accuracy (68.46%), so it contains more cropland than the sample. This also applies to Globeland30 and LSV10. ESRI10, on the other hand, is the opposite. In BeyFusLUC30, the user accuracy and producer accuracy of cropland were balanced at 84.89% and 85.65%, respectively, with a difference of only 0.76%.
For forest, the user accuracies of the source datasets were excellent, ranging from 90.67–97.84, but the producer accuracies were relatively poor, ranging from 82.28–87.76%. In BeyFusLUC30, the forest had the most balanced user and producer accuracy, 93.01% and 92.84%, respectively, with a difference of only 0.17%.
LSV10, GLC_FCS30, and BeyFusLUC30 showed higher accuracy for the water bodies, whereas Globeland30 and ESRI10 had relatively lower accuracy. The five datasets for built-up land showed that the user accuracy was lower than the producer accuracy. Among them, the difference between the two accuracies in LSV10 and BeyFusLUC30 was small, around 8%. The difference between the two accuracies in GLC_FCS30, ESRI10, and Globeland30 was large, exceeding 12% or even more than 24% in all cases.
For wetlands, the source datasets demonstrate that the user accuracy is much lower than the producer accuracy; the two accuracies in BeyFusLUC30 are more balanced. For grassland, shrubland, and bare land, the advantages of BeyFusLUC30 are more obvious. Its user and producer accuracies are often the highest among the five datasets, and the balance is also better.
Furthermore, since the source LUC datasets are made based on remote sensing images of 2020 or nearby years, we remove cloud effects from all landsat8 surface reflectance images (LANDSAT/LC08/C01/T1_SR) of 2020 based on Google Earth Engine and make an average image (Landsat ASR). We visually compared the five datasets with the standard false-color image of the Landsat ASR and selected the main land types (cropland, forest, grassland, shrubland, wetland, water bodies, and built-up land) for a detailed investigation; the results are shown in
Figure 5. In terms of cropland, the area of cropland in GLC_FCS30 was significantly larger than that shown in Landsat ASR; the areas of cropland in LSV10, GLC_FCS30, ESRI10, Globeland30, and BeyFusLUC30 were similar, and the distribution and texture of cropland and built-up land in BeyFusLUC30 had the best matching degree with Landsat ASR.
For forests, the results of LSV10, GLC_FCS30, ESRI10, and BeyFusLUC30 were similar, which can better reflect the interlaced characteristics of the forest and other land types. Globeland30 had the worst accuracy, and many other land types were classified as forests. For grassland and shrubland, the consistency of different datasets varied greatly, and LSV10 and BeyFusLUC30 had the best consistency with Landsat ASR. For wetlands, ESRI10 and BeyFusLUC30 could distinguish the boundaries between the water bodies and wetlands and had better accuracy. For the water bodies, LSV10, GLC_FCS30, ESRI10, and BeyFusLUC30 all showed boundaries between large areas of water and other land types. Among them, GLC_FCS30 and BeyFusLUC30 had the best extraction results for small water areas. Globeland30 showed obvious misclassification, and large land areas were classified as water. For built-up land, except Globeland30, the other four datasets reflected the shape features and texture details of built-up land well.
Based on the above analysis results, it can be concluded that, compared with the source datasets, BeyFusLUC30 has the highest overall accuracy and Kappa coefficient. For different land types, BeyFusLUC30’s user accuracy, producer accuracy, and the balance of the two accuracies, as well as the comparison results with remote sensing images, have all improved.
4.4. F1-Score Test
Based on the above accuracy test results, we further compared and analyzed the F1 scores of various land types in the five datasets; the results are shown in
Figure 6. In general, the F1 scores of the five datasets were higher for cropland, forest, water bodies, and built-up land. The F1 scores for grassland, shrubland, wetland, and bare land were lower, and all types of land in BeyFusLUC30 had the highest F1 scores of their kind.
Detailed analysis: BeyFusLUC30 exhibited less improvement in the F1 score for cropland, forest, water bodies, and built-up land. The F1 scores of these land types in BeyFusLUC30 increased by at least 0.40–2.29% and, at most, by 6.66–9.88% compared to the source datasets. On the one hand, these land cover types have obvious spectral and texture characteristics and good spatial continuity, so remote sensing classification can achieve high accuracy (their original F1 scores generally exceed 80%). From this point of view, they don’t have much room for improvement. On the other hand, because their areas were large, the correction had little effect on the overall results. Therefore, it is normal and easy to understand that their F1 scores improved less in the fusion results [
43,
44]. The F1 scores of BeyFusLUC30 for grassland, shrubland, wetland, and bare land were greatly improved. Compared with the source datasets, the F1 scores of these land types in BeyFusLUC30 increased by at least 4.02–5.82% and, at most, by 14.41–48.35%. The reason is that these land types are ambiguous in definition, their texture and spectral features are similar, and their classification accuracy is low; but, because this fusion method utilizes all prior knowledge, it can effectively improve the F1 score. However, because their areas were small, the correction had a large impact on the overall results.
Based on the above results, it can be concluded that BeyFusLUC30 improved the F1 score of each land type. A fusion method based on Bayesian fuzzy probability prediction can effectively improve the precision and recall of each land type.
5. Discussion
In this study, a LUC data fusion method based on Bayesian-fuzzy probability prediction was developed, and four high-resolution LUC datasets (LSV10, GLC_FCS30, ESRI10, Globaland30) and the Indochina Peninsula fusion LUC dataset (BeyFusLUC30) were developed. According to the test results, the reliability and accuracy of Bey-FusLUC30 were significantly better than those of the four source datasets, with an average overall consistency of 79.57%, an overall accuracy of 84.11%, and a Kappa coefficient of 83.05%. BeyFusLUC30 has a good consistency with the source datasets than the combination of the source datasets (approximately 10%). The balance between the two accuracies of the dataset (user accuracy and producer accuracy) has increased, so the accuracy of BeyFusLUC30 has been improved in terms of both land types and datasets. The F1 score of BeyFusLUC30 increased by 0.40–2.29% for cropland, forest, water bodies, and built-up land, and increased by 4.02–5.82% for grassland, shrubland, wetland, and bare land when compared to the optimal results in the source datasets. These metrics reflect the feasibility and reliability of the fusion methods.
On the other hand, we compared the accuracy of fusion results with the existing methods, as shown in
Table 4. Although the overall accuracy of the fusion results obtained in different studies varies greatly, the overall accuracy improvement in this study is the most obvious when compared to the highest overall accuracy in the input data set. This also shows the reliability of the fusion method proposed in this paper.
This study points out that the overall accuracy of the five LUC datasets in the Indochina Peninsula is ranked from high to low as BeyFusLUC30 (84.11%) > LSV10 (79.27%) > ESRI10 (78.92%) > Globeland30 (77.19%) > GLC_FCS30 (77.00%). Among them, the four source datasets have a deviation of ±7% from the accuracy claimed by the original authors (the overall accuracy of LSV10, ESRI10, Globeland30, and GLC_FCS30 was 74.4%, 85.96%, 85.72%, and 81.4%, respectively) [
12,
13,
14,
15,
16]. Considering that the accuracy assessment of the original authors was conducted on a global scale, this study was conducted in a specific region of the Indochina Peninsula. Therefore, the difference between these two is understandable. This also shows that the selection of validation samples had a major impact on the results. Given the difference between the accuracy of information claimed by global LUC datasets and the actual situation in the region, we suggest that scholars introduce regional samples to test their accuracy before carrying out research in a specific region [
45]. More importantly, if these accuracy indicators are needed as input in the research, then regional accuracy verification is essential.
Using the fusion method, this study introduces a spatial relationship into the Bayesian formula, thereby increasing its practicality in the field of geosciences. Considering the difference in the quality of LUC sample datasets and the uncertainty of the validation process, this study used the results published by the original authors based on global-scale validation to determine prior probability and comprehensive posterior probability. These statistics are reliable, but they differ from reality in the Indochina Peninsula. Therefore, constructing regionalized, high-quality LUC samples or selecting authoritative LUC sample datasets for regional accuracy validation will be an important way to improve the accuracy of fusion results [
46]. However, the accuracy of the source datasets largely determines the quality of fusion results [
47]. Owing to the high accuracy of cropland, forest, water bodies, and built-up land in the source datasets and after the improvement of the fusion process, the user accuracy and producer accuracy in the results were extremely high. The F1 score was generally higher than 85% or even 93%. Although the accuracies and F1 score for grassland, shrubland, wetland, and bare land have improved significantly, they are still generally lower than 70%. On the one hand, introducing hyperspectral and/or multispectral aerial images with higher resolution can more accurately identify the difference between spectral and texture features to improve the classification accuracy of the source datasets. This will be an important basis for improving the accuracy of fusion products. On the other hand, in future research, it will be important to improve their accuracy by introducing image texture, structural features, ecological indicators (normalized difference vegetation index, leaf area index, phenological characteristics), and time-series change information for the fusion process to supplement the shortcomings of source datasets, which will also be of great significance in the field of land use/cover mapping.
6. Conclusions
In this study, we developed a LUC dataset fusion method based on Bayesian-fuzzy probability prediction. We applied it to Indochina Peninsula as a study area, based on four high-resolution LUC datasets (LSV10, GLC_FCS30, ESRI10, and Globeland30, with a spatial resolution of 10 m–30 m) to conduct fusion research. The highest-accuracy LUC dataset (BeyFusLUC30) is currently available for researchers and government agencies in the region.
The method evaluates the spatial relationship and uses accuracy validation information as prior knowledge, which is a maximum probability discriminant model. It is capable of correcting low-reliability pixel attributes and maintaining high-reliability pixel attributes. The fusion results had better average overall consistency (79.57%), overall accuracy (84.11%), and Kappa coefficient (83.05%) than the original dataset. The research also shows that the fusion results show a small improvement in the accuracy of the land types with good original accuracy (cropland, forest, water bodies, and built-up land) but a large improvement in the land types with poor original accuracy (grassland, shrubland, wetland, and bare land). We also propose possible ways to further reduce the uncertainty of the fusion process and improve the quality of the results, including considering a regional LUC accuracy test and providing more refined reference information.