1. Introduction
Coastal wetlands play a pivotal role in providing many ecological services, including storing runoff, reducing seawater erosion, providing food, and sheltering many organisms, including plants and animals [
1]. Most coastal wetlands have a vital carbon sink function, which is crucial to reduce atmospheric carbon dioxide concentration and slow down global climate change [
2,
3]. In addition, the mudflats [
4], mangroves, and vegetation (e.g.,
Tamarix chinensis,
Suaeda salsa, and
Spartina alterniflora) [
5] in coastal wetlands have strong carbon sequestration ability. Therefore, the coastal wetland is called the main body of the blue carbon ecosystem in the coastal zone [
6].
The Yellow River Delta (hereinafter referred to as YRD) has a complete range of estuarine wetland types, including salt marshes, mudflats, and tidal creeks [
7,
8]. However, intense anthropogenic activities in recent decades, such as dam building, agricultural irrigation, groundwater pumping, hydrocarbon extraction, and the artificial diversion of the estuary, have posed serious threats to the coastal wetlands of YRD [
9,
10,
11,
12,
13]. Therefore, it is of great significance to carry out dynamic monitoring and obtain a reliable and up-to-date classification of coastal wetlands over the YRD for studying the impact of human activities on habitat area [
14].
Wetland classification can illustrate the distribution and area of wetlands over geographical regions, which are helpful tools for evaluating the effectiveness of wetland policies [
14]. In the last sixty years, wetland mapping and monitoring methods have been varied, mainly divided into field-based methods and remote sensing (RS) methods. Field-based wetland classification requires field work, which is labor-intensive, high in cost, time-consuming, and usually impractical due to poor accessibility. Therefore, it is only practical for relatively small areas [
15]. In contrast, RS imagery can currently provide spatial coverage and repeatable observations in long-term series from local to regional scales, enabling effective detection and monitoring of different wetlands at a lower cost. However, wetland RS classification needs to be combined with sufficient field observations to train and evaluate the accuracy of classification [
14]. RS has been demonstrated to be the most effective and economical method in wetland classification [
15]. In addition, large-scale coastal wetland mapping is becoming a reality thanks to cloud computing platforms such as Google Earth Engine (GEE) [
16,
17].
However, there are still some problems in the detection and classification of different types of wetland using satellite remote sensing images. The spectral curves of the same vegetation may be different due to the influence of growth environment, diseases, and insect pests. Additionally, two different vegetation may present the same spectral characteristics or mixed spectral phenomenon in a certain spectral segment, which makes it difficult to identify wetland types well by only using spectral response curves. These two phenomena greatly influence the classification algorithm based on spectral information and easily cause misclassification [
18]. The particularity of wetlands makes wetland classification a challenging topic in remote sensing study.
Optical images can classify ground objects according to spectral features and various vegetation indices. Since the launch of the Landsat satellite in the late 1960s, wetland mapping has been an important application of remote sensing [
19,
20,
21,
22]. In the early stages, single data source and classical algorithms were mainly used, but now mapping has gradually started using multisource data fusion and complex algorithms [
23]. With the launch of hyperspectral satellites, hyperspectral remote sensing images are gradually becoming widely used [
24,
25,
26]. Hyperspectral data are sensitive to tiny spectral details and can detect resonance absorption and other spectral features of materials within the wavelength range of the sensor [
27]. Melgani and Bruzzone [
21] introduced support vector machines (SVM) to class hyperspectral images and proved that SVM is an effective alternative to conventional pattern recognition approaches (feature-reduction procedures combined with a classification method) to classify hyperspectral remote sensing data. Xi et al. [
28] utilized Zhuhai-1 Constellation Orbita Hyperspectral Satellite (OHS) hyperspectral images for tree species mapping and indicated that hyperspectral imagery can efficiently improve the accuracy of tree species classification and has great application prospects for the future.
In recent years, the continuous launch of spaceborne synthetic aperture radar (SAR) systems have obtained a large number of on-orbit and historical archived data, providing an excellent opportunity for multi-temporal analysis, especially in coastal and cloudy areas [
29]. Radar reflectivity is usually determined by the complex dielectric constant of the landcover, which in turn is dominated by the water content and geometric detail of the surface, e.g., smoothness or roughness of the surface and the adjacency of reflecting faces [
27]. In the last two decades, many complicated and efficient classifiers and features have been investigated and integrated into the polarimetric SAR (hereinafter referred to as PolSAR) image classification framework to improve classification accuracy [
30,
31,
32,
33]. Li et al. [
34] used Sentinel-1 dual polarization VV and VH data to discriminate treed and non-treed wetlands in boreal ecosystems. Mahdianpari et al. [
35] use multi-temporal RADARSAT-2 fine resolution quad polarization (FQ) data to classify wetlands in Finland. The results show that the covariance matrix is a critical feature set of wetland mapping, and polarization and texture features can improve the overall accuracy. Therefore, the use of multi-temporal PolSAR classification shows considerable potential for wetland mapping. Full-polarization SAR data also have great advantages in wetland classification.
Previous studies have shown that multisensor remote sensing information fusion can improve the final quality of information extraction by relying on the existing sensor data without increasing the cost [
23,
36,
37,
38,
39]. Due to the variety and complexity of coastal wetland types, it is necessary to consider multisource data fusion to improve the accuracy of wetland classification [
7,
40,
41,
42]. One approach is the synergetic classification of optical and SAR images, considered to be an effective way to improve the accuracy of ground object recognition and classification. For example, Li et al. [
43] used GF-3 full-polarization SAR data and Sentinel-2 multispectral data to carry out synergetic classification of YRD wetlands, and the results were significantly superior to that of the single datum. Kpienbaareh et al. [
44] used the dual polarization Sentinel-1, Sentinel-2, and PlanetScope optical data to map crop types. Niculescu et al. [
45] identified an optimal combination of Sentinel-1, Sentinel-2, and Pleiades data using ground-reference data to accurately map wetland macrophytes in the Danube Delta, which suggests that diverse combinations of sensors are valuable for improving the overall classification accuracy of all of the communities of aquatic macrophytes, except
Myriophyllum spicatum L. Thus, the fusion of available SAR and optical remote sensing data provides an opportunity for operational wetland mapping to support decisions such as environmental management.
However, a review of the existing literature yields few studies focused on the synergetic classification of coastal wetlands over the YRD, especially with GaoFen-3 (GF-3) full-polarization SAR and Zhuhai-1 OHS hyperspectral remote sensing in China. Therefore, in this study, we first introduce a combination method for coastal wetland classification over the YRD with both GF-3 and OHS images, and we then evaluate the classification accuracy. Furthermore, we investigate the influence of an optimal feature subset, seasonal change, and tidal height on the final classification.
3. Results
The classification results derived from the ML, MD, and SVM methods for the GF-3, OHS, and synergetic data sets in the YRD are presented in
Figure 8. First, a larger amount of noise deteriorates the quality of GF-3 classification results, and many pixels belonging to the river are misclassified as saltwater (
Figure 8a,d,g), indicating that the GF-3 fails to separate different water bodies (e.g., river and saltwater). Second, the OHS classification results (
Figure 8b,e,h) are more consistent with the actual distribution of wetland types, proving the spectral superiority of OHS. However, there are many river noises in the sea that are probably attributed to the high sediment concentrations in shallow sea areas (see
Figure 1). Third, the complete classification results generated by the synergetic classification are clearer than those of GF-3 and OHS data separately (
Figure 8c,f,i). Similarly, some unreasonable distributions of wetland classes in the OHS classification also exist in the synergetic classification results, which reduces the classification performance. For example, river pixels appear in the saltwater, and
Suaeda salsa and tidal flat exhibit unreasonable mixing. Overall, the ML and SVM methods can produce a more accurate full classification that is closer to the real distribution.
The accuracy results obtained by different classification methods for different data sets are shown in
Table 4,
Table 5,
Table 6, and
Table 7, and
Figure 9,
Figure 10, and
Figure 11. The OAs obtained by the ML, MD, and SVM methods for the GF-3 data are 52.5%, 55.7%, and 80.7%, and the Kappa coefficients are 0.36, 0.41, and 0.70, respectively. The above classification accuracy is the lowest of all classification processes, possibly due to common wetland structural conditions. In contrast, the OAs with the ML, MD, and SVM methods for the OHS data are 96.7%, 87.6%, and 95.6%, and the Kappa coefficients are 0.95, 0.82, and 0.94, respectively. This may be attributed to the increased spectral separation capacity of the biochemical characteristics of wetland types. Subsequently, the classification accuracy after data fusion is improved by approximately 30% compared with the GF-3 data alone. This is mainly due to the consideration of the biophysical and biochemical changes that occur with the change in the phenology of wetland types.
Considering that the synergetic technique essentially combines the structural and dielectric information of wetland types with the scattered power components, GF-3 and OHS data are transformed into more meaningful target information content than the GF-3 or OHS data alone. Therefore, we found that the accuracy metrics of synergetic classification were significantly improved compared with the single data classification in
Table 4 and
Figure 9. Among the three tested classifiers, the MD method provides the lowest synergetic classification accuracy of 89%, and the other two methods (ML and SVM) are relatively close, with an overall accuracy of 97% and a Kappa coefficient of 0.96.
In addition to the OA and Kappa coefficients (
Table 4 and
Figure 9) and corresponding classification images (
Figure 8), the PA, UA, and F1-score were calculated according to the confusion matrix (
Table 5,
Table 6 and
Table 7 and
Figure 10). Concerning the values of PA, UA, and F1-score obtained for each wetland type, the best classified types are saltwater, farmland, river, and tidal flat, with values above 80%. The accuracy of
Suaeda salsa was the lowest, mainly due to the fact that
Suaeda salsa is small in size (approximately 1 m in height and width) and sparsely distributed on the tidal flat, whereas the image resolution of 10 m was used in this study. The PA, UA, and F1-score of saltwater and river for GF-3 data are significantly lower than those of the other two datasets. Since SAR distinguishes objects by different scattering mechanisms and surface roughness, the above two factors are basically the same in saltwater and river, making it difficult to distinguish between them. Therefore, the spectral characteristics of optical images are required to improve the PA, UA, and F1 scores of water bodies.
For synergetic classification, the PA, UA, and F1-score are above 90% as most phenological features are captured by the SAR backscatter coefficients and OHS spectral information. Although there is an overall increase in the Kappa coefficient, OA, UA, PA, and F1-score for different wetlands with synergetic classification, the PA, UA, and F1-score of shrub, grass, and Suaeda salsa are abnormal, respectively. The decrease in the UA, PA, and F1-score could be due to the fact that the sample pixels used for training are insufficient. Considering the complexity of wetlands in the study areas, these levels of accuracy prove the robustness and high performance of the proposed synergetic classification in different study areas with various ecological characteristics.
Misclassification commonly occurs in the process of image classification. The fewer misclassified categories and misclassified pixels, the better the results of the classification.
Figure 11 is a graphical representation of the confusion matrix. Most off-diagonal cells have low values, indicating that most pixels are reasonably well classified. In particular, the results of the ML synergetic classification show that part of the tidal flats were wrongly classified as
Suaeda salsa, grass, river, and farmland. In a few cases, saltwater was also misclassified as shrub and river. The biggest omission was the misclassification of
Suaeda salsa as tidal flat and farmland. In general, there is extensive confusion between adjacent succession groups, such as saltwater vs. river, farmland vs. tidal flat, and shrub vs. grass.
5. Conclusions
Wetland classification is a challenging task for remote sensing research due to the similarity of different wetland types in spectrum and texture, but this challenge could be eased by the use of multi-source satellite data. In this study, a synergetic classification method for GF-3 full-polarization SAR and OHS hyperspectral imagery was proposed in order to offer an updated and reliable spatial distribution map for the entire YRD coastal wetland. Three classical machine learning algorithms (ML, MD, and SVM) were used for the synergetic classification of 18 spectral, index, polarization, and texture features. According to the field investigation and visual interpretation, the overall synergetic classification accuracy of 97% for ML and SVM algorithms is higher than that of single GF-3 or OHS classification, which proves the performance of the fusion of fully polarized SAR data and hyperspectral data in wetland mapping.
The spatial distribution of coastal wetlands affects their ecological functions. Detailed and reliable wetland classification can provide important wetland type information to better understand the habitat range of species, migration corridors, and the consequences of habitat change caused by natural and anthropogenic disturbances. The synergy of PolSAR and hyperspectral imagery enables high-resolution classification of wetlands by capturing images throughout the year, regardless of cloud cover. Therefore, the proposed method has the potential to provide accurate results in different regions.