1. Introduction
Water body extraction plays a significant role in ecological environment monitoring, the information it provides can help in the utilization and protection of water resources, the evaluation and prevention of natural disasters such as floods, and the guidance of agricultural production around the waters [
1]. Since the beginning of remote sensing more than 40 years ago, this technology has become increasingly mature, helping us quickly and comprehensively obtain the distribution of surface water and continuously monitor its dynamic changes. Remote sensing technology has advantages over conventional surface water mapping methods because it is low-cost, source-reliable, has high conducting frequency, and makes repeatable observations. Many studies [
2,
3,
4,
5] have successfully used surface water extraction based on remote sensing images and multi-source remote sensing data such as moderate resolution imaging spectroradiometer (MODIS), small satellite constellation (HJ—1 a/B), multispectral scanner system (MSS), thematic mapper (TM), and enhanced thematic plotter (ETM +) to extract data on epicontinental water bodies. In particular, MODIS images have the advantages of 36 band range channels and a short return cycle, so they are widely used not only for the detection of land surface water but also for the inversion detection of global surface types and vegetation indicators [
2,
3,
4,
5].
With technological development, researchers continue to explore more advanced methods for water body extraction by using remote sensing technology. Improving the accuracy of information extraction and imaging has become the focus of research. In 2018, Wang and Qin pointed out that the combination of remote sensing spectroscopy with the physical and chemical properties of water is an important method currently studied to improve the accuracy of water extraction [
4]. Jiang et al. used a multilayer perceptron (MLP) neural network for surface water accurate identification [
6]; 2019, Hong et al. claimed that automatic sub-pixel coastline extraction method (ASPCE) can accurately distinguish land and water area [
7]; 2017, Yang and Du used principal component analysis (PCA) and modified normalized difference water index (MNDWI) to improve the accuracy of water extraction [
8]; Normalized difference water index (NDWI) and Sentinel-2 Water Index (SWI) method were used by Jiang et al. at 2020 to reach higher spatial and spectral resolution [
9].
The MERSI data of FY-3 is very similar to MODIS data. FY-3/MERIS enhances the ability to observe fine surface features. Based on the MODIS spatial resolution of 250 m data, MERSI data add 3 bands to the resolution of this data, especially the thermal infrared data at 10.5–12.5 μm in the first channel, which enables FY-3/MERSI images to detect the 250 m spatial resolution of some natural disasters such as typhoons, debris flow, and landslide. There are few studies on using FY-3/MERSI to extract surface waters, and the application of FY-3/MERSI data still has significant room for exploration. The parameters comparison between FY-3/MERIS and MODIS is shown in
Table 1.
In the water body extraction method, since the 1970s, increasingly more scholars have engaged in the exploration and analysis of the remote sensing applications of surface waters. The methods for classifying surface waters based on visible remote sensing data include the single-band threshold method, the multi-band method, the conventional land classification method, and the special water extraction method. Among the commonly used methods for extracting land surface water, the single-band threshold method and the multi-band method are simple to calculate and convenient to construct, but they have many topographic restrictions. For images in different regions and at different times, the threshold settings are different, and these two methods’ adaptabilities are relatively poor [
10,
11,
12,
13,
14,
15]. Conventional land classification methods, such as supervised classification, unsupervised classification, and the decision tree method, have a large workload in the process of building a relevant model. At the same time, it is necessary to adjust the conditions in the classification of images for different times and regions. Moreover, their adaptabilities are relatively poor. Surface water extraction methods make full use of the spectral characteristics of land surface water bodies. At present, the particle swarm optimization (PSO) technique is a new kind of evolutionary algorithm based on the observations of bird groups looking for food [
16]. Currently, discrete particle swarm optimization (DPSO) has been fully used to extract land surface information from remote sensing images, in order to improve the integrity of the algorithm for water body extraction. PSO has intelligent characteristics of self-adaptation and self-organization, as well as the advantages of simple calculation, fast convergence, and strong stability [
17,
18,
19]. Thus, as an intelligent optimization algorithm, PSO has a very strong ability to determine an optimal solution, which has been increasingly more introduced for the special classification of surface waters in remote sensing images.
By combining the characteristics of DPSO with spectral information, Jia et al. made full use of water information with fewer parameters to complete the classification process for land surface water and published the SMDPSO [
20]. With the advantages of low parameter demand and strong portability, this method can be easily combined with more surface water features.
Fengyun satellite data has good timeliness, which can help us to evaluate the flood invasion area in time and provide necessary guidance for disaster relief. Yu et al. used the image and spectral characteristics of FY 3 MERSI data from 1 to 5 bands, analyzed the performance characteristics of vegetation, towns, soil, and water on MERSI images, and performed various water recognition models. A water body index expansion model suitable for MERSI data has been put forward by them. Due to the limitations of obtaining the permission of satellite data acquisition, there are relatively few studies on water extraction methods using Fengyun satellite data [
21].
This paper primarily uses FY-3/MERSI images to extract surface water from Poyang Lake. The classification of land surface waters is mainly completed by using the single-band threshold, water index, decision tree classification, supervised classification, unsupervised classification, SMDPSO and SMDPSO+LFE algorithms. After comparing the coefficients of OA, Kappa, f1, and miou of the various models and methods, the best method for the classification of land surface waters is selected. Moreover, the stability and sensitivity analyses are completed using this method.
Selecting an accurate, sensitive, and stable method is essential for water body information extraction in the Poyang Lake region. However, seven potential water extraction methods are known, but it is not clear which one is the most acceptable. The following study will determine the most appropriate method for water extraction in Poyang Lake. The research area and data section will explain the selection of the water body information of the Poyang Lake, and the spectroscopic characteristics of FY-3 data the Landsat 8 operational land imager (OLI) data. The source of these data will be explained at the same time. The methodology part includes the pre-processing of FY-3 and Landsat 8 OLI images, and the algorithms of 5 traditional methods, SMDPSO method, and SMDPSO+LFE method. Methods of validation and accuracy evaluation will also be provided. Then, the results section will give the accuracy comparison of the 7 methods and the sensitivity and stability test results of the most accurate method. The advantages and disadvantages of the 7 methods will be discussed in the discussion section. The conclusion of this research will be provided at the end.
3. Methodology
3.1. Pre-Process
A method that can be used to improve the geometric errors of the FY-3/MERSI image was adopted. According to the image characteristics of the FY-3/MERSI image, four corresponding error categories (90% control point error (CE90), plane geometric error (RMSEH), meridional geometric error (RMSEX), and zonal geometric error (RMSEY)) were determined to calculate the geometric error of the FY-3/MERSI image. Firstly, the image registration in the FY-3/MERSI research area was confirmed via the specific registration points and geometric error analysis, and then the geometric error correction was carried out for the other time-series images in the research area. Lastly, the radiometric calibration was completed.
The research area of Poyang Lake in this paper is relatively small, so the regional shape of the research area was cut out from the FY-3/MERSI image. Meanwhile, the Landsat8 OLI image for accuracy verification is obtained from the same area [
10,
12,
28]. In the stability analysis of the surface water extracted by the relatively most accurate water extraction method, the corresponding study area images were also selected from the FY-3/MERSI images.
The data in this paper used remote sensing image automatic matching technology to perform control point matching processing on the FY-3/MERSI satellite remote sensing data that has been systematically geometrically corrected. Then FY-3/MERSI data were used to determine its geometric positioning error according to the control point of the same name and perform detailed errors on a single image verification and analysis.
In this study, Level 1a data, which is unprocessed instrument data with geo-referenced and time-referenced parameters [
29], was applied, and the geometric correction was performed based on the latitude and longitude data it provides. The geometric correction in this process is a systematic geometric correction. Then the data was mosaicked and trimmed according to the spatial scope of the study area to generate data of spatial extent covered the study area.
In this pre-processing, Proba-V100m image data is selected as the reference image for analyzing the positioning accuracy of FY-MERSI image data. Using computer artificial intelligence processing methods, automatically find the same control points of the reference image and the image to be processed, the most important of which is the corner detection operator. There are many types of image matching operators; image matching based on image features is the most common operator. Among the image matching operators based on point features, the Moravec corner detection algorithm is one of the earliest corner detection algorithms. The algorithm detects each pixel of the image, treats a certain window around the pixel as a block, and detects the correlation between this block and other surrounding blocks. This correlation is measured by the sum of the squared differences between two blocks. The smaller the value, the higher the similarity. The image matching in this paper is realized under the support of ENVI. The ENVI software provides Moravec and Forster operators to find the same control points. In this process, the Moravec operator is selected and the correlation coefficient is set to 0.95 to determine the effective control points [
30].
3.2. Conventional Process
3.2.1. Extraction of Surface Water by Single-Band Threshold Method
According to the reflectance characteristics of the FY-3/MERSI image for different ground object types, the most prominent band was selected for threshold segmentation and land surface water extraction. Taking FY-3/MERSI data as an example, the reflectance values of land surface water, vegetation, and soil vary greatly with the wavelength changes. Land surface water basically absorbs all the spectral values of sunlight in the near-infrared band, and only a very small part of it is reflected, while vegetation and soil have the opposite trend to land surface water. This leads to the soil, building, vegetation, and surface water bodies showing huge differences in imaging. In this experiment, the threshold value was set for the FY-3/MERSI Band 4. In the early stages of establishing the model, to analyze the reflectance of these classified samples in the near-infrared band, samples of the edge of land surface water, small tributary watershed, vegetation, bare land, soil, etc., were selected for study. The water and other surface epicontinental types showed two obvious peaks in the histogram: pure land surface water and vegetation. Sandwiched between these two peaks is an epicontinental fuzzy area of the water body and bare land construction. The gray value corresponding to the position of the trough is selected as the threshold interval of the single-band threshold method. After repeated tests, the threshold value was determined to be 0.1516.
3.2.2. Extraction of Surface Water by Water Index Method
In remote sensing, NDVI = (NIR − R)/(NIR + R), where NIR is the reflection value of the near-infrared band, and R is the reflection value of the red band [
13,
31]. NDWI = (G − NIR)/(G + NIR), where NIR is the reflection value of the near-infrared band, and G is the reflection value of the green band [
14,
15]. In this experiment, NDWI was used to classify the surface water of Poyang Lake. FY-3/MERSI green light and near-infrared band were used to establish ratio relationship, and the characteristics of terrestrial surface water were emphasized from the image. The threshold value of the water index method is 0 to complete the classification and extraction of surface water.
3.2.3. Extraction of Surface Water by Knowledge Decision Tree Classification Method
The four-channel data of the FY-3/MERSI image were selected to calculate the values of the NDVI; NDWI; and the sum of the reflectance values of green light, red light, and the near-infrared band according to their formulas. The slope value was calculated using elevation data [
32,
33,
34]. The above data is used as the basis for decision tree nodes. The knowledge decision tree is shown in
Figure 2, where NDVI represents the normalized vegetation index, NDWI represents the normalized water index, and Sum_grn represents the sum of the reflectance values of the green light, red light, and near-infrared bands [
35]. Using the prior knowledge, the image bands are analyzed according to the samples and then the decision tree is generated [
36,
37].
3.2.4. Extraction of Surface Water by Supervised Classification Method
Supervised classification is mainly separate into three steps. The first is to define the category. In this experiment, FY-3/MERSI images are characterized by visual interpretation. According to FY-3/MERSI image features, surface features are divided into four categories: surface water, bare land, architecture, and vegetation. The second is for sample selection, which in turn evenly sample selection on the remote sensing image classification and then calculate all separability between samples. Sample can be separating degree value between 0 and 2, if the separated degrees of the sample were greater than 1.9, the next classification step will be started. Otherwise, the program will modify the input data or check the classified samples again. The third is classifier selection. This study selects the support vector machine (SVM) method in supervised classification. SVM can construct a variety of vectors according to the characteristics of image data channels and select the support vector with better performance for classification, to maximize the class spacing of all surface types and establish supervisory classification rules based on this. Landsat verification data also adopts the supervision classification method of the above steps. The data collection of supervised classification method is to select water and non-water areas in the study area by visual discrimination, using the digital number (DN) value of each band as the input data. The water data is selected for study as the target data, training it through a supervised training model such as SVM, applying it to the entire region after obtaining the model data.
3.2.5. Extraction of Surface Water by Unsupervised Classification Method
The main parts of unsupervised classification are the following three steps. The first step is image analysis. The number of classifications is determined based on the remote sensing image characteristics. In this paper, based on the image characteristics of the Poyang Lake study area, the image classification categories are defined as vegetation, bare land, buildings, and water bodies. When selecting the number of classifications, the number is roughly 2–3 times the number of land surface classifications, and finally the number of classifications in unsupervised classification is set to 12. The second step is classifier selection. The corresponding unsupervised classifier is selected according to the spectral characteristics of the remote sensing image data, the number of classifications, and the selection rule of the sample. In this paper, the K-Means method is applied, which determines the center of different categories based on the similarity of different pixels in the remote sensing image position and then iteratively calculates the distance between all the pixels to be classified and the center positions of different categories to select the most similar feature type for aggregation. The third step is category definition and consolidation. According to the linear representation of the categories of FY-3/MERSI images, the categories of the unsupervised classification results are determined and merged with the same categories.
3.3. Extraction of Surface Water by the SMDPSO Method
The SMDPSO method extracts water bodies in two main steps. First, the water probability image is obtained from the multispectral data by the proposed spectral matching method. Then, the water pixels are identified from the water probability image by using the DPSO method [
20]. The purpose of surface water bodies extraction based on FY-3/MERSI images is to solve the problem of pixels or discrete image pixels. However, the basic particle swarm optimization (PSO) is mainly used to solve the problem of continuous space, but it cannot solve the problem of discrete space. Therefore, we applied discrete binary PSO [
37,
38]. By selecting standard spectra from the FY-3/MERSI image to calculate the water probability, the water probability image was divided into 4×4 blocks. For each different block, a spectral matching algorithm based on discrete particle swarm optimization was used to extract the land surface water [
18,
19]. The flow chart is shown in
Figure 3.
Because NDWI makes full use of the regularity of surface water reflection spectral curves, it has a good response to water bodies. Thus, the value of the NDWI is introduced into the calculation of water probability. The water probability image of the study area obtained by calculations is shown in
Figure 4.
To make the DPSO algorithm more sensitive to various surface types in the process of extracting land surface water, the image is roughly classified based on the mean value μ and standard deviation σ of the pixel water probability. The judgment conditions and flow charts of the four coarse classifications are shown in
Figure 5.
The high μ/σ value of rough classification 1 indicates that the block has a high probability of water homogeneity or a low standard deviation, and the pixel feature of the block is likely to be a homogeneous land surface water body or a homogeneous non-water surface. The comparatively lower μ/σ values of rough classifications 2, 3, and 4 indicate that the block has a lower water probability mean value or a higher standard deviation, and the pixel features of the block are more likely to be mixed features. In the rough classification of a low μ/σ value, three categories were classified according to the size of the water probability value. Classification 2 has a highest water probability mean value, indicating that the pixel feature of this block is more likely to be the boundary between water and land and also more likely to be the boundary between small water and vast land areas. Classification 3 offers a medium water probability mean value, indicating that the block is more likely to have small tributaries and that more water probability information from other types is mixed in the block. Classification 4 provides the lowest water probability mean value and a relatively high-water probability standard deviation, indicating that it is more likely to be a mixed block of non-water land, such as the boundary between bare land and vegetation, and the boundary between buildings and vegetation. A rough classification diagram of the Poyang Lake research area is shown in
Figure 6.
In the code for DPSO, the position vectors of all the particles form position matrix X. The pixels in the block are divided into two categories: water and non-water. The velocity vectors of all particles form the velocity matrix V. The particle objective function value is then calculated. The optimal particle history and global optimal solution are obtained, and the particle velocity and particle position are updated. Next, the fitness value of the particles is calculated, and the historical optimal and global optimal solutions of the particles are updated. After reaching the maximum iteration, the output global optimal solution is the final classification representation of the land surface water pixel and the non-land surface water pixel in the block.
3.4. Extraction of Surface Water by SMDPSO+LFE Method
For a particular image resolution, a river may appear to be about the width of a single-pixel or more, depending on the actual width of the river [
39].
Figure 7 shows the feature differences between rivers with a single-pixel width and rivers with multiple pixels in the water probability image. In general, the techniques used to extract the feature width of a single pixel cannot be used directly to extract the same features at a higher resolution. A single-pixel width element is mapped by a single line in the generated vector database, while the multiple linear elements of pixel width must be represented as a set of polygons [
40].
SMDPSO+LFE method calculates the probability of the water body through the standard spectrum, and roughly classifies the image through the probability of the water body. The results of the rough classification are divided into two parts. As the flow chart is shown in
Figure 8, the first part is used to extract water pixels via the DPSO algorithm for three categories. The second part is used to enhance the current characteristics of small tributaries and extract the small tributaries.
The first stage involves extracting water pixels from three tributaries via the DPSO algorithm, and the second step is to enhance the current characteristics of small tributaries and extract the small tributaries [
41,
42]. The extraction of small tributaries from FY-3/MERSI images is mainly based on Fischler’s work [
43]. Fischler proposed a Duda Road Operator (DRO) in his study to identify rural roads from low-resolution remote sensing data by locating roads less than or equal to three pixels wide. The DRO was further studied and extended to extract natural features, such as coastlines and valleys. Fischler’s research involves evaluating images pixel by pixel based on a set of operators, each of which measures the local image attributes through a pixel and returns a score indicating the likelihood that the pixel is a road [
44,
45,
46]. The higher the calculated value of the operator, the more likely it is a road pixel.
In the FY-3/MERSI images, small tributaries showed as less than or equal to 3-pixel values, so the DRO operator was introduced to enhance the linear features and extract small tributaries. Rivers vary greatly in morphology due to their different points of origin. Rivers formed on mountains tend to be very tortuous in shape along with the direction of valleys. However, rivers in flat areas and artificially excavated rivers tend to be squarer in shape. Considering the effect of river shape on the enhancement of linear features, only the central pixel of the river and the pixels on both sides of the central pixel in different directions are considered in the process of introducing the DRO operator. Based on an image with an inconsistent width of river water pixels, 3 × 3 and 5 × 5 pixels were used to enhance the linear features of the image. Considering that the water probability value of the surface water pixel is larger than the rest of the pixel, to avoid the influence of other pixels, only the cases where the water probability value of the pixel in the center is larger than the two sides were calculated. The DRO operators include four operators in different directions to calculate the linear enhancement results for operators in different directions and determine the maximum values for 3 × 3 and 5 × 5 pixels in different directions as the final linear feature enhancement result values. The linear feature enhancement results according to the probabilistic image of the water body in the study area are shown in
Figure 9.
Figure 9 shows that the small tributaries on the west bank of Poyang Lake are greatly enhanced and that the small tributaries have a high degree of integrity. For the wider parts of the river in the southwest bank, the width of the river pixel is greater than three, and the enhancement effect of the linear features of the DRO is not obvious. At the same time, the irregular edges of Poyang Lake were completely enhanced at the boundary of water and land. However, for the tributaries that are one-pixel wide, this linear enhancement is less effective due to the small differences between the water probability value of the tributaries and the surrounding pixels.
3.5. Validation and Accuracy Evaluation Methods
In this study, 7 methods were applied, including single-band threshold method, water body index method, knowledge decision tree classification method, supervised classification method, unsupervised classification method, SMDPSO, and SMDPSO+LFE.
A sensitivity analysis was carried out to evaluate the accuracy of the extraction of surface water using the standard water spectrum, block size, and iteration times. The standard water spectrum, block size, and iteration times are input parameters for the selected water extraction method. Three sampling areas were selected to evaluate the sensitivity of the selected water extraction method to the water standard water spectrum. These three sampling areas were identified as Lake Superior, the Congo river, and the Persian Gulf, and their maps are shown in
Figure 10.
For the block size, a larger size covers more adjacent pixels, but the calculation time and distance degree, which means the distance from the center of one block to the neighboring block’s center point, of different block size algorithms are different. In this study, the block size of the research area of Poyang Lake varied from 3 to 10, the land surface water was extracted, and the corresponding OA was calculated according to the sampling results.
For the land surface water obtained by the iterations, the smaller the number of iterations is, the poorer the results are and the less likely they are to achieve the output of the objective function value. As the number of iterations grows larger, the results for the land surface water obtained by the iterations are improved, and the optimal solution becomes more likely to be achieved. However, the longer the number of iterations is, the longer it takes for the algorithm to output results. Different iterations were used to output the image water extraction results from the images in the research area and calculated the OA and Kappa coefficients from the water extraction results to find the most appropriate iteration times.
5. Discussion
The single-band threshold method is a good method to distinguish surface water from land. Through manual visual interpretation, this method could roughly extract the surface water in the study area. However, most of the small tributaries and discrete surface waters on the southwest and southeast shores of the Poyang Lake area were not successfully separated; only some of the rougher rivers were extracted. Thus, there are still some imperfections at the extraction of river’s edge.
The water index extraction method can be used to extract the basic shape of the main body of Poyang Lake. However, the NDWI is sensitive to sediment, and the area of Poyang Lake and a large number of rivers are mixed with a great deal of sediment. Moreover, the water index method only extracts wider rivers.
The decision tree classification method can extract the general water bodies and rivers of Poyang Lake. However, lakes with a large amount of suspended sediment and mixed water cannot be distinguished by decision tree classification. The high concentration suspended sediment water body was incorrectly classified as a non-water body. Moreover, this method could not distinguish the shallow bank at the edge of the lake or extract the small tributary water body. At the same time, the knowledge-based decision tree method requires that researchers have a good understanding of the image features in the early stage when constructing the decision tree. The precision of the classification results of the decision tree was determined by sample selection. Images in different regions are needed to build special decision trees, so their utility is low.
There were many misclassifications in the supervised classifications for the shadowed part of the western mountains. At the same time, the surface water extraction results were also not perfect for the area of the lake edge with a complex shape. In addition, supervisory classification methods require prior field investigations or prior knowledge and are complex to operate.
The unsupervised classification results were not accurately extracted from the shoals on the lake boundary. At the same time, for the junction between the river and the building, part of the building was divided into land surface waters. The unsupervised classification does not need to be supervised during the selection and implementation of classifiers. However, after the implementation of classifiers, it is necessary to judge the results according to the classification characteristics and then merge the classifications, which requires accurate discrimination.
The SMDPO extracted not only the basic surface water of Poyang Lake but also some relatively small tributaries. The general directions of the rivers were also clearly indicated, and some discrete lakes were successfully distinguished. However, the land surface water in the building area was relatively confused. The results of SMDPSO are better than those of the 5 previous methods.
On the basis of SMDPSO, the SMDPO+LFE method extracted some relatively small tributary waters. In addition, some small tributaries and lake water that take up only one pixel were extracted.
According to the stability study of SMDPSO+LFE, the spectral reflectance of vegetation, cloud, and ice is quite different from the spectral value of standard water, so the water probability value is relatively low. This method can accurately distinguish vegetation, clouds, and ice, as well as thin clouds. The SMDPSO+LFE method was used to extract the images of the Poyang Lake research area, the Magdalena River in Colombia, and the coastal lake in Mexico. This method was shown to be highly sensitive to shoal and silt, as it could correctly distinguish between shoal, silt, and surface water. At the same time, this method offers good feedback for small tributaries and lakes with a pixel width of less than two and can effectively extract small tributaries, small lakes, and small puddles. However, the continuity of small tributaries needs to be improved.
Mountain lakes in Thailand, Gandhi lake in India, Nasser reservoir in Egypt, and Congo river in the Congo are characterized by obvious surface waters due to their simple image types.
The OA and Kappa coefficient values of the classification results of surface waters in these areas were both high, with an overall classification accuracy higher than 96.5% and Kappa coefficient values higher than 0.93.
In Kazakhstan’s Aral sea, Colombia’s Magdalena river, Mexico’s beach lakes, and in the Mississippi River, due to the complexity of ground type, shallow water and silt, sea tide, and cloud shadow, epicontinental water mixed with the rest of the class information; water extraction result complete degree is not high; OA is relatively low, at around 95%, and the Kappa coefficient is about 0.89.
In terms of sensitivity, the sizes of the blocks increased from 3 to 5, and the overall classification accuracy gradually increased. With the increase in the block size, the greater distance limit of the block is solved, and the overall classification accuracy gradually decreases. Considering the solution time for land surface water, when the size of the block is 4, there are 65,536 (216) possible solutions for land surface water in any block. When the block size is 5, there are 33,554,432 (225) kinds of surface water in any block. The number of iterations is multiplied. Therefore, to achieve the highest accuracy and meet the low time requirements, the optimal block size chosen in this paper is 4.
Through an approximate curve analysis of the OA and Kappa coefficients and the number of iterations in SMDPSO, the correlation between the fitting relationships of the OA and the Kappa coefficients and the number of iterations and the logarithm function was higher than 0.84. With a gradual increase in the number of iterations, the distribution law of logarithmic growth was realized. With 0–1000 iterations, the overall classification accuracy and Kappa coefficient increased rapidly. The growth slowed in 1000–2000 iterations, and the overall classification accuracy and Kappa coefficient tended to be stable after 2000 iterations. Since a higher number of iterations means a longer surface water extraction time, the number of iterations for all images in this paper was set at 2500.
This research focuses on the comparison of water extraction methods. Five traditional methods are already mature. On the basis of comparing these methods, this paper added SMDPSO and SMDPSO+LFE to conduct experiments on water extraction methods. In the experiment of this method, SMDPSO+LFE is considered to have better accuracy, stability, and sensitivity.
The results show that the accuracies of the seven methods are all greater than 93%. However, from the perspective of overall accuracy, SMDPSO+LFE is 3% to 4% higher than the conventional 5 methods. Kappa coefficient is 0.11 to 0.06 higher. In water extraction, including many other classification models, the process of improving the accuracy is relatively slow, which mainly depends on the characteristics of the data. Even a small increase in accuracy is of great significance, especially in the extraction of river tributaries.
6. Conclusions
Based on FY-3 remote sensing satellite data, seven methods were applied to extract the surface water of the Poyang Lake, China. A confusion matrix analysis on the verification samples selected by Landsat 8 OLI was conducted to compare with the seven classification methods. Compared with other methods, SMDPO+LFE is considered to be the best method for extracting the water of Poyang Lake because of its high accuracy on water extraction, especially on small tributary waters. It also shows good performance in terms of sensitivity and durability.
Among the surface water extraction methods in the Poyang Lake region, the classification accuracy and Kappa coefficient of the seven methods were all higher than 93% and 0.84 respectively. SMDPSO+LFE classification accuracy reached 97.56% and Kappa coefficient reached 0.95. These results show that the SMDPSO+LFE method represents the highest accuracy among the seven methods in this case study. The algorithms of SMDPSO+ LFE determined three parameters of standard water spectrum, block size, and iteration times in the early stage of the surface water body extraction model and have a good extraction effect.
In terms of the adaptability of different regions, eight qualitative regions of different terrestrial surface water types, landscapes, and topographic features are selected globally. SMDPSO+LFE algorithm was used to extract surface water from FY-3 images of eight regions. The results show that water extraction is effective and can be divided into ice, shoal, silt, and small tributaries. Sensitivity analysis was carried out through the standard water spectrum, block size, and iteration times. The results show that the selection of Lake Superior as the standard water spectrum, block size of 4, and iteration times of 2500 have high classification accuracy and Kappa coefficient. The algorithm is also more efficient.
However, compared with other methods, the SMDPSO+LFE method is slightly more complicated in the implementation process. Other conventional methods can be completed with many software. SMDPSO+LFE does not support by any software by now, which means it needs to be programmed. In a follow-up study, intelligent optimization algorithms will be introduced. The surface water body extraction stability of SMDPSO+LFE method under different programming languages will also be explored.