1. Introduction
Aquaculture has become one of the fastest-growing food industries [
1], and the fishery products of China play an important role in the international seafood market [
2], with over 60% of the fish farmed in the world [
3]. However, according to the 2018 State of World Fisheries and Aquaculture report (SOFIA 2018) by the Food and Agriculture Organization of the United Nations (FAO), the proportion of marine fish resources within a biologically sustainable level showed a downward trend in recent years [
4]. In China, many lakes and coastal wetlands were reclaimed in the past few years in order to support the fast development of fisheries [
5], putting tremendous pressures on environments and hampering regional sustainable developments [
6]. Accurately mapping aquaculture areas is an important support to policy development and implementation at regional, national, and global levels, and to measure progress towards sustainable developments [
7].
Traditional field survey for aquaculture mapping suffers from low efficiency, and currently the satellite remote sensing technique is one of the most important methods due to its many advantages, such as low cost, wide monitoring range, high efficiency, and high repeated observations [
8,
9,
10]. Optical and radar remote sensing images have been increasingly utilized to delineate aquaculture areas [
11,
12], and many methods have been developed for local [
13], regional [
14], and national scale [
15] aquaculture mapping. Meanwhile, due to the periodic repeated observations of satellites, remote sensing images were used not only to map aquaculture areas at a single time [
16], but also to map their time series distributions [
17].
Previous studies can be roughly summarized into two categories according to basic mapping units: pixel-based and object-based approaches. The pixel-based methods are widely applied to the images with low and medium spatial resolutions. For example, artificially designed spectral and textural features were computed for each pixel, and a supervised machine learning classifier was used to map large scale aquaculture areas [
18]; Sakamoto et al. [
19] applied a wavelet-based filter for detecting inland-aquaculture areas from MODIS time series images, and deep-learning-based methods were used for aquaculture classification [
20,
21]. For the object-based classification method [
22,
23,
24], images are segmented into many homogeneous segments, which are further classified through machine learning classifiers or classification rules derived from expert rules. For example, Wang et al. [
25] segmented Landsat images into objects using multi-resolution segmentation method to extract raft-type aquaculture areas.
Considering data sources, most studies used only optical images for they are visually intuitive and easy to be understood. For mapping aquaculture facilities over small areas, high spatial resolution remote sensing images are frequently applied [
26,
27], whereas medium resolution images are generally used for mapping aquaculture facilities at a regional or national scale due to their wide coverage and better spectral resolution. For example, Ren et al. [
28] combined Landsat series images and an object-based classification method to map the spatiotemporal distribution of aquaculture ponds in China’s coastal zone. Synthetic aperture radar (SAR) images are also used for aquaculture mapping [
14,
29]. For examples, Hu et al. [
29] detected floating raft aquaculture from SAR image using statistical region merging and contour feature; Ottinger et al. [
14] employed time series Sentinel-1 images and object-based approach to map aquaculture ponds over river basins; and Zhang et al. [
30] mapped marine raft aquaculture areas using a deep learning approach by enhancing the contour and orientation features of Sentinel-1 images.
Most studies applied satellite images at a certain time to map aquaculture areas, which may affect the accurate mapping of aquaculture areas for their dynamics. Although the influence of weather could be suppressed by carefully selecting images, other factors may also affect the accurate mapping of aquaculture areas. For example, some paddy fields are still water-dominated at the early stage of farming, and inland aquaculture ponds are drained during harvest time. Previous studies demonstrated that more accurate aquaculture maps could be obtained by using time series SAR images [
31], and time series optical images was also proved to be effective in improving mapping accuracy [
32,
33]. Therefore, time series images are an ideal and reliable data source for aquaculture mapping.
The Google Earth Engine (GEE) platform provides a series of free remote sensing images, many kinds of image processing algorithms, and high-performance computing capabilities, and it can process huge amount of time series remote sensing images over a large-scale area [
34,
35,
36]. Therefore, GEE has been widely used for mapping wetlands [
37] and agricultural lands [
38], and it also has the potentials to map aquaculture areas [
39,
40,
41,
42]. For examples, Xia et al. [
39] proposed a framework for extracting aquaculture ponds by integrating existing multi-source remote sensing images on the GEE platform; Duan et al. mapped aquaculture ponds over coastal area of China using Landsat-8 images and GEE platform [
41] and further analyzed their dynamic changes from 1990 to 2020 [
42]. Existing studies were mainly focused on designing artificial image features or training deep leaning models to map a specific type of aquaculture area, such as aquaculture ponds. However, many different aquaculture types are found over a large area, and how to simultaneously extract multiple types of aquaculture areas over a large area has not been well studied.
This study proposed a novel approach for mapping aquaculture areas with multiple types over large areas. With the Pearl River Basin (Guangdong) of China as a case study, time series Sentinel images were used as a data source to overcome the accidental factors of single-time observation. The spectral indices (including normalized difference vegetation index (NDVI), normalized water index (NDWI), and normalized built-up index (NDBI)) derived from Sentinel-2A multispectral images, the VV and VH polarized data of Setinel-1 SAR images, and their derived texture features were used to map aquaculture areas using machine learning algorithms implemented in Google Earth Engine. The proposed method holds great potentials in simultaneously mapping different types of aquaculture areas over a large area.
5. Discussion
5.1. Comparison with Other Aquaculture Maps
The proposed approach in this study was compared with the methods proposed by Duan et al. [
42] and Xia et al. [
39]. The corresponding reference images were manually labeled. The maps are presented in
Figure 12, and the confusion matrixes are presented in
Table 5,
Table 6 and
Table 7. Duan et al. [
42] adopted a spectrum, and spatial and morphological features of 30 m resolution Landsat images to build a Random Forest classifier to implement an automatic extraction of large-scale aquacultures. Xia’s et al. [
39] extracted the water surface from Sentinel-2A images using multiple thresholds, described the water patches using geometric and spectral features, and finally applied a Random Forest classifier to classify inland aquaculture ponds.
The map provided by Duan et al. [
42] presented a general distribution of aquaculture ponds; however, almost all the details were missed (
Figure 12b,j), for it was obtained using Landsat TM images with a spatial resolution of 30 m. Moreover, they conducted morphological closing and erosion operations post process, which further smoothed the details.
Xia’s method and the proposed approach in this study were applied to two typical regions, and their derived aquaculture maps were compared. Xia’s [
39] and the proposed approach in this study produced similar aquaculture maps, and most of their details are clearer than Duan’s map (
Figure 12c,d,k,l) for Sentinel images of 10 m spatial resolution were applied and the post processing did not eliminate the details. The proposed method produced similar results to Xia’s method (
Figure 12c,d,k,l); however, less embankment pixels were misclassified. Therefore, the User’s Accuracy of aquaculture areas (92.40%) was higher than that of Xia’s method (77.97%). The main problem of the proposed method is that some isolated ponds were missed, resulting in a lower Producer’s Accuracy (
Table 7). However, the overall accuracy and Kappa coefficient indicated that the proposed method performed better over the tested areas.
The proposed approach in this study achieved better performance than Duan’s and Xia’s methods, which might be explained by that: (1) the texture features from normalized difference spectral index images increased the distinguishability of aquaculture area from other objects; and (2) radar images were integrated with optical ones, as they are sensitive to the texture structure of aquaculture areas, resulting in further improvement for aquaculture mapping. More important, the proposed approach was originally designed to simultaneously map aquaculture ponds and mariculture areas, whereas Xia’s and Duan’s methods can only map aquaculture ponds, and thus the proposed method has a better generalizability.
5.2. Impacts of Mixed-Pixels
The aquaculture maps in the Pear River Basin (Guangdong) were obtained with medium resolution Sentinel-1 SAR and Sentinel-2A multispectral images, and the results showed the advantages of using medium resolution images for large-scale thematic mapping. However, the accuracy and generalization of the proposed approach might be affected by mixed pixels of medium resolution images. The aquaculture pixels are often mixed with embankment among ponds, which often results in ambiguous boundaries and some inevitable errors. Thus, some parts of aquaculture ponds were classified as land, and some embankments were classified as ponds. It is difficult to overcome such errors using medium spatial resolution images, and very high spatial resolution images may easily overcome this problem.
The step of extracting water surface mask in our proposed approach is also affected by mixed pixels. Our method relies on the assumption that pixels covering mariculture facilities are still water-dominated, because the sizes of these facilities are usually relatively smaller than a pixel. Thus, they were first classified as waterbodies, and further been detected at later stages. However, this assumption may be not appropriate when using high spatial resolution images, because many pixels covering mariculture facilities are no longer water-dominated mixed pixels. In such case, these pixels will be segmented into non-aquaculture, affecting the accuracy of final map. Object-based methods might be an optimal solution for high spatial resolution images to overcome this limitation.
The proposed approach was designed for large scale aquaculture mapping using medium spatial resolution images. Although the boundaries are not detected very accurately, the aquaculture map over a large area can be efficiently obtained. In particularly, the medium spatial resolution images acquired by many satellite sensors (such as Landsat TM series and Sentinel-2A series) for several decades provide great convenience to monitor the development of the aquaculture industry.
5.3. Selection of Time Series Images
In this study, time series images for a whole year were used for aquaculture mapping in order to eliminate the influence of accidental factors, such as water-dominated paddy fields and dry ponds during harvest period. Time series SAR images also are effective in suppressing speckle noise of radar image. Although there are many advantages, two problems should be noted.
First, the seasonal characteristics of study areas may affect the selection of time series images. For example, this study area is located in southern China, in which the temperature is usually high, and the water does not freeze; thus, the images with good quality acquired at any time can be used. However, in northern China, the pond water freezes in winter and its optical properties will change. Therefore, the time series images should be selected according to the specific seasonal characteristics of study areas.
Second, the assumption and basis of using time series images are that ground entities are not changed suddenly. If some aquaculture ponds are converted into agricultural land or built-up areas in winter, they may still be recognized as aquaculture areas. To improve the ability to respond to such sudden changes, it is necessary to shorten the time interval of time series images.
5.4. Limits and Future Works
An effective and efficient approach was proposed in this study for aquaculture area mapping over large areas; however, some issues should be further investigated. Firstly, some narrow rivers adjacent to aquaculture ponds were still misclassified. Thus, more accurate classifier and post-processing methods are still worth investigating, and the combination of post-processing and river vector boundaries may be a potential solution. Secondly, the aquaculture areas were roughly classified into two types in this study: aquaculture ponds and mariculture areas, and more specific aquaculture types are needed to be investigated. Finally, only one global threshold was used to extract waterbodies (containing aquaculture areas); it was not always optimal for different and complex image scenes, and thus a locally adaptive thresholding approach is a potential solution to improve the segmentation of the water surface. Only the aquaculture maps in 2020 were obtained in this study, and they are not sufficient for more potential applications. With historic earth observation images, it is necessary to analyze the long-time spatial–temporal changes of aquaculture areas and their impacts on economy and ecological systems and further to provide supports for the sustainable developments of the study area.
6. Conclusions
A novel approach was proposed in this study for simultaneously mapping multi-type aquaculture areas over large scale areas by combining spectral and texture features from optical (Sentinel-2A multispectral) and radar (Sentinel-1) images, and a case study in the Pear l River Basin (Guangdong) showed its efficiency. The main contribution of this work could be summarized as follows:
(1) We analyzed the spectral and textural features of aquaculture areas and demonstrated the effectiveness of fusing multiple image features for aquaculture mapping. We found that the use of textural features derived from the spectral indices can greatly improve the mapping accuracy and the use of textural features derived from SAR images can further improve mapping accuracy, as they are sensitive to marine aquaculture facilities.
(2) The proposed approach could generate a more accurate aquaculture map than previous studies. Moreover, the proposed approach was implemented on the GEE platform, and has great potential for national-scale and long-term aquaculture mapping.