2.2. Satellite Data
In this research, GeoEye-1, Sentinel-2 MSI level 1C, and Landsat-8 OLI satellite images were selected. The GeoEye-1 satellite produces images with a spatial resolution of 2.0 m for multispectral bands. Several spectral bands were applied in this study, including blue (0.45–0.51 μm), green (0.51–0.58 μm), red (0.65–0.69 μm), and NIR (0.78–0.92 μm), at a quantization level of 11 bits per pixel in each band. The acquisition data for GeoEye-1 were recorded on 13 February 2019.
Sentinel-2 Multi Spectral Instrument (MSI) level 1C (L1C) imaging was used in this study. In cartographic geometry, L1C products have been corrected using at the top of atmosphere (ToA) satellite reflectance data. The images have spectral bands. The Sentinel 2 satellite imagery data from 4 February 2019 were selected. This study used images with spatial resolutions of 10 m, which consisted of three visible bands and an NIR band, which was needed for image corrections.
The satellite imagery of Landsat-8 OLI Level 1 used data path/row 114-34 with 30 m resolution for multispectral bands. The spectral bands used in this study included blue (0.452–0.512 μm), green (0.533–0.590 μm), red (0.636–0.673 μm), and NIR (0.851–0.879 μm). The Landsat-8 OLI data were recorded on 22 January 2019.
2.3. Image Processing
Satellite imagery data for identifying benthic habitats should be radiometrically corrected to remove any disturbances generated by the environment. The classification accuracy of satellite images may be greatly improved by utilizing corrected images, which contain atmospheric, sunglint, and water column corrections.
In this study, we used ACOLITE to perform atmospheric corrections on Landsat-8, Sentinel-2, and GeoEye-1 imagery. Each image consists of blue, green, red, and NIR bands, and all of them were atmospherically corrected. The exploration of underwater radiances in the visible region of the ocean requires the use of atmospheric correction methods for multichannel remote sensing imagery. The ACOLITE atmospheric correction technique has the lowest relative and absolute error values, compared to the existing L2-WFR, POLYMER, C2RCC, SeaDAS and SeaDAS-ALT [
36]. Therefore, this atmospheric correction algorithm was used in this study. ACOLITE supports sensors from a variety of satellites, including Landsat 5, 7, 8, Sentinel 2, 3, PlanetScope, and WorldView, with atmospheric correction performed using Dark Spectrum Fitting [
23,
36,
37,
38,
39]. ACOLITE also works on the basis of an input image, and does not require external inputs, such as aerosol optical thickness (
) estimates or measurements (such as FLAASH), subject to meeting two conditions: the atmosphere is constant and homogeneous within a limited space, and; at least one pixel in the scene or subscene has a surface reflectance (
) close to zero, so that the atmospheric path reflectance (
) can be estimated in at least one band [
23]. The parameters used to produce the surface-level reflectance are generated by the composite band. The parameter output used in this study was L2R (Level 2, surface-level reflectances) (
, rhos_*). For the ACOLITE atmospheric corrector method, there are internal parameters, such as minimum gas transmittance for retrieval of aerosol optical thickness, which were all set to default values.
Sunglint is a significantly greater complication for remote sensing of the sea floor and aquatic characteristics via radiometric correction. The sunglint algorithm in shallow waters was developed by Hedley [
25]. Utilizing brightness in a NIR band, this approach is advantageous for removing sunglint from remote sensing imagery [
24]. Maintaining a steady baseline brightness and low water luminosity in the NIR, the image was chosen to allow for a variety of pixel luminosity levels. Using all pixels in the area covered, a linear regression is performed between the NIR radiance and the visible band radiance. This technique performs a regression analysis of the NIR and visible band data, using random samples of pixel data to obtain a set of regression slopes. The slope of least squares regression is then used to determine the correlation between the visible band and NIR, and each pixel is adjusted by subtracting the visible band from the NIR radiance’s estimated lowest value. The formula in Equation (1) used for sun glint is shown as follows:
where
is the sunlight-free reflectance;
is the reflectance from visible band
;
is the product of the regression slope;
is the reflectance from the NIR band, and;
is the minimum NIR band.
Considering that environmental factors, such as the bottom type, water depth, and water attenuation (which may cause scattering and absorption in the water column), can vary widely, water column adjustment is a major challenge. Many water column algorithms were developed, though, in some cases, algorithms were not available due to the requirement for values for bathymetry and the diffuse attenuation coefficient of the water [
27]. Therefore, in this study, the Lyzenga algorithm was selected because it could minimize the water attenuation effect in shallow waters, and did not require additional data [
27,
40]. Spectral characteristics obtained from the ocean’s surface were utilized to recognize the water column depth through the Lyzenga algorithm [
41]. The algorithm was continually enhanced and became extensively adopted as the depth invariant index (
) transformation, which could be employed for conducting ecosystem mapping of shallow waters based on satellite data [
42,
43,
44]. Approaches were established to correct for water column, which stems from absorption and scattering by particles in the water [
45]. The technique for identifying benthic habitats has potential advantages, including the development of more than two spectral bands to boost performance, improve researchers’ ability to distinguish between bottom components with similar object spectral reflectances, and expand the functional capability without considering the same coefficient of water attenuation [
41]. This approach establishes that the bottom type is the primary factor that affects the constant in the linear relationship between the Lyzenga-converted reflectance values of the various bands. The
index is defined as an expression between bands
and
, as follows in Equation (2):
where
is the reflectance value of band
;
is the reflectance value of band
, and;
/
is the following Equation (3) allowed to determine the slope of the interband conversion:
where
/
is the ratio of the attenuation coefficient values of bands
and
,
is the variable defined in Equation (4)
where
is the covariance of bands
and
j;
is the variances of band
; and
is the variance of band
j. The following image shows
corrections applied to the values of the three main band ratios, including blue and green (B1/B2), blue and red (B1/B3), and green and red (B2/B3) data [
42]; these bands are factored into the equations.
One type of supervised learning algorithm, known as a support vector machine (SVM), is a non-parametric classifier [
46]. The objective of support vector machines (SVMs) is to locate a hyperplane that can divide the input dataset into a fixed number of classes in a way that corresponds to the samples used for training [
47]. The elements used to classify the image were divided into four classes: land, seagrass, breaking wave, and others.
Table 1 describes each category. The closest training values in the training datasets, generally referred to as support vectors, were used to increase the margin between the tested point and the ideal hyperplane. When the size of the margins was maximized, there was an improvement in the classification accuracy [
48]. The hyperplane in the decision variables was thus established; the SVM model was then developed for each seagrass using the radial gaussian basis function kernel, with approach C and gamma regarded as the best option due to its greater efficiency [
49]. The radial gaussian basis function kernel has better performance than other kernels, with powerful capabilities in remote sensing data processing; it simply needs a few numbers of the parameters to be defined [
50]. Support vectors selected the best values for the SVM hyperparameters and employed cross validation, with some of the training pixels being retained. The parameters for the classification process using SVM were selected (shown in
Table 2), and included Kernel Type, C and Gamma in Kernel Function. The selected kernel type was a radial basis function. The C and Gamma values were used 5792.61 and 32, respectively.
We used the equalize random sampling schema to classify the seagrass [
51]. In this method, the samples in each class were divided into training and testing steps. The sampled area was segmented into the seagrass, breaking wave, land, and others classes based on two methods.
Table 2 shows the definition of each class, which were used as references for dividing the classes for training and testing purposes. From the entire study, the 3737 pixels were selected as references for training, and 1604 pixels were selected for testing. The testing points were selected according to the random sampling method within the label distribution, and were balanced. Training and testing data for seagrass were generated using research from the Korea Institute of Ocean Science and Technology (KIOST), while research from the Korea Institute of Geoscience and Mineral Resource was used for the land and others classes; the breaking wave class data were generated by satellite images. The classified image was then tested to estimate overall accuracy. The accuracy of classification was determined using the method of the percentage of pixels correctly allocated, which is evaluated using the overall accuracy of the classification. Accuracy for a target class is the percentage appropriately labeled to the total number of pixels in that class. We used the matrix’s column and row allocation to define two types of accuracy; these methods are called user’s and producer’s accuracy. Nevertheless, they do not account for agreements across data sets that could be attributed to random chance. The kappa coefficient approach was used to assess the consistency of the output maps by measuring the agreement, based on the actual agreement in the confusion matrix and the chance agreement.