2.2. Sampling Method
On 25 June 2021, a DJI M300 RTK four-wing multicopter UAV (Dà-Jiāng Innovations Science and Technology Co., Ltd., Shenzhen, China) and MS600pro multispectral camera (Dà-Jiāng Innovations Science and Technology Co., Ltd., Shenzhen, China) (
Figure 3) were used to collect multispectral images at 11:00–13:00 (UTC/GMT +08:00) on a clear and cloudless noon. At the time of sampling, the tide was the lowest of the flood season. Before each flight, the sensors were calibrated by a Ag-60 calibration gray plate (
Figure 2), and six ground control points were set for each route. The flying height for UAV aerial photography was 80 m, the flight speed was 5 m/s, the lateral overlap rate was 70%, the heading overlap rate was 80%, the outward expansion distance of the shore and land was 10 m, and the outward expansion distance of the sea surface was 20 m.
After UAV aerial photography, samples were collected immediately and the biomass values of the target species of seaweed were measured.
Ulva pertusa,
Sargassum thunbergii and
Sargassum fusiforme were the target species used in this investigation (
Figure 4).
According to the marine survey specifications [
36], 0.25 × 0.25 m (length area) quadrats were used for seaweed collection, and the longitude and latitude of the collection points were recorded. Due to the tide, a total of 57 quadrats were collected. In addition, seaweed in different seasons and different dry and wet states in the intertidal zone were measured by an ASD hyperspectral spectrometer (ASD Field Spec Handheld, Analytical Spectral Devices, Inc., Boulder, CO, USA) (
Figure 5).
During hyperspectral measurement, it is important to monitor weather factors such as wind speed, humidity, sun direction and elevation. The test was conducted at noon (about 11:00–13:00), and the relevant personnel wore dark clothes. Before spectral measurement, the lens of the instrument needs to be aligned with a white board to correct the dark current and standard white board. When the spectral reflectance curve of the standard whiteboard is a straight line, the spectral measurement can be carried out. Spectrometer optimization was performed every 10–15 min, and dark currents were collected every 5 min [
37]. After the hyperspectral determination of the three species of target seaweed, the spectral data were analyzed and the appropriate spectral combination was selected.
2.3. Analysis Method
For seaweed biomass data processing, after the collected seaweeds were placed in a dry area for 6 h, the fresh weight of each of the three seaweed species in every quadrat was measured by an electronic scale, and the biomass, longitude and latitude of Ulva pertusa, Sargassum thunbergii and Sargassum fusiforme collected at each sample point were recorded as a label set. For UAV data processing, before each take-off, we took measurements with the aircraft about 1.5 m above the correction plate to obtain the standard reflectance under the conditions at the time. Taking the orthoimagery of the study area as the reference, 30 reference points were selected at different positions of the image for the geometric fine correction of the multispectral image.
We then preprocessed the raw data (
Figure 6) to generate a 6-band image file (
Figure 6). Firstly, the registration parameters were generated according to the pixel size, focal length and flight altitude. Then, the generated registration parameters were used as camera parameters for band registration. The calibration gray board was used for radiometric calibration, and the standard reflectivity of the calibration gray board was used as the input (in this paper, bands 1–6 had values of 0.6, 0.62, 0.62, 0.62 and 0.6, respectively). After that, we performed geometric correction according to the ground control points. Finally, the image was spliced to generate an orthophoto image, which was then verified. After verification, the geometric correction error of the image was less than 0.5 pixels.
We selected the region of interest (ROI) according to the coordinates of the ground sampling points. Taking the average reflectance spectrum value of the ground objects within the ROI range as the seaweed reflectance spectrum of the sampling point, the reflectance spectrum data of various points were obtained. According to Equation (1), the spatial resolution (GSD) of the multispectral images in this survey was about 0.058 m, while the quadrat size used for field acquisition was 0.25 × 0.25 m, so a 4-pixel grid on the image was used as the extraction range of image spectral features corresponding to each sample point. In ArcGIS 10.2 (ESRI, Redlands, California, USA), the vector data of the sampling point distribution are established according to the longitude and latitude of the field sampling points. We loaded the sampling point distribution vector data and the spliced UAV remote sensing image data simultaneously into the ENVI5.1 software (Exelis VIS, USA) and established a grid with the sample point as the center, with a size of 4 × 4 and a total of 57 ROI. The average spectral features corresponding to the UAV images in the range of interest of all 57 sample points were extracted as the characteristic parameter set for the seaweed biomass inversion.
h: flight altitude; f: principal distance of lens; GSD: ground resolution; α: pixel size.
To concisely analyze the spectral data of
Ulva pertusa,
Sargassum thunbergii and
Sargassum fusiforme in different growth states or dry and wet states, principal component analysis (PCA) was used to reduce their dimensions. Information extraction is carried out for multiple variables, and fewer variables are used to replace as much information as possible in the original data, which makes the original problem easier to deal with [
38]. After performing the PCA of the spectral data of different states of each seaweed, the load of their principal components was analyzed, and the principal component parameters of high load were used to represent a certain seaweed. According to the research results of Chen [
37], the PCA shows that the load of the first principal component of the three seaweeds is greater than 90%, that is, the first principal component can be used to represent more than 90% of the spectral information of a certain seaweed (
Figure 7).
Ulva pertusa is similar to terrestrial vegetation. In the visible range, chlorophyll has two strong absorption peaks, a blue band and a red band, centered at 450 nm and 640–680 nm, respectively, and the absorption peak appears at 670 nm. A strong reflection peak appears in the green band of about 550 nm, at which time, the chlorophyll absorption coefficient is the smallest. Near 700 nm, it is also the corresponding valley band of chlorophyll absorption. Therefore, 550 nm and 700 nm accessories are often selected as the characteristic bands of anti-interference to weaken the photosynthetic effective radiation caused by non-photosynthetic substances. The 700–780 nm red-edge range is the climbing ridge of the high-reflection platform formed by the strong absorption of chlorophyll in the red-edge band and multiple scattering in the near-red band. This range is very sensitive to vegetation nutrition, growth and water [
39].
As for the method of screening spectral band combinations, Juan et al. selected spectral bands through a genetic programming (GP) framework, which has a better effect in distinguishing and classifying tropical biological communities [
40]. In addition, different spectral indices will affect each other [
41]. Meanwhile, the 719 nm water vapor absorption band is considered as a potential solar-induced chlorophyll fluorescence (SIF) inversion band [
42]. Joiner et al. successfully retrieved SIF spectra from GOME-2 through the 712–747 nm band [
43]. Although the seaweeds were exposed to air during UAV aerial photography, their surfaces are still in contact with seawater. Therefore, we tried to optimize the vegetation index with the red-edge band. Further, 20 commonly used vegetation biomass inversion spectral parameters (
Table 1) were selected to establish the inversion model and predict the biomass of seaweeds. Combined with the analysis results of the seaweed spectra, 20 spectral parameters were optimized. The specific optimization method is to replace the red band with the red-edge band to adapt to the vegetation index in the seaweed biomass assessment. Taking NDVI as an example, taking the step of a as 0.1 and the value as [0, 1], different vegetation indices are obtained. By observing the correlation coefficient between them and the biomass of each species of seaweed, the maximum value is taken. Through this analysis, it is concluded that the parameters are the best when evaluating seaweed biomass (
Table 1).
NIR: near-infrared band;
Edge: red-edge band;
R: red band;
a: optimization coefficient.
Table 1.
Calculation formulae of spectral index.
Table 1.
Calculation formulae of spectral index.
Spectral Index | Pre-Implementation Formula | Improved Formula |
---|
NDVI | | |
RVI | | |
DVI | | |
PVI | | |
EVI | | |
GNDVI | | |
RDVI | | |
SAVI | | |
OSAVI | | |
NLI | | |
GRVI | | |
GBNDVI | | |
GRNDVI | | |
BNDVI | | |
MSAVI | | |
RBNDVI | | |
Pan NDVI | | |
GCI | | |
CI red edge | | |
VARI | | |
While many new vegetation indices have a good correlation with biomass, if the approximate linear relationship between vegetation indices is ignored, the model may become unstable. Some vegetation indices are significantly hidden from biomass images, and some regression coefficient symbols are inconsistent with the actual significance [
44]. Therefore, it is necessary to calculate the variance inflation factor (VIF) of the vegetation index of the input model (
Appendix A) to analyze the multicollinearity between variables.
VIF: VIF between vegetation indices;
R: Correlation coefficient between vegetation indices.
The 20 groups of UAV spectral characteristic data extracted through the ROI range are read and sorted into separate tables with the biomass data of the seaweeds to analyze the relationship between various parameters and biomass, and to carry out inversion evaluation and analysis of the biomass. The data processing and analysis stages were programmed based on PyCharm2020.1 x64 (JetBrains, Prague, Czech Republic). The significance (
p value) between the biomass and 20 spectral parameters and three seaweeds was calculated by one-way ANOVA. The correlation coefficient between the biomass and the 20 spectral parameters was calculated by Pearson’s correlation analysis to analyze the response of each seaweed to different spectral parameters. At the same time, in order to compare the advantages of machine learning methods in the quantitative inversion of seaweed in the intertidal zone, two machine learning methods, namely, GBDT (
Figure 8) and RF (
Figure 9), were used to compare the biomass estimation results.
Additionally, the determination coefficient (
R2), root mean square error (
RMSE) and mean absolute error (
MAE) were selected as the evaluation indices, and the prediction results of the model were evaluated, as shown in Equations (4)–(6). Finally, the results were evaluated in parallel by Lin’s Concordance Correlation Coefficient (LCCC).