Genus-Level Mapping of Invasive Floating Aquatic Vegetation Using Sentinel-2 Satellite Remote Sensing

: Invasive ﬂoating aquatic vegetation negatively impacts wetland ecosystems and mapping this vegetation through space and time can aid in designing and assessing effective control strategies. Current remote sensing methods for mapping ﬂoating aquatic vegetation at the genus level relies on airborne imaging spectroscopy, resulting in temporal gaps because routine hyperspectral satellite coverage is not yet available. Here we achieved genus level and species level discrimination between two invasive aquatic vegetation species using Sentinel 2 multispectral satellite data and machine-learning classiﬁers in summer and fall. The species of concern were water hyacinth ( Eichornia crassipes ) and water primrose ( Ludwigia spp.). Our classiﬁers also identiﬁed submerged and emergent aquatic vegetation at the community level. Random forest models using Sentinel-2 data achieved an average overall accuracy of 90%, and class accuracies of 79–91% and 85–95% for water hyacinth and water primrose, respectively. To our knowledge, this is the ﬁrst study that has mapped water primrose to the genus level using satellite remote sensing. Sentinel-2 derived maps compared well to those derived from airborne imaging spectroscopy and we also identiﬁed misclassiﬁcations that can be attributed to the coarser Sentinel-2 spectral and spatial resolutions. Our results demonstrate that the intra-annual temporal gaps between airborne imaging spectroscopy observations can be supplemented with Sentinel-2 satellite data and thus, rapidly growing/expanding vegetation can be tracked in real time. Such improvements have potential management beneﬁts by improving the understanding of the phenology, spread, competitive advantages, and vulnerabilities of these aquatic plants.


Introduction
Invasive floating aquatic vegetation (FAV) species are a global concern due to their negative impacts on fragile wetland ecosystem processes such as nutrient cycling, hydrology, and energy budgets [1][2][3][4].Their introduction threatens ecosystem biodiversity and function, and often results in economic impact on fisheries, hydropower generation, and transportation services [5][6][7].Understanding invasion pathways, processes, impacts, and triggers of change is an essential step in effective wetland management and is dependent on accurate mapping of vegetation and species prevalence through time [8].
Satellite remote sensing is a favored tool for monitoring invasive vegetation due to its synoptic views and repeat coverage; however, discrimination of multiple aquatic species in areas of high spatio-temporal complexity is often difficult due to spectral similarity between species and high spectral variation within a class [9][10][11].Imaging spectroscopy has historically been the favored tool for mapping vegetation at the species or genus level because it offers high spectral resolution capable of discriminating between multiple aquatic species within the same functional type [12][13][14][15][16]. Airborne imaging spectroscopy (AIS) commonly has a high spatial resolution, which is important for mapping multiple FAV species that often co-occur within the same patch [13,17,18].However, spaceborne imaging spectrometers with systematic, repeat sampling capabilities are currently rare and commercial AIS campaigns to collect such data are expensive-often resulting in annual and intra-annual data gaps.Such gaps hinder the ability to relate invasive species distribution to environmental and anthropogenic drivers of change during the growing season [19].Newer multispectral satellites such as Sentinel-2A and B (S2) have a repeat period of 5 days at improved spectral and spatial resolutions compared to older multispectral satellites often used in wetland studies, such as Landsat or MODIS.This makes Sentinel-2 an attractive contender for mapping FAV species that are mobile and rapidly expand throughout the growing season.
The Sentinel-2 sensors provide the opportunity to discriminate between multiple FAV at the species and genus level throughout the growing season because of their high revisit frequency and high spatial resolution along with additional bands (relative to Landsat) that are intrinsically linked to vegetation traits [20].Sentinel-2 has been used to distinguish between multiple emergent aquatic vegetation species (e.g., Spartina alterniflora, Phragmites australis) but the potential to spectrally discriminate between different types of FAV has yet to be determined [21,22].Although Sentinel-2 has previously been used to map invasive FAV, such as water hyacinth, most study sites in the literature focus on relatively homogenous environments and community types, or target only one invasive species [23][24][25].These prior studies also do not demonstrate the extent of Sentinel-2's utility across a range of environments, nor do they differentiate floating vegetation at the genus or species-level.
The Sacramento-San Joaquin Delta (henceforth the Delta) is an extensively modified wetland system comprising multiple tidally influenced habitat types, including open lakelike and channelized riverine environments.The spatial heterogeneity and range of wetland environments in the Delta make it a unique setting for testing the potential of Sentinel-2 to distinguish FAV at the species or genus level, while simultaneously mapping submerged and emergent aquatic vegetation at the community level to provide holistic wetland maps that can inform and assess management actions.Currently, aquatic vegetation in the Delta is mapped to the species, genus, and community level using annual AIS acquisitions [14,26].However, the imagery is costly and thus only consists of one annual snapshot per year, which does not allow for intra-annual tracking of aquatic vegetation composition and coverage.At times, lack of funding has resulted in annual data gaps as well, further increasing the temporal gaps.
Here, we demonstrate for the first time that separation between invasive FAV at the genus and species level is possible using Sentinel-2.We compared our Sentinel-2 maps to those derived from AIS; the current near-operational method employed for aquatic mapping in the Delta.Our classifiers distinguish between invasive FAV water hyacinth (Pontederia crassipes; formerly Eichornia crassipes) and water primrose (Ludwigia spp.) and identify submerged and emergent aquatic vegetation at the community level for both summer and fall imagery, demonstrating that the intra-annual temporal gap can be closed, and rapidly expanding vegetation can be tracked throughout the growing season.Such improvements have multiple possible management benefits.For example, the State of California spends millions of dollars each year to control IAV in the Delta and costs are likely to continue to rise [27].Gaining a better understanding of invasive FAV distribution and spread within the year can lead to more informed and effective control strategies.

Experimental Design
Two floating aquatic macrophytes, water hyacinth (Pontederia crassipes) and water primrose (Ludwigia spp.), were targeted in this study.Water primrose, although nominally rooted, develops adventitious roots that can draw nutrients directly from the water.This allows water primrose to form floating canopies that extend several meters into the channel from the shore [28].In contrast, water hyacinth is truly free-floating aquatic macrophyte-often anchoring itself to nearby emergent vegetation [29].Both are among some of the most invasive aquatic macrophytes globally and are frequently referred to as 'ecosystem engineers' because of their ability to alter physical, biological, and chemical processes to their benefit [7,30,31].For these reasons, we focused our extension of mapping capabilities on these two FAV aquatic macrophytes.Overall, our classification scheme comprises 8 classes-water hyacinth, water primrose, emergent aquatic vegetation (EAV), submerged aquatic vegetation (SAV), riparian vegetation, open water, non-photosynthetic vegetation (NPV), and soil (Figure 1 and Table 1).Figure 1 depicts the difference in spectral response between Sentinel-2 and AIS datasets for the target vegetation classes, further descriptions of the AIS instruments used here are in Section 2.3.

Riparian
For example: Willow species (Salix spp.), Oak species (Quercus spp.), and Cottonwood (Populus spp.)This study was conducted for the Delta, which is the hub of California's water system spanning approximately 2220 km 2 in Northern and Central California.It is a heavily engineered system comprising a diverse network of channels and freshwater tidal marshes at the confluence of the Sacramento and the San Joaquin rivers (Figure 2) The Delta is also recognized as one of the most invaded estuaries in the world [33].In the past few decades, water primrose and water hyacinth have negatively impacted water quality, water pumping, and native species, and there are ongoing efforts to control persistence and spread [27,34,35].As part of the control and monitoring programs, AIS efforts to map these invasive floating species have been led by the Center of Spatial Technologies and Remote Sensing (CSTARS) at UC Davis from 2004-2008 and 2014-2021 [13,14,26,36,37].An imaging spectrometer is flown on a low-altitude aircraft collecting high spatial resolution (~1.7-3 m pixels) and high spectral quality data.These data are used to prepare maps for California State Agencies and for assessing annual invasive species spread and community dynamics.
To evaluate the potential of Sentinel-2 for filling intra-annual data gaps, we compared the performance of a machine learning classifier using Sentinel-2 data to the current stateof-the-art approach for FAV mapping in the Delta derived from AIS [37].To compare the two products, we identified the closest Sentinel-2 overpass to AIS acquisition with low cloud cover, we refer to these date pairs as "match up dates" that included fall 2018, fall 2019, and summer 2020 (Table 2).We labeled our mapping efforts FAV genus-level because multiple Ludwigia species are present in the Delta, but the exact number is unknown; they are difficult to visually distinguish in the field let alone from space.While we mapped the entire Delta and made Delta-wide comparisons, for visualization and demonstration, map results are presented for four smaller sub-regions representative of the range of different habitat types in the Delta (Figure 2).Big Break is a brackish, shallow flooded island located downstream of the confluence of the Sacramento and San Joaquin Rivers.Liberty Island, an abandoned, flooded island, is a freshwater site influenced by the Sacramento River.Rhode Island and Ward Cut are islands along the San Joaquin River, that include a diverse mix of upland riparian and freshwater marsh habitats with large areas of deep open water bound by levees.

Acquisition and Pre-Processing of Sentinel-2 Imagery
Level-1C (L1C) top of atmosphere (TOA) reflectance products for Sentinel-2 tiles capturing the legal Delta boundary (10SFH, 10SFG and 10SEH) were downloaded from the Copernicus Open Access Hub for each of the three match-up dates listed in Table 2. Products were atmospherically corrected to Level-2A (L2A) bottom of atmosphere reflectance (BOA) using Sen2Cor version 2A and 2B to collect information in 13 spectral bands at 10, 20, and 60 m resolutions (Table 3).The Sen2r 1.4.0R package was used to resample 20-60 m bands to a 10 m spatial resolution, mosaic tiles, and convert products from their native SAFE file format to ENVI files [38].

Imaging Spectroscopy Acquisitions and Classification Process
In fall 2018 and 2019, HyMap imaging spectroscopy data were collected over the Delta at a 1.7 × 1.7 m ground resolution in 126 bands (400-2500 nm, bandwidth ~15 nm) with a 20% overlap in flightlines by HyVista Corporation (Sydney, Australia).HyVista performed atmospheric calibration and delivered the data to CSTARS at UC Davis with geo-location files (GLT) for further processing.In July of 2020, SpecTIR LLC (Reno, NV, USA), flew their Fenix 1K hyperspectral imager over the Delta.The Fenix sensor measures 323 spectral bands across the visible to shortwave infrared spectrum (397-2450 nm) at a nominal spatial resolution of 2 × 2 m.SpecTIR also provided data to CSTARS after atmospheric calibration and included geographic lookup table files to associate each pixel with a geographic location.All airborne spectroscopy campaigns were conducted during low tide and flown to minimize sun glint.Locations of aquatic vegetation species, riparian vegetation, and water were collected within a two-week window of image acquisition (~800-2000 points per campaign) using handheld high precision (sub-meter accuracy) GeoXT and GeoXH Trimble DGPS units (Trimble Navigation Limited, Sunnyvale, CA, USA) with Wide Area Augmentation System (WAAS) differential correction.Each location marks a dominant species, along with information on patch size, percent cover of each species present, and vegetation condition.Patch width and length are labeled as small (3-5 m), medium (5-10 m), large (10-15 m), and extra-large (>15 m).Classification training and validation polygons are then created through photointerpretation using field collected points and ground reference photos as a reference, in ArcMap (ArcGIS 10, Redlands, CA, USA) [37].
CSTARS classifies the spectroscopy data using a random forest algorithm trained and validated with the field collected reference data.A separate random forest model is developed for each campaign.Input data include a suite of spectral indexes, spectral angle mapper rule images, linear spectral unmixing fractional cover, and continuum removal of absorption features [39].The polygons are randomly split into 50/50 training and validation data.Class accuracies were evaluated using confusion matrices which report an overall accuracy, user's and producer's accuracies, and the Kappa statistic [40,41].The overall accuracy measures the total number of correct class predictions relative to the total number of predictions.Producer's accuracy is comparable to a true positive rate and measures how well a predicted class matches labeled reference data.User's accuracy is comparable to a positive predictive value and measures how often predications are misclassified.The Kappa statistic (K) also indicates the level of overall agreement between the field data and the classification map, but it accounts for the probability of random agreement between the two datasets [40].Kappa values range from 0 to 1 with values greater than 0.5 indicating good agreement and values greater than 0.8 indicating exceptional agreement between the classification map and the validation dataset.
We used these previously generated maps, associated accuracy assessments, and training and validation polygons to conduct a visual and percent coverage comparison between AIS and Sentinel-2 aquatic vegetation maps.AIS maps range from nine to sixteen classes depending on the year and we adjusted the classes to match ours.Tule/cattail and phragmites were grouped together into EAV.Arundo donax (giant cane) was grouped with riparian vegetation because it is generally found on higher ground and levees adjoining channels.Floating vegetation such as duckweed and pennywort were excluded from comparisons because they occupy less than 0.5% of mapped waterways.Non-photosynthetic floating vegetation was grouped with NPV.The summer 2020 classification map identifies areas of water primrose encroachment into emergent marsh as a separate class "emergent primrose" [37], which we did not re-assign as either water primrose or EAV.

Sentinel-2 Image Classification 2.4.1. Training and Validation Data
Training and validation pixels were primarily selected from the same GPS points collected for AIS classification.Point locations of field observations were adjusted to match the courser resolution of Sentinel-2 by eliminating points in patches that were heavily mixed or too small to be detected at a 10 m spatial resolution.Since the boats used for data collection often record GPS locations at the edge of a patch due to potential propeller entanglement, observation points labeled extra-large or large (>10-15 m) were spatially nudged to the nearest vegetation or water patch center visible in Sentinel-2 imagery.Water, soil, and NPV locations were randomly sampled from training and validation polygons created for the AIS classifications.Additional points were added via photo interpretation from careful examination of Google Earth Imagery and 2017 LiDAR digital surface models (https://data.cnra.ca.gov/dataset/delta-lidar-2017 (accessed on 3 March 2020)).A separate adjustment was made for each of the three match-up images listed in Table 2, which resulted in a total of ~2400 points available for selection during the classification experiments.Each class ranged from 80-120 points per date depending on class and not all points were marked as suitable for each match-up date.The water hyacinth class had the least number of points across all dates because it occurs in small patches and occupies the least area of our target classes during the period of study.

Sentinel-2 Random Forest Model Selection
Following the AIS classifications process, we also used random forest to create our classifications.Random forest (RF) models are commonly used for remote sensing image classification across sites and sensors [24,[42][43][44].Such models have been successful for aquatic vegetation mapping in the Delta using AIS data collected from both UAV and manned aircraft [17,39].Random forest is an automated algorithm that builds hundreds of classification tree models and then selects the most frequent solution [45].Although RF models often result in improved classification accuracies over other traditional image classification methods, they are sensitive to training data selection [46].To account for this issue and select the optimal RF model, different sets of labeled training and testing data were examined using a bootstrapped approach.For each match-up date, 100 RF models were built by randomly dividing points into 50% training and 50% independent test data.A 50/50 split was selected to mimic the AIS mapping procedure.The 'best' performing RF model was selected based by first selecting ranking the models on overall accuracy, and then selecting a high overall accuracy model that also had high class-specific accuracies of target FAV classes.The 'best' performing model was then applied to the Sentinel-2 image.We chose to construct a separate model for each match-up date to align our classification process with the AIS classification process, ensuring a fair comparison.
The models were constructed and evaluated using the caret [47] and randomForest [46] R packages.The model was built with default number of trees (ntrees = 500) (R Foundation for Statistical Computing, Vienna, Austria).The random forest model inputs included Sentinel-2 reflectance bands (Table 2) and nine spectral indices (Table 4).We selected these indices due to their correlation with plant biophysical properties and previously documented success in other aquatic vegetation classification models [13,17,25,39].Similar to the AIS process, overall accuracy, user's and producer's accuracy, and a K statistic were calculated for each date.The K was calculated to compare the two datasets but was not used as a metric for best model selection due to its similar functionality to overall accuracy (Foody, 2020).

Model Accuracies
The Sentinel-2 and AIS performance by match-up date is summarized in Table 5. Sentinel-2 FAV genus-level RF models yielded overall accuracies of 87-90%.Water primrose had the highest producer's accuracy from 91-96%, while water hyacinth had the second highest user's accuracy ranging from 85-94%.Producer's and user's accuracies for SAV and water were above 90% for fall 2019 and summer 2020 and were slightly lower for fall 2018, the Sentinel-2 date with the highest tide (Table 2).Riparian vegetation had the lowest producer's accuracy followed by water hyacinth.Riparian and emergent vegetation had the lowest user's accuracies (74-85%).Although not listed in the tables below, Soil and NPV user's and producer's accuracies were between 93-100%.These classes were mostly confused with each other or NPV was confused with vegetation pixels labeled as partially dry in the GPS field data.The primary source of confusion was between class pairs with similar spectral signatures: water primrose and water hyacinth, emergent and riparian vegetation, and water and SAV.Qualitative examination of imagery revealed that misclassification commonly occurred at class boundaries and in smaller vegetation patches where pixels are more likely to be mixed.Sentinel-2's overall accuracy results (87-90%) were just slightly below the overall accuracy reported for AIS maps (90-91%) (Table 5).Sentinel-2 producer's accuracies for water primrose 91-96% were comparable with AIS producer's accuracies of 91-94%, while user's accuracies were lower 80-92% (Sentinel-2) vs. 89-95% (AIS).Sentinel-2 water hyacinth had higher user's accuracy than AIS in Fall 2018 and summer 2020.The EAV class had relatively lower producer's and user's accuracies for both Sentinel-2 and AIS data at 77-68% and 61-88%, respectively.EAV has the lowest class accuracies for AIS, indicating that in some locations this class is difficult to detect even with high spatial and spectral resolution data.The riparian class had the greatest differences in accuracy between the two sets of imagery and Sentinel-2 was always lower.SAV and water class accuracies were comparable between the two datasets.Sentinel-2 had higher SAV and water accuracy in fall 2019, potentially because some AIS flightlines were acquired at higher tidal stages than Sentinel-2 (Table 2).

Sentinel-2 Forest Variable Importance
Figure 3 shows the random forest variable importance for Sentinel-2 models: variables are ordered by the mean decrease accuracy (MDA) considering all models.MDA and mean decrease gini (MDG) are common measures of variable importance for random forest models.Inputs with high MDA or MDG indicate high importance to the model; additional information on their computation and use can be found in [57,58].Spectral indexes were consistently ranked with high importance, along with the green spectral reflectance band followed by SWIR and NIR spectral reflectance bands (Figure 3).High importance of spectral indexes has been previously identified in other aquatic vegetation classification studies targeting floating vegetation [17,25].The NIR band contributes to the separation of different FAV covers and emergent vegetation while the SWIR bands are critical to differentiating emergent and FAV from water [14].Although ranked lower than most other spectral bands in our models, the red edge bands had reasonably high mean average decrease values indicating they were valuable to model performance.The two red edge indexes, NDVIre2 and NDVIre3, were the least important in all models.

Visual Comparison in Two Sites
Classification results between the two sensors were visually similar for all three dates throughout much of the Delta, here we show this by comparing 2019 Fall Sentinel-2 and AIS maps of Ward Cut and Rhode Island in Figure 4. We chose to display these two sites because they contained all study classes and exhibited a range of different vegetation patch sizes and shapes.Generally good agreement between maps was observed at the FAV genus level between locations of larger water primrose (yellow) and water hyacinth patches (purple); however, there were some thinner water hyacinth patches present in AIS maps that were undetected at the coarser Sentinel-2 resolution (Figure 4a).The largest differences between the two maps resulted from EAV and riparian confusion and the detected SAV coverage relative to tidal stage during image acquisition (Table 2).

Percent Coverage Comparison
To characterize over-or underestimation of classes in Sentinel-2 maps relative to AIS maps, we calculated the percent area each class occupied in both maps and subtracted AIS percent area from that of Sentinel-2 in four key study sites and the Delta as a whole (Table 6).Thus, negative values in Table 6 indicate Sentinel-2 underestimation, while positive values indicate Sentinel-2 overestimation.Actual percent coverages are reported in Appendix A. We compared percent coverage rather than individual pixel agreement because the AIS maps have their own classification errors and reports to state agencies characterizing AIS mapping results primarily focus on percent coverage comparison across years.Similar to class accuracy, we only compared the five vegetation classes and open water.Water hyacinth percent coverage was overestimated compared to AIS for all comparisons except Ward Cut in fall 2018 and 2019 and Delta wide in fall 2019; however, the difference for all these sites was quite small (0.1-0.2%).The largest differences in water hyacinth percent coverage were in Rhode Island (0.4-1.8%) and across the Delta in summer 2020 (0.4%).Rhode Island fall 2019 maps are displayed in Figure 4b and show good matchup between medium and large patch locations and extent, despite having the largest percent coverage difference compared to AIS.Orange non-photosynthetic vegetation (NPV) patches on the edge of islands in AIS maps are patches of floating NPV which cannot be detected by Sentinel-2 spatial or spectral resolution.Differences in SAV detection for both areas are likely due to image quality or differences in water height during acquisition, the latter is further discussed in Section 4.3.
Water primrose was underestimated relative to AIS in all locations except Big Break in fall 2019 (no difference), and Big Break and Liberty Island in summer 2020.Water primrose percent differences in summer 2020 were impacted by the additional emergent primrose class in AIS maps which was included to characterize areas of water primrose encroachment on emergent habitat [39].Potential explanations for difference in water hyacinth and water primrose percent coverage are provided in the discussion.
Emergent and riparian vegetation had relatively high percent coverage differences between datasets and exhibited a less distinct pattern of over-or underestimation across years unlike the two FAV classes, although they appeared to be inversely related, likely due to their class confusion.The largest difference between maps was Sentinel-2 SAV underestimation and consequently, water overestimation in fall 2018 for Big Break (Table 6), which may be due to tidal height (Table 2).

Discussion
RS classification of multiple FAV classes is sparse in the published literature, especially in complex aquatic systems such as the Delta.Most previous works target only a single floating invasive species or classify at the community level and are conducted in areas with low spatial variability and large contiguous patches making it easier to separate FAV species from other neighboring aquatic vegetation [59][60][61].Previous studies in the Delta rely on airborne imaging spectroscopy (AIS) to separate different FAV covers [14,17,39].In order to determine if Sentinel-2 could be used to fill in AIS temporal gaps with acceptable detection of FAV at the genus level, we compared classification model accuracies (Table 5) and maps between the two datasets visually (e.g., Figure 4) and by difference percent class coverage (Table 6) and discuss dependence on patch size and within class variability relative to sensor spatial and spectral resolution and the influence of external factors such as tidal stage.We further compared differences for both summer and fall imagery to determine if Sentinel-2 maps could be created for multiple dates throughout the growing season, thereby enabling tracking of patch expansion and movement of these motile aquatic species.

Model Accuracies
Following a similar procedure to the near-operational AIS mapping in the Delta, we were able to achieve similar overall accuracies (OA) of 89-91% with Sentinel-2 compared to 90-92% of AIS while mapping the same classes-water hyacinth (species level), water primrose (genus level), SAV, EAV, and riparian vegetation at the community level, and open water, soil, and non-photosynthetic vegetation.Our overall Sentinel-2 accuracies were also comparable to other multispectral RS studies which target 4 macrophyte classes and open water 90% [61,62] (although neither target multiple FAV classes).
The average Sentinel-2 class accuracy statistics for our target FAV classes-water hyacinth (PA: 79%, UA: 91%) and water primrose (PA: 95%, UA: 85%) were lower or comparable with AIS water hyacinth (PA: 89%, UA: 90%) and water primrose (PA: 93%, UA: 93%).Our performance was also better than, or in line with, other Sentinel-2 studies targeting water hyacinth, which achieved user's accuracies of 75-89% and producer's accuracies 61-94% using random forest models [25,60].To our knowledge, this is the first study to demonstrate water primrose mapping using multispectral satellite remote sensing; therefore, we are unable to compare class specific accuracies to other sites, but generally our accuracies were quite high.
The spatial resolution of the data relative to target patch size influenced the detection capability of the classifiers, and subsequent differences in percent coverage calculated from Sentinel 2. Generally, larger pixel sizes lead to poorer accuracy statistics, though the relationship is not linear; the effect of pixel size on accuracy is dependent on the heterogeneity of the landscape and the design and geolocational precision and accuracy of field data [63,64].Spatial resampling Sentinel-2 bands of varying spatial resolution to 10 m may have further impacted the ability to detect small scale features.Nearest neighbor resampling is recommended to be both computationally efficient while best preserving pixel spectral information [65] and has been shown to be suitable for Sentinel-2 land cover classification [66].More specialized downscaling algorithms could be explored in future research, although the effects of downscaling on classification accuracy are highly variable [67,68].The pixel size influence on accuracy was minimized in this study through the field data collection and curation process, which paid special attention to patch size relative to the spatial resolution of the two RS datasets.Unfortunately, this same data design process may have possibly introduced lower quality training data into the models resulting in confusion between water hyacinth and water primrose and EAV and riparian vegetation.AIS observations indicate that water hyacinth patches are commonly interspersed with or surrounded by water primrose; therefore, patches labeled as majority water hyacinth in the field data may contain enough water primrose coverage to influence the spectral signature at the Sentinel-2 pixel size.EAV commonly forms narrow patches which are difficult to photo-interpret and observe in the field due to logistical challenges associated with access and plant density and height obscuring field observations.This poses an issue particularly for Sentinel-2 discrimination because at coarser spatial resolutions it becomes even more difficult to separate emergent vegetation from the riparian vegetation located directly behind it.It is recommended that future field data collection focus on building training datasets of large and heterogenous aquatic vegetation patches and better characterization of fractional cover and patch sizes relative to Sentinel-2 pixel size.

Percent Coverage of FAV Classes
Our Sentinel-2 maps matched well with AIS maps visually (Figure 4), but our areal comparison analysis indicated that S2 classifiers generally overestimated water hyacinth and underestimated water primrose area relative to AIS (Table 6).Infrequent classes with small patches, such as water hyacinth, are actually expected to be underestimated since small patches are likely to be undetected at coarser spatial resolutions-this was exhibited for certain patches, for example, the long and thin patch in Ward Cut was missed by S2 (Figure 4).However, under-detection of small and narrow water hyacinth patches was outweighed in percent coverage estimates because of S2 pixel size, confusion with riparian vegetation due to similar spectral signatures in multispectral space (Figure 5), and 'sporadic' misclassification likely due to other mixed pixels or within class spectral variability not accounted for in the training dataset.This confusion could be reduced with higher spectral resolution data, as it enables spectral unmixing of spectrally complex and similar classes [13,18].However, even previous AIS studies in the Delta found some confusion between these classes [18].These studies reduced misidentification of riparian vegetation as water hyacinth by creating a riparian mask based on spatial information regarding wetland and channel configuration derived from ancillary GIS layers and LiDAR data, which we recommend for future studies [18].Water primrose coverage was likely underestimated due to Sentinel-2 mixed pixels at class boundaries exemplified in Figure 6, which shows patch edges that are more likely to be mixed with water or SAV classified as emergent or riparian vegetation in Sentinel-2 imagery.Such effects were more pronounced for water primrose patches than water hyacinth because water primrose shares a larger interface with other classes due to its dominance in the system.In recent years, water primrose has rapidly increased in abundance and distribution in the Delta, and several regions across the Delta have experienced marsh encroachment of water primrose, where water primrose has been "terrestrializing" and creeping up on top of emergent reeds [39].The AIS summer 2020 maps labeled this as a separate class ("emergent primrose"), but Sentinel-2 does not have the spectral or spatial resolution required to do so.Instead, Sentinel-2 identifies the dominant class representing whichever vegetation is on top of the heterogeneous 3D structure, thereby resulting in percent coverage differences that are not directly related to misclassification.

Impact of Environmental Conditions: Tidal Stage
Other large mismatches in visual and percent area assessments occurred for the SAV and water classes; these differences are likely explained by differences in tidal stage during image acquisition rather than Sentinel-2 misclassification.Figure 7 explores the impact of tide-stage mismatch between datasets more extensively by comparing a good tidal matchup date (fall 2019) with one of lesser quality (fall 2018).Fall 2018 Sentinel-2 acquisition occurred during a flood tide, which impedes detection of full SAV extent.Fall 2019 Sentinel-2 acquisition occurred at a low tide and showed a much closer match-up between Sentinel-2 and AIS maps.This indicates that SAV is detectable with Sentinel-2, but images should be selected at low tide to reduce the impacts of water height and increased turbidity, which inhibit detection of the full extent for SAV coverage.Such an approach would mirror the current operations for AIS-based SAV mapping, which only acquires data during periods of low tide and during times of the day that avoid effects from sun glint [13,39].Tidal stage did not strongly impact FAV mapping because the leaves are above the water column hence spectral reflectance is not affected by tidal stage.Both water hyacinth and water primrose exhibited a good visual match and smaller quantitative differences, as listed in Table 6.

Sentinel-2 Characteristics That Enable Differentiation between FAV Classes and the Neighboring Vegetation
The detection of water hyacinth and water primrose in this spatially complex system of wetlands was possible due to the 10-20 m pixel size of Sentinel-2 images and the moderate spectral resolution.While some previous multi-spectral RS studies targeting water hyacinth detection have successfully used Landsat (30 m pixels; 7 bands) [24], these studies occurred in simpler systems with fewer target species.Thamaga and Dube (2018b) compared the performance of Landsat and Sentinel-2 and determined that Sentinel-2 resulted in higher class accuracies.In an environment as spatially complex as the Delta, the 10 m spatial resolution of Sentinel-2 supports water primrose and water hyacinth detection because these vegetation patches are generally small (~40-60 m) wide with few large patches (90-100 m).Large patches would be contained within a single 30 m pixel, but smaller patches would not.
Spectral resolution and range also play important roles in FAV separation.Although the three Sentinel-2 red-edge bands were not ranked as high as several spectral indexes or spectral bands in Sentinel-2 RF model variable importance, the mean decrease accuracy values (~20%) suggest that the red edge bands still played an important role in class discrimination.The importance of the red edge bands to aquatic vegetation classification has been demonstrated in other studies [25,69].These works, like ours, determined that spectral indexes and other VSWIR spectral bands are more important to classification accuracy than red edge bands, but this does not negate the usefulness of these additional Sentinel-2 bands.However, red edge indices were consistently among the least important variables, their importance has varied in other aquatic vegetation classification studies [69,70].Additional testing is recommended to determine if these red edge indexes could be excluded in future models.Spectral range is also an important consideration.The SWIR region, and spectral indexes calculated with SWIR bands were important for RF model performance.Previous works have also documented the importance in the SWIR region for discriminating aquatic vegetation from water [18,71].

Conclusions
Analysis of Sentinel-2 classifications maps demonstrated that two types of invasive floating aquatic vegetation, water hyacinth and water primrose, can be distinguished in heterogeneous wetlands during summer and fall.Our results indicated that Sentinel-2 imagery can supplement AIS mapping efforts by filling temporal gaps and enabling studying of annual and intra-annual changes in FAV community composition as a response to environmental or anthropogenic disturbance events.Extensive time series will, in particular, enhance the understanding of invasion processes.Future research should investigate differences in spring and winter and incorporate a riparian vegetation mask to reduce misclassification of floating vegetation.Further, the near-operational pipeline developed during this study will lead to an operational mapping pipeline for generating wetland vegetation maps for California agencies that enable new methods of management and monitoring to identify where aquatic invasive species control is effective.

Figure 1 .
Figure 1.(A) Field photos of vegetation classes and list of species in each vegetation community type.(B) Mean spectral signatures for 2019 Sentinel-2 training data; shading represents 95% confidence interval.(C) Mean spectral signatures for 2019 AIS training data; shading represents 95% confidence interval.Data come from the HyMap sensor described in Section 2.3.See [32] for more in-depth descriptions of targeted invasive aquatic vegetation.

Figure 2 .
Figure 2. Map of the Delta sub-study areas.RGB images were created using imaging spectroscopy acquisitions.

Figure 3 .
Figure 3. Variable importance for Sentinel-2 random forest models for each match-up date, (a) Mean Decrease Accuracy and (b) Mean Decrease Gini.Variables are ordered by average decrease across all three models.Spectral indices consistently rank high in importance across all three model dates.

Figure 4 .
Figure 4. (a) Ward Cut and (b) Rhode Island comparison between 2019 Sentinel-2 and imaging spectrometer maps.The comparison shows good agreement at the FAV genus level between locations of larger water primrose (yellow) and water hyacinth patches (purple).

Figure 5 .
Figure 5. Spectral response of a riparian vegetation pixel that was misclassified as water hyacinth and a water hyacinth pixel.

Figure 6 .
Figure 6.Comparison of the impact of limited spatial resolution in Sentinel-2 data using water primrose patches.Coarse pixel resolution is more likely to cause misclassification at class boundaries due to pixel mixing.

Figure 7 .
Figure 7. Tidal stage comparison for fall match-ups at Big Break: (a) fall 2018 and (b) fall 2019.Fall 2018 Sentinel-2 acquisition occurred during a flood tide which impedes detection of full SAV extent.Fall 2019 Sentinel-2 acquisition occurred at a low tide and showed a much closer match-up between Sentinel-2 and imaging spectrometer maps.

.
Target classes and their descriptions.

Table 2 .
Match-up dates and tidal ranges 1 for corresponding imaging spectrometer acquisitions and Sentinel-2 over passes.

Table 3 .
Sentinel-2 band characteristics.Asterix indicates band was not used during classification.

Table 4 .
Vegetation and water indexes used in the Sentinel-2 classification process.

Table 5 .
Sentinel-2 and airborne imaging spectrometer classification performance by match-up date.Overall accuracy = OA, Producer's accuracy = PA, User's Accuracy = UA.SAV = submerged aquatic vegetation.Soil and non-photosynthetic vegetation (NPV) accuracies are not listed but were 93-100% and primarily confused with each other.

Table 6 .
Difference in percent coverage between Sentinel-2 and airborne imaging spectrometer (AIS) classification maps for 4 sub study areas and Delta wide.AIS percent coverage was subtracted from that of Sentinel-2; therefore, negative values indicate Sentinel-2 underestimation (blue), while positive values indicate Sentinel-2 overestimation (yellow) relative to AIS maps.
* AIS emergent primrose class was not re-grouped as either primrose or emergent in Sentinel-2 maps.