Evaluation of Five Atmospheric Correction Algorithms over French Optically-Complex Waters for the Sentinel-3 A OLCI Ocean Color Sensor

The Sentinel-3A satellite was launched on 16 February 2016 with the Ocean and Land Colour Instrument (OLCI-A) on-board for the study of ocean color. The accuracy of ocean color parameters depends on the atmospheric correction algorithm (AC). This processing consists of removing the contribution of the atmosphere from the total measured signal by the remote sensor at the top of the atmosphere. Five ACs: the baseline AC, the Case 2 regional coast color neural network AC, its alternative version, the Polymer AC, and the standard NASA AC, are inter-compared over two bio-optical contrasted French coastal waters. The retrieved water-leaving reflectances are compared with in situ ocean color radiometric measurements collected using an ASD FielSpec4 spectrometer. Statistical and spectral analysis were performed to assess the best-performing AC through individual (relative error (RE) at 412 nm ranging between 23.43 and 57.31%; root mean squared error (RMSE) at 412 nm ranging between 0.0077 and 0.0188) and common (RE(412 nm) = 24.15–50.07%; RMSE(412 nm) = 0.0081–0.0132) match-ups. The results suggest that the most efficient schemes are the alternative version of the Case 2 regional coast color neural network AC with RE(412 nm) = 33.52% and RMSE(412 nm) = 0.0101 for the individual and Polymer with RE(412 nm) = 24.15% and RMSE(412 nm) = 0.0081 for the common ACs match-ups. Sensitivity studies were performed to assess the limitations of the AC, and the errors of retrievals showed no trends when compared to the turbidity and CDOM.


Introduction
Although they represent only 7% of the total ocean surface, coastal and inland water zones produce up to 40% of the marine and freshwater biomass inventoried today and 85% of the marine and freshwater resources exploited by humans.Moreover, 60% of the world's population lives less than 100 km from the coast, whilst inland waters provide key ecosystem services with direct linkages to human health [1].Therefore, it is vital to study these waters in a systematic way and with a long-term perspective to characterize the variability of bio-optical and biogeochemical properties and to understand their impacts on the water quality [2,3].The only means to get a synoptic view of these zones is to use Earth observations, especially ocean color [4][5][6], which has many applications such as the estimation of biogeochemical parameters or the inherent optical properties (IOPs) of seawater, societal benefit outcomes, fisheries and aquaculture management, ecological provinces partition, phytoplankton functional types retrieval, and water quality monitoring [7][8][9][10][11][12].
Ocean color images have been available since 1978 with the Coastal Zone Color Scanner (CZCS) proof-of-concept mission and continuously since 1997.The European Space Agency (ESA) has developed an ambitious program called "Copernicus" whose aim is to provide continuous observations of the ocean and land for the next twenty years through a series of Sentinel satellites.Among those Sentinel satellites are the Sentinel-3A and Sentinel-3B satellites [13], which were successfully launched on 16 February 2016 and 25 April 2018, respectively.On-board, two sensors are of interest for the monitoring of natural water bio-optical properties and surface temperature [13]: the Ocean and Land Colour Instrument (OLCI-A thereafter) [14] and the Sea and Land Surface Temperature Radiometer (SLSTR) [15].
OLCI is the successor of the Medium Resolution Spectrometer (MERIS).OLCI observes the Earth with a swath width of 1200 km and a spatial resolution of 300 m.The revisit time over sea is 3.8 days with one instrument and 1.9 days with two instruments [14].OLCI has 21 spectral bands [13] compared to 15 for MERIS.These six additional bands are centered at 400 and 674 nm (water constituents retrieval improvement), 761, 764, and 768 nm (O 2 gas absorption correction improvements), and 1020 nm (atmospheric correction improvement).OLCI cameras are tilted to mitigate the sun-glint contamination [14].The OLCI and SLSTR fields of view are co-located, allowing potentially the improvement of atmospheric correction over optically-complex waters [16].
The ocean color remote sensing requires efficient atmospheric correction [17] to remove the contribution of the atmosphere from the total signal measured by the sensor [18].The aim is to only get the light back-scattered by the seawater, the so-called water-leaving reflectance [19].The difficulty of the atmospheric correction is that the atmosphere contributes to 80-90% of the total top of atmosphere signal at the blue-green wavelengths (400-550 nm).Furthermore, the atmospheric path signal significantly varies and cannot be easily approximated [17].Over open ocean waters, the black pixel assumption, i.e., ocean being totally absorbent, is valid in the Near-InfraRed (NIR) [19].Over turbid waters, this assumption is not valid any longer, and this makes atmospheric correction a difficult task for this bio-optical type of water [20,21].
From the five above-cited atmospheric correction approach categories, two atmospheric correction algorithms (ACs) from the fourth AC category and three ACs from the fifth category were selected to evaluate the Sentinel-3A OLCI ocean color retrievals over two contrasted French optically-complex coastal waters: the Eastern English Channel and French Guiana.The validation of these five selected atmospheric correction algorithms provides an assessment of their accuracy and information about the most accurate algorithm over these coastal waters.The validation work exposed here is the first done over French coastal waters for the OLCI sensor.Actually, a similar work was published for OLCI AC development and validation, but over inland waters and not coastal waters [52].
The background about atmospheric correction, the description of the five considered atmospheric correction algorithms, a full description of the study areas, in situ and satellite datasets, and the match-up procedure are provided in Section 2. In the next Section 3, the results of the match-up exercises are presented and discussed in Section 4 with the sensitivity of the results to different parameters where field Ocean Color Radiometry (OCR), match-up number impacts, AC performance impacts, and AC concomitant differences are highlighted.Finally, suggestions and recommendations are given in the conclusion.
BlAC is based on the black pixel assumption in the NIR with the clear water atmospheric correction algorithm [70] taking into account the multiple scattering by air molecules and aerosols [74] and the bright pixel atmospheric correction algorithm [33], which corrects for the contribution of the sediments to ρ w in the NIR.The switch between both algorithms is carried out according to the turbid water flag [33].The BlAC includes the correction of blue absorbing aerosols [75].
C2R-CC is based on artificial neural networks (ANN) [40,76]).The ANN are trained with large datasets of simulated ρ A (λ) and ρ w (λ).Then, the trained neural nets perform the inversion of remotely-sensed ρ w (λ) from the remotely-sensed ρ toa (λ).C2R-CCAltNets is similar to C2R-CC [40,76].Both use ANN.Here, the bio-optical models were revised for the simulation of ρ w (λ), and the ANN training procedure was improved with a new exponent for the detritus absorption coefficient (a d ) for an extended range of neural net training with more coverage of especially high backscatter waters and a larger number of training samples [76].
Polymer is a spectral optimization algorithm, specifically designed for high sun-glint conditions.The atmospheric contribution including the sun-glint is modeled through a polynomial function, and the marine contribution is based on a bio-optical model using the full spectral range [51].
NASA is a combination of the standard NASA Case-I atmospheric correction algorithm based on the black pixel assumption in the NIR [19] and an iterative procedure using a bio-optical model in the NIR to remove the contribution of the ocean in the NIR [37].

Study Area
The validation of OLCI-A ρ w retrievals is performed using field radiometric measurements collected in two contrasted coastal waters [78]: the Eastern English Channel and French Guiana coastal waters.
Figure 1 shows the location of the stations.For the measurements in the Eastern English Channel (left box of Figure 1), most of the stations are located in the northeast of the English Channel, except two stations in Somme Bay and four other stations in Seine Bay.For measurements in French Guiana (right box of Figure 1), the stations are mostly located in the east part of the coastal waters, near the capital Cayenne, except six located in the Mahury River.As shown in Table 1, most of the field campaigns were conducted in French Guiana, leading to almost twice more stations and potential match-up numbers than in the Eastern English Channel.However 11 stations were removed in the absence of OLCI-A images.The potential match-ups were made with no additional selecting criteria than the availability of OLCI data.The Eastern English Channel region has a temperate climate and moderate salinity influenced by the Seine and Escault rivers.This area is influenced by strong tide ranges, river inputs, and low bathymetry, leading to re-suspension of suspended materials, water masses mixing, and relatively intense spring blooms of phytoplankton.These factors make the Eastern English Channel a moderate turbid waters area [79].
The French Guiana region is subject to a tropical climate and to the dominant influence of the Amazon River plus other local rivers such as the Mahury River.This region is influenced by the south-equatorial current.The coastal waters are influenced by a strong fresh water input, very low bathymetry, strong tidal ranges, water masses mixing, and the migration of mud banks [80,81].For these reasons, the French Guiana coastal waters are considered as very turbid (and sometimes extremely turbid) waters [82].
In these regions, the spatio-temporal distribution of the IOPs is highly variable [83].In the coastal area of the Eastern English Channel, the bio-optical properties have a pronounced seasonal variation.Relevant differences are observed between winter and spring-summer periods.The CDOM largely dominates the total absorption in winter.In the spring-summer period, the phytoplankton absorption dominates [83].In French Guiana, the Suspended Particulate Matter (SPM) mainly dominates the IOPs variability, which presents a very high temporal variability with daily high-and seasonal low-frequency variations.High heterogeneity of the optical water types is also observed [84].
The ASD spectro-photometer measures the radiances of the back-scattered light by the water surface L t , the sky L s , and the Spectralon plate L p with a reflecting factor ρ p of about 99% [85][86][87][88], in a similar way as the TriOS-RAMSES above-water system [89,90].For each measurement, 10 or three (after 3 July 2017) scans by parameter were recorded.Knaeps et al. [85] showed an inter-comparison between ASD and above TriOS system measurements, and overall, the spectra match very well with each other.
The initial sequence of measurements is: L p , L s , L t .For measurements collected from November 2017, the measurement order was changed to: L p , L s , L t , L s , and L p to avoid the high variability of sky and illumination conditions by increasing the measurements of L p and L s .
The viewing geometry of L t and L s measurements were defined with a zenith angle of 45 • from the nadir and an azimuth angle of 135 • from the Sun plane, keeping the Sun in the back to minimize the sun-glint [85,88].For L p , the measurement geometry was perpendicular to the Spectralon plate [91].When possible, the wind speed was measured.To generate the radiances, the radiometric calibration was carried out using the dedicated auxiliary data and software delivered with the ASD spectro-photometer.
After performing the radiometric calibration, the first step of the post-processing protocol consisted of double-checking the quality of the measured radiances L p , L s , and L t .For each station, the coefficient of variation (CV Med ), the standard deviation over the median ratio, were computed at each λ.If CV Med (L x (λ)) was below 5%, 10%, and 10% for x = p, s, and t, respectively, the scans were conserved and used to compute the median value.If not, the relative median absolute difference (RMedD) was computed for each scan (L i , Equation ( 3)).The scans with values of RMedD(L i (λ)) equal to or below 10% for L p , L s , and L t were conserved and used to recompute the median Med(L x (λ)).
with ρ p (λ) the wavelength-dependent Spectralon plate reflectance and ρ s the air-sea interface reflection coefficient, which depends on the wind speed W [89] and is wavelength independent [88].The spectral variation of ρ p was delivered with the ASD Spectralon plate.
ρ s is computed as a function of W, following the sky plate Spectralon radiances ratio at 750 nm, dL s/p (750 nm), described in Equation ( 5).If dL s/p (750 nm) is <0.05 for clear sky conditions, ρ p is determined as a function of W, else if dL s/p (750 nm) is ≥0.05 for overcast sky, ρ p received a fixed value.When the wind speed was not available, W was assumed to be moderate and equal to 5 m•s −1 [89].This led to W ranging between 0.37 and 5 m•s −1 with a mean of 3.97 m•s −1 (±1.70 m•s −1 ).
The final step consists of a visual inspection of the spectrum shape.If the spectrum presents any discontinuity [89] or any anomalous shape, the spectrum is rejected.The quality assurance score (QAS) [92] was used for the selection of the spectra having QAS > 0.66.The QAS procedure is a four-step method.In the first step, the target reflectances are matched with the QAS reference reflectances with regards to the wavelength.In the next step, the target reflectances are normalized.Then, an optical water type is assigned to the target spectra by comparison to the QAS reference ones using the spectral similarity based on the spectral angle [93].Finally, the total score is computed following the mean of the sum of the spectral scores, which receives a value of one if the target spectral reflectance is in between the upper and the lower boundaries of the QAS reference reflectance [92].

Field-Measured Biogeochemical Parameters
During the sea campaigns, surface water samples were collected.Biogeochemical and bio-optical parameters were measured such as the turbidity and the absorption coefficient of the colored dissolved organic matter (a CDOM ).
The turbidity was measured using the HACH 2100Qis Portable Turbidimeter (HACH Company, Loveland, CO, USA), which meets the ISO-7027 norm and has a measurement range of 0-1000 Formazin Nephelometric Units (FNU) with a resolution of 0.01 FNU.Before each sea campaign, the turbidimeter was calibrated with the StabCal Formazin Standards.The turbidity was determined by calculating the mean and standard deviation values of ten replica by sample as described in Neukermans et al. [94].
The CDOM spectral absorption coefficient (a CDOM (λ)) was estimated based on the NASA protocol [95].The method consists of filtering the seawater sample under a gentle vacuum (<5 in Hg) using a 0.2-µm polycarbonate membrane.Using a double-beam UV-visible spectrophotometer (Shimadzu UV-2450 (Shimadzu Corporation, Kyoto, Japan)), CDOM absorbance was measured between 250 and 850 nm with 1 nm of spectral resolution.The CDOM absorbance was then used to compute a CDOM .The full details are given in [96].
The OLCI-A L2 was used for the validation of BlAC.L1B OCLI was processed using different software: the C2R-CC Processor Version 1.0 plug-in, under the Sentinels Application Platform (SNAP) v6.0 software [76] for C2R-CC and C2R-CCAltNets; the Polymer v4.10 archived package for Polymer [51]; the SeaWIFS Data Analysis System (SeaDAS) v7.5 for NASA [97].BlAC and Polymer v4.10 were system vicariously calibrated (SVC), while this was not the case for the other ACs.The SVCs of these former AC were performed using Case I water measurements from the MOBY (Marine Optical BuoY) [98] and BOUSSOLE (BOUée pour l'acquiSition d'une Série Optique à Long termE) [99,100].

Match-Ups Exercise
The goal of a match-up exercise is to co-locate satellite images and in situ measurements in time and space [101].To perform this comparison, the following steps have been taken into account.

Recommended Flags
The common and usually-used recommended flags for ρ w are: invalid, land, snow-ice, cloud, suspect, high sun-glint, whitecaps, high zenith angle, saturation, and AC failure pixels [101].Other types of flags were used depending on the AC: the adjacency effect and ρ w negative value between 412 and 665 nm flags for BlAC [36,70,102], out of scope and out of range of atmospheric correction neural nets flags for C2R-CC and C2R-CCAltNets [40,76], negative back-scattering coefficient, out of bounds, exception, thick aerosol, high air mass, and inconsistency flags for Polymer [51,103], and stray light, very low 560-nm water-leaving radiance L w (560 nm) and navigation fail flags for NASA [19,37,77].

Selection Criteria
The match-ups exercise is based on selection criteria [20,21,101,104].The selection criteria included a time window of ±2 h between the satellite overpass and in situ measurements times.In addition, the image pixels were extracted over a 3-by-3-pixel box centered on the in situ measurements.The number of valid pixels had to be at least 6 out of 9.The mean and the standard deviation of the valid pixels, in the 3-by-3-pixel box, were computed and used to determine the coefficient of variation at 560 nm (CV(560)), which has to be below 20% [101].
If a match-up passed the above-cited selection criteria (±2 h, valid pixels ≥6/9 and CV(560) ≤20%), the median of the pixel box was computed to avoid the outliers.

Statistics Analysis
The assessment of each AC was performed through the calculation of six statistical parameters alongside the scatterplots: the slope (α) and the intercept (β) of the regression line, the bias (Equation (6)), the relative error (RE, Equation ( 7)), the root mean squared error (RMSE, Equation ( 8)), and the correlation coefficient (R 2 ).
where N is the number of match-ups, ρ AC w is the water-leaving reflectance retrieved by the atmospheric correction algorithms, and ρ ASD w is the measured water-leaving reflectance using the ASD FieldSpec 4 spectro-photometer.
Full spectral statistical analyses were also performed to assess the full spectral quality of the AC ρ w retrievals: QAS [92], Chi-squared mean [104] (χ 2 , Equation ( 9)), and spectral angle mean [93,105] (SAM, Equation ( 10 where i is the index of the spectrum, j is the index of the λ between 400 and 709 nm excluding the 560-nm wavelength, X AC The number of match-ups provides information about the spatial coverage provided by each AC.QAS gives quantification of the full spectrum quality of ρ w retrievals with reference to an internal dataset [92].QAS values have to be closer to one for the highest quality of retrieved spectra.χ 2 provides information about the full spectrum relative errors of AC retrievals and has to be null.SAM indicates the mean of the full spectrum difference between the AC retrieved and the field measured ρ w spectra and has to be equal to 0 • .

Scoring Scheme
A scoring scheme was adapted from Müller et al. [104] with the aim to rank each AC following its performance in comparison to the other ACs.It was based on the scoring of the α, β, bias, RE, RMSE, R 2 , and N of each algorithm depending on their variation range between the minimum and the maximum values of each statistical parameter score considering all ACs.For example, in Equation (11), the algorithm presenting the closest slope to 1 received the highest score of 1 for a given wavelength λ j .The total score (S tot ) is the sum of the slope score (S α , Equation (11)), the intercept score (S β , Equation ( 12)), the bias score (S Bias , Equation ( 13)), the RE score (S RE , Equation ( 14)), the RMSE score (S RMSE , Equation ( 15)), the R 2 score (S R 2 , Equation ( 16)), and the N score (S N , Equation ( 17)) as a function of wavelength λ j (Equation ( 18)).
As 12 wavelengths and seven statistical parameter scores are taken into account, the maximum value of S tot is 84.Similar to QAS, SAM, and χ 2 , S tot quantifies AC performance over the full spectrum of ρ w .

Field Ocean Color Radiometry
The ASD FieldSpec4 measurement range was between 350 and 2500 nm.We made the choice to show only the 400-750 nm range because it is relevant to commonly-used OLCI spectral bands to access the biogeochemical parameters.Applying all criteria and flags led to obtaining 37 (out of 76) valid match-ups: 18 in the Eastern English Channel and 19 in French Guiana.Figure 2 shows the 37 spectra of ρ ASD w between 400 and 750 nm used for the validation of all ACs.The QAS mean score was calculated to assess the quality of these spectra; its value was 0.95, very close to one, assuring the high quality of our in situ measurements.The 37 in situ spectra are representative of a large range of bio-optical water types from weakly-turbid to very turbid waters, with a large range of magnitudes at 400 nm and a peak magnitude ranging around 490, 555, or 700 nm (or around 820 nm in French Guiana) [106,107].The use of the QAS algorithm allowed us to get 10 out of 23 optical water type clusters following the clustering method developed by Wei et al. [92].The 10 clusters stand for optical water types having their Chl-a between 0.94 and 10.41 mg•m −3 .The obtained clusters were used as a substitute to Chl-a concentration and allowed us to assess the AC accuracy following the optical water type clusters (see Section 4.2).

AC Overall Analysis
The number of match-ups depends on the selected AC: 18, 16, 19, 35, and 17 for BlAC, C2R-CC, C2R-CCAltNets, Polymer, and NASA, respectively, out of 37 valid match-ups.Figure 3 presents the scatterplots between the in situ ρ w (x-axis) and the estimated ρ w for all AC (y-axis) for λ between 400 and 754 nm.A visual inspection of the scatterplots shows that the accuracy of the retrievals was wavelength-dependent.For λ between 400 and 443 nm, the scattering of the retrievals was quite high for all ACs except C2R-CCAltNets.This is particularly true for λ at 400 nm.For λ between 490 and 674 nm, the scattering tended to decrease, and the AC performances seemed to increase, getting close to each other with increasing wavelength.For λ between 681 and 754 nm, the accuracy decreased, in particular for NASA at 681 nm.
The scatterplots analysis is in accordance with the spectral variation of the statistical parameters (Figure 4).The slope of the regression line, α, increased with the wavelength until 560 nm and flattened for λ greater than 560 nm.It was especially pronounced for NASA where a clear inflection was observed at 443 nm.C2R-CCAltNets presented the closest α values to one at 400 and 510 nm.Between 560 and 681 nm, it was C2R-CC, except at 620 nm (BlAC).For the longest wavelengths (≥709 nm), α was the closest to one for NASA.For most of the ACs, α increased from 400 nm-490 nm, except for C2R-CCAltNets, for which α was always close to one and did not seem to be wavelength-dependent.
The variations of R 2 (bottom right of Figure 4) were similar to those of α, but more pronounced, i.e., steeper increased values from 400-665 nm.All ACs had an R 2 value ≤90% between 400 and 560 nm, C2R-CCAltNets showing the highest values for this spectral range.From 560 nm, NASA presented the highest values, close to 100%.
The magnitude of the bias was wavelength-dependent, while its sign was algorithm-dependent.C2R-CCAltNets showed the lowest values at 400, 560, 620, 665, 674, 681, and 709 nm, whereas Polymer showed the lowest values at 412 nm.For 443 and 510 nm, C2R-CC was the least biased algorithm.BlAC was always negatively biased, while C2R-CCAltNets was always positively biased.For the other ACs, the sign of the bias depended on the wavelength.
RE showed the standard "smiley" shape with high values in the blue and in the red and low values in the green.Both Polymer and C2R-CC showed the lowest RE for all wavelengths, and their spectral variation were less pronounced compared to the other algorithms.
The spectral variation of RMSE was different from the other statistical parameters as its values continuously decreased with wavelength.For λ between 400 and 510 nm, C2R-CC showed the lowest values of RMSE, while the highest values were obtained for BlAC.For λ between 560 and 754 nm, C2R-CC and C2R-CCAltNets showed the lowest values, while the highest values were obtained for Polymer.It is worth noting that ρ w retrievals with NASA were affected by smile effect correction failure at 681 nm with peak of values for α, bias, RE, and RMSE.
BlAC and NASA performed better in French Guiana than in the Eastern English Channel waters.The other ACs performed equally well in both regions.
Table 2 gives the values of QAS, χ 2 , SAM, and S tot for each AC for a time window of ±2-h.C2R-CCAltNets had the maximum value of QAS (0.98), while BlAC and NASA had the lowest value (0.82).C2R-CCAltNets and Polymer had the minimum values of χ 2 (2.28 and 2.00, respectively), while BlAC had the highest (3.30).When the accuracy of the ACs decreased, χ 2 increased due to the increase of the relative squared errors of the spectral bands normalized at 560 nm.Polymer presented the lowest value of SAM (7.29) and NASA the maximum value (14.60).SAM had to be 0 • when the AC spectra were retrieved accurately, leading to a null angle to superpose the AC spectrum on the in situ-measured spectrum.For S tot , C2R-CCAltNets was the best-performing AC and had the maximum score of 56.46.In second rank, C2R-CC had a value of 51.99.BlAC showed the lowest value of 36.01.)) are represented in the x-axis and the water-leaving reflectance AC retrieval (ρ AC w )) is represented in the y-axis.The black solid lines represent the 1:1 line.The colored solid lines show the linear regression lines between the field-measured and AC retrievals of ρ w .The color lines code and the number of match-ups are indicated in the text box of the top left scatterplot (λ = 400 nm), where magenta, blue, orange, green, and red colors stand for the baseline atmospheric correction algorithm (BlAC), the Case 2 regional coast color atmospheric correction algorithm (C2R-CC), the alternative neural nets of C2R-CC atmospheric correction algorithm (C2R-CCAltNets), Polymer, and NASA, respectively.

AC Concomitant Analysis
The preceding section presented the general evaluation of the AC for all match-ups.Here, we present the same analysis, but considering only the common match-ups to all AC.The number of match-ups decreased from 37 to 14; 10 in the Eastern English Channel and four in French Guiana.Figures 5 and 6 are similar to Figures 3 and 4. The trends observed in the previous section were similar when taking only the common match-ups.The retrievals were not accurate for λ between 400 and 443 nm and between 665 and 754 nm, while they were more accurate for λ between 490 and 620 nm.
Furthermore, for the spectral statistics variation, the ACs spectral shapes of the AC common match-ups were similar for α, β, R 2 , and RMSE.Moreover, C2R-CC and especially C2R-CCAltNets statistical performances decreased in comparison to the initial performance analyses, while the accuracy for the other ACs (NASA, BlAC, and Polymer) showed a relative increase.Here, Polymer became the most efficient AC when taking only the AC concomitant match-ups.
Table 3 is similar to Table 2. Taking into account only the common match-ups and the ±2-h time window gives quite a similar analysis in comparison to the other ∆t.C2R-CCAltNets had the highest QAS value (0.99), while BlAC had the lowest value (0.82).For χ 2 , NASA had the lowest value (2.18), while BlAC had the highest value (4.19).
In terms of SAM, the minimum value was obtained for Polymer (6.80) and the highest for NASA (15.01).For S tot , Polymer had the maximum value (65.33) followed closely by C2R-CCAltNets (58.05), while BlAC presented the lowest score (36.11).On the common match-ups, the results were slightly different, with Polymer being the most accurate AC on our dataset, closely followed by C2R-CCAltNets.In all cases, the least accurate AC was BlAC.)) are represented in the x-axis and the water-leaving reflectance AC retrieval (ρ AC w )) are represented in the y-axis.The black solid lines represent the 1:1 line.The colored solid lines show the linear regression lines between the field-measured and AC retrievals of ρ w .The color code is similar to that in Figure 3, and the number of match-ups is indicated in the text box of the left-top scatterplot (λ = 400 nm). .Statistical properties of the AC concomitant match-ups evolution as a function of wavelength between 400 and 750 nm.The statistical properties: slope (α), intercept (β), bias, relative error (RE), root mean squared error (RMSE), and correlation coefficient (R 2 ) are represented in the y-axes in the subplots starting from the left-top subplot until the right-bottom subplot, respectively.The x-axes represents the variation of the wavelength between 400 and 750 nm.The color code is similar to that in Figure 3 and given with the match-ups numbers in the top-middle subplot text box.

Impacts on the Number of Match-ups
The number of our field-measured OCR was somewhat limited, but the variability of ρ w was high, so we strongly believe that our in situ dataset was representative of a wide range of situations observed over the Eastern English Channel and French Guiana [21,83,84].This is the reason why we used a ranking scheme and spectral analysis and not only the classic statistical parameters.

Impact of the Time Window
The match-ups exercise was repeated for different time-window values: ±1.5, ±1, and ±0.5 h to assess the ACs behavior in accordance with the time window.Comparing χ 2 , SAM, and S tot values for each AC showed that the full spectrum of statistical parameters of each AC was slightly different, except for the shortest time-window for which the ranking based on S tot changed (Tables 1 and 2).For ∆t = ±0.5 h, C2R-CC was ranked first, followed by C2R-CCAltNets, and BlAC was still ranked at the last position.Moreover, a shorter time window can lead to better results in terms of S tot (BlAC, C2R-CC, and NASA for the individual match-ups).
The number of match-ups depends on the selected AC.Furthermore, the time window highly impacts the number of match-ups.The shorter the time window is, the smaller the number of match-ups is: 21, 28, 33, and 45 stations were rejected due to the ±2-, ±1.5-, ±1-, and ±0.5-h time windows, respectively.Furthermore, for the initial time window (±2 h), only 50 stations were valid for match-ups.Based on the value of CV(560), only two stations were rejected for Polymer.For the other ACs, CV(560) was ≤20%.

Impact of the Recommended Flags
The flags used to get the match-ups can impact the number of valid values and the accuracy of the ACs.For instance, Polymer had 13 (out of 48) match-ups that were rejected.The principal flag was the cloud flag (11) and then the inconsistency flag (2), where t*ρ w or ρ a exceeded ρ toa .Here, the only limitation for Polymer was the cloudy condition.For BlAC, 32 stations were rejected (out of 50) using the recommended flags: cloud (11), cloud margin (10), high solar zenithal angle (4), high glint (13), ρ w negative value between 412 and 665 nm (3), and the 560-nm optical thickness annotation (1).The high glint and clouds were the most limiting flags for BlAC.For NASA, 33 spectra (out of 50) were rejected due to the clouds (24), the high glint (5), the high zenithal angle (5), and the land (5) flags.Concerning C2R-CC and C2R-CCAltNets, both shared the same number of valid and rejected (31) match-ups due to the cloud (22), sun-glint risk (17), and the AC ANN out of scope and range (1) flags.Most of the ACs were highly limited by the high glint, except Polymer.It seems that the cloudy and sun-glinted stations were not flagged in the same way depending on the AC.This led to a different number of valid match-ups.The high solar zenithal angle flag was only limiting for BlAC and NASA.This happened for four stations in the Eastern English Channel.The presence of the high sun-glint situation occurred more often over French Guiana.The cloudy situation was limiting for both areas.

Sensitivity Studies
Even if any atmospheric and bio-optical parameters were measured during our sea cruises, we tried to understand the limitations of each AC as a function of physical and biogeochemical parameters: optical water type cluster indexes retrieved by QAS (QAS Cluster ID#), turbidity, a CDOM (412), and CV(560) for the valid match-ups of each AC (the figure is not shown here).The goal was to observe if any correlation existed between the statistical parameters and those external parameters.
For instance, the distribution of RE at 412 nm as a function of QAS clusters showed a dependence in the ACs.For BlAC, no trends existed.For NASA (except for one station (in Cluster 8)), RE(412) strongly increased with Chl-a (QAS cluster ID) increase.For Polymer, RE(412) was homogeneously distributed and did not exceed 80%.For both C2R-CC and C2R-CCAltNets, no relationship was observed between RE(412) and the optical water type clusters.In the turbidity-RE(412) scatter analyses, six stations were missing due to the absence of measurements, and one station presented a high relative standard deviation.No trends were observed between RE(412) and the turbidity for C2R-CC and C2R-CCAltNets.A positive relationship was observed for NASA: RE(412) increased with the increase of turbidity.For turbidity greater than 2 NTU, a positive RE(412)-turbidity relationship was observed for Polymer and BlAC.The RE(412)-a CDOM scatter analyses (12 stations were missing) also showed a logarithmic trend for NASA.No trends were observed for the other ACs.Concerning CV(560)-RE(412) scatter analyses, no particular trends were observed for all ACs.Polymer showed the lowest CV(560) in comparison to the other ACs.
This additional sensitivity study revealed no strong trends between physical and biogeochemical parameters and RE (at any wavelengths) such as observed for SeaWIFS by [20].This suggests that ACs are less sensitive to the biogeochemical parameters.However, it suggests that Polymer can be affected by high values of turbidity where Chl-a and CDOM do not dominate the marine signal.CDOM, Chl-a, and their resulting turbidity (turbidity indirectly resulting from CDOM) could be limiting for NASA due to the use of Chl-a estimates in the Case-2 iteration scheme.The same could not be observed for the BlAC.The lack of atmosphere and more seawater parameter data limited our discussion.
The AC negative values were taken into account in the match-ups exercise and were included in the scatterplots and the statistics calculations except for BlAC between 412 and 665 nm, as applied by the its recommended flags.BlAC had negative values at 400 nm (1), 709 nm (2), and 754 nm (2), while Polymer and NASA had negative values (six and two, respectively) at 709 nm.The presence of negative values did not impact the statistical parameters of the retrievals.No obvious differences are observed between Figures 3 and 7, wherein the negative values were used to compute the statistical parameters.It is worth noting that the SVC was not applied to all ACs except BlAC and Polymer.
The absence of the SVC can explain the performance and the increase of the error retrieval for NASA, C2R-CC, and C2R-CCAltNets.Statistical properties without negative values between 400 and 754 of the AC match-ups evolution as a function of wavelength between 400 and 750 nm.The statistical properties: slope (α), intercept (β), bias, relative error (RE), root mean squared error (RMSE), and correlation coefficient (R 2 ) are represented in the y-axes in the subplots starting from the left-top subplot until the right-bottom subplot, respectively.The x-axis represents the variation of the wavelength between 400 and 750 nm.The color code is similar to that in Figure 3 and given in the top middle subplot with the match-ups numbers.

ACs Band Ratios Performance Impacts
Figure 8 shows the comparison between the water-leaving reflectances ratios for 443/560 nm, 490/560 nm, and 510/560 nm.The spectral bands ratios of AC retrieved and ASD measured ρ w were used to analyze the AC impact on the estimation of the Chl-a, as its estimation is very often based on combination of bands ratios (OC4v4 [108]).Table 4 shows the computed statistical parameters (RE, RMSE, and R 2 ) of the ACs retrieved ρ w 443/560 nm, 490/560 nm, and 510/560 nm bands ratios.(560 nm)) are represented in the x-axis and the ACs retrieved ratios (ρ AC w (λ)/ρ AC w (560 nm)) in the y-axis.The color code is similar to that in Figure 3, and the number of match-ups is provided in the text box of the middle scatterplot.
Polymer provided the closest one-to-one regression line for all ratios, as well as the highest R 2 (66.39%, 89.14%, and 89.14% for 443/560 nm, 490/560 nm, and 510/560 nm bands ratios, respectively).BlAC estimates were the least accurate band ratios for 443/560 nm (RE = 34.03% and R 2 = 9.21%) and 490/560 nm (RE = 13.51% and R 2 = 75.74),and it was C2R-CCAltNets for the 510/560-nm ratio (RE = 9.53%, RMSE = 0.11 and R 2 = 51.08%).This is in concordance with the ACs performance inter-comparison results exposed in Section 3.2, and it suggests potentially that the estimation of the Chl-a concentration [108] is impacted by the AC ρ w retrievals (except if the ratio was 510/560 nm).
As the SPM concentration estimation is often based on the use of semi-analytical algorithms (SAA) [109][110][111], it is less impacted by the AC performance if the red spectral band is used in the SPM bio-optical model (560 or 665 nm).Furthermore, all ACs showed the best correlations and lower retrieval errors at the 665-nm spectral band instead of longer wavelengths (λ > 709 nm).This suggests that the use of the red bands is recommended to estimate the SPM concentration instead of the NIR bands.

BRDF Effect Issue
The ACs retrieved ρ w were provided without BRDF correction (except for the NASA AC), so the ASD ρ w were not corrected for this effect.Besides, there was still no best solution for this, and it requires, at a given wavelength, a good knowledge of the IOPs, the geometry of observation, the wind speed, and, to a lesser extent, the atmospheric conditions.This correction is proportional to ( Q f ) ( , where the index 0 stands for the Sun at zenith and a viewing angle at nadir.This factor typically varies between 0.6 and 1.2 [112].While R 0 R accounts for all transmission and reflection effects at the air-sea interface, the f Q ratios, at a given wavelength, are driven by IOPs and the geometry of observations [113].Note that the upwelling light fields can only be considered as purely isotropic (which provide a simple solution of the BRDF) for waters with extreme turbidity [114].

Conclusions
Validation of OLCI water-leaving reflectance products has been proposed over two contrasted French coastal waters obtained by five different atmospheric correction algorithms.Sea cruises radiometric measurements were collected using a spectro-photometer instrument.Post-processing was developed to assess the quality of the in situ water-leaving reflectances.These in situ measurements were used to conduct a match-up exercise leading to the estimation of statistical parameters and the ranking scheme used in the validation of the water-leaving reflectance retrievals.
The ranking scheme and statistical analysis used in this study show their usefulness for the assessment of the ACs.It allowed us to handle the spectral variations of the statistical parameters from 400-754 nm, which were presented in the match-up exercise results.Furthermore, the ACs performance was taken into account to give in the end a specific AC score that allowed us to rank the ACs following their efficiency.Such a scoring method is highly recommended for the ACs inter-comparison and can be easily extended to the other ocean color retrieval algorithms.
Over our in situ measurements, high performances were obtained for Polymer and C2R-CCAltNets.However, for all ACs, the performance was not very high at 400 and 443 nm.The ACs performances were higher between 490 and 560 nm for all ACs.A sensitivity study was performed to highlight the AC limitations depending on the turbidity or biogeochemical parameters.The sensitivity analysis showed no limitations and correlation with the chosen in-water parameters.
The ease of using the atmospheric correction processors is a very important criterion for the end-users.In terms of the processing time, Polymer had the fastest processing, followed by NASA and both C2R-CC versions.Polymer is recommended for atmospheric correction facility.The standard OLCI AC, BlAC, still needs improvements.
The common match-ups analyses allowed a better direct comparison of the performances of the AC.This allowed avoiding the impact of the ACs specific recommended flags that were used in the AC individual match-ups results.The match-ups number was highly impacted by the flags used, which indirectly impacted the resulting ACs performance assessments and the ACs score ranking.While C2R-CCAltNets was the most accurate AC over the individual match-up exercise, Polymer was the most efficient algorithm for the common match-ups.We recommend using either Polymer or C2R-CCAltNets to process OLCI images over our regions of interest or over similar regions.

Figure 1 .
Figure 1.Maps of field OCR measurements locations over the Eastern English Channel (left) and French Guiana (right).

j
and X ASD j are ρ AC w (λ j ) and ρ ASD w (λ j ) normalized at 560 nm, respectively, and ρ ASD w , ρ AC w are the dot products of ρ ASD w , ρ AC w , and ρ ASD w .ρ AC w are the Euclidean norms of ρ ASD w and ρ AC w , respectively.

Figure 2 .
Figure 2. Spectral variation of the in situ ρ w obtained with the ASD spectro-photometer between 400 nm and 750 nm.The blue color represents the Eastern English Channel station, whereas the red one represents the French Guiana stations.The total number of spectra is given in the top of the panel in addition to the mean of the quality assurance score (QAS).

Figure 4 .
Figure 4. Spectral variation of the statistical parameters between 440 and 750 nm: α (top-left), β (top-middle), bias (top-right), RE (bottom-left), RMSE (bottom-middle), and R 2 (bottom-right).The color code is similar to Figure 3 and given in the text box of the top middle subplot in addition to the match-up numbers.

Figure 5 .
Figure 5. Scatterplots of the AC concomitant match-ups of the AC retrieval and the field measured water-leaving reflectance for 400, 412, 443, 490, 510, 560, 620, 665, 674, 681, 709, and 754 nm.In the multiple scatterplots figure, the field measured OCR (ρ ASD w Figure 6.Statistical properties of the AC concomitant match-ups evolution as a function of wavelength between 400 and 750 nm.The statistical properties: slope (α), intercept (β), bias, relative error (RE), root mean squared error (RMSE), and correlation coefficient (R 2 ) are represented in the y-axes in the subplots starting from the left-top subplot until the right-bottom subplot, respectively.The x-axes represents the variation of the wavelength between 400 and 750 nm.The color code is similar to that in Figure3and given with the match-ups numbers in the top-middle subplot text box.

Figure 7 .
Figure7.Statistical properties without negative values between 400 and 754 of the AC match-ups evolution as a function of wavelength between 400 and 750 nm.The statistical properties: slope (α), intercept (β), bias, relative error (RE), root mean squared error (RMSE), and correlation coefficient (R 2 ) are represented in the y-axes in the subplots starting from the left-top subplot until the right-bottom subplot, respectively.The x-axis represents the variation of the wavelength between 400 and 750 nm.The color code is similar to that in Figure3and given in the top middle subplot with the match-ups numbers.
)). QAS uses 7 wavelengths: 412, 443, 488, 510, 555, 667, and 687 nm, which are the only QAS reference ρ w wavelengths available and closer to the OLCI spectral bands wavelengths.If a wavelength does not exist for OLCI, the nearest wavelength was taken.

Table 2 .
Values of QAS, χ 2 , SAM and S tot associated with different time windows.N is the number of match-ups and ∆t is the time window.SAM, spectral angle mean.

Table 3 .
Values of QAS, χ 2 , SAM, and S tot associated with different time windows and the AC common match-ups.N is the number of match-ups, and ∆t is the time window.