Assessment of Approximations in Aerosol Optical Properties and Vertical Distribution into FLEX Atmospherically-Corrected Surface Reflectance and Retrieved Sun-Induced Fluorescence

Physically-based atmospheric correction of optical Earth Observation satellite data is used to accurately derive surface biogeophysical parameters free from the atmospheric influence. While water vapor or surface pressure can be univocally characterized, the compensation of aerosol radiometric effects relies on assumptions and parametric approximations of their properties. To determine the validity of these assumptions and approximations in the atmospheric correction of ESA’s FLEX/Sentinel-3 tandem mission, a systematic error analysis of simulated FLEX data within the O2 absorption bands was conducted. This paper presents the impact of key aerosol parameters in atmospherically-corrected FLEX surface reflectance and the subsequent Sun-Induced Fluorescence retrieval (SIF). We observed that: (1) a parametric characterization of aerosol scattering effects increases the accuracy of the atmospheric correction with respect to the commonly implemented discretization of aerosol optical properties by aerosol types and (2) the Ångström exponent and the aerosol vertical distribution have a residual influence in the atmospherically-corrected surface reflectance. In conclusion, a multi-parametric aerosol characterization is sufficient for the atmospheric correction of FLEX data (and SIF retrieval) within the mission requirements in nearly 85% (70%) of the cases with average aerosol load conditions. The future development of the FLEX atmospheric correction algorithm would therefore gain from a multi-parametric aerosol characterization based on the synergy of FLEX and Sentinel-3 data.


Introduction
Atmospheric correction of optical Earth observation data is one of the critical steps in the data processing chain of a satellite mission as it allows deriving surface biogeophysical parameters free from the atmospheric influence [1,2].Generally, atmospheric correction methods can be classified in two main families [3,4]: (1) empirical and (2) physically-based inversion methods.Empirical methods are based on mathematical approximations of the atmospheric radiative transfer equation and the inversion of surface properties directly from sensor data through empirical assumptions [5,6].Physically-based methods [7][8][9][10] typically comprise two main steps: (1) derivation of key atmospheric parameters and (2) decoupling of surface properties from atmospheric effects by inversion of a Radiative Transfer Model (RTM).While empirical methods have a lower computation burden than physically-based inversion methods, their accuracy is also generally lower [1,6].Consequently, when it comes to missions that require an accurate determination of surface properties, then physically-based inversion methods are preferred.
Physically-based atmospheric correction methods either apply an image-based retrieval of key atmospheric parameters, particularly (1) water vapor, (2) surface pressure and (3) aerosol properties, or take those parameters from external reference data [11,12].Water vapor and surface pressure can be univocally determined, e.g., through differential absorption techniques [13] or meteorological pressure fields [11] and digital elevation models.However, the complete characterization of aerosol optical properties (e.g., extinction, absorption and scattering) can only be determined by instruments that provide multi-angular, multi-spectral and polarimetric capabilities such as the Multi-viewing, Multi-channel, Multi-polarization Imager (3MI) on-board of MetOp-Second Generation [14].In addition, retrieval of aerosol vertical distribution is limited to active optical instruments such as the Cloud-Aerosol Lidar and Infrared Pathfinder Satellite Observation (CALIPSO) mission [15].Unfortunately, such capabilities are usually not implemented in most optical satellite missions and thus most atmospheric correction algorithms aim to compensate the net radiometric effects of aerosol properties.In practice, this means that atmospheric correction algorithms typically implement parametric approximations or assume default values for some key aerosol properties.For instance, this is the case for advanced atmospheric correction methods such as applied to MEdium Resolution Imaging Spectrometer (MERIS)/Advanced Along Track Scanning Radiometer (AATSR) [10] where only aerosol optical thickness (AOT) and aerosol type (from a discrete set of standard aerosol types, e.g., [16]) are derived from satellite data by comparison against RTM-simulated Top-of-Atmosphere (TOA) radiances.
The question now arises whether the use of parametric approximations and assumptions about default values are sufficient to describe the impact of aerosol properties at TOA radiance.While approximations are generally implemented in most physically-based atmospheric correction methods [7][8][9][10], some studies remark the importance of typically neglected aerosol properties such as absorption [17] and vertical distribution [18,19] in atmospheric correction and radiative forcing of aerosols.This question is particularly relevant for the atmospheric correction of instruments measuring radiance in deep absorption bands at a high spectral resolution.
The European Space Agency's (ESA) Fluorescence Explorer (FLEX) mission aims to derive global maps of vegetation Sun-Induced Fluorescence (SIF) from measurements in the deep Oxygen absorption bands at 687 nm and 760 nm [20,21].FLEX is equipped with the FLuORescence Imaging Spectrometer (FLORIS) [22] that will measure radiance within these O 2 bands at a spectral sampling of 0.1 nm, a spectral resolution of 0.3 nm and a spatial resolution of 300 m.At TOA level, SIF signal is around two orders of magnitude lower than the surface reflected radiance and thus errors appearing in the atmospheric correction can be largely propagated to errors in the retrieved SIF.FLEX will therefore make use of the synergy with Sentinel-3 [23], acquiring data over the same target with 6-15 s time delay at 300 m spatial resolution, to accurately characterize the atmospheric state.
The main objective of this paper, therefore, is to study the accuracy of using parametric approximations to describe the net radiometric effects of aerosol properties through their impact in atmospherically-corrected FLORIS data (i.e., surface reflectance) and subsequent retrieved SIF.Based on modeled data, we quantified how the net radiometric effects caused by a variability of aerosol types and vertical distributions are compensated by approximations in (1) the spectral behavior of the aerosol extinction, (2) the phase function and (3) the single-scattering albedo.These insights will ultimately contribute to support progress in atmospheric correction algorithms for the FLEX/Sentinel-3 tandem mission, and to reduce errors in the retrieval of SIF.
The remainder of the presented work is organized as follows.Section 2 gives an overview of aerosol optical properties and their common parametric approximations.Section 3 presents the materials and methods, specifically a newly developed software tool used to compute multi-dimensional look-up tables based on the execution of atmospheric RTMs.Section 4 focuses on the comparison of two simulated datasets in order to study how the implemented parametric approximations compensate the net radiometric effect of complex aerosol optical properties within the O 2 bands.The analysis is conducted at three levels: (1) in the atmospherically-corrected FLEX data, (2) in the derived key aerosol parameters and (3) in the subsequent SIF retrieval.Finally, we conclude our work with a discussion of the results and their implications into FLEX atmospheric correction algorithm.

Aerosol Optical and Physical Properties
The propagation of an electromagnetic signal in some medium is described by its reflection, refraction, diffraction, absorption, polarization and scattering properties.In the Earth's atmosphere, suspended matter (i.e., aerosol) is an important contributor to light absorption and scattering; specifically in the visible to short-wave infrared spectral range (from 400 nm to 2500 nm).With respect to their impact in light radiative transfer, aerosols can be described by their optical properties and their particle density vertical distribution in the atmosphere.Aerosol optical properties are typically characterized by their wavelength-dependent extinction coefficient ε k , which represents the loss of light per unit of optical path (usually in units of km −1 ).As shown in Equation (1), two mechanisms are sources of extinction: absorption (ε a ) and scattering (ε s ).
The proportion of scattering with respect the total extinction is represented by the wavelength-dependent single-scattering albedo (SSA), The amount and spectral behavior of the extinction coefficient is respectively driven by the number and proportion of small (0.1-2.5 µm radius) and large (>2.5 µm radius) particles, also known as fine and coarse modes.Each of these two modes can empirically be modeled by the Ångström law [24] in Equation ( 2): where λ 0 is a reference wavelength and α is the so-called Ångström exponent, which is also wavelength-dependent [25].
Additionally, the angular distribution of the scattered light with respect to the incident beam depends on the aerosol's particle shape and size.The aerosol phase function Φ λ (θ, φ) represents, for a given wavelength λ, the fraction of scattered light in the direction represented by the scattering and azimuth angles (θ and φ respectively).Typically, azimuthal symmetry is assumed given the random orientation of aerosol particles.The aerosol phase function can be described by the wavelength-dependent asymmetry parameter g λ , which is the average over all scattering directions of the cosine of the scattering angle as in Equation (3): Given this asymmetry parameter, the phase function can then be computed assuming a mathematical formulation such as the Mie theory for spherical particles [26] or the parametric Henyey-Greenstein (HG) approximation [27] (see Equation ( 4)).
With these aerosol optical properties (extinction, absorption and phase function), the effects of aerosols onto radiance detected by a sensor can be further described by considering the vertical distribution (i.e., height profile) of aerosol particle density.Aerosol particle density typically follows a decreasing vertical distribution with, in normal meteorological conditions, higher densities in the boundary layer (0 km to ∼2 km) and troposphere (∼2 km to 10 km) [16].Among these two layers, variability and mixture of aerosol types is higher in the boundary region [28,29] due to interactions between land surface and atmosphere (e.g., biomass burning, dust storms or urban pollution).The vertical distribution in each layer largely depends on particle density and meteorological conditions, particularly wind, turbulence and vertical profiles of pressure and temperature.The variability of this vertical distribution can be partially calculated with numerical weather prediction models [30].However, these models are neither practical for simulation of large databases nor for near-real time atmospheric correction algorithms.Instead, aerosol vertical distribution is typically modeled with simple empirical approximations such a decreasing exponential [16] (see Equation ( 5)): Here, the extinction coefficient at an altitude h (in km), ε k (h), is driven by the scale height parameter Z (in km).The integral of the extinction coefficient over the entire atmospheric column (0-100 km) is known as Aerosol Optical Thickness (AOT), which is also wavelength dependent.

Materials and Methods
To facilitate evaluating the impact of introduced approximations and assumptions on aerosol properties in the atmospheric correction of FLORIS TOA radiance and the subsequent SIF retrieval, we developed a new software tool (see Section 3.1) to simulate several surface reflectance datasets (see Section 3.2).The implemented method for the analysis and comparison of these datasets is further presented in Section 3.3.

Atmospheric Look-Up Table Generator
With the purpose of automating the generation of the datasets used in this work, we developed a friendly graphical user interface software tool (Atmospheric Look-up table Generator: ALG, v1.0) that enables executing the MODerate resolution atmospheric TRANsmission (MODTRAN) RTM (v5.2.1) [31] (see Figure 1) and generates large multi-dimensional look-up tables.In addition, similarly to AEROgui [32], ALG integrates the Optical Properties of Aerosols and Clouds (OPAC) (v3.1) database [16].When coupled with an atmospheric RTM, OPAC expands the pre-defined RTM aerosol models with a large and comprehensive database of complex aerosol optical properties.ALG also allows users to (see Figure 2 • Define input geometric conditions such as viewing and illumination zenith/azimuth angles, surface height and sensor altitude.• Set the spectral range and resolution of the output data at non-contiguous spectral intervals.As shown in Figure 1, ALG first generates the RTM input files corresponding to: (1) all combinations of input key variables or (2) a near-random sampling of the input variable space based on the Latin Hypercube sampling (LHS) distribution [33].Second, parallel instances of a RTM are run on a process similar to the implemented in [34].Last, the output database is computed.It contains the following atmospheric transfer functions for each combination of key input parameters: • Atmospheric path radiance (L 0 ) • At-surface direct and diffuse solar irradiance (E dir and E di f ).
• Target-to-sensor direct and diffuse transmittance (T dir and T di f ).
The computation of the above atmospheric transfer functions is performed using the MODTRAN5 interrogation technique developed by Guanter et al. (2009) [35].

Description of Simulated Datasets
We used the ALG tool to produce three MODTRAN5-based datasets for the analysis of aerosol properties in the spectral range around the O 2 -B (680-700 nm) and O 2 -A (755-775 nm) bands.MODTRAN5 was chosen to generate these three datasets given that it accurately simulates the coupled effects of absorption and scattering through its correlated-k option [36].
The first dataset (further referred to as analysis) functions to evaluate the relative contribution into TOA radiance of various aerosol optical properties and vertical distribution thanks to the versatility in MODTRAN5 to introduce user-defined aerosol properties.Thus, this dataset (see Table 1) includes typical variability [12,16,17] in: (1) the AOT at 550 nm (AOT 550 ); (2) the spectral dependency of the extinction coefficient, through the Ångström exponent; (3) the phase function, through the HG asymmetry parameter; (4) the single scattering albedo; and (5) the vertical distribution, through the scale height Z and top height of the boundary layer H max .The analysis dataset consists of 2000 samples with a LHS distribution at a spectral resolution of 5 cm −1 (approx.0.3 nm at 760 nm).The second dataset (further referred to as reference) aims to reproduce a realistic set of FLORIS TOA radiances observations.The OPAC aerosol database is used to provide a realistic and varied set of wavelength-dependent aerosol optical properties (e.g., extinction, absorption and asymmetry parameter) in the boundary layer.Furthermore, this dataset varies the aerosol vertical distribution by considering several values of the aerosol scale height and top height in the boundary layer.Table 2 summarizes the values of the key aerosol variables for the reference dataset, with a total of N re f = 2625 combinations.The third dataset (further referred to as retrieval) implements parametric approximations and assumptions on the aerosol optical properties and vertical distribution.Therefore, the retrieval dataset aims to represent the data used in an atmospheric correction algorithm.We consider the following approximations and assumptions on the aerosol optical properties and vertical distribution: • A single Ångström Law with spectrally-invariant Ångström exponent reproduces the effective spectral dependency of the extinction coefficient of both fine and coarse modes in the boundary layer and troposphere.• The aerosol scattering phase function is modeled by the HG approximation with a spectrally-invariant asymmetry parameter.• Different aerosol types have a low variability of values and spectral dependency of the SSA [17] and, therefore, it is considered to be spectrally-invariant in the 680-775 nm region.• For spaceborne measurements, it is considered that the aerosol vertical distribution has a lower impact in TOA radiance than the caused by the variability of AOT 550 , Ångström exponent, asymmetry parameter and SSA.Thus, we assume that a predefined aerosol vertical distribution is representative of the net radiometric effect of most vertical distributions.
In order to sample the full key input parameter space, we used a LHS distribution with a total of N ret = 3500 samples.Table 3 summarizes the minimum/maximum values of each key aerosol variables for the retrieval dataset.Further, in order to evaluate the impact of additional simplifications and assumptions, the retrieval dataset was divided into four subsets as shown in Table 4: Both the reference and the retrieval datasets were generated at high spectral resolution with a spectral sampling of 0.1 cm −1 (approx.0.006 nm at 760 nm).
For each sample in the analysis, reference and retrieval datasets, we calculated the corresponding TOA radiance spectra by coupling the generated atmospheric transfer functions with a reference Lambertian surface reflectance (ρ) and isotropic SIF (F) spectra (see Figure 3) following Equation ( 6): where T = T dir + T di f is the total target-to-sensor transmittance and E tot = E dir cos θ il + E di f is the total at-surface irradiance for a solar zenith angle of θ il (55 • in our case).The Lambertian surface reflectance assumption is considered to be valid due to the coarse resolution of the FLEX data (300 m) [9] and since the near-nadir observation geometry is not in hot spot direction [37,38].We simulated synthetic FLORIS TOA radiance data for each sample of the reference and the retrieval datasets.The high spectral resolution TOA radiance from Equation ( 6) was resampled to a lower spectral resolution data to mimic FLORIS data.This was achieved through convolution with FLORIS spectral response function, modelled with a double error function (see Equation ( 7)): Here er f (λ; s, c ± b) represents an error function (see Equation ( 8)) evaluated in the high resolution wavelength domain λ at the spectral channel centered at the wavelength c: The parameters s = 17.5 and b = 0.3 nm represent respectively the slope of the error function and the full-width at half maximum of FLORIS spectral response function.The synthetic FLORIS TOA radiance, L , was therefore obtained by the convolution in Equation ( 9) for a set of spectral channels c in the range around the O 2 -B and O 2 -A absorptions and with a spectral sampling of 0.1 nm.

Data Analysis
In order to identify the aerosol properties in FLORIS data variables, we first conducted a global sensitivity analysis (GSA) of the TOA radiance simulations from the analysis dataset.GSA explores the full input variable space and evaluates the relative importance of each input variable in a model [39].The method can be used to identify the most influential variables affecting model outputs.In variance-based GSA, the contribution of each input variable to the variation in outputs is averaged over the variation of all input variables, i.e., all input variables are changed together.In a related study [40,41], the method of Satelli et al. (2010) [42] was applied to the SCOPE model [43,44] to identify the driving variables that shape the variability of canopy-leaving SIF spectrum across its full spectral range.The method has been demonstrated to be effective in identifying both the main sensitivity effects (first-order effects, i.e., the contribution to the variance of the model output by each input variables) and total sensitivity effects (the first-order effects plus interactions with other input variables, S Ti ) of input variables.However, as many simulations are required, i.e., typically about 1000 per input variable, GSA is a computationally expensive method and thus limited by the processing speed of the model under study.In order to bypass the computational burden of costly RTMs, a computationally effective technique has recently been proposed by only using a limited number of RTM executions.The technique consists of approximating the RTM by a surrogate model with low computation time, also referred to as meta-model or emulator [45].Once having a successful emulator developed, the idea is that it can replace the original RTM and be readily applied to perform sensitivity analysis.Accordingly, emulators can provide an efficient mean for applying a GSA to large and expensive models, including the computationally expensive MODTRAN model.An earlier study over the full 400-2500 nm MODTRAN spectral range confirms the applicability of the proposed method [46].Here, we extended this method for the GSA of TOA radiance within the O 2 -B (680-700 nm) and O 2 -A (755-775 nm) bands at a spectral resolution of 5 cm −1 (approx.0.3 nm at 760 nm).The residuals of the emulator used for GSA were first analyzed on their accuracy; the residuals are a factor ∼20 lower than variations in TOA radiance caused by errors of 1% in key input aerosol parameters, and can thus be considered as accurate for the GSA.
Secondly, we analyzed the impact of approximations in aerosol properties in FLORIS atmospherically-corrected surface reflectance.To do so, we applied the approach shown in the flow chart illustrated in Figure 4, and described in the following paragraphs.

Definition of error thresholds
Reflectance error thresholds (Figure 4)

Figures 7 and 8
Reference dataset

Atmospheric correction
Eq. ( 11) Transfer functions Retrieved reflectance (r' ret ) The first step aims to reproduce the atmospheric characterization step of a physically-based atmospheric correction algorithm.This step consists on finding the sample (j best ) in the retrieval dataset that best matches each sample (i) in the reference dataset by minimizing the cost function in Equation (10) as proposed in [9,10]: where the subindex re f and ret refer respectively to the reference and the retrieval datasets.The cost function e(i, j) only considers the n spectral channels, c, outside the O 2 absorption bands.
The second step performs the atmospheric correction, inverting the apparent reflectance ( ρ) of each reference dataset spectra from the first order Taylor expansion of Equation ( 6) [20] (see Equation ( 11)).
Here, the apparent reflectance ( ρ) couples the effect of surface emitted SIF and Sun-reflected radiance, i.e., E tot ρ = E tot ρ+πF.The brackets • represent the result of convolving the high spectral resolution atmospheric transfer functions by FLORIS spectral response function following Equations ( 7)- (9).
The atmospheric correction is applied to the reference and retrieval datasets: • In the reference dataset, the retrieved surface apparent reflectance ρre f corresponds to an ideal atmospheric correction algorithm from perfectly known atmospheric conditions.
• In the retrieval dataset, the retrieved surface apparent reflectance ρret corresponds to the actual product of an atmospheric correction algorithm.The best matching combinations (j best ) are taken from the estimated atmospheric conditions from the first step (i.e., atmospheric characterization).
The apparent reflectance will be later used to retrieve SIF emission through spectral decomposition techniques ( [47]).
In the third step, we evaluate the accuracy of the approximations in aerosol properties through the accuracy of the retrieved surface apparent reflectance.To do so, we compute the relative error between ρre f and ρret and its average in each O 2 absorption band (i.e., 686.6-699.5 nm for the O 2 -B and 759.3-767.5 nm for the O 2 -A).The relative error in surface apparent reflectance is compared against the threshold defined as the relative difference between ρre f and ρre f +0.2 (see Figure 5).Here, ρre f +0.2 results of an ideal atmospheric correction of a new dataset based on the key input atmospheric parameters in Table 2 and increasing F in Equation ( 6) by ∆F = 0.2 mW•m −2 •sr −1 •nm −1 .This ∆F is the SIF threshold error as defined by FLEX mission requirements [21,48].Two surface reflectance error assessment were performed to evaluate the accuracy of the atmospheric correction: • We calculated the mean relative error, i.e., the average for all the samples in the reference dataset, after the atmospheric correction and compared it with the relative error threshold shown in Figure 5.This mean relative error is calculated for the surface apparent reflectance retrieved using the complete retrieval dataset and its four subsets (see Table 4).Through this analysis, we evaluated the overall impact of the various aerosol properties on the atmospheric correction.• We filtered the relative error in apparent reflectance, averaged within each O 2 absorption, as a function of the key input variables of the reference dataset.Through this analysis, we determined in more detail the source of errors in the atmospheric correction of the reference dataset using the complete retrieval dataset.
Thirdly, we analyzed the retrieved values (average and standard deviation) of the key aerosol variables against the reference values derived from OPAC aerosols in [16].The reference values of Ångström exponent and HG asymmetry parameter were obtained from the best fitting of the extinction coefficient and the phase function (at 700 nm) respectively with the Ångström law and the HG phase function.The reference value of the SSA was calculated by averaging the SSA in the 650-800 nm spectral range.
Finally, we analyzed the impact of approximations in aerosol properties in the retrieved SIF after the atmospheric correction.For each retrieved apparent reflectance ρi ret (i from 1 to N re f ) and in each O 2 region, SIF i ret was retrieved based on the Spectral Fitting Method described in [47].We selected the wavelength range 686.7-688.3nm for the O 2 -B and 757.3-765.5 nm for the O 2 -A.In order to analyze the error propagation from atmospheric correction to the retrieved SIF, and to minimize the errors associated with the Spectral Fitting Method, we assumed a known real reflectance and applied a linear polynomial for the retrieved SIF.For the complete reference dataset, we then computed the root-mean-square error between the reference and retrieved SIF, filtered by AOT 550 , and calculated the error statistic.The SIF error threshold was set to 0.2 mW•m −2 •sr −1 •nm −1 according to FLEX mission requirements [21,48].

Results
Following the method described in Section 3.3, we now show the results corresponding to: (1) the conducted GSA applied to FLORIS TOA radiance (Section 4.1), (2) the effect of approximated aerosol properties on the inversion of surface apparent reflectance (Section 4.2), (3) the accuracy of retrieved key aerosol optical properties (Section 4.3), and (4) the propagation of atmospheric correction errors on the retrieved SIF (Section 4.4).

Relative Contribution of Aerosol Parameters in FLORIS TOA Radiance
The total sensitivity index (S Ti ) over the analysis dataset allows us studying the relative contribution of each aerosol variable, thus determining which of the key variables should be retrieved in an atmospheric correction algorithm for FLEX. Figure 6 shows this relative contribution evaluated by S Ti for each spectral channel in the O 2 -B (left) and O 2 -A (right).Each aerosol variable is represented with a different color: AOT 550 in dark blue, HG asymmetry parameter (g) in blue, Ångström exponent (α) in cyan, scale height (Z) in yellow, top height of the boundary layer (H max ) in orange and single scattering albedo (SSA) in dark red.In general terms, it can be noted that TOA radiance is mainly driven by atmospheric variables related to scattering processes (i.e., the HG asymmetry parameter (G), the SSA and the AOT 550 ) but some points must be outlined: • The non-isotropy of the aerosol phase function (evaluated through the HG asymmetry parameter) has indeed the largest contribution (50-60%) in the TOA radiance.Particularly, since aerosol scattering is higher at shorter wavelengths, the contribution of the aerosol phase function is higher in all spectral channels within the O 2 -B and at the bottom of the O 2 -A band (∼761 nm).• The AOT 550 is the second driving variable, particularly outside the O 2 -A absorption band and on the secondary absorptions (762-770 nm).• The remaining sensitivity is dominated by the SSA with an influence of 20% in the O 2 -B and 40% in the O 2 -A spectral regions outside the absorption bands.The influence of the SSA is reduced to 10-20% inside the absorption bands.
• The spectral dependency of the extinction coefficient (evaluated through the Ångström exponent) has lower impact than AOT 550 , SSA and g in both O 2 regions, particularly inside the O 2 -B absorption band.• The aerosol vertical distribution, evaluated through the Z and H max parameters, has a residual influence on the TOA radiance.

Apparent Reflectance Error Analysis Within Each O 2 Absorption Band
Next, we evaluate the impact of approximations and assumptions on aerosol optical properties in atmospherically-corrected surface apparent reflectance.After the atmospheric correction using the complete retrieval dataset and its four subsets (see Table 4), we evaluate the mean relative error, averaged for all samples in the reference dataset (see Figure 7).An aerosol characterization based solely on the variable AOT 550 (subset #1, red line) does not compensate the variability in TOA radiance caused by all aerosol optical properties.The errors in surface apparent reflectance occur both inside and outside the O 2 absorptions and particularly in the O 2 -B spectral region, where the scattering processes (and thus the phase function) have a larger influence.In agreement with the above GSA, the importance of ignoring the aerosol phase function can also be observed when using subset #4 (light blue line), which despite of considering 3 key aerosol parameters (i.e., AOT 550 , Ångström exponent and SSA), also results in high errors.
The influence of considering a spectrally-invariant SSA is also observed when comparing the errors obtained in the subset #2 (in dark blue, fixing the SSA) with those lower errors in the subset #3 (in orange, fixing the Ångström exponent).Again, the importance of scattering (through the SSA) is more prominent in shorter wavelengths, i.e., the difference between dark blue and orange lines is higher within the O 2 -B spectral region than within the O 2 -A.
Comparing the complete dataset (in green line) with subset #3 (fixing the Ångström exponent, in orange line), it seems that considering the Ångström exponent for the atmospheric characterization and retrieval hardly reduce the errors in the retrieved apparent reflectance.
The same trend appears when evaluating the number of samples in the reference dataset that are atmospherically-corrected within the error threshold (see Table 5).The impact of ignoring the asymmetry parameter is observed in the lower accuracy of the retrievals in subsets #1 and #4 when compared against the other subsets (i.e., those including the HG parameter).While including the Ångström exponent hardly improves the results within the O 2 -A spectral range, it seems to improve the results within the O 2 -B spectral range, thereby increasing the percentage of samples atmospherically-corrected within the error threshold from 63% (with subset #3) up to 85% (with the complete retrieval dataset).Next, in order to further determine the source of errors in the atmospheric correction, we show the relative errors propagated in apparent reflectance filtered by aerosol type (see Figure 8) and by AOT 550 value (see Figure 9).We have calculated the error statistics for each of the filtered relative error values through box plots.When filtered by aerosol type, it can be observed that the higher errors within the O 2 -B absorption region are given in both OPAC and MODTRAN urban aerosols, i.e., in highly absorbent aerosols with low SSA.However, the errors in this spectral region are on average below the threshold, which implies that the approximations and assumptions considered in the retrieval dataset are suitable for the atmospheric correction within the O 2 -B band.
As for the errors in the O 2 -A absorption region, again MODTRAN's urban aerosol type shows the largest average error and the second largest 75th percentile.However, and opposite to the errors in the O 2 -B absorption, OPAC's urban aerosol does not particularly lead to high error statistics.In addition, MODTRAN's tropospheric and OPAC's continental clean aerosols show high dispersion between low and high errors.
When filtered by AOT 550 , it can be observed that the errors in both O 2 bands increase with higher aerosol loads (see Figure 9).Errors in the characterization and parameterization of key aerosol optical properties start to be propagated to the retrieved apparent reflectance at higher aerosol concentrations.However, nearly 85% of the reference dataset is atmospherically-corrected within the error threshold for typical AOT 550 values of 0.05-0.21[49,50].Similar error statistics have been analyzed (box plots not presented) for the apparent reflectance errors filtered by the vertical aerosol distribution, however, in agreement with the GSA results in Section 4.1, vertical aerosol distribution does not play a significant role.

Retrieved Key Aerosol Optical Properties
Here we assess the accuracy of the implemented approximations on the retrieved key aerosol optical properties.We have first evaluated the errors in the retrieved AOT 550 (see Table 6).These results indicate that, in average and despite the assumptions considered in the retrieval dataset, the AOT 550 can be retrieved with a precision below 0.06 and with an error below 0.03 for all reference AOT 550 values.Secondly, as for the optical properties of OPAC aerosols implemented in the reference dataset, Figures 10 and 11 show their spectral or angular dependency.In Figure 10 (left) we can observe that the extinction coefficients (normalized by their values at 550 nm) (circles) are well fitted (mean R 2 of 0.9973) by the Ångström law.In Figure 10 (right), the SSA shows a nearly constant spectral dependency in the 650-800 nm spectral range with maximum variations of 3% (for urban and desert aerosols).The angular dependency of the various phase functions (at 700 nm) and their best fittings to a HG phase function is displayed in Figure 11 (left).Despite of the good parametric fitting (mean R 2 of 0.9854), the log-log scale shows that at forward scattering angles (0-10 deg), where the phase function is higher, the fitting is poorer, particularly for OPAC's desert aerosol (in orange).Also, the spectral dependency of the asymmetry parameters shows variations below 2.5% in the 650-800 nm range (see Figure 11 (right)), which indicates that the asymmetry parameter can be considered spectrally invariant.The retrieved aerosol optical properties are given in Table 7.Each column displays the average and the standard deviation of the retrieved values for each OPAC aerosol within the reference dataset.The following observations can be made: • Ångström exponent: the poor accuracy (0.6 average error) and the high dispersion (precision between 0.3 and 0.6) of the retrieved Ångström exponent for all aerosol types are noteworthy.This supports the results of the conducted GSA, i.e., that the Ångström exponent has a secondary order influence on the variability of the TOA radiance in the O 2 wavelength range and thus the difficulty to derive its value just using FLORIS instrument.The low precision of the retrieved Ångström exponent indicates that this parameter has been used to compensate for variations on other aerosol optical properties (e.g., phase function and SSA).
• HG asymmetry parameter: we observe that an effective and spectrally-invariant HG asymmetry parameter of 0.68 ± 0.04 reproduces, overall, the main scattering processes related to the phase function.This indicates that the effect of aerosol scattering in the at-sensor TOA radiance is more related to illumination and observation geometry than to the aerosol type.Consequently, the variability of Mie phase functions implemented in the reference dataset can be compensated with an effective value of the HG asymmetry parameter for the given observation/illumination geometry.Though the HG approximation with an effective spectrally-invariant asymmetry parameter might be suitable within the O 2 -B spectral region, it plays a more important role inside the O 2 -A absorption band, which causes higher errors in apparent reflectance.• Single scattering albedo: as with the retrieved AOT 550 , the retrieved values of SSA are in agreement (within the standard deviation) with the reference SSA values.This is true except for OPAC's urban aerosol, which overestimates the SSA with an error of 0.06 ± 0.05, in agreement with the higher apparent reflectance error values within the O 2 -B spectral region seen in Figure 8 (left).Finally, we have evaluated the error propagation of approximations and assumptions on aerosol optical properties into the retrieved SIF. Figure 12 evaluates, with a box plot diagram, the SIF error statistics, filtered by AOT 550 , after using the complete retrieval dataset for the atmospheric correction.It can be observed that SIF can be retrieved with an error below 0.2 mW•m −2 •sr −1 •nm −1 in nearly 70% of the reference dataset for typical AOT 550 values of 0.05-0.21.These results are in agreement with the errors derived in the surface apparent reflectance shown in Figure 9.

Discussion
The most important messages conveyed in this work, and the implications for the ESA's FLEX/Sentinel-3 tandem mission data processing and further studies for atmospheric simulation can be summarized as follows.

Interpreting Atmospheric Correction Results: Implications for FLEX/Sentinel-3 Tandem Mission
A multi-parametric description of aerosol properties was applied in order to study its accuracy in the atmospheric correction of ESA's FLEX/Sentinel-3 tandem mission.Overall, the results from the GSA reveal the importance of the scattering processes, particularly through the larger influence of aerosol phase function and SSA on the TOA radiance.These results are in agreement with our previous analysis in [46], where it was observed the high influence of the aerosol phase function, particularly on the diffuse terms of the TOA radiance in Equation ( 6) (i.e., E di f , T di f and S).Previous studies such as [51,52] also indicate an increase of uncertainty in atmospheric correction due to errors related to a poor characterization of the aerosol phase function and SSA.A possible atmospheric correction strategy, e.g., as used in the Sentinel-3 SYN atmospheric correction algorithm [53], is to include a discrete set of aerosol types with predefined optical properties.However, various studies observed important discrepancies in aerosol optical properties between OPAC database and ground measurements ( [54][55][56]), which indicate that discretization might not be accurate enough for an atmospheric correction.Instead, our atmospheric correction approach considers a multi-parametric approximation of aerosol properties with the assumption that effective aerosol variables can be mutually compensated to match the instrument TOA radiance data.We demonstrated that this approach achieves the required accuracy for FLEX atmospheric correction (SIF retrieval) in 85% (70%) of the considered cases for typical aerosol load conditions [49,50].The analysis of the atmospheric correction results derive the following observations: • Retrieving the aerosol phase function is important for an atmospheric correction scheme to enable compensating the variability of aerosol scattering in at-sensor TOA radiance.However, our results show that the spectrally-invariant HG approximation is adequate for the atmospheric correction of FLEX.Indeed, an effective HG asymmetry parameter describes the main scattering effects in TOA radiance associated to the phase function for aerosols of different type, which are mainly driven by the illumination/observation geometry.• Our results indicate that only high SSA errors in highly absorbent aerosols (low SSA) might lead to an increase of the apparent reflectance error in the O 2 -B spectral region but not in the O 2 -A spectral region.In fact, as observed in the GSA results, the SSA has less influence inside of the O 2 -A absorption band and thus errors in the estimation of SSA will also have less impact in the atmospheric correction and on the SIF retrieval.• The AOT 550 is retrieved with a low error despite the implemented multi-parametric atmospheric correction.This is important in view of the possibility to evaluate the quality of FLEX atmospheric correction through the comparison of retrieved AOT 550 against reference values (e.g., from the Aerosol Robotic Network (AERONET) [1,12]).• When filtered by aerosol type, the errors on the retrieved apparent reflectance indicate that the assumption of spectrally-invariant asymmetry parameter and SSA might not be fully sufficient for all aerosol types.This is particularly relevant at aerosol concentrations with an AOT 550 > 0.2, where errors in the characterization and the parameterization of key aerosol optical properties are propagated to the retrieved apparent reflectance and SIF within the O 2 -A spectral region.• The error of atmospherically-corrected apparent reflectance is less affected by the variability of aerosol vertical profiles than the variability of aerosol optical properties.This was already observed in previous research [57] where, for nadir observations within the O 2 -A absorption region, aerosol vertical distribution was only retrieved over targets with low surface reflectance and at aerosol load conditions of AOT 550 > 0.3.We can therefore conclude that assuming a predefined aerosol vertical distribution in FLEX atmospheric correction will not lead to a large error contribution to the retrieved apparent reflectance and subsequent SIF retrieval.• The low precision of the retrieved Ångström exponent indicates that, on the one hand, this parameter cannot be retrieved from the short spectral range of FLORIS data.It justifies the use of the wider spectral range covered by Sentinel-3 instruments in this part.On the other hand, we observed that fixing the Ångström exponent (e.g., by inversion from Sentinel-3 data) will not have a large impact on the atmospheric correction accuracy within the O 2 absorption region.However, an error in the Ångström exponent may lead to errors in surface reflectance in a wider spectral range that will be propagated to errors in retrieved SIF, especially when using Spectral Fitting Methods [58].
Based on these observations, we are developing a multi-parametric atmospheric characterization step for FLEX atmospheric correction algorithm that exploits the synergy of FLORIS, Ocean and Land Colour Instrument (OLCI) and Sea and Land Surface Temperature Radiometer (SLSTR) data.

New Opportunities for Atmospheric Simulation
To facilitate the simulation of large MODTRAN-based multi-dimensional look-up tables, we developed a software package called Atmospheric Look-up Table Generator (ALG).Although the generation of these multi-dimensional atmospheric look-up tables has been partially described in the past [35,59], tools are missing that allow their generation in a simple manner.For example, existing tools such as ONTAR's PcModWin [60] or the official MODTRAN6 website [61] are limited to the execution of MODTRAN for a single combination of key input parameters.The generation of a multi-dimensional dataset would therefore require the impractical task of manually editing and running hundreds or thousands MODTRAN input files and process their generated output files into the final multi-dimensional dataset.Other commercial tools such as ReSe's MODO [62] offer capabilities for sensitivity analysis through parameter series but the generation of multi-dimensional datasets is not easily achieved through its interface.The introduction of ALG allows users to: (1) define easily a wide variety of multiple combinations of key input parameters for the dataset, (2) run efficiently parallel execution of multiple instances of an atmospheric RTM as a third-party software and (3) process automatically the atmospheric RTM output data files and generating the final look-up table.The broader scientific community can subsequently use ALG (downloaded at http://ipl.uv.es/artmo/index.php/tools/80-tools/24-alg)for a wide variety of applications such as: • Atmospheric correction: ALG facilitates users with the definition of atmospheric datasets that can be used for a physically-based atmospheric correction of satellite passive optical instruments [7][8][9][10].• Cross-comparison of atmospheric RTMs: The modular design of the ALG tool allows expanding with additional atmospheric RTMs such as Second Simulation of a Satellite Signal in the Solar Spectrum-Vector (6SV) [63,64], which facilitates the cross-comparison of various atmospheric RTMs [65][66][67].• Forward simulation: The generation of synthetic satellite data has been an active research field since the late 80 s [68].As shown in recent satellite mission simulators ( [69][70][71]), the generation of synthetic TOA radiance scenes play an important role to assess a mission performance.Through its datasets, ALG can extend the current simulation capabilities of satellite mission simulators to generate synthetic scenes in a large variety of atmospheric conditions.In combination with satellite mission simulators, ALG would allow scientists and engineers validating and optimizing satellite data processing schemes.• Sensitivity analysis: As demonstrated in our previous study [46] and exploited in this paper, ALG and global sensitivity analysis can be used to identify the most influential atmospheric variables affecting satellite radiance data, which would serve as a tool to design new passive optical instruments.

Conclusions
Characterization of aerosol optical properties and vertical distribution are the main sources of error in the atmospheric correction of any satellite optical mission.Physically-based atmospheric correction algorithms rely on the implementation of assumptions or approximations with respect these aerosol characteristics.In this paper, we studied the accuracy of using approximations and assumptions related to aerosol properties to describe their net radiometric effects through their impact in synthetic atmospherically-corrected FLORIS data and subsequent retrieved Sun-induced Fluorescence (SIF).To do that, we used a newly developed Atmospheric Look-up Table Generator (ALG) tool to simulate three synthetic datasets based on the execution of MODTRAN RTM coupled with OPAC aerosol database.Unlike the commonly used strategy of discretizing aerosol optical properties by aerosol types, we demonstrated that a multi-parametric aerosol characterization based on effective variables is sufficient for the atmospheric correction of ESA's FLEX mission data.Especially, characterizing scattering effects can play an important role in an accurate atmospheric correction algorithm.However, the approximations of (1) a spectrally-invariant Henyey-Greenstein asymmetry parameter and (2) a spectrally-invariant single scattering albedo already account for the main scattering processes in the 650-780 nm range.We also demonstrated that errors in the effective Ångström exponent retrieved from the wider spectral range covered by Sentinel-3 OLCI and SLSTR instruments, flying in tandem with FLEX, has negligible impact within the O 2 regions.In addition, we found that the aerosol vertical distribution has a marginal influence on the TOA radiance for this type of satellite observations.This suggests that a predefined standard aerosol vertical distribution is appropriate to realize an accurate atmospheric correction of FLEX data.Altogether, the implemented approximations reach the required accuracy in the atmospheric correction of ESA's FLEX mission (and SIF retrieval) in nearly 85% (70%) of the considered cases with typical aerosol load conditions (i.e., AOT 550 < 0.2).

Figure 1 .
Figure 1.Modular architecture of the developed ALG software tool.

•
): Define input key atmospheric variables including among others: aerosol types (default and OPAC user-defined), aerosol vertical distribution in the boundary layer (model default or exponential function), phase function (Mie or HG), spectral dependency of aerosol extinction (model default or Ångström exponent), SSA (model default or spectrally constant).

Figure 2 .
Figure 2. Screen-capture of the ALG (v1.0) tool at the LUT key input parameters configuration step.

Figure 4 .
Figure 4. Approach for evaluating the impact of approximations in aerosol properties in FLORIS atmospherically-corrected surface reflectance.

Figure 5 .
Figure 5. Relative error threshold in apparent reflectance in the O 2 -B (left) and O 2 -A (right) regions based on an increase of fluorescence of 0.2 mW•m −2 •sr −1 •nm −1 .

Figure 6 .
Figure 6.Global sensitivity analysis for the O 2 -B (left) and O 2 -A (right) spectral regions.

Figure 7 .
Figure 7. Mean relative error in surface apparent reflectance derived from the retrieval dataset and its four subsets (see Table 4) within the O 2 -B (left) and O 2 -A (right) spectral regions.The figure legend includes the ranging key input aerosol variables within each subset.

Figure 8 .
Figure 8. Relative error in apparent reflectance, filtered by aerosol type, derived from the complete retrieval dataset within the O 2 -B (left) and O 2 -A (right) spectral regions.The following error statistic are provided: median (|), average ( * ), 25th and 75th percentiles (blue boxes) and extreme min/max values (horizontal dashed lines).The vertical dashed line indicates the error threshold in apparent reflectance within each O 2 absorption.Notice that aerosol types with same name are indentified as MODTRAN (M) or OPAC (O).

Figure 9 .
Figure 9. Relative error in apparent reflectance, filtered by AOT 550 , derived from the complete retrieval dataset within the O 2 -B (left) and O 2 -A (right) spectral regions.The following error statistics are provided: median (|), average ( * ), 25th and 75th percentiles (blue boxes) and extreme min/max values (horizontal dashed lines).The vertical dashed line indicates the error threshold in apparent reflectance within each O 2 absorption.

Figure 11 .
Figure 11.(Left) phase function at 700 nm (solid lines) and best fitting to HG approximation (dashed lines).(Right) wavelength dependency of asymmetry parameter.

Figure 12 .
Figure 12.Mean error statistic in retrieved SIF, filtered by AOT 550 , after the atmospheric correction of the reference dataset derived from the complete retrieval dataset within the O 2 -B (left) and O 2 -A (right) spectral regions.The following error statistic are provided: median (|), average ( * ), 25th and 75th percentiles (blue boxes) and extreme min/max values (horizontal dashed lines).The vertical dashed line indicates the error threshold in SIF.

Table 1 .
Key input aerosol variables for the analysis dataset.This dataset consists of 2000 samples with a LHS distribution.

Table 2 .
Key input aerosol parameters for the reference dataset.

Table 3 .
Key input aerosol parameters for the retrieval dataset.This dataset consists of 3500 samples with a LHS distribution.

Table 4 .
Fixed and ranging variables for each subset of the retrieval dataset.

Table 5 .
Percentage of samples in the reference dataset with a retrieved surface apparent reflectance below the error threshold within the O 2 -B and O 2 -A bands.

Table 6 .
Evaluation of retrieved AOT 550 against each reference dataset value.

Table 7 .
Evaluation of mean and standard deviation retrieved Ångström exponent, HG asymmetry parameter and SSA on OPAC aerosols.Reference values are given in bold font.Color code in the square symbols corresponds to those in Figures 10 and 11.