Abstract
Space-based solar-induced chlorophyll fluorescence (SIF) has been widely demonstrated as a great proxy for monitoring terrestrial photosynthesis and has been successfully retrieved from satellite-based hyperspectral observations using a data-driven algorithm. As a semi-empirical algorithm, the data-driven algorithm is strongly affected by the empirical parameters in the model. Here, the influence of the data-driven algorithm’s empirical parameters, including the polynomial order (np), the number of feature vectors (nSV), the fluorescence emission spectrum function, and the fitting window used in the retrieval model, were quantitatively investigated based on the simulations of the SIF Imaging Spectrometer (SIFIS) onboard the First Terrestrial Ecosystem Carbon Inventory Satellite (TECIS-1). The results showed that the fitting window, np, and nSV were the three main factors that influenced the accuracy of retrieval. The retrieval accuracy was relatively higher for a wider fitting window; the root mean square error (RMSE) was lower than 0.7 mW m−2 sr−1 nm−1 with fitting windows wider than 735–758 nm and 682–691 nm for the far-red band and the red band, respectively. The RMSE decreased first and then increased with increases in np range from 1 to 5 and increased in nSV range from 2 to 20. According to the specifications of SIFIS onboard TECIS-1, a fitting window of 735–758 nm, a second-order polynomial, and four feature vectors are the optimal parameters for far-red SIF retrieval, resulting in an RMSE of 0.63 mW m−2 sr−1 nm−1. As for red SIF retrieval, using second-order polynomial and seven feature vectors in the fitting window of 682–697 nm was the optimal choice and resulted in an RMSE of 0.53 mW m−2 sr−1 nm−1. The optimized parameters of the data-driven algorithm can guide the retrieval of satellite-based SIF and are valuable for generating an accurate SIF product of the TECIS-1 satellite after its launch.
1. Introduction
Traditional vegetation monitoring processes commonly make use of indicators like vegetation indices [1], leaf area index (LAI) [2], and aboveground biomass [3], etc. Recently, solar-induced chlorophyll fluorescence (SIF) has been regarded as a new way of estimating gross primary productivity (GPP) due to its close correlation with photosynthesis [4]. SIF retrieved from satellites has wide coverage and a large number of long-term measurements, which are expanding its application field [5,6,7,8,9,10].
As a common method used for SIF retrieval, the data-driven algorithm has been widely applied for SIF retrieval from satellite-based hyperspectral data. Guanter et al. [11] first presented a singular vector decomposition (SVD) data-driven algorithm to retrieve SIF from the Greenhouse gases Observing Satellite (GOSAT) space measurements with high spectral resolution (~0.04 nm). Following this, Joiner et al. [12] designed a principal component analysis (PCA) data-driven algorithm to map SIF from the Global Ozone Monitoring Experiment–2 (GOME–2) satellite data with moderate spectral resolution (~0.5 nm). Furthermore, Kohler et al. [13] presented a simplified forward linear model by estimating the atmospheric upward transmittance in advance and used a backward elimination algorithm to automatically determine the parameters of the data-driven algorithm, which was successfully applied for accurately retrieving satellite-based SIF from GOME-2 and the Scanning Imaging Absorption SpectroMeter for Atmospheric Chartography (SCIAMACHY) data. Moreover, a series of global satellite-based SIF products were generated by the data-driven algorithm using spectral measurements from the TROPOspheric Monitoring Instrument (TROPOMI) [14], Orbiting Carbon Observatory 2 (OCO-2) [15], OCO-3 [16], and Chinese Carbon Dioxide Observation Satellite Mission (TanSat) [17]. However, as was mentioned by Kohler et al. [13], there are inconsistencies between different products, even between products generated from the same satellite. Therefore, the accuracy of the satellite-based SIF datasets is dependent on both the specifications of sensors (spectral resolution, spectral range, and the signal-to-noise ratio (SNR), etc.) and the empirical parameters in the data-driven algorithms (fitting window, number of feature vectors, and polynomial order, etc.). Thus, the setting of the algorithm parameters for a specific satellite is very important. To date, simulated satellite data have been extensively used to verify the accuracy and precision of the SIF retrieval algorithm [12,13,18,19,20,21], but it is still not well understood how and how much the empirical parameters in the data-driven algorithm affect satellite-based SIF retrievals.
In this study, we aimed to investigate the influences of the empirical parameters of the data-driven algorithm on satellite-based SIF retrieval and develop an optimized data-driven algorithm for the upcoming First Terrestrial Ecosystem Carbon Inventory Satellite (TECIS-1) with a SIF Imaging Spectrometer (SIFIS) onboard, which will be launched at the beginning of 2022 (details listed in Du et al., 2020 [18]). For this purpose, we generated simulation datasets according to the spectral characteristics of SIFIS (Table 1).
Table 1.
The spectral characteristics of the solar-induced chlorophyll fluorescence Imaging Spectrometer (SIFIS) onboard the First Terrestrial Ecosystem Carbon Inventory Satellite (TECIS-1).
So far, little research has focused on the optimization of empirical parameters for SIF retrieval on TECIS-1, especially for red band SIF retrieval, and a detailed evaluation of empirical parameters is necessary to provide an accurate algorithm and generate satellite-based SIF products after the launch of TECIS-1. For this purpose, the simulation datasets were generated (Section 2.1) to investigate the influence of the empirical parameters of the data-driven algorithm on satellite-based SIF retrieval, including the influence of the fitting window (Section 3.1.1), the number of feature vectors, and polynomial order (Section 3.1.2), as well as the fluorescence spectrum function. The optimal parameter setting for the data-driven SIF retrieval algorithm was determined according to the specifications of the SIFIS onboard TECIS-1, and end-to-end retrievals were conducted to show the precision of the optimal combination of parameters (Section 3.2). The mechanism of the influence of empirical parameters on the retrieval is explained in the Discussion section.
2. Materials and Methods
2.1. Simulation Datasets
Assuming that the surface is Lambertian, the top-of-atmosphere (TOA) radiance over a vegetation target can be expressed as the following radiative transfer equation [22,23]:
where L0 is the atmospheric path radiance, LTOC is the radiance reaching the surface, ρs is the surface reflectance, S is the atmospheric spherical albedo, T↑ is the upward atmospheric transmittance, and SIF is the emitted fluorescence signal at the top-of-canopy (TOC), which should be omitted when simulating radiance over the non-vegetated surface.
The Soil Canopy Observation Photosynthesis and Energy (SCOPE) [24] model was employed to simulate the emitted fluorescence signals and reflectance of vegetation under 126 different canopy structures and leaf biochemic characteristics. Ten different reflectance spectra over snow and soil surface were derived from the spectral library of the Environment for Visualizing Images (ENVI) [25]. Three constant reflectance spectra were set in the Moderate-resolution atmospheric TRANsmission (MODTRAN) [26,27] model to calculate the atmospheric radiation transfer parameters (L0, LTOC, and S) under 1280 different atmospheric and observation conditions. The input parameters of the two models are shown in Table 2. Figure 1 shows the simulated canopy SIF spectra of different canopy and leaf conditions in the range of 640–800 nm. The fluorescence spectra have a bimodal distribution, with a more obvious peak at around 740 nm.
Table 2.
Look–up table (LUT) for the Moderate-resolution atmospheric TRANsmission (MODTRAN) and the Soil Canopy Observation Photosynthesis and Energy (SCOPE) models.
Figure 1.
Fluorescence signal at the top-of-canopy (TOC) simulated by SCOPE.
The spectra of the atmospheric radiation transfer parameters (L0, LTOC, S, and T↑) simulated by MODTRAN have a spectral resolution of 0.005 nm. The canopy SIF and the reflectance spectra over vegetated and non-vegetated surfaces were then resampled to the same interval. Then, the resampled SIF and reflectance spectra were used to determine the TOA radiance spectra according to Equation (1). Finally, 161,280 simulated radiance spectra over the vegetated surface were generated as the test dataset, and 12,800 simulated radiance spectra over the non-vegetated surface were generated as the training dataset.
Simulated radiance spectra with a high spectral resolution of 0.005 nm were then convolved and resampled according to the specifications of SIFIS (a spectral resolution of 0.3 nm and a sampling interval of 0.1 nm). Figure 2 shows a set of TECIS-like simulated spectra, including the canopy radiance spectra (Rad_Veg), solar irradiance spectra (Solar_Irr), upward atmospheric transmittance spectra (T_u), and SIF signal at the top-of-canopy and atmosphere. As can be seen in Figure 2, the absorption lines are obvious under a spectral resolution of 0.3 nm, and the radiance spectra are affected by the absorption of solar atmosphere and Earth’s atmosphere.
Figure 2.
Simulated signals with 0.3 nm spectral resolution and 0.1 nm sampling interval derived from the SCOPE and MODTRAN 5 models, including the top–of–atmosphere (TOA) radiance over the vegetated surface (Rad_Veg), solar irradiance (Solar_Irr), upward transmittance of the atmosphere (T_u), SIF signal at the top of the canopy (TOC_SIF) and atmosphere (TOA_SIF).
Since SIF is a weak signal, its retrieval is very sensitive to noise [13,28,29]. For satellite-based SIF retrieval, the SNR of the instrument is also taken into consideration. The noise added to each radiance signal can be expressed as [30]:
where λ is the wavelength, noiser is a base noise that obeys a standard normal distribution across the spectra, and SNR is a function of the radiance and wavelength, denoted as [14]:
where SNRref is the reference SNR at the reference radiance level .
Based on the specifications of SIFIS, we set the SNRref (denoted as SNR below) of 300, 350, 400, 450, and 500 to analyze the influence of noise on empirical parameter optimization. In addition, an SNR of 322 was set to select optimal parameters for SIFIS data.
2.2. The Data-Driven SIF Retrieval Algorithm
The first two terms in Equation (1) are non-fluorescent contributions, while the last term is the fluorescent contribution. According to the basic idea of the data-driven algorithm, the non-fluorescence contribution of LTOA is a combination of high-frequency information (including Fraunhofer lines and atmospheric absorption lines) and low-frequency information. The high-frequency part (caused by the absorption of the solar atmosphere and Earth’s atmosphere) has a specific pattern, which can be extracted from the training dataset composed of non-vegetated targets and reconstructed with a small number of feature spectra. The low-frequency part (caused by atmospheric radiation and surface reflection) can be expressed by a polynomial of wavelength (λ) [11,19]. Moreover, SIF spectra usually have a specific shape and can be expressed by a mathematical function. Thus, Equation (1) can be rewritten as:
where is the feature vector decomposed by the singular vector decomposition (SVD) method, and are the coefficients of the polynomial and the singular vectors, respectively, is the order of polynomial, is the number of selected singular vectors, Fs0 is the SIF signal at the specified wavelength, and T↑ is the upward atmospheric transmittance. is expressed as a Gaussian function with an average of and a standard deviation of :
According to Kohler et al. [13], the upward atmospheric transmittance T↑ can be estimated before retrieval, so when np, nSV, μh and σh were determined, only were left as unknown parameters; then, the retrieval could be turned into a linear least-squares problem.
The parameters mentioned above are mostly set by experience. For example, μh is generally 740 nm for far-red fluorescence, and 692 nm for red fluorescence, np is generally 2–4. However, the influence of different parameters on the results remains to be analyzed. In addition to parameters in the forward model, the fitting window, i.e., the spectral range in Equation (5), is also a key parameter, which determines the number of absorption lines used. Meanwhile, because the random noise of the instrument also fills the absorption line, it interferes with the retrieval so different SNR levels were also considered in this study.
For red SIF retrieval, the spectral range covered by O2-B absorption lines (682–697 nm) covered nearly all the fitting windows selected in previous studies [18,20,31]. For far-red SIF retrieval, the water vapor absorption band (range from 712 nm to 745 nm), the O2-A band (range from 755 nm to 775 nm), and the atmospheric window between them were commonly used. Thus, we completed a series of experiments to preliminarily select the range of the far-red fitting window (Table 3). Table 3 shows the retrieval results with the minimum RMSE in the fitting windows after trial and error. Fitting windows within the O2-A band were used in Exp. 1 and 2, Exp. 3–6 used the fitting window containing water vapor absorption lines, while the fitting window of Exp. 5 and 6 also contained the atmospheric window range from 747 nm to 758 nm. The results showed that retrievals using a fitting window containing oxygen absorption lines (755–778 nm and 747–780 nm) or merely water vapor absorption lines (724–747 nm and 715–748 nm) are not as good as using a fitting window that combines water vapor absorption lines and the atmospheric window (735–758 nm and 720–758 nm), which is also true for the fitting windows of the same width (735–758 nm, 724–747 nm and 755–778 nm). Exp. 6–14 showed that with the adjustment of np and nSV, reasonable retrievals (with RMSE less than 1 mW m−2 sr−1 nm−1) could be achieved when np was between 1–5 and nSV was less than 20. Through the optimization, the RMSE of SIF retrieval can be reduced from 1.18 mW m−2 nm−1 sr−1 to 0.63 mW m−2 nm−1 sr−1 at the far-red band and reduced from 0.62 mW m−2 nm−1 sr−1 to 0.53 mW m−2 nm−1 sr−1 at the red band.
Table 3.
Comparisons of retrieved SIF (SIFretrieved) and true SIF (SIFtrue) simulated using SCOPE model in different experiments. A certain number of feature vectors (nSV) and polynomial order (np) were used within the fitting window from the starting wavelength w1 to the ending wavelength w2. The statistical results included the root mean square error (RMSE), the mean difference (bias) between SIFretrieved and the SIFtrue, the correlation coefficient (r), as well as the slope and intercept of the linear fitting (SIFretrieved = slope * SIFtrue + intercept). The radiance units (denoted by *) are mW m−2 nm−1 sr−1.
Based on the analysis above, we designed the list of parameter values shown in Table 4. The fitting windows for far-red SIF retrieval were selected to cover the weak water vapor absorption lines and Fraunhofer lines between 712 and 758 nm; 758 nm was taken as the final wavelength, only the starting wavelength (λ1) of the fitting window was changed, and the width of the window was then determined. The fitting windows for red SIF retrieval were selected between 682 and 697 nm to cover the oxygen absorption lines. Then, 682 nm was taken as the starting wavelength, and the fitting window was determined by the ending wavelength (λ2).
Table 4.
The parameters involved in SIF retrieval using the data-driven algorithm.
We took the intensity of SIF at a specific wavelength (depending on μh) simulated by SCOPE as the true values, which were compared to the SIF retrievals. The root mean square error (RMSE) was used to evaluate the influence of the parameters, and the optimized parameters were determined for these settings with the smallest RMSE.
3. Results
3.1. Influence of Empirical Parameters on SIF Retrievals
The range of retrieval RMSE when different values were set for a parameter stands for the necessity of optimizing this parameter. In this study, we selected the median value of each parameter in Table 4 as a default value. Then, for each parameter, the RMSE of different values were recorded when other parameters were fixed as their default value in order to clarify the influence of different parameters on retrieval accuracy (Figure 3).
Figure 3.
The root mean square error (RMSE) of SIF retrieval when different values for each parameter were set. The influence of each parameter was investigated when other parameters were set as their default values, i.e., μh1 = 740 nm, σh1 = 30 nm, μh2 = 688 nm, σh2 = 10 nm, np = 3, nSV = 11, λ1 = 730 nm, and λ2 = 691 nm (SNR = 322). The horizontal bar in blue represents the median, diamonds in red represent the average, and the box bar covers 50% of the RMSE values. RMSE values that do not fall within the upper and lower limits of the boxplot are excluded as outliers.
The fitting window of SIF retrieval (represented by λ1 and λ2) notably affected the accuracy of SIF retrieval, both in the far-red and red fitting windows. Improper setting of the fitting window greatly increased the RMSE to 1.48 mW m−2 sr−1 nm−1 and 8.79 mW m−2 sr−1 nm−1 for far-red and red SIF retrieval, respectively. However, an RMSE of 0.59 mW m−2 sr−1 nm−1 and 0.87 mW m−2 sr−1 nm−1 could be achieved when the optimal fitting window was selected. The changing range of RMSE can be 104.51% and 295.52% of the average RMSE in different far-red and red fitting windows, respectively. The polynomial order (np) and the feature vectors (nSV) also have a great influence on the retrieval. For far-red SIF retrieval, the RMSE changed within 0.57–1.09 mW m−2 sr−1 nm−1 with different np (66.67% of the average RMSE), and when a different nSV was set, the RMSE varied between 0.57 and 0.96 mW m−2 sr−1 nm−1 (47.56% of the average RMSE). For red SIF retrieval, the influence of np and nSV was greater. An RMSE within 0.76–3.27 mW m−2 sr−1 nm−1 (138.67% of the average RMSE) and 1.02–3.36 mW m−2 sr−1 nm−1 (121.62% of the average RMSE) can be obtained when selecting different np and nSV, respectively.
On the contrary, the impact of μh and σh was negligible, and the changing ranges of RMSE using different μh and σh were no more than 5% of the average RMSE value for both far-red and red SIF retrieval.
Thus, the fluorescence function was fixed according to the study of Du et al. [18], i.e. μh1 is 740 nm, σh1 is 21 nm, μh2 is 692 nm and σh2 is 9.5 nm. The impacts of λ1, λ2, np, and nSV are evaluated in a more detailed below.
3.1.1. Influences of Fitting Window and SNR on SIF Retrievals
SIF retrieval is based on the filling effect in the absorption lines and the best fitting window may be dependent on the SNR. Thus, we calculated the RMSE of SIF retrieval using different fitting windows and simulation datasets with different SNRs (Figure 4).
Figure 4.
The root mean square error (RMSE) of (a) far-red and (b) red SIF retrieval using different fitting windows (λ1—758 nm at far-red band and 682—λ2 nm at red band) and simulation datasets with different SNRs. Other parameters listed in Table 4 are arbitrarily changed, and the minimal value of the RMSE is taken.
The retrieval accuracy was low in narrow fitting windows. For the far-red band, the RMSE was greater than 0.70 mW m−2 sr−1 nm−1 when λ1 was greater than 740 nm, and the accuracy did not appear to be significantly improved with the increase in the SNR in narrow fitting windows. In wider fitting windows (λ1 < 740 nm, λ2 > 691 nm) that contain enough absorption lines, a relatively low RMSE of less than 0.70 mW m−2 sr−1 nm−1 can be achieved, even when the SNR is 322, by appropriately adjusting other parameters in Table 4.
Figure 4 shows the best retrievals of SIF with different fitting windows, but it has not been tested whether the best retrievals in these fitting windows are robust with the changing of nSV and np. To select an optimal fitting window that can achieve an accurate and robust retrieval using SIFIS data, the simulation dataset with an SNR of 322 was used to investigate the distribution of RMSE in different fitting windows (Figure 5).
Figure 5.
The RMSE of SIF retrieval in different (a) far-red and (b) red fitting windows (λ1—758 nm at far-red band and 682—λ2 nm at red band) when the other parameters in Table 4 arbitrarily change (SNR = 322). The horizontal bar in blue represents the median, diamonds in red represent the average, the box bar covers 50% of the RMSE values, RMSE values that do not fall within the upper and lower limits of the boxplot are excluded as outliers.
For far-red SIF retrieval, the RMSE varied within wide limits when λ1 was greater than 735 nm, which increased the difficulty of the optimization of empirical parameters in these fitting windows. When λ1 was 735 nm, the RMSE was the smallest and most robust with a value of 0.93 ± 0.44 mW m−2 sr−1 nm−1. For red SIF retrieval, the varying range of RMSE continued to decrease as the fitting window widened. A fitting window of 682–697 nm achieved an optimal RMSE (0.95 ± 0.38 mW m−2 sr−1 nm−1).
3.1.2. Influence of Polynomial Order and the Number of Feature Vectors on SIF Retrievals
The influence of the polynomial order is clarified in the optimal fitting window selected in Section 3.1.1 (Figure 6). As can be seen, the RMSE decreased first and then increased with the increase in the polynomial order. The second-order polynomial was a better choice in both far-red and red fitting windows, with an RMSE of 0.82 ± 0.36 mW m−2 sr−1 nm−1 and 0.71 ± 0.16 mW m−2 sr−1 nm−1 for far-red and red SIF retrieval, respectively.
Figure 6.
The RMSE of SIF retrieval in the optimal (a) far-red (735–758 nm) and (b) red fitting windows (682–697 nm) with different polynomial orders when the other parameters in Table 4 arbitrarily change. The horizontal bar in blue represents the median, diamonds in red represent the average. The case where np is 5 in the far-red fitting window is excluded for the abnormal RMSE value with an average of 17.14 mW m−2 sr−1 nm−1.
The RMSEs of different numbers of feature vectors were evaluated (Figure 7). The accuracy of SIF retrieval decreased first and then increased with the increase in nSV in both the far-red and red fitting windows as too many feature vectors over-fit the random noise of the sensor. For far-red SIF retrieval, nSV was set as 4 with a minimal RMSE of 0.63 mW m−2 sr−1 nm−1, while for red SIF retrieval, nSV was set as 7 with a minimal RMSE of 0.53 mW m−2 sr−1 nm−1.
Figure 7.
The RMSE of SIF retrieval in (a) far-red and (b) red fitting windows using different numbers of feature vectors. The order of the polynomial is 2 and the fitting windows are 735–758 nm and 682–697 nm for the far-red and red band, respectively.
Except for the analysis above, we also investigated whether different fitting windows and SNR affected the optimal setting of np and nSV (Figure 8).
Figure 8.
The optimal polynomial order (a,c) and number of feature vectors (b,d) of SIF retrieval in different far-red (a,b) and red (c,d) fitting windows (λ1—758 nm at far-red band and 682—λ2 nm at red band). The influence of the SNR is also shown by the number of scattered points in the same fitting window. The optimal parameters were selected with the smallest RMSE.
Our results showed that in far-red fitting windows, there was an obvious positive linear correlation between window width and the optimal np and nSV, with an R2 of 0.71 and 0.46 respectively, which means that a larger number of feature vectors and polynomial orders are preferable in wider fitting windows. In the red band, the SIF signal was weaker and more affectable by the random noise. Thus, for red SIF retrieval, although the optimal np and nSV still had a positive linear correlation with the width of fitting windows, the correlation between λ2 and the optimal np or nSV was lower than that of far-red fitting windows with the R2 of 0.24 and 0.38 for the relationship between λ2 and the optimal np and the optimal nSV, respectively.
In addition to the influence of the fitting windows, the scatter points in a certain fitting window also represent the results corresponding to data with different SNR. In some fitting windows, such as 720–758 nm, there was more than one optimal value of nSV and np for data with different SNR, which indicates that the SNR also affects the optimization of the empirical parameters. However, in our selected fitting windows (682–697 nm for red SIF retrieval and 735–758 nm for far-red SIF retrieval), there was only one combination of optimal np and nSV which was not affected by the SNR.
3.2. End-to-end SIF Retrievals of the Optimal Empirical Parameters
End-to-end SIF retrieval for TECIS-like simulations was carried out to test the retrieval ability of the selected optimal combinations of empirical parameters. We calculated the mean and standard deviation of SIF retrieved under 1280 different atmospheric conditions (SIFretrieved) and took 126 SIF values simulated by the SCOPE model as true values (SIFtrue). Here, we show end-to-end SIF retrievals using the optimal empirical parameters for SIFIS onboard TECIS-1 (Figure 9).
Figure 9.
The end-to-end SIF retrieval for the TECIS-1 satellite using (a) far-red and (b) red fitting windows with optimized parameter setting (SNR = 322). For far-red SIF, the fitting window is 735–758 nm, the order of the polynomial is 2, and the number of feature vectors is 4. For red SIF, the fitting window is 682–697 nm, the order of the polynomial is 2, and the number of feature vectors is 7. The standard deviations of retrieved SIF are depicted by the error bars.
The scatter points distribute very close to the 1:1 line for both simulations in the far-red and red band. For far-red SIF retrieval, the optimized parameters setting in the data-driven algorithm are given with a 735–758 nm fitting window, second-order polynomial, and four feature vectors. When the SNR is 322, the standard error of SIF retrieval is 0.62 mW m−2 sr−1 nm−1 and the RMSE value is 0.63 mW m−2 sr−1 nm−1. For red SIF retrieval, the optimized parameters are second-order polynomial and seven feature vectors using a fitting window of 682–697 nm. The standard error of SIF retrieval was 0.44 mW m−2 sr−1 nm−1, and the RMSE value was 0.53 mW m−2 sr−1 nm−1. Furthermore, the error lines of different SIF intensities have almost the same amount under a certain noise level (e.g., SNR = 322), which indicates that the larger SIF levels have a smaller relative error and the absolute error is mainly affected by noise while independent of SIF intensity.
We also evaluated the fitting accuracy of the reconstructed radiance spectra between 735–758 nm and 682–697 nm with and without the filling of fluorescence under an SNR of 322, as shown in Figure 10. Figure 10 also shows excellent fitting accuracy of the forward model to reconstruct the TOA radiance. The residual value was less than 0.11 mW m−2 sr−1 nm−1 for far-red SIF retrieval and less than 0.04 mW m−2 sr−1 nm−1 for red SIF retrieval. The forward model fitted with SIF fit the radiance better with a lower residual error.

Figure 10.
Fitting (a,c) radiance spectra and (b,d) residual error using (a,b) far-red fitting window (735–758 nm) and (c,d) red fitting window (682–697 nm) when SIF is fitted and not fitted.
4. Discussion
Our results showed the variation of the retrieval accuracies using several empirical parameters in the model of the data-driven algorithm. Firstly, as shown in Figure 4 and Figure 5, more accurate retrievals can be achieved using wider fitting windows. This is largely because the broader spectral windows may contain more information about the absorption line [32], while reducing the influence of instrumental noise [19]. However, a wide window is not always correlated with an accurate retrieval, which can be explained by the fact that it is more difficult to accurately fit the variation of radiance in a wider window [19]. Secondly, the retrieval accuracy increased first and then decreased with the increase of np, while the optimal retrieval was achieved when np is 2 or 3. This is easy to understand, for the low-frequency part of the vegetation reflectance spectrum has a fixed smooth shape, which can be well fitted by a 2- or 3-order polynomial in a fitting window narrower than 30 nm [19]. Besides, the accuracy of retrieval first increases and then decreases as nSV increases from 2 to 20. This is because the variance that can be explained by feature vectors increased with nSV at first and gradually saturated [12]. Most of the variance of the training set can be carried out by our selected feature vectors (99.89% of the variance can be explained by the first four feature vectors at far-red band, while the first seven feature vectors at red band can explain 99.99% of the variance). The subsequent feature vectors can only explain less than 0.2% of the variance, which mostly comes from the random noise of the instrument.
It has to be admitted that our set of parameters is only designed for SIFIS-like data and may not apply to other instruments with different spectral characteristics. For satellites with higher spectral resolution (<0.1 nm), it is better to use a narrower fitting window, which only covers the solar Fraunhofer lines [11,15,18]. Nevertheless, the reasonable RMSE and the ability of the forward model to fit the radiance spectra indicate that accurate and robust retrievals can be achieved using our optimal parameters at both far-red and red bands. It can also be expected that the optimal combination of empirical parameters in this paper can not only be applied to TECIS-1, but also have the potential of SIF retrieval onboard other satellites to be launched, such as the FLuorescence EXplorer (FLEX) [33] and the Tropospheric Emissions: Monitoring of Pollution (TEMPO) [34] geostationary mission.
5. Conclusions
In this paper, simulation data were employed to investigate the influence of different empirical parameters on far-red and red SIF retrieval. The fitting window, the polynomial order, and the number of feature vectors were the three main factors that influenced SIF retrieval in both far-red and red band. The retrieval accuracy was relatively higher for a wider fitting window, and an RMSE of lower than 0.7 mW m−2 sr−1 nm−1 was achieved with fitting windows wider than 735–758 nm and 682 nm–691 nm for far-red band and red band, respectively. Moreover, the retrieval accuracy was dependent on the polynomial order and the number of feature vectors. The RMSE decreased first and then increased with the increase in np and nSV and can be lower than 0.7 mW m−2 sr−1 nm−1 when np is 2 or 3; such a result can also be obtained when nSV is 3–5 and 4–14 in the far-red and red band with an np of 2, respectively. According to the specifications of SIFIS onboard TECIS-1, a fitting window of 735–758 nm, second-order polynomial, and four feature vectors were the optimized parameters for far-red SIF retrieval, with an RMSE of 0.63 mW m−2 sr−1 nm−1. For red SIF retrieval, using second-order polynomial and seven feature vectors in the fitting window of 682–697 nm was the optimal choice and resulted in an RMSE of 0.53 mW m−2 sr−1 nm−1. The optimized parameters of the data-driven algorithm can guide the retrieval of satellite-based SIF and are valuable for generating an accurate SIF product of the TECIS-1 satellite after its launch.
Author Contributions
Conceptualization, C.Z., S.D., X.L., and L.L.; data curation, C.Z.; funding acquisition, L.L.; investigation, C.Z.; methodology, S.D.; project administration, L.L.; resources, X.L.; software, C.Z.; supervision, S.D. and L.L.; validation, X.L., Y.W., and Z.L.; visualization, Y.W. and Z.L.; writing—original draft, C.Z.; writing—review & editing, S.D. and L.L. All authors have read and agreed to the published version of the manuscript.
Funding
This research was funded by the Key Research Program of the Chinese Academy of Sciences (ZDRW-ZS-2019-1), the National Key Research and Development Program of China, grant number 2017YFA0603001, and the National Natural Science Foundation of China, grant number 41871239.
Data Availability Statement
Data is contained within the article.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Sara Tokhi, A.; Ryozo, N.; Shusuke, M.; Tofael, A. Prediction of grape yields from time-series vegetation indices using satellite remote sensing and a machine-learning approach. Remote Sens. Appl. Soc. Environ. 2021, 22, 100485. [Google Scholar] [CrossRef]
- Chung-Te, C.; Pei, J.; Hsiang-Hua, W.; Teng-Chiu, L. Resilience of a subtropical rainforest to annual typhoon disturbance: Lessons from 25-year data of leaf area index. For. Ecol. Manag. 2020, 470-471, 118210. [Google Scholar] [CrossRef]
- Zhang, L.; Shao, Z.; Liu, J.; Cheng, Q. Deep Learning Based Retrieval of Forest Aboveground Biomass from Combined LiDAR and Landsat 8 Data. Remote Sens. 2019, 11, 1459. [Google Scholar] [CrossRef]
- Mohammed, G.H.; Colombo, R.; Middleton, E.M.; Rascher, U.; van der Tol, C.; Nedbal, L.; Goulas, Y.; Pérez-Priego, O.; Damm, A.; Meroni, M.; et al. Remote sensing of solar-induced chlorophyll fluorescence (SIF) in vegetation: 50 years of progress. Remote Sens. Environ. 2019, 231, 111–177. [Google Scholar] [CrossRef]
- Joiner, J.; Yoshida, Y.; Gu, L.; Marchesini, L.B.; Vasilkov, A.P.; Schaefer, K. The seasonal cycle of satellite chlorophyll fluorescence observations and its relationship to vegetation phenology and ecosystem atmosphere carbon exchange. Remote Sens. Environ. 2014, 152, 375–391. [Google Scholar] [CrossRef]
- Sun, Y.; Frankenberg, C.; Wood, J.D.; Schimel, D.S.; Jung, M.; Guanter, L.; Drewry, D.T.; Verma, M.; Porcar-Castell, A.; Griffis, T.J. OCO-2 advances photosynthesis observation from space via solar-induced chlorophyll fluorescence. Science 2017, 358, eaam5747. [Google Scholar] [CrossRef] [PubMed]
- Walther, S.; Voigt, M.; Thum, T.; Gonsamo, A.; Zhang, Y.; Köhler, P.; Jung, M.; Varlagin, A.; Guanter, L. Satellite chlorophyll fluorescence measurements reveal large-scale decoupling of photosynthesis and greenness dynamics in boreal evergreen forests. Glob. Chang. Biol. 2016, 22. [Google Scholar] [CrossRef] [PubMed]
- Zhang, Y.; Xiao, X.; Zhang, Y.; Wolf, S.; Zhou, S.; Joiner, J.; Guanter, L.; Verma, M.; Sun, Y.; Yang, X. On the relationship between sub-daily instantaneous and daily total gross primary production: Implications for interpreting satellite-based SIF retrievals. Remote Sens. Environ. New York 2018, 276–289. [Google Scholar] [CrossRef]
- Zhang, Y.; Guanter, L.; Joiner, J.; Lian, S.; Guan, K. Spatially-explicit monitoring of crop photosynthetic capacity through the use of space-based chlorophyll fluorescence data. Remote Sens. Environ. 2018, 210, 362–374. [Google Scholar] [CrossRef]
- Köhler, P.; Guanter, L.; Kobayashi, H.; Walther, S.; Wei, Y. Assessing the potential of sun-induced fluorescence and the canopy scattering coefficient to track large-scale vegetation dynamics in Amazon forests. Remote Sens. Environ. 2017, 204, 769–785. [Google Scholar] [CrossRef]
- Guanter, L.; Frankenberg, C.; Dudhia, A.; Lewis, P.E.; Gómez-Dans, J.; Kuze, A.; Suto, H.; Grainger, R.G. Retrieval and global assessment of terrestrial chlorophyll fluorescence from GOSAT space measurements. Remote Sens. Environ. 2012, 121, 236–251. [Google Scholar] [CrossRef]
- Joiner, J.; Guanter, L.; Lindstrot, R.; Voigt, M.; Vasilkov, A.P.; Middleton, E.M.; Huemmrich, K.F.; Yoshida, Y.; Frankenberg, C. Global monitoring of terrestrial chlorophyll fluorescence from moderate-spectral-resolution near-infrared satellite measurements: Methodology, simulations, and application to GOME-2. Atmos. Meas. Tech. 2013, 6, 2803–2823. [Google Scholar] [CrossRef]
- Köhler, P.; Guanter, L.; Joiner, J. A linear method for the retrieval of sun-induced chlorophyll fluorescence from GOME-2 and SCIAMACHY data. Atmos. Meas. Tech. 2015, 8, 2589–2608. [Google Scholar] [CrossRef]
- Guanter, L.; Aben, I.; Tol, P.; Krijger, J.M.; Hollstein, A.; Köhler, P.; Damm, A.; Joiner, J.; Frankenberg, C.; Landgraf, J. Potential of the TROPOspheric Monitoring Instrument (TROPOMI) onboard the Sentinel-5 Precursor for the monitoring of terrestrial chlorophyll fluorescence. Atmos. Meas. Tech. 2015, 8, 1337–1352. [Google Scholar] [CrossRef]
- Frankenberg, C.; O’Dell, C.; Berry, J.; Guanter, L.; Joiner, J.; Köhler, P.; Pollock, R.; Taylor, T.E. Prospects for chlorophyll fluorescence remote sensing from the Orbiting Carbon Observatory-2. Remote Sens. Environ. 2014, 147, 1–12. [Google Scholar] [CrossRef]
- Taylor, T.E.; Eldering, A.; Merrelli, A.; Kiel, M.; Yu, S. OCO-3 early mission operations and initial (vEarly) XCO2 and SIF retrievals. Remote Sens. Environ. 2020, 251, 112032. [Google Scholar] [CrossRef]
- Du, S.; Liu, L.; Liu, X.; Zhang, X.; Zhang, X.; Bi, Y.; Zhang, L. Retrieval of global terrestrial solar-induced chlorophyll fluorescence from TanSat satellite. Sci. Bull. 2018, 63, 1502–1512. [Google Scholar] [CrossRef]
- Du, S.; Liu, L.; Liu, X.; Zhang, X.; Gao, X.; Wang, W. The Solar-Induced Chlorophyll Fluorescence Imaging Spectrometer (SIFIS) Onboard the First Terrestrial Ecosystem Carbon Inventory Satellite (TECIS-1): Specifications and Prospects. Sensors 2020, 20, 815. [Google Scholar] [CrossRef]
- Guanter, L.; Rossini, M.; Colombo, R.; Meroni, M.; Frankenberg, C.; Lee, J.-E.; Joiner, J. Using field spectroscopy to assess the potential of statistical approaches for the retrieval of sun-induced chlorophyll fluorescence from ground and space. Remote Sens. Environ. 2013, 133, 52–61. [Google Scholar] [CrossRef]
- Joiner, J.; Yoshida, Y.; Guanter, L.; Middleton, E.M. New methods for the retrieval of chlorophyll red fluorescence from hyperspectral satellite instruments: Simulations and application to GOME-2 and SCIAMACHY. Atmos. Meas. Tech. 2016, 9, 3939–3967. [Google Scholar] [CrossRef]
- Wang, S.; Huang, C.; Zhang, L. Designment and Assessment of Far-Red Solar-Induced Chlorophyll Fluorescence Retrieval Method for the Terrestrial Ecosystem Carbon Inventory Satellite. Remote Sens. Technol. Appl. 2019, 3, 476–487. [Google Scholar]
- Guanter, L.; Alonso, L.; Gómez-Chova, L.; Meroni, M.; Preusker, R.; Fischer, J.; Moreno, J. Developments for vegetation fluorescence retrieval from spaceborne high-resolution spectrometry in the O2-A and O2-B absorption bands. J. Geophys. Res. 2010. [Google Scholar] [CrossRef]
- Liu, X.; Liu, L. Assessing Band Sensitivity to Atmospheric Radiation Transfer for Space-Based Retrieval of Solar-Induced Chlorophyll Fluorescence. Remote Sens. 2014, 6, 10656–10675. [Google Scholar] [CrossRef]
- Tol, V.D.C.; Verhoef, W.; Timmermans, J.; Verhoef, A.; Su, Z. An integrated model of soil-canopy spectral radiances, photosynthesis, fluorescence, temperature and energy balance. Biogeosciences 2009, 6, 3109–3129. [Google Scholar] [CrossRef]
- Clark, R.; Swayze, G. Automated Spectral Analysis: Mapping Minerals, Amorphous Materials, Environmental Materials, Vegetation, Water, Ice and Snow, and Other Materials: The USGS Tricorder Algorithm. Lunar Planet. Sci. Conf. 1995, 26, 255. [Google Scholar]
- Berk, A.; Bernstein, L.S.; Anderson, G.P.; Acharya, P.K.; Robertson, D.C.; Chetwynd, J.H.; Adler-Golden, S.M. MODTRAN Cloud and Multiple Scattering Upgrades with Application to AVIRIS. Remote Sens. Environ. 1998, 65, 367–375. [Google Scholar] [CrossRef]
- Berk, A.; Acharya, P.K.; Bernstein, L.S.; Anderson, G.P.; Chetwynd, J.H.; Hoke, M.L. Reformulation of the MODTRAN band model for higher spectral resolution. In Proceedings of the AeroSense 2000, Orlando, FL, USA, 24–26 April 2000; Volume 4049, pp. 190–198. [Google Scholar] [CrossRef]
- Ji, M.; Tang, B.; Li, Z. Review of Solar-induced Chlorophyll Fluorescence Retrieval Methodsfrom Satellite Data. Remote Sensing Technol. Appl. 2019, 3, 455–466. [Google Scholar]
- Parazoo, N.C.; Frankenberg, C.; Köhler, P.; Joiner, J.; Yoshida, Y.; Magney, T.; Sun, Y.; Yadav, V. Towards a Harmonized Long-term Spaceborne Record of Far-red Solar-induced Fluorescence. J. Geophys. Res. Biogeosci. 2019, 124. [Google Scholar] [CrossRef]
- Damm, A.; Erler, A.; Hillen, W.; Meroni, M.; Schaepman, M.E.; Verhoef, W.; Rascher, U. Modeling the impact of spectral sensor configurations on the FLD retrieval accuracy of sun-induced chlorophyll fluorescence. Remote Sens. Environ. 2011, 115, 1882–1892. [Google Scholar] [CrossRef]
- Wolanin, A.; Rozanov, V.V.; Dinter, T.; No?L, S.; Vountas, M.; Burrows, J.P.; Bracher, A. Global retrieval of marine and terrestrial chlorophyll fluorescence at its red peak using hyperspectral top of atmosphere radiance measurements: Feasibility study and first results. Remote Sens. Environ. 2015, 166, 243–261. [Google Scholar] [CrossRef]
- Kohler, P.; Guanter, L.; Frankenberg, C. Simplified Physically Based Retrieval of Sun-Induced Chlorophyll Fluorescence From GOSAT Data. IEEE Geosci. Remote Sens. Lett. 2015, 12, 1446–1450. [Google Scholar] [CrossRef]
- Drusch, M.; Moreno, J.; Del Bello, U. The FLuorescence EXplorer Mission Concept—ESA’s Earth Explorer 8. IEEE Trans. Geosci. Remote Sens. 2017, 55, 1273–1284. [Google Scholar] [CrossRef]
- Zoogman, P.; Liu, X.; Suleiman, R.M. Tropospheric emissions: Monitoring of pollution (TEMPO)—ScienceDirect. J. Quant. Spectrosc. Radiat. Transf. 2017, 186, 17–39. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).