1. Introduction
Healthy ecosystems are a primary source of vital resources for human welfare and survival while regulating the impacts of natural hazards and protecting human settlements against floods, landslides, and drought events, thus playing a key role in mitigating climate change effects [
1]. Conserving biodiversity and preventing biodiversity shifts is, therefore, of significant importance and a main target of multiple international conventions. The Sustainable Development Goals (SDGs) [
2] and the Paris Agreement by the United Nations Framework Convention on Climate Change (UNFCCC) [
3] promote the conservation of ecosystems and their associated biodiversity. The conservation, restoration and sustainable use of ecosystems are specifically mentioned in SDG 6.6 and SDG 15.1. The first, SDG 6.6, aims to “
protect and restore water-related ecosystems, including mountains, forests, wetlands, rivers, aquifers and lakes”, while SDG 15.1 aims at ensuring “
the conservation, restoration and sustainable use of terrestrial and inland freshwater ecosystems and their services, in particular forests, wetlands, mountains and drylands, in line with obligations under international agreements”. Other agreements and commitments include the Aichi Biodiversity Targets by Parties of the United Nations (UN) and the Convention on Biological Diversity (CBD) [
4,
5].
Providing information on the health of ecosystems status through monitoring activities is essential for applying suitable conservation and restoration practices. In recent years, Earth Observation (EO) technologies, particularly Satellite Remote Sensing (SRS), have been used to assist national authorities in monitoring programs for the achievement of the SDGs [
6]. The main advantages of SRS include the provision of data in a systematic and timely way, the cost-effectiveness of monitoring activities and accessibility to remote areas [
7,
8,
9]. SRS can capture biodiversity changes both at global and regional scales [
10,
11], thus the joint use of SRS data with in situ data and images obtained from other technologies (Light Detection and Ranging (LiDAR), Unmanned Aerial Vehicle (UAV), etc.) is a key factor to generate actionable knowledge in real-time for improved decision-making [
10].
To generate actionable knowledge, SRS data need to be translated into information and specifics metrics, maps, or consumable information products that can then be used to inform decision-makers. Structural and functional indicators are both required to evaluate the ecosystem’s health. Structural indicators reflect on the extension and composition of the ecosystems; functional indicators are useful to understand whether the areas are fulfilling their ecosystem role [
12]. Structural indicators are often used in monitoring programs whereas ecosystem functions are rarely measured [
13,
14,
15,
16], although they are sensitive indicators of ecosystem health [
17] and losses of biodiversity [
18]. Ecosystem functions are the aggregation of multiple processes that overall define the ecosystem’s capacity to bring benefits to a range of species [
9,
19]. Among these components, is Primary Productivity (PP). PP can be categorized into Gross Primary Production (GPP), the total amount of carbon or energy captured by plants and Net Primary Production (NPP), the carbon allocated to plant tissue after accounting for the costs of autotrophic respiration. PP is a process that underpins most of the ecosystem functions essential for the understanding of the global carbon cycle [
20]. Due to its relevance in the characterization of biodiversity change, PP is considered an Essential Biodiversity Variable (EBV). EBVs are defined as the derived measurements to study and manage biodiversity change, and their estimation is a step forward for integral monitoring programs and holistic health assessments in ecosystems [
4,
21].
PP has been widely related to the Vegetation Indexes (VIs), used as a proxy for vegetation productivity and to create long-term and consistent data series. Since their first use in the early 1970s, VIs have evolved rapidly together with the increased number of spectral bands of the sensors and the new demand for measuring specific ecological indicators [
7]. Particularly, the Normalized Difference Vegetation Index (NDVI) and the Enhanced Vegetation Index (EVI) have been widely used for the monitoring of primary productivity. Empirical models linking the NDVI and EVI with GPP or NPP in situ observations have shown accurate NDVI-GPP correlation in low biomass vegetation areas [
9,
22] and good performance of the EVI in high biomass vegetated areas such as dense grass or forest ecosystems [
23,
24]. Additional VIs have also been investigated, such as the two-band Enhanced Vegetation Index (EVI2) used by Cai et al. to assess the relationship between the EVI2, meteorological variables and GPP within the Nordic region [
25]. A recently suggested vegetation index, the Near-Infrared Reflectance of terrestrial vegetation (NIRv) [
26] has also shown a high correlation with GPP across different ecosystems [
27,
28] and temporal scales [
29,
30]. Recent outcomes in research suggest that given that GPP is not only connected to the greenest of vegetation, but also to other processes such as temperature or content water of leaves, different VIs sensitive to these factors need to be evaluated in carbon flux studies [
31]. Moreover, since specific VIs are more sensitive to particular ecosystems, a broader analysis for the selection of a specific VI depending on the study region is required for a better model formulation [
32,
33,
34].
Currently, remote sensing based global products of PP are provided by the National Aeronautics and Space Administration (NASA) through the Moderate Resolution Imaging Spectroradiometer (MODIS) sensor. This product, known as a MOD17 product, provides estimates of GPP on an 8-days basis, aggregated to annual products to compute NPP, with a spatial resolution of 250 m, 500 m and 1 km. MOD17, designed to provide an accurate regular measure of terrestrial vegetation growth, has been available since the mid-2000s [
35,
36,
37,
38]. Although the MOD17 product is widely utilized, its use and applicability in conservation and management practice are limited by the trade-off between its temporal resolution, spatial resolution, and spatial extent [
35,
39,
40]. MOD17 coarse spatial resolution (1 km) does not provide sufficient relevant information to monitor processes at a fine-local scale and represents a critical factor for carbon flux estimation, especially in heterogeneous landscapes [
41,
42]. Additionally, MOD17 GPP products are estimated by means of biome-specific parameters, parametrized and applied to biomes at a global scale [
35]. While this simplification is suitable for global estimations of GPP, the coarse inputs and the use of global Biome Parameter Look-up Table (BPLUT) parameters reduce the capacity of MOD17 to describe ecologically significant variation at finer scales [
35,
37,
43,
44].
Different authors have identified the Copernicus Sentinel-2 Multi-Spectral Instrument (MSI) as a potential sensor capable to improve the estimation of PP at a local scale, given its higher spatial resolution (10 m, 20 m, and 60 m). Pettorelli et al. [
20] listed several potential applications of Sentinel-2 sensors, including the generation of indicators of vegetation phenology, fire damage extent, defoliator control, habitat extent, habitat quality, production of biomass, etc. According to Cai et al. [
25], the Sentinel-2 data can better capture the spatial variation in heterogeneous landscapes in comparison to MOD17 products and enable the estimation of GPP at finer scales. Lin et al. [
33] have shown that the narrow red bands from Sentinel-2 sensors enhance the GPP estimations, increasing the accuracy of the empirical relationship between remote sensing based VIs and GPP which is influenced by the spectral resolution of the satellite products.
To provide information on ecosystems and biodiversity status, this study builds on the methodology proposed by Cai et al. [
25] and further investigates the use of Sentinel-2 MSI in modeling GPP. The methodology of Cai et al., is extended to evaluate not only several EVs but also multiple VIs, therefore, assessing the sensitivity of specific bands to the climate conditions and vegetation characteristics of the studied area. Eight Sentinel-2 derived VIs are integrated with in situ measurements of different EVs; their relationship with ground base GPP is investigated to select a model with the highest accuracy. A robust empirical approach employed by Cai et al. that follows previous research [
45,
46] is adopted. Additionally, a methodology is implemented to upscale the model across larger areas. This upscaling methodology relies on an unsupervised classification algorithm used to identify regions with similar reflectance properties to those within the EC area. Those regions are assumed to have vegetation with similar biophysical properties and photosynthesis activity. The study methodology is demonstrated in the case study of a marshland ecosystem located in Doñana National Park, Spain. High-resolution maps of GPP at the local scale are provided to support conservation and restoration practices, policy and decision-making. The integration of multiple remotely sensed indices and additional environmental variables (EVs) allows our workflow to be flexible, facilitating its uptake to different ecosystems.
4. Discussion
The development and application of a workflow to derive an empirical model of gross primary productivity from remote sensing imagery have been demonstrated at Doñana National Park wetlands. The model has been upscaled for the estimation of GPP in the marsh area surrounding the EC tower. The generated monthly maps of GPP have the potential to provide detailed information on the patterns and dynamics of primary production.
The model selection led to a linear regression model driven by the remote sensing based red-edge chlorophyll index CLr in combination with the rainfall data. This confirmed previous research outcomes on the sensitivity of the red-edge to canopy biomass, chlorophyll content and photosynthesis activity [
76,
77,
78] and thus on the enhanced estimation of the vegetation biophysical characteristics [
33,
79,
80]. In general, analyzing each VI proposed in the methodology, we observed that the greenness-sensitive VIs performed the best. Narrowband greenness VIs also showed good performances. The EVI and EVI-2 resulted to be more sensitive than NDVI, confirming that EVI can improve the estimation of productivity in areas with dense vegetation being less prone than NDVI to noise caused by soil and atmospheric effects [
81,
82]. Water-sensitive VIs, such as the LSWI and the MNDWI, did not show a high correlation with in situ GPP. Previous researchers have shown that these indexes are more sensitive to drought [
34,
83] than greenness-sensitive VIs and thus we suggest further investigation of their performances during dry phases and in ecosystems undergoing seasonal droughts.
The integration of satellite-derived VIs with environmental variables in the model formulation increased the correlation with in situ GPP. Particularly, rainfall data increased the model performances confirming that the biomass production of the Doñana marsh ecosystem is strongly dependent on precipitation [
58]. As a result of the high exploitation of water resources in the proximities of the Doñana National Park, the current flooding regime of the natural marsh is mainly determined by local precipitation [
16]. The wetlands’ size and depth change remarkably between years, driven mainly by the variability in the precipitation [
48], indicating that the flood levels and associated primary productivity in the ecosystems also correlate with the precipitation regimen. The association has also been supported by other authors studying species distribution stating that precipitation is the main driver of primary productivity and associated support of habitats for waterbirds in the ecosystem [
84].
The phenological cycle of the vegetation of the study area is also well reproduced, confirming the choice of our model predictors. [
48,
58] report that the biomass and associated photosynthesis activity observed through the total intake of carbon dioxide or GPP is low during the start of the rain season (October–December) and the initial stage of the more intense precipitation and flood season (January–February). Then, the GPP content increases in the middle of the wet season when the water availability in the ecosystem reaches a maximum (February–March) [
58] and continues to increase till the dry season when GPP starts to decrease. During the months with higher productivity, the vegetation contributed to 81% of the total GPP produced during the entire period.
This pattern can also be observed in the generated monthly GPP maps. Given the lack of validation data outside the EC tower, we attempt to validate the derived products at the location of the EC tower. Although the empirical regression model was generally accurate, an overestimation of the monthly GPP was observed when upscaling the method. Smaller time windows were also considered to check the influence of the window size on the model results. The model performances did not improve significantly; however, the smaller time window allowed for a better discretization in time and the identification of GPP variation within the months. Ideally, a smaller time window during the period with high productivity, when the VI selected for the model formulation is rapidly changing, should yield higher model accuracy and abides by the prerequisite of stable VI. At the same time, enlarging the time window could contribute to a reduction in the contamination and noises in the images, and therefore, lead to a smoother GPP estimation [
85]. We, therefore, suggest further investigation of the model development when upscaling it.
Including more specific land cover classes to account for different vegetation types or ecosystems [
86,
87,
88] or applying daily instead of annual climatological footprint [
25,
89] can also allow better estimations of GPP and should be pursued in further research. To perform the classification, we used the k-means clustering algorithm [
70]. The algorithm was quick and easy to run, a main advantage in comparison with supervised classification, which would have required extensive prior knowledge of the area to be able to identify and label the classes for the training data set [
90]. Supervised classification algorithms, however, can also be considered to further improve the land classification of the study area. Supervised classification methods are in general more expensive since require more time and prior knowledge of the area, but can perform better if a good quality training data set is available [
91].
The heterogeneity of the area surrounding the EC tower and the climatological footprint represents a critical factor for carbon flux estimation from satellite imagery [
41]. High-resolution products that accurately describe the observed patterns and processes occurring at different temporal and spatial scales are needed [
35]. We demonstrated that the high spatial resolution of the Sentinel-2 products allowed a detailed description at the time of the distribution of GPP over the ecosystem, enhancing the MOD17 coarse 500-m resolution GPP products. We conclude, therefore, that Sentinel-2 multispectral and high-resolution products can enhance the evaluation of ecosystem responses at a fine scale.
The model versatility is ensured by the integration and assessment of multiple remotely sensed indexes with different environmental variables. This can facilitate the uptake of the workflow in different ecosystems. The advantage of assessing multiple indexes is justified by the fact that the sensitivity to specific bands combination depends on the characteristics of the vegetation or climatic conditions [
32,
33]. A clear example of this is the saturation of NDVI in high biomass vegetation areas and the use of EVI instead [
92]. Regardless of the development of multiple vegetation indexes formulated to characterize specific vegetation features and processes in different regions, many studies on primary productivity still rely on the use of NDVI or EVI [
25,
41,
45,
46,
92]. It can be assumed that those VIs will have sensitivity to any ecosystem component or process. Nevertheless, ecosystem functioning processes such as primary productivity could be explained by multiple remotely sensed information and not exclusively by the fraction of absorbed photosynthetically active radiation (fAPAR), commonly approximated with the NDVI and EVI. An additional concern about the use of those indexes is that multiple studies have derived models for Nordic ecosystems [
25,
45]. Regardless of the accuracy derived for those ecosystems, upscaling the approach to other regions would require further analysis. For instance, the assessment of drier ecosystems for which the water or temperature availability can play an essential role rather than the light availability or fAPAR would require the use of VIs that reflect these limiting factors. Examples of these indices are the water-sensitive VIs such as the Land Surface Water Index (LSWI) and the Modified Normalized Difference Water Index (MNDWI), which are more sensitive to droughts [
34] or the Green Normalized Difference Vegetation Index (GNDVI) which is more sensitive to chlorophyll and so to photosynthesis activity [
32]. Additionally, narrowband VIs should be preferred to broadband VIs since they are more sensitive to water availability than other vegetation indices [
34,
83]. Further improvement of the workflow would, therefore, require the inclusion of other VIs which can be more sensitive to the study area. The same would apply to the environmental variables used for the model formulation that also help to better represent the condition of the ecosystem.
Challenges and Outlook
In the present study, the model could not be calibrated and validated for multiple years, or a full year. This was due to the lack of in situ measurements during the dry season that hampered the calculation of the annual GPP and the modeling of interseason variation of GPP. Moreover, about half of the total available Sentienl-2 data images had to be removed from the original dataset because the cloud coverage was higher than 30% of the total area of the tale. With the increased availability of both in situ measurements and satellite data in the future, it will be possible to fine-tune the model and study not only interseason but also interannual variation in GPP. This was, however, not the main objective of this study that aimed at assessing the capability of Sentinel-2 data for spatiotemporal upscaling of flux tower GPP measurements. Furthermore, the application of the workflow in different regions and different ecosystems may require further development of the tool and will make the upscaling methodology robust.
Further analysis to estimate the effect of the flux partitioning method in the calculation of the ground base GPP may need to be carried out. Although the night-time method of flux partitioning proposed by Reichstein et al. [
93] was generally accurate, further research could investigate the day-time method of flux partitioning suggested by Lasslop et al. [
94] or the night-time data-based method proposed by Falge et al. [
95]. Using multiple flux partitioning approaches would allow the evaluation of the robustness of the GPP prediction method [
96]. Moreover, an uncertainty analysis could also be implemented to better understand the accuracy of the in-situ GPP estimation and its impact on the workflow [
97].
Additionally, further analysis to investigate residual noise in the VI time series may be required. In the current study, the noise in the time series was reduced by removing cloudy pixels, water pixels and pixels not belonging to vegetation (e.g., water bodies, snow, temporal floods, or other temporal phenomena). However other noises arising from surface-viewing geometries or sun sensors can still be hindered [
98]. Noise-reduction algorithms such as the Savitzky–Golay filtering [
99], the asymmetric Gaussian function fitting [
100], the double logistic function fitting [
101], or the best index slope extraction (BISE) method [
102], might be also implemented in the methodology to increase the accuracies of the derived VIs.