1. Introduction
Due to their strong contribution to the water cycle, water surface dynamics are the predictive features for the eventuality of floods, hydric stress, water scarcity and severe drought [
1,
2,
3,
4]. Soaring needs of monitoring are being expressed by all types of end-users, such as hydrological and climate experts, administrations, intergovernmental and non-governmental agencies and financial investors. Through the requirements expressed by communities committed to water preservation, challenges of getting information in specific uncovered areas are being witnessed [
5]. Significant indicators encompass water levels, water discharge and water volume changes of inland reservoirs and rivers [
6,
7,
8,
9,
10,
11,
12]. Remote sensing, and especially the field of altimetry, offers a privileged access to the quantification of water availability and the computation of the relative change in terms of water resources [
13].
Several web platforms, described in
Table 1, allow access to water level observations at pre-determined locations, defined as virtual stations. Copernicus Global Operational Land Service [
14] uses Jason-3 and Sentinel-3 data to obtain water level measurements over 12,901 virtual stations. Hydroweb [
15] recently (November 2020) increased their number of virtual stations from 1733 to 11,336, in addition to 124 large lakes. Observations are refreshed at the latest 1.5 days after the availability of a new altimetric measurement in the Hydroweb platform. Dahiti [
16] currently provides not only water level measurements for 4411 stations but also for some of them, surface areas, volume variations, bathymetry, water occurrence masks, land-water masks, hypsometry and river discharge with a latency of 1–2 days. G-REALM [
17] monitors, in real time, lakes over 100 km
2 around the world, relying on Jason-3 and other past missions for historical data since 1992. However, the information provided suffer from spatio-temporal coverage sparsity at a global scale, as they are only available at predefined locations from existing web platforms. Not only large regions of the world are being underrepresented, but missing data are observed in complex topographies (mountainous landscapes, seasonal ice cover), which require more advanced processing algorithms. Cos et al. [
18] give a truthful and suited listing of the inherent reasons behind these limitations.
When water level data are not available, their computation requires altimetry data access at different levels of processing and from the various satellite missions providing measurements. However, radar altimeter orbits and elevation retrieval technology were originally designed for observing oceans, implying a design of their spatial resolution limiting to 1000 m river width measurements for most of the missions [
22]. Moreover, retrieving water heights from altimeter range data requires a great deal of processing for rivers, as surrounding land introduces noise in the system conceived for open water. As indicated in the Method column of
Table 1, references of the algorithms used to extract water level heights from altimetry data show that these large datasets rely on L2 data-based products with ocean or ice retrackers [
15,
20,
21,
23] and applying post-processing outlier rejections, while DAHITI [
16] uses L1 data-based algorithms. Although the BRAT toolbox [
24] was conceived to process these raw data, downloading and processing the altimetry data for large-scale sites remain a challenge, which the AltEx software [
25] intended to tackle in 2019. An open-source platform with a web-viewer was built to explore altimetry database availability and access water level time series information on-the-fly. However, considered missions were limited to Jason 2-3 and the web-viewer does not seem to be maintained anymore, neither does the API access to data. Moreover, AltEx does not verify whether users have selected points over water.
Recently, [
26] stated that the greatest challenge of extracting insights from altimetry is actually educational in order to set up synergies with hydrologists. There is a strong need for operational remote sensing-derived products with “continuity of data services, standardization of information, characterization of errors and accuracy” to engage miscellaneous types of end-users in benefiting from this information.
The call for world-scale tracking of water level heights firstly relies on mapping all possible water body areas that can be monitored on a regular basis, called virtual stations. In this study, a cutting-edge operational virtual station extractor algorithm (denominated
teroVIR) has been developed for multi-altimetry missions going beyond pre-determined location availability. Moreover, an operational, robust and consistent large geospatial–temporal data processor (denominated
teroWAT) has been implemented, relying on a constellation of Level 2 (L2) altimetry mission data (Sentinel 3A-B, Jason 1-2-3 and SARAL/Altika) to obtain continuous water level information from 2001 over large areas barely covered until today, such as the Arctic or the Ishim and Nura river basins in Kazakhstan. The platform agnostic software
teroWAT has then been adapted to systematically run in near real-time, covering all of West Africa and validated across the whole region (17 countries) with a standardized and state-of-the-art proposed metric. The community of scientific users of the
teroWAT software has contributed to the development of products such as water volume variations and embedded visualisation, fostering the use of remote sensing information in hydrological applications. An L2 data-based algorithm [
27] has been found to have limited accuracy in the Arctic region due to frozen water in winter time. Therefore, a Level 1 (L1) data-based algorithm [
28] has been developed in an operational fashion and plugged into the existing software to deliver unrivalled water heights information.
In
Section 2, the methodologies are introduced, firstly detailing the datasets. Secondly, a description of
teroVIR, a world scale virtual stations extractor, is provided. Thirdly, the modulable and all terrain
teroWAT software is presented by functional block highlighting its interoperability. Finally, we expose the conventional statistical metrics used to validate water level heights and propose a combined innovative one. In
Section 3, the results, such as unprecedented water surface dynamics of unsurveyed areas, are revealed, as well as large-scale validation of water level time series in West Africa. Furthermore, we show improvements by the L1-based processor for complex terrain handling and illustration of the compliance to the mosaic of users’ needs. In
Section 4, the methods and results are discussed.
2. Material and Methods
2.1. Datasets
Twenty-five years of available satellite altimetry missions [
26] provide the opportunity to build a long record of water level data.
Figure 1 depicts the possibility of temporal continuity of observations thanks to overlapping missions’ lifetime and high revisit time (roughly bi-monthly/monthly) at medium spatial resolution (300 m up to 2.2 km). However, each mission follows its specific nominal orbit, impeding the persistence of spatial coverage when the mission is over. The processors developed in this study only rely on currently flying missions, namely Saral/ALTIKA(SRL), Jason(J) missions 1-2-3 (which followed one after each other the nominal orbit) and Sentinel(S )3A-3B. While CryoSat-2 is currently active, this mission suffers from low revisit time (369 days) for most of the world since it is focusing on the polar regions, and is therefore not included as an input of the processor. As opposed to imaging satellite missions, the altimetry tracks are sparse and do not offer the possibility to fly over the entire world. Fortunately, water level heights disclose the advantage of being physical quantities spatially connected along the same water body and the sparsity of the measurements is then counterbalanced. As a result, observational requirements encompass permanent water bodies frequently flown over by altimetry missions in favour of spatio-temporal consistency. Permanent water extents are extracted from the JRC GSWE dataset [
29], which expresses the water occurrence value per pixel in percentage aggregated over thirty years (1979–2019) at 30 m resolution. Spatial resolution limitations of the altimetry missions and geometry of the water bodies entail constraints on their validity to extract water level heights. In this sense, the virtual stations extractor
teroVIR aims at inducing all possible locations in the world which would allow for continuous water level observations, without the need for setting them up a priori.
2.2. teroVIR: World Scale Coverage Virtual Stations Extractor
The
teroVIR processor derives virtual stations by crossing the altimeters’ buffered nominal reference tracks with water masks extracted from JRC GSWE. In order to fulfill long data record requirements [
26], no interleaved and drifting orbits are considered, since satellite missions follow their nominal reference tracks repeatedly during most of their operational life. In the following, we describe the built
teroVIR framework, illustrated in
Figure 2.
World coverage of the nominal satellite tracks is retrieved from the data providers of each satellite mission and read by the software as vector objects. Geometries of the tracks firstly have to be formatted to respect continuous coordinate sets of points encompassing latitudes between and degrees and longitudes between and degrees. Harmonisation of the multi-mission track geodata follows so that each satellite track object discloses the following metadata (essential for uniqueness denomination of the virtual stations): satellite name, absolute orbit, relative orbit.
Virtual stations are extracted for the areas of interest provided as polygons by the user. In the case of regions with high water occurrence coverage (like in the Arctic), river beds can be provided as linestrings. In those regions, main rivers and reservoirs of interest drive the water dynamics impact within the hydrological catchment. By coarsely pre-setting the lines of geometry of the river beds of interest, only virtual stations along these water streams are retrieved, significantly reducing the processing time. This process is called Geographic Selection and outputs filtering geometries of interest.
Surface water mapping is conditioned by downloading the JRC GSWE occurrence maps. An adapter parsing the bounding box around the filtering geometries had to be implemented to fit the formatting system used to geo-query the JRC GSWE tiles by their name. Each tile is then downloaded and processed in order to extract, as explained in the following sections, firstly water bodies and secondly virtual stations. The codebase is modulable so that any water bodies dataset released with higher accuracy could be plugged in.
For each tile downloaded, water occurence pixel values are thresholded (T) to derive a binary mask (water/land). Filtering geometries issued from the Geometric Selection step are intersected with the tile, forming the area of interest from which the water body geometries should be extracted. In case of string lines (representing the river beds), they are buffered by a distance in degrees () covering the full width potentiality of the river. The output collection of geometries is burned into a raster mask, to allow for the intersection with the water mask. The intersected mask is finally vectorised into a collection of water body geometries. As this collection could contain interrupted and disconnected components, which actually represent the same water body, traditional image processing operations of erosion followed by dilation are carried out. The water body dataset JRC GSWE has a resolution of 30 m, and parameters of erosion e and dilation d are chosen within the same order of magnitude. Finally, sanitised and non-empty geometries are kept as a collection of water bodies for the specified tile. This aforedescribed process represents the Water bodies module.
For each satellite mission, only harmonised tracks geometries lying in the tile are kept. As the track geometries are simplified compared to where the real altimetry signal could occur, their spatial expansion has to be modelled. To do so, each of these tracks is buffered by a certain distance in degrees (
), which covers the across-track resolution for JS and SRL nominal tracks and the derivation potentiality of the real track from the nominal track specific to the S3 mission. Then, water bodies and track geometries are intersected, outputting a collection of virtual stations. This collection is finally cleaned to only keep geometries within certain area extent boundaries (
and
). The inferior area extent bound ensures that the virtual station has a high probability to receive an altimetry pulse. JS and SRL work in Low Resolution Mode (LRM) operating mode whereas S3 works in Satellite Aperture Radar (SAR) mode, allowing for a higher resolution along track. Therefore, for JS and SRL missions, the minimum area extent is set as
, while for S3,
, where
and
, respectively, are the across and along track resolution. The upper boundary
is used to discard virtual stations following into oceans or seas, which would then be way too big. All parameters and values are summed up in
Table A1.
2.3. teroWAT: Interoperable All Terrains Water Level Processor
The water level processor
teroWAT is built under the following rationale. Altimeters send pulses at recurring points in time towards the earth’s surface. Onboard trackers gather echoes that are scattered back, which are then accumulated under a power distribution function over time known as “waveform” [
13]. Waveforms obtained over open water are accurately processed to obtain the satellite ranges (distance to the water) by estimating their “epoch” [
30] with algorithms named “Retrackers” [
31,
32]. The altimetry L2 products [
33] provide directly only the satellite ranges, whereas the L1 ones consist of geolocated waveforms generated by the signal received by the sensor on board of satellites. As shown in
Figure 3, the water level processor
teroWAT was built in blocks in order to ensure the modularity of the software. This way, additional features can be added to the processor so that it can be tailored to the users’ needs. The necessary inputs mainly consist of surface water geometries (the so-called virtual stations or the entire water body) and the period of observation. It is then possible to further filter by missions or processing level (L1 or L2). Since the purpose of
teroWAT is to obtain water level heights at a large number of relevant locations around the world, necessary automatic downloading of a large number of altimetry datasets has to be carried out. To optimize downloading times, only passes close to the virtual station geometries are queried to the data provider, with priority to the most consolidated processed products in terms of timeliness. Several virtual stations can be flown over by the same pass; therefore, duplicates of queries are removed to reduce downloading times. All passes are then downloaded in parallel for the case of multiple virtual stations ensuring an optimised processing. After this, the L2 processing module is also run in parallel for all virtual stations. In this module, for each virtual station, all geo-matching passes containing altimetry datasets are processed. Each altimetry dataset provides data for all pulses of the entire satellite pass crossing the globe but only data points intersecting the water mask geometry are kept for computation. The larger the water body is, the more data points are likely to be found. Satellite ranges and backscatter values for all retrackers available in L2 products and corresponding corrections (wet troposphere, dry troposphere, ionosphere, solid earth tide, geocentric pole tide and ocean loading tide) are computed [
27]. The best retracker is the one presenting the highest number of points falling into the mask and the smaller standard deviation of the basckatter values. An additional filtering step is performed to obtain valid points and reject outliers both within each water body, and within each water level time series. The Interquartile Range Rule [
34] is used to first discard pulses with out of range backscatter values for one water body at a certain date, but also to discard water level values from the entire time series associated with a water body at the end of the processing. The output of the
teroWAT consists of multiple time series of water level values for each of the surface water masks and each mission used as input.
While the L2 product is sufficient for large open water bodies (oceans, lakes), the retracked satellite range can be a complete outlier when land reverberations or frozen water presence contaminate the waveform. In this case, the state-of-the-art algorithm by Gao et al. [
28], based on L1 products, has been implemented to filter out too noisy waveforms (based on the number and amplitude of the peaks) and the ones with retracked epoch too far away from the corresponding local elevation value (obtained from a Digital Elevation Model). While in most of the regions of world, the Shuttle Radar Topography Mission (SRTM) DEM is available, for the northernmost and southernmest areas, an adapter has been developed to be able to use the Arctic DEM. The Copernicus DEM with global coverage will be considered in further work. Following the same strategy of modularity, other adapters can be included for regional DEM of higher resolution. As seen in
Figure 3, the L1 processing module is stand alone and only requires communication with the altimetry datasets. The L1 processor computes the retracked satellite range value, which becomes an input of the L2 processor.
As all satellite missions operate on different orbits, biases can be introduced that require a cross-calibration of inter-track and inter-satellite biases. For each water surface geometry, mean differences between the overlapping time series of the different missions are set as bias and therefore removed to the water heights computed for each mission. By post-processing the time series with satellite mission bias removal [
35], the final outputs of
teroWAT are mere water level time series for each virtual station.
The
teroWAT processor can be adapted to any online processing platforms facilitating tailored access as required by users [
26]. As shown in
Figure 3, platform adapters are only needed for reading the inputs, for communicating with the data providers available in the platform and for visualisation of the results. This demonstrates the capability of the system to be implemented in on-line platforms by slightly changing the processor, while keeping the main steps unaltered. In other words, the system is platform agnostic.
2.4. Consistent and Inclusive Evaluation Metrics
The water heights are validated by calculating a collection of usual metrics. The unbiased root mean square error (ubRMSE) [
28] allows an estimation of the errors between the in situ measurements and the water level datasets, removing the bias induced by the distance between the measurements (<50 km) on the same water body.
are the in situ measurements corresponding to
, the water level matching
N datapoints. ubRMSE is defined as:
The Median Absolute Deviation (MAD) provides an assessment of the errors by removing the impacts of potential outlier datapoints. MAD is defined as:
The Pearson coefficient (R) [
36] gives an indication on the correlation of the time series compared. R is defined as:
Considering the sparsity of the measurements in remote and difficult to access areas, not all in situ stations have the same sample size and reliability. The ubRMSE and R are more likely to be better for a time series of two datapoints than for more. Based on this observation, we propose to classify the time series extracted by virtual station-mission (VSM) in four reliability categories. If a virtual station is at the crossing of different mission tracks, each mission time series will be evaluated separately as a VSM. The normalised
TeroSCO (
) grants a consistent comparison, combining different metrics, of all virtual stations and is evaluated as:
where
c is the reliability category and corresponding weight
. The definition of the categories is as such:
Very low coverage category: datapoint available and since only one measurement does not define a reliable time series.
Low coverage category: datapoints available, defining a minimum of points for a reliable time series and
Medium coverage category: datapoints available, defining around at least one year of data and . For Jason mission, , whereas for other missions , considering their revisit time.
High coverage category: datapoints available, defining more than one year of data and
The TeroSCO ensures an interpretable metric (normalised range), favouring datasets with the highest number of measurements (N), highest accuracies (ubRMSE) and highest correlations (R).
4. Conclusions
Climate science often targets large-scale spatio-temporal phenomena depicted by long data records [
38], many of which can be derived from Earth Observation satellites on a recurrent basis, such as water level related products. We developed an operational software conceptualised with a modulable view, not only allowing to fit the requirements of miscellaneous users but also delivering datasets at a global scale extracted from up and running altimetry missions (since 2001). Within this software, the interoperability of the
teroVIR processor producing virtual stations and of the
teroWAT processor delivering corresponding water level time series, guarantees the extension to past (Geosat, ERS 1-2, Topex/Poseidon, GFO, Envisat) and future (SWOT) missions. The recently launched Sentinel-6 mission is planned to be integrated shortly in the
teroVIR and
teroWAT processors.
The virtual stations extractor
teroVIR was built on the scientific and physical knowledge of multi-mission/dataset types to map all possible water body areas that can be water level monitored on a regular basis. Unprecedented virtual stations in barely covered area to date have been found in high numbers in Kazakhstan and the Arctic. In West Africa, an increase of
,
,
of extracted virtual stations in comparison respectively with CGLOS, Hydroweb and Dahiti (dispensing predetermined stations), has been achieved. However, limitations are found for the virtual stations whose shape extension follows the same orientation as the satellite track in
Figure 7. Since the bathymetry and width of the water body, as well as the geoid local values, can significantly vary along the river bed, the average water height value computed at each datapoint will not give an accurate representation of the distribution of all water level values of each pulse. Further refinement of the
teroVIR based on the alignment of the virtual station with the track geometry should allow us to split it up. Moreover, the multitude of virtual stations found over large water bodies such as lakes should be merged into a unique one since they are all connected on a hydrological physical interpretation. Finally, since the
teroVIR targets the production of time continuous data records, only permanent water bodies have been considered, though many regions present a high inter-seasonal variation in water availability such as West Africa. An evolution of the processor considering seasonal water bodies would give unprecedented information on water heights over the years.
The
teroWAT processor modularity ensured a smooth evolution guided by the needs of the users. Key stakeholders asking for data in remote locations can also be data providers. The access given by the hydrologists of West Africa to gauges allowed, to the best of our knowledge, the first large scale validation for water level products across such a large region. An average of ubRMSE of
m and MAD of
m have been found for 36 virtual stations for the L2 processor. The
TeroSCO introduced, combining several metrics to assess with a normalised value, the reliability of a virtual station and corresponding water level heights accuracy, proved its effectiveness by favouring virtual stations with more datapoints (N), lower ubRMSE and higher Pearson coefficient (R). This score allows an unequivocal understanding of the quality of the validation of time series, by removing the bias introduced by the number of samples compared. Other datasets from CGLOS, Dahiti, Hydroweb providers have been compared against
teroWAT, the latest showing better conventional metrics (ubRMSE, MAD, pearson) while lower
TeroSCO. The accuracy of the L1
teroWAT processor has been improved, targeting complex terrain regions barely covered until today, like the Arctic, with the modular addition of the L1 module. Results significantly show the effectiveness of the L1 processor in discarding the polluted waveforms by terrain contamination and in improving the accuracy for the selected ones. The
teroWAT undeniably discloses a destructive strategy with its outlier processing steps, removing unreliable datapoints benefiting for smaller errors but penalizing the
TeroSCO. In parallel to removing the outliers postprocessing on water level heights, further adjustments on only retaining the high backscatter values (e.g., >15 dB in Ku band and >20 dB in S band [
24]) for each pulse could help get rid of outliers that remain in the results. These values are obtained on very flat surfaces, such as deserts, large river basins or wetlands due to the specularity of the return radar echo.
As demonstrated in
Section 3.3, any region of the world can then be monitored in near real time, opening for scientific collaborations and actual usage of EO data as a driver for change implementation. Among the sundry purposes presented, water surface dynamics have disclosed their relation to flood forecast and river discharge estimation in complex terrains with the L1 processor. Since the flexiblity of the processors have been at the center of the development of the software, improvement of the computation core can easily be implemented while connection to any platform or viewer is facilitated. Finally, the developed processors, providing water level time series information on-the-fly, could be adapted to an unrivaled platform that allows users to query the large global raw altimetry database dynamically.