1. Introduction
The operational water budget is an important water resources management and conservation tool. Organizations managing a collection of surface water reservoirs and associated distribution systems can, and do, directly implement quasi-real time accounting of amount of water in storage and in transmission to guide decision making. Bathymetry, and thus storage volume for each reservoir surface elevation, inflows, and outflows, are mostly observable and measurable for a surface water reservoir. Seepage losses are typically the only unknown outflow, which cannot be measured and are usually calculated based on observed changes in reservoir volume and estimates for surface water inflows, calculated evaporation, and observed outflows.
The ability to observe and measure a managed surface water reservoir means that the concept of residence time,
in Equation (
1), can guide resource management and conservation. In Equation (
1),
is time-averaged volume, e.g., annual average volume, and
is total outflow discharge from the reservoir, averaged over the same time interval as volume. Inflows to the managed reservoir are likely uncertain and dependent on climate and actions and management of other agencies. Equation (
1) provides for management of outflows, given the known amount of water currently available, to provide future supply and to generate a factor of safety to address uncertain inflows.
Groundwater reservoirs, in contrast to surface water reservoirs, have limited observability and measurability. Voids or spaces not filled by solid material in the subsurface provide for the storage of groundwater. The arrangement, configuration, and interconnection of voids tend to be heterogeneous and anisotropic, and there are no bathymetric or as-built surveys available for aquifers. The true areal extent and storage volume of an aquifer is rarely known. Additionally, inflows to and outflows from groundwater reservoirs are rarely observable. Recharge and inter-aquifer flows are typically not directly observed or even measurable. Wells provide direct point observations of water levels and outflow discharge from pumping when they are monitored. Outflow discharge at springs is also monitored in some cases.
Given the inherent uncertainty surrounding aquifer extent, total inflow discharge, and total outflow discharge created by limited observability and measurability, groundwater reservoir characteristics are often estimated using numerical models and “calibration-constrained uncertainty analyses [
1,
2]”, which use limited observations and soft information constraint in an inverse-style approach to select ensembles of aquifer characteristics that provide for history matching between simulation results and observations and generate equally feasible descriptions of the aquifer. Equally feasible collections of parameter values produce ensembles of possible storage volumes for an aquifer, which means that a value for
in Equation (
1) is not available as a simple guide to water conservation and management.
“Calibration-constrained uncertainty analyses [
1,
2]” are a type of data assimilation (DA). DA covers a collection of approaches for optimal combination of information from numerical model simulations with observations. It uses a “forward” numerical model to make predictions; measurements, or observed values, are assimilated with the predictions to derive updated values. The goal is to obtain the “best” description of a dynamical system and inherent uncertainty with the updated values. DA is frequently employed for two different purposes: (1) to compute the best possible estimate of a model state and (2) inverse-style approaches to estimate model parameters or deduce optimal model forcing [
3]. Best estimates of model state are often used in operational and forecasting implementations where the goal may be to use quasi-real time information to update or improve model forecasts. Inverse-style approaches, such as “calibration-constrained uncertainty analyses [
1,
2]”, focus on model calibration.
The Kalman filter [
4] is a digital filter and DA algorithm that provides “best” estimates of a system state. It recursively estimates state variables in a noisy linear dynamical system by leveraging a series of measurements in conjunction with initial state predictions from a forward model to generate estimates of unknown variables. It requires a linear model of system state and a Gaussian-like distribution of measurement errors, and its estimates, or updates, combine a model prediction with a measurement using a weighted average. More weight is allocated to estimates that have greater certainty. The result is generation of estimates that tend to be more accurate than estimates based on a single measurement. As part of the update process, the joint probability distribution over the variables for each time frame is estimated. The Kalman filter is used widely in many technical and quantitative fields and can often be implemented in real time [
5,
6,
7]. Linear or classical Kalman filters have been applied to hydrologic problems since the 1970s [
8,
9,
10,
11].
Ref. [
12] developed an ensemble form of the Kalman filter, the ensemble Kalman filter (EnKF), which uses a Monte Carlo framework to generate updates that combine predictions and measurements and which is applicable to highly nonlinear systems and non-Gaussian error terms [
13]. EnKF approaches have also been employed in a variety of hydrological DA studies [
14,
15,
16,
17,
18,
19,
20,
21,
22,
23,
24]. Previously, EnKF approaches have been extended with one or more “corrector” update steps to enforce water budget closure [
11,
25,
26,
27].
DA traditionally employs forward numerical models that are process- or physics-based. In contrast, statistical learning algorithms seek to discover rules which are statistics- rather than physics-based for executing a data analysis or comparison task based on known examples of inputs or features and corresponding outputs or outcomes. Machine learning (ML) and deep learning (DL) are sub-fields of artificial intelligence (AI) [
28] and are types of statistical learning [
29]. An ML approach involves training a statistical algorithm, or “machine”, to “learn” from input data; DL methods are a subset of ML algorithms [
28]. DL approaches are artificial neural network methods that can use multiple neuron layers and are deep in the sense of having more than one learning layer within the algorithm [
30].
DL and Kalman filter-style algorithms have been combined to estimate system state. Ref. [
31] uses DL to make predictions of battery charge and health state combined with an extended Kalman filter to update these predictions with observations. Ref. [
32] develop a neural network-based (a neural network is a DL algorithm) Kalman filter for the interpolation of sea surface dynamics, which is an alternative to the EnKF for DA.
A Kalman filter integration of long short-term memory (LSTM) network predictions to a process-based water budget model is developed and implemented in this paper. The integration is a three-step calculation that uniquely combines DL, i.e., LSTM, predictions into a mass-conservative water balance framework. LSTM predictions for water level elevations in wells are combined with the aquifer storage description in the water budget model so that the DA integration solves for the state variable of aquifer volume, which allows for water resources management using Equation (
1). The DL predicted stage generally replaces unavailable observations from missing or malfunctioning equipment. DA provides explicit and inherent inclusion and representation of data uncertainty, and the DA integration accounts for data uncertainty impacts to LSTM predictions externally to DL training, testing, and validation.
4. Discussion
DA provides methods for incorporating observations, or data, with numerical models. In this study, assimilated data are LSTM model predictions of the water level in five wells. Three of the five target wells, Willoughby, McBride #1, and McBride #3, have observations for only about 11 months. Observations are also available for a limited period, about 31 months, for a fourth target well, the Ehler well. Limited observations for these four wells means that there are insufficient data for training, testing, and validation and that the LSTM predictor is likely over-fitting estimates for these four wells, as discussed in
Section 3.1. Over-fitting is when a statistical learning predictor learns to reproduce inherent noise and systematic error in a data set in addition to “true” physical trends.
The fifth target well, J-27, is a monitoring well and has a relatively long record of observation as identified in
Table 5. Sufficient data are available for training, testing, and validation of the J-27 well. Because about 14% of the J-27 data set (see
Table 2) was reserved for independent validation and because the LSTM predictor demonstrates skill in predicting J-27 water levels during the independent validation period (see
Table 8), the J-27 portion of the complex graph LSTM predictor demonstrates generalization ability.
The Kalman filter integration combines the LSTM predictor “measurement” with the process-based water budget forward model prediction by weighting the “measurement” and the prediction with the dynamically evolving Kalman gain. The relative magnitude of the prediction variance to the “measurement” variance is used in determining the Kalman gain. The Kalman gain is external to or independent of LSTM predictor generalization ability, which is analyzed only in relation to the noisy and uncertain outcome data set used for training and validation, and it assimilates information from the process-based water budget model that is not available to the LSTM predictor.
The LSTM predictor projects water level elevation for the Edwards aquifer segment using the J-27 output, and
Table 9 lists a relatively small Kalman gain value for the Edwards aquifer segment. A Kalman gain value close to zero, such as 0.087 for the Edwards aquifer segment, denotes a relatively large variance for forward model predictions and provides more weight to forward model predictions, even though analysis of J-27 LSTM predictions, relative to solely water level outcomes, suggest generalizability. In contrast,
Table 9 identifies Kalman gain values of 0.624 and 0.673 for the Leona Gravels aquifer segment (Ehler index well) and the Buda #2 aquifer segment (McBride #3 index well), respectively. Kalman gain values closer to one provide more weight to LSTM predictor estimates in assimilation, even though it is expected that the LSTM predictor for the Ehler and Buda #2 water levels is over-fit and is producing biased estimates, corrupted by noise and error in the limited observation record used to train the “Other Wells” LSTM predictor.
An individual LSTM predictor was created for the J-27 well (see
Figure 5) to leverage data availability for training, testing, and validation of this well. A complex graph LSTM predictor, see
Figure 5, was then created to combine individual LSTM predictors so that the J-27 and river discharge model could trained with relatively long records of observations and could be subjected to independent validation. The primary means to improve a DL model is to train it on more and better data [
28], and statistical learning approaches assume and require that training, testing, and validation data sets are clean and “perfect”.
Hydrological data sets tend to be noisy, contain rare occurrences or extreme events, and be estimates of a desired parameter value from a different type of observation. Calculated potential evapotranspiration from observed temperature, estimated river discharge from observed stage, and calculated aquifer volume from index well stage are three commonly used, derived hydrologic data sets. A common-sense baseline, see
Section 2.3.3, can be used with outcome or target data sets that are known to be noisy and contain errors to assess statistical learning generalization ability. However, this assessment of generalizability is limited to the outcome data set used in training and its inherent flaws. In this study, a common-sense baseline is generated for the river discharge LSTM predictor (see
Figure 5) that provides two
thresholds: (1) a lower threshold that needs to be exceeded for demonstration of model skill and (2) an upper threshold. above which the model is assumed to be over-fitting.
DA, in contrast to statistical learning methods, does not require data perfection and provides for the explicit representation of expected measurement error in target observations through an observation error model, see
Section 2.4.1. An observation error model is employed for aquifer segment storage in this study. Measurements of water level stage in wells are transformed to volumes using the storage volume description within the forward model. The LSTM predictor provides the water level stage measurements that are assimilated by the Kalman filter-integrated water-balance approach. Integration of the LSTM predictor allows application of the observation error model to LSTM predictions and accounts for inherent target uncertainty and system representation uncertainty externally to the statistical learning algorithm.
The Kalman filter-based integrated LSTM predictor and process-based water budget calculation presented here is a three-step implementation that employs existing models and computer programs, with minimal modification, in the unique configuration explained in
Figure 3. Step one is the prediction of current aquifer segment storage with the forward model. Step two is the generation of the Kalman update aquifer segment storage through combination of LSTM predicted water levels, converted to aquifer segment storage, with forward model predicted storage using Kalman gain weighting. Step three is the “correction” step where the unobserved pumping and diversion volume are adjusted for the previous assimilation window, and the forward model is re-run with adjusted pumping and diversion discharge to predict current aquifer segment storage that approximates the Kalman update storage.
The forward model and the LSTM predictor are separate models and only the inputs to the forward model are adjusted. The forward model provides a water budget calculation where the budget framework enforces mass conservation. Because the LSTM predicted stage is used only to calculate the Kalman update storage and because pumping and diversion discharge are updated to account for the residual between step two storage and step one storage, the Kalman filter-integrated water balance is inherently mass conservative, even though it leverages statistical learning predictions for which mass conservation is undefined.
EnKF approaches have been extended previously with one or more “corrector” update steps to enforce water budget closure. Ref. [
11] incorporated constraints to the EnKF and applied a constrained EnKF in a two-step process to estimate a terrestrial water budget across the southern Great Plains region of the United States (US). Step one in this estimation is the standard EnKF, and step two is a constraint step that optimally redistributes water budget imbalance created in step one to adjust or correct the budget calculation. Ref. [
25] produced a two update, weakly constrained EnKF approach that enforces water balance closure and accounts for data uncertainty and applied it to assimilate gravity recovery and climate experiment (GRACE) terrestrial water storage observations with a global-scale hydrological model. This two-update, weakly constrained EnKF was subsequently modified to incorporate a more general, unsupervised framework that permits an unknown water-balance model covariance [
26], and the unsupervised and modified framework was applied for combined assimilation–calibration [
27]. These previous EnKF “corrector” applications are conceptually similar in implementation (but at disparate scales, and using different types of data sets that are observed, and which do not employ statistical learning predictions) to the three-step implementation in this study.
4.1. Integrated Volume Calculation—Advantages and Limitations
Kalman filters are a DA algorithm with the goal of continuously updating numerical model results to represent observations, and are frequently employed in active, “operational” environments with real-time or near real-time updates to optimize assembly line performance, track objects, and implement autonomous vehicle navigation and control [
6,
7]. Given the goal of continuous improvement, it only makes sense to use a classical Kalman filter implementation for “active” and rapid fusion of data with numerical models. In other words, a Kalman filter is not a “calibration” technique but is a technique for continuous optimization of dynamic system representation.
Here, Kalman filter integration produces an operations-focused water budget calculation that continuously evolves as additional data are acquired and that works with the state variables of water storage volume and rate of change of storage. The volume of water that is stored and volumetric additions to and extractions from storage are the primary concerns for resource management and conservation. The volume of water stored in the five aquifer segments in this study is an unobserved quantity. The explicit focus on storage volume and the adjustment of uncertain discharges allows for resource planning using Equation (
1) and the concept of residence time,
.
The main advantages of Kalman filter integration of DL predictions to a process-based water budget involve leveraging the advantages of each disparate approach, i.e., DL versus process-based approaches, to compensate for the deficiencies in the other approach. Disadvantages of process-based water budget calculations include: (1) model parameterization and process representation complexity increases with site complexity, which means that significantly more effort is required to make a water budget calculation for a physically complex site than for a simple site, and (2) model inputs and outputs must have dimensions and must balance dimensionally, which means that dimensionless, or proportional, soft information is difficult to employ within purely deterministic implementations of these models. Statistical learning advantages directly ameliorate the disadvantages of process-based calculations and include that (1) site complexity is uncoupled from DL model complexity so that there is no additional effort required for complex sites relative to simple sites and (2) dimensionless trends can be used as features, or inputs, as is utilized for pumping and water rights extractions in this study.
Similarly, the deficiencies of DL approaches are counterbalanced by the advantages of process-based approaches. Disadvantages of DL models include: (1) dimensionless and standardized inputs and outputs means no mass conservation or dimensionally consistent representation, which can lead to allocating too much importance to trivial correlations, and (2) standardization requires assumption of weak stationarity, which means that weak stationarity among training inputs and prediction inputs must be assumed. Contrasting advantages of process-based water budget calculations are: (1) explicit calculation of volume of water in storage, (2) physical process representation provides for representation of process function outside of the range of calibration data, and (3) inherent mass conservation and dimensional consistency as part of the budget, or balance, framework.
Kalman filter integration of these two fundamentally different approaches generates further advantages of (1) working directly with the unobserved state variable of water storage volume, (2) ability to combine dimensionless predictions of aquifer stage with the process-based volume description when stage observations are limited or unavailable, and (3) use of dimensionless extraction trends as inputs to predict aquifer stage in the LSTM predictor allows for “correction” of these unobserved, and uncertain, volumetric extractions and additions in the process-based model to yield “optimal” storage volume solutions.
The main limitations of the Kalman filter integration are: (1) it does not create new observations but merely spreads the uncertainty in previously observed values among the state variables of storage volume and rate of change in storage volume and (2) it is active, evolutional, and requires the continuous acquisition of additional observations to promote evolution and ongoing representation enhancement. If additional observations, i.e., additional data collection, are not planned, then using an active and evolutional approach is a waste of time because no new observations are available for the continued improvement of system state representation.
4.2. Future Work
The future work goal for this study is to implement this type of operational DA of DL predictions to process-based water budgets in an environment of continuous improvement via continuous acquisition of new observations analogous to assembly line function and process optimization for manufacturing. The LSTM predictor does not create “new” observations. It only estimates water level values based on learning from a limited set of observations. Consequently, the immediate need for future advancement in the current study area is to remedy mechanical issues with observation collection equipment so that new observations can be, and are, obtained.
Additional observations can be directly incorporated as “measurements” as soon as they are available, and the LSTM predictor should only be used when observations are not available. More data, and continual collections of observations, will allow for examination of different distributions of the
residual volume correction (from
Figure 3) between pumping and recharge. The correction volume distribution between pumping and recharge, used in this study, was arbitrarily derived to enable process-based and mass-conservative correction of the water budget calculation. Additional observations will eventually allow for iterative cycles of model development and optimization as part of the ongoing improvement to dynamic system representation.
Iterative cycles of model development will include both the process-based forward model and the LSTM predictor. An LSTM predictor should be re-trained and the updated representation validated whenever there are one to two years of additional data. The process-based forward model should be re-calibrated as part of this cycle of model optimization; additional complexity should be incrementally introduced to the process-based model as suggested by analysis of new observations. Finally, the process-based model will eventually need to be changed to a systems dynamics model that can inherently represent complex and interrelated water demands like feedback loops. The current process-based model is only capable of hydrologic routing, one-way movement of water across the interconnected aquifer and river segments, with a limit of five outflows.
5. Conclusions
LSTM-predicted water levels in wells are integrated to a process-based, mass-conservative water budget framework via classic Kalman filter DA. The integrated calculation is a three-step combination of a process-based water balance model, which is the forward model, with a DL predictor. The volume of water stored in aquifer segments is the important system state tracked in the assimilation.
In the first step, the forward model predicts aquifer storage volume. A Kalman filter calculates an updated aquifer storage volume by assimilating “measurements” from an LSTM predictor, predicted using similar inputs to the forward model, with initial forward model predictions in the second step. The final calculation step utilizes modified pumping and recharge inputs, adjusted to address the difference between the Kalman filter update and the forward model prediction, to recalculate the water budget for the current assimilation window in a “correction” step. Neither pumping nor recharge is observed in the study area, and values used in the prediction step, i.e., step one, are approximate.
This Kalman filter integration provides advantages of working directly with the unobserved state variable of water storage volume and combining dimensionless predictions of aquifer stage from the LSTM predictor with the process-based volume description for the water budget model in a mass-conservative fashion. It generates an operations-focused water budget calculation that provides for optimal system representation conditional upon existing observations and that can continuously evolve and improve as additional data are acquired.
DL requires clean and consistent data because statistical learning focuses solely on learning correlations from standardized data sets. Significant uncertainty issues are known to impact the target, or outcome, data sets used to train and validate the LSTM predictor in this study. A common-sense baseline is employed to facilitate interpretation of LSTM model generalizability accounting for uncertainty in river discharge observations. DA, in contrast to statistical learning, provides for explicit incorporation of target data set uncertainty including measurement error through the observation error model. An observation error model is used for the storage state variable in the integrated calculation to externally address data uncertainty impacts to LSTM predictions and to incorporate additional uncertainty inherent in the dynamic system representation of the water budget model.