Next Article in Journal
Research on the Index Calculation Method for the Impact of Drought on Water Quality in the Nakdong River, Korea
Previous Article in Journal
Assessing the Origin and Mapping the Extension of Salinity Around Shrimp Culture Ponds in Rio Grande Do Norte (Brazil)
Previous Article in Special Issue
Hydropolitical System Archetypes: Feedback Structures, Physical Environments, Unintended Behaviors, and a Diagnostic Checklist
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Essay

Some Remarks About Forward and Inverse Modelling in Hydrology, Within a General Conceptual Framework

by
Mauro Giudici
Dipartimento di Scienze della Terra “A. Desio”, Università degli Studi di Milano, Via Botticelli 23, 20133 Milan, Italy
Hydrology 2024, 11(11), 189; https://doi.org/10.3390/hydrology11110189
Submission received: 29 August 2024 / Revised: 24 October 2024 / Accepted: 7 November 2024 / Published: 9 November 2024
(This article belongs to the Collection Feature Papers of Hydrology)

Abstract

The solution to inverse problems is crucial for model calibration and to provide a good basis for model results to be reliable. This paper is based on a recently proposed conceptual framework for the development and application of mathematical models that require the solution of forward and inverse problems. The focus of this paper is on the discussion of some terminology related to the results of forward problems and their reanalysis, on the use of the proposed framework to revise and generalise some methods of solutions of the inverse problem, and to provide a non-standard insight in some aspects about the Bayesian approach to model calibration.
PACS:
92.40.-t; 92.40.Cy; 02.30.Zz; 02.50.-r; 02.50.Tt; 02.60.-x

1. Introduction

Since several decades ago, mathematical modelling has been common practice in hydrology, not only by researchers but also by practitioners. This approach is natural in the framework of modern science, especially in physics, which has been based on systematic experimentation and mathematical abstraction since the XVII century. Mathematical models should provide a mathematical twin of the real world. Analytical models, based on closed-form solutions of the physico-mathematical equations, are very useful for interpreting laboratory or field data, or to provide fast assessment of different alternatives during planning and design activities, but they are based on quite restrictive assumptions. In particular, they can be rarely applied to model complex systems, especially in the presence of non-linearity and heterogeneity. Numerical models for geophysics and environmental physics can handle these complex situations. Such models can be considered as mathematical tools to simulate the behaviour of a real system with a virtual representation of the natural system itself.
The results of a physical-based, discrete (or numerical) model are reliable and useful only if the following conditions are met: the physics is properly described, i.e., the phenomenological laws that are applied [1] are appropriate to describe physical phenomena at the relevant scale; the numerical methods are accurate and properly respect conservation principles (the basic principles of conservation of mass, energy, linear momentum, etc.); the parameters used in the model are correct, i.e., the model is properly calibrated and validated.
However, “appropriate”, “accurate”, and “correct” are qualitative adjectives: scientists and engineers seek to express them in a quantitative way. This is not always easy and straightforward. In fact, the phenomenological laws and the corresponding relevant parameters could depend on the time and space scales that are considered in the specific application. Moreover, the required accuracy in the simulation of the natural system depends on the goals of the model, on the extent and the quality of the available data set, on the scale at which processes are simulated, and on the relationship of model scale with measurement scale.
This paper is focused on numerical models in hydrology, but it is not a review of mathematical modelling and of the theory and application of inverse problems. Several books are devoted to the discussion of the discrete inverse problem (DIP) (see, e.g., [2,3,4,5,6,7,8] for an extensive introduction to this topic).
Following the seminal papers by Nelson [9,10], several researchers dealt with several applications and with different approaches in groundwater hydrology [11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39] and soil physics [40,41,42,43]. The relevance of model calibration and of inverse problems to simulate groundwater dynamics in global models of the water cycle has been recently recognised [44]. The application of automatic calibration has been widespread also in the field of surface hydrology, at least from the ’70s of the XX century [45,46,47,48,49,50,51] and more extensively in the following decades (see, e.g., [52,53,54,55,56,57,58,59,60,61,62,63,64,65,66] as a largely incomplete list of the vast literature on this subject). These works considered the parameter fitting for different kinds of models [67], ranging from lumped to distributed models, empirical models based on a small number of parameters, conceptual models based on a simple structure, and physical-based distributed models. Interesting examples of model calibration for integrated surface/subsurface models [68,69] can be found in the scientific literature, as well as reviews of inverse problems in other sub-fields of geophysics [70].
The discussion proposed here is based on the conceptual framework to cast forward and inverse problems in geophysics proposed by the author with some of his coworkers [71]. The specific goals of this work are the following:
  • The introduction of a proper terminology, focused on the use of mathematical models in geophysics and environmental physics;
  • The discussion of some properties of the DIP by making use of a paradigmatic example of a non-linear model;
  • Some methods of solution to the DIP, which belong to the family of the so-called “direct” methods, are embedded in the abovementioned conceptual framework;
  • The Bayesian approach is also embedded in the abovementioned conceptual approach in order to discuss a couple of aspects that can be relevant.
The structure of this paper is as follows: Section 2 introduces a paradigmatic example useful to discuss the basic concepts about forward and inverse problems in hydrology. Section 3 is devoted to the introduction of the forward problem and to point 1 above (Section 3.3). In Section 4, the DIP is stated and its properties are discussed. Section 5 is focused on points 3 and 4 above. Finally, Section 6 is devoted to a discussion of some conclusive remarks.

2. A Paradigmatic Problem

Diffusive processes are very common in any sub-field of hydrology (catchment hydrology, rainfall–runoff, river hydraulics, groundwater hydrology, etc.). They are modelled with partial differential equations that are based on physical conservation principles (e.g., mass, energy, and linear momentum) and on phenomenological laws (Fick’s, Fourier’s, Darcy’s, etc.), are complemented with boundary and initial conditions (BICs), and most often assume the general form:
b · D t u = div ( a grad u ) + f ,
where u is the “potential” (e.g., hydraulic head for groundwater flow, solute concentration for contaminant transport, or temperature for heat flow), D t is the Lagrangian derivative [72,73,74], f is the source/sink term, and a and b are two phenomenological coefficients. Namely, a is a conductivity, diffusivity, or transmissivity that quantifies the ability of the medium to transfer the physical quantity under examination; b is a capacity that quantifies the ability of the system to store or release the physical quantity in response to variations of the potential.
Equation (1) can assume specific forms when applied to the different transport phenomena considered in groundwater hydrology, e.g., groundwater flow, solute transport, and heat transfer. In surface hydrology, the continuity equation (mass conservation) and Navier–Stokes equation (linear momentum conservation) can be further modified by introducing proper approximations, like those used to derive the classical Saint–Venant equation [75] or other models (see, e.g., [76]).
In the simplest case of a 1D, purely diffusive (i.e., convective terms are neglected), stationary process, (1) reduces to
d d x a d u d x = f .
For this example, a conservative finite-difference approximation can be synthetically written as
a i 1 / 2 u i 1 u i Δ x i 1 / 2 + a i + 1 / 2 u i + 1 u i Δ x i + 1 / 2 = φ i , i = 1 , , N 1 ,
where i { 0 , , N } is the index used to identify a node that is the centre of one of the N + 1 non-overlapping cells that cover the whole domain (see the geometry in Figure 1); a i 1 / 2 represents the phenomenological coefficient (e.g., internode or interblock conductance, given by the product of internode or interblock conductivity times the surface separating two adjacent blocks); u i is the potential at node i; Δ x i 1 / 2 is the spacing between adjacent nodes; and φ i is the sum of the source terms (expressed as a flow rate of the considered quantity per unit surface) in the cell i. Each of the terms on the left-hand side of (3) represents the specific flux per unit surface of the considered quantity (e.g., mass and energy) entering the cell i through the surface separating two adjacent cells. This is shown graphically through the grey arrows in Figure 1.
Dirichlet boundary conditions (BCs) are easily introduced by setting u 0 or u N equal to the prescribed value. Neumann BCs can be introduced by substituting a 1 / 2 u 0 u 1 / Δ x 1 / 2 with the prescribed value q 1 / 2 ; the term a N 1 / 2 u N u N 1 / Δ x N 1 / 2 can be substituted with its prescribed value q N 1 / 2 , analogously. Neumann BCs permit to write (3) also for i = 0 or i = N .

3. Forward Modelling

The goals of mathematical models in hydrology can be very widespread and differentiated, as they may include the following:
  • Comparison of the model outcomes with the results of laboratory and field experiments in order to test the validity of physical theories;
  • Interpretation of field data for the determination of physical properties of natural systems (e.g., interpretation of well tests);
  • Simulation of the impact of management and exploitation of water resources on the natural systems;
  • Simulation of the effects of remediation actions to support risk analysis concerning the pollution of water resources and the environment;
  • Simulations of the impacts of land use programs on water resources for strategic environmental assessment and for environmental impact assessment;
  • Simulation of the impacts of natural stressors (e.g., climate change) to support risk analysis related to hydro-meteorological intense events;
  • Support to the optimal design and setup of monitoring networks or laboratory and field experiments, including sampling frequency, precision of the instruments, and of the measuring procedures and gauges resolution.

3.1. Statement of the Forward Problem

A numerical mathematical model can be represented by a set of equations that describe the state of the physical system under study as a function of model parameters. The model parameters are included in an array p , whereas an array s includes the quantities that describe the state of the system. It is important to stress that the term “array”, rather than vector, is used here to denote mathematical structures that can play different roles. In fact, in this work, arrays denote a collection of physical quantities and parameters that will be considered either as a column vector, in order to apply standard methods of linear algebra, or as a finite set of elements. A discrete model can be written as the following system of equations:
f ( p , s ) = 0 ,
where the function f may assume different forms for different problems and may refer to time-evolving systems, for which s is split into sub-arrays corresponding to the state of the system at different time steps.
For the paradigmatic example considered in this work, the state of the system is the potential at the nodes, i.e., s = u i , i = 0 , , N , and (2) can be written as
f p , s = A p s b p = 0 ,
where A is a square matrix and b is used to model both the source terms and the BCs. The forward problem aims at solving (3) with respect to u i , if a i 1 / 2 and φ i are known, whereas the inverse problem aims at identifying the best values of the phenomenological parameters a i 1 / 2 , i = 1 , , N .
Equation (5) is a prototype also for the following:
  • Two-dimensional or three-dimensional problems;
  • Transient conditions, recalling the remark in the line after (4);
  • Different methods of numerical solution of the partial differential equations (e.g., finite elements and spectral methods);
  • Non-linear processes, for which A and b depend on s .
The abovementioned applications include different approaches for numerical integration of the mass and linear momentum equations in catchment hydrology, e.g., with a Muskingum–Cunge approach [77]. The general form given by (4) applies in a relatively easy way also to input–output models such as the advection–diffusion approach (e.g., [76,78]), the classical unit hydrograph concept (which goes back to Dooge’s work in the 1950s [79]) and similar approaches (e.g., [80]), or the curve number method ([81], see the remarks at the end of Section 4.2).
From a formal and general point of view, one can introduce the forward problem as follows:
Problem 1 
(Forward problem—FP). Let p P R N ( p ) be known, where P is the subset of R N ( p ) that takes into account any possible constraint on the model parameters (e.g., conductivity must be positive); then, the FP is stated as finding s S R N ( s ) such that (4) is satisfied.
If a unique solution of the FP can be found, it can be expressed in an explicit, possibly non-linear form as
s = g ( p ) .
For the sake of simplicity, in this paper, it is assumed that the FP is well-posed, i.e., a solution exists; is unique; and depends with continuity on p . However, this is not always the case [82].

3.2. Data and Parameters

Array p includes any model parameter, comprising those describing the geometry of the discretisation grid (e.g., the spacing of the grid or the time step for time-evolving processes). Therefore, some of these parameters are fixed before the application of the model; their values will depend on the available data, which are the elements of d . Then, the fixed parameters can be grouped in a “sub-array” p ( fix ) , whose dimension is N ( p ) and which depends on the data: p ( fix ) ( d ) .
For the paradigmatic example, d includes the following:
  • The position of the points where potential is measured;
  • The measured values of potential;
  • Other measurements necessary to estimate the values of the source terms or the BICs.
On the other hand, for rainfall–runoff models, d could include the following:
  • Time series of water level or discharge measured at stream gauging stations;
  • Time series of rainfall rate monitored at meteorological stations in the drainage basin;
  • Topographic data or a digital elevation model of the drainage basin.
For the paradigmatic example, p ( fix ) includes the values of Δ x i 1 / 2 , i { 1 , , N } , and possibly the source terms and the values of the BCs, e.g., u 0 and u N in the case of Dirichlet BCs.
Array p ( cal ) R N ( c ) denotes the model parameters, whose values are obtained through calibration, namely, as a solution of a DIP. The number of elements of p ( cal ) is N ( c ) . In general, one can write
p = p ( fix ) t , p ( cal ) t t = p 1 ( fix ) , , p N ( f ) ( fix ) , p 1 ( cal ) , , p N ( c ) ( cal ) t , or equivalently in set theory notation p = p ( fix ) p ( cal ) = p 1 ( fix ) , , p N ( f ) ( fix ) , p 1 ( cal ) , , p N ( c ) ( cal ) ,
where N ( f ) + N ( c ) = N ( p ) .
For the paradigmatic example, N ( c ) = N and p ( cal ) = { a i 1 / 2 , i = 1 , , N } .

3.3. Terminology About Results and Forecast of a Mathematical Model

The great variety of goals of mathematical models that were mentioned at the beginning of this section implies that model results are qualified with different words, which are usually considered as synonyms in common language [83,84,85,86]: estimate, forecast, guess, outlook, prediction, projection, and prospect. Other synonyms, which would not be used in scientific language, are foretelling, premonition, and prophecy. In the author’s opinion, a “fine-tuned” differentiation in the technical language is important.
Results or outcomes of a forward model should be considered as the state of the system array s that is the solution to the forward problem, i.e., the solution of (4), when the parameters p are given. Instead, forecasts of the model could be considered as a set of quantities that are derived from the model results, possibly also with the use of model parameters p and data d . The terms reanalysis and products are sometimes used to denote similar quantities, especially when data assimilation is performed. In particular, climatic and meteorological reanalysis is often considered in atmospheric physics (e.g., [87,88]). With reference to the paradigmatic example, flow rates could be computed by using the state of the system and the internode conductances. Therefore, the model forecasts are expressed as an array y that is a function of s , p , and d : y s , p , d .
The distinction between prediction and projection finds wide consensus in the scientific community. In fact, such a distinction is quite clearly stated in climate science and is well summarised in the Annex II. Glossary by IPCC [89] in 2007, from which the following sentences are borrowed.
Climate prediction
A climate prediction or climate forecast is the result of an attempt to produce an estimate of the actual evolution of the climate in the future, for example, at seasonal, interannual, or long-term time scales. Since the future evolution of the climate system may be highly sensitive to initial conditions, such predictions are usually probabilistic in nature. See also Climate projection and climate scenario.
Climate projection
A projection of the response of the climate system to emission or concentration scenarios of greenhouse gases and aerosols, or radiative forcing scenarios, often based upon simulations by climate models. Climate projections are distinguished from climate predictions in order to emphasise that climate projections depend upon the emission/concentration/radiative forcing scenario used, which are based on assumptions concerning, for example, future socioeconomic and technological developments that may or may not be realised and are therefore subject to substantial uncertainty.
Instead, the distinction between prediction and forecast is rather subtle and debatable (e.g., [90]). In many websites and blogs, forecast is often claimed to refer to the numerical simulation and prediction of the future behaviour of a system on the basis of past data. Prediction is used in a more general and wide sense. For instance, in statistics, linear regression of a variable against several other variables can be performed by comparing predicted values of the variable under examination with measured or observed values. In some cases, prediction is also considered to be a guess or an estimate of the behaviour of a system without referring to scientific arguments or data but based on assumptions, expert opinions, intuition, or subjective judgement. Therefore, as it is sometimes claimed on websites and blogs, all forecasts are predictions, but some predictions are not forecasts. When mathematical models are used in natural sciences, it seems that “forecast” is the most appropriate term to denote the estimate of quantities derived from the processing of the outcomes of a numerical model for a dynamical system and should be preferred to “prediction”.

4. Discrete Inverse Problems

With the notation and the definitions given in Section 3, the DIP should aim to solve the system
y s , p , d = t d , p ( fix ) ,
with respect to p ( cal ) , which is a sub-array of p , as shown by (7). Array t d , p ( fix ) consists in the calibration target, i.e., the values of the model forecast that are expected to be obtained from model outcomes. The calibration target is obtained from data processing, possibly supported by fixed parameters, i.e., p ( fix ) ; this phase might be tricky, because data and model parameters often refer to different spatial and temporal scales. The calibration target cannot depend on p ( cal ) .
An exact solution to (8) can rarely be found. The most common approach is to cast the DIP in the framework of optimal control and to look for the set of model parameters that minimises the discrepancy between the two sides of (8). This is performed by introducing an objective function O , given by
O p ( cal ) = d y s , p , d , t d , p ( fix ) ,
where d is a discrepancy function such that
  • d : R N ( t ) × R N ( t ) [ 0 , + ) ,
  • x = x d x , x = 0 .
The classical choice is the least-squares approach, for which
d y s , p , d , t d , p ( fix ) = i = 1 N ( t ) y i s , p , d t i d , p ( fix ) 2 .
Of course, many other choices are possible for the d function, among which are the sum of absolute differences ( 1 norm) and the maximum absolute difference ( norm), or any p norm of the difference y s , p , d t d , p ( fix ) . Therefore, the discrete inverse problem can be stated as follows:
Problem 2 
(Discrete inverse problem—DIP). Let s = g p be the solution to the FP from (6). Given p ( fix ) and d , given
P ( cal ) = p ( cal ) : p ( fix ) t , p ( cal ) t t P ,
given the functions y and t —in particular, the values of t d , p ( fix ) , and given the objective function O from (9), the DIP is stated as finding p ( cal ) P ( cal ) such that
O p ( cal ) O p ( cal ) , p ( cal ) P ( cal ) o r e q u i v a l e n t l y p ( cal ) = arg min p ( cal ) P ( cal ) O p ( cal ) .
A graphical scheme of the arrays used to state the FP and the DIP is shown in Figure 2. In that picture, the red arrows is the symbol to show the comparison between model forecast y and calibration target t .
As pointed out by one of the reviewers, the proposed formalism can embrace the approach to inversion through Physics-Informed Machine Learning [91], an approach that has been increasingly applied in geophysics [92] and in hydrology [93] in the last few years. Moreover, the training phase of Artificial Neural Networks can be included in this framework and shares some of the properties and difficulties of physical-based DIPs [94].

4.1. Remarks About the Discrepancy Function

The conditions that have been introduced for the discrepancy function are very similar to those that define a distance or metric among the elements of a given set. However, this function does not necessarily correspond to a distance as two of the properties that define a distance, in a rigorous mathematical sense, are not required here, namely, symmetry and triangle inequality. As an example, it is important to mention that even some commercial codes use the root mean square relative error given by
d y , t = 1 N ( t ) i = 1 N ( t ) y i t i t i 2 1 / 2 .
The idea behind this choice is to weight the discrepancy between model forecast and calibration target by considering the magnitude of the target. In other words, a small discrepancy could be negligible if the target value is great, whereas it is of great relevance if the target value is small, of the same order of magnitude as the discrepancy. Under this premise, the symmetry condition is not physically significant, i.e., it is physically very different to normalise with respect to t or to y . This also impacts on the asymmetry of the function.
Nevertheless, even if one disregards this conceptual point, it is easy to check that the function d defined by (12) does not satisfy the triangle inequality. For this, it suffices to consider N ( t ) = 1 so that d ( y , t ) = ( y t ) · t 1 , and the three numbers α = 4 , β = 1 , and γ = 2 to see that
3 = d ( α , β ) = α β β > α γ γ + γ β β = d ( α , γ ) + d ( γ , β ) = 2 ,
which violates the triangle inequality.
A generalisation of the relative error has been recently proposed in a COVID-19 spread model [95]. In that example, y and t include the cumulative number of infected (I), recovered (R), and deceased (D) persons:
y = { y n = y n ( 1 ) , y n ( 2 ) , y n ( 3 ) R 3 | y n ( 1 ) = I n , y n ( 2 ) = R n , y n ( 3 ) = D n , n = n min , , n max 1 } ;
t = { t n = ( t n ( 1 ) , t n ( 2 ) , t n ( 3 ) ) R 3 | t n ( 1 ) = I n ( ref ) , t n ( 2 ) = R n ( ref ) , t n ( 3 ) = D n ( ref ) , n = n min , n max 1 } ,
where (ref) means “obtained from reference data” taken from the Johns Hopkins University Coronavirus Resource Center (CRC), and n min and n max are the minimum and maximum time steps for which data are available, whereas n = 0 is the moment when the first person is infected. In the referenced work, the misfit between model forecast and calibration target is computed by means of the following objective function:
O p ( cal ) = i = 1 3 O ( j ) p ( cal ) ,
where
O ( j ) p ( cal ) = 1 n max n min n = n min n max 1 y n ( j ) t n ( j ) max ξ , t n ( j ) 2 1 / 2 ,
for i = 1 , 2 , 3 , where ξ 1 is a threshold-and-weight parameter. Such a parameter plays a double role:
  • A threshold that keeps positive the denominator of y n ( j ) t n ( j ) × max ξ , t n ( j ) 1 , even in the particular case when t n ( j ) = 0 ;
  • A weight that controls some characteristics of the objective function:
    • If ξ = 1 and t n ( j ) 1 , then O ( j ) reduces to the root-mean-squared relative difference between t n ( j ) and y n ( j ) .
    • If ξ assumes large values, it dampens the errors for low values of t n ( j ) , namely, if t n ( j ) ξ ; therefore, the relative errors corresponding to large values of t n ( j ) will be dominant, so that early-time behaviour, when I n ( ref ) , R n ( ref ) , and D n ( ref ) are small, is less relevant to the model fitting.
    • If ξ > max t n ( j ) , n = n min , n max 1 , then O ( j ) reduces to the standard root-mean-squared error.

4.2. Remarks About the DIP for the Paradigmatic Example

The paradigmatic example introduced in Section 2 is useful to discuss possible alternative choices for y and t . Four cases are mentioned here.
The first approach is to use measured data to build t in a straightforward way by assigning measured values of potential as the elements of t : t = u 1 ( meas ) , , u N m ( meas ) t , where u j ( meas ) is the potential measured at a measurement point x j . On the other hand, y is obtained by interpolation of the elements of s , so that the model forecast is the simulated potential at each measurement position: y = u 1 ( int ) , , u N m ( int ) t , where u j ( int ) is the potential interpolated at x j from s = u 0 , , u N t .
The second approach is the interpolation of field data at the node positions as a target: t = s ( int ) d , p ( fix ) . This array should be compared with the results of the model, i.e., with the solution to (4): in this case, y = s .
The third approach assumes that the model results are used to forecast the values of other quantities. For instance, if it is possible to estimate fluxes from field data and they can be included in the t array, then the y vector should include the forecast for the same quantities obtained from model results.
The fourth approach consists in substituting the interpolated field s ( int ) d , p ( fix ) in (4), or equivalently in (5) for this paradigmatic example. Then, y = A p s ( int ) d , p ( fix ) b p , which is often called equation (or balance) error. For this approach, the target is trivial: t = 0 . This approach is known in the literature on groundwater hydrology as the “direct approach” [12]. The conceptual framework used here [71] permits consideration of the whole process from data collection to parameter identification, if p ( fix ) includes the parameters that control the interpolation of field data. Then, the direct formulation of the DIP reduces to finding the parameters p ( cal ) that satisfy
f p , s ( int ) d , p ( fix ) = 0 .

4.3. Insights into the DIP Through the Example of the Curve Number Approach

In order to give further insights in the proposed framework, the simple curve number approach [81] could be considered. In its simplest formulation, it computes runoff (Q) as a function of rainfall (P), potential maximum retention after runoff begins (S), and initial abstraction ( I a ), i.e., all losses before runoff begins, including interception by vegetation, evaporation, infiltration, and water retention in surface depressions:
Q = ( P I a ) 2 P I a + S ,
where all the quantities are expressed as a length, i.e., they are specific per unit surface, possibly integrated over a given monitoring interval (24 h in many applications). Then, a simple linear relationship between I a and S is proposed, namely, I a = c 1 S ; moreover, the curve number C N is used to compute S through the relationship S = c 2 · C N 1 c 3 . c 1 = 0.2 , c 2 = 1000 in = 25,400 mm , and c 3 = 100 in = 2540 mm are the reference values that were originally proposed [81].
Under the light of the formalism used in this paper, one could collect field measurements for different rainfall events, which are denoted with the index m = 1 , , M . Rainfall measured at different stations could be included in d , together with discharge measurements at a gauge station. Array t could be simply obtained by the time integration of instantaneous discharge data over the prescribed time interval, whose duration is included in p ( fix ) . Some interpolation of the point measurements from the meteorological stations could provide the value of P m to be inserted in (18), to yield Q m . In other words, y = s = { Q m , m = 1 , , M } , and g is built with the application of (18) to each of the M-th rainfall events.
For this simple example, the optimal determination of the values of the parameters c 1 , c 2 , and c 3 , together with C N , could be the goal of model calibration. In this case, one could consider all these parameters as elements of p ( cal ) . An alternative choice could be to consider C N only as the fitting parameter and to fix the values of c 1 , c 2 , and c 3 to the reference values so that they belong to p ( fix ) .

5. Further Remarks About the DIP

The definitions given in the previous sections could be used to discuss several concepts relevant for the DIP, like identifiability, regularisation, multi-objective inversion, sensitivity analysis, and the use of the adjoint-state equation to compute sensitivity methods for the minimisation of the objective function. These aspects are discussed elsewhere ([71] and the references therein). Here, the focus is on the ill-posedness or ill-conditioning of the DIP and on different calibration methods that can only partially be embedded in the above-referenced conceptual framework. However, it is worth mentioning some literature references that are related to the importance of multi-objective calibration [96,97,98] and to the use of different metrics to assess and quantify model performance [7,99,100,101].

5.1. Some Remarks About Ill-Posedness of DIP

Since the beginning of the XX century, well-posedness (i.e., existence, uniqueness, and stability) is a key concept in mathematical problems [102]. Inverse problems are often ill-posed; because it is difficult to provide reasonable conditions to guarantee the existence of a solution, the latter is often non-unique and can depend in an unstable way on the input data. The framework considered in this paper permits stating the uniqueness and stability of the DIP in a relatively simple way, as follows:
Definition 1 
(Uniqueness). Let p and p be two solutions to Problem 2. The DIP is said to admit a unique solution if p = p .
Definition 2 
(Stability). Let p and p be two solutions to Problem 2, corresponding, respectively, to two data sets d and d . The DIP is said to admit a stable solution if d d 0 implies p p 0 .
From the practical point of view, the DIP could be affected by ill-conditioning—that is, even if a DIP admits a stable solution, a small error on the data could be sufficient to yield big, or even huge, errors in the identified parameters. This concept can be defined in a formal, but still qualitative, way as follows:
Definition 3 
(Lipschitz condition). Under the same hypotheses of Definition 2, the solution to the DIP is said to satisfy the Lipschitz condition if there exists a constant C > 0 such that p p C · d d 0 .
Definition 4 
(Conditioning). Under the same hypotheses of Definition 2, and if the solution to the DIP satisfies the Lipschitz condition, the DIP is said to be ill- (or, respectively, well-) conditioned if C is big (or, respectively, small).
In practical applications, the problems due to non-uniqueness, instability, and ill-conditioning might appear in a very similar manner. Moreover, the presence of local minima of the objective function is a very relevant and typical problem that arises in the solution of a DIP. It causes the failure of several algorithms, mostly those based on gradient descent: their final outcome depends on the initial guess and is often a local minimum. In addition, those algorithms suffer big problems when the objective function is almost flat around the minimum. However, no matter which minimisation algorithm is applied, the flatness of O implies that a large set of arrays p ( cal ) yields values of O , which differ by quantities smaller than the uncertainties due to measurement and modelling errors. This problem is somehow linked with the concept of “resolution” and is strictly related to the sensitivity of the model outcomes and forecast on the model parameters. These issues triggered the introduction of the concept of “equifinality” [56,103,104,105] and of the study of parameter uncertainty [106,107,108]. The latter, together with proper management of data and modelling errors, should be cast in the frame of probability theory and stochastic processes (see Section 5.3).

5.2. Solution Methods Based on Comparison Among Model Forecasts

Some methods of solution of the DIP are based on the comparison among different model forecasts; here, the focus is on two of them: the Comparison Model Method ([109,110,111,112,113,114,115,116], CMM), together with its variation named successive flux estimation [117], and the Dual Constraint Method ([118,119,120], DCM). Both methods are based on the use of the solution to the FP for a tentative set of parameters ( p ( 0 ) ), namely, for a tentative a field for the applications related to the paradigmatic example. The iterative updating of parameters is based on the use of phenomenological laws (Darcy’s law in the case of groundwater hydrology, Fourier’s law in the case of heat flow, etc.). Therefore, these methods have quite a strong physical foundation but are not based on the use of the gradient of O to search the minimum of the objective function.
Starting from a first guess of p ( cal ) , i.e., a i 1 / 2 ( 0 ) , i = 1 , , N for the paradigmatic example, CMM solves the FP and obtains s ( 0 ) , i.e., the values of u i ( 0 ) , i = 1 , , N 1 for the solution of (5). Then, in its simplest formulation, the method is applied iteratively by means of the following simple formula:
a i 1 / 2 ( + 1 ) u i ( int ) u i 1 ( int ) Δ x i 1 / 2 = a i 1 / 2 ( ) u i ( ) u i 1 ( ) Δ x i 1 / 2 , i = 1 , , N .
In other words, the fluxes computed at the -th step of the iterative algorithm are assumed to be equal to the fluxes that are computed from the internode conductivities at the next iteration and the potential interpolated at each node. Roughly speaking, from the physical point of view, at each iteration, the method increases the values of a i 1 / 2 , where the modelled discrete potential gradient is greater than the one computed with the interpolated potential and vice versa.
By abstracting from the specific paradigmatic example, the CMM algorithm could be specified as follows: starting from an initial guess of the parameters to be calibrated, compute the parameters at the -th iteration as the solution to
y ( s ( 1 ) , d , p ( 1 ) ) = y ( s ( int ) , d , p ( ) ) ,
where = 0 , 1 , and s ( 1 ) = g ( p ( 1 ) ) .
DCM starts from the idea of Calderón’s problem [121], who worked on the problem of obtaining images of the subsurface through electrical tomography inversion. He stated the inverse problem as the determination of the conductivity field in a region of the subsurface by means of potential and current measured at the border, through the use of the Dirichlet-to-Neumann map. The latter is the mathematical mapping that assigns to a given set of Dirichlet boundary values the correspondent currents at the border. In the case of DCM, two FPs are considered: the first FP uses “Dirichlet BCs”, i.e., it prescribes the values of the potential at some points, possibly corresponding to measurement points (the fixed parameters used for this FP and the corresponding solution are denoted with apex (D)); the second FP is solved by using “Neumann BCs”, i.e., it prescribes fluxes at the same measurement points as for the previous FP (the apex (N) is used for quantities referred to this FP). For the paradigmatic example, starting from an initial guess a i 1 / 2 ( 0 ) , i = 1 , , N , the solution is obtained by iteratively applying the following equation:
a i 1 / 2 ( ) = a i 1 / 2 ( ) u i ( N ) u i 1 ( N ) u i ( D ) u i 1 ( D ) , i = 1 , , N .
In the general case, the method can be stated as follows: starting from an initial guess of p ( cal ) , and considering two different subsets of fixed model parameters, corresponding, e.g., to different BCs, and denoted with p ( fix , D ) and p ( fix , N ) , compute p ( cal ) at the -th iteration as the solution to
y s ( D ) , d , p ( fix , D ) t , p ( cal ) , t t = y s ( N ) , d , p ( fix , N ) t , p ( cal ) , 1 t t ,
where = 0 , 1 , ,
s ( D ) = g p ( fix , D ) t , p ( cal ) , 1 t t ,
and
s ( N ) = g p ( fix , N ) t , p ( cal ) , 1 t t .

5.3. Bayesian Approach

The role of the measurement, approximation, and modelling errors on the solution of DIP is emphasised in [71]. Since such errors are very complex to predict (this verb is used here in a proper way, in the sense that, in some cases, only educated guesses can be applied on such errors), the Bayesian approach (see, e.g., [122] for an interesting discussion of the approach) could be of particular interest for model calibration. The common strategy invokes Bayes’ theorem, which can be cast as follows:
f posterior p ( cal ) | y t = f likelihood y t | p ( cal ) · f prior p ( cal ) f marginal y t ,
where
  • f posterior p ( cal ) | y t is the posterior probability density function (pdf);
  • f likelihood y t | p ( cal ) is the likelihood [123], i.e., the conditional pdf of the discrepancy between model forecasts and target values conditioned with respect to the p ( cal ) array;
  • f prior p ( cal ) is the prior pdf of p ( cal ) , i.e., the pdf of the values of the parameters to be calibrated, without any information from measured data and model outcomes;
  • f marginal y t is the marginal pdf, i.e., the pdf of the discrepancies between y and t , without any information from measured data and model outcomes.
The application of the Bayesian approach requires some guesses about the pdfs appearing in (23). The most common guess consists in assuming that both the likelihood and the prior pdf can be expressed as multi-Gaussian distributions. Under these hypotheses, the search for the maximum of the posterior pdf reduces to the least-squares approach. More precisely, it reduces to the weighted least-squares, where the differences y t and the discrepancy of p ( cal ) from its prior expected-value have zero mean and are weighted by the inverse of the corresponding covariance matrices. Such matrices take a diagonal form if the elements of y t are independent of each other; moreover, if the variances of each element of y t are equal, then the optimal set of parameters obtained from the Bayesian approach reduces to the standard least-squares solution. However, this is not the only possible choice. In fact, if outliers are present—i.e., if some values of y are very far from the expected target value t —an exponential distribution might provide a better approximation of the pdfs, particularly of the likelihood, and it practically leads to the minimisation of 1 -like norms.
The above remarks are fundamental, because they implicitly show which are the conditions and the physical requirements at the basis of the classical least-squares approach. Those remarks should not be considered as limitations to the relevance of the Bayesian approach, which is not restricted to find a single optimal set of parameter values. Such an approach should be applied to find the posterior pdf of parameter values so that it provides a potentially powerful tool to analyse uncertainties in the model parameters and, therefore, in the model outcomes and forecasts.
At the beginning of this section, it was mentioned that the discrepancy between model forecast and calibration target depends on measurement errors, on the non-optimal choice of parameter values, and also on model errors. An interesting and more thorough discussion about this topic, particularly about the different role of uncertainties in the model and in the model parameters, can be found in [106]. The fact that (23) is slightly different from the standard formulation proposed by other authors, who consider y t = s d , can be useful to stress some remarks. In fact, the framework applied in this paper allows the inclusion of different types of model forecasts and calibration targets. Some sort of prior information is contained not only in the prior pdf, which, by the way, implicitly introduces a regularisation. The likelihood and the marginal pdf depend on the prior assumptions and approximations introduced in the model, although this could be somehow masked by the notation. Moreover, few applications of the Bayesian approach consider the marginal pdf, because the latter is often assumed to be independent of p ( cal ) . However, the definition given here shows in a quite explicit way that this quantity is affected by modelling assumptions. One could discard this problem, if only the optimal set of parameters is sought. But if the ultimate goal is to obtain a posterior pdf to be used for further analysis, e.g., to infer the uncertainty on model forecast from the uncertainty on model parameters, an accurate estimate of the marginal pdf is necessary and cannot be substituted with a trivial, straightforward assumption.

6. Conclusions

The goal of this paper was to propose a discussion of some properties of forward and inverse modelling in hydrology. The work is focused on the paradigmatic example of the transport equation, which is very commonly used in hydrology, and particularly for the simplest case of a purely diffusive, stationary, 1D transport.
A few essential messages can be summarised here.
The conceptual framework proposed by the author of this paper and some of his coworkers [71] can be very useful to cast the FP and the DIP in a proper way.
Some methods of model calibration, e.g., CMM, DCM, and the Bayesian approach, do not perfectly fit that framework. Nevertheless, such a conceptual framework and its formalism can be useful to cast those methods in more general cases. Furthermore, it provides a non-standard approach that can be useful to discuss some problems, like the role of uncertainties on the model structure, as mentioned in the last remarks of Section 5.3.
With regard to the terminology used to describe the results of mathematical models for hydrology, and more generally for natural sciences, this work has the ambition to clarify the distinction between words that are considered as synonyms in common language and that are sometimes used without any specific care in scientific papers and technical reports. In particular, the conceptual framework permits us to differentiate model results or outcomes (i.e., the state of the system obtained from the solution to the FP) from model forecasts (or reanalysis, or products, whose evaluation could involve data and parameters). On the other hand, words like prediction should be carefully used, possibly only in specific contexts like, e.g., statics.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

This work is a theoretical one, so no data were created or analysed in this study.

Acknowledgments

The Managing Editor of Hydrology is warmly acknowledged for the kind invitation to submit a contribution to this journal, for his encouragement to resubmit it after the first revision, and for his valuable comments. The Academic Editor is warmly acknowledged for his excellent work and for his suggestions, which were very helpful to improve the quality of the manuscript in this revised version. Seven anonymous referees who reviewed the original and the revised version of the manuscript are gratefully acknowledged: their comments helped me to improve the quality of the manuscript during the revision process. This research has been partially conducted in the framework of the project “Geosciences for society: resources and their evolution” supported by the Italian Ministry of University and Research (MUR) through the funds ‘Dipartimenti di Eccellenza 2023/2027’.

Conflicts of Interest

The author declares no conflicts of interest.

Abbreviations and Symbols

The following abbreviations are used in this paper:
BCsboundary conditions;
BICsboundary and initial conditions;
CMMcomparison Model Method;
DCMdual constraint Method;
DIPdiscrete inverse problem;
FPforward problem;
pdfprobability density function.
The following mathematical symbols are used in this paper.
For the description of the general framework of the FP and the DIP:
d array of data;
f array of functions describing the FP;
g function of p denoting the solution to the FP;
p array including model parameters;
p ( cal ) array of model parameters whose values must be fitted;
p ( fix ) array of model parameters whose values are fixed;
p ( cal ) solution to the DIP;
s array of the state of the system, i.e.,the solution to the FP;
t array of calibration target;
y array of model forecasts;
P set of the admissible model parameters;
P ( cal ) set of potential solutions to the DIP;
S set of state of the system;
N ( c ) number of elements of p ( cal ) ;
N ( f ) number of elements of p ( fix ) ;
N ( p ) number of model parameters;
N ( s ) number of elements of s ;
N ( t ) number of elements of t and y ;
O objective function;
d discrepancy function;
x , x elements of R N ( t ) ;
α , β , γ numbers.
For the paradigmatic example:
aphenomenological coefficient (e.g., conductivity, diffusivity, and transmissivity);
a i 1 / 2 discrete phenomenological coefficient (e.g., internode or interblock conductance);
bphenomenological coefficient (capacity);
fsource/sink term;
inode or cell index;
Nmaximum node index;
upotential (e.g., hydraulic head, solute concentration, and temperature);
u i 1 potential at a node;
φ i sum of the source terms in a cell;
Δ x i 1 / 2 spacing between adjacent nodes;
A coefficient matrix for the paradigmatic example;
b known-term, including sink/source terms and BICs;
D t Lagrangian derivative;
x j measurement points;
u j ( meas ) measured value of the potential;
(int)superscript to denote interpolated quantities.
For the discussion of some remarks coming from the application of the DIP to the curve number method:
C N curve number;
c 1 , c 2 , c 3 phenomenological coefficients;
I a initial abstraction, i.e., losses before runoff begins;
mindex to denote different rainfall events;
Mnumber of rainfall events considered;
Prainfall;
Qrunoff;
Spotential maximum retention after runoff begins.
For the discussion of some remarks coming from the application of the DIP to a COVID-19 spread model:
I, R, Dnumbers of infected, recovered, and deceased persons;
ntime index;
n min , n max minimum and maximum values of n;
ξ threshold-and-weight parameter;
(j)apex used in the DIP;
(ref)apex used to denote values obtained from reference data.
For the discussion of the Bayesian approach and of some specific methods of solution to the DIP:
f likelihood likelihood;
f marginal marginal pdf;
f posterior posterior pdf;
f prior prior pdf;
(D)apex denoting quantities referred to the FP with Dirichlet BCs for DCM;
(N)apex denoting quantities referred to the FP with Neuman BCs for DCM;
iteration index for CMM and DCM.

References

  1. Narasimhan, T.N. Fourier’s heat conduction equation: History, influence, and connections. Rev. Geophys. 1999, 37, 151–172. [Google Scholar] [CrossRef]
  2. Menke, W. Geophysical Data Analysis: Discrete Inverse Theory, 3rd ed.; Academic Press: Cambridge, MA, USA, 2012. [Google Scholar]
  3. Tarantola, A. Inverse Problem Theory and Methods for Model Parameter Estimation; Society for Industrial and Applied Mathematics: Philadelphia, PA, USA, 2004. [Google Scholar]
  4. Parker, R. Geophysical Inverse Theory; Princeton University Press: Princeton, NJ, USA, 1994. [Google Scholar]
  5. Aster, R.; Borchers, B.; Thurber, C. Parameter Estimation and Inverse Problems, 2nd ed.; Academic Press: Boston, MA, USA, 2013. [Google Scholar] [CrossRef]
  6. Zhdanov, M. Inverse Theory and Applications in Geophysics, 2nd ed.; Elsevier: Oxford, UK, 2015. [Google Scholar] [CrossRef]
  7. Hill, M.C.; Tiedeman, C.R. Effective Groundwater Model Calibration: With Analysis of Data, Sensitivities, Predictions and Uncertainty; John Wiley & Sons: Hoboken, NJ, USA, 2006. [Google Scholar]
  8. Sun, N. Inverse Problems in Groundwater Modeling; Theory and Applications of Transport in Porous Media; Springer: Dordrecht, The Netherlands, 2013. [Google Scholar]
  9. Nelson, R.W. In-place measurement of permeability in heterogeneous media: 1. Theory of a proposed method. J. Geophys. Res. 1960, 65, 1753–1758. [Google Scholar] [CrossRef]
  10. Nelson, R.W. In-place measurement of permeability in heterogeneous media: 2. Experimental and computational considerations. J. Geophys. Res. 1961, 66, 2469–2478. [Google Scholar] [CrossRef]
  11. Emsellem, Y.; De Marsily, G. An Automatic Solution for the Inverse Problem. Water Resour. Res. 1971, 7, 1264–1283. [Google Scholar] [CrossRef]
  12. Neuman, S.P. Calibration of distributed parameter groundwater flow models viewed as a multiple-objective decision process under uncertainty. Water Resour. Res. 1973, 9, 1006–1021. [Google Scholar] [CrossRef]
  13. Neuman, S.P.; Yakowitz, S. A statistical approach to the inverse problem of aquifer hydrology: 1. Theory. Water Resour. Res. 1979, 15, 845–860. [Google Scholar] [CrossRef]
  14. Neuman, S.P.; Fogg, G.E.; Jacobson, E.A. A statistical approach to the inverse problem of aquifer hydrology: 2. Case study. Water Resour. Res. 1980, 16, 33–58. [Google Scholar] [CrossRef]
  15. Yakowitz, S.; Duckstein, L. Instability in aquifer identification: Theory and case studies. Water Resour. Res. 1980, 16, 1045–1064. [Google Scholar] [CrossRef]
  16. Neuman, S.P. A statistical approach to the inverse problem of aquifer hydrology: 3. Improved solution method and added perspective. Water Resour. Res. 1980, 16, 331–346. [Google Scholar] [CrossRef]
  17. Carrera, J.; Neuman, S.P. Estimation of Aquifer Parameters Under Transient and Steady State Conditions: 1. Maximum Likelihood Method Incorporating Prior Information. Water Resour. Res. 1986, 22, 199–210. [Google Scholar] [CrossRef]
  18. Carrera, J.; Neuman, S.P. Estimation of Aquifer Parameters Under Transient and Steady State Conditions: 2. Uniqueness, Stability, and Solution Algorithms. Water Resour. Res. 1986, 22, 211–227. [Google Scholar] [CrossRef]
  19. Yeh, W.W.G. Review of Parameter Identification Procedures in Groundwater Hydrology: The Inverse Problem. Water Resour. Res. 1986, 22, 95–108. [Google Scholar] [CrossRef]
  20. Carrera, J.R. State of the art of the inverse problem applied to the flow and solute transport equations. In Groundwater Flow and Quality Modelling; Custodio, E., Gurgui, A., Lobo Ferreira, J., Eds.; NATO ASI Series. Series C, Mathematical and Physical Sciences; D. Reidel Publishing Company: Dordrecht, The Netherlands, 1988; pp. 549–583. [Google Scholar]
  21. Ginn, T.R.; Cushman, J.H. Inverse methods for subsurface flow: A critical review of stochastic techniques. Stoch. Hydrol. Hydraul. 1990, 4, 1–26. [Google Scholar] [CrossRef]
  22. Sun, N.Z.; Yeh, W.G. Coupled inverse problems in groundwater modeling: 2. Identifiability and experimental design. Water Resour. Res. 1990, 26, 2527–2540. [Google Scholar] [CrossRef]
  23. Chavent, G. On the theory and practice of non-linear least-squares. Adv. Water Resour. 1991, 14, 55–63. [Google Scholar] [CrossRef]
  24. Certes, C.; de Marsily, G. Application of the pilot point method to the identification of aquifer transmissivities. Adv. Water Resour. 1991, 14, 284–300. [Google Scholar] [CrossRef]
  25. Carrera, J.; Glorioso, L. On geostatistical formulations of the groundwater flow inverse problem. Adv. Water Resour. 1991, 14, 273–283. [Google Scholar] [CrossRef]
  26. Parravicini, G.; Giudici, M.; Morossi, G.; Ponzini, G. Minimal a priori assignment in a direct method for determining phenomenological coefficients uniquely. Inverse Probl. 1995, 11, 611–629. [Google Scholar] [CrossRef]
  27. Giudici, M.; Morossi, G.; Parravicini, G.; Ponzini, G. A New Method for the Identification of Distributed Transmissivities. Water Resour. Res. 1995, 31, 1969–1988. [Google Scholar] [CrossRef]
  28. McLaughlin, D.; Townley, L.R. A Reassessment of the Groundwater Inverse Problem. Water Resour. Res. 1996, 32, 1131–1161. [Google Scholar] [CrossRef]
  29. Ferraresi, M.; Todini, E.; Vignoli, R. A solution to the inverse problem in groundwater hydrology based on Kalman filtering. J. Hydrol. 1996, 175, 567–581. [Google Scholar] [CrossRef]
  30. Vázquez González, R.; Giudici, M.; Ponzini, G.; Parravicini, G. The Differential System Method for the Identification of Transmissivity and Storativity. Transp. Porous Media 1997, 26, 339–371. [Google Scholar] [CrossRef]
  31. Yeh, T.C.J.; Liu, S. Hydraulic tomography: Development of a new aquifer test method. Water Resour. Res. 2000, 36, 2095–2105. [Google Scholar] [CrossRef]
  32. Knowles, I.; Yan, A. On the recovery of transport parameters in groundwater modelling. J. Comput. Appl. Math. 2004, 171, 277–290. [Google Scholar] [CrossRef]
  33. Kuhlman, K.L.; Hinnell, A.C.; Mishra, P.K.; Yeh, T.C.J. Basin-Scale Transmissivity and Storativity Estimation Using Hydraulic Tomography. Groundwater 2008, 46, 706–715. [Google Scholar] [CrossRef]
  34. Ginn, T.; Haeri, H.; Massoudieh, A.; Foglia, L. Notes on groundwater age in forward and inverse modeling. Transp. Porous Media 2009, 79, 117–134. [Google Scholar] [CrossRef]
  35. Mariethoz, G.; Renard, P.; Caers, J. Bayesian inverse problem and optimization with iterative spatial resampling. Water Resour. Res. 2010, 46, W11530. [Google Scholar] [CrossRef]
  36. Baratelli, F.; Giudici, M.; Vassena, C. Single and dual-domain models to evaluate the effects of preferential flow paths in alluvial sediments. Transp. Porous Media 2011, 87, 465–484. [Google Scholar] [CrossRef]
  37. Baratelli, F.; Giudici, M.; Parravicini, G. Single- and dual-domain models of solute transport in alluvial sediments: The effects of heterogeneity structure and spatial scale. Transp. Porous Media 2014, 105, 315–348. [Google Scholar] [CrossRef]
  38. Ackerer, P.; Trottier, N.; Delay, F. Flow in double-porosity aquifers: Parameter estimation using an adaptive multiscale method. Adv. Water Resour. 2014, 73, 108–122. [Google Scholar] [CrossRef]
  39. Liu, F.; Yeh, T.C.J.; Wang, Y.L.; Hao, Y.; Wen, J.C.; Wang, W. Characterization of basin-scale aquifer heterogeneity using transient hydraulic tomography with aquifer responses induced by groundwater exploitation reduction. J. Hydrol. 2020, 588, 125137. [Google Scholar] [CrossRef]
  40. Kool, J.; Parker, J.; van Genuchten, M. Parameter estimation for unsaturated flow and transport models—A review. J. Hydrol. 1987, 91, 255–293. [Google Scholar] [CrossRef]
  41. Hopmans, J.W.; Šimůnek, J.; Romano, N.; Durner, W. 3.6.2. Inverse Methods. In Methods of Soil Analysis; John Wiley & Sons, Ltd.: Hoboken, NJ, USA, 2002; pp. 963–1008. [Google Scholar] [CrossRef]
  42. Dai, Z.; Samper, J. Inverse problem of multicomponent reactive chemical transport in porous media: Formulation and applications. Water Resour. Res. 2004, 40, W07407. [Google Scholar] [CrossRef]
  43. Zheng, L.; Samper, J. Formulation of the inverse problem of non-isothermal multiphase flow and reactive transport in porous media. In Computational Methods in Water Resources: Volume 2; Developments in Water Science; Miller, C.T., Pinder, G.F., Eds.; Elsevier: Amsterdam, The Netherlands, 2004; Volume 55, pp. 1317–1327. [Google Scholar] [CrossRef]
  44. Condon, L.E.; Kollet, S.; Bierkens, M.F.P.; Fogg, G.E.; Maxwell, R.M.; Hill, M.C.; Fransen, H.J.H.; Verhoef, A.; Van Loon, A.F.; Sulis, M.; et al. Global Groundwater Modeling and Monitoring: Opportunities and Challenges. Water Resour. Res. 2021, 57, e2020WR029500. [Google Scholar] [CrossRef]
  45. Ibbitt, R.P.; O’Donnell, T. Fitting Methods for Conceptual Catchment Models. ASCE J. Hydraul. Div. 1971, 97, 1331–1342. [Google Scholar] [CrossRef]
  46. Ibbitt, R.P. Effects of random data errors on the parameter values for a conceptual model. Water Resour. Res. 1972, 8, 70–78. [Google Scholar] [CrossRef]
  47. Clarke, R. A review of some mathematical models used in hydrology, with observations on their calibration and use. J. Hydrol. 1973, 19, 1–20. [Google Scholar] [CrossRef]
  48. Natale, L.; Todini, E. A stable estimator for linear models: 1. Theoretical development and Monte Carlo Experiments. Water Resour. Res. 1976, 12, 667–671. [Google Scholar] [CrossRef]
  49. Natale, L.; Todini, E. A stable estimator for linear models: 2. Real world hydrologic applications. Water Resour. Res. 1976, 12, 672–676. [Google Scholar] [CrossRef]
  50. Johnston, P.R.; Pilgrim, D.H. Parameter optimization for watershed models. Water Resour. Res. 1976, 12, 477–486. [Google Scholar] [CrossRef]
  51. Mein, R.G.; Brown, B.M. Sensitivity of optimized parameters in watershed models. Water Resour. Res. 1978, 14, 299–303. [Google Scholar] [CrossRef]
  52. Sorooshian, S.; Dracup, J.A. Stochastic parameter estimation procedures for hydrologie rainfall-runoff models: Correlated and heteroscedastic error cases. Water Resour. Res. 1980, 16, 430–442. [Google Scholar] [CrossRef]
  53. Hino, M.; Hasebe, M. Analysis Of Hydrologic Characteristics from Runoff Data—A Hydrologic Inverse Problem. J. Hydrol. 1981, 49, 287–313. [Google Scholar] [CrossRef]
  54. Kuczera, G. Improved parameter inference in catchment models: 1. Evaluating parameter uncertainty. Water Resour. Res. 1983, 19, 1151–1162. [Google Scholar] [CrossRef]
  55. Duan, Q.; Sorooshian, S.; Gupta, V. Effective and efficient global optimization for conceptual rainfall-runoff models. Water Resour. Res. 1992, 28, 1015–1031. [Google Scholar] [CrossRef]
  56. Beven, K.; Binley, A. The future of distributed models: Model calibration and uncertainty prediction. Hydrol. Process. 1992, 6, 279–298. [Google Scholar] [CrossRef]
  57. Sorooshian, S.; Duan, Q.; Gupta, V.K. Calibration of rainfall-runoff models: Application of global optimization to the Sacramento Soil Moisture Accounting Model. Water Resour. Res. 1993, 29, 1185–1194. [Google Scholar] [CrossRef]
  58. Duan, Q.; Sorooshian, S.; Gupta, V.K. Optimal use of the SCE-UA global optimization method for calibrating watershed models. J. Hydrol. 1994, 158, 265–284. [Google Scholar] [CrossRef]
  59. Boyle, D.; Gupta, H.; Sorooshian, S. Toward improved calibration of hydrologic models: Combining the strengths of manual and automatic methods. Water Resour. Res. 2000, 36, 3663–3674. [Google Scholar] [CrossRef]
  60. White, L.W.; Vieux, B.; Armand, D. Surface flow model: Inverse problems and predictions. Adv. Water Resour. 2002, 25, 317–324. [Google Scholar] [CrossRef]
  61. White, L.W.; Vieux, B.; Armand, D.; LeDimet, F.X. Estimation of optimal parameters for a surface hydrology model. Adv. Water Resour. 2003, 26, 337–348. [Google Scholar] [CrossRef]
  62. Perrin, C.; Michel, C.; Andréassian, V. Improvement of a parsimonious model for streamflow simulation. J. Hydrol. 2003, 279, 275–289. [Google Scholar] [CrossRef]
  63. Madsen, H. Parameter estimation in distributed hydrologycal catchment modelling using automatic calibration with multiple objectives. Adv. Water Resour. 2003, 26, 205–216. [Google Scholar] [CrossRef]
  64. Kavetski, D.; Kuczera, G.; Franks, S.W. Bayesian analysis of input uncertainty in hydrological modeling: 1. Theory. Water Resour. Res. 2006, 42, W03407. [Google Scholar] [CrossRef]
  65. Kavetski, D.; Kuczera, G.; Franks, S.W. Bayesian analysis of input uncertainty in hydrological modeling: 2. Application. Water Resour. Res. 2006, 42, W03408. [Google Scholar] [CrossRef]
  66. Wagener, T.; Wheater, H.S. Parameter estimation and regionalization for continuous rainfall-runoff models including uncertainty. J. Hydrol. 2006, 320, 132–154. [Google Scholar] [CrossRef]
  67. Sitterson, J.; Knightes, C.; Parmar, R.; Wolfe, K.; Avant, B.; Muche, M. An Overview of Rainfall-Runoff Model Types; Technical Report EPA/600/R-17/482; U.S. Environmental Protection Agency: Washington, DC, USA, 2018. [Google Scholar]
  68. Blasone, R.; Madsen, H.; Rosbjerg, D. Parameter estimation in distributed hydrological modelling: Comparison of global and local optimisation techniques. Nord. Hydrol. 2007, 38, 451–476. [Google Scholar] [CrossRef]
  69. Abbas, S.A.; Bailey, R.T.; White, J.T.; Arnold, J.G.; White, M.J.; Čerkasova, N.; Gao, J. A framework for parameter estimation, sensitivity analysis, and uncertainty analysis for holistic hydrologic modeling using SWAT+. Hydrol. Earth Syst. Sci. 2024, 28, 21–48. [Google Scholar] [CrossRef]
  70. Vozoff, K.; Jupp, D.L.B. Joint Inversion of Geophysical Data. Geophys. J. R. Astron. Soc. 1975, 42, 977–991. [Google Scholar] [CrossRef]
  71. Giudici, M.; Baratelli, F.; Cattaneo, L.; Comunian, A.; Filippis, G.D.; Durante, C.; Giacobbo, F.; Inzoli, S.; Mele, M.; Vassena, C. A conceptual framework for discrete inverse problems in geophysics. arXiv 2021, arXiv:1901.07937. [Google Scholar] [CrossRef]
  72. Landau, L.; Lifshitz, E. Fluid Mechanics, 2nd ed.; Butterworth-Heinemann: Oxford, UK, 1987. [Google Scholar]
  73. Batchelor, G.K. An Introduction to Fluid Dynamics; Cambridge Mathematical Library, Cambridge University Press: Cambridge, UK, 2000. [Google Scholar]
  74. Bird, R.; Stewart, W.; Lightfoot, E. Transport Phenomena; Number v. 1 in Transport Phenomena; Wiley: Hoboken, NJ, USA, 2006. [Google Scholar]
  75. Saint-Venant, A.B. Théorie du mouvement non permanent des eaux, avec application aux crues des rivières et a l’introduction de marées dans leurs lits. C. R. Acad. Sci. 1871, 73, 147–154,237–240. [Google Scholar]
  76. Yen, B.; Tsai, C.S. On noninertia wave versus diffusion wave in flood routing. J. Hydrol. 2001, 244, 97–104. [Google Scholar] [CrossRef]
  77. Todini, E. A mass conservative and water storage consistent variable parameter Muskingum-Cunge approach. Hydrol. Earth Syst. Sci. 2007, 11, 1645–1659. [Google Scholar] [CrossRef]
  78. Yang, Y.; Endreny, T.A. Watershed hydrograph model based on surface flow diffusion. Water Resour. Res. 2013, 49, 507–516. [Google Scholar] [CrossRef]
  79. Dooge, J.C.I. A general theory of the unit hydrograph. J. Geophys. Res. 1959, 64, 241–256. [Google Scholar] [CrossRef]
  80. Wang, G.T.; Wu, K. The unit-step function response for several hydrological conceptual models. J. Hydrol. 1983, 62, 119–128. [Google Scholar] [CrossRef]
  81. Natural Resources Conservation Service Conservation Engineering Division, USDA. Urban Hydrology for Small Watersheds; Technical Report Technical Release 55; United States Department of Agriculture (USDA): Washington, DC, USA, 1986. [Google Scholar]
  82. Valota, G.; Giudici, M.; Parravicini, G.; Ponzini, G.; Romano, E. Is the Forward Problem of Ground Water Hydrology Always Well Posed? Ground Water 2002, 40, 500–508. [Google Scholar] [CrossRef]
  83. McIntosh, C. Cambridge Dictionary—English Thesaurus. Available online: https://dictionary.cambridge.org/thesaurus (accessed on 11 December 2023).
  84. CollinsDictionary.com. Collins Thesaurus. Available online: https://www.collinsdictionary.com/dictionary/english-thesaurus (accessed on 11 December 2023).
  85. Merriam-Webster, Inc. The Merriam-Webster Thesaurus. Available online: https://www.merriam-webster.com/thesaurus (accessed on 11 December 2023).
  86. Oxford English Dictionary. Historical Thesaurus. Available online: https://www.oed.com/thesaurus/start (accessed on 11 December 2023).
  87. Compo, G.P.; Whitaker, J.S.; Sardeshmukh, P.D.; Matsui, N.; Allan, R.J.; Yin, X.; Gleason, B.E.; Vose, R.S.; Rutledge, G.; Bessemoulin, P.; et al. The Twentieth Century Reanalysis Project. Q. J. R. Meteorol. Soc. 2011, 137, 1–28. [Google Scholar] [CrossRef]
  88. Kaiser-Weiss, A.K.; Borsche, M.; Niermann, D.; Kaspar, F.; Lussana, C.; Isotta, F.A.; van den Besselaar, E.; van der Schrier, G.; Undén, P. Added value of regional reanalyses for climatological applications. Environ. Res. Commun. 2019, 1, 071004. [Google Scholar] [CrossRef]
  89. IPCC. Climate Change 2007: Synthesis Report. Contribution of Working Groups I, II and III to the Fourth Assessment Report of the Intergovernmental Panel on Climate Change; International Panel on Climate Change: Geneva, Switzerland, 2007. [Google Scholar]
  90. Bray, D.; von Storch, H. “Prediction” or “Projection”?: The Nomenclature of Climate Science. Sci. Commun. 2009, 30, 534–543. [Google Scholar] [CrossRef]
  91. Karniadakis, G.E.; Kevrekidis, I.G.; Lu, L.; Perdikaris, P.; Wang, S.; Yang, L. Physics-informed machine learning. Nat. Rev. Phys. 2021, 3, 422–440. [Google Scholar] [CrossRef]
  92. Schuster, G.T.; Chen, Y.; Feng, S. Review of Physics-Informed Machine Learning Inversion of Geophysical Data. Geophysics 2024, 89, T337–T356. [Google Scholar] [CrossRef]
  93. Bandai, T.; Ghezzehei, T.A. Forward and inverse modeling of water flow in unsaturated soils with discontinuous hydraulic conductivities using physics-informed neural networks with domain decomposition. Hydrol. Earth Syst. Sci. 2022, 26, 4469–4495. [Google Scholar] [CrossRef]
  94. Aggarwal, C.C. Neural Networks and Deep Learning, 2nd ed.; Springer: Cham, Switzerland, 2023. [Google Scholar] [CrossRef]
  95. Comunian, A.; Gaburro, R.; Giudici, M. Inversion of a SIR-based model: A critical analysis about the application to COVID-19 epidemic. Phys. D Nonlinear Phenom. 2020, 413, 132674. [Google Scholar] [CrossRef] [PubMed]
  96. Yapo, P.; Gupta, H.; Sorooshian, S. Multi-objective global optimization for hydrologic models. J. Hydrol. 1998, 204, 83–97. [Google Scholar] [CrossRef]
  97. Gupta, H.; Sorooshian, S.; Yapo, P. Toward improved calibration of hydrologic models: Multiple and noncommensurable measures of information. Water Resour. Res. 1998, 34, 751–763. [Google Scholar] [CrossRef]
  98. Vrugt, J.; Gupta, H.; Bastidas, L.; Bouten, W.; Sorooshian, S. Effective and efficient algorithm for multiobjective optimization of hydrologic models. Water Resour. Res. 2003, 39, 1214. [Google Scholar] [CrossRef]
  99. Aitken, A. Assessing systematic errors in rainfall-runoff models. J. Hydrol. 1973, 20, 121–136. [Google Scholar] [CrossRef]
  100. Doherty, J.; Hunt, R.J. Two statistics for evaluating parameter identifiability and error reduction. J. Hydrol. 2009, 366, 119–127. [Google Scholar] [CrossRef]
  101. Bennett, N.D.; Croke, B.F.; Guariso, G.; Guillaume, J.H.; Hamilton, S.H.; Jakeman, A.J.; Marsili-Libelli, S.; Newham, L.T.; Norton, J.P.; Perrin, C.; et al. Characterising performance of environmental models. Environ. Model. Softw. 2013, 40, 1–20. [Google Scholar] [CrossRef]
  102. Hadamard, J. Sur les problèmes aux dérivées partielles et leur signification physique. Princet. Univ. Bull. 1902, XIII, 49–52. [Google Scholar]
  103. Beven, K. Changing ideas in hydrology—The case of physically-based models. J. Hydrol. 1989, 105, 157–172. [Google Scholar] [CrossRef]
  104. Beven, K. A manifesto for the equifinality thesis. J. Hydrol. 2006, 320, 18–36. [Google Scholar] [CrossRef]
  105. Beven, K.; Binley, A. GLUE: 20 years on. Hydrol. Process. 2014, 28, 5897–5918. [Google Scholar] [CrossRef]
  106. Todini, E.; Biondi, D. Chapter 22. Calibration, parameter estimation, uncertainty, data assimilation, sensitivity analysis, and validation. In Handbook of Applied Hydrology, 2nd ed.; Singh, V.P., Ed.; Mc Graw Hill Education: New York, NY, USA, 2016. [Google Scholar]
  107. Todini, E. Chapter 26. Predictive Uncertainty Assessment and Decision Making. In Handbook of Applied Hydrology, 2nd ed.; Singh, V.P., Ed.; Mc Graw Hill Education: New York, NY, USA, 2016. [Google Scholar]
  108. Vrugt, J.A. Markov chain Monte Carlo simulation using the DREAM software package: Theory, concepts, and MATLAB implementation. Environ. Model. Softw. 2016, 75, 273–316. [Google Scholar] [CrossRef]
  109. Cattaneo, L.; Comunian, A.; de Filippis, G.; Giudici, M.; Vassena, C. Modeling Groundwater Flow in Heterogeneous Porous Media with YAGMod. Computation 2016, 4, 2. [Google Scholar] [CrossRef]
  110. De Filippis, G.; Giudici, M.; Margiotta, S.; Negri, S. Conceptualization and characterization of a coastal multi-layered aquifer system in the Taranto Gulf (southern Italy). Environ. Earth Sci. 2016, 75, 686. [Google Scholar] [CrossRef]
  111. Giudici, M.; Margiotta, S.; Mazzone, F.; Negri, S.; Vassena, C. Modelling hydrostratigraphy and groundwater flow of a fractured and karst aquifer in a Mediterranean basin (Salento peninsula, southeastern Italy). Environ. Earth Sci. 2012, 67, 1891–1907. [Google Scholar] [CrossRef]
  112. Ponzini, G.; Crosta, G. The comparison model method: A new arithmetic approach to the discrete inverse problem of groundwater hydrology. Transp. Porous Media 1988, 3, 415–436. [Google Scholar] [CrossRef]
  113. Ponzini, G.; Lozej, A. Identification of aquifer transmissivities: The comparison model method. Water Resour. Res. 1982, 18, 597–622. [Google Scholar] [CrossRef]
  114. Vassena, C.; Durante, C.; Giudici, M.; Ponzini, G. The importance of observations on fluxes to constrain ground water model calibration. Phys. Chem. Earth Parts A/B/C 2008, 33, 1105–1110. [Google Scholar] [CrossRef]
  115. Vassena, C.; Rienzner, M.; Ponzini, G.; Giudici, M.; Gandolfi, C.; Durante, C.; Agostani, D. Modeling water resources of a highly irrigated alluvial plain (Italy): Calibrating soil and groundwater models. Hydrogeol. J. 2012, 20, 449–467. [Google Scholar] [CrossRef]
  116. Lesnic, D. The comparison model method for determining the flexural rigidity of a beam. J. Inverse-Ill-Posed Probl. 2010, 18, 577–590. [Google Scholar] [CrossRef]
  117. Pasquier, P.; Marcotte, D. Steady- and transient-state inversion in hydrogeology by successive flux estimation. Adv. Water Resour. 2006, 29, 1934–1952. [Google Scholar] [CrossRef]
  118. Brouwer, G.; Fokker, P.; Wilschut, F.; Zijl, W. A Direct Inverse Model to Determine Permeability Fields from Pressure and Flow Rate Measurements. Math. Geosci. 2008, 40, 907. [Google Scholar] [CrossRef]
  119. Zijl, W. A Direct Method for the Identification of the Permeability Field Based on Flux Assimilation by a Discrete Analog of Darcy’s Law. Transp. Porous Media 2004, 56, 87–112. [Google Scholar] [CrossRef]
  120. Zijl, W.; De Smedt, F.; El-Rawy, M.; Batelaan, O. The Double Constraint Inversion Methodology; Springer: Berlin/Heidelberg, Germany, 2018. [Google Scholar] [CrossRef]
  121. Calderón, A.P. On an inverse boundary value problem. In Seminar on Numerical Analysis and Its Applications to Continuum Physics (Rio de Janeiro, 1980); Sociedade Brasileira de Matemática: Rio de Janeiro, Brazil, 1980; pp. 65–73. [Google Scholar]
  122. Heße, F.; Comunian, A.; Attinger, S. What We Talk About When We Talk About Uncertainty. Toward a Unified, Data-Driven Framework for Uncertainty Characterization in Hydrogeology. Front. Earth Sci. 2019, 7, 118. [Google Scholar] [CrossRef]
  123. Edwards, A. Likelihood; Cambridge University Press: Cambridge, UK, 1972. [Google Scholar]
Figure 1. Discretisation grid for the paradigmatic example. Nodes are denoted as black dots; arrows show the incoming fluxes through the borders of the cell i, appearing on the left-hand side of (3).
Figure 1. Discretisation grid for the paradigmatic example. Nodes are denoted as black dots; arrows show the incoming fluxes through the borders of the cell i, appearing on the left-hand side of (3).
Hydrology 11 00189 g001
Figure 2. Scheme of the arrays introduced in the text to state the FP, the DIP, and their relationships.
Figure 2. Scheme of the arrays introduced in the text to state the FP, the DIP, and their relationships.
Hydrology 11 00189 g002
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Giudici, M. Some Remarks About Forward and Inverse Modelling in Hydrology, Within a General Conceptual Framework. Hydrology 2024, 11, 189. https://doi.org/10.3390/hydrology11110189

AMA Style

Giudici M. Some Remarks About Forward and Inverse Modelling in Hydrology, Within a General Conceptual Framework. Hydrology. 2024; 11(11):189. https://doi.org/10.3390/hydrology11110189

Chicago/Turabian Style

Giudici, Mauro. 2024. "Some Remarks About Forward and Inverse Modelling in Hydrology, Within a General Conceptual Framework" Hydrology 11, no. 11: 189. https://doi.org/10.3390/hydrology11110189

APA Style

Giudici, M. (2024). Some Remarks About Forward and Inverse Modelling in Hydrology, Within a General Conceptual Framework. Hydrology, 11(11), 189. https://doi.org/10.3390/hydrology11110189

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop