1. Introduction
This work deals with stochastic models in fluid mechanics. The literature on the subject is very large, but it is mostly of theoretical nature. Having in mind potential applications, two main questions arise: (i) Why should we use stochastic models in fluid mechanics? (ii) Which noise is more interesting, the classical additive noise or other forms? Among various answers to these questions, one is based on stochastic model reduction, the topic discussed in this work. In a sentence, it claims that stochastic models may reduce the complexity of interaction between scales and the noise arising from such a reduction is not the classical additive noise added to the equations in most of the literature (which however is interesting for other reasons), but a multiplicative one of transport type, described here and in related works. We address an audience made up of both mathematicians and practitioners, and our hope is to contribute to the understanding of fluid mechanics PDEs with transport noise and how they are related to applications.
Going into details, in this paper we are interested in general models of geophysical fluid-dynamics with the following form,
      
The spatial domain on which the equations are studied is denoted by D and, depending on the particular problem under investigation, can be either two- or three-dimensional:  or 3. The unknowns of the equations above are the velocity vector field  and the pressure scalar field .
The quantity  describes the density of the fluid and is deduced from u by conservation of mass, mathematically translated as the continuity equation .
The term 
 represents the inertial force per unit of mass acting on the fluid due to advection, for instance,
      
      for Navier–Stokes equations, but it can assume different forms in certain regimes, like the small aspect ratio regime proper of Primitive Equations.
 represents the force per unit of mass of any dissipation mechanism acting on the fluid. Usually dissipation occurs via viscosity
      
      or friction
      
      or a combination of the two, depending on the model under consideration. Indeed, in general viscosity is due to the interaction of the fluid particles with themselves and is therefore an intrinsic property of the fluid: typical experimental values of 
 at room temperature and pressure are 
 m
/s for air and 
 m
/s for water. On the other hand, friction describes well the interaction of a roughly two-dimensional fluid with a solid bottom (or top) layer: for instance, for a fluid of depth 
h the value of 
 is related to 
 via the relation
      
In particular, friction forces dominate viscous forces (at low wavenumbers) for values of h which are small compared to the other typical lengths of the fluid, while viscous forces dominate friction forces for larger values of h. It is worth mentioning that in certain idealised models dissipation is neglected: , see, for instance, Euler equations and related models.
The term f represents any other force per unit of mass acting on the fluid, either inertial (e.g., the Coriolis force) or not.
System (
1) is usually accompanied with suitable boundary conditions on 
u, depending on the geometry of the domain and physical meaning.
Under the assumption of incompressibility 
 is constant and (
1) assumes the form
      
      so that continuity equation simplifies into the condition that the velocity vector field 
u is solenoidal: 
. Mathematically speaking, we assume in the following that 
J is a bilinear operator and 
 is a linear operator, possibly unbounded.
It is clear to everybody that atmospheric and oceanic dynamics show a superposition of structures of different sizes, ranging from continental, with order of magnitude of 1000 km, to human scale structures of size 1 m. In this paper we propose a stochastic model reduction procedure for deterministic geophysical fluid dynamics models of the form (
2), which, in our opinion, is able to isolate the evolution of large scale structures via a closed equation which is a stochastic modification of (
2), where a transport-type Stratonovich noise is sufficient to model, with a certain degree of approximation, the influence of the small scale structures on the large scales ones.
The literature on the topic of either stochastic or deterministic model reduction is wide and the motivations beyond the interest in reduction procedures for geophysical fluid-dynamics models are several.
From the numerical point of view, especially when one is interested in the simulations of complex turbulent flows like weather forecast, one necessarily has to deal with the fact that limited computational power often implies an under-representation of the real physical processes with spatial or temporal scale smaller than a certain threshold, typically the length of the grid parametrisation and the time discretisation step. However, these small scale processes may have a non-trivial impact on the large scales ones, and thus it is important to take this impact into the account in order to obtain accurate description of the evolution of the simulated process, see in [
1] and the references therein.
Another field of application is climate prediction [
2,
3,
4,
5]; indeed, the high complexity of real geophysical models allows an accurate forecast only for relatively short time intervals, that is, it is impossible to have good weather predictions over a time span greater than a few days. On the other hand, decreasing the complexity of a model allows for better error control in long-term simulations, thus opening the way to the study of climate tendency.
By the theoretical point of view, model reduction has always played a primary role in geophysics and, more generally, in fluid mechanics; here, model reduction is meant in the broad sense, as the operation of reducing the complexity of a model in order to conveniently describe certain phenomena. For example, if one is interested in the evolution of a certain geophysical flow on a relatively small portion of Earth’s surface, then the spherical geometry of the problem is usually not so important and the use of spherical coordinates is an unnecessary complication: it is way more convenient to study the problem in Cartesian coordinates. The dynamical effects of Earth’s rotation are therefore captured with the so-called 
f-plane approximation [
6] (and more generally with the 
-plane approximation), which constitutes a nice simplification of the problem yet capable of describing very interesting phenomena, like the motion of cyclonic flows at geostrophic balance and the Taylor–Proudman effect.
Our reduction procedure consists in splitting (
2) into a system of two coupled equations, describing the evolution of the large scale component 
 and the small scale component 
 separately. As already explained, we are not interested in solving explicitly the equation for the small scale process 
, which instead is modelled stochastically as described in 
Section 2. This operation can be performed whenever the structures produced in a geophysical system have a wide range of spatial scales, which corresponds to a wide range of temporal scales. For the sake of modelling, among the various temporal scales, we select three particular of them satisfying certain relations, see below for details. The stochastic modelling depends on a parameter 
 describing the separation between these temporal scales, and our result, obtained by taking the limit of infinite separation of time scales, consists in the convergence of the large scale velocity 
 towards the solution of the stochastic equation
      
      where 
W is a Brownian motion, 
 and 
 are suitable coefficients and 
 denotes the large scale forces acting on the fluid. Our results therefore add further motivation to the study of transport-type noise in equations from fluid-mechanics, which started with the works in [
7,
8,
9,
10] and has received a lot of attention in the last years, see in [
11,
12,
13,
14,
15] and more recently in [
16,
17,
18].
Our approach differs from the many already available in the literature for being purely infinite-dimensional. In fact, although finite-dimensional models are usually sufficient to provide good numerical simulations of the real geophysical processes, for the theoretical motivations explained above it is important to have reduction procedures that act directly on the infinite-dimensional model under investigation. In our particular case, the special form of the limiting equation (stochastic PDE with Stratonovich transport noise) gives access to a vast range of results and techniques from stochastic analysis to study some properties of a geophysical system like, for instance, the existence of invariant measures, ergodicity, Large Deviations estimates for small intensity of the noise, and others.
  2. Main Results
First of all, we clarify from the beginning that the theory illustrated in this work applies to systems with a wide range of space-time scales, this sentence to be understood as explained below. Among this variety of scales, for the sake of modelling we identify three reference scales that constitutes the basis of our analysis.
Concerning the time scales, we need a small time scale 
, which is the characteristic time of the small scale dynamics, we need an intermediate scale 
, and then we need a third, large time scale 
 typical of the large scale dynamics. The following relation will play a role,
      
In terms of spatial scales, we take three reference scales: one small scale , one intermediate scale  and one large scale . The scales  and  are understood, respectively, as the characteristic length of small-scale and large-scale dynamics.
The specific values of scales , , ,  and  are not fundamental in our analysis, and can be modified for other applications of our arguments. Relations between spatial and temporal scales are specified below.
An example, although ideal, may be the lower-layer atmospheric fluid over a large region, which interacts with the irregularities of the ground. This system can be described, with a certain degree of approximation, by means of the ideal model (
2):
Suppose we are observing our system at a certain combination of space-time scales 
 and 
. Dimensional analysis of (
2) above gives the following identity,
      
      where 
 is the reference order of magnitude of velocities and 
 is the reference order of magnitude of forces per unit of mass in the system (
2). Hereafter, we adopt the natural choice
      
The last reference quantity we introduce is reference mass 
, which for convenience we take as
      
Following [
6], Equations (
2) can be non-dimensionalised via the substitutions:
      and take the form
      
      where 
 and 
∇ are nondimensional derivatives with respect to variables 
 and 
, and the non-dimensionalised density 
 is unitary thanks to (
4).
  2.1. Small Scale
By small scale we mean the system observed by the point of view of an observer whose characteristic unit of measure are small, that is,
        
Assume we split the initial conditions according to some reasonable rule (geometric, spectral…), in large and small scales
        
Small scales describe the fluid fluctuations at space distances of order ; large scales those which impact at the regional level (national, continental), namely, with structures with size of order . We assume this separation of scales at time .
Given this separation of the initial datum, we split (
2) into the following system of equations,
        
        where 
 corresponds to large scale external forces and 
 incorporates the small scale inputs due to ground irregularities. We assume that 
 acts on small scale, namely, it includes variations at distances of order 
, with changes in time in a range of order of 
. The property above can be reformulated in the following way; the non-dimensionalisation of 
 with reference magnitude given by 
 is of order one
        
        and 
 has typical variations at distances and times of order one. We assume that similar properties hold for the small scale dissipation term 
. In particular, under suitable assumptions on the initial condition 
, the non-dimensionalised small scale velocity 
 with reference magnitude given by 
 is of order one as well:
In addition, 
 undergoes appreciable changes over time intervals and distances of order one, in formulae
        
        where 
 and 
∇ are nondimensional derivatives with respect to variables 
 and 
.
Remark 1. It is easy to check that the splitting (5) is consistent with (2), in the sense that if  is a solution of (5), then  is a solution of (2). However, we point out that a priori one could have split the equation in a different way, for instance, exchanging the role of  and : both splittings would have been consistent with the initial equation. In other words, the physics only prescribes the evolution of  and not the evolution of  and  individually, and therefore the choice of a splitting for (2) corresponds de facto in the choice of a model for the evolution of  and  separately, and vice versa. The main issue here is that not every splitting is also consistent with the heuristic idea that the two components of the system model the dynamics of large and small structures separately. As far as this is concerned, the splitting (5) is part of the trend called location uncertainty [1], which prescribes the evolution of  in a way that is substantially equivalent to the splitting of (2) at the level of velocity. Nevertheless, motivated by the works in [15,19,20,21], we also point out as a possible alternative approach the so-called stochastic advection by Lie transport. According to this scheme, the evolution of  is prescribed in a manner that is basically equivalent to the splitting of (2) at the level of vorticity, see, for instance, in [22] and Theorem 1 below.  For the reader’s convenience, we rewrite system (
5) in non-dimensionalised variables:
  2.2. Intermediate Scale
Let us observe the same system from the viewpoint of an observer whose reference unit of measure are
        
Assume that the order of magnitude of 
 and 
 are comparable:
As a result, the non-dimensionalised velocity 
 has the same order of magnitude, independently of the choice of 
 or 
 as reference unit of measure. However, the typical time of the fluctuations of the small scale velocity 
 is 
: this implies that non-dimensionalising the velocity with respect to reference measure 
 gives a non-dimensionalised velocity process with fluctuations of typical period
        
Similarly, as 
 changes in space over distances of order 
, the non-dimensionalised velocity process with respect to reference measure 
 changes in space over distances of order
        
In formulae, the arguments above can be summarised as follows,
        
        and
        
        where 
 and 
∇ are nondimensional derivatives with respect to variables 
 and 
.
This motivates our main modelling assumption, see also in [
23,
24,
25]. We replace the small scales by a stochastic equation, Gaussian conditionally to the large scales; that is, we replace the second equation in (
6) by
        
        where 
 is a Brownian motion on the velocity space 
, with 
 and possibly additional boundary conditions, and 
 are positive constants. The condition 
 is not restrictive for our purpose, see Remark 4 below. For technical reasons we assume that the space covariance of 
 is sufficiently regular. For the sake of simplicity we take (cfr. also the discussion in [
26])
        
        where 
 is a family of independent standard Brownian motions on a given probability space 
 and 
 for every 
 with
        
We make this modelling choice for a number of reasons: first, we work under the implicit assumption that quickly varying fluctuations in the small scales dynamics are given by the combined effect of a large number of weakly coupled factors, so that Central Limit Theorem applies. Therefore, it is natural to model the self-interaction  and the external forcing  with a Gaussian source of noise.
The presence of the damping term 
 simulates dissipation, where 
 is of order one. The coefficient 
 in front of the damping is motivated by the fact that, in the regime under investigation, the velocity 
 is of order one, while the dissipative forces acting on the fluid are of order
        
        and therefore a coefficient 
 is needed to make damping of the same order of magnitude as dissipation.
Finally, given the factor 
 in front of the damping, we observe that the coefficient in front of the random term 
, which models 
 and 
, is the only compatible with the fact that 
 is of order one, with typical period of fluctuation of order 
. Indeed, neglecting for simplicity the terms 
 and 
 in the equation for 
 and taking 
, one has
        
        and the covariance between 
 and 
 is equal to
        
        in accordance with the fact that 
 is of order one, as its variance is approximately equal to 
, and has typical period of fluctuation of order 
, as the covariance between 
 and 
 decays approximately as 
. These two properties can not hold simultaneously with a random term of the form 
, 
, thus motivating our choice.
Moreover, in [
23] it is shown, under certain hypotheses on the spatial correlation of the noise, that a model similar to that considered here is capable of representing in silico the main statistical properties of two-dimensional turbulence: energy spectra, inverse energy cascade and direct enstrophy cascade. This fact adds further justification to our modelling choice.
We remark that, in addition to the physical motivation just discussed behind our modelisation, there is also a practical reason: indeed, the Ornstein–Uhlenbeck process 
 given by
        
        is mathematically very treatable, thus making possible explicit computations for (
7).
Remark 2. A posteriori, we will see that the large scale non-dimensionalised large scale velocity process  is of order ϵ when expressed with respect to reference measure  (see subsection below). Using this, together with the fact that  is of order  when expressed with respect to , one has at intermediate scales. On the other hand, for the quadratic self-interaction and external forces we haveat intermediate scales. This suggests that the scattering term  plays only a minor role in the dynamics of , which can be made rigorous in some particular case.    2.3. Large Scale
By this we mean the same system, lower atmospheric layer over a large region, observed by a satellite. The unit of measure are
        
We assume now the following relation,
        
        which corresponds to
        
Equation (
7) becomes
        
        where 
 satisfies 
, in particular 
 also is a Brownian motion.
Now go back to the equation for the large-scale velocity:
Assume that the typical order of magnitude of 
 is 
, that is, the large-scale structures travel a distance of order 
 in a time of order 
. This means that the non-dimensionalisation of 
 with reference magnitude given by 
 is of order one
        
        and also
        
        where 
 and 
∇ are nondimensional derivatives with respect to variables 
 and 
.
Similarly, the forces acting on 
 due to pressure, dissipation and external sources are of magnitude 
, so that their non-dimensionalisation with reference magnitude given by 
 is of order one as well. Therefore, looking at the whole system in non-dimensionalised variables, with unit of measure 
, it takes the following form (recall that by assumptions 
 is divergence-free),
        
To ease the notation we denote  in the following.
Remark 3. Looking at (11) above at large scales, one immediately notice that all the terms in the equation for  are of order one, except . Indeed, as the non-dimensionalised small scale velocity  is of order  when expressed with respect the reference velocity , the term  is of order  as well. However, for small ϵ, the quickly varying-in-time of  has an averaging effect on the term , which thus converges (in a suitable sense) to noise of transport type, see subsection below.    2.4. Asymptotic Behaviour of Coupled System
As already said, we are interested in the large scale component 
 of (
11) above. In particular our goal is to find a new equation for 
 which is closed in 
, namely, we do not want to solve for 
 in order to compute the coupling term 
. We notice that, in the limit as 
, the small-scale velocity is well approximated by the stationary Ornstein–Uhlenbeck process 
 given by
        
        almost independently on the initial condition 
, as time correlation decays as 
. The process 
 formally converges to a white-in-time noise, because of the following computation,
        
The asymptotic behaviour of (
11) as 
 can therefore be studied in a rigorous mathemathical framework as an example of Wong–Zakai approximation principle for stochastic PDEs. Starting from the seminal work of Wong and Zakai [
27], a number of results have been obtained in this direction: we mention among others the works in [
28,
29,
30,
31,
32,
33] and, more recently, those in [
34,
35] based on rough path theory. The aforementioned results suggest, as a rule of thumb, to interpret the formal limit of 
 as a white-in-time noise in Stratonovich sense, that is, for every suitable process 
 and some appropriate notion of convergence
        
        where the latter is a limit in mean square.
Therefore, the candidate limit equation for the sole large scale velocity 
 is the following,
        
        where 
 stands for stochastic integration in the Stratonovich sense. In the particular case
        
        by bilinearity of 
J Equation (
12) above takes the more explicit form
        
Remark 4. In the argument above we have used the approximation , thus neglecting the terms  and , which is indeed the case if . We point out that in the general case  the process  does not converge to , but thank to the presence of the stochastic pressure term it converges to , where  is the solenoidal part in the Helmhotz decomposition of , satisfying In particular, the limit Equation (12) would have been the same, except for  replacing , and therefore the assumption  is not restrictive when investigating the limit behaviour of the large-scale velocity process.  Remark 5. By a physical point of view, taking the limit  in (11) corresponds to implicitly assume infinite separation of time scales. This hypothesis, although not matching strictly speaking with reality, constitutes a sufficiently good approximations and is a very practical working assumption. This also motivates the interest in the identification of the rate of convergence of Wong–Zakai approximations to their limits, see for instance [29,36]. However, we do not treat this problem here.  We summarise the heuristic discussion above with the following.
Conjecture 1. Fix. For every denote the solution of (
11) on the time interval 
 and let  be the solution of (12) on the time interval . Then, the following convergence in probability holds,where  and  are intended as random variables in .  A few remarks are in order.
First of all, we are tacitly assuming well-posedness of (
11) and (
12); otherwise, the result just conjectured may not have a precise meaning. A global well-posedness result in this abstract setting, however, is not available: the theory of incompressible equations from fluid-dynamics is very different depending on the dimension 
n, on the type of dissipation and on the boundary conditions. Therefore, it is impossible to unify everything in one single theorem, and each case must be treated separately.
At least two different situation are worth of special mention. The first is the case where well-posedness holds globally for (
12), but only locally for (
11), up to a (possibly random) time 
 converging to 
∞ as 
. In this case, the convergence of Conjecture 1 may still hold if we replace 
 with its stopped version 
: we are in front of global well-posedness 
in the limit. The second scenario is when (
12) is globally well-posed, but its deterministic counterpart (
2) is not: in some sense, the presence of the noise regularises the equations. Regularisation by noise has been widely investigated (see in [
37] and the references therein) and is still an active topic of research.
The second remark concerns the strategy of the proof of convergence 
. As already said, the validity of this result heavily depends on many factors, so we do not aim to give an universal approach to this problem, but rather some ideas. If dissipation in the large scale component of (
11) is sufficiently strong, then the evolution semigroup 
 is regularising and one can consider the mild formulation
        
If the quantities , , etc. are sufficiently well-behaved, then one can prove the desired convergence arguing as in some of the available works on Wong–Zakai principle we already mentioned.
Another strategy, that is specific for equations of transport type, may be the following. For simplicity, and having in mind Remark 1, we present the idea contained in [
22] for 2D Euler equations on the two dimensional torus 
 in vorticity form
        
        where 
K is the Biot–Savart kernel on 
, so that 
 and 
. Splitting the system above in large scale and small scale and non-dimensionalising we obtain
        
The large-scale component of the system above has an explicit solution: 
        where 
 is the initial condition and 
, 
 are the characteristics, which are given by the solution of
        
        and a similar formula holds for the limit equation. Therefore, it is possible to deduce a convergence result for the vorticity 
 (and as a consequence also for the velocity 
) from a convergence result at the level of characteristics. To be precise, in [
22] it is proved the following.
Theorem 1. Let  and take  such that (8) and (9) hold. For a zero-mean initial vorticity , let  be the solution of (13) and let  be the unique solution of the stochastic equation where . Then, the process  converges as  to  in the following sense; for every :as , for every fixed  and in  for every finite p. In addition, the velocity field  converges as , in mean value, to , as random variables in .