1. Introduction
Mesoscale oceanic eddies populate nearly all parts of the global ocean and play important roles in maintaining the oceanic general circulation (e.g., [
1]). The most straightforward, but also the most computationally intensive and, thus, unfeasible, way of accounting for the eddy effects on the large-scale circulation is by resolving them dynamically with eddy-resolving ocean general circulation models (OGCMs). This brute-force approach requires computational grids with nominal resolution of about 1 km, which makes it feasible only for relatively short-time simulations, whereas the Earth system and climate modelling routinely require multiple and much longer simulations over centuries and millenia. The only way to afford these time scales is to
parameterize the important eddy effects with simple and affordable but nevertheless accurate models embedded in non-eddy-resolving OGCMs. In this context, an
eddy parameterization is a parametric mathematical model for use in some coarse-grained, reduced-dynamics ocean circulation models. Ideally, parameters involved are to be related to the explicitly resolved, large-scale circulation properties, thus resulting in a turbulence closure for the eddy scales. Over the last few decades, the search for suitable eddy parameterizations has remained a challenging theoretical topic with a clear practical dimension.
Development of mesoscale eddy parameterizations can be also viewed as a strategy for advancing our knowledge about the eddy dynamics and eddy/large-scale flow interactions. The prevailing approach strives to find some sort of diffusive closure for the eddy transport, assuming that eddies transport the corresponding flow property down its large-scale gradient and along isopycnals. In this sense, all ocean models implement eddy viscosity (e.g., Laplacian or biharmonic) for downgradient diffusion of momentum. Many models also implement eddy buoyancy diffusion [
2], which simulates real effects of the baroclinic instability on large-scale currents and, thus, substantially improves model solutions in many parts of the global ocean. The baroclinic instability effects can be also parameterized in terms of large vertical viscosity in the momentum equations [
3,
4]. Eddy diffusion of potential vorticity is another parameterization approach [
5,
6], but its closure and ultimate implementations in OGCMs remain to be worked out. Along these lines, Eden [
7] argued that, for some flows, eddy diffusivities of buoyancy and potential vorticity (PV) can be identical, but for general flows this is not true [
8].
Most of the diffusive parameterization theories focus on either estimating various eddy diffusivities from data (e.g., [
9,
10]) or deriving eddy diffusivities from the large-scale flow properties. Several ideas were proposed for the latter. The classical idea is to relate linearly eddy diffusivity and local strain rate tensor of the large-scale flow [
11], but this approach is phenomenological, and there is no theory for the linear coefficient involved. One idea is to find eddy diffusivity from the most unstable normal mode of the locally homogeneous linear-stability analysis (e.g., [
12,
13]). This can be interpreted as the single-wavenumber approach, and it was recently extended to incorporate a band of wavenumbers [
14]. The other idea is that an eddy diffusivity coefficient can be found by assuming that the relevant time scale is given by the Eady growth rate, which in turn depends on the local stratification and magnitude of the large-scale vertical shear [
15]. This can be viewed as an
f-plane argument that does not take into account the planetary PV gradient and its orientation relative to the large-scale velocity and its shear. There are also eddy diffusivity theories for horizontally homogeneous flows [
16], but their applicability to inhomogeneous flows is unclear. Another idea for deriving an eddy diffusivity closure is that a fully developed equilibrium state of the baroclinic turbulence is characterized by the comparable growth rates of primary and secondary instabilities of the large-scale flow patterns [
17]. Finally, various spectral homogeneous-turbulence diffusivity approaches (e.g., [
18]) base derivation of scale-dependent eddy diffusivity on the existence of a universal turbulent energy spectrum and on the mixing-length arguments; therefore, they do not apply to oceanic mesoscale eddies that are spatially inhomogeneous and exhibit no universal spectra.
Eventually, eddy diffusivities must be somehow derived from the flow dynamics, but until this happens, one can try at least to constrain the involved eddy diffusivities by the physical conservation laws assumed by the underlying dynamics. There were several studies introducing energy and momentum conservation constraints [
19,
20,
21,
22], but each approach takes into account conservation of a quadratic quantity at the expense of solving a dynamical prognostic equation for its evolution. This equation involves its own assumptions and approximations. It is argued that taking into account horizontal and vertical variations, as well as anisotropy of eddy diffusivities should be a high priority (e.g., [
8,
9]), but it remains unclear to what extent these factors can be captured along with assuming local flow homogeneity. Alternatively, one can assume specific spatial patterns of the eddy diffusivity (e.g., [
23]), but this approach remains to be generalized. Finally, it remains unclear what to do with negative eddy diffusivities that arise from common situations with upgradient eddy fluxes [
24] and make the whole eddy diffusion model ill-posed. Along this line, Jansen [
25] proposed parameterizing upscale energy transfers by the negative Laplacian eddy viscosity stabilized by the additional hyperviscosity. Overall, relatively few studies propose abandoning the diffusion and look for other frameworks.
An emerging theoretical alternative to the diffusion is to account for the negative-diffusion eddy effects by imposing random forcing that induces upscale energy transfers and flow rectification. The forcing randomness is justified by highly transient and structurally complicated patterns of the actual eddy fluxes (e.g., [
26,
27,
28]). First studies of this kind were made within the classical homogeneous-turbulence approach (e.g., [
29,
30,
31]), and only more recent studies are spatially inhomogeneous (e.g., [
32,
33,
34]). Determining random forcing, deriving its parameters and relating them to the large-scale flow remains to be done. So far, construction of random forcing is achieved by fitting patterns from the eddy-resolving simulations (e.g., [
35,
36]).
Another way to overcome problems of the diffusion approach is to solve explicitly some intermediate-complexity dynamical model for the momentum, buoyancy or PV ([
37] (hereafter B15), [
38] (hereafter G15)). In G15, the intermediate-complexity model relied on the local homogeneity assumption. It was driven by spatially correlated white-noise stochastic forcing, under assumption that the damping rates are consistent with
and
turbulent energy spectra for the long and short length scales, respectively. Here, parameters controlling the damping rates and stochastic-forcing structural properties remain to be justified and constrained. Choice of the applied time-averaging interval is another concern because intermediate-complexity model possesses modes that are linearly unstable and exponentially growing in time. Although the intermediate dynamics require extra computational costs, they can be alleviated by pre-computing the solutions for all possible configurations of the large-scale flow. The B15 study offers another direct dynamical approach, which is a precursor of this paper. This approach is based on analysis of the linear-dynamics responses to spatially localized, periodic forcing functions representing an elementary eddy flux divergence and referred to as
transient impulses or
plungers. Note that spatially localized forcing is very different from the classical spectral random forcing because it involves many phase-correlated Fourier harmonics with different wavenumbers. The plunger effectively rearranges PV and induces permanent changes of the background flow ([
39,
40,
41,
42]). In B15, the plunger effect is interpreted in terms of its
footprint (i.e., large-scale feedback) that strongly depends on the underlying large-scale flow—this dependence provides the vital link for the eddy parameterization closure.
We start by introducing the model and its solution, and by explaining the relationship between transient impulses and the eddy backscatter mechanism that is responsible for maintaining the eastward jet and its adjacent recirculation zones. Then, we extend B15 results in two main directions. First, we overcome the local homogeneity assumption and demonstrate its consequences. This is achieved by considering full background flow and finite-time, rather than periodic, transient forcing impulses. Second, we use the upgraded plunger-footprint analyses to derive the dynamically consistent, anisotropic, and spatially inhomogeneous, equivalent eddy diffusivity tensor that incorporates information about the eddy backscatter. On the one hand, the eddy diffusivity map provides a new and powerful tool for understanding the eddy effects. On the other hand, the corresponding equivalent eddy fluxes can be used directly as the simple source term parameterizing the eddies. In
Section 2, we describe the ocean model, the reference eddy-resolving solution and the eddy backscatter mechanism, as well as the linearized model and formulation of transient impulses. We present the main results in
Section 3, followed by the discussion and conclusions in
Section 4.
3. Transient-Impulse Solutions and Analyses
We consider a uniform
horizontal grid, so that its nodes are at least
away from the lateral boundaries, solve the linearized model (8)–(10) for
transient impulses, one at a time, and analyze the outcome. First, we look at the transient-impulse flow patterns (
Figure 4a–c) and find that, in the vicinity of the plunger and at the end of the forcing time interval, they resemble the periodic solutions studied in B15. Second, for each transient-impulse solution, we obtain the corresponding induced time-mean eddy PV flux (the layer index is omitted for simplicity, and the overbar denotes the time averaging over
)
(not shown). The corresponding “footprint” (
Figure 4d–f) is defined as the horizontal divergence of the eddy PV flux taken with the minus sign
so that
can be interpreted as the cumulative PV forcing to be exerted on the background flow by the nonlinear interactions of the transient-impulse solution, if this solution were nonlinear. Transient-impulse solutions differ from the temporally periodic ones (not shown, but we checked this by comparing responses in a double-periodic domain with uniform background flows), especially in the far field, but this difference has a relatively weak effect on the footprints. The other noticeable difference from the periodic solutions is due to the inhomogeneous background-flow effect—we observe it by replacing the 2D background circulation with the uniform flow given by the velocity at the plunger location
(the effect of the local homogeneity assumption is discussed further below). The third noticeable difference is due to the fact that the periodic solutions do not take into account natural instabilities of the background flow, whereas the transient-impulse solutions pick them up. The main instabilities grow in the eastward jet (
Figure 4a–c), even if the plunger is located far away from it, but more vigorously if the plunger is close to the jet. However, these instabilities yield nearly negligible contributions to the footprints, especially for plungers located far away from the jet. To summarize, we find that the transient-impulse solutions not only produce meaningful and structurally rich maps of the footprints, but they also can be viewed as physically justified upgrades of the periodic solutions.
3.1. Equivalent Eddy PV Flux
The footprint pattern
can be also viewed as a nonlinearly driven redistribution of PV anomalies supplied by the imposed plunger forcing; hence, we can recast it in terms of the equivalent PV flux. Let us introduce this concept and show that it can be useful both for interpretation of the eddy effects and for upgrading the earlier simple treatment of footprints in B15. First, in each layer (layer indices are omitted), we calculate
anomalies accumulated to the north and south of the reference point
and similarly to the west and east of it, that is,
and
Next, we calculate the “centers of mass” for these anomalies as illustrated by
and
and
are found analogously. Although, the area of integration may cover the full basin, in practice, we integrate over the square with size
centered at
in order to avoid remote and small contaminations of the footprint due to the above-discussed instabilities of the eastward jet. The equivalent eddy flux components are found as
so that the equivalent eddy flux is a simple measure of the amplitude of the PV anomaly generated by the transient impulse, and of its spatial orientation. The main difference between the equivalent and full eddy flux (14) is that the latter is a 2D vector field, whereas the former is a local single-vector quantity for each transient impulse. Of course, there is more information in the footprint pattern, but we treat it simplistically, motivated by the initial investigation purpose. Further below, we will refer to
simply as the eddy flux, while keeping in mind its definition (18). Note that the eddy flux can be straightforwardly decomposed into the relative-vorticity and isopycnal-stretching terms that can be interpreted as Reynolds and form stresses, but we leave the corresponding analyses beyond the scope of this paper.
We decompose the isopycnal eddy fluxes (e.g., as in [
45]) into their gradient and agradient components found with respect to the time-mean large-scale PV field
so that
or, equivalently,
where the eddy diffusivity components are defined as
Thus, the eddy diffusivity tensor can be defined as the sum of its symmetric and antisymmetric parts (The importance of both symmetric (diffusive) and antisymmetric (advective) eddy-induced tracer transports has been addressed by Griffies [
46]. More recently, Bachman and Fox-Kemper [
47] estimated and compared passive-tracer
κ and
in the Eady spin-down problem, and argued that their values are similar, but to what extent this can be applied to other geophysical flows remains unclear):
and the flux-gradient relation can be compactly written as
We calculate the eddy fluxes on the grid, as described above, scaled them by from the reference solution and found their divergence. By local rotation of the coordinate system, each flux field is also recast in terms of the gradient and agradient components. The fluxes themselves show how the transient forcing redistributes the PV, and their divergence shows the corresponding accumulation of the time-mean PV—hence, the large-scale effect induced by transient impulses. The negative/positive divergence of the flux describes the local accumulation of the positive/negative PV anomaly, and following B15, this anomaly can be added to the non-eddy-resolving model as the source term.
The upper- and deep-ocean eddy fluxes and their divergences, as well as the gradient and agradient components of the fluxes, are shown in
Figure 5 and
Figure 6. These patterns suggest several conclusions. First, almost everywhere, the meridional component of the eddy flux is larger than the zonal one. Second, orientation of the flux relative to the background PV gradient tells us that although the gradient component dominates, in the upper ocean, the agradient component is also significant. In the deep ocean, the gradient component is always negative (i.e., PV is fluxed down the gradient), whereas, in the upper ocean, it changes the sign to negative in the northern and southern parts of the double gyres and in the recirculation zones around the eastward jet. The latter sign changes are due to the sign changes in the PV gradient, rather than to the real flux reversals. The upper-ocean agradient flux component exhibits many sign reversals and characteristic recirculation cells to the north and south of the eastward jet. These recirculation cells are such that, in the immediate vicinity of the eastward jet, they advect downstream positive/negative PV anomalies. This is consistent with the downstream enstrophy advection mechanism ([
26,
42]). The upper-ocean flux divergence is characterized by two types of patterns: the anomalies around the eastward jet are such that they enhance the recirculation zones, and the anomalies in the northern and southern parts of the double gyres are such that they slow down the background westward return flows. The deep-ocean flux divergence is characterized only by the latter pattern. All these equivalent eddy flux effects are consistent with the actual eddy effects in the eddy-resolving double-gyre solutions (e.g., [
26]).
Next, we find the eddy diffusivity tensor following (20) and (21) and analyze its properties (
Figure 7). The eddy diffusivity components
κ and
are similar to the corresponding eddy flux components, except around
curves corresponding to zero background PV gradient; this is where
κ and
become singular. Since we work with discrete solutions, these singularities cause no problems, because they are poorly resolved and manifested only by sharp differences between large, positive and negative values on the neighboring grid points. Positive values of
such as in the westward return flows of the gyres, correspond to the downgradient redistribution of PV, and the negative values describe the upgradient, antidiffusive eddy fluxes. The eddy diffusivity pattern in the recirculation zones is dominated by
and along the core of the eastward jet
κ is mildly negative and decreasing in the downstream direction. In the deep ocean,
is relatively small, whereas
κ is positive and significant only in the westward return flows of the gyres. Overall, we conclude that the eddy diffusivity is asymmetric, and its
part is significant and has to be taken into account. More importantly, the diffusivity components are negative and even singular in many parts of the basin, and this makes the whole idea of the diffusive eddy parameterization ill-posed. On the other hand, an eddy parameterization that directly imposes diverging eddy fluxes can be well-posed and more straightforward.
3.2. Local Homogeneity Assumption
We assess accuracy of the local homogeneity assumption by comparing the eddy fluxes calculated without it (all results in this paper) and with it. For this, we remove the flow inhomogeneity by assuming that transient impulses act on the horizontally homogeneous flow with the velocity components estimated at each reference location
on the same
grid. These solutions are similar to those from B15, but with several important differences: transient impulses are finite rather than periodic in time, solutions are found in the closed rather than double-periodic domain, and both zonal and meridional components of the background flow are taken into account. The resulting outcome is illustrated by
Figure 8, which has to be compared with
Figure 5a–c, and several conclusions can be drawn from it. First, the local homogeneity assumption completely fails in the eastward jet region, which is characterized by the largest values of the background PV gradient. Second, significant differences are found even in the interior gyres, where they are more quantitative rather than qualitative. Third, we simplify the local homogeneity even further, by setting to zero the meridional background-flow component, and find that the resulting differences are small. This suggests that taking into account only zonal background-flow component (as in B15) is justified simplification away from the eastward jet. Overall, we conclude that the transient-impulse approach indeed bypasses the local homogeneity assumption and is well justified.
As we showed above, large inhomogeneities of the background flow have a significant effect on the transient-impulse responses and the resulting equivalent eddy fluxes. We explore this further by considering a highly idealized situation of the narrow eastward jet that was fitted to approximate the jet from the reference eddy-resolving solution (
Figure 9a–c). The idealized jet profile (in all three isopycnal layers) is given by the following exponentially decaying oscillation:
where the nondimensional meridional coordinate and the jet width are related to the dimensional variables as
and
respectively, and parameter
is always kept fixed. The idealized jet is controlled by amplitude
U and width
and it closely resembles the reference jet when
km. We place the idealized jet in the center of the basin, find its velocity streamfunction by integration of the velocity profile, and taper the streamfunction to zero around the boundaries (
Figure 9d,e), in order to satisfy the no-flow-through boundary condition for the synthetic flow.
Next, we position our standard transient-impulse forcing in the center of the basin and calculate its responses and footprints for a broad range of
U and
w values (
Figure 10). The outcome clearly shows that the (across-jet) eddy flux strongly depends on the width of the jet, and for a strong enough jet it changes sign from the positive to negative. On the one hand, this result is a simple conceptual illustration of how the local homogeneity assumption fails. Applying this assumption to any point
in
Figure 10 is equivalent to taking
w to infinity; as the Figure suggests, this results in large error in the equivalent eddy flux for fast and narrow eastward jets. On the other hand, this result implies that the tendency of transient eddy forcing to maintain the jet weakens with the jet strength and even reverses for strong enough jet. This is consistent with the eddy diffusivity across the eastward jet becoming more negative further downstream (
Figure 7a).
4. Discussion and Conclusions
In this paper, we continue development of the new framework for parameterizing mesoscale eddy effects in non-eddy-resolving ocean circulation models. The central theme of the approach is its reliance on explicit dynamical solutions of an idealized model of the eddy effects; therefore, we refer to the approach as
dynamically consistent. In Part I of this paper, we (a) focused on the eddy backscatter effect of the
transient part of mesoscale eddy forcing; (b) proposed the “plunger-footprint” approach that translates simple and spatially localized, elementary transient forcing (i.e., plunger) into its nonlinearly rectified response (i.e., footprint); (c) related the footprint to the underlying large-scale flow; and (d) implemented the resulting cumulative effect of many footprints as the eddy parameterization in a non-eddy-resolving model of wind-driven ocean gyres ([
28] hereafter, B15). Despite several significant shortcuts made, especially with how the large-scale flow information was treated, the parameterization demonstrated remarkable skills. In this paper, by making significant modifications of the approach, we not only fixed several problematic shortcuts but also gained new dynamical insights into the eddy/mean interactions.
We continued to focus on the classical, wind-driven double-gyre model and showed how the eddy backscatter mechanism maintains the eastward jet via positive correlations between the upper-ocean transient eddy forcing and the jet itself. In order to understand and model nonlinear flow rectification in response to the transient component of the eddy forcing, we constructed elementary, spatially localized and temporally transient forcing functions, referred to as the transient impulses (plungers), and imposed them on the governing quasigeostrophic potential vorticity dynamical equations linearized around the time-mean double-gyre reference solution. For each position of the transient impulse inside the double gyres, we found the corresponding flow solution and its nonlinear self-interaction, referred to as the footprint. The resulting footprints strongly depend on the underlying large-scale flow, and, therefore, vary across the gyres. At this point, we fixed the following shortcuts made in the previous theory and its implementation (B15). First, when solving the linearized dynamics, we took into account not only zonal but also meridional component of the mean large-scale flow. Second, we upgraded the time-periodic transient impulses to more general and realistic, finite-time impulses which take into account the decaying autocorrelations of the actual eddy forcing. We chose a simple wavelet-like piecewise-linear shape of the transient-impulse time dependence: one larger pulse is surrounded by two equal and smaller pulses with the opposite sign. In the presentation, we considered 50-day-long impulses, and the sensitivity study in the interval from 45 to 70 days showed that the results are qualitatively similar. Third, we overcame the local spatial homogeneity assumption that used only local flow information at each plunger position and allowed the Fourier transform, which greatly simplified the problem. Without the assumption, we had to solve the problem differently—by forward time stepping combined with the second-order finite differences. By comparing transient-impulse solutions with and without the local homogeneity assumption, we found that this assumption completely breaks down in the eastward jet, which is characterized by the most energetic eddies and significant eddy effects, and yields significant quantitative errors in other parts of the gyres. To summarize, we concluded that the proposed extensions of the theory and parameterization are important.
The transient-impulse approach provided us with several new insights into the double-gyre eddy/mean interactions. First, the footprints provided us with maps of the PV anomalies expected to be induced by a general transient eddy forcing acting on the top of the considered background circulation. This information was used to calculate the equivalent eddy PV fluxes and their divergences, and thus to partition the double-gyre circulation into distinct geographical regions with specific eddy effects. In particular, these results suggested that the eddies have to maintain the eastward jet extension of the western boundary currents and its adjacent recirculation zones and to slow down the western parts of the subtropical and subpolar gyres. In turn, these results are consistent with the actual eddy effects in the nonlinear double-gyre solutions (e.g., [
26,
48]). Second, from the equivalent eddy PV fluxes and the underlying large-scale PV gradients, we calculated a spatially inhomogeneous and anisotropic eddy diffusivity tensor. Both of its components along and across the large-scale PV gradient have multiple singularities where the gradient changes sign. Existence of these singularities is consistent with the fact that the eddy fluxes are driven mostly by nonlocal dynamical mechanisms; therefore, they can not be determined only by local gradients. Third, in many parts of the gyres, especially around the eastward jet and its adjacent recirculation zones, the resulting eddy diffusivity is found to be negative, that is, counter-gradient, which is consistent with some earlier studies (e.g., [
26,
45,
49]). All these eddy diffusivity properties suggest that imposing estimated diverging eddy fluxes directly on non-eddy-resolving solutions is better parameterization strategy than relying on the eddy diffusion model. Finally, our results provided a new eddy diffusivity theory, which is complimentary to the existing ideas, such as invoking the linear-stability analysis under the local homogeneity assumption, constraining by the conservation laws, scaling by the Eady growth rate, and relying on the universal turbulent energy spectra (see
Section 1).
There are several avenues for future upgrades of the proposed parameterization framework, and the most important one is extension beyond the quasigeostrophic to primitive-equation dynamics, which is the main tool of comprehensive OGCMs. Extension from linear to fully nonlinear transient impulses is another anticipated future development. Accounting for the large-scale low-frequency variability of the eastward jet and upgrading the transient-impulse forcing towards much more realistic multiscale patterns (e.g., by data-driven statistical models; see also [
36]) will provide further advances to the whole approach. The transient-impulses approach can be also advanced as a powerful analytical tool for estimating eddy fluxes and diffusivities for both passive and active tracers. The fluxes and diffusivities can be used not only for eddy parameterizations but also for dynamical interpretations of the observations and model solutions, as well as for various practical predictions. To summarize, the framework being developed in this paper and its prequel (B15) is promising but requires further understanding and development. The main advantage, as well as the added complexity and extra capabilities of the new framework stems from its explicit use of the high-resolution dynamics—this constitutes the main difference from the mainstream approaches [
2] that use dynamical arguments implicitly.