2. Materials and Methods
We begin with the two-box energy balance climate model (EBM), notionally of the Earth’s climate. A low- and high-latitude zone each receive different amounts of sunlight, and each reject heat as a function of their temperature, T
0 and T
1, respectively. For the present-day Earth, the insolation I
0 in the low-latitude box (taking into account the albedo) is about 300 Wm
−2, while at higher latitudes, we have I
1 = 170 Wm
−2, following [
7]. We ignore the greenhouse effect and assume that heat leaves the planet as thermal radiation from the two zones, with the outgoing emission related to temperature, e.g., E
x = σT
x4 (many EBMs use a more or less empirical linear relationship instead, the results are not significantly different here). If heat is transferred between the boxes at a rate F (see 
Figure 1), then the system of equations is easily solved and the temperatures T
0 and T
1 are determined.
Clearly, a physically reasonable system (i.e., one where T0 > T1, so that heat is transported only down a temperature gradient) must have 0 < F < (I0 − I1)/2. In some EBMs, F is not unreasonably assumed to be proportional to the temperature difference, in effect acting as a diffusion process, i.e., F = k∆T with ∆T = (T0 − T1); but for simplicity of exposition, we will discuss only F in physical units. However, there is no universal ‘first principles’ means of determining F (or, equivalently, k), and they have generally been estimated in EBMs from empirical data.
Properties of interest are the Carnot limit on the rate of generation of mechanical work by this heat flux, i.e., W = F∆T/T
0, and its entropy generation dS/dt = F(1/T
1 − 1/T
0). As noted by E. Lorenz [
2,
9], W tends to zero for small F and for large F, but has a maximum at some intermediate value, at which the Earth happens to sit (
Figure 2). Simply put, at low F, there is no heat flux to convert into work, whereas for the maximum F, the system has been made isothermal (∆T = 0) and thus the efficiency with which heat transport is converted into work drops to zero: it is clear that there must be a maximum W at some intermediate value of F. The distinction of the F values for which W is a maximum and for which dS/dt is a maximum is not significantly different; also, since in a steady state, the frictional dissipation in the system must balance the generation of mechanical energy W, this steady configuration is often also termed one of ‘maximum dissipation’. Consideration of the fundamentals of thermodynamics suggests that the entropy production metric may well be the most fundamentally important (and in a general climate problem, sources of entropy production other than mechanical dissipation, such as that by mixing, are likely to be significant). However, here we again strive for clarity of exposition, so we consider in this paper only mechanical work and friction as concepts that are easier to understand. Note that in this paper, ‘work’ refers to the rate of generation of work (i.e., in W or W/m
2). Similarly, in real planets, there is substantial vertical transport of heat and associated generation of work and entropy (e.g., [
18,
19]), whereas here we consider only the horizontal temperature contrasts and associated flows and generation. The arguments in the present paper are general and could be applied in the vertical dimension, but for clarity, we consider only the horizontal dimension (It has been remarked that it is probably not coincidental that the diffusion parameter k for the Earth has not only the same dimensions, but also the same numerical value (~1 W/m
2/K) as the characteristic planetary average (radiative) entropy production, which is 0.5[(I
0 − F) (1/T
0) − (I
0/T
S) + (I
1 + F) (1/T
1) − (I
1/T
S)], where T
S~6000 K is the temperature of the sun. Since T
S >> T
0~T
1, this reduces to approximately 0.5(I
0 + I
1)/0.5(T
0 + T
1), or the average solar flux divided by the average temperature. This same ‘coincidence’ is true for Titan, even though the respective values are almost two orders of magnitude smaller (~0.01 W/m
2/K), contrary to what one would expect from scaling Earth’s k value to Titan’s pressure and rotation rate. In the EBM literature, k is often given the symbol D, although in this paper, we preferred to avoid confusion with dissipation).
Returning to the 1-D latitudinal EBM, the crux of the problem is that F is unknown. Even for a pure fluid on a homogenous and flat rotating planet, F cannot be uniquely deduced from first principles, and of course the real Earth has heat transports through complex flows in both the atmosphere and the ocean, with heat conveyed in part by the latent heat of water as well as by sensible heat in the two main fluids.
If only a single transport mode is present (with some dissipation function D(F), which is likely nonlinear (e.g., D = αF
3, see later, although nonlinearity is not essential to the argument), then for a given set of boundary conditions (I
0, I
1) which determines the work output W(F), it follows that a steady state (e.g., S1 in 
Figure 2) occurs where the curves cross (W = D). The system is then uniquely specified (and such a constrained system recalls the minimum entropy production arguments of Prigogine, Jaynes, and others e.g., [
20,
21]). It is only if the dynamics of the transport mode (α) are ‘lucky’ such that W = D happens to be at the maximum work output that the maximum dissipation state is found.
Only with very mechanically restricted arrangements is it specified, however, since a fluid can take many paths, each with different α (or indeed, there may be multiple fluids present—see 
Figure 3). The argument I advance here essentially notes that this underdetermination can be resolved statistically—it is the very fact that the heat transport is the result of the combination of many elements that allows its net effect to be predictable, just as conventional thermodynamics allows the prediction of large-scale properties of a gas without requiring knowledge of the energies of every individual molecule (an analogy noted by Paltridge [
1] and Dewar [
22,
23]). Some elements of the paradigm proposed here (notably that the matching of maximum work output to maximum dissipation can be achieved by the combination of low- and high-dissipation heat transports) were articulated briefly in, e.g., a commentary article in 2003 [
24], but the present paper permits a more comprehensive development. In fact, elements of the paradigm were actually articulated by E. Lorenz [
1] in his observation that the Earth’s atmosphere seems to be in a state of maximal generation of available potential energy (APE). This energy (or work) is generated by the radiative deposition of heat at the ‘hot’ end of the engine. In the steady state, this is equivalent to maximum mechanical dissipation, since the APE becomes kinetic energy which is then dissipated by friction and viscosity. Aspects of ‘how’ the system accesses and reaches the maximum were noted in that work, and by [
5,
6], and in a review by Ozawa et al. [
9], but the 
combinatorial emergence of the optimum was not described.
  3. Results
Let us assume that the heat transport F is the sum of two components, F
1 and F
2, corresponding to some large-scale overturn of the atmosphere and some smaller-scale eddies (or, perhaps, F1 might correspond to ocean transport and F2 the atmosphere—see 
Figure 3). If we consider sensible heat transport, then these two terms will be associated with corresponding flow velocities V
1 and V
2, and there will be associated frictional dissipation D
x~V
x3 (since the drag force per unit area is proportional to V
2), or since F is proportional to V, we can write D
x = α
xF
x3 with α
x as a constant for that transport mode x = [1, 2, …]: α
x encodes the density of the fluid, the surface roughness which controls the effective drag coefficient, and some geometric factor describing the flow pattern, e.g., the size of the characteristic eddy. Clearly, since a set of small eddies has a more tortuous path from low to high latitude than a large-scale overturn, then α
1 >> α
0. Schematically, such flows are shown in 
Figure 2; some aspects of these flows are discussed in the MaxEP context by Kleidon and colleagues [
25].
So far, we have not simplified the problem—we just introduced another unknown. In steady state, we can choose F
1 and F
2 to be any arbitrary values up to
The work output, W, varies as a function of F = (F
1 + F
2), with some maximum value and an intermediate F as discussed earlier (solid line in 
Figure 2). Now, if we set F
2 = 0 and imagine that the large-scale transport F
1 has a rather weak dissipation D
1 = α
1F
13, which is a monotonically increasing function of F
1 (dashed line in 
Figure 2), the D(F) and W(F) curves cross at a position determined by the various parameters, notably α
1, for an F value F
s1 large than that for the peak F
m. At this crossing point, mechanical energy generation and frictional dissipation are equal, so the system has a possible steady state. The constraints on the system (embodied in α
1) are such that the system can in principle access the configuration F
m; however, the dissipation here is lower than the mechanical energy generation rate, so the system would tend to speed up, moving away from the peak. These aspects of system behavior were discussed previously by E. Lorenz, Ozawa, and others.
Contrariwise, we may imagine that F
1 can be set to zero, and F
2 can vary. However, the F
2 transport mode is much less efficient in the sense that there is a much higher frictional dissipation per unit heat transport (i.e., α
2 >> α
1) and the D
2(F) function grows much steeper than in the previous example. Thus, the intercept of this curve (dotted line in 
Figure 3) with the W(F) mechanical energy generation curve (solid line) occurs at a much lower value of F, i.e., to the left of the peak in W. In this case (i.e., if the F
2 mode was the only one accessible to the system due to dynamical constraints on density, rotation and so on, as encoded in α
2), the system cannot access the maximum dissipation state, because W(F
m) < D(F
m). As soon as the system spins up beyond the crossing point, W < D and mechanical energy is lost, tending to reduce F
2 so F
m cannot be reached.
It is easy to imagine that one could have hybrid configurations with F1 > 0 and F2 > 0, which will have a net dissipation intermediate between these two end members. Algebraically, we could choose combinations of F1 and F2 that have dissipations equal to the work output for F above, below, or exactly at the optimum Fm. We could generalize further to some arbitrarily large number of different modes.
We can see, then, that as long as one or more modes exist with dissipations both suitably low and high, then it is theoretically possible to have a MaxEP steady state. Since arbitrarily tortuous flow paths can be imagined, it is always possible to obtain suitably high dissipation modes, thus, the necessary condition becomes that at least one permitted configuration has dissipation weaker than that needed to reach the optimum. However, this does not explain why a given system should find this optimum.
To address this question, consider the set of available states (for discussion purposes, to make this a finite set, let us assume that the heat transports in the two modes must have integer values in Wm
−2). This then defines a triangular set of points in the state space of (F
1, F
2)—see 
Figure 4. Some of these states have dissipation higher than the work output of the system in that configuration (i.e., W < D), in which case the system would be expected to spin down quickly, i.e., change to a lower energy state with lower F
1, lower F
2, or both. The opposite evolution is expected when W > D.
However, some configurations may have dissipations rather close to the work output—in such configurations, one may imagine that the system will change state only slowly since kinetic energy is being added at nearly the same rate W as that at which it is being removed by D. However, the time evolution of F1, F2, etc., depends on the actual dynamics of the system, specifically how kinetic energy is transferred to and from the different modes F1 and F2.
In general, one could imagine a set of couplings of the form.
The parameters β
1 and β
2 describe how available potential energy (i.e., the temperature difference) drives the motions, while parameters γ
1 and γ
2 describe how friction transfers momentum from F
1 to F
2. The dynamical interpretation is that coupling exists between the different modes; for example, energy will flow from a large-scale circulation to smaller eddies in the familiar Kolmogorov cascade, or wind stress at the sea surface will cause ocean currents to develop. In the first instance, the dissipation per unit heat transport increases (since the smaller-scale eddies have more shear than the large-scale flow), whereas in the second, the dissipation per unit heat transport decreases, because the density and heat capacity of the ocean water column is much larger than that of the ocean (for the present-day Earth, at least). Physical models, typically with empirically derived parameters (such as the sea surface drag coefficient) can be developed for these couplings, and this is how general circulation models typically are constructed (we may incidentally recall that the system of Equations (2) and (3) has some similarities with the convecting system, whose nondeterministic behavior was noticed by Lorenz [
26], a discovery—‘chaos’—that become altogether more famous than his work on the extremal state of the climate).
The details needed to accurately model such modes and couplings are not always available, however. Ocean circulation on Earth is driven, for example, to some extent by the equator-to-pole temperature difference (although warm surface waters must be mechanically mixed downwards by difficult-to-model effects of, e.g., hurricanes, in order to provide the buoyant force driving the large-scale circulation). Large-scale wind stress over the ocean is also an important factor. We could reasonably posit that the acceleration of the ocean circulation mode has two terms, one related to the temperature difference (i.e., β), and one to the speed of the atmospheric mode, i.e., γ (Strictly, a frictional coupling would imply that the drag force per unit area accelerating the ocean current would be proportional to the square of the difference between surface wind speed and the ocean surface current. However, since the ocean column has such a large heat capacity, the ocean speeds are small relative to the wind and can be ignored. In this respect, the kinetic energy transfer is effectively unidirectional. Planetary environments, e.g., with very shallow oceans and with very dense atmospheres, could be contrived where two-way coupling would need to be considered.). Note that if these couplings are too strong, the system loses degrees of freedom (e.g., forcing F1 and F2 to be strongly connected is algebraically equivalent to choosing an intermediate k value and thus explicitly determining the steady state W = D crossing point).
The essence of the MaxEP hypothesis is that these details 
do not need to be known. What matters is that at least one mode exists with a dissipation less than the maximum and at least one exists with a dissipation more than it, and some coupling in both directions allows the system to evolve. In the planetary climate context, satisfaction of the first condition is not always met (e.g., the atmosphere is too thin, or the planet rotates too fast, etc., and it is the lack of a statement about this condition that caused dynamical meteorologists to be deeply uneasy with the MaxEP notion). In situations where the first condition is met, it seems that the second will always be met too, simply as a result of the character of fluid turbulence (e.g., [
27]). One can always invoke ever smaller eddies to soak up excessive kinetic energy by viscosity, per Richardson’s poem.
It follows, then, that if the system can evolve throughout the state space, then the likelihood of some observed property (such as near-maximum dissipation) will depend on the fraction of the state space showing that property. 
Figure 5 shows the relative frequencies of dissipations in the state space of 
Figure 4 (i.e., assuming the system resides for equal amounts of time on each point in the allowed space)—higher frequencies are seen for higher dissipations. However, the propensity to observe near-maximum dissipations is much higher when we consider only those points which are near steady state, i.e., with W~D. There are simply proportionately more such states when the dissipation is higher, because there are more possible combinations of (D
1, D
2) that sum to a higher number (W) than there are that sum to a smaller one.
More generally then, the dimensionality of the state space is higher than two, and the location in that space is described by a large vector (F1, F2, F3…Fn) of (assumed positive) quantities describing the vigor of each of the modes. The net heat transport is evaluated by a weighted sum of these values—if the vector was specified as mass flows, then the weights would incorporate factors that encapsulate the transport properties of each mode (e.g., the horizontal size of eddies, the column mass and heat capacity of the atmosphere, and so on), although more generally, additional factors, such as latent heat, could be included. The dissipation is a weighted sum of some probably nonlinear (notionally cubic, in the classic drag formulation noted earlier) functions of the heat transports. The propensity of the system to be close to a maximum in dissipation simply arises from the number or density of microstates for which dissipation is α1F13 + α2F23 + α3F33…+ αnFn3~W, which is higher for large W.
As another example, then, we can consider a more general three-mode system, with α values of 0.01, 0.001, and 0.0001, and allowable F values for each having distributions uniform in the logarithm of F in the range 0.01–65 Wm
−2. Although this choice is entirely arbitrary, it avoids the restriction of integer values in the previous example (which in turn biased the order of magnitude that resulted). Again, however, it is seen (
Figure 6) that the distribution of W values peaks at the maximum, and indeed, this results without requiring the selection of the steady state (W~D). The principle is rather general.
Now, while the explicit dynamics of the system could be specified by couplings in a higher-dimensional analogy of Equations (2) and (3), one could instead imagine evolution via a conditional random walk, similar to the search algorithm of an optimizer. A rational physical rule would be, for example, to simply increment one coordinate Fi (i randomly chosen from 1 to n) if the dissipation is less than the work output such that the overall kinetic energy increases, or decrement it if dissipation exceeds work (i.e., spin down). It is easy to see that such an evolution would naturally spin up from rest and will tend to fluctuate about a steady state, which will most frequently be near the maximum of total dissipation.