1. Introduction
The transfer of large-scale available potential energy to mesoscale eddy energy by baroclinic instability is one of the most important energy pathways in the dynamics of the global oceans [
1]. In the Lorenz energy cycle [
2], nonlinear baroclinic instability processes transfer large-scale potential energy to eddy potential energy (EPE); EPE is then converted to eddy kinetic energy (EKE); and EKE is either returned to the large-scale flow or dissipated through a wide range of mechanisms. Theories of geophysical macro-turbulence concern themselves, inter alia, with predicting the length scales at which the EPE production and conversion to EKE occur. An early theory proposed by Salmon [
3] has become a touchstone of the modern understanding [
4], having been revised and expanded by a wide range of authors. This early theory, and many successors, is formulated in the context of a highly idealized two-layer quasigeostrophic model whose vertical structure can be formulated in terms of two ‘modes’: a barotropic mode that is depth-independent, and a baroclinic mode that is antisymmetric in the vertical. (The term ‘mode’ here simply means elements of a basis, not the normal modes of a mechanical system, cf. [
5,
6].) These modes diagonalize the energy, and Salmon’s theory was framed in terms of energy in the modes rather than in terms of kinetic and potential energy. The barotropic mode has no potential energy, while the baroclinic mode has both potential and kinetic energy, so the connection of the modal energy perspective to the perspective based on kinetic and potential energy is murky. Nevertheless, the modern theory of geophysical turbulence (cf. [
4] Section 9.3) inherits the following picture from Salmon’s theory: EPE is generated at scales larger than the deformation radius (defined below) and then cascades downscale; conversion from EPE to EKE occurs primarily at scales near the deformation radius; EKE is then dissipated by a range of processes, especially by frictional interaction with the bottom boundary at scales somewhat larger than the deformation radius.
The goal of the present investigation is to study the process of conversion from EPE to EKE from a mathematical rather than a physical perspective. Rather than studying the dynamical processes responsible for conversion, the goal is to investigate the mathematical limits on the rate of conversion per unit energy. The approach is based on considering all possible flow configurations at a fixed energy level and then seeking a configuration that will maximize the rate of potential to kinetic energy conversion. These flow configurations need not be steady solutions of the governing equations, or even states that might realistically be produced by the dynamics, so the rates obtained in this way should be understood as a limit on what a realistic flow can achieve, rather than as a prediction of what a realistic flow will achieve. The method is closely related to methods used to provide upper bounds on the heat flux in Rayleigh–Bénard convection, e.g., [
7,
8,
9].
Barham and Grooms studied the mathematical limits on the rate by which eddies can extract potential energy from the mean flow in a fluid model incorporating only the hydrostatic approximation [
10], and in a quasigeostrophic model [
11]. They found that the eddies are able to extract potential energy from the mean flow at a rate that is independent of the length scale of the eddies. This was something of a null result, in the sense that the bounds on what the flow can do are not closely related to what the flow actually does: a wide range of investigations have found that the EPE is primarily generated at scales larger than the deformation radius [
12,
13,
14,
15,
16,
17,
18,
19,
20]. The results of the present investigation make a closer connection to the phenomenology of geostrophic turbulence than the results of [
10,
11]: it is shown that conversion from EPE to EKE is much less efficient than the generation of EPE at scales large compared to the deformation radius. While the results do not provide a rigorous mathematical explanation for why EPE is typically generated at scales larger than the deformation radius (there are many dynamical theories to predict this, e.g., [
3,
14,
15]), it does connect to the downscale cascade of EPE and to the fact that conversion from potential to kinetic energy is observed to be weak at scales larger than the deformation radius.
A limitation of the current results is that the maximal rates of conversion obtained through the analysis are not proven here to be truly maximal. The conversion rate is a function of the flow configuration, and there are an infinite number of flow configurations that are stationary points of the function, i.e., these flow configurations correspond to local extrema or saddle points of the function. The analysis here identifies an infinite set of these stationary points that can be analyzed using Fourier methods and linear eigenvalue theory. Whether there are other flow configurations that could result in higher conversion rates remains an open question. The difficulty is related to the fact that conversion, in a quasigeostrophic approximation, is a cubic function of the flow state, which is more difficult to analyze than the production of EPE (cf. [
10,
11]), which is a quadratic function of the flow state.
The paper is organized as follows. In
Section 2, conversion is studied in the context of the hydrostatic Eady problem, similar to the development in [
10]. As in [
10], the results are complicated by the presence of unrealistic ageostrophic flow configurations, so, as in [
11], the quasigeostrophic (QG) version of the problem is studied next. The QG problem is studied analytically in
Section 3 and numerically in
Section 4. Conclusions are offered in
Section 5.
2. The Hydrostatic Eady Problem
The linear perturbation equations in the nondimensional Eady problem with hydrostatic and Boussinesq approximations are [
10,
21]
where
z is the coordinate along the axis of rotation and gravity and
x is the coordinate along the direction of mean flow. The domain is periodic in the
x and
y directions and
. The subscript
denotes the horizontal component of a vector, e.g.,
.
The buoyancy frequency is N and the dimensional background velocity is . The Richardson number is , where . The time scale for nondimensionalization is , where f is the Coriolis parameter; the vertical length scale is H; the horizontal length scale is the deformation radius . The horizontal velocity scale is ; the vertical velocity scale is ; the buoyancy scale is .
The perturbation kinetic energy equation is obtained by taking the dot product of (
1) with
and integrating over the domain (with several integrations by parts)
where
denotes an integral over the physical domain
V. The perturbation available potential energy equation is obtained by multiplying (3) by
and integrating over the domain
The term of interest here is the conversion from potential to kinetic energy
Because this is a hydrostatic model, the vertical velocity
is obtained from the horizontal velocity using
where the variable
s is a stand-in for the vertical coordinate.
The goal is to obtain a configuration of
that maximizes the conversion
C at a fixed energy level
. We therefore define the Lagrangian
where
is the Lagrange multiplier and the total energy is
The Lagrangian is a quadratic function of the buoyancy and velocity and the energy level can be scaled out of the problem, so, without loss of generality, let .As usual, the Euler–Lagrange equations are derived by finding conditions that describe stationary points of the Lagrangian.
The Euler–Lagrange equations for this constrained optimization problem are obtained as follows. We first consider the Fréchet derivative of the energy, which is simply
To derive the Fréchet derivative of the shear production, start from the expression
To proceed, we need the following simple integration by parts identity, also used in [
10]
which is valid for integrable functions
g and
h. This expression enables the following manipulation of the first term in (
12):
The fact that
at both the upper and lower boundaries sets
. The Fréchet derivative of the conversion is therefore
Configurations of
that are stationary with respect to the conversion rate subject to the condition of fixed energy satisfy
for all
. The Euler–Lagrange equations are therefore
Perturbation fields
that satisfy these equations are associated with conversion rates
. Note that horizontally incompressible flow with
is a stationary point of the Lagrangian, but an uninteresting one since it has no conversion.
A system of partial differential equations for the optimal flow configurations can be obtained by taking the partial derivative of these equations with respect to
z and then condensing to a single equation
Noting that (
19) implies that, for these optimal configurations
on the top and bottom boundaries, we can expand solutions as
where ‘c.c.’ denotes the complex conjugate and the amplitude of
is determined by the condition of unit total energy. The conversion rate
satisfies
The conversion rate exhibits an ultraviolet catastrophe where the conversion rate approaches infinity at small scales. Ultraviolet catastrophes can call into question the well-posedness of a system of equations; in this case, this is not a concern. Conversion is not a component of the total energy budget, since the contributions to the kinetic and potential energy budgets cancel on adding, so this ultraviolet catastrophe does not cause unbounded growth of energy at small scales. Instead, it means that certain small-scale flow configurations will almost instantaneously convert their potential energy to kinetic energy, which is of course not problematic for the dynamics.
The flow configurations associated with these high conversion rates have divergent horizontal velocities: (
19) implies
As
, the magnitude of the divergence grows without bound, which causes the magnitude of the vertical velocity to also grow without bound per (
8). These flow configurations are mathematically permissible—vertical velocity is not included in the conserved energy for this model, so infinite vertical velocity is not precluded by the constraint of finite energy—but physically unrealistic for balanced ocean dynamics, which have small vertical velocities. Thus, as in [
11], the next section pursues the question in the context of a quasigeostrophic model. An alternative approach, not taken here, would be to analyze the conversion in the non-hydrostatic model; this would include the vertical velocity as part of the kinetic energy, which would prevent it from growing unboundedly in the progression to small scales.
3. Conversion in the Quasigeostrophic Model
The nondimensional eddy vorticity and buoyancy evolution equations in an inviscid quasigeostrophic (QG) approximation in the presence of a zonal mean flow are
where
is the eddy vorticity,
is the eddy buoyancy,
is the nondimensional buoyancy frequency, and
. The depth
H is used to nondimensionalize
z, and the deformation scale
is used to nondimensionalize
x and
y, where
f is the Coriolis parameter at a fixed reference latitude, and
is the maximum value of the dimensional buoyancy frequency. The time scale is
; the scale of
is
and of
is
. The nondimensional planetary vorticity gradient is
, where
is the meridional rate of change of the Coriolis parameter at the same reference latitude.
These two evolution equations for derivatives of
could, in principle, be incompatible; the incompatibility is prevented by vertical velocity
, which acts to keep the two evolution equations consistent. The condition on
that keeps the two equations consistent is obtained by applying
to (
24), applying
to (
25), and adding the results. This leads to the Omega equation [
22] for
:
The evolution of kinetic and available potential energy in the QG model can be obtained, respectively, by multiplying (
24) by
and integrating over the domain, and by multiplying (
25) by
and integrating over the domain. The results are
where one integration by parts was performed to obtain the kinetic energy equation, using
conditions on the boundaries.
3.1. Derivation of the Lagrangian
The form of the potential to kinetic conversion and of the total energy are the same as in the hydrostatic Eady problem, with the important difference that in the QG problem, the velocity and buoyancy are all derived from a single variable
. In terms of the QG streamfunction, we can write the total energy as
Unlike in the hydrostatic Eady problem, conversion is a cubic function of
, although it can be split into a sum of cubic and quadratic components. For notational convenience,
is defined to be the differential operator acting on
, i.e.,
The operator
is self-adjoint when acting on the Sobolev space of functions with homogeneous Dirichlet boundary conditions whose weak derivatives of order
belong to
, which implies that
is also self-adjoint (cf. [
23], examples 10.1 and 10.4). Noting that
is linear, we can split conversion as follows:
where
is cubic and
is quadratic. The fact that
is self-adjoint and commutes with
implies that
From this, we conclude that the vertical velocity generated by the
term in the Omega equation does not lead to any conversion, and thus
Note that
has a profound influence on the dynamics, and thus an indirect influence on the rate of conversion of EPE to EKE.
With this notation, the problem of interest is to maximize the rate of conversion at fixed energy level
. The Lagrangian for this constrained optimization problem is
Unlike the non-QG case, the presence of a cubic term implies that the energy level
cannot be set to unity by a rescaling of
.
3.2. Analysis of the Cubic Term
The presence of the cubic term
implies that the Euler–Lagrange equations for this Lagrangian constitute a quadratic eigenvalue problem, unlike the linear eigenvalue problems associated with EPE production analyzed in [
10,
11]. However, the cubic term has a curious property: for all flow configurations
for which
, we have that
. This implies that any conclusions drawn from analyzing only the quadratic part of the Lagrangian apply also to the full Lagrangian, with the only caveat being that the optimal conversion rates discovered in this way may not be globally optimal for the full problem.
The following theorem justifies the foregoing statement about the cubic part of the Lagrangian.
Theorem 1 (Univariate degeneracy of ). Any flow configuration that satisfies sets for any and .
Proof. The fact that for any follows from the fact that the Jacobian advection operator is zero for two univariate arguments: , .
To prove that
for any
, begin by using the fact that
is self-adjoint to write
Integrations by parts in the vertical and horizontal now produce the following, where horizontal periodicity has been used, as well as the fact that
produces a function that is zero on the upper and lower boundaries:
With this expression in hand, we can consider the Fréchet derivative of
The first line is zero whenever
because of the property of the Jacobian noted above. The remaining terms can all be expressed in the form
where either
r or
s is a perturbation. (Note that
q does not refer to potential vorticity in this expression.) The identity
allows us to consider
s to be the perturbation without loss of generality. Integration by parts puts these terms into the form
If
, then, for all of terms on the second and third lines of (
39),
q and
r are of the form
and
. For functions of this form,
which implies that
. □
Note that no claim is made that these are the only flow configurations for which . A complete, rigorous analysis of the stationary points of the full Lagrangian is outside the scope of this investigation.
3.3. Euler–Lagrange Equations for the Quadratic Part
Having established the foregoing property of
, we next derive Euler–Lagrange equations for the quadratic component of the Lagrangian
As with the hydrostatic Eady problem, the fact that this Lagrangian is quadratic implies that the energy level
can be set to unity without loss of generality. The Fréchet derivative of the energy is
The Fréchet derivative of the quadratic component of conversion is
Expand both
and
as Fourier series,
where
and
are the nondimensional length and width of the periodic domain. With this expansion, we have the following representations of
and
:
where
is the complex conjugate of
. Stationary points of the quadratic Lagrangian are thus single Fourier modes that satisfy the Euler–Lagrange equations
Because stationary points of the quadratic part of the Lagrangian are single Fourier modes, they are of the form
. Theorem 1 implies that
for these configurations, so these flow configurations are also stationary points of the full Lagrangian.
3.4. Asymptotic Analysis
Exact solutions of the Euler–Lagrange equations for the quadratic part (
50) remain elusive due to the presence of the
operator. Before proceeding directly to numerical investigations of conversion, it is valuable to provide some asymptotic analysis to complement the numerics. Two limits of interest are large and small scales. To analyze these limits, we return to the Omega Equation (
26), written for a single Fourier mode and omitting the passive
term
Clearly,
as
, which implies that conversion also goes to zero as
. More generally, at large scales, i.e.,
, it is clear that
, so conversion must go to zero at scales much larger than the deformation radius.
For small scales,
, it is evident that
Exponentially thin boundary layers return
to zero on the boundary if necessary, but these layers contribute an asymptotically subdominant amount to the total conversion. The total conversion for large
thus scales as twice the real part of
The elementary inequalities
and
imply that the conversion in this limit is bounded by
which further implies that the rate of conversion per unit energy is bounded by twice the Richardson number
This boundedness at small scales is in contrast to the behavior seen in the hydrostatic Eady problem in
Section 2, and mirrors the behavior of EPE production rates found in [
11].
4. Numerical Investigation into Conversion Rates
This section provides a numerical investigation of maximal conversion rates, applying only to the quadratic component of conversion in the presence of a zonal mean flow. Rather than discretize the Euler–Lagrange Equations (
50), the analysis is framed directly in terms of the conversion for a single Fourier mode
By applying standard equispaced, second-order, centered finite differences to the Omega equation for a single Fourier mode (
51) and to
, one obtains an expression of the form
where
is a Hermitian matrix and
is a vector of values of
at
equispaced points on
. The energy may similarly be numerically approximated using
where
and
is a discrete approximation to the operator
. With these discrete approximations, the generalized eigenvalues
of
are the conversion rates per unit energy associated with flow configurations given by the generalized eigenvectors
. The fact that
and
are both Hermitian and that
is positive definite imply that there are exactly
real generalized eigenvalues
for each Fourier mode
.
For comparison, the same treatment is applied to the rate of eddy potential energy production following [
11]; the continuous expression is
The discrete approximation takes the form
and the optimal rates of production per unit energy are found as generalized eigenvalues
of the Hermitian generalized eigenvalue problem
The code used to generate the
,
, and
matrices can be found in [
24].
Results are computed for three mean flow profiles:
all with
and using
points. These three profiles represent three different ways of violating the Charney–Stern–Pedlosky criteria for baroclinic stability ([
4], Section 6.4.3; see also [
25]). For each mean flow configuration and over a range of wavenumbers
, the optimal rates of conversion and production per unit energy are computed.
These conversion rates are shown as functions of
and
in
Figure 1, and as a function of
for
in
Figure 2. As predicted by the asymptotic theory, the conversion rates go to zero as
, which is in contrast with the production rates. For the latter, there is a singularity at
such that, for
and
, the production rate remains nonzero, whereas for
, the production rate goes to zero as
. This contrast is a key finding of this investigation: QG flow is able to produce EPE at scales larger than the deformation radius much more efficiently than it is able to convert EPE to EKE, at least when considering only linear conversion processes. However, this mismatch only occurs near the
axis; at large scales away from this axis, the maximal rates of conversion and production are both weak.
Figure 2 compares the conversion and production rates on the same plot for
. For all mean flow configurations, the maximum possible rate of EPE production is larger than the maximum possible rate of linear conversion at large scales. The rate of conversion increases towards small scales, and the maximum possible rate of conversion exceeds that of production at
for the Eady and Phillips mean flows; for the Charney mean flow, conversion exceeds production at much smaller scales (not shown). None of the computed conversion rates violates the asymptotic bound derived in
Section 3.4.
Figure 3 shows the flow configurations
that optimize conversion and production for all three mean flow configurations at
,
. At this wavenumber, the maximal production rate is significantly greater than the maximal conversion rate, and the flow configurations that achieve these maximal rates are markedly different: the vertical tilt of the horizontal flow is opposite for optimal conversion and optimal production.
5. Conclusions
The goal of this paper was to study the mathematical limits on the rate of conversion from eddy potential energy (EPE) to eddy kinetic energy (EKE) with reference to the ocean’s Lorenz energy cycle. The first result parallels one of the results of [
10], namely that when the problem is studied using only the hydrostatic approximation, the results are clouded by the presence of highly unrealistic ageostrophic flow configurations that convert EPE to EKE with an efficiency that is unbounded in the progression to infinitesimal horizontal length scales. Thus, following [
11], the quasigeostrophic version of the problem is studied next so that, somewhat ironically, the more restrictive approximation might lead to more realistic results.
In the quasigeostrophic approximation, the conversion rate is a cubic function of the eddy flow configuration because the vertical velocity is a quadratic function of the QG streamfunction through the Omega Equation (
26) [
22]. Nevertheless, in the presence of a mean flow, the vertical velocity can be written as the sum of two components that depend linearly and quadratically on the eddy flow state; the conversion rate can thus also be written as the sum of two components that depend quadratically and cubically on the eddy flow state. The second result of this study is the discovery that eddy flow configurations that correspond to local extrema of the quadratic part of the conversion rate also correspond to local extrema of the full conversion rate including the cubic term. This enables the use of methods based on Fourier analysis and linear eigenvalue problems to find eddy flow configurations that correspond to local maxima of the conversion rate; whether these local maxima are also global maxima remains an open question.
Studying the local maxima only, it is found that the rate of EPE to EKE conversion goes to zero as the horizontal length scale of the eddy flow configuration grows above the deformation radius, and that as the horizontal length scale decreases past the deformation radius, the rate of conversion appears to grow towards an upper bound. Putting these results together with those of [
11] on the maximal rate of EPE generation leads to the following picture: at large scales, the maximal rate of EPE generation is much larger than the maximal rate of conversion from EPE to EKE. Thus, if EPE is generated near the maximal rate at large scales, it cannot be converted equally rapidly to EKE; to achieve a statistically steady state, the EPE must be transferred towards smaller scales, where it can be converted to EKE efficiently. As these results are only concerned with limits on what the flow can do, they do not predict what the flow will do. Nevertheless, the results are consistent with the phenomenological theory of QG turbulence.
This study raises some questions that remain unresolved. An obvious example is whether there are eddy flow configurations that can achieve higher conversion rates than the ones identified here. The question can be addressed using more advanced analytical methods, or using numerical methods for partial-differential-equation-constrained optimization. Barotropic shear could be introduced into the mean flow profiles; this would prevent the problem from being partially diagonalized by a Fourier basis, and would similarly require more advanced analytical methods, or using numerical methods for partial-differential-equation-constrained optimization. Aside from the cubic problem, there still remain open questions about the quadratic component of the problem. For example, one might attempt to derive a rigorous bound on the quadratic conversion rate to complement the asymptotic analysis provided in
Section 3.4. Finally, the problems in the hydrostatic model could potentially be alleviated by studying conversion in the non-hydrostatic model. In the non-hydrostatic model, all three components of velocity are included in the kinetic energy, which prevents the vertical velocity from growing unboundedly at small scales.