1. Introduction
The mixing of ocean tracers occurs predominantly along directions tangent to the locally referenced potential density surface (Iselin 1939; Montgomery 1940; Solomon 1971; Redi 1982; Olbers et al. 1985; McDougall 1987a; Gent and McWilliams 1990; Ledwell et al. 1993; Kunze and Sanford 1996). The analyses from Ledwell et al. (1993) and Kunze and Sanford (1996) establish the large degree to which tracer mixing along these neutral directions dominates the cross or dianeutral mixing. Their results substantiate the long standing hypothesis that the ocean interior is highly adiabatic in the sense that tracer properties are only very slowly mixed across the neutral directions. In turn, the measurements support the relevance of layer models for simulating the adiabatic aspects of ocean dynamics (e.g., Bleck et al. 1992; Hallberg 1995). Furthermore, they provide a severe constraint that the z-coordinate models must satisfy in order to provide physically realistic simulations of the ocean interior.
As argued by Redi (1982), a large component of ocean tracer mixing can be parameterized as downgradient diffusion along neutral directions, referred to in the following as isoneutral diffusion. Redi’s work provides a conceptual framework allowing the z models to move away from the physically unrealistic horizontal/vertical diffusion in which the diffusive fluxes are oriented according to the local geopotential direction. What is necessary, therefore, is a straightforward rotation of the diffusion tensor to align the diffusive fluxes along the neutral directions. Cox (1987, hereafter C87) implemented a diffusion scheme in the Cox (1984) version of the GFDL ocean model, which attempted to numerically realize Redi’s isoneutral/dianeutral diffusion. The C87 diffusion scheme improved certain aspects of the z model’s fidelity when compared to the traditional horizontal diffusion simulations. In particular, C87 produced a more realistic thermocline and mitigated the Veronis effect (Veronis 1975; Gough and Lin 1995; GFDL Ocean and Climate Groups 1996, personal communication). Cox’s work forms the foundations upon which numerous other models have grown over the years, and the original C87 discretization of isoneutral diffusion has remained fundamentally unchanged.
Over the course of long-term climate integrations (order ≥ 100 years), the effects from how tracer mixing is parameterized become especially visible in a numerical simulation. It is on such timescales that the positive affects from the C87 scheme have been diagnosed. Unfortunately, it is on such timescales that the problems with this scheme are also most apparent. Namely, C87 contains a numerical instability, whose characteristics are described in this paper, that prevents it from being run without an added nonneglible amount of horizontally aligned background diffusion. This background diffusion contributes to the general overly diffused nature of the coarse resolution z models. One result is that tracer properties are not well preserved over the thousands of kilometers seen in observations; a problem intimately related to the models having too much dianeutral mixing. Large-scale preservation of tracer properties is essential in order to utilize the z models for climate simulations. This paper focuses on the question: Is it possible to realize isoneutral diffusion in a z-coordinate ocean model in a physically based and numerically stable fashion so that no added background diffusion is required? The new diffusion scheme documented in this paper indicates that it is possible to do so within certain limitations.
The problem with background horizontal diffusion is most apparent when gauged in terms of the smallness of the measured dianeutral diffusivity. Background diffusion can create a tracer flux that dominates the flux parameterized with dianeutral diffusivity in regions where the isoneutral slope S is larger than (AD/Aback)1/2, where Aback is the horizontal background diffusivity and AD is the dianeutral diffusivity. The typical background diffusivity used with the C87 scheme is normally larger than 10% of the isoneutral diffusivity, which means Aback ≈ 106 cm2 s−1. With an observed AD ≈ 0.1 cm2 s−1 (Ledwell et al. 1993; Kunze and Sanford 1996), background horizontal diffusion contributes to a larger dianeutral flux than that explicitly parameterized by AD when the isoneutral slopes are greater than 3 × 10−4, which is a modest slope commonly realized in the ocean. The problems associated with overly large dianeutral diffusion have been pointed out in various studies (Veronis 1975; McDougall and Church 1986; Gough and Welch 1994; Böning et al. 1995; Gough and Lin 1995; Hirst et al. 1996). Some of these references point to the problems with traditional horizontal/vertical diffusion, for which the C87 scheme was meant to remedy. As mentioned previously, C87 indeed mitigated the problems, yet because of the need for background diffusion, it did not solve them completely [see especially Hirst et al. (1996) for focus on problems with horizontal background diffusion]. Additionally, the recent confidence taken in the AD ≈ 0.1 cm2 s−1 measurements places a high priority on finding the means to eliminate essentially all background diffusion in the z models.
In addition to diffusive tracer mixing, Gent and McWilliams (1990, hereafter GM90) argued for the presence of a quasi-adiabatic stirring mechanism that conserves all tracer moments between isoneutral layers, yet systematically reduces the isoneutral slopes and so acts as a sink for available potential energy. This transport can be parameterized by a divergence-free advective velocity (Gent et al. 1995), or equivalently with an antisymmetric stirring tensor (Plumb and Mahlman 1987), which gives rise to a skew-diffusive flux (Griffies 1998). The GM90 stirring therefore complements that mixing obtained with the symmetric Redi diffusion tensor. Many ocean modelers have implemented the GM90 scheme in z models, in addition to C87 diffusion, and have found the simulations to have an added amount of realism over models without GM90 (e.g., Danabasoglu and McWilliams 1995; Large et al. 1996). As a result, the GM90 ideas, along with Redi diffusion, may provide a useful, albeit incomplete, framework for parameterizing ocean tracer mixing. It follows that in order to evaluate the affects of isoneutral diffusion and GM90 eddy-induced advection (or skew diffusion) in z models, it is necessary to provide a clean numerical realization of both processes. This paper focuses on the Redi diffusion process, and the paper by Griffies (1998) focuses on GM90 skew diffusion.
Before entering the main body of this paper, it is useful to summarize the two central results that provide a foundation for our new isoneutral diffusion scheme: First, we find that it is necessary to build the property of downgradient tracer diffusion, and the consequent variance reduction, into a numerical isoneutral diffusion scheme. Otherwise, the scheme can produce uncontrollable upgradient fluxes and result in model blowups. Second, it is necessary to balance the isoneutral fluxes of the active tracers in order to realize a zero isoneutral flux of locally referenced potential density. Otherwise, even for a variance reducing scheme, grid-scale noise can result. Both of these properties are fundamental to what is meant by isoneutral diffusion, both in the continuum and on the numerical lattice.
The plan of this paper is the following. Section 2 presents numerical experiments that illustrate the problems with the C87 diffusion scheme. In section 3, kinematical properties of isoneutral/dianeutral diffusion are discussed. In section 4, problems of the C87 scheme are interpreted in terms of these kinematical properties. Section 5 describes a new discretization of isoneutral diffusion based on a functional formalism. Section 6 presents results from model simulations using this scheme. Section 7 presents summary and conclusions. There are five appendices: appendix A critiques a linear stability analysis that incorrectly concludes that the C87 scheme is unstable for large isoneutral slopes; appendix B discusses basic mathematical properties of cabbeling and thermobaricity; appendix C considers issues related to realizing steep isoneutral slopes and provides a critique of the commonly used slope clipping scheme of C87; appendix D discusses what is meant by “downgradient diffusion” in the new isoneutral diffusion scheme; and appendix E provides details for discretizing the full Redi tensor.
Some conventions should be noted. The word “isoneutral” is preferred to the more commonly employed“isopycnal,” by which we mean processes occuring in the neutral directions. “Temperature” refers to potential temperature θ, which is temperature referenced to the ocean surface. Tensorial notation is employed, in which the labels m, n, which represent the coordinates x, y, z, are assumed to be summed if repeated. Vectorial notation is also employed for added clarification. The labels i, j, k refer to discrete spatial grid points in the x, y, z directions used for expressing equations on the lattice, and no summation will be assumed for these labels. The diffusive flux discretization is lagged one time step in order to ensure stability of the time discretization (Haltiner and Williams 1980); the time step is omitted for brevity. Model results in this paper are computed with the GFDL MOM 2 Ocean Model (Pacanowski 1996).
2. Model experiments
For the numerical experiments presented in this paper, a sector domain is employed that extends from 5°N to 65°N latitude and 60° wide in longitude. The horizontal grid resolution is 2.4° × 2.4° and there are 18 unevenly spaced vertical levels with a flat bottom at 4000 m. A rigid-lid boundary condition is applied to the surface. The external mode is solved using the streamfunction method with a conjugate gradient algorithm and 9-point numerics for the Laplacian. For the cases in which both temperature and salinity are active, the surface level is forced by restoring to the zonally averaged annual mean Levitus (1982) climatology. When salinity is uniform and constant, the linear temperature profile of Cox and Bryan (1984) is used. Both cases use a 50-day timescale for the restoring, defined over the 35-m top level. The model diagnoses density by using the Bryan and Cox (1972) equation of state, which consists of separate cubic approximations to the UNESCO equation of state (Gill 1982) for each vertical model level. The dianeutral diffusivity is constant with depth. Momentum is dissipated with constant viscosity of 109 cm2 s−1 in the horizontal and 10 cm2 s−1 in the vertical and no wind forcing is used. The scheme of Rahmstorf (1993) is employed to completely gravitationally stabilize the vertical columns at each time step. In regions where the slopes of the neutral directions steepen, the scaling of the isoneutral diffusion coefficient as described by Gerdes et al. (1991) is used in order to maintain linear stability of the diffusion equation. Further discussion of this scaling is provided in appendix C. The time steps are 1 h for the momentum and external mode, 16 h for the tracers, and no other acceleration is used. The basic steady-state thermohaline circulation seen in such a model was discussed by Bryan (1975).
Using this model configuration, various test problems are carried out based on the following approach. First, the model is spun up to a pseudoequilibrium using a vertical diffusivity of 0.5 cm2 s−1 and horizontal diffusivity1 of 107 cm2 s−1. The integration is long enough to establish the thermocline and outcropping in the northern part of the basin associated with free convection. Figure 1 provides a meridional snapshot of the temperature after 3400 years for the case with temperature and salinity both active. After such a spinup, horizontal diffusion is switched to various isoneutral diffusion schemes. These “switching” experiments provide an inexpensive means to evaluate the integrity of the numerics. Model years will refer to years subsequent to this switch.
a. Experiments with C87
The special case of an ocean model with buoyancy dependent on a single variable, such as a single active tracer, is a useful place to begin testing the C87 diffusion scheme. In the continuum, the isoneutral portion of the diffusion tensor will not act on this tracer since the neutral directions are parallel to the isotracer surfaces. This property is true for both linear and nonlinear equations of state (section 3 and appendix B provide further discussion). To test the ability of the C87 scheme to respect this property, we ran the sector model using horizontal diffusion with salinity fixed throughout the domain at 35 psu, and allowed temperature to be active with the nonlinear equation of state. We then switched from horizontal to C87 diffusion using 107 cm2 s−1 for the isoneutral diffusion coefficient, removed all surface forcing, and turned off tracer advection.
Figure 2a shows the solution 600 years after the switch. As expected, vertical gradients of isotherms are weakened due to the nonzero dianeutral diffusivity. However, there is a near complete loss of numerical integrity. In the north, temperature has undergone a vigorous amount of vertical mixing due to convection acting on the unstable solution. In the south, where convection is absent, the presence of unstable grid waves is clear. These waves grew unbounded in subsequent years of integration, which caused the solution to eventually blow up. Figure 2b shows the temperature at year 300 for the case in which salinity is also allowed to change. This solution for two active tracers is consistent with the single active tracer experiment. For both cases, setting the dianeutral diffusivity to zero increased the growth rate of the instability by roughly one order of magnitude.
Dispersion errors associated with grid Peclet number violations encountered with centered advection (Bryan et al. 1975; Weaver and Sarachik 1990) have traditionally been considered one of the main reasons for employing horizontal background diffusion with C87 (see comment at the end of C87). Since tracer advection is removed from these experiments, Peclet grid noise has nothing to do with this unstable behavior.
Another example of the problems with the diffusion scheme can be seen when performing the switching experiment while maintaining an Eulerian advective transport. For centered differenced advection, the solution blows up (not shown) even sooner than without advection. This behavior might be expected based on grid Peclet arguments. A reasonable question to ask is whether a completely monotonic advection scheme, such as flux corrected transport (FCT) (see Gerdes et al. 1991), is sufficient to suppress the C87 instability. Although FCT acts only on the advective fluxes, it might provide enough dissipation to stabilize the C87 scheme. Figure 2c shows that this possibility is not completely realized, even with a single active tracer for which the diffusive fluxes should vanish. After switching the diffusion from horizontal to C87, switching the advection from centered to FCT, and releasing the surface forcing the model undergoes a spindown in which the isotherms eventually flatten and spread apart due to the dianeutral diffusion and lack of surface forcing. However, by year 900, there is a nontrivial grid wave whose growth is able to overcome the stabilizing aspects of the FCT scheme. The amplitude of this grid wave continued to grow in subsequent years. Turning off all diffusion and using FCT alone resulted in smooth and flat isotherms, with no sign of grid noise.
The implementation of GM90 eddy-induced advective transport in z-coordinate models has been associated with the ability to remove the background horizontal diffusion otherwise necessary with the C87 scheme. It is noted that GM90 acts to reduce the isoneutral slopes, thus producing more horizontally aligned diffusive fluxes. Such a reduction in slopes may provide for increased numerical stability according to the linear stability analysis discussed in appendix A. However, that analysis is in error and, so, is not relevant for isoneutral diffusion. A different reasoning for why GM90 stabilizes certain experiments is provided in Griffies (1998). Unfortunately, as seen in Fig. 2d, the switching experiment with GM90 eddy-induced advection, C87 diffusion, two active tracers, and 0.5 cm2 s−1 dianeutral diffusivity is not stable. Indeed, when compared to Fig. 2b, the solution looks even worse, and it blows up sooner than without GM90 advection. Such behavior is consistent with the problems encountered when implementing GM90 eddy-induced advection as described by Weaver and Eby (1997). As discussed by Griffies (1998), the problems with this GM90 experiment are related to the method of implementing both the GM90 closure as well as isoneutral diffusion.
b. Comments on the switching experiments
We performed numerous other switching experiments in this and other model configurations with various permutations of subgrid-scale parameterizations. Consistently, the only way to eliminate the unstable grid waves was to add at least 10% background horizontal diffusion to the C87 diffusion scheme. As discussed in the introduction, such background diffusion is not a viable choice for realistic climate modeling. It should be noted that when running the C87 scheme with centered advection from the start of a spinup from uniform tracer fields, the sector model remained stable, with only modest grid noise. Therefore, the switching experiments provide substantially stronger tests of the numerical integrity than the spinup experiments. The reason is that the diffusive fluxes of the active tracers have the potential to be much larger in the period after switching between diffusion processes than during a spin up from rest. Within the spirit of evaluating the performance of a scheme under various conditions, some of which could be realized in more realistic models run under time-varying forcing and with bottom topography, we conclude that the C87 scheme is unsound. The remainder of this paper is devoted to providing the physical and numerical understanding necessary to interpret these problems with C87 and thereafter to derive a new scheme that aims to rectify them.
3. Kinematics of isoneutral diffusion
a. The diffusion tensor and the diffusion operator
A diffusion tensor
b. Tracer variance
The previous discussion suggests the following example in order to illustrate an important point. Consider a tracer field with its power concentrated in the long wavelengths; that is, it is a smooth yet nonuniform tracer field. Allow this tracer to diffuse downgradient along static neutral directions that have a lot of spatial power in high wavenumbers: for example, the density field has a grid noise structure. Isoneutral diffusion will result in the tracer becoming aligned along the neutral directions, which means that it will have power transferred to the high wavenumbers. In the process, the total tracer variance will reduce due to the dissipative nature of diffusion. Therefore, even though tracer variance will not increase with isoneutral diffusion, it is possible to introduce grid noise into the tracer field if such noise is in the density field. This point will prove fundamental in the development of section 4.
c. Balance of the active tracer isoneutral diffusive fluxes
The isoneutral diffusion operator is a nonlinear function of active tracers through the dependence of the diffusion tensor on temperature and salinity. In this way, isoneutral diffusion differs fundamentally from more traditional forms of linear diffusion in which the diffusion tensor is independent of tracers. Since the isoneutral/dianeutral diffusion tensor is constructed locally according to the dianeutral unit vector
In the ocean, neutral directions are those for which an adiabatic displacement of a parcel is not affected by buoyancy forces. As a familiar example, note that after a free convective event, a vertical column is typically neutrally buoyant and the vertical is correspondingly a neutral direction. McDougall (1987a) formalized this idea to all three directions, and argued that because mixing can act unopposed by buoyancy forces, neutral directions are relevant for orienting the tracer diffusive fluxes.
As discussed by McDougall and Jackett (1988), it is not possible to define a coordinate γ that can globally describe the envelope of neutral directions, unless one neglects the affects of cabbeling and thermobaricity through linearizing the equation of state (see appendix B for further details). However, all that is necessary for our purposes is a local description, which is available for the general case of two active tracers with a nonlinear equation of state. For this description, start by noting that through any point in the ocean with pressure p, there passes a potential density surface ρp(θ, s), which is referenced to the same pressure p. McDougall (1987a) showed that the neutral directions at this point lie within the tangent plane to the ρp surface at this point. It follows that the plane’s unit normal vector ∇ρp|∇ρp|−1 at this point is perpendicular to the neutral directions within this plane, so it can be identified with the dianeutral unit vector
It is useful to relate ∇ρp to the active tracer gradients when computing
4. Analysis of the C87 diffusion scheme
a. Grid-scale computational modes
The practical difficulty encountered when discretizing isoneutral diffusion in the z models concerns how to handle the off-diagonal terms. This issue is not unique to the B grid used in the GFDL model since the physical process (tracer diffusion) concerns only tracer points and tracer cells. The discretization given by C87 considers tracer gradients and the corresponding isoneutral slopes to be independent, each of which needs to be spatially averaged in order to bring it onto the relevant face of the T-cell. In addition, the horizontal and vertical density gradients appearing in the isoneutral slopes are handled independently of one another. Therefore, the average operator appears individually on only the numerator or denominator. There is nothing fundamental about this particular discretization. Rather, it merely represents a series of convenient choices based on details of the model grid.
In general, when working on the B grid and acting on a single field, such combinations of an average in one direction combined with a derivative in the same direction introduce computational modes. The potential for grid splitting and possible amplification of the modes must be addressed when such modes exist. Namely, if these modes are either completely invisible to the dissipation, or worse if they are amplified, then they can be very harmful to the integrity of the numerical solution. In particular, when appearing in the active tracer fields, and therefore in the density field, certain computational modes can lead to model instabilities, as we now show.
b. Increasing tracer variance
In general, an upgradient component to the diffusive flux vector can be the result of projecting the downgradient isoneutral flux vector onto orthogonal Cartesian axes. As such, it is important to distinguish between particular flux components, which may be upgradient, and the diffusive flux vector, which should be downgradient with respect to the tracer gradient (Property I in section 3b). The above discrete configuration is an example of how, in the C87 numerical scheme, an upgradient component in the x direction is not compensated by a downgradient component in the z direction, since the z component identically vanishes. Therefore, the numerically realized diffusive flux vector is upgradient. Furthermore, this upgradient flux can in general be distributed over the extent of the model domain, thus ensuring the increase in tracer variance. The rate of increasing variance is directly proportional to the difference between the slopes, and to the value of the diffusion coefficient. Since the strength of the growth in variance is proportional to AI, the larger the isoneutral diffusivity, the larger the background horizontal diffusivity needed to suppress the variance increase.
Figure 5 shows a simple realization of this discussion in which a passive tracer field is diffused in the background of a density held constant in time with a 2Δy structure (ρi,j+1,k = ρi,j−1,k), constant in longitude, and stratified in depth. The passive tracer is initialized with unity on the surface level and zero below. There is no surface forcing, advection is turned off, and there is zero dianeutral diffusivity. As anticipated, the result is a uniform increase in tracer variance.
c. Active tracer fluxes and the basic problem with C87
Figure 6 illustrates an unstable profile corresponding to the above inequality combined with a 2Δx computational mode. With stable stratification, θ1 > θ2, and so point A is a local minimum with respect to the horizontal direction x̂. Upgradient fluxes along x̂, with zero compensating vertical flux (recall that the vertical flux vanishes for this 2Δx mode), will cause point A to cool. The converse occurs at point B. The tendency is to increase the amplitude of the wave thus providing for its instability. The upgradient fluxes induce an unbounded increase in temperature variance so long as inequality (16) is maintained. The situation for two active tracers is similar, only now the dashed lines in Fig. 6 denote isolines of locally referenced potential density. In effect, the nonlinear equation of state, combined with the upgradient fluxes and computational modes, feeds “energy” into the numerical solution, which causes the tracer variance to grow unbounded. It is this instability that must be stabilized by the introduction of Aback ≈ AI background horizontal diffusion.
d. A nonlinear instability due to spurious density fluxes
Upgradient diffusive fluxes are generally destabilizing in a numerical diffusion scheme. Furthermore, they do not correspond to the physical process of interest. Therefore, in developing the new isoneutral diffusion scheme, much effort is focused toward correcting this problem (see section 5). As discussed here, however, reducing tracer variance is not sufficient to stabilize the profile shown in Fig. 6, or any wave structure in which the inequality (16) is respected. Stability is realized only if there is exact alignment between the locally referenced potential density surfaces (dashed lines) and the computed neutral directions (solid line).
To see why this profile is generally unstable, consider any profile in which the inequality (16) is satisfied, for which Fig. 6 provides a particular case. Point
The instability represented by Fig. 6 is equivalent to the following analytical discussion. Consider a temperature profile that in some local interior region of the ocean takes the form θ = θo + bz + B(t) cos(2πx/L) and let the directions along which θ is diffused downgradient be determined by ρ = ρo + az + A(t) cos(2πx/L). Here ρ surfaces equal neutral surfaces when the slope ratio Sρ/Sθ = (Ab)/(Ba) is unity. As suggested by Fig. 6, let ρ be directly coupled to θ, yet let it be an imperfect approximation to locally referenced potential density. The misalignment introduces an unphysical degree of freedom. The zonal wavelength L corresponds to the grid scale over which the slopes are computed in the numerical case, and the constants a, b determine a stable linear vertical stratification. Assume the isoneutral fluxes are computed with the small slope tensor and a constant diffusivity. The downgradient diffusion of temperature along the ρ surfaces induces the evolution ∂tB(t) = AI(2π/L)2(Ab/a − B). A normal diffusive adjustment to the misaligned slopes causes the temperature wave amplitude to change. However, because the neutral directions are directly dependent on temperature, as the temperature wave changes, so does the ρ wave. The amplitude of both waves grows as long as Sρ/Sθ > 1. Even if the slope misalignment is small, the diffusivity AI ≈ 107 cm2 s−1 can provide a nontrivial growth. This growth is largest when the wave is short, as for the case when L corresponds to the grid scale in a numerical model. Variance is reduced if the diffusion is downgradient and the boundaries are either insulated or held with a fixed tracer value (see section 5). Therefore, all waves in a model will not grow; only those for which the slope misalignment is relevant. What occurs, therefore, is a reduction in variance with some of the original spectral density of variance being transferred preferentially to the grid scale. This process must saturate since variance is bounded from below (i.e., it is ≥0). At saturation, the amount of small-scale spectral density is bounded above by the variance in the initial condition.
Ideally, both prescriptions A and B eliminate the numerical instability by zeroing out the isoneutral diffusive flux of locally referenced potential density along the computed neutral directions; that is, they balance the active tracer fluxes and hence provide a self-consistent discretization of isoneutral diffusion. In practice, prescription A performs slightly less satisfactorily than B perhaps due to truncation errors allowing for slight imbalances. Additionally, as discussed in appendix C, in order to solve the diffusion equation in regions of steep isoneutral slopes it is necessary to solve the vertical piece of the diffusion equation implicitly. The effect is to split the vertical flux Fz(T) = Kzn∂nT into a piece (Kzz∂zT) solved implicitly and another (Kzx∂xT + Kzy∂yT) solved explicitly. The split introduces the possibility of numerical mismatch between the two parts and, so, cannot in general ensure
5. The new isoneutral diffusion scheme
a. Functional formalism
As previously mentioned, the functional formalism is strictly useful for the case of linear diffusion of the passive tracers. Since the active tracers are diffused with the same operator as the passive tracers, this restriction is of no substantial limitation. Yet, in order to prevent the nonlinear computational instability described in section 4, it is crucial to add to the functional formalism the constraint that the slopes be computed so that the active tracers closely approximate the self-consistency or balance condition on the isoneutral diffusive fluxes αFI(θ) = βFI(s). Otherwise, even though the scheme will not increase tracer variance, it will be subject to grid noise.
In summary, for any symmetric and positive semidefinite diffusion tensor, the functional formalism allows for a straightforward incorporation of the variance reducing properties implied by such a mixing tensor to be readily built into a discretization scheme. The procedure is to first discretize the functional
b. Discretizing the functional
The functional given by either Eq. (23) for the full tensor, or (25) for the small angle approximation, consists of quadratic terms that take the form (∂mT∂nρ − ∂nT∂mρ)2, with m ≠ n. Their discretization defines grid stencils in the corresponding two-dimensional (m, n) plane. This observation motivates a discretization of the functional where its different pieces are discretized separately within their respective two-dimensional plane. The exception to this two-dimensionality arises from the term |∇ρ|−2, which occurs due to the two factors of the dianeutral unit vector
In the longitudinal–depth plane, consider the term (∂xT∂zρ − ∂zT∂xρ)2. A simple discretization is to use nearest neighbor tracer grid points for constructing the discrete differential operators. Since the model grid is staggered, such a choice employs second-order accurate difference operators in the functional, and will likewise be the case for the corresponding diffusion operator. Higher order in accuracy schemes can be derived by extending the stencil outward to include more points. Using the nearest neighbors, a typical component of the discretizaton of (∂xT∂zρ − ∂zT∂xρ)2 consists of triads of tracer and density values. For example, one such triad contains contributions from the grid points (i − 1, k), (i, k), and (i, k − 1), where the corresponding term in the functional is (δxTi−1,kδzρi,k−1 − δzTi,k−1δxρi−1,k)2. As seen in Fig. 8, this triad is one of 12 that contain contributions from the central tracer point Ti,k.
These details about quarter-cells, triads, and their respective volumes are important since they make precise the notion about downgradient diffusion on the lattice. Namely, these 12 quarter-cells define the finite size volume, mentioned in the discussion of Eq. (22), for which the new scheme provides downgradient fluxes of the tracer. Since the finite volume encompasses many grid cells, the locally defined diffusive fluxes discretized on the faces of the T-cell (next subsection) will not generally satisfy the downgradient property individually. In other words, the new scheme is not“positive definite” for each cell, rather it is positive-definite only over the semilocal finite volume defined by the 12 quarter-cells. This is an important qualification that must be kept in mind when interpreting results from this scheme. Note, however, that the new scheme will not increase tracer variance since variance reduction depends only on the negative semidefiniteness of the globally defined functional. The finite volume interpretation of downgradient diffusion in the new scheme is made more mathematically precise in appendix D.
c. Flux discretization with density triads
For the purpose of describing the discretization, it is sufficient to discretize the diffusion flux from the small angle tensor since the basic ideas are the same for both the small angle and full slope diffusion tensors. It is also sufficient to continue considering the two-dimensional longitudinal–depth geometry. We start by discretizing the x component Fx(T) = −AI(∂xT + Sx∂zT) on the east face of the T-cell (i, k) (see Fig. 3). Recall that in the C87 discretization discussed in section 4, the central practical difficulty is how to handle the off-diagonal term AISx∂zT = −AI(∂xρ/∂zρ)∂zT. When employing density triads as fundamental units, notice that there are four triads surrounding the east face of each T-cell. Each defines a discretization of the isoneutral slope Sx, so each has an associated diffusion coefficient AI. The diffusion coefficients are selected according to one of the chosen slope constraints discussed in appendix C in order to satisfy the requirements of linear stability for steep isoneutral sloped regions. In addition, for each triad it is necessary to choose a reference point for calculating the locally referenced potential densities. This point can be chosen anywhere as long as there is a unique point for each of the triads. Since the equation of state is already computed for each tracer point, it is convenient to choose the corner of the triad as the reference point. By constructing AISx∂zT as an average over these four triads, each multiplied by their respective vertical tracer gradient, the discretized diffusive flux component Fx is correctly placed at the east face of the T-cell. Additionally, in order to account for nonuniform grids, it is important to weight each term in the average by its associated vertical grid spacing. Figure 9 summarizes this discussion by showing the stencil for the new scheme. Importantly, it requires the same six densities and tracers as for the C87 scheme (see the C87 stencil in Fig. 4), so the grid “footprint” is identical to that of C87.
It is important to highlight the direct computation of the expansion coefficients ρθ and ρs for the purposes of computing the density gradients and, hence, for computing the isoneutral slopes. As discussed in section 4, a proper alignment of the computed neutral directions will eliminate spurious and unstable fluxing of locally referenced potential density. Otherwise, the new scheme, even though it cannot increase tracer variance, would be exposed to a form of the nonlinear instability described in section 4d and hence subject to grid noise. A viable alternative to computing the density gradients as given in Eq. (32) is to directly diagnose one of the active tracer fluxes from the other (i.e., prescription B discussed in section 4d). Such a scheme was tested, but it did not alter the solution substantially. The choice of computing the slopes in terms of the expansion coefficients represents a decision to emulate as closely as possible the kinematics discussed in section 3, and in particular Eq. (7) for the dianeutral unit vector. The direct calculation of ρθ and ρs is therefore a fundamental and novel aspect of the new scheme.
The diagonal piece of the flux consists of the horizontal gradient of the tracer weighted by the four diffusion coefficients, which correspond to the four triads used for constructing the off-diagonal term. Should the slopes for the four triads all lie within the stability range and therefore not require scaling, then the sum collapses and the diagonal piece becomes AIδxTi,k, where AI is the unscaled diffusion coefficient.
The resulting diffusive flux can be thought of as an average of four “subfluxes” associated with each of the four triads. Each of the subfluxes correctly vanishes individually when locally referenced potential density is substituted for the tracer. In this manner, the new scheme provides for a completely symmetric or democratic sampling of the diffusive fluxes associated with each of the four vertical tracer gradients surrounding the east face of the T-cell. Should any one of these subfluxes lie adjacent to a solid boundary, its contribution to the average over the four triads is eliminated. Additionally, the vertical tracer gradient and the corresponding density triad are fundamentally coupled. Therefore, it is not possible to identify a discretization of the Kxz off-diagonal diffusion tensor component independently of the particular tracer, whereas it was possible to do so with the C87 scheme. The resulting discretization of the diffusion operator recovers the traditional five-point Laplacian in the limit of flat isoneutral directions. This operator has well-known stability properties; that is, it fluxes tracer downgradient and it has no computational modes.
6. Numerical tests of the new diffusion scheme
As described in section 4 and illustrated in Fig. 7, the switching experiments to isoneutral diffusion are greatly stabilized upon providing a self-consistent balance of the isoneutral diffusive fluxes of the active tracers so that αFI(θ) = βFI(s). This stabilization was achieved independently of discretizing with the density triads. Analogous tests with the new scheme show similar results (not shown). Furthermore, they indicate that it is possible to remove all explicit dianeutral diffusivity when solving the isoneutral diffusion problem, even in the especially difficult case of strongly evolving active tracers.2 The ability to do so in a z-coordinate model opens up interesting possibilities for exploring mixing effects due to the nonlinear equation of state (see appendix B), which have heretofore been swamped by the horizontal background diffusion. This section briefly illustrates further aspects of the new scheme. All experiments are conducted with zero horizontal background diffusivity.
a. Effects due to the triad discretization
Consider the effects in a two-dimensional model with a 2Δx profile in salinity yet vertical stratification in temperature and allow the equation of state to be linear.3 The 2Δx salinity structure induces a 2Δx density structure. With this structure, C87 produces a zero vertical flux [Eq. (12)] and an upgradient horizontal flux, which means that the variance created in the x direction is not removed in the z direction. The effect is to move water parcels along neutral directions in θ–s space until there is a 1–1 relationship between θ and ρ. The initial spread in salinity is replaced by a spread in temperature, with the densest water colder than any water initially in the domain, and the lightest water warmer than any initially in the domain. The final range is roughly αΔθ ≈ βΔs. In cold water, α is small so that small horizontal variations in salinity will drive large horizontal variations in temperature, thus creating very cold water. By contrast, the new scheme has no computational mode and so diffuses temperature and salinity in both the vertical and horizontal. The result is the more reasonable temperature–salinity distribution shown in Fig. 10b. All the temperatures now lie within the initial range. There are a few points, however, where the salinity is slightly outside the initial range. These unphysical results may be attributed to small amounts of upgradient diffusion occurring for the individual cell faces (see discussion of the finite volume interpretation of downgradient fluxes in section 5 and appendix D).
Now reconsider the passive tracer experiment shown in Fig. 5 using the new scheme. The tracer variance for this experiment, shown in Fig. 11a, reduces immediately. Figure 11b shows a meridional slice at the surface in the middle of the basin for both the C87 and new results. This figure shows unphysical values (those values outside the range [0, 1]) for the tracer associated with the increasing variance in the C87 scheme. The new scheme produces physically realistic values.
b. Diffusion and GM90 advection
The unsuccessful relaxation experiment with GM90 advection and C87 diffusion shown in Fig. 2d, which used 0.5 cm2 s−1 dianeutral diffusivity, was cleanly simulated with the new diffusion scheme (not shown). A more difficult problem to solve with the C87 scheme is the case with identically zero dianeutral diffusivity. Such an experiment is possible with the new diffusion scheme (as seen in Fig. 12). The solution relaxes toward a horizontally uniform state, as expected since the GM90 scheme acts to reduce the available potential energy of the system. We found a similar smooth relaxation when allowing the Eulerian velocity to act on the tracers through the addition of a centered difference advective flux, in addition to the GM90 advection and isoneutral diffusion (not shown).
c. Computational timing requirements
The computational requirements of the new diffusion scheme are more than the C87 scheme. With the idealized sector model used in this study, the new diffusion scheme using the small angle tensor took roughly 10%–20% more computational time than the MOM 2 implementation of the C87 scheme. These numbers represent the time for the model as a whole. This result is consistent with comparisons made in a realistic coarse-resolution global ocean model (not shown). The discretization provided in appendix E for the full isoneutral diffusion tensor took roughly six times longer than the new small angle tensor. If further justification is provided for employing the full tensor (see appendix C for more discussion), then some sort of approximation to the scheme provided in appendix E should be considered in order to reduce the large time requirements for using this tensor. In general, for the small tensor, the 10%–20% increased time requirements over the C87 scheme appear modest considering the substantially improved numerical representation of the isoneutral diffusion process.
7. Summary and conclusions
The main purpose of this paper was to present the physical and numerical properties that are necessary in order to stably and accurately realize isoneutral/dianeutral diffusion of tracers in z-coordinate ocean models. Model tests with the C87 (Cox 1987) diffusion scheme were seen to result in unphysical and unstable solutions. In addition to showing problems when diffusing with the C87 scheme, problems occured when C87 diffusion was run with either FCT Eulerian advection or the GM90 eddy-induced advective transport. Although exceptions to these negative results may occur in other flow regimes, the goal of our study was to provide a diffusion scheme that performs well under a broad spectrum of flows, including those undergoing large perturbations such as might occur in realistic climate variability and climate change experiments. Therefore, our evaluation of the C87 scheme leads us to conclude that it does not generally provide a numerically sound or physically realistic simulation of isoneutral diffusion in z-coordinate ocean models.
After the model tests with the C87 scheme, some basic kinematical properties of isoneutral/dianeutral diffusion were discussed. From this analysis, two central properties desired from a numerical diffusion scheme were identified: (I) It should flux tracers downgradient, which implies that it will not increase tracer variance. (II) It should provide for a proper orientation of the neutral directions so that there is a balance between the isoneutral fluxes of the active tracers, thus providing for a zero isoneutral diffusive flux of locally referenced potential density. Property II is a self-consistency condition for isoneutral diffusive fluxes. This property distinguishes isoneutral diffusion of active tracers, which is a nonlinear process due to the dependence of the diffusion tensor on the active tracers, from linear diffusion processes such as that for passive tracers. Based on this discussion, the following core problems with the C87 scheme were identified: (i) The presence of computational grid-scale modes and (ii) the ability to increase tracer variance when (a) these computational modes appear in the density field and (b) the equation of state is nonlinear, which results in a computed neutral direction that is steeper than the isolines of locally referenced potential density. The nonlinear equation of state, combined with the upgradient flux associated with the computational mode, feeds the growth in the numerical instability that causes the active tracer variance to grow unbounded.
Besides upgradient fluxes, there is another problem with the unstable density profile associated with the C87 instability (Fig. 6). The problem is fundamentally associated with the misalignment of the computed neutral directions and isolines of locally referenced potential density. A result of the misalignment is that the isoneutral diffusive flux of the active tracers is not constrained to satisfy the balance αFI(θ) = βFI(s). The imbalance of active tracer fluxes in C87 effectively provides for an extra computational degree of freedom. This extra degree of freedom will manifest as an unstable mode if the computed neutral directions are steeper than the isolines of locally referenced potential density. Any scheme satisfying this misalignment, even if it reduces variance, will produce an accumulation of power at high wavenumbers; that is, grid noise. We have seen this result in tests with the variance reducing scheme in which we purposefully computed the neutral directions as in C87 (not shown). What occurs amounts to an unstable form of self-diffusion, where the scheme attempts to diffuse locally referenced potential density downgradient along spuriously computed neutral directions. The process will saturate since variance reduces, yet variance is nonnegative. Depending on the initial variance, the result could be a substantial amount of power at the grid scale: an effective ripping apart of the solution. The instability depends on the nonlinear nature of active tracer diffusion with the Redi diffusion tensor. The reason is that, if there were no feedback, locally referenced potential density isolines would simply align with the spurious neutral directions, as will occur for a passive tracer in the background of a static density field. Instead, as the locally referenced potential density tries to align, the spurious neutral directions are themselves altered, thus leading toward a continued growth in grid-scale waves until the process finally saturates.
As a corollary, this nonlinear instability does not occur in isopycnal layer models since, when they employ two active tracers, the evolution of the second tracer is diagnosed from the first in such a way to preserve the integrity of the potential density layers. The z models can either perform an analogous diagnosis (prescription B in section 4c), or properly compute the neutral directions (prescription A in section 4c) in order to eliminate the instability.
An appropriate framework for building the downgradient property of diffusive mixing into the discretized isoneutral diffusion operator is a functional formalism. Use of this formalism produces a scheme that does not increase tracer variance. Furthermore, it will diffuse tracers downgradient when considering the net effects over a finite volume region of the grid (see Fig. 8 and appendix D). A central strength of the formalism is that the functional is negative semidefinite, which means that any consistent discretization will also be negative semidefinite. Since the negative semidefiniteness is the crucial property that ensures the downgradient nature of the resulting diffusive flux, any consistent discretization starting from the functional will yield a conditionally stable diffusion operator.
We derived the discretized diffusion operator both through performing the discrete functional derivative [details given in Pacanowski (1996)], as well as more heuristically by identifying triads of density points as fundamental building blocks for use in constructing the isoneutral slopes. The triads, when weighted by their respective tracer gradients, eliminate computational modes since there is no longer any grid splitting. By building in the constraint that the scheme properly orients the neutral directions so that the isoneutral fluxes of the active tracers are balanced, the formalism readily incorporates both the active and passive tracers. The result is a diffusion operator that is consistent, stable, and aims to satisy physical properties that are fundamental to isoneutral diffusion.
There are five appendices that clarify certain details not central to the discussion in the main text. We point to appendix C as being quite important as it provides arguments for employing more sensible and physically based numerical techniques for maintaining linear stability in regions of steep isoneutral slopes. Most notably, the commonly used slope clipping scheme introduced by C87 was shown to be especially unphysical in convective regions. This analysis motivated slope scaling, as suggested by Gerdes et al. (1991), Danabasoglu and McWilliams (1995), or the full slope isoneutral diffusion tensor, as a sensible alternative to slope clipping. Currently, we have no arguments motivating one form of scaling over another [however, see Large et al. (1996) for one argument].
Model tests with the new diffusion scheme indicate that it effectively eliminates the problems with the C87 scheme and, hence, allows for removal of horizontal background diffusivity. The effects from the finite volume downgradient fluxing of tracers, as rendered by use of density triads, provides for a more physically reasonable simulation of passive tracer diffusion as well as a more sensible action on water mass properties. Tests of the new scheme alone or combined with the GM90 advection allow for a clean separation of the effects due to isoneutral diffusion and GM90.
In closing, it is important to highlight some of the shortcomings of the new diffusion scheme. Most notably, the two fundamental physical properties described in this paper that we aimed for in the numerical diffusion scheme are not completely realized. First, the scheme does not satisfy a “downgradient” property at each of the tracer cells. Rather it does so only over a finite volume that incorporates more than one cell (see section 5a and appendix D). As such, for a pure diffusion problem, it is possible to realize tracer values within a particular tracer cell that are outside a physically realistic range due to spurious upgradient fluxes at the cell face. Preliminary experiments suggest that this problem may be most severe for passive tracers (Gnanadesikan 1998, manuscript submitted to J. Mar. Res.) since they are not constrained in a manner similar to the active tracers. The results of Beckers et al. (1997, personal communication) suggest that it is not possible to do better within the constraints of a consistent, linear, numerical isoneutral diffusion operator whose grid stencil extends over the nine points (for a two-dimensional small angle tensor) used here. Second, the new diffusion scheme does not precisely balance the active tracer fluxes in the vertical due to the splitting of the vertical flux into a temporally explicit and implicit piece. The degree to which each of these problems arise will depend on the particulars of the experimental configuration. Some preliminary results reported in Griffies (1998) and Gnanadesikan (in preparation 1997) indicate that in simulations with isoneutral diffusion combined with eddy-advection processes, the solutions are less problematical than with only isoneutral diffusion. Given these caveats, it remains our conclusion that the new diffusion scheme provides a sound tool for systematically addressing various subgrid-scale parameterization theories, including isoneutral diffusion, in z coordinate ocean climate models.
Acknowledgments
We wish to thank Jeff Anderson, Jean-Marie Beckers, Kirk Bryan, Bob Hallberg, Isaac Held, Jerry Mahlman, Trevor McDougall, Tony Rosati, Robbie Toggweiler, Eli Tziperman, and Martin Visbeck for discussions and comments. JKD and RDS wish to acknowledge the support of the DOE CHAMMP program. We dedicate this paper to the memory of Mike Cox.
REFERENCES
Beckers, J.-M., H. Burchard, J. M. Campin, E. Deleersnijder, and P. P. Mathieu, 1998: Another reason why simple discretizations of rotated diffusion operators cause problems in ocean models: Comments on “Isoneutral diffusion in a z-coordinate ocean model.” J. Phys. Oceanogr., in press.
Bell, M. J., 1994: Notes on how to choose parameter values for the Cox numerical ocean circulation model. U.K. Met. Office Forecasting Research Tech. Rep. No. 135, 43 pp.
Bleck, R., C. Rooth, D. Hu, and L. T. Smith, 1992: Salinity-driven thermocline transients in a wind and thermohaline forced isopycnic coordinate model of the North Atlantic. J. Phys. Oceanogr.,22, 1486–1505.
Böning, C., W. R. Holland, F. O. Bryan, G. Danabasoglu, and J. C. McWilliams, 1995: An overlooked problem in model simulations of the thermohaline circulation and heat transport in the Atlantic Ocean. J. Climate,8, 515–523.
Bryan, K., 1975: Three-dimensional numerical models of the ocean circulation. Numerical Models of Ocean Circulation, Natl. Acad. Sci., 94–105.
——, and M. D. Cox, 1972: An approximate equation of state for numerical models of the ocean circulation. J. Phys. Oceanogr.,2, 510–514.
——, S. Manabe, and R. C. Pacanowski, 1975: A global ocean–atmosphere climate model. Part II: The oceanic circulation. J. Phys. Oceanogr.,5, 30–46.
Courant, R., and D. Hilbert, 1953: Methods of Mathematical Physics. Vols. 1 and 2. Wiley, 560 and 830 pp.
Cox, M. D., 1984: A primitive equation, 3-dimensional model of the ocean. GFDL Ocean Group Tech. Rep. 1, GFDL. [Available from GFDL, Route 1, Box 308, Princeton, NJ 08542.].
——, 1987: Isopycnal diffusion in a z-coordinate ocean model. Ocean Modelling, (unpublished manuscripts), 74, 1–5.
——, and K. Bryan, 1984: A numerical model of the ventilated thermocline. J. Phys. Oceanogr.,14, 674–687.
Danabasoglu, G., and J. C. McWilliams, 1995: Sensitivity of the global ocean circulation to parameterizations of mesoscale tracer transports. J. Climate,8, 2967–2987.
Gent, P. R., and J. C. McWilliams, 1990: Isopycnal mixing in ocean circulation models. J. Phys. Oceanogr.,20, 150–155.
——, J. Willegrand, T. J. McDougall, and J. C. McWilliams, 1995: Parameterizing eddy-induced tracer transports in ocean circulation models. J. Phys. Oceanogr.,25, 463–474.
Gerdes, R., C. Köberle, and J. Willebrand, 1991: The influence of numerical advection schemes on the results of ocean general circulation models. Climate Dyn.,5, 211–226.
Gill, A. E., 1982: Atmosphere–Ocean Dynamics. Academic Press, 662 pp.
Goloviznin, V. M., A. A. Samarskii, and A. P. Favorskii, 1977: A variational approach to constructing finite-difference mathematical models in hydrodynamics. Sov. Phys. Dokl.,22, 432–434.
Gough, W. A., and W. J. Welch, 1994: Parameter space exploration of an ocean general circulation model using an isopycnal mixing parameterization. J. Mar. Res.,52, 773–796.
——, and C. A. Lin, 1995: Isopycnal mixing and the Veronis effect in an ocean general circulation model. J. Mar. Res.,53, 189–199.
Griffies, S. M., 1998: The Gent–McWilliams skew flux. J. Phys. Oceanogr.28, 831–841.
Hallberg, R., 1995: Some aspects of the circulation in ocean basins with isopycnals intersecting the sloping boundaries. Ph.D. dissertation, University of Washington, 244 pp.
Haltiner, G. J., and R. T. Williams, 1981: Numerical Prediction and Dynamic Meteorology. 2d ed. Wiley, 477 pp.
Hirst, A. C., D. R. Jackett, and T. J. McDougall, 1996: The meridional overturning cells of a World Ocean model in neutral density coordinates. J. Phys. Oceanogr.,26, 775–791.
Iselin, C. O., 1939: The influence of vertical and lateral turbulence on the characteristics of the waters at mid-depth. Trans. Amer. Geophys. Union,20, 414–417.
Kershaw, D. S., 1981: Differencing of the diffusion equation in Lagrangian hydrodynamical codes. J. Comput. Phys.,39, 375–395.
Korshiya, T. K., V. F. Tishkin, A. P. Favorskii, and M. Y. Shashkov, 1980: Flow-variational difference schemes for calculating the diffusion of a magnetic field. Sov. Phys. Dokl.,25, 832–834.
Kunze, E., and T. B. Sandford, 1996: Abyssal mixing: Where it is not. J. Phys. Oceanogr.,26, 2286–2296.
Large, W. G., G. Danabasoglu, S. C. Doney, and J. C. McWilliams, 1997: Sensitivity to surface forcing and boundary layer mixing in a global ocean model: Annual-mean climatology. J. Phys. Oceanogr.,27, 2418–2447.
Ledwell, J. R., A. J. Watson, and C. S. Law, 1993: Evidence for slow mixing across the pycnocline from an open-ocean tracer-release experiment. Nature,364, 701–703.
Levitus, S., 1982: Climatological Atlas of the World Ocean. NOAA Prof. Paper No. 13, U.S. Govt. Printing Office, Washington, DC,173 pp.
McDougall, T. J., 1987a: Neutral surfaces. J. Phys. Oceanogr.,17, 1950–1967.
——, 1987b: Thermobaricity, cabbeling, and water-mass conversion. J. Geophys. Res.,92, 5448–5464.
——, 1991: Parameterizing mixing in inverse models. Dynamics of Oceanic Internal Gravity Waves, Proc. Sixth ’Aha Huliko ’a Hawaiian Winter Workshop, P. Müller and D. Henderson, Eds., Manoa, HI, University of Hawaii at Manoa, 355–386.
——, and J. A. Church, 1986: Pitfalls with numerical representations of isopycnal and diapycnal mixing. J. Phys. Oceanogr.,16, 196–199.
——, and D. R. Jackett, 1988: On the helical nature of neutral trajectories in the ocean. Progress in Oceanography, Vol. 20, Pergamon, 153–183.
Montgomery, R. B., 1940: The present evidence on the importance of lateral mixing processes in the ocean. Bull. Amer. Meteor. Soc.,21, 87–94.
Olbers, D. J., M. Wenzel, and J. Willebrand, 1985: The inference of North Atlantic circulation patterns from climatological hydrographic data. Rev. Geophys.,23, 313–356.
Pacanowski, R. C., 1996: MOM 2 documentation, user’s guide and reference manual. GFDL Ocean Tech. Rep. 3.1, Geophysical Fluid Dynamics Laboratory/NOAA. [Available from GFDL, Princeton University, Princeton, NJ 08542.].
Plumb, R. A., and J. D. Mahlman, 1987: The zonally averaged transport characteristics of the GFDL general circulation/transport model. J. Atmos. Sci.,44, 298–327.
Rahmstorf, S., 1993: A fast and complete convection scheme for ocean models. Ocean Modelling, (unpublished manuscripts), 101, 9–11.
Redi, M. H., 1982: Oceanic isopycnal mixing by coordinate rotation. J. Phys. Oceanogr.,12, 1154–1158.
Send, U., and J. Marshall, 1995: Integral effects of deep convection. J. Phys. Oceanogr.,25, 855–872.
Solomon, H., 1971: On the representation of isentropic mixing in ocean models. J. Phys. Oceanogr.,1, 233–234.
Tishkin, V. F., A. P. Favorskii, and M. Y. Shashkov, 1979: Variational-difference schemes for the heat-conduction equation on nonregular grids. Sov. Phys. Dokl.,24, 446–448.
Veronis, G., 1975: The role of models in tracer studies. Numerical Models of Ocean Circulation, Natl. Acad. Sci., 133–146 pp.
Visbeck, M., J. Marshall, T. Haine, and M. Spall, 1997: Specification of eddy transfer coefficients in coarse resolution ocean circulation models. J. Phys. Oceanogr.,27, 381–402.
Weaver, A. J., and E. S. Sarachik, 1990: On the importance of vertical resolution in certain ocean general circulation models. J. Phys. Oceanogr.,20, 600–609.
——, and M. Eby, 1997: On the numerical implementation of advection schemes for use in conjunction with various mixing parameterizations in the GFDL ocean model. J. Phys. Oceanogr.,27, 369–377.
APPENDIX A
Clarification of a Linear Stability Analysis
Bell (1994) provides a linear numerical stability analysis of the equation ∂tT = AI(∂xxT + S∂zxT), where S is spatially constant. His analysis, assuming an explicit time step and the usual discretized spatial derivatives, indicates the presence of linear modes whose amplitude grows when |S| > 2Δz/Δx. Assuming that this equation is a relevant idealization of isoneutral diffusion, Bell concluded that the discretization of isoneutral diffusion as given by C87 is linearly unstable if the given slope condition is satisfied. However, the assumption that this equation is relevant for isoneutral diffusion in a z model is incorrect since it neglects an essential diagonal term. Namely, the simplest form of isoneutral diffusion is given by ∂tT = AI(∂xxT + S2∂zzT + 2S∂zxT) = AI(∂x + S∂z)2T, where S is some constant slope. With the ∂zzT term added, a linear analysis of the discrete equation allows for a conditionally stable scheme. Another way to understand this result is to note that with the extra term, it is possible to produce a variance reducing scheme as shown in this paper. Through the usual energy arguments, such a scheme is at least conditionally stable.
APPENDIX B
Cabbeling and Thermobaricity
It is useful to present the explicit form for the mixing effects induced by the nonlinear equation of state. It is worth noting that these effects are trivially implemented within a z-coordinate ocean model by simply employing a nonlinear equation of state when diagnosing density. These effects provide irreversible changes to the locally referenced potential density. An alternative formulation of the following, in much more detail, can be found in McDougall (1991).
In the manner written here, both cabbeling and thermobaricity effects can be readily diagnosed in z-level models. Use of the new isoneutral diffusion scheme described in this paper, which does not rely on background horizontal diffusion, allows for a potentially clean manifestation of these sometimes subtle effects on the formation and transformation properties of water masses.
APPENDIX C
Linear Stability Constraints Related to Steep Slopes
Traditionally, the implementation of isoneutral diffusion in the z-coordinate models has assumed that slopes of the neutral directions are small (C87, GM90). Additionally, this assumption is implicit in potential density (isopycnal) layer models of the type described by Bleck et al. (1992) or Hallberg (1995), which employ a diagonal diffusion tensor (see GM90 for the full, nondiagonal diffusion tensor in isopycnal coordinates). The small slope approximation is valid over a good part of the World Ocean where slopes larger than 1/100 are uncommon. Computationally, it allows fewer terms to be computed in the diffusion operator and so results in significant computational savings. However, in order to satisfy a linear numerical stability condition described below, it is necessary to incorporate ad hoc prescriptions in order to admit steep slopes. The importance of steep slope regions in the ocean are related to their largely determining air–sea exchange processes especially relevant for climate modeling. In addition, recent studies indicate that effects of mesoscale eddies are important in regions near areas of active deep convection where isoneutral slopes are essentially infinite (e.g., Send and Marshall 1995). Therefore, it could be that parameterizing dynamics in steep sloped regions will include quasi-adiabatic stirring, such as GM90, and isoneutral diffusion. For example, Visbeck et al. (1997) have employed the full slope tensor in their simulations of mixing within convection regions since this tensor is valid for all slopes. The purpose of this extended appendix is to provide a linear stability analysis for the full isoneutral tensor. In the process, we provide a summary of the various means available for enforcing the constraints implied from the linear stability analysis relevant for steep isoneutral sloped regions.
Linear stability analysis
Just as for the small angle tensor, the stability constraint for the full tensor is satisfied for small isoneutral slopes. Additionally, the (1 + Σ2)−1 self-regulating factor allows the numerical constraint to also be satisfied for large angles. Thus, the full tensor has two domains of numerical stability: 0 ⩽ Σ ⩽ S(−) and S(+) ⩽ Σ < ∞. The slopes S(±) are determined from solutions to Σ = δ(1 + Σ2). This equation has two roots S(±) = [1 ± (1 − 4δ2)1/2]/(2δ). Note that S(+)S(−) = 1, and so S(−) =
The above analysis is based on the conservative assumption that, if all components to the diffusion tensor produce linearly stable diffusion, then the scheme is linearly stable. Although conservative, our experience has shown that violation of these constraints can result in unacceptably large numerical inaccuracies. These inaccuracies are of special concern since they make it more difficult to realize the balance
As slopes steepen to greater than the grid aspect ratio, the slope that is estimated by the scheme becomes less accurate due to the need to extrapolate outside of the differencing scheme’s stencil. However, in so far as the scheme is respecting the balance in the active tracer fluxes along the computed neutral directions, the scheme is respecting the integrity of the neutral directions. In other words, the new scheme provides a self-consistent discretization of isoneutral diffusion regardless of the neutral direction slopes. In this sense, the errors from this extrapolation are no different in principle than the errors incurred with isopycnal layer models in regions where the neutral directions depart from the isopycnal layers. In addition, as seen in the discussion of section 4, these extrapolation errors are not the cause of the instability in the C87 scheme.
Comparing the slope checking prescriptions
Although perhaps cleaner from a numerical perspective due to its natural rescaling properties, it is not clear whether the full tensor is a physically relevant alternative to the small slope tensor. Further studies are necessary to clarify this issue.
APPENDIX D
Downgradient Diffusion over Finite Volumes
APPENDIX E
Full Tensor Diffusive Flux
Note added in proof. Soon after this paper was accepted, the work of Kershaw (1981) and that of Beckers et al. (1998) came to our attention. The functional approach employed in this paper was also used by Kershaw for his discretization of a diffusion operator. Beckers et al. provide a general proof that the resulting linear rotated diffusion scheme cannot maintain positive definiteness of a tracer field.
With a 107 cm2 s−1 diffusion coefficient, the horizontal diffusion equation grid CFL number AHΔt/(Δx)2 is 2 × 10−4, which is well within the constraints of stability for the linear diffusion equation (see appendix C).
Dispersion errors from advective fluxes may qualify this statement.
This is the only model result in this paper not computed with MOM 2.
We thank T. McDougall for emphasizing this point.