1. Introduction
In the climate modeling community, unforced trends in coupled model simulations are commonly referred to as model drift. Given the potential for drift to contaminate forced signals in climate simulations, it has been a topic of interest throughout the phases of the Coupled Model Intercomparison Project (CMIP). For CMIP2+ (Covey et al. 2006), CMIP3 (Sen Gupta et al. 2012), and CMIP5 (Sen Gupta et al. 2013), drift represented a nonnegligible fraction of historical forced trends in global depth-integrated quantities such as ocean heat content (OHC) and steric sea level over recent decades. For surface and atmospheric variables such as global mean temperature or precipitation (i.e., variables that are less influenced by the slowly evolving deep ocean) drift is less important, but on regional scales it can still represent a substantial fraction of recent historical trends (Sen Gupta et al. 2013).
There are a number of causes of drift in coupled climate models. When a model simulation is initiated, an imbalance inevitably exists between the prescribed initial state (which is commonly estimated from observations) and the representation of physics in the model (i.e., the simulated ocean dynamics, advection and mixing). A coupling shock may also occur when the various model components (e.g., atmosphere, ocean, sea ice) are first joined together, resulting in discontinuities in boundary fluxes (e.g., Rahmstorf 1995). In response, a model will typically drift from its initial state toward a quasi-steady state over time. The time scale over which the system reaches equilibrium depends on how long it takes anomalies to be advected or mixed through the deep ocean, which is typically many thousands of years (e.g., Peacock and Maltrud 2006). The adjustment of the atmosphere and land surface is much faster. The most obvious solution to this issue would be to let the model run to equilibrium before performing any experiments of interest. The problem is that state-of-the-art coupled climate models are computationally expensive, which makes a “spinup” period of many thousands of years impractical. Instead, models are generally spun up for a few hundred years. Experiments will therefore exhibit changes/trends associated with incomplete model spinup, as well as changes related to external forcing or internal climate variability. The overall reduction in drift from CMIP2+ to CMIP5 has been primarily attributed to longer spinup times and more careful initialization of the coupled ocean–atmosphere system (Sen Gupta et al. 2013).
In addition to incomplete model spinup, drift is also caused by spurious mass or energy “leakage” into or out of the simulated climate system. This nonclosure of the global mass and energy budgets arises due to small inconsistencies in the model treatment of energy (Lucarini and Ragone 2011; Hobbs et al. 2016) and/or water (Liepert and Previdi 2012; Liepert and Lo 2013). In relation to the global energy budget, an essential characteristic is a close correspondence between the globally integrated net top-of-the-atmosphere radiation (netTOA) and OHC, because the latter represents Earth’s primary energy store (Palmer et al. 2011). In CMIP5 models, the difference between the time-integrated global netTOA and changes in OHC is overwhelmingly characterized by an approximately time-constant bias that is insensitive to changes in model forcing (i.e., it is the same for all experiments; Hobbs et al. 2016). This means it is generally possible to correct (or “dedrift”) output from a coupled model experiment by subtracting a drift signal taken from the corresponding control experiment. When calculating this drift signal there is the potential to overfit and thus remove low-frequency signals associated with internal variability, so there are a number of (somewhat subjective) decisions to be made about fitting a linear or higher-order polynomial (or high-pass filter) to either the full length or a shorter segment the control time series (Sen Gupta et al. 2013). Once the data have been dedrifted, most CMIP5 models are approximately energy conserving (Hobbs et al. 2016).
The practice of dedrifting is commonplace in studies concerned with forced trends in model variables that have an obvious link to the slowly evolving deep ocean (e.g., OHC and steric sea level; Palmer et al. 2018; Bilbao et al. 2019), but it is less well understood and applied in the context of net time-integrated heat and water fluxes into the atmosphere and ocean. For instance, changes in meridional transports of ocean heat and freshwater can be inferred from cumulative surface heat and freshwater fluxes (e.g., Levang and Schmitt 2015; Nummelin et al. 2017; Irving et al. 2019) and changes in ocean salinity can be used to infer global water cycle changes (Skliris et al. 2016), but only if there is approximate closure of the relevant global budgets. If model leakage causes a substantial mismatch between changes in globally integrated OHC and the time-integrated net ocean heat flux, for instance, then any inferred change in meridional ocean heat transport is invalid. If dedrifting does not restore budget closure for any particular model, then that model may need to be excluded from the analysis ensemble (e.g., Palmer and McNeall 2014; Irving et al. 2019).
In this study, we extend the physically based approach to drift analysis used by Hobbs et al. (2016) by considering both energy and mass conservation in the CMIP6 ensemble (Eyring et al. 2016) before and after dedrifting. Relevant comparisons are made with the CMIP5 ensemble (Taylor et al. 2012) in order to report on progress/improvements.
2. Methods
To assess drift in the CMIP6 and CMIP5 ensembles, we analyze data from the preindustrial control (piControl) experiment. For each model, the drift in globally integrated OHC is decomposed into a temperature and barystatic (mass related; Gregory et al. 2013) component. This decomposition provides insights into the cause of the drift, and the temperature component can be compared against the time-integrated heat flux into the ocean to assess energy conservation. To assess ocean mass and salt conservation, we compare the global ocean mass to the time-integrated surface freshwater flux and global mean salinity, respectively (remembering that the ocean integrated salt content should be constant). Similarly, atmospheric mass conservation is assessed by comparing the global mass of water in the atmosphere to the time-integrated moisture flux into the atmosphere (i.e., evaporation minus precipitation). Each quantity in the OHC decomposition and mass and energy conservation analysis is derived/defined below.
a. Ocean heat decomposition
b. Conservation
The drifts in global oceanic mass, atmospheric mass and OHC are approximately linear (e.g., Figs. 1 and 2 ), so the time derivatives defined above were calculated as a simple linear trend (using ordinary least squares regression) over the length of the control simulation. Any significant residual in Eqs. (3), (4), or (5) indicates a spurious source/sink of heat or mass in the simulated climate system, which we refer to as model leakage.
To put the magnitude of the model drifts into perspective, we compare them to estimates of current observed trends. For the global energy budget, we compare against estimates of the planetary energy imbalance, which range from 0.4 to 1.0 W m−2 for various estimation methods and time periods over the last couple of decades (Meyssignac et al. 2019; von Schuckmann et al. 2020). This comparison is achieved by dividing the model energy drift by the planetary surface area of 5.1 × 1014 m2. For the ocean mass budget, we compare against the current barystatic sea level rise of +1.8 mm yr−1 (or approximately 6.6 × 1014 kg yr−1). This value represents 58% of the estimated total (i.e., steric plus barystatic) global sea level rise during the altimetry era (3.1 mm yr−1 from 1993 to the present), as per the findings of WCRP Global Sea Level Budget Group (2018). Finally, for the atmospheric mass budget we compare against a constant relative-humidity warming rate of 1.68 × 1013 kg yr−1. This value represents the Clausius–Clapeyron response of 7% °C−1 to a trend in global average surface temperature of approximately 0.2°C decade−1 over the 1990–2019 period (from the NOAA Merged Land Ocean Global Surface Temperature Analysis, version 5; Zhang et al. 2019) for an approximate average mass of water vapor in the CMIP atmospheres of 1.2 × 1016 kg.
c. Model diagnostics
Each of the variables discussed in the equations above (Table 1) can be related to a CMIP diagnostic(s) (Table 2). Detailed definitions for each diagnostic are available from the CMIP5 standard output (PCMDI 2013) and CMIP6 data request (WGCM Infrastructure Panel 2017) documentation, with additional information regarding ocean diagnostics provided by Griffies et al. (2016). Tables S1–S4 provide precise details of exactly which diagnostics and data file versions were used for each model in this study. We note that none of the models for which we present ocean surface heat or water flux results archived a heat flux correction (hfcorr) or water flux correction (wfcorr) diagnostic, respectively.
Variable definitions. For models where cpocean and/or rhozero were not provided, default values of 4000 J (kg K)−1 and 1026 kg m−3 were used. See Table 2 for more details on the CMIP diagnostics.
CMIP diagnostics used in this study. The evaporation (evspsbl) diagnostic includes transpiration and sublimation, while precipitation (pr) includes liquid and solid phases from all types of clouds; TOA = top of atmosphere.
Identifying the correct diagnostics for use in this study was mostly straightforward, except in the case of the global ocean mass. Almost all of the CMIP6 and CMIP5 ocean models apply a Boussinesq approximation, which means volume is conserved rather than mass (and seawater density is only considered in so far as it influences ocean dynamics). As such, steric processes (i.e., contraction/expansion of seawater due to temperature and/or salinity change) are represented as a change in density, from which an implied change in mass is often inferred and reported by modeling groups (the so-called Boussinesq ocean mass), as opposed to the real world where temperature and/or freshwater input leads to direct changes in ocean volume. To avoid any confusion, Boussinesq models in CMIP6 were asked to archive a global ocean mass variable (masso) equal to the reference density (rhozero) multiplied by the ocean volume (volo), as opposed to the Boussinesq ocean mass (Griffies et al. 2016). A small number of modeling groups did not follow this direction (and it was not a requirement for Boussinesq models in CMIP5), so for those models we performed the density-times-volume calculation in order to obtain the variable M used in the equations above. All models for which a global ocean mass time series could be constructed were included in the final ensemble (Table 3). The small number of (mostly CMIP5) models that archived a virtual salt flux diagnostic were left out of the ensemble, as it is not clear from the CMIP documentation how those fluxes impact/modify the global ocean mass, salinity, and surface water flux diagnostics. Monthly mean data were converted to annual mean (accounting for the different number of days in each month) prior to analysis and results for only the first member from each model ensemble is presented, because all ensemble members from a given model tended to produce similar results.
Models used in the study. The first ensemble member was used for all models, which was r1i1p1 for CMIP5 and typically r1i1p1f1 for CMIP6 (an asterisk indicates r1i1p1f2 was the first member). Citations are for the piControl experiment dataset.
The change in the definition of the global ocean mass diagnostic for CMIP6 means that for models that apply a Boussinesq approximation (which is almost all the models), neither the global mass nor volume diagnostics respond to steric processes—they both only respond to barystatic changes. It is possible to derive some steric information from the global average thermosteric sea level change diagnostic (zostoga), however, another new development in CMIP6 is that the full steric sea level change (zossga) is not archived. That diagnostic would incorporate thermosteric changes, halosteric changes and the so-called non-Boussinesq steric effect, which relates to reorganization of ocean mass (Griffies and Greatbatch 2012). In the absence of any diagnostic that fully captures steric changes, our analysis does not consider changes to the volume of the ocean.
3. Results
a. Example model
To illustrate the various aspects of our analysis, the results for a typical model (ACCESS-CM2; Bi et al. 2020) are shown in Fig. 1. The first thing to note is the clear drift/nonzero trend in OHC (black curve, Fig. 1a). If the model were energy conserving, the time series corresponding to the OHC temperature component anomaly (red curve), time-integrated ocean surface heat flux (orange curve) and time-integrated netTOA (gold curve) would approximately overlay one another, as per Eqs. (3) and (7). To put the magnitude of these drifts into perspective, the linear trend in those time series is 0.18, 0.02, and 0.37 W m−2, respectively. These values (and the leakage of approximately 0.19 W m−2 between the TOA and ocean storage) are trivial compared to the corresponding climatological energy flows in the climate system but are not an insignificant fraction of the anthropogenic signal (i.e., the current planetary energy imbalance of 0.4–1.0 W m−2).
Similar principles apply for the ocean mass budget (Fig. 1b). The time series corresponding to the ocean mass anomaly (blue curve) and time-integrated freshwater flux (gray curve) approximately overlay one another, indicating approximate water conservation. The corresponding linear trend is equivalent to a drop in global sea level of 0.2 mm yr−1, which is trivial compared to individual surface freshwater fluxes (e.g., the annual precipitation or evaporation flux) but is not an insignificant fraction of the estimated current rate of barystatic sea level rise (1.8 mm yr−1). Global mean salinity has been converted to an equivalent change in ocean mass [as per Eq. (8); green curve] and it also approximately overlays the ocean mass time series, indicating approximate salt conservation. Finally, it is clear that the atmosphere does not conserve water (Fig. 1c). The drift in the mass of atmospheric water vapor is negligible (linear trend of 3.2 × 1011 kg yr−1), but the drift in time-integrated water flux into the atmosphere (i.e., evaporation minus precipitation; −1.8 × 1014 kg yr−1) is not. While trivial compared to the individual annual fluxes of precipitation or evaporation, the magnitude of the drift in time-integrated atmospheric water flux is larger than our estimated observed trend in atmospheric water vapor (+1.68 × 1013 kg yr−1) and represents a loss of approximately 1.5% of total atmospheric water vapor every year.
Given that the ACCESS-CM2 model does not conserve energy and atmospheric mass, it is important for data users to know whether conservation can be achieved after dedrifting. To test this, we quantify the drift signal by fitting a cubic polynomial to the full-length of various time series shown in Figs. 1a–c. That signal is then subtracted from the original time series in order to produce corresponding dedrifted time series (Figs. 1d–f). Approximate conservation is achieved for the energy and ocean mass budget after drift removal, but the atmospheric moisture budget time series still do not overlay one another. In a practical sense, this means that after dedrifting the mass and heat content of the global ocean responds appropriately to time-integrated changes in surface heat and freshwater fluxes, whereas the mass of water vapor in the atmosphere does not respond in a physically consistent manner to time-integrated changes in precipitation and evaporation. This is problematic for data users looking to infer anomalous atmospheric moisture transports (for instance) from regional changes in water vapor and evaporation minus precipitation. With this description of an example model in mind, we can expand our analysis to the entire CMIP6 (and CMIP5) ensemble.
b. Drift and conservation
1) Temporal evolution
We begin our description of the CMIP6 ensemble by considering the temporal evolution of the drift in globally integrated ocean mass and heat content (drift in atmospheric water vapor is negligible and thus not shown). Drift in both quantities is overwhelmingly characterized by linear trends that are relatively constant throughout the length of the control experiment (Figs. 2a,c). To visualize any coherent drift signals other than the linear trends, detrended OHC and ocean mass time series were calculated (Figs. 2b,d). The removed trend was estimated using ordinary least squares regression on the annual mean time series. For most models, removal of the linear trends transforms the time series into stationary red noise, which is the expected regime under an equilibrium climate. However, some of the models show clear coherent signals, particularly in OHC. These signals could represent low-frequency oscillations that are cut off by the control run length (i.e., multicentury variability in the models), but most appear to be an asymptotic progression to some stable “red noise plus trend” state that is more indicative of incomplete spinup (Lucarini and Ragone 2011). Of course, there is no way of testing this hypothesis unless the control simulation is run for long enough that either the second-order trend becomes zero (indicating the arrival at a stable state) or reverses (indicating oscillatory behavior).
2) Energy budget
Since the ocean is the biggest energy reservoir in the climate system, we anchor our energy budget analysis around the drift in OHC. Similar to ACCESS-CM2 (Fig. 1a), the drift in OHC is dominated by the temperature (as opposed to barystatic) component for essentially all models (Fig. 3). The direction of that drift has a positive bias across the ensemble, which was also true for the CMIP3 ensemble (Sen Gupta et al. 2013). This is important because it means the drifts will not cancel in the calculation of an ensemble mean. While there are fewer outliers in CMIP6, the ensemble median magnitude of the drift in OHC is similar for CMIP5 and CMIP6 (Table 4).
Drift in the CMIP5 and CMIP6 ensembles. The ensemble median [interquartile range] drift magnitude, calculated as the linear trend over the full length of the piControl experiment, is shown. Bold values indicate where drift in one of the CMIP projects is clearly smaller than the other (defined as a median drift magnitude at least 50% smaller). Drift in ocean salinity was calculated by first converting to an equivalent change in ocean mass [as per Eq. (8)].
Drift in OHC tends to be much smaller than for time-integrated netTOA, indicating a net leakage of energy in the simulated climate system (Fig. 4a). In fact, while drift in OHC is typically a small but nonnegligible fraction of the current planetary energy imbalance, the drift in time-integrated netTOA (and indeed the net systemwide energy leakage; Fig. 5) is larger than the observed planetary imbalance for a number of models. Most of this leakage occurs somewhere between the TOA and ocean surface, as ocean energy leakage (i.e., the discrepancy between the time-integrated heat flux into the ocean and change in OHC temperature component; Fig. 4b) is relatively modest. Similar to OHC, the ensemble median magnitude of the drift in time-integrated heat flux into the ocean has changed very little from CMIP5 to CMIP6. In contrast, the magnitude of the drift in time-integrated netTOA is substantially smaller in CMIP6, which explains the reduced total system energy leakage in CMIP6 (Table 4).
3) Mass budget
Drift in the ocean mass budget shares many similarities with the energy budget. First, like drift in OHC, the magnitude of drift in global ocean mass typically represents a small but nonnegligible fraction of observed trends (Fig. 4c) and has changed very little from CMIP5 to CMIP6 (Table 4). Drift in time-integrated surface freshwater flux on the other hand is larger than observed sea level trends for a number of models (Fig. 4c), indicating substantial nonclosure of the ocean mass budget. The ensemble median magnitude of the drift in freshwater flux is larger/worse in CMIP6, due in part to a number of large outliers (Table 4). Many models do a relatively good job of conserving salt (Fig. 4d) and the magnitude of the drift in ocean salinity has been reduced in CMIP6 (Table 4).
Given that atmospheric water vapor is not directly linked to the slowly evolving deep ocean, it is perhaps not surprising that the ensemble median drift magnitude (Table 4) represents a negligible fraction of estimated current trends (i.e., atmospheric variables tend not to exhibit much drift). The same cannot be said for the time-integrated moisture flux into the atmosphere (i.e., evaporation minus precipitation), which for most models is larger than estimated current trends in atmospheric water vapor (Fig. 4e). In fact, for many models the gain or loss of water associated with the drift in time-integrated moisture flux represents an appreciable fraction of the total mass of atmospheric water vapor (1.2 × 1016 kg) every year. In the CMIP3 ensemble, the drift in time-integrated atmospheric moisture flux was overwhelmingly negative (i.e., precipitation dominated over evaporation for most models; Liepert and Previdi 2012) but the CMIP5 (Liepert and Lo 2013) and CMIP6 models are relatively evenly distributed between positive and negative drifts (Fig. 4e). As was the case for the freshwater flux into the ocean, the ensemble median magnitude of the drift in time-integrated atmospheric moisture flux is larger in CMIP6 than it was in CMIP5 (Table 4).
c. Dedrifting
With the exception of atmospheric water vapor, we have shown that the magnitude of the drift in various global energy and mass budget terms typically represents a nonnegligible fraction of estimated current observed trends (OHC, ocean mass, and time-integrated ocean heat flux) or approaches/exceeds the magnitude of those trends (time-integrated netTOA, ocean freshwater flux, and atmospheric moisture flux). To avoid contamination of analyzed trends it is therefore important to quantify and remove this drift from forced experiments, particularly as the direction of the drift is biased for some variables (e.g., Fig. 3) and thus will not cancel when calculating ensemble statistics. Since the temporal evolution of these drifts is quasi-linear (with slight curvature likely related to incomplete spinup; Fig. 2) and insensitive to changes in model forcing (Hobbs et al. 2016), this can be achieved by fitting a simple polynomial (we fit a cubic, although a linear or quadratic fit yields similar results) to the control experiment and then subtracting the relevant segment of that polynomial from the forced data.
An additional motivation for dedrifting relates to budget closure. We saw earlier that approximate energy and ocean mass budget closure was achieved after dedrifting for the ACCESS-CM2 model (Figs. 1d,e), but nonclosure of the atmospheric water budget remained (Fig. 1f). To extend this budget closure analysis to the entire ensemble, we regress the various (decadal mean) dedrifted time series against one another to test for corresponding variability (Fig. 6). For reference, the ACCESS-CM2 linear regression coefficients were 0.99 (Qr vs Qh), 0.98 (Qr vs HT), and 0.98 (Qh vs HT) for the dedrifted energy budget time series (Fig. 1d); 0.89 (M vs S), 1.02 (Qm vs M), and 0.91 (Qm vs S) for the ocean mass budget time series (Fig. 1e); but only 0.39 (Ma vs Qep) for the atmospheric water budget (Fig. 1f).
Looking at the regression coefficients across the ensemble, it is clear that like ACCESS-CM2 most CMIP6 models show approximate energy budget closure after dedrifting (i.e., regression coefficients close to 1.0; Fig. 6a). Most CMIP5 models also achieve approximate energy budget closure, but there has been a small improvement between CMIP5 and CMIP6. Energy budget coefficients slightly less than 1.0 were common across the ensemble because the variance in the dedrifted time-integrated netTOA time series was typically marginally larger than the time-integrated heat flux into the ocean time series, which had a variance marginally larger than the OHC time series. The larger netTOA variance might be explained by the additional nonocean heat stores represented in the models (e.g., continental energy storage; Cuesta-Valero et al. 2016), but it is unclear why the time-integrated heat flux into the ocean would have a slightly larger variance than the OHC. In other words, while perfect/expected closure would normally be a regression coefficient of 1.0, for the comparisons against the time-integrated netTOA the expected coefficient might be slightly less than 1.0. See Fig. S1 for energy budget regression coefficients for individual models.
In contrast to the energy budget, even after drift correction there remain large discrepancies between the time-integrated surface freshwater flux and both ocean mass and salinity for a number of models (Fig. 6b). The ensemble median closure has improved from CMIP5 to CMIP6, but aside from the approximate closure between ocean mass and salinity the ocean mass budget closure across the CMIP6 ensemble falls short of that achieved for the energy budget (see Fig. S2 for mass budget regression coefficients for individual models). Closure of the atmospheric mass budget after drift correction also eludes the models, with many showing essentially no meaningful relationship between variability in the dedrifted atmospheric water vapor and time-integrated moisture flux time series (Fig. 6b).
4. Discussion
In the early coupled ocean–atmosphere models, drift was so large that it was necessary to constrain simulations via the use of offline flux adjustments (e.g., Fanning and Weaver 1997). This was still the case for most models participating in the first phase of CMIP (Covey et al. 2003), but with each subsequent CMIP iteration model drift improved to the point where no flux adjustment was required for most CMIP5 models to achieve drifts in global, depth-integrated quantities (e.g., OHC or steric sea level) of magnitude less than corresponding forced historical trends (Sen Gupta et al. 2013). Our analysis suggests that when it comes to globally integrated OHC, there has been little improvement from CMIP5 to CMIP6 (fewer outliers, but a similar ensemble median magnitude). This indicates that model drift still represents a nonnegligible fraction of historical forced trends in global, depth-integrated quantities and existing advice regarding the need to dedrift data from forced experiments still applies (Sen Gupta et al. 2013).
To better understand the component of model drift related to nonclosure of the global energy and mass budgets, we compare drift in ocean state variables (global ocean mass, salinity, and heat content) with accumulated heat and freshwater fluxes at the ocean surface and TOA. We find that drift in OHC is typically much smaller than in time-integrated netTOA, indicating a leakage of energy in the simulated climate system. Most of this energy leakage occurs somewhere between the TOA and ocean surface and has improved (i.e., it has a reduced ensemble median magnitude) from CMIP5 to CMIP6 due to reduced drift in time-integrated netTOA. To put these drifts and leaks into perspective, the time-integrated netTOA and systemwide energy leakage approaches or exceeds the estimated current planetary imbalance for a number of models.
A similar story is true for ocean mass conservation. Drift in ocean mass is typically a small but nonnegligible fraction of observed trends in barystatic sea level, while the time-integrated freshwater flux is typically much larger and approaches/exceeds the magnitude of recent observed trends for some models. Unlike the global energy budget, the ensemble median drift magnitude in time-integrated ocean freshwater flux is worse for CMIP6 than it was for CMIP5. In contrast, most models do a relatively good job of conserving salt and the drift in ocean salinity is reduced/improved in CMIP6. Given the importance of modeling and understanding changes in the global water cycle, we also consider the atmospheric mass budget. While drift in the global mass of atmospheric water vapor is negligible relative to estimated current trends, the drift in time-integrated moisture flux into the atmosphere (i.e., evaporation minus precipitation) and the consequent nonclosure of the atmospheric moisture budget is relatively large (and worse for CMIP6), approaching/exceeding the magnitude of current trends for many models.
The causes of the energy and mass leaks we identify are many and varied but must essentially belong to one of two categories. The first relates to deficiencies in model coupling, numerical schemes and/or physical processes. For example, the heat flux associated with water transport across the ocean boundary generally represents a global net heat loss for the ocean, because evaporation transfers water away at a temperature typically higher than precipitation adds water. The documented size of this global heat loss ranges from 0.15 (Delworth and Dixon 2006) to 0.30 W m−2 (Griffies et al. 2014). In a steady state, this heat loss due to advective mass transfer is compensated by ocean mass and heat transport, which is in turn balanced by atmospheric transport. However, most atmospheric models do not account for the heat content of their moisture field, meaning they represent the moisture mass transport but not the heat content transport (Griffies et al. 2016). Leakage in the simulated global heat budget therefore arises due to a basic limitation of the modeled atmospheric thermodynamics.
The second category has nothing to do with deficiencies of the model itself and instead relates to potential issues with the data that are archived and made available to the research community. For example, in discussions about ocean heat budget closure with people familiar with the ACCESS-CM2 model (R. Holmes 2020, personal communication), it was discovered that the discrepancy between the OHC temperature component anomaly (Fig. 1a, red curve) and time-integrated ocean surface heat flux (Fig. 1a, orange curve) could be explained by a minor mistake in the construction of the ocean surface heat flux CMIP diagnostic (hfds; Table 2). In particular, the hfds diagnostic was missing contributions from the heat flux into the ocean associated with sea ice–ocean volume exchanges and frozen precipitation as well as the effects of frazil ice formation below the surface layer of the model. When these terms are correctly included in hfds, there is closure between the OHC temperature component and time-integrated ocean surface heat flux. Given the high level of model-specific knowledge (and access to data) required to precisely diagnose the cause of an apparent energy leak like this, a detailed examination of the underlying causes of nonconservation across the CMIP6 ensemble would be a difficult undertaking (and is beyond the scope of this study). A detailed assessment of energy and mass conservation is therefore best undertaken by the relevant modeling groups.
While the causes of nonconservation are of interest to model developers, for CMIP data users it is more important to know whether closure of global energy and mass budgets can be achieved after dedrifting. In other words, does the state of a reservoir like the ocean (i.e., its mass and heat content) respond appropriately to a time-integrated change in boundary heat and water fluxes once drift has been removed? In this regard, we find that almost all CMIP5 and CMIP6 models achieve approximate energy budget closure between the time-integrated netTOA flux, time-integrated ocean heat flux and OHC after dedrifting, whereas a number of models do not achieve ocean mass budget closure. The situation is even worse for the atmospheric water budget, with no models showing a strong relationship between variability in the global mass of atmospheric water vapor and time-integrated moisture fluxes into the atmosphere after dedrifting. In the case of the global energy and ocean mass budgets, CMIP6 closure represents an improvement over CMIP5. It appears that while progress in reducing the magnitude of global energy and ocean mass drifts is something of a mixed bag, the physical consistency between variations in surface fluxes and the ocean state after dedrifting has improved across the ensemble.
Acknowledgments
This study was supported by the Centre for Southern Hemisphere Oceans Research (CSHOR), jointly funded by the Qingdao National Laboratory for Marine Science and Technology (QNLM, China) and the Commonwealth Scientific and Industrial Research Organisation (CSIRO, Australia), and the Australian Research Council’s Discovery Project funding scheme (project DP190101173). We acknowledge the World Climate Research Programme, which, through its Working Group on Coupled Modelling, coordinated and promoted CMIP6. We thank the climate modeling groups for producing and making available their model output, the Earth System Grid Federation (ESGF) for archiving the data and providing access, and the multiple funding agencies who support CMIP6 and ESGF. We also thank Ryan Holmes, Angeline Pendergrass, and Martin Dix for their insightful comments on drafts of this work.
Data availability statement
The CMIP5 and CMIP6 model output used in this study is publicly available through a distributed data archive developed and operated by the Earth System Grid Federation (ESGF). The citation webpage for each unique model dataset (Table 3) provides a link to access the data from the relevant ESGF node. Following established best practices for reproducible computational research in the weather and climate sciences (Irving 2016), the code we wrote to analyze those data has been uploaded to a Figshare repository (Irving 2020) along with details of the associated software environment and data processing steps for each figure we present.
REFERENCES
AORI, NIES, and JAMSTEC, 2015: MIROC4h model output prepared for CMIP5 piControl, served by ESGF. World Data Center for Climate, DKRZ, accessed 9 September 2020, https://doi.org/10.1594/WDCC/CMIP5.MIMHPC.
Bader, D. C., R. Leung, M. Taylor, and R. B. McCoy, 2018: E3SM-Project E3SM1.0 model output prepared for CMIP6 CMIP piControl. Earth System Grid Federation, accessed 9 September 2020, https://doi.org/10.22033/ESGF/CMIP6.4499.
Bader, D. C., R. Leung, M. Taylor, and R. B. McCoy, 2019: E3SM-Project E3SM1.1 model output prepared for CMIP6 CMIP piControl. Earth System Grid Federation, accessed 9 September 2020, https://doi.org/10.22033/ESGF/CMIP6.11489.
Bentsen, M., and Coauthors, 2011: CMIP5 output1 NCC NorESM1-M piControl, served by ESGF. World Data Center for Climate, DKRZ, accessed 9 September 2020, https://doi.org/10.1594/WDCC/CMIP5.NCCNMpc.
Bi, D., and Coauthors, 2016a: ACCESS1-0 model output prepared for CMIP5 piControl, served by ESGF. World Data Center for Climate, DKRZ, accessed 9 September 2020, https://doi.org/10.1594/WDCC/CMIP5.CSA0pc.
Bi, D., and Coauthors, 2016b: ACCESS1-3 model output prepared for CMIP5 piControl, served by ESGF. World Data Center for Climate, DKRZ, accessed 9 September 2020, https://doi.org/10.1594/WDCC/CMIP5.CSA3pc.
Bi, D., and Coauthors, 2020: Configuration and spin-up of ACCESS-CM2, the new generation Australian Community Climate and Earth System Simulator Coupled Model. J. South. Hemisphere Earth Syst. Sci., 70, 225–251, https://doi.org/10.1071/ES19040.
Bilbao, R. A. F., J. M. Gregory, N. Bouttes, M. D. Palmer, and P. Stott, 2019: Attribution of ocean temperature change to anthropogenic and natural forcings using the temporal, vertical and geographical structure. Climate Dyn., 53, 5389–5413, https://doi.org/10.1007/s00382-019-04910-1.
Boucher, O., S. Denvil, A. Caubel, and M. A. Foujols, 2018: IPSL IPSL-CM6A-LR model output prepared for CMIP6 CMIP piControl. Earth System Grid Federation, accessed 9 September 2020, https://doi.org/10.22033/ESGF/CMIP6.5251.
Caubel, A., and Coauthors, 2016: IPSL-CM5A-LR model output prepared for CMIP5 piControl experiment, served by ESGF. World Data Center for Climate, DKRZ, accessed 9 September 2020, https://doi.org/10.1594/WDCC/CMIP5.IPILpc.
CMCC, 2013a: CMCC-CESM model output prepared for CMIP5 pre-industrial control, served by ESGF. World Data Center for Climate, DKRZ, accessed 9 September 2020, https://doi.org/10.1594/WDCC/CMIP5.CMC1pc.
CMCC, 2013b: CMCC-CMS model output prepared for CMIP5 pre-industrial control, served by ESGF. World Data Center for Climate, DKRZ, accessed 9 September 2020, https://doi.org/10.1594/WDCC/CMIP5.CMC2pc.
Covey, C., K. M. AchutaRao, U. Cubasch, P. Jones, S. J. Lambert, M. E. Mann, T. J. Phillips, and K. E. Taylor, 2003: An overview of results from the Coupled Model Intercomparison Project. Global Planet. Change, 37, 103–133, https://doi.org/10.1016/S0921-8181(02)00193-5.
Covey, C., P. J. Gleckler, T. J. Phillips, and D. C. Bader, 2006: Secular trends and climate drift in coupled ocean-atmosphere general circulation models. J. Geophys. Res., 111, D03107, https://doi.org/10.1029/2005JD006009.
Cuesta-Valero, F. J., A. García-García, H. Beltrami, and J. E. Smerdon, 2016: First assessment of continental energy storage in CMIP5 simulations. Geophys. Res. Lett., 43, 5326–5335, https://doi.org/10.1002/2016GL068496.
Delworth, T. L., and K. W. Dixon, 2006: Have anthropogenic aerosols delayed a greenhouse gas-induced weakening of the North Atlantic thermohaline circulation? Geophys. Res. Lett., 33, L02606, https://doi.org/10.1029/2005GL024980.
Dix, M., and Coauthors, 2019: CSIRO-ARCCSS ACCESS-CM2 model output prepared for CMIP6 CMIP piControl. Earth System Grid Federation, accessed 9 September 2020, https://doi.org/10.22033/ESGF/CMIP6.4311.
Dunne, J., and Coauthors, 2014a: NOAA GFDL GFDL-ESM2G, piControl experiment output for CMIP5 AR5, served by ESGF. World Data Center for Climate, DKRZ, accessed 9 September 2020, https://doi.org/10.1594/WDCC/CMIP5.NGEGpc.
Dunne, J., and Coauthors, 2014b: NOAA GFDL GFDL-ESM2M, piControl experiment output for CMIP5 AR5, served by ESGF. World Data Center for Climate, DKRZ, accessed 9 September 2020, https://doi.org/10.1594/WDCC/CMIP5.NGEMpc.
EC-Earth Consortium, 2019a: EC-Earth-Consortium EC-Earth3 model output prepared for CMIP6 CMIP piControl. Earth System Grid Federation, accessed 9 September 2020, https://doi.org/10.22033/ESGF/CMIP6.4842.
EC-Earth Consortium, 2019b: EC-Earth-Consortium EC-Earth3-Veg model output prepared for CMIP6 CMIP piControl. Earth System Grid Federation, accessed 9 September 2020, https://doi.org/10.22033/ESGF/CMIP6.4848.
Eyring, V., S. Bony, G. A. Meehl, C. A. Senior, B. Stevens, R. J. Stouffer, and K. E. Taylor, 2016: Overview of the Coupled Model Intercomparison Project Phase 6 (CMIP6) experimental design and organization. Geosci. Model Dev., 9, 1937–1958, https://doi.org/10.5194/gmd-9-1937-2016.
Fairhead, L., and Coauthors, 2016: IPSL-CM5B-LR model output prepared for CMIP5 piControl experiment, served by ESGF. World Data Center for Climate, DKRZ, accessed 9 September 2020, https://doi.org/10.1594/WDCC/CMIP5.IPIBpc.
Fanning, A. F., and A. J. Weaver, 1997: On the role of flux adjustments in an idealized coupled climate model. Climate Dyn., 13, 691–701, https://doi.org/10.1007/s003820050191.
Foujols, M. A., and Coauthors, 2016: IPSL-CM5A-MR model output prepared for CMIP5 piControl experiment, served by ESGF. World Data Center for Climate, DKRZ, accessed 9 September 2020, https://doi.org/10.1594/WDCC/CMIP5.IPIMpc.
Giorgetta, M., and Coauthors, 2012a: CMIP5 simulations of the Max Planck Institute for Meteorology (MPI-M) based on the MPI-ESM-LR model: The piControl experiment, served by ESGF. World Data Center for Climate, DKRZ, accessed 9 September 2020, https://doi.org/10.1594/WDCC/CMIP5.MXELpc.
Giorgetta, M., and Coauthors, 2012b: CMIP5 simulations of the Max Planck Institute for Meteorology (MPI-M) based on the MPI-ESM-MR model: The piControl experiment, served by ESGF. World Data Center for Climate, DKRZ, accessed 9 September 2020, https://doi.org/10.1594/WDCC/CMIP5.MXMRpc.
Gregory, J. M., and Coauthors, 2013: Twentieth-century global-mean sea level rise: Is the whole greater than the sum of the parts? J. Climate, 26, 4476–4499, https://doi.org/10.1175/JCLI-D-12-00319.1.
Griffies, S. M., and R. J. Greatbatch, 2012: Physical processes that impact the evolution of global mean sea level in ocean climate models. Ocean Modell., 51, 37–72, https://doi.org/10.1016/j.ocemod.2012.04.003.
Griffies, S. M., and Coauthors, 2014: An assessment of global and regional sea level for years 1993–2007 in a suite of interannual CORE-II simulations. Ocean Modell., 78, 35–89, https://doi.org/10.1016/j.ocemod.2014.03.004.
Griffies, S. M., and Coauthors, 2016: OMIP contribution to CMIP6: Experimental and diagnostic protocol for the physical component of the Ocean Model Intercomparison Project. Geosci. Model Dev., 9, 3231–3296, https://doi.org/10.5194/gmd-9-3231-2016.
Guo, H., and Coauthors, 2018: NOAA-GFDL GFDL-CM4 piControl model output. Earth System Grid Federation, accessed 9 September 2020, https://doi.org/10.22033/ESGF/CMIP6.8666.
Hobbs, W., M. D. Palmer, and D. Monselesan, 2016: An energy conservation analysis of ocean drift in the CMIP5 global coupled models. J. Climate, 29, 1639–1653, https://doi.org/10.1175/JCLI-D-15-0477.1.
Horowitz, L., and Coauthors, 2014: NOAA GFDL GFDL-CM3, piControl experiment output for CMIP5 AR5, served by ESGF. World Data Center for Climate, DKRZ, accessed 9 September 2020, https://doi.org/10.1594/WDCC/CMIP5.NGG3pc.
Irving, D., 2016: A minimum standard for publishing computational results in the weather and climate sciences. Bull. Amer. Meteor. Soc., 97, 1149–1158, https://doi.org/10.1175/BAMS-D-15-00010.1.
Irving, D., 2020: A mass and energy conservation analysis of drift in the CMIP6 ensemble: Supplementary metadata. Figshare, accessed 9 September 2020, https://doi.org/10.6084/m9.figshare.12143562.
Irving, D., S. Wijffels, and J. Church, 2019: Anthropogenic aerosols, greenhouse gases, and the uptake, transport, and storage of excess heat in the climate system. Geophys. Res. Lett., 46, 4894–4903, https://doi.org/10.1029/2019GL082015.
JAMSTEC, AORI, and NIES, 2015a: MIROC-ESM-CHEM model output prepared for CMIP5 piControl, served by ESGF. World Data Center for Climate, DKRZ, accessed 9 September 2020, https://doi.org/10.1594/WDCC/CMIP5.MIM7pc.
JAMSTEC, AORI, and NIES, 2015b: MIROC-ESM model output prepared for CMIP5 piControl, served by ESGF. World Data Center for Climate, DKRZ, accessed 9 September 2020, https://doi.org/10.1594/WDCC/CMIP5.MIMEpc.
Ji, D., L. Wang, J. Feng, Q. Wu, and H. Cheng, 2015: BNU-ESM model output prepared for CMIP5 piControl experiment, served by ESGF. World Data Center for Climate, DKRZ, accessed 9 September 2020, https://doi.org/10.1594/WDCC/CMIP5.BUBUpc.
Jungclaus, J., and Coauthors, 2012: CMIP5 simulations of the Max Planck Institute for Meteorology (MPI-M) based on the MPI-ESM-P model: The piControl experiment, served by ESGF. World Data Center for Climate, DKRZ, accessed 9 September 2020, https://doi.org/10.1594/WDCC/CMIP5.MXEPpc.
Jungclaus, J., and Coauthors, 2019: MPI-M MPI-ESM1.2-HR model output prepared for CMIP6 CMIP piControl. Earth System Grid Federation, accessed 9 September 2020, https://doi.org/10.22033/ESGF/CMIP6.6674.
Levang, S. J., and R. W. Schmitt, 2015: Centennial changes of the global water cycle in CMIP5 models. J. Climate, 28, 6489–6502, https://doi.org/10.1175/JCLI-D-15-0143.1.
Liepert, B. G., and M. Previdi, 2012: Inter-model variability and biases of the global water cycle in CMIP3 coupled climate models. Environ. Res. Lett., 7, 014006, https://doi.org/10.1088/1748-9326/7/1/014006.
Liepert, B. G., and F. Lo, 2013: CMIP5 update of ‘Inter-model variability and biases of the global water cycle in CMIP3 coupled climate models.’ Environ. Res. Lett., 8, 029401, https://doi.org/10.1088/1748-9326/8/2/029401.
Lucarini, V., and F. Ragone, 2011: Energetics of climate models: Net energy balance and meridional enthalpy transport. Rev. Geophys., 49, RG1001, https://doi.org/10.1029/2009RG000323.
Meyssignac, B., and Coauthors, 2019: Measuring global ocean heat content to estimate the Earth energy imbalance. Front. Mar. Sci., 6, 432, https://doi.org/10.3389/fmars.2019.00432.
NASA GISS, 2018: NASA-GISS GISS-E2.1g model output prepared for CMIP6 CMIP piControl. Earth System Grid Federation, accessed 9 September 2020, https://doi.org/10.22033/ESGF/CMIP6.7380.
NASA GISS, 2019: NASA-GISS GISS-E2-1-G-CC model output prepared for CMIP6 CMIP piControl. Earth System Grid Federation, accessed 9 September 2020, https://doi.org/10.22033/ESGF/CMIP6.11856.
Neubauer, D., and Coauthors, 2019: HAMMOZ-Consortium MPI-ESM1.2-HAM model output prepared for CMIP6 CMIP piControl. Earth System Grid Federation, accessed 9 September 2020, https://doi.org/10.22033/ESGF/CMIP6.5037.
Nummelin, A., C. Li, and P. J. Hezel, 2017: Connecting ocean heat transport changes from the midlatitudes to the Arctic Ocean. Geophys. Res. Lett., 44, 1899–1908, https://doi.org/10.1002/2016GL071333.
Palmer, M. D., and D. J. McNeall, 2014: Internal variability of Earth’s energy budget simulated by CMIP5 climate models. Environ. Res. Lett., 9, 034016, https://doi.org/10.1088/1748-9326/9/3/034016.
Palmer, M. D., D. J. McNeall, and N. J. Dunstone, 2011: Importance of the deep ocean for estimating decadal changes in Earth’s radiation balance. Geophys. Res. Lett., 38, L13707, https://doi.org/10.1029/2011GL047835.
Palmer, M. D., G. R. Harris, and J. M. Gregory, 2018: Extending CMIP5 projections of global mean temperature change and sea level rise due to thermal expansion using a physically-based emulator. Environ. Res. Lett., 13, 084003, https://doi.org/10.1088/1748-9326/aad2e4.
PCMDI, 2013: CMIP5 standard output. Lawrence Livermore National Laboratory, accessed 8 April 2020, https://pcmdi.llnl.gov/mips/cmip5/requirements.html.
Peacock, S., and M. Maltrud, 2006: Transit-time distributions in a global ocean model. J. Phys. Oceanogr., 36, 474–495, https://doi.org/10.1175/JPO2860.1.
Rahmstorf, S., 1995: Climate drift in an ocean model coupled to a simple, perfectly matched atmosphere. Climate Dyn., 11, 447–458, https://doi.org/10.1007/BF00207194.
Ridley, J., M. Menary, T. Kuhlbrodt, M. Andrews, and T. Andrews, 2018: MOHC HadGEM3-GC31-LL model output prepared for CMIP6 CMIP piControl. Earth System Grid Federation, accessed 9 September 2020, https://doi.org/10.22033/ESGF/CMIP6.6294.
Scoccimarro, E., and S. Gualdi, 2014: CMCC-CM model output prepared for CMIP5 piControl, served by ESGF. World Data Center for Climate, DKRZ, accessed 9 September 2020, https://doi.org/10.1594/WDCC/CMIP5.CMC0pc.
Seferian, R., 2018: CNRM-CERFACS CNRM-ESM2-1 model output prepared for CMIP6 CMIP piControl. Earth System Grid Federation, accessed 9 September 2020, https://doi.org/10.22033/ESGF/CMIP6.4165.
Sen Gupta, A., L. C. Muir, J. N. Brown, S. J. Phipps, P. J. Durack, D. Monselesan, and S. E. Wijffels, 2012: Climate drift in the CMIP3 models. J. Climate, 25, 4621–4640, https://doi.org/10.1175/JCLI-D-11-00312.1.
Sen Gupta, A., N. C. Jourdain, J. N. Brown, and D. Monselesan, 2013: Climate drift in the CMIP5 models. J. Climate, 26, 8597–8615, https://doi.org/10.1175/JCLI-D-12-00521.1.
Skliris, N., J. D. Zika, G. Nurser, S. A. Josey, and R. Marsh, 2016: Global water cycle amplifying at less than the Clausius-Clapeyron rate. Sci. Rep., 6, 38752, https://doi.org/10.1038/srep38752.
Tang, Y., S. Rumbold, R. Ellis, D. Kelley, J. Mulcahy, A. Sellar, J. Walton, and C. Jones, 2019: MOHC UKESM1.0-LL model output prepared for CMIP6 CMIP piControl. Earth System Grid Federation, accessed 9 September 2020, https://doi.org/10.22033/ESGF/CMIP6.6298.
Taylor, K. E., R. J. Stouffer, and G. A. Meehl, 2012: An overview of CMIP5 and the experiment design. Bull. Amer. Meteor. Soc., 93, 485–498, https://doi.org/10.1175/BAMS-D-11-00094.1.
Tjiputra, J., C. Roelandt, M. Bentsen, D. Lawrence, T. Lorentzen, J. Schwinger, Ø. Seland, and C. Heinze, 2012: CMIP5 output1 NCC NorESM1-ME piControl, served by ESGF. World Data Center for Climate, DKRZ, accessed 9 September 2020, https://doi.org/10.1594/WDCC/CMIP5.NCCNEpc.
Voldoire, A., 2018: CMIP6 simulations of the CNRM-CERFACS based on CNRM-CM6-1 model for CMIP experiment piControl. Earth System Grid Federation, accessed 9 September 2020, https://doi.org/10.22033/ESGF/CMIP6.4163.
von Schuckmann, K., and Coauthors, 2020: Heat stored in the Earth system: Where does the energy go? Earth Syst. Sci. Data, 12, 2013–2041, https://doi.org/10.5194/essd-12-2013-2020.
WCRP Global Sea Level Budget Group, 2018: Global sea-level budget 1993–present. Earth Syst. Sci. Data, 10, 1551–1590, https://doi.org/10.5194/essd-10-1551-2018.
WGCM Infrastructure Panel, 2017: CMIP6 data request. Accessed 8 April 2020, https://www.earthsystemcog.org/projects/wip/CMIP6DataRequest.
Wieners, K.-H., and Coauthors, 2019: MPI-M MPI-ESM1.2-LR model output prepared for CMIP6 CMIP piControl. Earth System Grid Federation, accessed 9 September 2020, https://doi.org/10.22033/ESGF/CMIP6.6675.
Wu, T., and X. Xin, 2015a: BCC-CSM1-1-M model output prepared for CMIP5 piControl experiment, served by ESGF. World Data Center for Climate, DKRZ, accessed 9 September 2020, https://doi.org/10.1594/WDCC/CMIP5.BCBMpc.
Wu, T., and X. Xin, 2015b: BCC-CSM1-1 model output prepared for CMIP5 piControl experiment, served by ESGF. World Data Center for Climate, DKRZ, accessed 9 September 2020, https://doi.org/10.1594/WDCC/CMIP5.BCB1pc.
Wu, T., and Coauthors, 2018: BCC BCC-CSM2MR model output prepared for CMIP6 CMIP piControl. Earth System Grid Federation, accessed 9 September 2020, https://doi.org/10.22033/ESGF/CMIP6.3016.
Zhang, H.-M., and Coauthors, 2019: Updated temperature data give a sharper view of climate trends. Eos, Trans. Amer. Geophys. Union, 100, https://doi.org/10.1029/2019EO128229.
Zhang, J., and Coauthors, 2018: BCC BCC-ESM1 model output prepared for CMIP6 CMIP piControl. Earth System Grid Federation, accessed 9 September 2020, https://doi.org/10.22033/ESGF/CMIP6.3017.
Ziehn, T., and Coauthors, 2019: CSIRO ACCESS-ESM1.5 model output prepared for CMIP6 CMIP piControl. Earth System Grid Federation, accessed 9 September 2020, https://doi.org/10.22033/ESGF/CMIP6.4312.