Earth system models (ESMs) simulate a large spread in carbon cycle feedbacks to climate change, particularly in their prediction of cumulative changes in terrestrial carbon storage. Evaluating the performance of ESMs against observations and assessing the likelihood of long-term climate predictions are crucial for model development. Here, we assessed the use of atmospheric growth rate variations to evaluate the sensitivity of tropical ecosystem carbon fluxes to interannual temperature variations. We found that the temperature sensitivity of the observed growth rate depended on the time scales over which atmospheric observations were averaged. The temperature sensitivity of the growth rate during Northern Hemisphere winter is most directly related to the tropical carbon flux sensitivity since winter variations in Northern Hemisphere carbon fluxes are relatively small. This metric can be used to test the fidelity of interactions between the physical climate system and terrestrial ecosystems within ESMs, which is especially important since the short-term relationship between ecosystem fluxes and temperature stress may be related to the long-term feedbacks between ecosystems and climate. If the interannual temperature sensitivity is used to constrain long-term temperature responses, the inferred sensitivity may be biased by 20%, unless the seasonality of the relationship between the observed growth rate and tropical fluxes is taken into account. These results suggest that atmospheric data can be used directly to evaluate regional land fluxes from ESMs, but underscore that the interaction between the time scales for land surface processes and those for atmospheric processes must be considered.
Evaluating feedbacks between tropical ecosystems and long-term climate change is crucial since terrestrial ecosystems currently act as a sink for 25%–30% of annual fossil fuel emissions (Le Quéré et al. 2016), and humid and semiarid tropical ecosystems are thought to contribute substantially to both the mean strength and the interannual variability of the sink (Ahlström et al. 2015). Several studies have noted that these ecosystems are highly sensitive to variations in temperature (Cox et al. 2013; Wang et al. 2013) and drought stress (Phillips et al. 2009; Gatti et al. 2014), suggesting that the tropical sink may be modified by changing climate. Tropical ecosystems store an estimated 500 PgC as biomass that may be vulnerable to long-term climate change (Pan et al. 2011), since under a high temperature future, ecosystem carbon loss due to higher tree mortality or enhanced rates of heterotrophic respiration may exceed net uptake by plants. Moreover, gross ecosystem fluxes in the tropics are large, between 50% and 60% of global primary productivity (Beer et al. 2010), suggesting that even small changes in climate could have a large impact on global carbon uptake. Recent studies have suggested that improving observational inferences about tropical carbon–climate interactions is necessary, since these ecosystems may be near a high temperature threshold in which further increases to air temperature reduce net assimilation (Doughty and Goulden 2008).
Long-term changes in land carbon stocks (; equivalent to cumulative net fluxes) are commonly attributed to a carbon–concentration feedback and a carbon–climate feedback [Boer and Arora (2009); Equation (1)]:
where β [PgC ppm−1] represents the sensitivity of land carbon stocks to CO2 fertilization, and [PgC K−1] represents their sensitivity to long-term temperature changes (Friedlingstein et al. 2006). Coupled Earth system model (ESM) predictions of through 2100 vary widely and do not even agree in sign (Friedlingstein et al. 2006, 2014). The mechanistic attribution for among ESMs participating in phase 5 of the Coupled Model Intercomparison Project (CMIP5; Taylor et al. 2012) likewise disagreed: model β factors differed by a factor of 6, while model coefficients differed by a factor of 5 (Arora et al. 2013).
The lack of model agreement in both the magnitude of and the mechanism driving the land carbon feedback to anthropogenic climate change underscores the need to evaluate models against observations. When evaluating coupled ESMs, simulated ecosystem properties may disagree with observational metrics due either to misparameterization of the relevant biogeochemical or biogeophysical processes or to biases in the physical climate drivers thereof. Model development and improvement, therefore, requires evaluating simulations using metrics that constrain functional responses—in other words, the relationships between driver and response variables—rather than simply comparing time series or spatial distributions of individual variables (Randerson et al. 2009).
It is further necessary to identify methods to gauge the realism of future predictions, since model agreement with present-day observations merely improves confidence that the model represents relevant processes, but cannot ensure predictive skill (Tebaldi and Knutti 2007; Bonan and Doney 2018). One method that has become increasingly prominent in climate change literature is the use of emergent constraints, which provides a methodology to evaluate long-term predictions within the context of a multimodel ensemble when a correlation exists between a short-term functional response, which can be evaluated against observations, and a long-term feedback governed by the same mechanism (Klein and Hall 2015). For example, Hoffman et al. (2014) showed that in the CMIP5 ensemble of ESMs, the atmospheric mole fraction at 2100 was correlated to the simulated mole fraction in 2010. This relationship forms the basis for an emergent constraint because 1) the accuracy of the short-term model output ( at 2010) can be evaluated against observations, and 2) the same processes that govern the rate of atmospheric carbon accumulation through 2010 [viz., the magnitude of the β and effects in Equation (1)] are among the important processes that govern atmospheric carbon accumulation through 2100. While this example relates to carbon cycle feedbacks, emergent constraints have been used to evaluate the likelihood of long-term predictions for several different components of the Earth system, including cryospheric feedbacks (Hall and Qu 2006; Boé et al. 2009) and cloud feedbacks (Gordon and Klein 2014) to climate change.
Unfortunately, developing functional response metrics for tropical ecosystem carbon–climate interactions is difficult, in part because of a lack of large-scale observations of tropical ecosystem function. There are limited tropical sites at which fluxes are measured directly via eddy covariance (Schimel et al. 2015). Moreover, these sites may not be representative of the entire tropics, given a high degree of ecosystem heterogeneity driven both by biological diversity and abiotic factors, such as soil chemistry and local hydrology (e.g., Araujo et al. 2002; Townsend et al. 2008). Thus, we propose to exploit long-term observations of the atmospheric mole fraction to gain insight into functional relationships within tropical ecosystems. The atmospheric mole fraction, which can be observed in situ, measured via flask sampling, or inferred from remote sensing, integrates variations in fluxes at spatial scales compatible with the resolution of ESMs (on the order of 104 to 106 km2) (Keppel-Aleks et al. 2013). Given the large concentration footprint, atmospheric variations in space and time are an apt constraint for regional- to global-scale carbon–climate feedbacks (Keppel-Aleks et al. 2013). A previous study that analyzed variations in atmospheric , its isotopic composition, and the atmospheric mole fraction at both seasonal and interannual time scales found terrestrial, rather than oceanic, fluxes are the primary drivers of variations in the atmospheric mole fraction (Battle et al. 2000), a result corroborated by atmospheric inverse modeling of both and its isotopic composition (Rayner et al. 2008). At interannual time scales, therefore, variability in the atmospheric growth rate may be used to estimate the sensitivity of terrestrial ecosystems to variations in climate (Wang et al. 2013; Keppel-Aleks et al. 2014).
Several studies have shown that the single variable to which the interannual growth rate is most strongly correlated is tropical temperature variations, suggesting high temperature stress in tropical ecosystems (Wang et al. 2013). Cox et al. (2013) further showed a strong correlation between interannual growth rate sensitivity to temperature (; Table 1) and long-term carbon losses in response to warming [Table 1; Equation (1)] across C4MIP coupled models. By using 50 years of atmospheric observations to identify the models whose was consistent with the observationally derived temperature sensitivity, Cox et al. (2013) concluded that the most likely long-term sensitivity to warming was weaker than that of the unconstrained multimodel mean and had narrower uncertainty. More recently, Wenzel et al. (2014) applied the same method to the CMIP5 ensemble and found that the constrained value on was consistent between the two model ensembles.
While the use of emergent constraints provides a promising method to link contemporary observations to the likelihood of future model outcomes, the results must be interpreted with caution. Previous studies have acknowledged that different processes, including vegetation mortality and shifts in vegetation, operate at long time scales, which could cause decoupling between the short-term and long-term temperature responses (Randerson 2013). Here, we focus on the challenge of evaluating modeled land fluxes against observed atmospheric since atmospheric reflects additional processes, such as atmospheric mixing, that are characterized by time scales independent of those that affect terrestrial ecosystems. Furthermore, atmospheric contains the imprint of drought, temperature, and fire across the tropics and the Northern Hemisphere (NH) (Keppel-Aleks et al. 2014; Wunch et al. 2013), so accounting for the influence of these drivers on atmospheric growth rate variability is necessary in developing a functional response metric focused on tropical ecosystems.
The goal of this paper is to identify methods to use atmospheric observations to isolate the temperature sensitivity of tropical net terrestrial exchange (; defined in Table 1). As a second step, we seek to quantify the difference between using land fluxes and atmospheric observations for calculating the from simulations. Finally, we extend the application of this metric as an emergent constraint for the long-term temperature sensitivity (; Table 1); this step will permit quantification of the uncertainty embedded in the emergent constraint approach due to uncertainties in the observational constraint. Overall, we hypothesize that any dissimilarity between the model output used to quantify short-term variations and the observations used as benchmarks may contribute bias both to the functional response metric and to the emergent constraint. For example, Cox et al. (2013) spatially integrated global land fluxes to represent the variation in atmospheric , implicitly assuming the atmosphere is instantaneously well mixed. In reality, atmospheric varies with location due to atmospheric transport patterns and is sampled only at discrete points in space and time. The estimated from atmospheric therefore depends on the extent of spatial and temporal averaging of the observations (Keppel-Aleks et al. 2014). The structure of the paper is as follows: in section 2, we describe the method to calculate interannual variability and its relationship to temperature for both observations and ESMs. In section 3, we report results using atmospheric observations for benchmarking ESM sensitivities. Finally, in section 4, we describe implications for using atmospheric as a benchmark for functional responses and requirements for future model development.
2.1. growth rate from atmospheric observations
We calculated the temperature sensitivity of the atmospheric mole fraction growth rate derived from observations at marine boundary layer (MBL) sites in the NOAA network (Dlugokencky et al. 2016; Table 2). These sites are located away from fossil fuel sources and regions of strong terrestrial uptake, and monthly mean data are derived from flask samples that are accurate to within 0.2 ppm of the WMO scale with a 1σ precision of 0.1 ppm (Conway et al. 1994). We calculated the interannual component of variability by detrending the monthly mean time series from each site, using a 3rd-order polynomial over the 24-yr period from 1982 to 2005 and then subtracting the mean annual cycle (Keppel-Aleks et al. 2013). Site-specific interannual variability [IAV (ppm)] was averaged within six latitude belts corresponding to the tropics (0°–23°), midlatitudes (23°–60°), and high latitudes (60°–90°) of each hemisphere, and a global mean time series of variability was then calculated as the area-weighted average of IAV within these latitude belts (Figure 1). We analyzed the time period after 1982 since there are multiple observing sites within all latitude belts. Furthermore, our zonal averaging approach ensured that the globally averaged variability was not overly influenced by the Northern Hemisphere, where the sampling network is more dense. For consistency with previous work (e.g., Cox et al. 2013; Wang et al. 2013; Wenzel et al. 2014), we filtered out IAV from measurements during the 24 months following the major volcanic eruptions of El Chichon (1982) and Mt. Pinatubo (1991).
We aggregated the monthly mean interannual variability into either quarters (e.g., January–March and April–June) or years and differenced across sequential periods to calculate a growth rate (ppm yr−1). The seasonal growth rate anomalies were therefore centered on 1 January (winter; difference between JFM and OND), 1 April (spring; difference between AMJ and JFM), 1 July (summer; difference between JAS and AMJ), and 1 October (fall; difference between OND and JAS), and the annual mean growth rate was centered on 1 January. Growth rate anomalies at all temporal frequencies were then converted to units of PgC yr−1 using the global mass of atmospheric dry air.
The interannual growth rate anomaly was related to variations in tropical temperature via linear regression to estimate (Figure 2). We calculated an area-weighted tropical land temperature between 23°S and 23°N using the gridded Hadley Centre Climate Research Unit (CRU) dataset at 4° × 5° resolution (Jones et al. 2012). The tropical land temperature time series was linearly detrended and the mean annual cycle removed to reveal interannual variations at monthly time steps, which were averaged to the same quarterly or annual resolution as the atmospheric observations. Because the atmospheric growth rate is calculated as a difference and therefore centered at the start of each quarter (e.g., 1 January and 1 April) or calendar year, and the temperature anomaly is centered midquarter or midyear, we averaged two successive quarters or years of temperature anomalies over land (cf. Cox et al. 2013) before calculating via linear regression of the two quantities.
Climate variability is not normally distributed, so the length of and gaps in the and temperature records may affect the estimate of . We quantified uncertainty in the regression coefficient due to data gaps and outliers using bootstrap Monte Carlo to generate 1000 synthetic datasets of length 24 years by sampling observed –temperature pairs with replacement, and we report the median and standard deviation of across these realizations. The same method was used to calculate from simulated carbon cycle diagnostics (section 2.2).
2.2. CMIP5 carbon cycle diagnostics
We analyzed the interannual climate sensitivity of terrestrial carbon uptake for historical simulations in eight CMIP5 ESMs (Table 3) whose outputs were obtained from the Earth System Grid Federation (Williams et al. 2011). The historical simulations covered the period 1850–2005 and were forced with historical atmospheric composition changes, including greenhouse gas and aerosol concentrations resulting from anthropogenic and natural sources. CMIP5 models were also forced with historical land-use change patterns, although the implementation varied across models (Taylor et al. 2012). From these simulations, we analyzed temperature and net biospheric production (NBP) output. NBP encompasses all terrestrial processes that leave an imprint on atmospheric , including net ecosystem exchange (NEE; defined as the sum of gross primary production and ecosystem respiration), carbon loss from disturbance, and harvest (Chapin et al. 2006). Each CMIP5 ESM is fully prognostic, meaning the carbon cycle in each model was coupled to different patterns of internal climate variability. Thus, evaluation of the interannual variability requires a functional response metric, since the timing and magnitude of internal climate variations, such as those from El Niño, are unique to each model run.
To examine our hypothesis that neglecting atmospheric processes induces a bias between inferred from modeled fluxes and the that must be calculated from atmospheric , it was necessary to use an offline atmospheric transport model to simulate atmospheric at the MBL observatories since only a few CMIP5 models propagated surface carbon fluxes through their own atmospheric model. To simulate the imprint of transport, we used the GEOS-Chem atmospheric transport model (version 9.1.2; Nassar et al. 2010) to simulate the spatial distribution of from CMIP5 NBP. GEOS-Chem was driven by MERRA reanalysis data (Rienecker et al. 2011) at 4°lat × 5°lon horizontal resolution, with NBP data from each CMIP5 model regridded to the resolution of the transport model. We ran the model from 1979 through 2005, discarding the first 3 years to allow for the tracer to spin up. We then sampled monthly mean output from GEOS-Chem at the locations of NOAA MBL sites and detrended the model output using the method described for the observations. Simulated values were likewise determined by regressing the growth rate against each model’s tropical land temperature anomaly.
We note that in comparing the simulated atmospheric to the observations, we neglect nonterrestrial influences on the observations. For example, the observed growth rate anomaly includes a small contribution from ocean and fossil fluxes (Keppel-Aleks et al. 2014), while the simulated growth rate includes only the influence of NBP. Thus, any covariance between temperature and ocean carbon fluxes or fossil fuel emissions will modify but none of the model diagnostics.
In addition to calculating for the CMIP5 ensemble members using simulated atmospheric , which most closely resembles the observations available to evaluate ESMs, we calculated directly from an area-weighted integral of NBP. This quantity represents the apparent growth rate that would be measured in an instantaneously well-mixed atmosphere and contains a source of bias in addition to those described above since it further lacks the imprint of atmospheric mixing. We calculated by regressing globally integrated NBP at both monthly and annual time steps against the tropical temperature anomaly, similar to the method used for atmospheric . We also calculated from NBP confined to the tropics. In terms of developing a tropical functional response metric, this is the most relevant target; however, it is a quantity that is not directly observed. Thus, much of the focus of this paper is on identifying an atmospheric-based metric that can be used to evaluate the inferred from model tropical NBP and temperature variations. Since both the NBP and the temperature anomalies were centered in the middle of the calendar month or year, we simply used monthly or annual land temperature averages, rather than averaging two successive time periods, as was required for the regressions against the atmospheric growth rate.
2.3. Calculating a constraint on
We calculated an emergent constraint on the long-term temperature sensitivity of terrestrial carbon storage based on the agreement of calculated from annual model data (described in section 2.2) with the observed (section 2.1). We used the value for each CMIP5 model reported in Wenzel et al. (2014), which were calculated from simulations with a 1% increase in radiative forcing by comparing the response in runs with and without biogeochemical coupling. We fit a slope and intercept to the and data, allowing error in both variables (York et al. 2004). To parameterize errors on , we used the standard deviation from bootstrap Monte Carlo, and we set the errors on to 10 PgC K−1, the value that minimized the reduced value of the fit. It was necessary to assume an error on each model’s to obtain reasonable error estimates on the regression coefficients. The error affected the width of the probability density function for the constrained ; however, the constrained estimate for was insensitive to the choice of the error between 1 and 30 PgC K−1. We derived the probability density function for by propagating the uncertainty in the observational constraint ( calculated at the corresponding monthly or annual time step) and the error on the fitted slope and intercept.
3.1. Temperature sensitivities inferred from atmospheric observations
The observed temperature sensitivity of land carbon uptake () depended strongly on the season analyzed and the frequency over which observations were averaged (Figure 2). The determined from annual mean data was PgC yr−1 K−1 (Figure 2a), compared to a value of PgC yr−1 K−1 determined from monthly data (Figure 2b). Both values for are consistent with previous estimates from global atmospheric (Cox et al. 2013; Keppel-Aleks et al. 2014), but it is worth noting that the methodological uncertainty (a 1.1 PgC yr−1 K−1 difference between annual and monthly averaging) is as large as the uncertainty on the fits themselves (1.1–1.3 PgC yr−1 K−1). Calculating separately for each quarter revealed strong seasonality to the relationship between the atmospheric growth rate and tropical temperature variations (Figures 2c–f). The quarterly growth rate for the NH spring (Figure 2d) had the highest slope (5.9 ± 2.2), while the quarterly growth rate centered on fall (Figure 2f) had a modest temperature sensitivity (2.4 ± 1.0). When quarterly regressions between the growth rate and temperature anomalies were performed for NH and tropical separately, we found weaker correlations (not shown), suggesting that it was necessary to aggregate observations from stations globally to define a relevant constraint for model benchmarking.
When the monthly or quarterly growth rate and tropical temperature anomalies were recentered by the annual mean values before fitting a value, there were only weak relationships between the two quantities (Figure 3), suggesting that global growth rate anomalies are most responsive to slowly varying temperature anomalies at annual time scales, as opposed to more rapid fluctuations in temperature at monthly time scales. We note that there is little structure to the residuals as a function of time (colors in Figure 3). The slopes fit to monthly and seasonal residuals were generally weakly positive (Figures 3a–d), with the exception of fall residuals (Figure 3e), indicating that high-frequency temperature stress induced further net carbon release to the atmosphere (Figures 3b–d). For NH fall, the residuals were negatively correlated (Figure 3e), suggesting that after accounting for annual mean temperature anomalies, anomalously warm autumns were associated with more carbon uptake.
3.2. A functional response metric for for tropical ecosystem fluxes
Since our goal was to develop a functional response metric for the tropical ecosystem temperature sensitivity, we first analyzed the difference in temperature sensitivity owing to tropical versus global fluxes. In section 3.1., we showed that annual temperature variations were a more important driver of variations in the observed growth rate than temperature variations at higher frequencies; therefore, we analyzed the annually integrated response of land carbon fluxes from CMIP5 simulations. Across the model ensemble, the values inferred from both global and tropical NBP spanned an order of magnitude due to differences among model structures and parameterizations (Table 3), consistent with previous results (Wenzel et al. 2014). For most individual models, the temperature sensitivity of tropical land fluxes was different from the temperature sensitivity calculated from global fluxes. Across the model ensemble, the ratio of these two sensitivities ranges from 0.5 to 1.2, with a median value of 0.9. This is a key result because it confirms that for most of the models (six out of eight), extratropical carbon fluxes are also positively correlated with tropical temperature variations, consistent with a prominent role for El Niño–driven teleconnections outside the tropics (Keppel-Aleks et al. 2014). With respect to model evaluation, it confirms that the use of global diagnostics (e.g., globally integrated fluxes or the global atmospheric growth rate) to make inferences about the tropics introduces substantial bias.
Second, we analyzed the impact that atmospheric transport leaves on by comparing results from globally integrated land fluxes with those from atmospheric . For most models, the derived from annual NBP was smaller than that derived from annual growth rate, with differences from −5% to 200% (Table 3). On average, the atmospheric growth rate within a model was 30% larger than the corresponding annual integrated net fluxes. If the atmospheric time series derived from the available MBL stations were representative of the entire atmosphere, this quantity would be unity by mass balance. The 30% average difference arises from atmospheric transport processes. Because we used the same atmospheric transport to advect terrestrial fluxes for all members of the ensemble, the different ratio from model to model was due to cross-model differences in the spatial distribution of fluxes relative to dominant patterns of atmospheric transport.
Throughout the manuscript, we calculated the atmospheric growth rate from a subset of atmospheric observations within the MBL. Typically, air samples are obtained at these sites within 100 m of the surface; therefore, this type of observation does not reflect the vertical gradients in that arise due to the interaction of atmospheric transport and fluxes at the surface. As a sensitivity test, we calculated the total column-integrated mole fraction from the GEOS-Chem model output at the same MBL sites, emulating the total column currently observed by satellites, such as OCO-2 (Eldering et al. 2017) and GOSAT (Yokota et al. 2009). When using total column , the ratio between calculated from fluxes and from atmospheric total column was reduced but still represented a 20% mismatch, owing to the fact that the spatially sparse sampling network is not uniformly sensitive to global vegetated land surfaces.
Together, these results suggest that a functional response metric to evaluate the temperature sensitivity of tropical ecosystems should be 1) minimally sensitive to the imprint of extratropical fluxes to better isolate the temperature sensitivity of tropical fluxes and 2) maximally representative of the spatially integrated fluxes despite sparse atmospheric sampling. With these criteria in mind, the relationship between the tropical land-derived and the atmosphere-derived was highest among CMIP5 models when was calculated using the January-centered (winter) atmospheric growth rate (Figure 4). Using the seasonal diagnostic, the slope between from tropical fluxes and from atmospheric was 0.9 ± 0.3 (R2 = 0.86), showing both improved correlation and smaller error on the slope, compared to the relationship between the annual growth rate and tropical fluxes (slope of 1.2 ± 0.8 and R2 = 0.71). In other seasons, the R2 value ranged from 0.4 to 0.7, the slope showed greater deviation from 1, and there was larger fractional uncertainty on the slope.
The strong relationship between tropical land-derived and the winter atmospheric-derived arises from a seasonally quiescent Northern Hemisphere biosphere. This result was corroborated in two ways: first, the relationship between Northern Hemisphere land-derived and the derived from the winter growth rate was minimal for the winter quarter output (R2< 0.05), suggesting that Northern Hemisphere ecosystems do not contribute to the seasonal temperature sensitivity. Second, the value from winter-only tropical land fluxes was highly correlated with the value from winter atmospheric , with a slope of 1.0 ± 0.2 and an R2 of 0.96, confirming minimal interference from the Northern Hemisphere biosphere during the winter season. Together, these relationships confirmed that the winter atmospheric value isolates the temperature sensitivity of the tropical carbon–climate functional responses.
3.3. An emergent constraint on long-term tropical sensitivity
The winter growth rate metric can be used not only to evaluate model representations of tropical carbon fluxes at contemporary time scales, but also to calculate an emergent constraint on the long-term sensitivity of tropical carbon fluxes. When the value derived from tropical NBP in each CMIP5 model (Table 3) was evaluated against the observed from winter growth rate, the estimated long-term climate sensitivity was 58 ± 16 PgC K−1 (Figures 5a, 6), in contrast to a value of 71 ± 16 PgC K−1 when the observational constraint is from annual growth rate (Figures 5b, 6). The 20% high bias when using annual, rather than seasonal, atmospheric results from the positive sensitivity between temperature and Northern Hemisphere fluxes for seasons other than NH winter. Extratropical fluxes thereby increased the apparent tropical temperature sensitivity, and a subset of models with greater interannual sensitivity was identified as consistent. In contrast, when the winter was used, the consistent model ensemble was weighted toward a lower long-term sensitivity (Figures 5a,b). As expected, the optimized value was consistent when CMIP5 values were calculated from each model’s winter growth rate (Figure 5c), as opposed to directly from tropical land (Figure 5a). We found, however, that both these values were shifted toward a higher sensitivity than the emergent constraint on derived from annual (Figure 5d; 48 ± 14 PgC K−1). Again, it is noteworthy that the methodology used to determine the observational constraint introduces as much uncertainty into the emergent constraint (13 PgC K−1) as does the uncertainty in the fit itself.
Our results suggest that atmospheric observations can provide direct constraints on tropical land fluxes. We identified a functional response metric to diagnose the sensitivity of tropical carbon fluxes to local temperature variations using the winter seasonal atmospheric growth rate. When winter atmospheric growth rate anomalies were used to evaluate the temperature sensitivity of net tropical ecosystem exchange, the interference from Northern Hemisphere ecosystems was minimized. Additionally, bias from sparse atmospheric sampling was small, based on offline analysis from eight CMIP5 ESMs whose terrestrial fluxes have been advected using an atmospheric tracer transport model. We found that a different set of models was consistent with the observational constraint, depending on whether atmospheric data were integrated annually versus isolating only one season, a finding that affects the benchmarking and subsequent tuning of ESMs. In this study, six ESMs whose tropical ranged from 2.8 to 8.1 PgC yr−1 K−1 were consistent with the observational constraint. Moreover, we found that the predicted long-term temperature sensitivity of tropical ecosystems was 58 ± 16 PgC K−1, a value that was smaller by 20% than if an annual constraint had been used.
Atmospheric observations provide one of the few regional- to global-scale diagnostics for carbon cycle processes. Although atmospheric has been used previously for model benchmarking in terms of the long-term increase (Hoffman et al. 2014) or seasonal diagnostic (Keppel-Aleks et al. 2013), determining functional responses that can be used to benchmark large-scale feedbacks in the context of biogeochemical and land model benchmarking systems [e.g., the International Land Model Benchmarking Project (ILAMB); Mu et al. 2014; Hoffman et al. 2017] is important for improving the predictions of terrestrial models. Determining robust ways to work with atmospheric presents challenges, however, since it integrates land, ocean, and anthropogenic fluxes over a range of geographic scales. Developing a CO2 metric that isolates a single functional response, as we have done here, opens up possibilities to use observations in different ways to isolate each of these different sensitivities that control long-term feedbacks. Especially in the context of an emergent constraint, tropical and Northern Hemisphere ecosystems will likely experience different responses to long-term temperature increases and, thus, must be considered separately.
In the context of an emergent constraint, our analysis shows that different averaging time scales for the observations affect the optimized constraint, as does using atmospheric to evaluate land fluxes directly, rather than evaluating fluxes transported through an atmospheric model. In the introduction, we identified two key components to an emergent constraint: the ability to evaluate short-term predictions against observations and a mechanistic link between the short- and long-term model-predicted quantities. While substantial attention has been paid in the literature to concerns that long-term feedbacks may include processes that do not contribute substantially to present-day interannual variability in land carbon storage (Randerson 2013; Keppel-Aleks et al. 2014), less attention has been given to how differences among model output and observations affect an emergent constraint. Here, we showed that there is potential for bias in emergent constraints when the differences between observed and simulated quantities are ignored. In fact, our results suggest that the method used to average the observational constraint for functional response metrics or for emergent constraints is as important as the fitting uncertainty itself.
Our results indicated that the emergent constraint on terrestrial temperature sensitivity from a multimodel ensemble shows a systematic dependence on the choice of observational constraint and the treatment of model output. There was a 20% difference in the expected value for the tropical when the annual versus winter atmospheric growth rate was used to evaluate short-term variations. This difference can be explained by the fact that annual CO2 growth rate anomalies include the additional temperature sensitivity of extratropical Northern Hemisphere ecosystems. It is more difficult to explain, however, why the emergent constraint on γLT derived from the annual CO2 growth rate from CMIP5 models and as the observational constraint (Fig. 5d) yields a lower value for γLT (48 ± 14 PgC K−1), beyond noting that small differences in the γIAV values can lead to large differences in the fitted slope. Thus, the error on the optimized may be underestimated by the error propagation and quantification methods generally used in the calculation of emergent constraints. The error on may further be underestimated because the interannual variations in the growth rate used to calculate may not be predictive of nonlinearities within the carbon cycle due to, for example, changes in vegetation mortality or fire, which are not well represented in CMIP5 ESMs. These nonlinearities may change the long-term trajectory of tropical ecosystems and invalidate the entire approach of calculating an emergent constraint.
Given the persistent uncertainties in prediction of the land carbon sink over the twenty-first century and the fact that CMIP5 coupled models simulated as large of a spread in cumulative net carbon uptake as did an earlier generation of C4MIP models (Friedlingstein et al. 2006, 2014), analyzing model output fields that have direct relationships to observational datasets is crucial. Our results show that there is a 25% bias when was calculated from atmospheric data versus integrated NBP within the model ensemble (Table 2). In the analysis presented here, it was necessary to translate fluxes through a transport model or emulator to reproduce model data as similar to the atmospheric observations as possible. However, using atmospheric transport inconsistent with the climate used to force land surface fluxes contributes additional uncertainty that could be avoided by making the time-varying, three-dimensional structure of atmospheric a standard model output.
The research was supported in part by the RUBISCO Scientific Focus Area (SFA), which is sponsored by the Regional and Global Climate Modeling (RGCM) Program in the Climate and Environmental Sciences Division (CESD) of the Office of Biological and Environmental Research in the U.S. Department of Energy Office of Science. Oak Ridge National Laboratory is managed by UT-Battelle, LLC, for the U.S. Department of Energy under Contract DE-AC05-00OR22725. We acknowledge the World Climate Research Programme’s Working Group on Coupled Modelling, which is responsible for CMIP, and we thank the climate modeling groups producing and making available their model output. For CMIP, the U.S. Department of Energy’s Program for Climate Model Diagnosis and Intercomparison provides coordinating support and led development of software infrastructure in partnership with the Global Organization for Earth System Science Portals.