EU Surface Temperature for All Corners of Earth (EUSTACE, www.eustaceproject.org) is a 4-yr research project funded by the European Union Horizon 2020 research and innovation program (EU H2020; Grant Agreement 640171; see appendix A for a list of the Consortium’s institutions) that started on 1 January 2015. EUSTACE has used temperature estimates from satellites to boost the amount of information available beyond that provided by weather stations and ships to help to construct a prototype global, multidecadal daily air temperature record presented on a 0.25° latitude × 0.25° longitude grid.
Near-surface air temperature (typically measured at a height of about 2 m above ground level at meteorological stations) is a fundamental quantity for many of the activities undertaken in climate science and in many of the societal concerns that climate services aim to support; it is something that we all experience directly in our day-to-day lives. Near-surface air temperature has been measured almost continuously in some places and across the global oceans by ships for well over a century. Designated as an Essential Climate Variable (ECV), these records allow for the construction of a useful climate data record (CDR) in those places for the period covered. Globally, however, there a number of locations where either access to the measurements is not possible, or no air temperature records exist. As well as long records of direct measurements of near-surface air temperature, we have information from satellite retrievals (i.e., remotely sensed, indirect estimates) of temperature. However, satellite retrievals tend not to pertain to the air temperature that we experience directly, but either to an average temperature of a higher layer in the atmosphere or to the skin temperature of the surface of the Earth. These quantities are related to near-surface air temperature, more or less tightly depending on the type of surface and the surface–lower-atmosphere interactions. Therefore, it is possible to use satellite-derived temperatures together with near-surface air temperature measurements to create a more complete climate data record of air temperature. Thus, EUSTACE created a prototype global climate data record of near-surface air temperature for every day since January 1850 using both direct measurements of air temperature and estimates of it based on satellite skin temperature retrievals.
Near-surface air temperature products provide valuable information for a range of activities, from the monitoring of current conditions (e.g., Sánchez-Lugo et al. 2019) to the assessment of past variability (e.g., Osborn et al. 2017) to their use in seasonal-to-decadal forecasting (e.g., Kushnir et al. 2019), climate model evaluation (e.g., Walters et al. 2019), detection and attribution of climate change (e.g., Jones and Kennedy 2017), Intergovernmental Panel on Climate Change assessments (e.g., Hartmann et al. 2013), agricultural modeling (e.g., Weedon et al. 2011), health modeling (e.g., Xu et al. 2019) and other downstream uses. Such a daily surface air temperature product could form part of the future operational monitoring system for surface air temperature over the polar regions, over Africa and South America. EUSTACE has already enabled monitoring of lake surface water temperature to be included in the annual State of the Climate reports (for the years 2015, 2016, 2017, and 2018; Woolway et al. 2016, 2017a, 2018a; Carrea et al. 2019). EUSTACE products are complementary to products from dynamical reanalyses (e.g., Buizza et al. 2018) with much of the work dedicated to the preparation of input surface temperature observations, for which EUSTACE has performed thorough uncertainty analyses, which were previously lacking.
Dynamical reanalyses combine historical and recent observations with numerical weather prediction models to produce dynamically consistent reconstructions of past weather and climate. These reanalyses require observational data with well-characterized uncertainties. The new, validated estimates of uncertainty in satellite surface skin temperature observations developed by EUSTACE are of benefit to them. EUSTACE products also provide an alternative source of near-surface air temperature data that is independent from numerical weather prediction models and extends further back in time than most dynamical reanalyses.
Results from scientific projects are often not produced in a format that can be used easily by others; in general, processing or translation is needed. Two-way interaction with potential users from the start of a project helps to increase the relevance and usability of products to various potential user groups. EUSTACE collected information on user requirements in several ways, via user consultation workshops; questionnaires and interviews; a literature review on user requirements (Bessembinder 2016; Bessembinder et al. 2017, including the results from a large number of national and EU projects); testing of example mock-up datasets; and describing specific use cases with “trail blazer” users.
These activities resulted in greater insight into how climate data are used, data format preferences, and which variables are needed (i.e., not just daily mean temperature, but also minimum and maximum temperature), among other things. We used many of the user requirements collected to design the EUSTACE data file structure and the user guides; for example, a quick start guide is provided as part of the product user guide, together with example use cases.
While many of the ideas used within EUSTACE have been trialed elsewhere for individual regions (e.g., Cristóbal et al. 2008), or for different time scales (e.g., Kilibarda et al. 2014), EUSTACE has brought them together for the first time to create global, multidecadal daily products. EUSTACE has performed an integrating function, bringing together products and expertise from a wide range of European, national, and international initiatives. EUSTACE has also followed much of the road map of “recommended steps towards meeting societal needs for surface temperature understanding and information” set out previously in the EarthTemp Network Community Paper (Merchant et al. 2013). In particular, EUSTACE has made progress in 7 out of the 10 broad aims identified therein:
-
develop more integrated, collaborative approaches to observing and understanding Earth’s various surface temperatures;
-
build understanding of the relationships between different surface temperatures, where presently inadequate;
-
make surface temperature datasets easier to obtain and exploit for a wider constituency of users;
-
consistently provide realistic uncertainty information with surface temperature datasets;
-
communicate differences and complementarities of different types of surface temperature datasets in readily understood terms;
-
rescue, curate, and make available valuable surface temperature data that are presently inaccessible; and
-
build capacities to accelerate progress in the accuracy and usability of surface temperature datasets.
Computer code has been developed both to estimate air temperature from satellite data and to create daily maps of mean air temperature; this code has been publicly released (Rayner 2019). Information contained in the satellite retrievals helps to create more-complete fields in the past, via statistical models of how surface air temperature varies in a connected way from place to place. As the data volumes involved are considerable, the EUSTACE partnership included statisticians and computer scientists, enabling the development of efficient analysis methods. As a result, EUSTACE has been able to demonstrate that these methods can be built into a fully functional processing system, with research-level maturity (EUMETSAT 2014) that exploits the features of modern high-performance computing resources to deliver the more-complete datasets described below. This system could be used in future to update some of the EUSTACE datasets described here to enable their use in climate monitoring.
The datasets that are currently commonly used to monitor surface temperatures globally are constructed as a combination of air temperature observations over land and sea surface temperature observations over ocean. The current versions of the most widely used global near-surface temperature datasets, HadCRUT4 (Morice et al. 2012), NOAAGlobalTemp (Smith et al. 2008; Vose et al. 2012), and GISTEMP (Hansen et al. 2010), extend from the mid-nineteenth century to present and are derived from in situ observations only; temperature retrievals from satellites are not used in their construction. These global temperature datasets are presented at monthly resolution because summaries of monthly average temperatures are more commonly available for individual meteorological stations and cover a greater region of the Earth than daily or subdaily summaries in the nineteenth century and early twentieth century. The density distribution of available in situ temperature observations limits the spatial resolution of these products. For example, HadCRUT4 is provided as monthly fields on an equi-angle latitude–longitude grid at 5° resolution.
Surface air temperature datasets covering land regions, but not ocean or sea ice, are available at higher spatial and temporal resolutions. For example, Rohde et al. (2013a,b) use a larger number of meteorological stations than do HadCRUT4, NOAAGlobalTemp, or GISTEMP, together with a statistical interpolation algorithm, to produce a monthly surface air temperature dataset at higher spatial resolution; an experimental daily analysis has also been produced. Other high-resolution datasets of air temperatures over land are available and are commonly used in climate modeling (Harris et al. 2013) and hydrological modeling (Weedon et al. 2011). Higher-temporal-resolution air temperatures derived from land meteorological station observations are also available, including the Global Historical Climatology Network–Daily (GHCN-D) databank (Menne et al. 2012) and the subdaily Hadley Integrated Surface Database (HadISD) databank (Dunn et al. 2016). Gridded temperature fields based on GHCN-D are available in the HadGHCN-D dataset (Caesar et al. 2006) covering a time period from 1950 to the present. HadISD is presented as time series for individual meteorological stations only. However, none of these latter datasets are based on homogenized data (see below).
The existing coarse-resolution global temperature datasets are widely used in global and regional climate assessments; however, their utility is limited in some applications that require information at high temporal and/or spatial resolutions, such as the assessment of temperature extremes, national climate assessments, regional impact studies, and validation of climate simulations from high-resolution climate models. These global temperature datasets are also often expressed in terms of temperature anomalies (temperatures relative to average conditions over some reference period), rather than in terms of absolute temperature information, which is commonly needed in these applications. EUSTACE provides products that can be used for the study of absolute temperatures, as well as providing information relevant to temperature anomalies.
Figure 1 provides an overview of the EUSTACE process and shows how different activities linked together to transform the source datasets (appendix B) into the series of EUSTACE products (appendix C). Source datasets were chosen to maximize our opportunity to quantify the components of uncertainty (in the case of satellite data) and the amount of historical daily information (in the case of weather station data). Wrapped around these scientific developments were interactions throughout the project with potential users. Evaluation against independent reference measurements (Veal 2019a) and comparison with other related products (Veal 2019b) put EUSTACE work into context.
Through this development process, EUSTACE has contributed to advancing and enabling climate science in five main areas:
-
Detecting and correcting for nonclimatic discontinuities in weather station series: to provide an accurate picture of variations in air temperature, measurements at weather stations have been checked for any jumps in the series and then corrected (Squintu et al. 2019a,b). Such discontinuities might have arisen from changes in the surroundings of the weather station, the instruments used, the location of the station, or the measurement procedure (Brugnara et al. 2019).
-
Estimating consistent skin temperature uncertainties: EUSTACE used satellite data on the surface skin temperature of the land, ocean, and ice, obtained from European reprocessing projects with diverse approaches to estimating uncertainty. Therefore, we derived consistent uncertainty estimates for these data over all surfaces in order to use them together effectively (Ghent et al. 2019; Nielsen-Englyst et al. 2019a).
-
Estimating air temperature from satellite data: while in some locations air temperature records can span periods of a century or more, in many areas there is a lack of information. EUSTACE has helped to provide daily air temperature information by using temperature estimates from satellite measurements to boost the amount of information beyond that already available from weather station records and ships (Nielsen-Englyst et al. 2019a; Høyer et al. 2018; J. J. Kennedy and E. C. Kent 2019, unpublished manuscript).
-
Understanding the role of lakes: a number of EUSTACE studies explored various aspects of the relationship between lake surface water temperature and air temperature, demonstrating the place of lakes in the global climate system, their response to climate change, and the importance of using spatially resolved data to explore aspects of the response of lakes to climate change (Woolway and Merchant 2017, 2018; Woolway et al. 2017b,c,d, 2018b).
-
Estimating complete fields: EUSTACE used cutting-edge statistical methods to exploit the links between air temperature in different places and through time to estimate daily air temperatures in places and at times when neither direct measurements nor estimates from satellites were available
Hereafter, we will briefly discuss these activities, together with the independent validation of EUSTACE products.
Detecting and correcting for nonclimatic discontinuities in weather station series
Most instrumental temperature series suffer from nonclimatic artifacts (i.e., discontinuities or “breaks,” e.g., due to the relocation of weather stations, changes in the instrument shelter, changes in observation practices) which often result in sudden changes in the time series (e.g., Peterson et al. 1998; Brandsma and Können 2006). Changes like this are not often adequately documented, so we need to use an automated method to detect them that we can apply to a global dataset. Correcting for these changes is termed “homogenization.” Until recently, homogenization efforts have mostly addressed the monthly or annual time scales and have only adjusted shifts in the mean value. This is not sufficient when dealing with daily data as inhomogeneities can affect not just the mean, but the entire distribution of variables (Trewin 2013). The effects of, for example, shelter changes on temperature depend nonlinearly on the ambient weather conditions such as sunshine and wind.
Homogenization of daily and subdaily data has received more attention in recent years (e.g., Aguilar et al. 2008), but efforts are still rare compared to work on monthly data (Venema et al. 2012). Existing methods correcting daily or subdaily temperature data can be grouped into three basic categories:
-
Corrections of the mean: Methods that start from monthly mean break sizes (i.e., sizes of nonclimatic discontinuities), which are then distributed to individual days. Daily corrections are computed by fitting a spline or piecewise linear function between monthly mean corrections (e.g., Vincent et al. 2002). This is the easiest approach, but comes with a risk that the tails of the distribution would not be properly corrected.
-
Corrections of higher-order moments of the distribution: Methods that directly adjust the distribution of daily temperature based on a daily reference series (e.g., Trewin 2013). This is better suited for extremes, but it requires longer and better correlated reference series than method 1.
-
Methods that incorporate basic physics such as the effects of radiation and ventilation on the temperature shield (e.g., Auchmann and Brönnimann 2012). This requires detailed metadata that are not usually available for large datasets.
Until quite recently, no global dataset of homogenized daily land surface air temperature was available. Corresponding homogenization work was restricted to a few regions such as Canada (Vincent et al. 2002), the Mediterranean region (e.g., Brunet et al. 2006; Kuglitsch et al. 2009), Australia (Trewin 2013), and China (Xu et al. 2013).
Most break-detection methods require highly correlated reference series. However, a nonclimatic network-wide break point (e.g., the simultaneous introduction of new instruments) can be difficult to detect if reference series are from the same network. For global studies, only unhomogenized daily temperature data have been available through GHCN-Daily and other sources, which are not suitable in all locations for analyzing trends in extremes, for example. Berkeley Earth has produced an experimental gridded daily temperature product over land [see a description of their method in Rohde et al. (2013a,b)], but their homogenized daily station series are not available and the analysis was constructed without directly homogenizing daily data. Rather, Rohde et al. (2013a,b) constructed fields of daily anomalies (from their monthly mean values) and added them to the existing homogenized monthly dataset.
EUSTACE has combined multiple break-detection algorithms [those of Caussinus and Mestre (2004), Toreti et al. (2012), and Wang (2008)]. We applied them either to annual and semiannual averages of differences between each station and neighboring reference series (our relative tests; all methods used), or to the averages of the target station alone [our absolute test; Wang (2008) only used], in the absence of neighboring stations or if available reference series are not suitable [Brugnara et al. (2019) provides details]. Using multiple methods of detecting discontinuities provides an ability to assess the robustness of the results. Figure 2 illustrates the coverage of the EUSTACE station dataset and indicates the type of break detection method applied to each station (relative or absolute) and also where application of the break detection methods has not been possible because of insufficient record length (i.e., less than 10 years). A simple likelihood index is formed from a 50-member break detection ensemble and users of the EUSTACE global station dataset can select a likelihood threshold appropriate to their needs, such that the detection power is maximized while minimizing the false alarm rate. This is the first global daily station dataset with estimated locations of nonclimatic discontinuities and their likelihood, together with valuable metadata, e.g., on resolution of measurements.
In addition to break detection, the EUSTACE global station dataset has undergone other quality checks both on the air temperature measurements themselves and on reported station altitudes (Brugnara et al. 2019). Appendix C provides a link to the resulting dataset of daily mean, maximum, and minimum temperature.
For European weather station series, EUSTACE has made adjustments, where possible, to reduce the impact of nonclimatic discontinuities. Briefly, we used an iterated quantile-matching approach (an example of method type 2 above) to adjust the distributions of the measurements, not just their means, by comparing to the measurement distributions at nearby reference stations [Squintu et al. (2019a,b) give details]. The homogenization brings the distributions before and after each station change much closer together, adjusting for the nonclimatic effects of such discontinuities.
Applying the quantile matching to the whole European station dataset has an impact on the apparent trends in temperature over Europe (see Squintu et al. 2019a). Sometimes, the EUSTACE corrections increase the trend and sometimes they decrease it. Where stations previously showed negative trends since 1951, they show positive trends in most cases after homogenization; in all cases making them more consistent with their neighboring stations.
This is the first time that a pan-European station dataset of daily data has been homogenized to reduce the impact of nonclimatic discontinuities. The homogenized European station dataset is provided separately from the global station dataset and comprises part of the European Climate Assessment and Dataset (ECA&D) product. A gridded 100-member ensemble dataset available either on a 0.1° latitude × 0.1° longitude grid or a 0.25° latitude × 0.25° longitude grid, based on the homogenized station records has also been developed as a contribution to the next version of the E-OBS dataset (Cornes et al. 2018). A two-step method (documented in Cornes et al. 2018) was used to create the ensemble: (i) the daily values were fitted with a generalized additive model, to capture large-scale spatial trends and (ii) the residuals from this were then interpolated using stochastic Gaussian random field simulation. Appendix C provides a link to the Centre for Environmental Data Analysis (CEDA) catalogue record for these datasets of daily mean, maximum and minimum temperature.
Estimating consistent skin temperature uncertainties
EUSTACE uses surface temperature retrievals over land, ocean, and ice based on information gathered by infrared satellite sensors. One of our key aims is to estimate the uncertainty in our air temperature products, so first we addressed the inconsistency in the availability of uncertainty estimates for skin temperature retrievals over different surfaces. Here skin temperature is the temperature at a few microns below the topmost surface of the land, ocean, or ice.
Uncertainty in surface skin temperature retrieved from satellites arises from various sources (Merchant et al. 2015):
-
Radiometric noise in the measurements made by the satellite sensor. This is usually the simplest component of uncertainty, and a standard “uncertainty propagation” can be applied to derive the surface skin temperature uncertainty associated with any surface skin temperature retrieval, given information about the radiometric noise. There is usually no or negligible correlation of error from this source between different surface skin temperature retrievals.
-
Limitations of the retrieval process would introduce uncertainty into the surface skin temperature even if the actual radiometric measurements made had zero error. For physically derived retrievals, this component can be isolated and estimated if representative simulations of the retrieval process are available; this is not the case where purely empirical relationships are used. An important aspect of this component of uncertainty is that the errors are likely to be correlated in space and time, and therefore may not “average out” in a simple way when transforming data from finer to coarser spatiotemporal scales.
-
Effects that are more systematic, principally: sensor calibration (which may drift over time) and radiative transfer simulation [including the effects of imperfect instrument characterization and incorrect surface emissivity assumptions, although subpixel emissivity variability over land is usually considered random despite having local, coherent structure. See Ghent et al. (2019) for further discussion of uncertainties arising from misspecification of emissivity].
In addition to the above, error is introduced into surface skin temperature estimates because of imperfect cloud detection (when infrared sensors are used, as in EUSTACE; see Bulgin et al. 2018), unrecognized atmospheric aerosol, sensor anomalies, signal contamination, geolocation error, corrupted data streams, etc. Errors arising from these contributing sources are often far from Gaussian in their distributions, with complex effects on surface skin temperature uncertainty. These uncertainties have not been quantified in EUSTACE.
For all surfaces, EUSTACE estimated uncertainties partitioned according to the correlation structure of the different contributing error sources, following the method developed by Merchant et al. (2014) and expanded in Merchant et al. (2015). Uncertainties are split into those arising from uncorrelated random effects, from effects which are locally correlated (these arise from atmospheric effects and/or from uncertainties in the specification of emissivity) and from effects that are correlated over large space and time scales. The derivation of uncertainties in land surface temperature is documented in Ghent et al. (2019) and in Nielsen-Englyst et al. (2019a) for ice surface temperature. Uncertainties in sea surface temperature are as calculated by Merchant et al. (2014).
Links to EUSTACE products containing these consistently estimated uncertainties are given in appendix C.
Estimating air temperature from satellite skin temperature
Before we can use the satellite data to estimate air temperature, we have to understand the relationship between surface air temperature and surface skin temperature and how it varies throughout the day, by surface type, and through the seasons. The challenges are different in each domain, so EUSTACE explored the relationship separately over land, ocean, and ice. Based on our understanding of the factors influencing the relationship in each case, we developed multiple linear regression relationships. As well as in situ measurements and satellite skin temperature estimates, these use extra information to help to categorize the way the skin/air temperature relationship behaves, such as vegetation, latitude, and snow cover. Inclusion of altitude was found to provide no additional skill due to a lack of high-altitude weather stations, although it does affect the relationship. Wind speed has a clear influence on the relationship (Good 2016), but use of wind speed information (from a dynamical reanalysis) in the regression provided no additional skill. The changing vegetation fraction information used also acts as a proxy for some other relevant surface effects, such as urbanization, but there was no explicit attempt here to model the impact of urbanization. The uncertainty arising from excluded effects is also not dealt with explicitly in the error model. We withheld a predefined set of in situ measurements from the regression to use in validation of the results. We then used the regression relationships to estimate air temperature when and wherever a satellite skin temperature retrieval is available, i.e., in clear-sky conditions over the period of record.
The relationship between skin and air temperature is not straightforward; Good (2016) explores this over land. Simultaneously measured air and skin temperature vary relative to each other over the course of a day. Depending on conditions, the skin temperature can become much warmer than the air temperature when the sky is clear, but when cloud is present, the skin temperature quickly decreases to a value close to the air temperature. The daily maxima and minima in the skin and air temperatures usually occur at different times of day and the amplitudes of their diurnal cycles are often quite different. These differences also vary with season and with location. Nielsen-Englyst et al. (2019b) found a very different relationship over ice-covered surfaces in Greenland with the closest coupling between skin and air temperature occurring at noon in the summer under clear skies, when the sun warms the surface. At other times, particularly in darkness, the surface is often colder than the air above it through radiative cooling and the formation of a surface inversion layer. Under overcast skies, the surface can become warmer than the overlying air during more of the day. Spatial mismatches between satellite retrievals and in situ measurements mean that care needs to be taken on the resolution of satellite data used to develop the relationships. Consequently, we train our regression over land on skin temperature at 0.05° latitude × 0.05° longitude resolution, as the relationship with air temperature has been shown to peak at this resolution (Sohrabinia et al. 2014). Weather stations were preferentially selected for model training if their land cover type matched the dominant land cover type in the surrounding 0.05° latitude × 0.05° longitude area. Retrievals from infrared sensors are only available in clear-sky conditions, so we might expect that to bias our understanding of the relationship. By using in situ measurements from both clear and cloudy conditions, we mitigate the impact of this [see Høyer et al. (2018), Nielsen-Englyst et al. (2019a), and J. J. Kennedy and E. C. Kent (2019, unpublished manuscript) for details on the relationships between skin and air temperature across different surfaces].
Once a regression relationship has been derived, that relationship is used to estimate air temperature where we have skin temperature retrievals. We perform this procedure separately over land, ocean, and ice and build up a global picture of air temperature based on the available satellite measurements (see an example in Fig. 3). Global regression coefficients are used over land. Here, the estimation is most challenging, largely due to a lack of representative station measurements, in high-altitude regions (for both daily minimum and maximum temperatures) and at high latitudes and/or with high snow cover (for daily maximum).
Since we previously estimated our skin temperature retrieval uncertainties arising from components with different correlation structures, when we propagate those through the regression-based air temperature estimation together with the uncertainties inherent in the estimation, we can also derive components of uncertainty in the air temperature estimates arising from random, locally correlated, and systematic effects. This means that the uncertainties in our air temperature estimates are also estimated consistently across the different surfaces and can be propagated appropriately through an application.
EUSTACE air temperature estimates from satellite are provided on a 0.25° latitude × 0.25° longitude grid in separate files for each surface (land, ocean, and ice). Daily mean temperatures are provided over ocean and ice and daily maximum and minimum is provided over land. Appendix C provides access information.
Understanding the role of lakes
EUSTACE has undertaken work using both lake surface water temperature from satellites and from in situ measurements gathered by the project to better understand the relationship between lake surface water temperature and near-surface air temperature.
Lakes can show an amplified response of summer surface water temperature to near-surface air temperature variability over the lake. This amplification of response is variable, but greater for cold lakes (e.g., those situated at high latitude and high elevation) and for deep lakes (Woolway and Merchant 2017). Over-lake atmospheric boundary layer stability is found to be more frequently unstable, with over-lake air temperature lower than lake surface water temperature, at lower latitudes (Woolway et al. 2017b). In summer, the frequency of unstable conditions decreases with increasing lake area, as a result of an increase in wind speed with lake size, affecting heat and carbon fluxes between the atmosphere and the lake. A study of central European lakes shows variable warming rates across the year, but these lakes have warmed most in spring with significant trends seen over the last few decades (Woolway et al. 2017c). Abrupt changes seen in these lakes in the 1980s are consistent with abrupt changes in air temperature at the same time. Warming trends seen across 19 large Northern Hemisphere lakes (Woolway and Merchant 2018) vary significantly across lakes as well as between them. Deeper areas of large lakes exhibit longer correlation time scales of lake surface water temperature anomalies and a shorter stratified warming season. Deep areas of large lakes consequently display higher rates of increase of summer lake surface water temperature.
Wind speed has a substantial impact on stratification of lakes, which can have a greater influence than air temperature (Woolway et al. 2017d), and is a controlling factor on lake–air turbulent heat fluxes. Variations in turbulent heat fluxes over lakes have a marked seasonal cycle in some cases, with heat loss higher over large lakes and at low latitudes (Woolway et al. 2018b). The relative contribution of latent and sensible heat fluxes to the total heat flux differs between lakes and with latitude.
The relationship between lake surface water temperature and near-surface air temperature is a two-way interaction. Air temperature influences lake temperature (via its role in turbulent fluxes) and the presence of a lake has an impact on the air temperature in its vicinity; an impact that metaphorically has some “memory” of earlier air temperature anomalies by virtue of the thermal inertia of the lake. The lake influence can be substantial, and in some instances be in excess of 2°C. In some regions, in particular where lakes are abundant (e.g., northern Europe), their influence on the surrounding climate needs to be considered. For EUSTACE, the key question is how the lake modifies the dynamics over time of the daily minimum, maximum, and mean air temperature in its vicinity. EUSTACE has estimated the region of influence of lakes globally, provided in the supplemental material (https://doi.org/10.1175/BAMS-D-19-0095.2) to facilitate the inclusion of this effect in future air temperature analyses.
Estimating more-complete fields
Having used surface skin temperature retrievals over all surfaces of Earth to estimate near-surface air temperature, we have global, but not globally complete, fields covering the last few decades. Gaps remain due to the impact of clouds on the satellite estimates, for example. We also have over a century and a half of spatially incomplete data from ships and weather stations. Night-only ship data were used, to avoid daytime biases, and adjusted to represent air temperature at 2 m following Kent et al. 2013. To try to complete the picture, we needed to use statistical modeling to capture information on how temperature covaries between locations. This information is contained in both the satellite estimates from the recent past and the weather station and ship measurements (Woodruff et al. 2011). The statistical modeling helps us understand unobserved regions on any given day.
The state-of-the-art in the spatial statistics research community was previously far ahead of the methods that had been introduced to the Earth sciences, both in terms of generality and computational efficiency. In particular, methods capable of propagating uncertainty from multiple input data sources and realistic modeling of uncertainty due to spatial variability had seen only very limited use in the Earth sciences.
Current methods for spatial interpolation in Earth sciences that also include statistical uncertainty estimates fall mainly into two categories: low-dimensional function representations (e.g., Banerjee et al. 2008; Wikle 2010) and local covariance-based kriging methods (e.g., Furrer et al. 2006). Given a realistic computational effort, none of these approaches provide full quantification of uncertainties on long and short spatial and temporal scales simultaneously; low-dimensional basis methods cannot capture small-scale variability and dealing with statistical nonstationarity is challenging for covariance-based methods. New techniques for statistical spatiotemporal models have been developed recently by combining numerical methods for stochastic partial differential equations (SPDEs) with efficient Bayesian computations for Markov random fields. When combined with methods for fast computations for hierarchical statistical models (e.g., Rue et al. 2013) they can handle multiple scales as well as nonstationarity (Lindgren et al. 2011; Bolin and Lindgren 2011), for a cost similar to that of low-dimensional models. Previously, these methods have successfully been used in ecology, epidemiology, and geology, but not until now for datasets of the size and resolution of global historical daily temperature datasets. EUSTACE development has made extensive use of these methods to create a global daily mean air temperature analysis on a 0.25° latitude × 0.25° longitude grid.
We model daily mean air temperature measurements, first, as an average of each day’s maximum and minimum temperature and, second, as a combination of the true temperature plus bias terms (including accounting for locally correlated biases in the air temperature estimates from satellite) and other errors affecting each measurement type. We then assume that the true daily mean air temperature can be modeled as a linear combination of three different components: a moving long-term average climatology; a large-scale component representing interannual variability and a daily, weather-related component. Each component is modeled as a linear combination of Gaussian variables and is solved conditioned on the other components, starting with the climatology. The solution is improved iteratively starting with the climatology, followed by the large-scale and then the local component, moving from the broadest and slowest scales, to the shortest and fastest. The process is then repeated. The estimation of the climatology component benefits directly from the inclusion of satellite-derived data. The time variation of the large-scale component is informed largely by the long-term in situ measurements from ships and weather stations. The correlations captured by the local component benefit from both the satellite-derived and in situ data. Different types of errors in the input measurements are associated with the individual component to which they are most relevant. For example, station biases arising from nonclimatic discontinuities are associated with and estimated as part of the large-scale component, because breaks in the station series are identified at an annual resolution. To make the computation tractable, we use a combination of local linear basis functions. These basis functions combine to describe variation in space (for the daily component) and, in some cases, also in time (for the large-scale component). The basis functions are defined on a nested triangular mesh that also helps to speed up the computation. This Bayesian method allows us to represent uncertainty in the process by drawing samples from the posterior distributions of the model components. Figure 4 illustrates the additional information this generates and the uncertainty in different components of the process for 1 January 2006.
We generate 10 samples of possible representations of mean near-surface air temperature for each day from 1 January 1850. The usefulness of the complete field is determined strongly by the availability of measurements to constrain the analysis. Therefore, where we have estimated values which add no additional information (as defined by climatology or large-scale uncertainties greater than a threshold), we mask these out of the analysis (white areas in top-right panel of Fig. 4). In addition, in a few limited areas the statistical model produced extreme climatological values; these were also masked. Consequently, the analysis is not globally complete.
The purpose of EUSTACE is to provide information on daily near-surface air temperature to enable assessments of vulnerability to its daily variations, rather than for monitoring of large-scale changes on longer time scales. Nonetheless, it is important to know how the global analysis compares to datasets developed for large-scale monitoring. The top panels of Fig. 5 show regional annual average near-surface air temperature anomaly in the EUSTACE global analysis v1.0 since 1850 for Europe and North America, together with the same quantity in a blend of Climatic Research Unit Temperature version 4 (CRUTEM4; Jones et al. 2012) and Hadley Night Marine Air Temperature version 2 (HadNMAT2; Kent et al. 2013); NOAAGlobalTemp (Smith et al. 2008; Vose et al. 2012); GISTEMP (Hansen et al. 2010); and Berkley Earth (Rohde et al. 2013a,b). From 1895 onward, the datasets agree closely. Prior to 1895, there are very few daily station measurements in the EUSTACE global station dataset, so the EUSTACE analysis v1.0 relies on night marine air temperature to infer values over Europe. This causes a discrepancy in the EUSTACE analysis when compared to the global surface temperature–monitoring datasets, which are themselves instead based on monthly weather station values. Monthly average data are more plentiful for the late nineteenth century, having been digitized separately from daily values. Over North America, the agreement is good back to 1870.
More pertinent to the aims of EUSTACE is the ability of the global analysis v1.0 to represent the evolution of daily near-surface air temperature at a particular location. Having withheld a large number of station records from the development of the analysis, we can examine how the analysis compares to these records over the course of example years. The bottom panels of Fig. 5 show this for Cimbaj, Uzbekistan, in 1975 and for Fort Nelson, British Columbia, Canada, in 2003. The station records for these locations were not included in the analysis, so they provide an independent comparator. The uncertainty in the analysis is larger for Cimbaj than for Fort Nelson (shown by the envelope around the EUSTACE analysis v1.0 time series). Nonetheless, in both locations, the analysis compares well on a day-to-day basis with the record of daily mean near-surface air temperature from GHCN-D v3.26. In particular, we see that the gaps in the Fort Nelson record for 2003 are completed by the EUSTACE analysis method, which uses information from other weather station records and air temperature estimated from satellite to infer the missing values.
The EUSTACE prototype global daily air temperature ensemble is openly available via the CEDA archive (see appendix C).
Validation
The EUSTACE daily air temperature estimates (both the air temperatures estimated from satellite and the global analysis) were matched with withheld validation measurements from land stations, ice stations, moored buoys, ships, and ice buoys. These data were excluded from both the derivation of regression relationships between skin temperature retrievals from satellite and air temperature and from the production of the global daily analysis fields. Veal (2019a) presents the full evaluation, but Fig. 6 summarizes the results for the EUSTACE global analysis.
Over ocean, the EUSTACE global analysis v1.0 performs well over the period 1850–2015, with a global median discrepancy [robust standard deviation (RSD)] of +0.00 K (1.76 K) against withheld ship measurements (Woodruff et al. 2011) adjusted to a height of 2 m. The highest discrepancies (analysis minus validation data) are found in the Southern Ocean, although matchups are sparse here. The global analysis also performs well in most land regions with a global median discrepancy (RSD) against weather station measurements of −0.23 K (1.76 K). Seasonal median discrepancies over central Asia, however, are high, 6–10 K in winter at some stations, these most erroneous data have been masked out of the final product. Over permanent ice domains, the global analysis performs less well, especially over sea ice: regional median discrepancies (RSDs) against ice buoy data are +1.19 K (4.60 K) in the Arctic and +4.76 K (6.81 K) in the Antarctic; note that these latter two statistics are affected by the sparsity of in situ measurements against which to compare the EUSTACE analysis in these regions, but are dominated by a drift over the poles in the analysis, which has largely been masked out of the final product. The regional median discrepancies (RSDs) over land ice, including the Antarctic ice shelf, against weather station data are lower: +0.37 K (4.04 K) in the Arctic and +0.47 K (2.68 K) in the Antarctic.
In addition, estimates of uncertainty are also evaluated using the withheld data. The uncertainty estimates are assessed by first binning the matchup discrepancies by the value of the uncertainty on the EUSTACE temperature estimate. Matchup statistics (median and RSD of the matchup discrepancies) are calculated for each bin. The matchup discrepancy has contributions from the uncertainty in the in situ reference data as well as the uncertainty on the EUSTACE temperature estimate. There is also a contribution from matching two different spatial scales, i.e., a point in situ value with the EUSTACE 0.25° gridbox estimate. The expected match up variance can be modeled as the sum of the squares of these contributions. The actual and modeled matchup discrepancy variances are plotted in Fig. 7. Assuming our estimates of the uncertainty in the reference data and the matchup process are good then, if the EUSTACE uncertainty estimates are also good, for each bin the matchup RSD (blue bar) should match the modeled value (dashed line). If the blue bars are higher than the dashed line then the matchup discrepancy RSD exceeds the modeled value, indicating that the EUSTACE uncertainty estimate is too low. The uncertainty estimates for the EUSTACE global analysis v1.0 show little agreement with expectation over ocean (overestimated and showing little variation with actual discrepancy), but good agreement over land. Since the EUSTACE analysis validates extremely well in comparison to withheld data over the ocean, this mitigates the impact of the less effective uncertainty estimates here. Analysis uncertainties are underestimated over ice regions, particularly in the Northern Hemisphere and over Southern Hemisphere land ice; here, this arises from assumptions in the analysis method about the correlation structure of errors in the oversampled air temperature estimates from satellite.
The EUSTACE matchup database is available for noncommercial use (see appendix C for details).
Priorities for future work
EUSTACE relies on good retrievals of surface skin temperature from infrared satellite instruments. Adequate removal of values contaminated by cloud between the surface and the sensor is crucial for accurate skin temperature retrieval, but also for correct estimation of uncertainties and for accurate estimation of air temperature from skin temperature. The skin temperature datasets currently used in EUSTACE are sporadically contaminated by uncleared clouds. Use of improved satellite retrievals will improve the EUSTACE products.
As a proof of concept, EUSTACE has demonstrated that inclusion of air temperatures estimated from satellite enables the more stable estimation of the climatological component of the global analysis (where biases in air temperature estimates from satellite are not large or there are sufficient in situ measurements to inform their correction), as compared to use of in situ measurements alone. Use of longer satellite datasets would improve the amount of information available to the analysis and improve results further. Since the inputs to the EUSTACE analysis were fixed, more satellite data have become available [i.e., version 2 of the Arctic and Antarctic Ice Surface Temperatures from thermal infrared satellite sensors (AASTI) dataset over ice, GlobTemperature land surface skin temperature from a further Moderate Resolution Imaging Spectroradiometer sensor, and stable sea surface temperatures from the Advanced Very High Resolution Radiometer series in the ESA SST Climate Change Initiative (CCI) v2.1 dataset].
With more satellite skin temperature information would come the possibility of developing and applying regionally varying regression relationships over land. EUSTACE air temperature estimates from satellite over land currently employ a global relationship determined by latitude, snow cover, and fractional vegetation cover; this results in some (sometimes large) regionally varying biases in the resultant air temperature estimates, which are reduced in the global analysis through the additional statistical modeling undertaken there and the inclusion of measurements made in situ.
Interactions with users have demonstrated that information on daily maximum and minimum temperatures are needed in addition to the daily mean. Although EUSTACE undertook modeling work to enable the production of a global analysis of maximum and minimum temperatures via the mean and the diurnal temperature range, it proved impossible to pull it through into production within the timeframe of the project. Methods developed demonstrate promise and have applicability beyond surface temperature diurnal temperature range to other non-Gaussian variables. These prototyped methods would also enable full propagation of components of uncertainty with different correlation length scales through to the final analysis; the current EUSTACE global analysis simplifies the assumptions made to enable the calculations, but consequently results in underestimated uncertainties, especially over polar regions where satellite data are plentiful.
Pull-through of the lake influence mask (see the online supplemental material) as a covariate (as distance from coast or altitude are currently specified) in the EUSTACE global analysis has the potential to improve the air temperature fields local to large lakes (with an influence on the scale of the EUSTACE grid box or larger, i.e., 0.25° in latitude and longitude).
The availability of daily measurements made in situ could be increased substantially by continuing the current international data rescue and digitization efforts (see, e.g., Brönnimann et al. 2018) and by making these and other daily measurements openly available. Each new set of digitized data has the potential to improve a global analysis of air temperature by better constraining the statistical modeling, particularly when targeted to regions currently underrepresented in the EUSTACE global station dataset (see Fig. 2) or in undersampled areas of the ocean, such as the Southern Ocean (Brönnimann et al. 2018).
In the course of our work, we have identified the following needs to extend the current observing system: more simultaneous voluntary observing ship measurements of sea surface and near-surface air temperature (because the network is declining and provides the only means of measuring near-surface air temperature over ocean globally) and more weather station measurements of near-surface air temperature in certain surface regimes (e.g., desert, deep forest, ice, high elevation, high latitude) both to better define the relationship between skin and near-surface air temperature there and to provide more data for validation.
Summary and conclusions
The potential for future improvements outlined above notwithstanding, EUSTACE has produced a number of novel outcomes:
-
a global daily station dataset with estimated locations of nonclimatic discontinuities and their likelihood;
-
a pan-European station dataset homogenized to reduce the impact of nonclimatic discontinuities and gridded ensemble analyses for Europe;
-
consistently estimated components of uncertainty in satellite skin temperature retrievals over different surfaces of Earth;
-
air temperature estimates from satellite for each surface (land, ocean, and ice) with propagated uncertainty components;
-
a deeper understanding of the role of lakes in responding to and influencing surrounding surface air temperature;
-
a global, multidecadal daily analysis of surface air temperature incorporating both measurements made in situ and estimated from satellite data; and
-
validation of products using withheld reference data.
These data have been made publicly available, where not restricted by source data licenses, both for direct use and to form the basis of future onward developments (see appendix C for details).
Acknowledgments
The EUSTACE project received funding from the European Union’s Horizon 2020 research and innovation program under Grant Agreement 640171. We thank Professor Doug Nychka for his advice throughout EUSTACE as part of the External Expert Advisory Board and Gary Corlett for validation of interim versions of the EUSTACE air temperature estimates over ocean. We also thank Esther Conway for her work on the data management plan and Roy Mandemakers for provision of a step-by-step guide for using the Climate4Impact portal as part of the EUSTACE user guides. ECK contribution was funded under NERC Grant NE/J020788/1. We thank three anonymous reviewers for their comments, which improved the manuscript.
APPENDIX A
The EUSTACE Team
The EUSTACE consortium included nine organizations: 1) Met Office (United Kingdom), 2) University of Reading (United Kingdom), 3) Science and Technology Facilities Council (United Kingdom), 4) University of Leicester (United Kingdom), 5) Koninklijk Nederlands Meteorologisch Instituut (KNMI) (Netherlands), 6) University of Bern (Switzerland), 7) University of Bath (United Kingdom), 8) Danmarks Meteorologiske Institut (Denmark), and 9) University of Edinburgh (United Kingdom).
An External Expert Advisory Board comprised Prof. Peter Thorne (University of Ireland, Maynooth, Ireland), Dr. Elizabeth Kent (National Oceanography Centre, Southampton, United Kingdom), and Prof. Doug Nychka (National Centers for Atmospheric Research and Colorado School of Mines, Colorado, United States).
APPENDIX B
EUSTACE Input Data
The EUSTACE data products are based on a number of input data sources, summarized in Tables B1–B3.
Satellite data on which EUSTACE products are based and period of data used.
Weather station air temperature measurements on which EUSTACE products are based and period of data used.
Marine in situ measurements on which EUSTACE products are based and period of data used.
APPENDIX C
EUSTACE Products
The EUSTACE data products have been catalogued in the Centre for Environmental Data Analysis (CEDA) archive, with individual download pages pointing to the data. Two products, the European homogenized data and the gridded European dataset, which also form part of the European Climate Assessment and Dataset (ECA&D), are made available separately via ECA&D.
The EUSTACE data products and their availability and licenses are summarized in Table C1.
EUSTACE products and their access and licensing information.
Data are made available on an open license (Open Government Licence, www.nationalarchives.gov.uk/doc/open-government-licence/version/3/) where possible. For the station datasets and the matchup database, this was not possible due to the licensing conditions of the input datasets, which meant they could only be made available for noncommercial use. These have been made available under a noncommercial license (Non-Commercial Government, www.nationalarchives.gov.uk/doc/non-commercial-government-licence/version/2/).
In addition, EUSTACE has produced user requirements reports; product user guides, including detailed guidance on uncertainties and information content in the products; and peer-reviewed journal articles.
Links to all of these can be found on the EUSTACE website (www.eustaceproject.org).
References
Aguilar, E. , M. Brunet , J. Sigró , F. S. Rodrigo , Y. L. Rico , and D. R. Alvarez , 2008: Homogenization of Spanish temperature series on a daily resolution: A step forward towards an analysis of extremes in the Iberian Peninsula. Proc. Seventh European Conf. on Applied Climatology, Amsterdam, Netherlands, European Meteor. Soc., A-00-697.
Auchmann, R. , and S. Brönnimann , 2012: A physics-based correction model for homogenizing sub-daily temperature series. J. Geophys. Res., 117, D17119, https://doi.org/10.1029/2012JD018067.
Banerjee, S. , A. E. Gelfand , A. O. Finley , and H. Sang , 2008: Gaussian predictive process models for large spatial datasets. J. Roy. Stat. Soc., B70, 825–848, https://doi.org/10.1111/j.1467-9868.2008.00663.x.
Bessembinder, J. , 2016: User requirement specification for product design. EUSTACE Deliverable 4.1, 13 pp., www.eustaceproject.org/eustace/static/media/uploads/Deliverables/eustace_d4-1.pdf.
Bessembinder, J. , N. Rayner , D. Ghent , K. Madsen , and J. Mitchelson , 2017: Report on user requirements: results from second round of user consultations. EUSTACE Deliverable 4.9, 58 pp., www.eustaceproject.org/eustace/static/media/uploads/Deliverables/eustace_d4-9.pdf.
Bolin, D. , and F. Lindgren , 2011: Spatial models generated by nested stochastic partial differential equations, with an application to global ozone mapping. Ann. Appl. Stat., 5, 523–550, https://doi.org/10.1214/10-AOAS383.
Brandsma, T. , and G. P. Können , 2006: Application of nearest-neighbor resampling for homogenizing temperature records on a daily to sub-daily level. Int. J. Climatol., 26, 75–89, https://doi.org/10.1002/joc.1236.
Brönnimann, S. , and Coauthors, 2018: Observations for reanalyses. Bull. Amer. Meteor. Soc., 99, 1851–1866, https://doi.org/10.1175/BAMS-D-17-0229.1.
Brugnara, Y. , R. Auchmann , S. Brönnimann , A. Bozzo , D. Cat Berro , and L. Mercalli , 2016: Trends of mean and extreme temperature indices since 1874 at low-elevation sites in the Southern Alps. J. Geophys. Res. Atmos., 121, 3304–3325, https://doi.org/10.1002/2015JD024582.
Brugnara, Y. , E. Good , A. A. Squintu , G. van der Schrier , and S. Brönnimann , 2019: The EUSTACE global land station daily air temperature dataset. Geosci. Data J., 6, 189–204, https://doi.org/10.1002/gdj3.81.
Brunet, M. , and Coauthors, 2006: The development of a new dataset of Spanish Daily Adjusted Temperature Series (SDATS) (1850–2003). Int. J. Climatol., 26, 1777–1802, https://doi.org/10.1002/joc.1338.
Buizza, R. , and Coauthors, 2018: The EU-FP7 ERA-CLIM2 project contribution to advancing science and production of Earth system climate reanalyses. Bull. Amer. Meteor. Soc., 99, 1003–1014, https://doi.org/10.1175/BAMS-D-17-0199.1.
Bulgin, C. , C. Merchant , D. Ghent , L. Klüser , T. Popp , C. Poulsen , and L. Sogacheva , 2018: Quantifying uncertainty in satellite-retrieved land surface temperature from cloud detection errors. Remote Sens ., 10, 616, https://doi.org/10.3390/rs10040616.
Caesar, J. , L. Alexander , and R. Vose , 2006: Large-scale changes in observed daily maximum and minimum temperatures: Creation and analysis of a new gridded data set. J. Geophys. Res., 111, D05101, https://doi.org/10.1029/2005JD006280.
Carrea, L. , and Coauthors, 2019: Lake surface temperature [in “State of the Climate 2018”]. Bull. Amer. Meteor. Soc., 100 (9), S13–S14, https://doi.org/10.1175/2019BAMSStateoftheClimate.1.
Caussinus, H. , and O. Mestre , 2004: Detection and correction of artificial shifts in climate series. J. Roy. Stat. Soc., 53, 405–425, https://doi.org/10.1111/j.1467-9876.2004.05155.x.
Cornes, R. C. , G. van der Schrier , E. J. M. Besselaar , and P. D. Jones , 2018: An ensemble version of the E-OBS temperature and precipitation data sets. J. Geophys. Res. Atmos., 123, 9391–9409, https://doi.org/10.1029/2017JD028200.
Cristóbal, J. , M. Ninyerola , and X. Pons , 2008: Modeling air temperature through a combination of remote sensing and GIS data. J. Geophys. Res., 113, D13106, https://doi.org/10.1029/2007JD009318.
Dunn, R. J. H. , K. M. Willett , D. E. Parker , and L. Mitchell , 2016: Expanding HadISD: Quality-controlled, sub-daily station data from 1931. Geosci. Instrum. Methods Data Syst ., 5, 473–491, https://doi.org/10.5194/gi-5-473-2016.
EUMETSAT, 2014: CORE-CLIMAX system maturity matrix instruction manual. EUMETSAT Doc. CC/EUM/MAN/13/002 v4, 34 pp., www.eumetsat.int/website/wcm/idc/idcplg?IdcService=GET_FILE&dDocName=PDF_CORE_CLIMAX_MANUAL&RevisionSelectionMethod=LatestReleased&Rendition=Web.
Furrer, R. , M. G. Genton , and D. Nychka , 2006: Covariance tapering for interpolation of large spatial datasets. J. Comput. Graph. Stat., 15, 502–523, https://doi.org/10.1198/106186006X132178.
Ghent, D. , K. Veal , T. Trent , E. Dodd , H. Sembhi , and J. Remedios , 2019: A new approach to defining uncertainties for MODIS land surface temperature. Remote Sens ., 11, 1021, https://doi.org/10.3390/rs11091021.
Good, E. J. , 2016: An in situ-based analysis of the relationship between land surface “skin” and screen-level air temperatures. J. Geophys. Res. Atmos., 121, 8801–8819, https://doi.org/10.1002/2016JD025318.
Hansen, J. , R. Ruedy , M. Sato , and K. Lo , 2010: Global surface temperature change. Rev. Geophys., 48, RG4004, https://doi.org/10.1029/2010RG000345.
Harris, I. , P. D. Jones , T. J. Osborn , and D. H. Lister , 2013: Updated high-resolution grids of monthly climatic observations - The CRU TS3.10 dataset. Int. J. Climatol., 34, 623–642, https://doi.org/10.1002/joc.3711.
Hartmann, D. L. , and Coauthors, 2013: Observations: Atmosphere and surface. Climate Change 2013: The Physical Science Basis, Stocker, T. F. et al., Eds., Cambridge University Press, 159–254.
Høyer, J. L. , E. Good , P. Nielsen-Englyst , K. S. Madsen , I. Woolway , and J. Kennedy , 2018: Report on the relationship between satellite surface skin temperature and surface air temperature observations for oceans, land, sea ice and lakes. EUSTACE Deliverable 1.5, 101 pp., www.eustaceproject.org/eustace/static/media/uploads/d1.5_revised.pdf.
Hunziker, S. , and Coauthors, 2017: Identifying, attributing, and overcoming common data quality issues of manned station observations. Int. J. Climatol., 37, 4131–4145, https://doi.org/10.1002/joc.5037.
Jones, G. S. , and J. J. Kennedy , 2017: Sensitivity of attribution of anthropogenic near-surface warming to observational uncertainty. J. Climate, 30, 4677–4691, https://doi.org/10.1175/JCLI-D-16-0628.1.
Jones, P. D. , D. H. Lister , T. J. Osborn , C. Harpham , M. Salmon , and C. P. Morice , 2012: Hemispheric and large-scale land surface air temperature variations: An extensive revision and an update to 2010. J. Geophys. Res., 117, D05127, https://doi.org/10.1029/2011JD017139.
Kent, E. C. , N. A. Rayner , D. I. Berry , M. Saunby , B. I. Moat , J. J. Kennedy , and D. E. Parker , 2013: Global analysis of night marine air temperature and its uncertainty since 1880: The HadNMAT2 data set. J. Geophys. Res. Atmos., 118, 1281–1298, https://doi.org/10.1002/JGRD.50152.
Kilibarda, M. , T. Hengl , G. B. M. Heuvelink , B. Gräler , E. Pebesma , M. Perčec Tadić , and B. Bajat , 2014: Spatio-temporal interpolation of daily temperatures for global land areas at 1 km resolution. J. Geophys. Res. Atmos., 119, 2294–2313, https://doi.org/10.1002/2013JD020803.
Klein Tank, A. M. G. , and Coauthors, 2002: Daily dataset of 20th-century surface air temperature and precipitation series for the European Climate Assessment. Int. J. Climatol., 22, 1441–1453, https://doi.org/10.1002/joc.773.
Kuglitsch, F. G. , A. Toreti , E. Xoplaki , P. M. Della-Marta , J. Luterbacher , and H. Wanner , 2009: Homogenization of daily maximum temperature series in the Mediterranean. J. Geophys. Res., 114, D15108, https://doi.org/10.1029/2008JD011606.
Kushnir, Y. , and Coauthors, 2019: Towards operational predictions of the near-term climate. Nat. Climate Change, 9, 94–101, https://doi.org/10.1038/s41558-018-0359-7.
Lindgren, F. , H. Rue , and J. Lindström , 2011: An explicit link between Gaussian fields and Gaussian Markov random fields: The stochastic partial differential equation approach. J. Roy. Stat. Soc., B73, 423–498, https://doi.org/10.1111/j.1467-9868.2011.00777.x.
Menne, M. J. , I. Durre , R. S. Vose , B. E. Gleason , and T. G. Houston , 2012: An overview of the global historical climatology network-daily database. J. Atmos. Oceanic Technol., 29, 897–910, https://doi.org/10.1175/JTECH-D-11-00103.1.
Merchant, C. J. , and Coauthors, 2013: The surface temperatures of Earth: Steps towards integrated understanding of variability and change. Geosci. Instrum. Methods Data Syst., 2, 305–321, https://doi.org/10.5194/gi-2-305-2013.
Merchant, C. , and Coauthors, 2014: sea surface temperature datasets for climate applications from phase 1 of the European space agency climate change initiative (SST CCI). Geosci. Data J., 1, 179–191, https://doi.org/10.1002/gdj3.20.
Merchant, C. , D. Ghent , J. Kennedy , E. Good , and J. Høyer , 2015: Common approach to providing uncertainty estimates across all surfaces. EUSTACE Deliverable 1.2, 20 pp., www.eustaceproject.org/eustace/static/media/uploads/Deliverables/eustace_d1-2.pdf.
Morice, C. P. , J. J. Kennedy , N. A. Rayner , and P. D. Jones , 2012: Quantifying uncertainties in global and regional temperature change using an ensemble of observational estimates: The HadCRUT4 dataset. J. Geophys. Res., 117, D08101, https://doi.org/10.1029/2011JD017187.
Nielsen-Englyst, P. , J. L. Høyer , K. S. Madsen , R. Tonboe , and G. Dybkjær , 2019a: Deriving Arctic 2-m air temperatures from satellite. Cryosphere Discuss ., https://doi.org/10.5194/tc-2019-126.
Nielsen-Englyst, P. , J. L. Høyer , K. S. Madsen , R. Tonboe , G. Dybkjær , and E. Alerskans , 2019b: In situ observed relationships between snow and ice surface skin temperatures and 2 m air temperatures in the Arctic. Cryosphere, 13, 1005–1024, https://doi.org/10.5194/tc-13-1005-2019.
Osborn, T. J. , P. D. Jones , and M. Joshi , 2017: Recent United Kingdom and global temperature variations. Weather, 72, 323–329, https://doi.org/10.1002/wea.3174.
Peterson, T. C. , and Coauthors, 1998: Homogeneity adjustments of in situ atmospheric climate data: A review. Int. J. Climatol., 18, 1493–1517, https://doi.org/10.1002/(SICI)1097-0088(19981115)18:13<1493::AID-JOC329>3.0.CO;2-T.
Rayner, N. A. , 2019: Final verified products delivered to CEMS for verification. EUSTACE Deliverable 2.6, 6 pp., www.eustaceproject.org/eustace/static/media/uploads/d_2.6_final.pdf.
Rennie, J. J. , and Coauthors, 2014: The international surface temperature initiative global land surface databank: Monthly temperature data release description and methods. Geosci. Data J., 1, 75–102, https://doi.org/10.1002/gdj3.8.
Rohde, R. , and Coauthors, 2013a: A new estimate of the average Earth surface land temperature spanning 1753 to 2011. Geoinfor. Geostat: Overview, 1 (1), http://doi.org/10.4172/2327-4581.1000101.
Rohde, R. , and Coauthors, 2013b: Berkeley Earth temperature averaging process. Geoinfor. Geostat: Overview, 1, 2, https://doi.org/10.4172/GIGS.1000103.
Rue, H. , S. Martino , F. Lindgren , D. Simpson , and A. Riebler , 2013: R-INLA: Approximate Bayesian inference using integrated nested Laplace approximations. www.r-inla.org/.
Sánchez-Lugo, A. , P. Berrisford , C. Morice , and J. P. Nicolas , 2019: Global surface temperature [in “State of the Climate 2018”]. Bull. Amer. Meteor. Soc., 100 (9), S11–S12., https://doi.org/10.1175/2019BAMSStateoftheClimate.1.
Smith, T. M. , R. W. Reynolds , T. C. Peterson , and J. Lawrimore , 2008: Improvements to NOAA’s historical merged land–ocean surface temperatures analysis (1880–2006). J. Climate, 21, 2283–2296, https://doi.org/10.1175/2007JCLI2100.1.
Sohrabinia, M. , P. Zawar-Reza , and W. Rack , 2014: Spatio-temporal analysis of the relationship between LST from MODIS and air temperature in New Zealand. Theor. Appl. Climatol., 119, 567–583, https://doi.org/10.1007/s00704-014-1106-2.
Squintu, A. A. , G. van der Schrier , Y. Brugnara , and A. Klein Tank , 2019a: Homogenization of daily temperature series in the European climate assessment & dataset. Int. J. Climatol., 39, 1243–1261, https://doi.org/10.1002/joc.5874.
Squintu, A. A. , G. van der Schrier , E. J. M. van den Besselaar , R. C. Cornes , A. Klein Tank , 2019b: Building long homogeneous temperature series across Europe: A new approach for the blending of neighboring series. J. Appl. Meteor. Climatol., 59, 175–189, https://doi.org/10.1175/JAMC-D-19-0033.1.
Stickler, A. , and Coauthors, 2014: ERA-CLIM: Historical surface and upper-air data for future reanalyses. Bull. Amer. Meteor. Soc., 95, 1419–1430, https://doi.org/10.1175/BAMS-D-13-00147.1.
Toreti, A. , F. G. Kuglitsch , E. Xoplaki , and J. Luterbacher , 2012: A novel approach for the detection of inhomogeneities affecting climate time series. J. Appl. Meteor. Climatol., 51, 317–326, https://doi.org/10.1175/JAMC-D-10-05033.1.
Trewin, B. , 2013: A daily homogenized temperature data set for Australia. Int. J. Climatol., 33, 1510–1529, https://doi.org/10.1002/joc.3530.
Veal, K. , 2019a: Validation report for the final in-filled EUSTACE surface air temperature product. EUSTACE Deliverable 3.5, 41 pp., https://www.eustaceproject.org/eustace/static/media/uploads/d3-5_final.pdf.
Veal, K. , 2019b: Intercomparison report for the final in-filled EUSTACE surface air temperature product. EUSTACE Deliverable 3.4, 39 pp., https://www.eustaceproject.org/eustace/static/media/uploads/d3-4_final.pdf.
Venema, V. , 2012: Detecting and repairing inhomogeneities in datasets: Assessing current capabilities. Bull. Amer. Meteor. Soc., 93, 951–954, https://doi.org/10.1175/1520-0477-93.7.947.
Vincent, L. , X. Zhang , B. R. Bonsall , and W. D. Hogg , 2002: Homogenization of daily temperatures over Canada. J. Climate, 15, 1322–1334, https://doi.org/10.1175/1520-0442(2002)015<1322:HODTOC>2.0.CO;2.
Vose, R. S. , and Coauthors, 2012: NOAA’s Merged Land–Ocean Surface Temperature analysis. Bull. Amer. Meteor. Soc., 93, 1677–1685, https://doi.org/10.1175/BAMS-D-11-00241.1.
Walters, D. , and Coauthors, 2019: The Met Office Unified Model Global Atmosphere 7.0/7.1 and JULES Global Land 7.0 configurations. Geosci. Model Dev., 12, 1909–1963, https://doi.org/10.5194/gmd-12-1909-2019.
Wang, X. L. , 2008: Accounting for autocorrelation in detecting mean shifts in climate data series using the penalized maximal t or f test. J. Appl. Meteor. Climatol., 47, 2423–2444, https://doi.org/10.1175/2008JAMC1741.1.
Weedon, G. P. , and Coauthors, 2011: Creation of the WATCH forcing data and its use to assess global and regional reference crop evaporation over land during the twentieth century. J. Hydrometeor., 12, 823–848, https://doi.org/10.1175/2011JHM1369.1.
Wikle, C. K. , 2010: Low-rank representations for spatial processes. Handbook of Spatial Statistics, Gelfand, A. E. et al., Eds., Chapman and Hall/CRC Press, 107–118.
Woodruff, S. D. , and Coauthors, 2011: ICOADS release 2.5: Extensions and enhancements to the surface marine meteorological archive. Int. J. Climatol., 31, 951–967, https://doi.org/10.1002/joc.2103.
Woolway, R. I. , and C. Merchant , 2017: Amplified surface temperature response of cold, deep lakes to inter-annual air temperature variability. Sci. Rep., 7, 4130, https://doi.org/10.1038/S41598-017-04058-0.
Woolway, R. I. , and C. Merchant , 2018: Intra-lake heterogeneity of thermal responses to climate change: A study of large Northern Hemisphere lakes. J. Geophys. Res. Atmos., 123, 3087–3098, https://doi.org/10.1002/2017JD027661.
Woolway, R. I. , and Coauthors, 2016: Lake surface temperatures [in “State of the Climate in 2015”]. Bull. Amer. Meteor. Soc., 97 (8), S17–S18, https://doi.org/10.1175/2016BAMSStateoftheClimate.1.
Woolway, R. I. , and Coauthors, 2017a: Lake surface temperature [in “State of the Climate in 2016”]. Bull. Amer. Meteor. Soc., 98 (8), S13–S14, https://doi.org/10.1175/2017BAMSStateoftheClimate.1.
Woolway, R. I. , and Coauthors, 2017b: Latitude and lake size are important predictors of over-lake atmospheric stability. Geophys. Res. Lett., 44, 8875–8883, https://doi.org/10.1002/2017GL073941.
Woolway, R. I. , M. T. Dokulil , W. Marszelewski , M. Schmid , D. Bouffard , and C. J. Merchant , 2017c: Warming of Central European lakes and their response to the 1980s climate regime shift. Climatic Change, 142, 505–520, https://doi.org/10.1007/s10584-017-1966-4.
Woolway, R. I. , P. Meinson , P. Nõges , I. D. Jones , and A. Laas , 2017d: Atmospheric stilling leads to prolonged thermal stratification in a large shallow polymictic lake. Climatic Change, 141, 759–773, https://doi.org/10.1007/s10584-017-1909-0.
Woolway, R. I. , and Coauthors, 2018a: Lake surface temperature [in “State of the Climate in 2017”]. Bull. Amer. Meteor. Soc., 99 (8), S13–S15, https://doi.org/10.1175/2018BAMSStateoftheClimate.1.
Woolway, R. I. , and Coauthors, 2018b: Geographic and temporal variation in turbulent heat loss from lakes: A global analysis across 45 lakes. Limnol. Oceanogr., 63, 2436–2449, https://doi.org/10.1002/lno.10950.
Xu, W. , Q. Li , X. L. Wang , S. Yang , L. Cao , and Y. Feng , 2013: Homogenization of Chinese daily surface air temperatures and analysis of trends in the extreme temperature indices. J. Geophys. Res. Atmos., 118, 9708–9720, https://doi.org/10.1002/JGRD.50791.
Xu, X. , and Coauthors, 2019: The effects of temperature on human mortality in a Chinese city: Burden of disease calculation, attributable risk exploration, and vulnerability identification. Int. J. Biometeor., 63, 1319–1329, https://doi.org/10.1007/s00484-019-01746-6.