Observed Temperature Changes in the Troposphere and Stratosphere from 1979 to 2018

Temperature observations of the upper-air atmosphere are now available for more than 40 years from both ground-and satellite-based observing systems. Recent years have seen substantial improvements in reducing long-standing discrepancies among datasets through major reprocessing efforts. The advent of radio occultation (RO) observations in 2001 has led to further improvements in vertically resolved temperature measurements, enabling a detailed analysis of upper-troposphere/lower-stratosphere trends. This paper presents the current state of atmospheric temperature trends from the latest available observational records. We analyze observations from merged operational satellite measurements, radiosondes, lidars, and RO, spanning a vertical range fromthelowertropospheretotheupperstratosphere.Thefocusisonassessingclimatetrendsandonidentifying thedegreeofconsistencyamongtheobservationalsystems.Theresultsshowarobustcoolingofthestratosphere of about 1–3K, and a robust warming of the troposphere of about 0.6–0.8K over the last four decades (1979– 2018). Consistent results are found between the satellite-based layer-average temperatures and vertically resolved radiosonde records. The overall latitude–altitude trend patterns are consistent between RO and ra-diosonde records. Signiﬁcant warming of the troposphere is evident in the RO measurements available after 2001, with trends of 0.25–0.35K per decade. Ampliﬁed warming in the tropical upper-troposphere compared to surface trends for 2002–18 is found based on RO and radiosonde records, in approximate agreement with moist adiabatic lapse rate theory. The consistency of trend results from the latest upper-air datasets will help to improve understanding of climate changes and their drivers.


Introduction
Earth's atmosphere is an essential component of the climate system.Improving knowledge of the natural variability and trends in atmospheric temperature is of vital importance for a better understanding of climate change and its causes.Therefore, consistent long-term observational records of essential climate variables (ECVs), such as upper-air temperature, are required for the detection and attribution of climate change and for verifying climate model simulations.
This topic is a research focus of the international climate science community acting through the World Climate Research Programme (WCRP) and of relevance for the Intergovernmental Panel on Climate Change (IPCC).Sustaining global climate data is the dedicated goal of the World Meteorological Organization (WMO) via the implementation of the Global Climate Observing System (GCOS) based on principles for climate monitoring systems and for the generation of climate data records (CDRs) (GCOS 2011(GCOS , 2016)).
In this context, the activity on Atmospheric Temperature Changes and Their Drivers (ATC) is a long-standing activity within the WCRP/Stratospheric-Tropospheric Processes and Their Role in Climate (SPARC) program.The activity has made substantial contributions to assessments of stratospheric temperature trends, based on analyses of observations and model simulations, with regular contributions to the WMO/United Nations Environment Programme (UNEP) Scientific Assessments of Ozone Depletion (Ramaswamy et al. 2001;Shine et al. 2003;Randel et al. 2009;Thompson et al. 2012;Seidel et al. 2016;Maycock et al. 2018).
Observations from meteorological satellites have become an important source of upper-air data for more than 40 years, while radiosonde measurements from weather balloons are available since the 1950s and earlier.Evaluating long-term temperature changes from these data is challenging, as the instruments were primarily intended for weather observation.Climate monitoring requires higher accuracy (e.g., Karl et al. 2006;Trenberth et al. 2013).Uncertainties due to such factors as instrument changes over time, intersatellite offsets, changes in diurnal sampling, and other modifications in the observational network require homogenization and intercalibration procedures for the construction of CDRs.
Substantial efforts have been put into the reconciliation of atmospheric temperature trends from different observational platforms (e.g., Karl et al. 2006;Randel et al. 2009).Homogenized radiosonde data (e.g., Titchner et al. 2009;Haimberger et al. 2008Haimberger et al. , 2012) ) and calibrated records from microwave soundings (e.g., Christy et al. 2007;Mears and Wentz 2009a,b;Zou et al. 2009) confirmed tropospheric warming and stratospheric cooling since the mid-twentieth century.In the low-to-midtroposphere, temperature trends from independent observations and models were found to be consistent, but differences remained in the upper troposphere and stratosphere (e.g., Fu et al. 2011;Mitchell et al. 2013;Lott et al. 2013).Independent observational temperature estimates from the Stratospheric Sounding Unit (SSU) showed large discrepancies and also differed from model trends (Thompson et al. 2012).
There has been substantial interest in comparisons of modeled and observed tropospheric temperature trends.Basic theory of moist adiabatic processes predicts larger warming in the tropical free troposphere compared to near the surface, referred to as tropical tropospheric amplification (Stone and Carlson 1979).A number of previous studies have found that most observational datasets appear to show weaker tropical tropospheric amplification for decadal-scale trends, while this remains a robust feature across several generations of model simulations (Santer et al. 2005(Santer et al. , 2008;;Po-Chedley and Fu 2012a;Santer et al. 2017b).While various homogenization efforts and reprocessing activities have generally reduced the observation-model differences, the trend amplification estimates from radiosondes and most reprocessed microwave sounding satellite products generally remain smaller than those from climate models and moist adiabatic lapse rate considerations (Thorne et al. 2011;Mitchell et al. 2013).Factors to consider when interpreting model-observation differences include possible errors in climate model forcings (Solomon et al. 2011(Solomon et al. , 2012;;Mitchell et al. 2013;Sherwood and Nishant 2015;Santer et al. 2017a), differences between simulated and observed sea surface temperature (SST) trend patterns (Mitchell et al. 2013;Kamae et al. 2015;Tuel 2019), internal variability (Suárez-Gutiérrez et al. 2017;Kamae et al. 2015;Santer et al. 2019), and, for satellite retrievals, effects from broad vertical weighting functions (Santer et al. 2017b).
While the vertical profile of temperature trends is an important fingerprint of climate change (e.g., Santer et al. 2013), the magnitude and vertical structure of trends is often dependent on details of the datasets and homogenization details (Hartmann et al. 2013), ''limiting the ability to draw robust and consistent inferences about the true long-term trends.''The IPCC Fifth Assessment Report (Hartmann et al. 2013)

stated that this was a key uncertainty:
There is only medium to low confidence in the rate of change of tropospheric warming and its vertical structure.Estimates of tropospheric warming rates encompass surface temperature warming rate estimates.There is low confidence in the rate and vertical structure of the stratospheric cooling.
Despite these uncertainties, it remains difficult to explain the observed pattern of stratospheric and tropospheric temperature change without anthropogenic forcing (Ramaswamy et al. 2006;Santer et al. 2013;Lott et al. 2013).
In recent years, substantial efforts have resulted in further improvements to layer-averaged temperatures from microwave sounding unit observations (Po-Chedley and Fu 2012b;Po-Chedley et al. 2015;Mears andWentz 2016, 2017;Spencer et al. 2017;Zou et al. 2018).Several merged satellite-based datasets have been constructed for providing continuous climate records in the stratosphere from 1979 to the present (McLandress et al. 2015;Zou and Qian 2016;Randel et al. 2016).Revisiting and reprocessing of stratospheric observations (Zou et al. 2014;Nash and Saunders 2015) led to improved consistency of the revised data versions; however, some differences remain (Seidel et al. 2016).Stratospheric temperature trends from the reprocessed observations and from new models of the SPARC Chemistry Climate Model Initiative (CCMI) showed substantial improvement in the agreement between modeled and observed trends, mainly due to updates of the satellite observations.The range of simulated trends was similar to that in the previous generation of models (Maycock et al. 2018;Karpechko et al. 2019).
The work of Maycock et al. (2018) also contributed to the recent Ozone Assessment Report (WMO 2018;Karpechko et al. 2019).Results confirmed a cooling of the stratosphere and an increase in stratospheric cooling with height; this effect is mainly due to increasing greenhouse gases and is modulated by evolving ozone changes.In the upper stratosphere, both greenhouse gases and ozone were found to contribute to the cooling, whereas in the midstratosphere, greenhouse gases are found to be dominant.In the lower stratosphere, ozone depletion was found to be the dominant factor for the cooling until the mid-1990s.Observed stratospheric cooling trends are weaker since around 1998 (Randel et al. 2016;Seidel et al. 2016;Zou and Qian 2016;Randel et al. 2017), reflecting a decline of ozone-depleting substances and the onset of recovery of the ozone layer (e.g., Harris et al. 2015;Solomon et al. 2016Solomon et al. , 2017)).
Vertical profiles of atmospheric temperature are available from limb-viewing satellite sounders and from groundbased observations, specifically radiosonde and lidar measurements.Reference radiosonde stations have been established over the past decade within the GCOS Reference Upper Air Network (GRUAN), adhering to the GCOS climate monitoring principles (e.g., Seidel et al. 2009;Bodeker et al. 2016).However, such series are still too short for trend retrievals.Gridded radiosonde records (Haimberger et al. 2012) have been updated recently, as well as observations from light detection and ranging (lidar) instruments (e.g., Keckhut et al. 2004;Wing et al. 2018a).
Since 2001, emerging novel satellite-based observations from Global Positioning System (GPS) radio occultation (RO), generically termed Global Navigation Satellite System (GNSS) RO, have become available for atmospheric and climate studies (e.g., Anthes 2011;Steiner et al. 2011Steiner et al. , 2020;;Ho et al. 2017Ho et al. , 2020) ) and have been identified as a key component for the GCOS (GCOS 2011).These long-term stable observations provide profile information with high vertical resolution in the upper troposphere and lower stratosphere and are well suited for climate studies (Lackner et al. 2011;Steiner et al. 2009Steiner et al. , 2011)).
We have deliberately chosen not to include reanalysis datasets in this study.While we acknowledge the high value of these products, we chose to compare observational records that are as independent of each other as possible.Since reanalyses strive to assimilate all available data sources, reanalysis products depend on all those datasets and details of the assimilation systems determine whether a reanalysis draws to one dataset more than to another.Several state-of-the-art reanalyses are currently available or in production (Fujiwara et al. 2017;Hersbach et al. 2020;Simmons et al. 2020).
In this study, we present the latest observational estimates of tropospheric and stratospheric temperature trends based on updated climate records, including novel GNSS RO satellite observations.These estimates include information from gridded radiosonde records and from lidar instruments.We provide an overview of presently available atmospheric observations and recent advances in their development, as well as some of the limitations of the datasets.We discuss variability and trends in both layer-averaged temperatures and vertically resolved data, as well as the associated uncertainties in these results.We also examine the representation of tropical tropospheric amplification in the observations.See the appendix for a list of acronyms used throughout this paper.

Observational datasets
We begin with a brief description of the observational data that are used for temperature trend analyses and discuss advantages and limitations of the data records.

a. Satellite-based observations
Instruments flown on polar-orbiting satellites of the National Oceanic and Atmospheric Administration (NOAA) provide the longest-running records of remotely sensed temperatures.These instruments include the Microwave Sounding Unit (MSU), the Advanced Microwave Sounding Unit (AMSU), and the SSU.SSU measurements are available from late 1978 to 2006 and are the only long-term temperature record in the mid-upper stratosphere with global coverage.The MSU instrument provided data from late 1978 until 1998.The follow-up AMSU instrument provides measurements from 1998 to the present.The sensors measure the radiance of Earth in a cross-track geometry and provide information on broad layer-averages of temperature.Information with higher vertical resolution is given by sensors in limb-viewing geometry, which scan the atmosphere in the vertical.Novel data for climate monitoring with long-term stability are available since 2001 from GNSS radio occultation, the latter exploiting atmospheric refraction.

1) MICROWAVE SOUNDING OBSERVATIONS
MSU and AMSU sounders are available from a suite of satellites that partially overlap in time.These passive microwave radiometers measure the radiance of Earth at microwave frequencies.The thermal emission line of oxygen near 50-60 GHz is used for retrieving atmospheric temperature information since oxygen is well mixed in the atmosphere.Measuring at different frequencies near the oxygen absorption line corresponds to weighting functions peaking at different heights, which provide information on bulk temperatures over a typical vertical width of about 10 km.
The MSU instrument had four different channels delivering temperature information on four thick atmospheric layers until the NOAA-14 satellite ceased in 2005.The AMSU-A instrument began operation in 1998 with a larger number of 15 channels, sampling more atmospheric layers with better resolution.The MSU data record has been extended to the present by using the AMSU-A channels that most closely match the MSU channels from 1979 to the present based on satellites from NOAA TIROS-N through NOAA-19, the National Aeronautics and Space Administration (NASA) Aqua satellite, and the European Meteorological Operational (MetOp) satellite series.
For this study we use MSU-AMSU-A records from three groups: Remote Sensing Systems (RSS, Santa Rosa, California), the Center for Satellite Applications and Research (STAR) of NOAA/National Environmental Satellite, Data, and Information Service (NESDIS, College Park, Maryland), and the University of Alabama (UAH, Huntsville, Alabama).We use the latest versions of MSU-AMSU-A climate data products, which include RSS, version 4.0 (RSS 2019;Mears and Wentz 2009a,b, 2016, 2017); STAR, version 4.1 (NOAA STAR 2019; Zou and Wang 2011); and UAH, version 6.0 (UAH 2019; Spencer et al. 2017).Monthly averaged time series and anomaly time series are available at a resolution of 2.58 3 2.58 in longitude and latitude (Table 1).
The records contain layer-average temperatures computed from single channels from near-nadir views for the midtroposphere (TMT; MSU channel 2/AMSU-A channel 5), the upper troposphere (TUT or TTS or TTP; MSU channel 3/AMSU-A channel 7), and the lower stratosphere (TLS; MSU channel 4/AMSU-A channel 9).The contributions for the temperature averages originate from broad layers peaking near 5 km for TMT, near 10 km for TUT, and near 17 km for TLS (Fig. 1).The TMT and TTS weighting functions extend into the stratosphere and contaminate tropospheric information.To accentuate tropospheric information, a TMT corrected temperature (TMTcorr) can be constructed by subtracting the stratospheric contribution from the TMT channel (Fu et al. 2004;Fu and Johanson 2005;Johanson and Fu 2006;Po-Chedley et al. 2015).We computed TMTcorr by a linear combination of TMT and TLS after Johanson and Fu (2006).Similarly, we computed a TTS corrected temperature (TTScorr) by a linear combination of TTS and TLS: 1.18 3 TTS 2 0.18 3 TLS.
Additionally, RSS and UAH provide a product for the lower troposphere (TLT) from a weighted average of measurements made at different incidence angles (RSS) or a weighted combination of TMT, TTS, and TLS observations (UAH) to extrapolate MSU channel 2 and AMSU-A channel 5 lower into the lower troposphere, with a peak contribution near 2 km (Mears and Wentz 2017;Spencer et al. 2017).AMSU-only stratospheric temperature datasets are available from mid-1998 to present from single channels (Wang and Zou 2014).
The merging of MSU and AMSU measurements from many different instruments requires a number of adjustments, since inhomogeneities from different sources can result in spurious trends in retrieved temperatures.Instrument changes over time using different channel frequencies introduce differences due to sampling of slightly different atmospheric layers.Sampling errors occur also from sampling at different local times when satellites are in different orbits.Orbital decay over time can cause brightness temperatures from the near-limb views to warm faster relative to those from the near-nadir views (Wentz and Schabel 1998).The calibration of the electric signal conversion to radiances is also a potential error source when it drifts over time.The absolute calibration uncertainty is estimated to be 0.5-1 K.An overview of known errors is given by Zou et al. (2018).
Over time, the three processing groups developed improved algorithms to account for calibration issues and time-varying biases before the measurements are compiled into a long-term temperature record (Christy et al. 2000(Christy et al. , 2003;;Mears and Wentz 2009a,b;Zou andWang 2010, 2011).Mears et al. (2011) performed a detailed uncertainty assessment for RSS data.They discussed uncertainty estimates that arise from the methodological choices in accounting for sampling error, diurnal adjustment, and merging procedures.The different methodological approaches by the processing groups lead to differences in climate data records.However, the latest product versions including improved diurnal drift correction based on observations (Po-Chedley et al. 2015) show much better agreement than the earlier versions (e.g., Seidel et al. 2016;Santer et al. 2017b).Our analyses include each of the datasets to provide a measure of uncertainty due to the differing methodologies.

2) STRATOSPHERIC SOUNDING UNIT AND MERGED DATASETS
The SSU is a nadir-sounding instrument that flew on NOAA operational satellites from November 1978 to April 2006.The sensor measured the thermal emission of atmospheric carbon dioxide (CO 2 ) in the infrared absorption line near 15 mm.The measurement made use of the pressure modulation technique by putting a cell of CO 2 gas in the instrument's optical path.By modulating the gas pressure of the CO 2 cell, the single CO 2 absorption line was split into three channels with their weighting functions peaking at 30 km for channel 1 (SSU1), 35 km for channel 2 (SSU2), and 45 km for channel 3 (SSU3), respectively.Accordingly, the main contributions for layer-average temperatures stem from heights between 20 and 40, between 25 and 45, and between 35 and 55 km, respectively, spanning the whole stratosphere as illustrated in Fig. 1.
The creation of a consistent homogeneous climate data record from SSU is a challenging task due to several issues and limitations inherent in the SSU measurements that require corrections for radiometric, spectroscopic, and tidal differences (Nash and Saunders 2015).Gas leakage from the onboard CO 2 cell caused the cell pressure to decrease, which caused weighting functions to peak at different layers over time.Moreover, the weighting functions were sensitive to CO 2 changes in the atmosphere (Shine et al. 2008).Orbital drift also caused biases through sampling of the diurnal cycle at different observation times.Detailed descriptions of these issues are provided by Wang et al. (2012), Nash andSaunders (2013, 2015), and Zou et al. (2014).
In recent work, two groups reprocessed all SSU measurements and generated improved CDRs by correcting for CO 2 cell pressure changes, satellite orbit drift, changes in atmospheric CO 2 , and viewing angle differences, and by accounting for the effects of solar diurnal tides in local time sampling.NOAA/STAR provides version STAR SSU v2.0 (NOAA STAR 2019) as monthly means on a 2.58 3 2.58 latitude and longitude grid (Wang et al. 2012;Zou et al. 2014).The Met Office (Exeter, United Kingdom) provides version UKMO SSU v2.0 only as 6-month-average global means (Nash and Saunders 2015), so that an analysis of latitudinal and seasonal variability is not possible for the UKMO dataset.Comparison of these independently derived versions of SSU show global-mean temperature differences of about 0.5 K, especially from 1979 to 1990 for the upper channels SSU2 and SSU3.A consistency check of SSU data is taking the average of the lowermost channel SSU1 and the uppermost channel SSU3 and subtracting the middle channel SSU2, which should be close to zero.This difference was found to be within 0.2 K for NOAA/STAR data but much larger for UKMO data (see Fig. 7 in Seidel et al. 2016).We therefore use the STAR SSU v2.0 dataset in the current study.
Several merged data records have been constructed by extending the SSU data, which ended in April 2006, with satellite-based datasets from nadir or limb sounders.These data have higher vertical resolution and are integrated vertically with SSU weighting functions to provide SSUequivalent data that are combined with SSU.McLandress et al. (2015) merged SSU and AMSU data by bridging them with Michelson Interferometer for Passive Atmospheric Sounding (MIPAS) measurements (Fischer et al. 2008).That record is only available until 2012.Randel et al. (2016) have provided a merged SSU record by combining SSU with Microwave Limb Sounder (MLS) data on the Aura satellite.MLS measures microwave emission from O 2 and delivers temperatures at 10-90-km altitude with a vertical resolution of about 4-7 km over 20-50 km (Schwartz et al. 2008).Randel et al. (2016) also combined SSU with data from the Sounding of the Atmosphere Using Broadband Emission Radiometry (SABER) instrument based on CO 2 emissions, with temperatures retrieved over 16-100 km and a vertical resolution of 2 km (Remsberg et al. 2008).Zou and Qian (2016) have derived a merged dataset, STAR SSU-AMSU v3.0, by combining SSU and AMSU measurements with a variational approach for optimally merging the data.In this work, we use those merged SSU records that have been updated to present, that is, STAR SSU-AMSU v3.0 (NOAA STAR 2019; Zou and Qian 2016) and SSU-MLS (Randel et al. 2016).
In addition to MLS, SABER, and MIPAS, there are other limb-viewing satellite instruments that provide temperature observations with relatively high vertical resolution (2-4 km).These include the Atmospheric Chemistry Experiment-Fourier Transform Spectrometer (ACE-FTS) from 2004 to the present (Bernath 2017) and the Global Ozone Monitoring by Occultation of Stars (GOMOS) for 2002-12.For GOMOS, two temperature datasets have been released recently: one for the stratosphere (Sofieva et al. 2019), and another dataset for the upper stratosphere and the mesosphere (Hauchecorne et al. 2019).Exploration of these data for potential use in trend analyses is the subject of future research.

3) GNSS RADIO OCCULTATION OBSERVATIONS
Since 2001, a new type of temperature observation from GNSS RO is available (Anthes 2011).RO is based on the refraction of GNSS radio signals by the atmospheric refractivity field during their propagation to a receiver on a low-Earth orbit satellite.Scanning the atmosphere in limb sounding geometry provides profiles of high vertical resolution of about 100 m in the troposphere and tropopause, and about 1 km in the stratosphere (Kursinski et al. 1997;Gorbunov et al. 2004;Zeng et al. 2019).Horizontally, the resolution is about 1.5 km across ray and ranges from about 60 to 300 km along ray in the lower troposphere to the stratosphere (Melbourne et al. 1994;Kursinski et al. 1997).The uncertainty of individual RO temperature profiles is about 0.7 K near the tropopause, gradually increasing into the stratosphere (Scherllin-Pirscher et al. 2011a,b, 2017).For monthly zonal means, the total uncertainty estimate is smaller than 0.15 K in the upper troposphere and lower stratosphere, and up to 0.6 K at higher latitudes in wintertime (Scherllin-Pirscher et al. 2011a).
As the time delay measurement of the refracted signals is based on precise atomic clocks, this enables longterm stability and traceability to the Système International (SI) unit of the second (Leroy et al. 2006).Therefore, data from different RO missions can be merged to a seamless record without intercalibration or requiring substantial temporal overlap (Foelsche et al. 2011;Angerer et al. 2017).Continuous observations are available from several RO satellite missions.So far, most missions have used GPS signals at wavelengths of 0.19 and 0.24 m in the microwave.At these wavelengths, the signals are not affected by clouds and observations are available in nearly all weather conditions.Bending angles are computed from the refracted signals.At high altitudes, the signal-to-noise ratio of the bending angle decreases (above about 50 km depending on the thermal noise of the receiver) and an initialization of bending angle profiles with background information is performed.
Refractivity is computed from bending angle and is related directly to temperature under dry atmospheric conditions.This is the case in the upper tropospherelower stratosphere (UTLS) where water vapor is negligible (Kursinski et al. 1997;Scherllin-Pirscher et al. 2011a).In the moist lower-to-middle troposphere, the retrieval of (physical) atmospheric temperature or humidity requires a priori information in order to resolve the wet-dry ambiguity information inherent in refractivity (e.g., Kursinski et al. 1995;Kursinski and Gebhardt 2014).In this study, we use RO dry temperatures (without a priori information) above 9 km to avoid the wet-dry ambiguity.For this altitude range, we found that the difference in trends from dry air temperature to trends from a moist retrieval is negligible (not shown).
The data processing adds structural uncertainty to the data as different processing centers use different background information and methods.For the early RO period, based on the single-satellite CHAMP mission only, this uncertainty increases above 25 km (Ho et al. 2012;Steiner et al. 2013), due to receiver noise and therefore larger impact of the high-altitude bending angle initialization (Leroy et al. 2018).Thus, for climate trend studies, CHAMP is regarded as a limiting factor.In addition, only 150 occultation profiles per day are available from CHAMP.However, uncertainty due to the changing number of observations is reduced by correcting for the sampling error in RO climatological fields (Foelsche et al. 2008).For later missions, based on advanced receivers, data are usable to higher altitudes (Steiner et al. 2020).Overall, structural uncertainty in trends is lowest at 8-25-km altitude globally for all inspected RO variables and different missions (Steiner et al. 2020).Data products comprise individual profiles and gridded fields of bending angle, refractivity, pressure, geopotential height, temperature, and specific humidity.These products have been used in a number of different atmosphere and climate studies (Ho et al. 2010;Anthes 2011;Steiner et al. 2011;Ho et al. 2020).
Comparison of Wegener Center (WEGC) RO data against MSU-AMSU records showed slight differences in TLS trends (Steiner et al. 2007;Ladstädter et al. 2011) while good agreement of Radio Occultation Meteorology Satellite Application Facility (ROM SAF) RO stratospheric trends to Aqua AMSU records was found (Khaykin et al. 2017).Comparisons with collocated radiosondes (detailed in the following section), Vaisala RS90/92 and GRUAN, showed very good agreement with global annualmean temperature differences of less than 0.2 K. Radiosonde daytime radiation biases were identified at higher altitudes (Ladstädter et al. 2015;Ho et al. 2017).The stability of RO makes it a useful calibration reference for AMSU (Chen and Zou 2014) and radiosondes (Ho et al. 2017;Tradowsky et al. 2018).

b. Ground-based observations
Ground-based temperature observations are available from radiosonde measurements made with weather balloons.Radiosonde measurements extend into the lower stratosphere only, while lidar measurements extend to the mid-and upper stratosphere.Observations are limited to ground stations and have limited coverage in space and time.

1) RADIOSONDE OBSERVATIONS
Reliable radiosonde temperature records commenced in 1958.Observations are made once or twice per day at stations that are mainly located on Northern Hemisphere continents.The weather balloons reach up to about 25 km until they burst.Depending on wind conditions, typical drift distances are a few kilometers in the lower troposphere to about 50 km in the lower stratosphere (Seidel et al. 2011).All radiosonde datasets have limited coverage in the tropics.Different countries use different instrument types, and instrumentation has changed over time (e.g., Thorne et al. 2011).A further problem is that radiosondes are affected by radiation biases during daytime measurements (Sherwood et al. 2005;Ladstädter et al. 2015).To reduce data discontinuities and residual cooling biases in radiosonde-derived CDRs, a number of different adjustment techniques have been developed.
Several centers have produced homogenized radiosonde products using different methods.NOAA's Radiosonde Atmospheric Temperature Products for Assessing Climate (RATPAC) (Free et al. 2004) is based on spatial averages of adjusted temperature data (Lanzante et al. 2003) from 1958 to 1995.Since 1996, it is based on the Integrated Global Radiosonde Archive (IGRA) station data using a first difference method (Free et al. 2004) and the record is not fully homogenized.RATPAC-A, version 2, data (NOAA NCEI 2019) are provided as zonal yearly anomalies.
The Hadley Centre Atmospheric Temperature dataset (HadAT) from the Met Office (Thorne 2005;McCarthy et al. 2008) uses a larger number of stations; however, it is only available for 1958-2012 and not updated to the present (www.metoffice.gov.uk/hadobs/hadat/).Sherwood et al. (2008) constructed a radiosonde record based on an iterative universal kriging (IUK) method.The record is available only until 2015 and not updated to present (Sherwood and Nishant 2015).Haimberger et al. (2008) introduced the homogenized Radiosonde Observation using Reanalysis (RAOBCORE) and Radiosonde Innovation Composite Homogenization (RICH) datasets.Break points are determined either by using composites of neighboring observations as reference (RICH) or by comparing to departures from a reanalysis background (RAOBCORE).RICH is independent of the background, but interpolation errors may be large where sampling is sparse such as in the tropics and the Southern Hemisphere.RAOBCORE reduces interpolation errors at the cost of slight background dependence on ERA-Interim (Haimberger et al. 2012).Gridded RAOBCORE and RICH data have been updated to the end of 2018 (Haimberger 2019).The data are provided as monthly means at 108 3 108 resolution.
Radiosonde temperature data are also available from GRUAN radiosonde stations (https://www.gruan.org/;Bodeker et al. 2016).GRUAN is a reference observing network of quality measurements of ECVs to reduce uncertainty in climate monitoring (Seidel et al. 2009;Thorne et al. 2013).As of 2019, GRUAN comprises of 26 sites, 12 of which have been certified.We do not use GRUAN data because records start in 2009 and are too short for reliable trend estimation.
In this study, we use the RICH and RAOBCORE radiosonde records of the University of Vienna.In addition, we also use radiosonde data from the ERA-Interim archive (denoted RS-VAIS), restricting our attention to data from the Vaisala RS80, RS90, RS92, and RS41 radiosondes from 1995 onward.These measurements are known to be of high quality (Steinbrecht et al. 2008;Nash et al. 2011;Ladstädter et al. 2015).

2) LIDAR OBSERVATIONS
Stratospheric and lower-mesospheric temperature lidar measurements are available at several locations from the Network for the Detection of Atmospheric Composition Change (NDACC).The Rayleigh lidar technique uses molecular backscattering of a pulsed laser beam to derive the vertical profile of atmospheric density.The collected signal is sampled as a function of time, that is, geometric altitude.The intensity of scattered light is directly related to the air density at the backscatter altitude considered.Using a priori temperature information at the top of the profile, temperature can be retrieved with high spatiotemporal resolution from the measured relative density profile (Hauchecorne and Chanin 1980).Accuracy and precision both depend on altitude, and typically range from less than 0.1 K in the stratosphere to 10 K or more at the very top of the profile (80 km or higher).Descriptions of the Rayleigh lidar temperature retrieval and its uncertainty can be found in Hauchecorne and Chanin (1980), Keckhut et al. (2011), Leblanc et al. (2016), and Wing et al. (2018a).Validation studies showed that the accuracy of individual lidar profiles is better than 1 K in the altitude range of 35-65 km (Keckhut et al. 2004).A variety of studies have assessed stratospheric temperature variability and trends from lidar (e.g., Randel et al. 2009;Steinbrecht et al. 2009;Li et al. 2011;Funatsu et al. 2011Funatsu et al. , 2016)).Lidar temperatures for the stratosphere and mesosphere were used as reference data for detecting biases in satellite-based observations from limb sounders (Wing et al. 2018b).
In addition, we show data available since 1993 from the JPL lidar at the tropical station of Mauna Loa Observatory (MLO) in Hawaii (19.548N, 155.588W;Leblanc and McDermid 2001).Under clear-sky conditions, lidar temperature measurements are usually made on 5-20 nights per month at each station.These measurements were then averaged to monthly mean time resolution for each station.The time series for OHP, HOH, TMF, and MLO were analyzed in this study.

Trend analysis
For estimation of atmospheric trends from observations, we used global-mean and zonal-mean monthly mean temperature time series of layer-average brightness temperatures and of vertically resolved temperature observations.RO and RS-VAIS zonal-mean fields were corrected to account for their incomplete sampling of the full spatial and temporal variability of the atmosphere (Scherllin-Pirscher et al. 2011a;Ladstädter et al. 2015).The sampling error is estimated from the difference between a field of averaged collocated profiles and a full atmospheric field (Foelsche et al. 2008).The sampling error is subtracted from the gridded climatologies, leaving a small residual sampling error (Scherllin-Pirscher et al. 2011a).The atmospheric fields used in this study for estimating the sampling error were reanalysis fields from ERA5.1 (Simmons et al. 2020) for ROM SAF RO, WEGC RO, and RS-VAIS, and ERA-Interim for UCAR RO.RICH and RAOBCORE gridded fields were not corrected for sampling error.
We computed anomaly time series by subtracting the monthly climatology of the common reference period 2002-18 from the absolute time series.Trend estimates were computed for a number of different periods: 1979-2018, 1979-98, 1999-2018, and 2002-18.Trends were computed by applying a linear ordinary least squares fit as well as by multiple regression analysis.The uncertainty estimates of the trends are expressed as 95% confidence level, accounting for lag-1 autocorrelation of the regression residuals.Trends are deemed to be ''significantly different from zero'' if the confidence interval does not contain the null hypothesis value (zero trend).
The multivariate regression model includes a linear trend term and natural variability terms accounting for the solar cycle, El Niño-Southern Oscillation (ENSO), stratospheric volcanic eruptions, and the quasi-biennial oscillation (QBO) (Fig. 2).Commonly used indices describe these terms.Solar variability is represented by the radio emission flux from the sun at a wavelength of 10.7 cm.The period 1979-2018 covers almost four solar cycles.Daily observed solar flux values (Natural Resources Canada 2019) were averaged to monthly means.
ENSO originates in the tropical Pacific Ocean with warm SSTs during El Niño phases and cold anomalies during La Niña phases.It dominates interannual variability in the troposphere up to the lowermost stratosphere.During an El Niño event, the tropical troposphere warms and the lowermost tropical stratosphere cools (Free and Seidel 2009;Randel et al. 2009).Deviations from the zonal mean are seen as eddy signals in the subtropics (Scherllin-Pirscher et al. 2012).We use the Niño-3.4SST index as an ENSO proxy.This is the spatially averaged SST in the Niño-3.4region (58S-58N and 1708-1208W).By definition, El Niño or La Niña periods occur if 5-month running means of SST anomalies in this region exceed 10.4 K or 20.4 K, respectively, for at least six months (Trenberth 1997).Our multiple regression relied on version 5 of the Extended Reconstructed Sea Surface Temperature dataset (ERSSTv5; Huang et al. 2017).To account for lags between this measure of ENSO variability and the response of tropospheric temperature, we used a lag of 3 months for the monthly ERSSTv5 (1981-2010 base period) Niño-3.4index (CPC 2019).
Tropical stratospheric variability is dominated by the QBO, which has a period of about 28 months.The QBO is characterized by alternating easterly and westerly wind regimes propagating downward to the tropopause at about 1 km per month.This is also seen in the stratospheric temperature structure as positive and negative temperature anomalies of several degrees; anomalies are proportional to the vertical gradient of the zonal winds (Randel et al. 1999;Baldwin et al. 2001).This distinctive thermal structure makes it possible to investigate the QBO with RO temperature anomalies (Wilhelmsen et al. 2018).Here, we use the QBO index of monthly mean zonal winds of the Freie Universität of Berlin (FU Berlin 2019) produced by combining observations of three radiosonde stations: Canton Island, Gan/Maldives, and Singapore (Naujokat 1986).Applying a principal component analysis to the wind profiles over 70-10 hPa, we use the first two orthogonal basis functions, PC1 and PC2, as proxies for the QBO (Wallace et al. 1993).
Explosive volcanic eruptions such as El Chichón in 1982, Mount Pinatubo in 1991 (Robock 2000) and also minor volcanic eruptions after 2000 affect short-term temperature trends in the troposphere and stratosphere (Solomon et al. 2011;Stocker et al. 2019).As a proxy for the effects of volcanic eruptions we compute the stratospheric aerosol optical depth over 15-25 km from the monthly mean Global Space-Based Stratospheric Aerosol Climatology (GloSSAC), version 1.0, averaging over the tropics and subtropics (Thomason 2017;Thomason et al. 2018).
In addition, we used observed surface temperature trends to compare with trends in the free atmosphere.We employed the HadCRUT4 dataset for this purpose (Met Office and the Climatic Research Unit, University of East Anglia, United Kingdom; HadCRUT4 2020; Morice et al. 2012).

a. Long-term time series and linear trends
Here we present multidecadal time series and linear trends over the 40-yr period 1979-2018 for the stratosphere and the troposphere.Results are from SSU and MSU layeraverage temperatures as well as from lidar temperatures.Figure 3 shows near-global-average (858S-858N) anomaly time series of stratospheric temperatures for the lowerstratospheric TLS channel from three MSU-AMSU records and for SSU channels in the mid-upper stratosphere from two merged records, SSU-AMSU and SSU-MLS.Stratospheric temperatures show the impact of the major eruptions of El Chichón in 1982 and Mount Pinatubo in 1991.These large warming signals have peak amplitude in the lower stratosphere and last for roughly two years after the eruptions; only minor TLS changes occurred between the two eruptions.
The linear trend over the last four decades shows cooling of the stratosphere.This is also the case if the anomalous years after the major volcanic eruptions are disregarded in the trend computation, which has minimal impact on trend values but substantially reduces the trend uncertainty (not shown).Accounting for the main modes of natural variability by applying multiple regression analysis also reduces the trend uncertainty with only small impact on trend values.Stratospheric trends increase from the lower stratosphere to the upper stratosphere.For results from the STAR group, for example, the least squares linear trends for the period 1979-2018 are 20.25 6 0.16 K decade 21 for TLS, and 20.56 6 0.13, 20.62 6 0.13, and 20.70 6 0.14 K decade 21 for channels SSU1, SSU2, and SSU3 (respectively; see Fig. 3).The corresponding trends from the multiple regression model, also obtained with STAR data, are 20.17 6 0.08 K decade 21 for TLS, and 20.50 6 0.09, 20.58 6 0.09, and 20.67 6 0.10 K decade 21 for SSU1, SSU2, and SSU3 (respectively).These results indicate that the long-term trends are robust to the statistical methodology used for fitting trends.The implied trend uncertainty is smaller in the multiple regression analysis, although some degree of collinearity between several of the predictor variables (see Fig. 2) can hamper assessment of trend uncertainty (Santer et al. 2001).All trends are significant at the 95% level, and results from different research groups are reasonable consistent.The overall decrease in stratospheric temperature is about 1-3 K over the last four decades, but the characteristics change over time and as a function of atmospheric layer.
Cooling is larger in the first half of the record.The amplitude of trends decreases since the late 1990s, particularly in the lowermost stratosphere.This nonlinear behavior in TLS is due to the decline of stratospheric ozone in the early period (Ramaswamy et al. 2001) and ozone recovery after roughly 1998 due to the effectiveness of the Montreal Protocol (WMO 2018).These results are consistent with a number of previous studies (e.g., Randel et al. 2009Randel et al. , 2016;;Seidel et al. 2016;Zou and Qian 2016;Randel et al. 2017;Polvani et al. 2017;Solomon et al. 2017;Maycock et al. 2018).Interestingly, since about 2015 the cooling seems to be enhanced, which may be related to the onset of solar minimum conditions.
Lidar time series (Fig. 4) are the only long-term temperature series in the stratosphere suitable for comparison to SSU temperatures.Monthly mean temperature time series from the four selected lidar stations are presented.Equivalent temperatures have been computed from lidar profiles using the SSU3 weighting function from STAR, by sampling and weighting the lidar temperature at the respective height levels (ftp://ftp.star.nesdis.noaa.gov/pub/smcd/emb/mscat/data/SSU/SSU_v2.0/Weighting_Function/).The vertically weighted lidar results are compared to collocated SSU3 data at grid points corresponding to the location of the respective lidar stations.A constraint in this comparison is the limited temporal sampling of the lidars at individual stations (measurements are limited to clear nights) in contrast to the full monthly sampling of SSU.This may partly explain the larger variability of the lidar temperature series.
In general, the variability in lidar temperatures is higher at the more northerly HOH and OHP stations (47.88 and 448N, respectively) and smaller for the tropical MLO station (19.58N).Lidar temperature anomalies are well correlated with the SSU3 time series: correlations are 0.76, 0.70, 0.73, and 0.72 for SSU-AMSU versus HOH, OHP, TMF, and MLO (respectively).All linear trends in Fig. 4, both for the lidar data and SSU, were computed for the time periods dictated by the length of individual lidar records.At HOH, the linear trend of 20.39 6 0.59 K decade 21 for the lidar is smaller than the trend of 20.59 6 0.38 K decade 21 for SSU-AMSU (Fig. 4a).Although the latter is statistically significant, the trend in HOH lidar data is not significant because of the larger variability of lidar data.At TMF, the lidar trends are larger than SSU trends, specifically due to the differences in the first half of the time series.These differences are likely associated with a warm temperature bias in the first few years of the lidar record.The warm bias was caused by the presence of signal-induced noise in the raw lidar data complicating the extraction of background noise.The pre-1996 temperature data at TMF should therefore be considered with caution.A full reanalysis of these data is currently being undertaken, with the expectation of a more accurate TMF record during these early years.At OHP, lidar trends are of a similar magnitude as SSU-AMSU trends over the time period considered, but these are not statistically significant due to the large variability.Note that data at the end of the OHP time series are not included in this work as they are currently being investigated for biases.For the MLO station, there is very close agreement between the lidar trend of 20.37 6 0.20 K decade 21 and the SSU-AMSU trend of 20.31 6 0.12 K decade 21 .
Tropospheric temperature anomalies are shown in Fig. 5 based on time series of MSU-AMSU layer-average temperature anomalies.As expected, the interannual variability is strongly correlated with ENSO behavior-for example, positive tropospheric temperature anomalies coincide with El Niño events in 1983, 1997, 2010, and 2016.Statistically significant warming trends are detected over the last four decades in the lower troposphere (TLT), over the total troposphere (TMTcorr), and, to a lesser amount, for TTScorr.The trend is weaker for the unadjusted midtroposphere channel (TMT) because it contains information from cooling of the stratosphere.The upper-troposphere channel TTS (see Fig. 1) also reaches into the lower stratosphere (to about 20 km above Earth's surface) and therefore integrates over tropospheric warming and stratospheric cooling; this results in a near-zero trend (not shown).
Note that the TTS time series of RSS starts in 1987, and is therefore shorter than the TLT, TMT, TMTcorr, and TTS records.
The warming of the troposphere is about 0.6-0.8K over the last four decades (Fig. 5).The RSS least squares linear trend for TMTcorr (0.19 6 0.04 K decade 21 ) is similar to the trend obtained from multiple linear regression of (0.16 6 0.02 K decade 21 ).Trend values are lowest for the UAH record (see, e.g., Santer et al. 2017a,b).

b. Latitude structure of trends
We used multiple regression to calculate trends as a function of latitude for 108 zonal bands.The latitudinal structure of stratospheric trends (Fig. 6a) shows a consistent picture of cooling over all latitudes that increases with height.Cooling is statistically significant in all four stratospheric layers and at all latitudes, except poleward of ;508S and 508N for TLS and at very high latitudes in the Southern Hemisphere for the SSU channels.Trends range from approximately 20.25 K decade 21 in the lower stratosphere (TLS) to 20.5 to 20.7 K decade 21 in the midupper stratosphere (SSU1 to SSU3).At northern high latitudes, cooling is up to 21 K decade 21 in the uppermost SSU channel while at southern high latitudes it is weaker.In the lower stratosphere, the latitudinal trend structure is different from the upper stratosphere, with largest cooling at high latitudes in the Southern Hemisphere and smallest cooling over the northern polar region.This structure in TLS trends arises because the strengthening of the Brewer-Dobson circulation (BDC) over 1979-2018 leads to cooling at low latitudes and high-latitude warming.The BDC partly compensates for the radiative cooling associated with high-latitude ozone depletion, especially in the Southern Hemisphere (Fu et al. 2015(Fu et al. , 2019)).Both ozone and atmospheric circulation changes are important factors in determining the latitudinal pattern of the lower-stratospheric cooling trend (e.g., Solomon et al. 2017;Maycock et al. 2018).In addition, an enhanced lower-stratospheric cooling is seen in the midlatitudes (Fu et al. 2006), which is caused by the poleward shift of subtropical jets associated with tropical expansion (Fu and Lin 2011;Polvani et al. 2017;Maycock et al. 2018).
Tropospheric trends (Fig. 6b) show significant warming over all latitudes from the lower troposphere to the midtroposphere (channels TLT, TMTcorr, TMT), except at southern high latitudes where the trend is near zero.At northern high latitudes, warming trends are largest and reach about 0.3-0.5 K decade 21 .TTScorr shows significant trends only in the tropics.Although RSS and UAH have large differences in TLT trends in this region, both products clearly show significant tropical warming.Only channel TTS shows near-zero trends that can be explained by the broad weighting function, which receives contributions from both tropospheric warming and lower-stratospheric cooling (see Fig. 1).
We also calculated zonal-mean trends for the period 2002-18 (Fig. 7), thus facilitating direct comparison with GNSS RO observations.Stratospheric trends (Fig. 7a) show larger uncertainties than were evident for the 1979-2018 period, particularly at high latitudes.This is due to combined effects of the large dynamical variability and the shorter analysis period.In the lowermost stratosphere, the trend over all latitudes is near zero except at southern high latitudes, where it reaches 21 K decade 21 .This result is highly dependent on the end points of the short data record, as Antarctic TLS trends beginning in 1998 are positive (Randel et al. 2017), while trends beginning in 2000 or 2002 are negative (Fu et al. 2019;Fig. 7); the results are strongly influenced by the Antarctic stratospheric warming in 2002 (e.g., Newman and Nash 2005).This sensitivity highlights the uncertainty of polar stratospheric trends derived from short data records with arbitrary end points.Note that Antarctic ozone has been recovering since the late 1990s (Solomon et al. 2017), leading to radiative heating within the background of large dynamic variability.In the mid and upper stratosphere, cooling trends are found to be significant over 508S-508N.Almost all TLS trends over 2002-18 fail to achieve statistical significance.
Tropospheric trends for the period 2002-18 (Fig. 7b) show a latitudinal structure similar to that found over the full 1979-2018 period, with significant trends throughout most of the tropics and subtropics.At high latitudes, however, trends have large uncertainties and are not significant.

c. Vertically resolved trends
Vertically resolved trends from radiosonde data in Fig. 8 are presented for the period 1979-2018 together with trends from layer-average temperatures from MSU-AMSU and merged SSU records.This provides an overview of upper-air trends from the lower troposphere to the stratopause for near-global averages (708S-708N) (Fig. 8a) and for the tropics (208S-208N) (Fig. 8b).Overall, the different records show remarkably good agreement.Surface temperature trends are also indicated and are similar to TLT trends.
Global-mean temperature trends from the homogenized, gridded RICH and RAOBCORE radiosonde records are plotted at standard pressure levels and show warming of near 0.2 K decade 21 in the lower and midtroposphere (from 0.15 6 0.03 K to 0.19 6 0.03 K decade 21 at 5-9 km).Close agreement of radiosondes with layer averages of MSU-AMSU is found for TLT and TMTcorr, except in the case of UAH satellite data (which has less warming than the RSS and STAR MSU-AMSU products).
In the lower stratosphere, cooling rates inferred from radiosonde records range from about 20.2 to 20.4 K decade 21 (from 20.26 6 0.09 to 20.39 6 0.06 K decade 21 ).In the mid-upper stratosphere, the radiosonde-estimated cooling increases from 20.5 to 20.7 K decade 21 (from 20.52 6 0.05 to 20.68 6 0.07 K decade 21 ), in close accord with merged SSU records.
Tropical average trends (Fig. 8b) show a similar structure to the near-global mean (708S-708N).As expected, the trend uncertainties are slightly larger because tropical variability is larger than for near-global averages.Tropospheric warming rates are about 0.15 K decade 21 (from 0.12 6 0.04 to 0.18 6 0.03 K decade 21 ) from radiosondes and correspond well with MSU-AMSU TLT and TMTcorr trends.In addition, the vertical amplification obtained by comparing the RSS and STAR TMTcorr trends with the surface trends is seen in the tropics (Fig. 8b), which is consistent with results from earlier studies by Fu et al. (2004) and Po-Chedley et al. (2015).UAH TMTcorr trends do not yield amplification of tropical surface warming.The UAH upper-troposphere channel TTScorr shows smaller trends but the STAR TTScorr agrees well with radiosondes, showing impact on trends from different bias correction algorithms in satellite merging.
In the tropical lowermost stratosphere, the cooling rate is about 20.25 K decade 21 and increases to about 20.35 K decade 21 , as observed by radiosondes.The cooling increases from 20.54 6 0.08 to 20.70 6 0.08 K decade 21 in the mid-upper stratosphere (SSU1 to SSU3) as seen from merged SSU records.
Figure 9 shows the variance explained by the different components of the multiple regression model.Results are for time series of tropical temperature anomalies for 1979-2018, and are for TMTcorr, MSU TLS, and SSU1, SSU2, and SSU3.In the troposphere (TMTcorr), about 45% of the variance is explained by ENSO.In the lower stratosphere (TLS), stratospheric volcanic aerosols and the QBO explain about 20% and 15% of the variance (respectively).In the mid-upper stratosphere (SSU1, SSU2, SSU3), ,10% variance is explained by the QBO and about For the shorter period 2002-18, our comparison of upper-air trends (Fig. 10) includes vertically resolved trends from three GNSS RO records (ROM SAF, UCAR/ NOAA, WEGC), the RICH, RAOBCORE, and RS-VAIS radiosonde records, and the MSU-AMSU and merged SSU records.In the troposphere up to roughly 10-km altitude, radiosondes show global warming of 0.3 K decade 21 , whereas trends from layer-averaged MSU-AMSU are smaller (Fig. 10a).Trends from radiosondes agree well with trends from RO data in the midtroposphere and in the lower stratosphere.For the latter, a global-mean cooling of about 20.2 K decade 21 near 25-km altitude is observed (from 20.16 6 0.11 to 20.35 6 0.09 K decade 21 for WEGC and UCAR/ NOAA, ROM SAF).The cooling increases in the midupper stratosphere from 20.42 6 0.06 to 20.58 6 0.08 K decade 21 (SSU-MLS).
In the tropics (Fig. 10b), tropospheric trends from the different RO records (ROM SAF, UCAR/NOAA, WEGC) are consistent with the RS-VAIS record (within their respective uncertainty estimates) up to 14-km altitude.While gridded radiosonde records (RICH and RAOBCORE) show a warming of about 0.2 K decade 21 throughout the troposphere (from 0.17 6 0.14 to 0.26 6 0.14 K decade 21 ), RO and RS-VAIS show a larger warming in the mid-upper troposphere of about 0.3 K decade 21 (from 0.31 6 0.11 to 0.38 6 0.15 K decade 21 at 9-16 km for ROM SAF).A possible reason for the different trend values in RICH and RAOBCORE is that they only represent conditions over land, while in RS-VAIS this sampling bias is corrected.In addition, slightly stronger warming is observed around the tropical tropopause (;15-18 km) by RO compared to the radiosondes.Observed trends in the tropical stratosphere point to a slight cooling from about 20.1 to 20.22 K decade 21 in the lower stratosphere observed in RO and radiosondes, which increases to 20.45 K decade 21 in the upper stratosphere in the SSU observations.
The largest trend differences among the different datasets are evident in the tropopause region.In this region we are more confident in trends from RO observations than in those from radiosonde observations.Inai et al. (2015) reported a pressure bias in Vaisala RS80 sensors that could lead to an artificial trend in temperature.Transitioning from the RS80-dominated time period before 2007 to the RS92-dominated period thereafter may have contributed to weaker TTS trends in the radiosonde record.In addition, radiosonde information is sparser in the tropics and Southern Hemisphere and clustered over Northern Hemisphere continents.While we account for sampling biases in the RS-VAIS record, this is not the case for the gridded RAOBCORE and RICH radiosonde products.Furthermore, the gridded radiosonde records are given on standard pressure levels and do not fully resolve the tropopause region (see Fig. 11).In contrast, the RS-VAIS record provides higher vertical sampling and agrees better with RO.These differences in homogeneity, coverage, and vertical resolution must contribute to the principal trend differences between the vertically resolved data records in the tropopause region.
Slight differences in RO trends at stratospheric altitudes mainly stem from the early CHAMP period before 2006.Due to the larger noise of the CHAMP receiver, the impact of high-altitude initialization propagates further down in the temperature retrieval while later RO missions have improved receivers (Leroy et al. 2018;Shangguan et al. 2019;Steiner et al. 2020).To highlight this sensitivity, we compare trends from 2007 based on a standard linear fit. Figure 11 shows that RO trends from different centers are highly consistent and agree well with the RS-VAIS and the other gridded radiosonde records.We note that the linear changes in Fig. 11  statistically significant warming of the troposphere from about 0.25 to 0.35 K decade 21 .Relatively weak cooling is found throughout much of the stratosphere, apart from a distinct warming of about 0.5 K decade 21 above the southern tropical tropopause.The latter feature is likely due to variability in the short-term data record.Largest cooling is observed in the southern high-latitude stratosphere but is not significant due to large atmospheric variability.

d. Tropospheric amplification
In this section we further investigate the observed warming of the tropical upper troposphere and the evidence for tropical amplification of surface warming in the free troposphere.We computed temperature amplification metrics following Santer et al. (2005), calculating ratios of temperature standard deviation and trends at tropospheric vertical levels with respect to HadCRUT4 surface temperatures.We first examine the ratio between the temporal standard deviations of monthly mean tropospheric and surface temperature anomalies as a measure of month-to-month variation.We then inspect the ratio between the multidecadal trends of tropospheric and surface temperature anomalies.
These scaling ratios were computed for radiosondes, RO records, and MSU data and compared to theoretically expected scaling ratios from moist adiabatic lapse rate theory.The theoretical scaling factors were computed by taking the difference between two moist adiabats with specified tropical-mean surface temperatures of 1-K difference.The difference between these two adiabats at higher altitudes then corresponds to the scaling factor.We estimate a range of plausible surface temperatures in the tropics by averaging over ERA5 2 m temperature and humidity.These contain a minimum and maximum estimate as well as mean conditions (approximately 298-300 K and 73%-84% relative humidity).The scaling factor depends sensitively on the relative humidity at the surface.Assuming saturation at the surface would yield a maximum scaling factor of ;2.8 in the upper troposphere (;13 km), whereas with 80% relative humidity, which is closer to the observed average surface relative humidity in the tropics, the maximum scaling factor is ;2.4 at slightly lower altitudes.Secondary effects such as phase changes have only a small effect on the amplification (less than 0.1).
Comparison of the standard deviation ratios for the period 1979-2018 (Fig. 13a) shows that for short-term variability, the RICH and RAOBCORE radiosonde records follow the expected moist adiabatic lapse rate up to 8-10 km.Above 10 km, these radiosonde datasets show no further amplification.There is also close agreement between the theoretical expectation of the standard deviation ratios and the MSU TLT and TMTcorr results (with weighting function peaks near 2 and 5 km, respectively, which represent thick vertical layers but are indicated as points).MSU TTScorr shows no further amplification relative to TLT and TMTcorr.Now consider scaling ratios for multidecadal tropospheric and surface trends over the period 1979-2018 (Fig. 13b).The scaling ratios of RICH and RAOBCORE do not yield amplification with height and exhibit damping of surface warming (scaling factor , 1) above 10 km.The STAR MSU TMTcorr shows amplification that is consistent with theory, while RSS and UAH MSU data have (respectively) minimal amplification and less warming aloft than at the surface.For TTScorr, surface warming is damped in UAH and has minimal amplification in STAR, showing impact on trends from different bias correction algorithms in satellite merging.
For the RO period 2002-18, we find all radiosonde and RO data records show approximate agreement with the moist adiabatic amplification in terms of short-term variability (Fig. 13c); RO and RS-VAIS exhibit amplification that is larger than the theoretical expectation.For scaling ratios based on trends over 2002-18 (Fig. 13d), each of the datasets show amplification with altitude, but with considerable differences in detail.The RS-VAIS radiosonde and RO trends show strongest amplification (up to a factor of 3.0) in the upper troposphere above 8 km, while the RAOBCORE and RICH data have nearly constant amplification (;2.0) with height above ;3 km.MSU RSS and STAR data show small amplification with STAR TMTcorr being closest to the theoretical lapse rate.TTScorr does not show any amplification, with only slight amplification from UAH TTScorr.This is most likely because the TTScorr weighting function still contains nonnegligible cooling contribution from the lower stratosphere that causes the TTScorr to underestimate the warming trends in the upper troposphere.
When interpreting Fig. 13, one should note that only the best estimate trend curves are shown for clarity.
From Fig. 10, however, one can see that there is substantial uncertainty in trends derived from all datasets.In addition, the scaling ratios depend sensitively on the HadCRUT4 surface trends, and a reduction of the best estimate from 0.14 to 0.11 K decade 21 , which is well within the observational uncertainty range in the tropics (Morice et al. 2012), would enhance the scaling factors of the upper-air datasets by 20%.
Despite this caveat, the datasets in Fig. 13d provide evidence for amplified warming in the tropical upper troposphere over the RO period (2002-18).All observations except MSU TTScorr are able to represent the amplification of month-to-month variability, with RO and RS-VAIS showing larger upper-troposphere variability.For the multidecadal trends (Figs.13b,d), the STAR MSU TMTcorr shows an amplification while TTScorr does not.The gridded radiosonde datasets RAOBCORE and RICH show weaker upper-tropospheric amplification for the recent 2002-18 period, and no amplification for 1979-2018.The RO and RS-VAIS trends exhibit the largest increases in amplification with altitude.

Summary and conclusions
This study provides an overview of updated atmospheric temperature trends estimated from observations and assesses the uncertainties and limitations of currently available data records.Atmospheric observations over the satellite era  have been investigated from satellite-based sensors as well as from groundbased instruments.We constrained our analysis to data records that have been updated until the end of 2018.We analyzed layer-average temperature from new merged data records of microwave sounders (MSU-AMSU) provided by three data centers and from two merged records of stratospheric sounders (SSU-AMSU and SSU-MLS).We compared these datasets to lidar temperatures from four long-term stations and to two gridded homogenized radiosonde records.We also relied on novel GNSS RO from three different centers.The RO data are available since 2001 and have the advantage of long-term stability and high vertical resolution.
Atmospheric trends were estimated for layer-average temperatures and for vertically resolved temperatures based on global-mean and zonal-mean monthly time series.Trends were computed by applying a linear ordinary least squares fit as well as multiple regression analysis.Natural variability terms used in the regression analysis account for the solar cycle, ENSO, stratospheric volcanic eruptions, and the QBO.
Resulting trends show a cooling of the stratosphere with a robust temperature decrease of about 1-3 K over the last four decades.The rate of cooling evolves over time, particularly in the lower stratosphere.In the first half of the record, cooling is larger and interrupted by volcanically induced stratospheric warming signals.Lower-stratospheric cooling trends are weaker since about 1998.Stronger cooling trends reflect the decline of stratospheric ozone in the early period, while the partial recovery of ozone since about 1998 is consistent with smaller stratospheric cooling trends.In the upper stratosphere, it seems that cooling is enhanced since about 2015.
The latitude structure of trends shows a consistent picture of stratospheric cooling over all latitudes that increases with height.It amounts to about 20.25 K decade 21 in the lower stratosphere (TLS) up to from 20.5 to 20.7 K decade 21 in the mid-upper stratosphere (SSU1 to SSU3).At northern high latitudes, cooling is up to 1 K decade 21 , while it is weaker at southern high latitudes.Model calculations (Randel et al. 2017;Maycock et al. 2018) show large variability in polar stratospheric trends derived from 40-yr samples, especially in the Northern Hemisphere.This indicates that a forced response cannot be easily separated from internal variability and implies that there may be a nonnegligible internally generated component to the larger stratospheric trends in the Northern Hemisphere.The smaller trends over the Antarctic continent are reproduced in the ensemble model simulations of Randel et al. (2017), suggesting a link to systematic circulation changes in response to ozone depletion during the Austral summer.In the lower stratosphere, the latitudinal trend structure shows cooling over all latitudes and a near-zero trend at northern high latitudes.Changes in ozone and the BDC are important factors in the latitudinal pattern of the lower-stratospheric cooling trend, while greenhouse gases and ozone changes dominate above.
In the troposphere, the observed warming over the last four decades is roughly 0.6-0.8K, with prominent variability associated with ENSO.Tropospheric trends show significant warming over all latitudes, except at southern high latitudes.Largest warming trends occurred at northern high latitudes.Overall, the different types of atmospheric record show reasonable agreement.
Vertically resolved trends for the period 2002-18 based on radiosonde data and novel GNSS RO records have consistent warming trends in the tropical troposphere up to 14 km.While gridded radiosonde records (RICH and RAOBCORE) warm by about 0.2 K decade 21 throughout the troposphere, GNSS RO and Vaisala sonde-based radiosondes (RS-VAIS) yield larger warming of about 0.3 K decade 21 .In the lower tropical stratosphere, slight cooling is observed in the last two decades.
Over this shorter RO period, warming in the troposphere tends to be uniform over all latitudes except for southern high latitudes.Importantly, the RO period provides evidence for amplified warming of surface temperature trends in the tropical upper troposphere.This evidence is in high vertical resolution RO data and radiosonde records, both of which yield amplification behavior that is in approximate agreement with moist adiabatic lapse rate theory.
For trends over the longer period 1979-2018 the results are more ambiguous.While there is no evidence for tropospheric amplification in the gridded radiosonde records at standard pressure levels, amplification in the STAR MSU TMTcorr data is consistent with theory.For the upper-tropospheric MSU TTScorr, however, no amplification is currently observed over the full satellite era.
Differences in trends in the different types of measurement platform are generally largest in the tropopause region (13-18 km), which may be due to changes in the radiosonde instrumentation and sampling.RO observations are of highest quality in the upper-tropospherelower-stratosphere region.We are therefore more confident in trends in this region estimated from RO observations than in trends inferred from radiosonde observations.Further research is required to better explain remaining differences between different measurement types and between the estimates of one measurement type obtained by different groups.Although at stratospheric altitudes there are slight differences in RO trends over 2002-18 due to limitations in the early CHAMP mission, we found RO trends after 2006 to be highly consistent across different research groups.
There is also consistency between the overall latitudealtitude trend patterns from GNSS RO and radiosondes.Significant warming of the troposphere is clearly revealed by both measurement platforms, with values of approximately 0.25-0.35K decade 21 and distinct warming of about 0.5 K decade 21 above the southern tropical tropopause.
In conclusion, we find overall consistency in observed trends over 1979-2018 obtained from the latest observational records for satellite-based layer-average temperatures and vertically resolved radiosonde records.Novel records from GNSS RO, available for 2002-18, show consistent trends with radiosondes over this shorter period of record.
Long-term climate records are essential for gaining fundamental understanding of climate variability and climate trends, for identification of externally forced climate change, and for estimating the sensitivity of Earth's climate to the changing composition of trace gases and aerosols.To sustain long-term climate data records of key climate variables, it will be essential to maintain reliable long-term monitoring of Earth's climate with such platforms as the Global Climate Observing System (GCOS).
Acknowledgments.The authors express their gratitude to SPARC for supporting the ATC activity in their work described in this article.These activities have been undertaken under the guidance and sponsorship of the World Climate Research Programme.We acknowledge the groups at NOAA/STAR, RSS, and UAH who have produced and made available the satellite temperature data sets used in this study.We acknowledge UCAR/CDAAC (Boulder, CO, United States) for providing access to RO phase, orbit data, and temperature data, as well as to ECMWF (Reading, United Kingdom) for giving access to analysis and forecast data.We also thank the WEGC EOPAC team for providing the OPSv5.6RO data, and the ROM SAF team for providing the ROM SAF CDR v1.The lidar data used in this publication were obtained from the Norwegian Institute for Air Research (NILU) as part of the Network for the Detection of Atmospheric Composition Change (NDACC) and are publicly available (see http://www.ndacc.org).The GloSSAC data set was obtained from the NASA Langley Research Center Atmospheric Sciences Data Center.We acknowledge FU Berlin (Berlin, Germany) for providing Singapore wind data.For C.-Z. Zou, the views, opinions, and findings contained in this report are those of the authors and should not be construed as an official NOAA or U.S. Government position, policy, or decision.Part of this research was carried out at the Jet Propulsion Laboratory, California Institute of Technology, under a contract with the National Aeronautics and Space Administration (80NM0018D0004).Q. Fu was supported by the NASA grant 80NSSC18K1031.L. Haimberger was supported by FWF grant P28818-N29 and EU 7FP grant 607029 (ERA-CLIM2).A.K. Steiner and F. Ladstädter were funded by the Austrian Science Fund (FWF) under research grant P27724-NBL (VERTICLIM), and by the FFG-ALR project ATROMSAF1(ASAP-13 859771).

FIG. 1 .
FIG. 1. Vertical weighting functions of stratospheric and tropospheric temperature observations from the SSU (STAR) and MSU (RSS) instruments.

FIG. 2 .
FIG. 2. Normalized indices of atmospheric variability modes used in the multiple regression analysis for 1979-2018 (top to bottom) solar flux, SST Niño-3.4,aerosol index, and the first two principal components PC1 and PC2 of QBO winds.

FIG. 3 .
FIG. 3. Stratospheric time series 1979-2018 and trends of near-global averages (858S-858N) are shown for layer-average brightness temperatures from SSU and MSU records.Shown are (bottom to top) channel MSU TLS (13-22 km), and SSU channels SSU1 (20-40 km), SSU2 (25-45 km), and SSU3 (35-55 km).Data records from different centers are displayed, SSU-AMSU and SSU-MLS, as well as TLS from three centers RSS, STAR, and UAH.The temperature anomalies are plotted with respect to the period for 1980-2018.Overplotted are equivalent TLS brightness temperatures from RO since 2001 for comparison (anomalies mapped with reference period 1980-2018 minus 2002-18 from RSS).Linear trends are indicated for the full period and for split trends 1979-97 and 1998-2018 (except RO for September 2001-December 2018).Trend values are given in this order (left to right).

FIG. 4 .
FIG. 4. Stratospheric time series from four lidar stations in the form of SSU3 equivalents compared to collocated SSU3 time series from SSU-AMSU.Shown are lidar temperature anomalies for the stations (a) HOH, (b) OHP, (c) TMF, and (d) MLO.Linear trends for the respective time periods are indicated as well.

FIG. 5 .
FIG. 5. Tropospheric time series 1979-2018 and trends of near-global averages (858S-858N) are shown for layeraverage brightness temperatures from MSU records.Shown are MSU channels (bottom to top) TLT, TMTcorr, TMT, and TTScorr, peaking at about 2, 5, 5, and 10 km, respectively.Data records from different centers are displayed for RSS, STAR, and UAH.Linear trends are indicated for the period 1979-2018 (except TTScorr: STAR for 1981-2018, RSS for 1987-2018).The temperature anomalies are plotted with respect to the period for 1980-2018.

FIG. 8 .
FIG. 8. Upper-air temperature trends 1979-2018 from different observations for (a) near-global averages (708S-708N) and for (b) the tropics (208S-208N).Layer-average temperature trends are shown for MSU-AMSU (RSS, STAR, UAH) and for merged SSU records.Vertically resolved trends are shown for radiosonde records (RICH, RAOBCORE).Surface temperature trends from HadCRUT4 are also indicated.Trends were computed with multiple linear regression.Uncertainty of trends is indicated at the 95% confidence level.
FIG. 9. Percentage of variance explained by the regressors and by the residuals in tropical temperature anomalies 1979-2018 for the tropospheric channel MSU TMTcorr (RSS), and the stratospheric channels MSU TLS (RSS), and SSU1-SSU3 (SSU-MLS).Results are shown for one exemplary record per channel and are consistent for all respective datasets.
FIG. 10.Upper-air temperature trends 2002-18 from different observations for (a) near-global averages (708S-708N) and (b) the tropics (208S-208N).Layer-average temperature trends are shown for MSU-AMSU (RSS, STAR, UAH) and for merged SSU records.Vertically resolved trends are shown for radiosonde records (RS-VAIS, RICH, RAOBCORE) and for RO records (ROM SAF, UCAR/NOAA, WEGC).Surface temperature trends from HadCRUT4 are also indicated.Trends were computed with multiple linear regression.The uncertainty of trends is indicated at the 95% confidence level.

FIG. 12 .
FIG. 12. Altitude vs latitude resolved trends 2002-18 shown for (a) RO (ROM SAF) and (b) radiosondes (RICH).Trends were computed with multiple regression analysis.Trend values that are significant at the 95% confidence level are indicated with dots.

TABLE 1 .
Overview on observational datasets, version, time period, horizontal format, references.Datasets in italics are discussed but not used in the analysis.
Ho and X. Zhou 2020, unpublished manuscript).An overview on RO data processing and a description of retrieval steps for each specific dataset is given inSteiner et al. (2020, their Table