1. Introduction
Extreme temperatures have profound societal, ecological, and economic impacts. It is known that average temperatures in the conterminous United States since 1900 have warmed on aggregate, with the west, northern Midwest, and New England showing the most warming and the Southeast showing little change (Lund et al. 2001). In fact, a linear trend estimate for the conterminous U.S. series of Menne et al. (2010), which aggregates over a thousand stations in the region on a day-by-day basis since 1895, is about 0.7°C century−1. This trend applies to mean temperatures.
It is less clear whether minimum and/or maximum temperatures have changed during this period. In fact, maxima and averages are statistically independent in large samples. Specifically, if {Xt} is a stationary time series, then
This paper proposes methods to accurately estimate trends in monthly extreme temperature time series and applies these methods to the U.S. record. Specifically, monthly maximum time series from 923 stations and monthly minimum time series from 932 stations located in the 48 conterminous United States are examined. Here, a monthly extreme temperature is the highest/lowest daily high/low temperature observed during the calendar month. For example, an extreme June high is the largest daily high temperature observed over 1–30 June. Hereafter, MaxTmax and MinTmin are used to denote monthly maximum and minimum temperatures, respectively.
Other authors have studied changes in extreme temperatures. For example, DeGaetano and Allen (2002) and DeGaetano et al. (2002) fix temperature thresholds and examine trends in the frequency of exceedances of these thresholds. These studies find that low temperatures are changing more (getting warmer) than high temperatures are (staying approximately the same), a pattern that we affirm later. While this gives good rudimentary guidance, it does not incorporate the magnitudes of the extreme exceedances. Peterson et al. (2008) also studies changes in the frequency of exceedances above and below various thresholds (such as the 10th and 90th distributional percentiles) and reaches similar conclusions for North America as a whole; again, the study does not consider the magnitude of the exceedances. Peterson et al. (2008) conjectures that the changes could be due to increasing carbon dioxide and/or increasing precipitation (among other factors). Van de Vyver (2012), in perhaps the most methodologically similar study to ours, quantifies changes in extremes via extreme value peaks over threshold methods; however, gauge and station relocation effects are not considered and that study only considers Belgium.
Two prominent issues tackled below involve periodicities and changepoints. The MaxTmax and MinTmin series tend to be more variable than monthly averaged series. Intuitively, this is because averaging pulls quantities toward a mean, while any extreme observation can set a record (persistence of the temperature is not needed to set an extreme). The MaxTmax and MinTmin series have periodic cycles in their mean and variance that are as pronounced as those for series of monthly average temperatures, with winter extremes being cooler and more variable than summer extremes. This increased variability makes changepoints harder to detect than changepoints in monthly averaged series; MinTmin series are slightly more variable than MaxTmax series.
Changepoints here refer to mean shift structures that are induced by station location moves, gauge changes, etc. Moving a station can shift average temperatures by several degrees. U.S. stations average roughly six relocations, gauge changes, or time of observation changes per century (Mitchell 1953). Lu and Lund (2007) and the references therein show that neglecting changepoint information can give unreliable trend estimates for monthly averaged series at a local station. Many changepoints are undocumented in station metadata records; for monthly averaged series, roughly half of the documented changepoint times do not impart actual mean shifts. Below, we investigate whether or not changepoints in MaxTmax and/or MinTmin series are also changepoints in monthly averaged series and how they correspond to the metadata record. However, given length restrictions, our picture is somewhat incomplete. Section 7 will affirm the importance of changepoints in trend estimation for MaxTmax and MinTmin series. DeGaetano et al. (2002) recognize the importance of homogenizing extreme data for changepoint effects. Homogenized data are also useful in other climate studies.
One issue common to all trend studies involves what type of trend function to fit. For simplicity and ease of interpretability, this study examines linear trends only. While true temperature changes are surely nonlinear in time, linear trends describe average changes over the period of record and provide good rudimentary guidance. Linear trend studies are ubiquitous in the climate sciences (Jones 1988; Bloomfield and Nychka 1992; Lund et al. 1995; Fomby and Vogelsang 2002).
The rest of this paper proceeds as follows. Section 2 describes our data and introduces a series from Jacksonville, Illinois, that will be analyzed as a case study in section 7. Sections 3–6 then discuss methods to obtain our trend estimates. In particular, section 3 presents the extreme value statistical background needed to estimate trends in MaxTmax and MinTmin time series. Here, it is shown how to obtain trend estimates in monthly extreme series under a general specified changepoint configuration. Section 4 describes how reference series are constructed. This is used in section 5 to estimate the unknown changepoint numbers and locations in our MaxTmax and MinTmin series. Here, the fact that station minus reference series are approximately Gaussian is used to estimate an initial changepoint configuration (this configuration is later refined). Technical algorithmic issues are collected into section 6. Section 7 then moves to a case study of the MaxTmax time series from Jacksonville, Illinois. The intent of this section is to inject some feel for the analyses, including changepoint issues. Section 8 reports results for all stations, and section 9 concludes with comments and a summary.
2. Data description
Our monthly extremes are taken from the National Climatic Data Center’s (NCDC’s) U.S. Historical Climatology Network (USHCN) data. The USHCN data contain daily maximum and minimum temperatures for 1218 stations located throughout the 48 conterminous United States through December of 2010. The USHCN data are available online (http://cdiac.ornl.gov/epubs/ndp/ushcn/ushcn.html).
Erroneous data entries do exist, but are not overly prevalent. Sometimes erroneous negative signs or extra digits were keyed in with the data. For example, some observations exceed the U.S. record high temperature (56.7°C at Death Valley, California, on 10 July 1913) or are lower than the U.S. record low (−56.7°C at Rogers Pass, Montana, on 20 January 1954). The NCDC has flagged inconsistent entries with quality control checks. All flagged temperatures are regarded as missing. Burt (2004) contains a good compilation of U.S. temperature records. The Death Valley extreme is now regarded as the world’s hottest naturally occurring temperature (El Fadli et al. 2013).
Almost all stations in the USHCN daily data set have some missing data. One missing daily observation could alter a monthly MaxTmax or MinTmin value if the extreme, in truth, occurred on that missing day. Because of this, a monthly extreme is flagged as missing if one or more of the days within that month are missing. About 75% of the months in the MaxTmax series are nonmissing.
Stations where data begin or end in the interior of a calendar year are cropped to full calendar years: each station’s record begins with a January observation and ends with a December observation. This simplifies our notation and analysis. After this cropping, a station is required to have at least 75 yr of data with a missing rate of at most 33.3%, or have at least 50 yr of record with a missing rate of at most 5%, to make this study.
These requirements leave 923 stations with analyzable MaxTmax series. Figure 1 graphically depicts the spatial location of these stations. The spatial coverage over the 48 conterminous United States is reasonable (longitudes and latitudes of the stations are used later). When the above requirements are applied to construct the MinTmin series, 932 stations remain. The spatial coverage of the MinTmin series is similar to that for the MaxTmax series. Missing data are assumed to occur “completely at random” in time; accounting for other structures (e.g., missing values might be more likely to occur immediately after a changepoint) is beyond our scope.
Map of MaxTmax station locations.
Citation: Journal of Climate 27, 11; 10.1175/JCLI-D-13-00283.1
The longest MaxTmax record comes from Atlantic City, New Jersey (137 yr), and the shortest MinTmin record occurs at three stations, Bedford and Reading, Massachusetts, and Las Cruces, New Mexico (51 yr). Over 75% of the MaxTmax and MinTmin series start between 1890 and 1910. Figure 2 presents a time series plot of the MaxTmax and MinTmin series for Jacksonville, Illinois. This station will be analyzed in detail in section 7. The Jacksonville MaxTmax series begins in January 1896, ends in December 2010, and has 115 yr of monthly data with a missing rate of 4.42%. The Jacksonville minimum series spans for 114 yr, January 1897–December 2010, with a missing rate of 16.23%. Both extreme series exhibit periodicity, with winter temperatures being cooler and more variable than summer temperatures. The seasonal variability cycle is seen by comparing the year-to-year jaggedness of the summer peaks (smaller) to their winter counterparts (larger).
(top) Monthly MaxTmax series from January 1896 to December 2010 and (bottom) monthly MinTmin series from January 1897 to December 2010 at Jacksonville, Illinois (°C).
Citation: Journal of Climate 27, 11; 10.1175/JCLI-D-13-00283.1
3. Statistical background: Extreme value methods
This section narrates our extreme value statistical methods. Extreme value methods are techniques especially suited for extreme data. Extreme data are often highly skewed and non-Gaussian; because of this, suboptimal inferences can be made if Gaussian techniques are used. In practice, one simply replaces a Gaussian likelihood with a generalized extreme value (GEV) likelihood; unfortunately, most of the common closed-form parameter estimator expressions in Gaussian series may be suboptimal in extreme settings. The reader is cautioned not to use Gaussian methods in extreme analyses without serious thought—different conclusions could be made (Gumbel 1958; Reiss and Thomas 2007). Also worth mentioning are quantile regression techniques (Koenker 2005). Quantile regression methods examine changes in a fixed quantile (e.g., the 95th percentile of the distribution) over time. As about 30 calendar days exist in a month, one could view our analysis as approximate 96.667 and 3.333 quantile analyses (there are technical differences between extreme and quantile analyses).


Because the data are extreme series, autocorrelation in {Xt} is not allowed in our analysis. While correlation is not totally absent in extremes, month-to-month temperature extremes typically exhibit weaker dependence than month-to-month temperature averages (again, any irregular observation can set a monthly extreme while monthly sample means are pulled toward a central tendency via the daily averaging). This said, some dependence in the extremes likely exists. A cold snap, for example, occurring on the last two days of a month and the first two days of the next month, might set MinTmin values for both months. Some authors (Smith 1989; Coles 2001) combat this dependence by blocking runs of hot and cold temperatures into distinct blocks, and then report the extreme of all days within each block. Unfortunately, blocking is not feasible given the changepoint and periodicity features considered, automation issues, and the large number of stations in our study. Also, correlation often does not appreciably change the limiting GEV distribution of the extremes (see Leadbetter et al. 1983). Residual autocorrelation plots will be analyzed later to scrutinize this issue in finite samples. The issue is not found to be overly problematic.

A standard error
4. Reference series for changepoint estimation
The likelihood in (6) applies to cases where the changepoint numbers and times are known (specified). Unfortunately, this is not the case in practice. Whereas files exist showing some of the station relocation and instrumentation change histories (the so-called metadata), these files are notoriously incomplete—many changepoints are not entered into the metadata logs. Of the changepoint times that are documented, only about half of these induce true shifts in average series. Because daily temperatures are often formed by averaging daily maximum and minimum temperatures, one hopes that any inhomogeneity time in the extremes will also be an inhomogeneity time in the means (and vice versa). On a strictly mathematical level, neither condition implies the other. Elaborating, if each day’s high becomes one degree warmer and each day’s low becomes one degree cooler, the extremes will change but the means will not. Monthly means can also change without altering the extremes (add a degree to each day’s high whenever this high is not the monthly high). In what follows, we hope to impart some feel for how the extreme and mean changepoints relate, and whether or not they correspond to the metadata.
Trends for individual stations are usually distrusted if homogenization has not been first attempted. The case study in section 7 will reinforce this point with extreme series. Our homogenization methods take the classic reference series approach. A reference series is a series taken from a location near the series being studied (the series being studied is called the target series). A good reference series is relatively changepoint free and experiences similar weather to the target. The target minus reference subtraction serves to reduce variabilities, seasonal cycles, and autocorrelation, thereby illuminating the locations of any shifts. In good target minus reference comparisons, the seasonal mean cycle and variances are “reduced” compared to those in the target series.
The reference methods in this section allow us to construct a reasonable reference series for each target series, which in turn will allow us to estimate the changepoint times and locations in the target series. Once the changepoint counts and location times are estimated, it is relatively easy to fit the GEV model to {Xt} and obtain an estimate of the trend.
Multiple reference series for a given target series are often helpful (Menne and Williams 2005, 2009). Current NCDC methods often compare over 40 or more references to a given target (Menne and Williams 2009) before making changepoint conclusions. Issues arise in the comparisons. Foremost, any changepoint in the reference series will likely impart a changepoint in the target minus reference series—one adds to the changepoint numbers by making reference comparisons. Menne and Williams (2009) devise the so-called pairwise algorithm to address this issue. The pairwise procedure becomes complicated when assigning which station is responsible for an occurring changepoint, especially when there is disagreement among the reference comparisons. To keep changepoint issues manageable but realistic, our approach will construct a composite reference series by averaging many individual reference series. Strength is gained by considering multiple references, but issues of additional changepoints induced by the reference series are minimized by the averaging.
For each station, the 100 nearest (“as the crow flies”) neighboring stations are first identified. Since good reference stations are heavily correlated with the target, the correlation between the target and these 100 nearest neighbors is next computed. This correlation is computed after differencing at lag T = 12. Differencing at lag T = 12 eliminates the seasonal cycle and most of the changepoint location shifts. In fact, δt − δt−T is nonzero only when one or more changepoints occur between times t − T and t. Lag one differencing may not eliminate the seasonal cycle in the series and should be avoided with monthly data unless the seasonal cycle has been a priori removed in some other reliable way.


Our reference series will average the 40 neighboring series with the largest correlation, as computed in (7), to the target. One caveat is made in selecting these 40 stations: only stations whose correlation to the target, as computed in (7), exceeds 0.5 are used. Subtracting a reference series whose correlation does not exceed 0.5 can actually increase data variability. For our 923 MaxTmax stations, only 76 stations had less than 40 candidate reference series with the required >0.5 correlation. Should there be less than 40 such reference stations, our composite reference simply averages over the number of stations that have the required correlation. Only four MaxTmax series had no references (and these are only for the maxima series): Eureka, California; Fort Lauderdale, Florida; Tarpon Springs, Florida; and Brookings, Oregon. Interestingly, these stations are all coastal and are known for microclimates, especially Eureka. These four stations are analyzed without a reference.
A more subtle issue involves the starting date for some of the longer series. Specifically, no reference station exists for the January 1874 data point at Atlantic City, New Jersey, the longest record in the study. To accommodate, the starting year of the Atlantic City series was advanced to 1901, which is the median starting year of the 40 reference stations with the highest correlation over times that are common to both records. By doing this, there are at least 20 reference stations at all times past 1901 for the Atlantic City series. A similar rubric is used for ending years, although this issue arises less frequently. If data are missing in one or more of the references, the composite reference is simply averaged over the number of references with nonmissing data. Because of this, composite reference series do not usually have any missing data.
Figure 3 shows our composite reference series for the MaxTmax and MinTmin time series at Jacksonville, Illinois. Figure 4 displays histograms of the target minus reference differences. While not truly Gaussian (a formal Shapiro–Wilks normality test is not passed at a 5% significance level), it may be surprising that the target minus reference series’ marginal distribution is not radically non-Gaussian (they are certainly unimodal).
Jacksonville composite reference series from 40 neighbors for the (top) MaxTmax and (bottom) MinTmin series (°C).
Citation: Journal of Climate 27, 11; 10.1175/JCLI-D-13-00283.1
Histogram of the Jacksonville target minus reference series (top) maxima and (bottom) minima (°C).
Citation: Journal of Climate 27, 11; 10.1175/JCLI-D-13-00283.1
Computations with the target minus reference series reveal seasonal means and variances (neither of these features was completely eliminated by the target minus reference differencing), but no other periodic structure. Elaborating, the coherence tests of Lund et al. (1995) were applied to assess whether or not the target minus reference series is stationary after subtraction of a linear trend and monthly sample means and division by a monthly sample standard deviation. Figure 5 shows a coherence plot with a 99% pointwise confidence threshold for these seasonally adjusted differences for the Jacksonville MaxTmax series. As there are no large exceedances of the 99% threshold, one concludes that the target minus reference series, beyond monthly means and variances, has no additional periodic structure. This simplifies our changepoint model in the next section.
Average squared coherences for the seasonally adjusted Jacksonville maxima target minus reference series. The absence of values exceeding the pointwise 99% confidence threshold suggests that no periodic features remain in the series.
Citation: Journal of Climate 27, 11; 10.1175/JCLI-D-13-00283.1
Other stations were also scrutinized; in all cases, the conclusions are the same as those for the Jacksonville MaxTmax series. Hence, we move to our next task—finding the changepoint locations in any target minus composite reference series.
5. MDL estimation of the changepoint configuration
Suppose that a target minus composite reference difference series {Dt}, where





We now develop the penalty term in (8). In computing an MDL penalty, three principles are needed. First, the penalty for a real-valued parameter estimated from g data points is log2(g)/2. Second, the penalty for an integer-valued parameter I that is known to be bounded by the integer M is log2(M). If no bound for I is known, the parameter is penalized log2(I) units. Third, the model penalty P is obtained by adding the penalty for all individual parameters.









A serious computational issue now arises. It is not feasible to compute the penalized likelihood in (8) over all possible changepoint numbers k and configurations {τ1, …, τk} when N is large. Indeed, there are
6. Genetic algorithm and spatial smoothing methods
A genetic algorithm (GA), which is essentially a Markov stochastic search, will be used to estimate the number of changepoints and their times in the target minus composite reference difference series. As changepoint effects in the composite series should be minimal, any identified changepoints are attributed to the target series. The GA development here is similar to that in Li and Lund (2012), but has seasonal aspects.
Genetic algorithms are described via chromosomes. Chromosomes here have the form (k; τ1, …, τk) and contain all changepoint information. Each different chromosome is viewed as a different individual in a population. One can compute an MDL score for a fixed chromosome from the methods in the last subsection. Individuals in the population are termed fitter (relatively) when they have a smaller (relatively) MDL score.
GAs need to breed two chromosome configurations, called the mother and father, in a probabilistic manner to form a child. The better fit individuals will be more likely to breed and pass on their chromosomes to the next generation, thus mimicking natural selection principles. Suppose a generation contains L individuals (we use L = 200 later). A mother and father are selected from these L chromosomes as follows. The ith chromosome is selected as the father with probability
From a mother and father chromosome, a child chromosome is randomly generated as follows. Suppose (i; ς1, …, ςi) and (j; τ1, …, τj) are the mother and father chromosomes, respectively. The child’s chromosome is produced in three steps. First, the mother’s and father’s chromosomes are combined by forming the chromosome (i + j; κ1, …, κi+j). Here, the κℓs contain the ordered changepoint times of both mother and father. The number of changepoints is strictly less than i + j should the mother and father have some common changepoint times. Second, the κℓ changepoint times are then either retained or discarded with independent coin flips with success probability 0.5. This acts to thin the number of changepoints. Finally, we allow the changepoint times that remain to move their locations slightly: each changepoint location stays the same with probability 0.4, moves to one time smaller with probability 0.3, or moves to one time larger with probability 0.3 (subject to the changepoint time being in {1, …, N}). For example, with N = 8, suppose that a mother and father have the chromosome (1; 6) and (3; 3, 5, 6), respectively. Then the child chromosome is first set to (3; 3, 5, 6). Three fair coins are then flipped independently. Should this have resulted in success, failure, and success, the chromosome is thinned to (2; 3, 6). Two draws from the above location shift generation mechanism might then, for example, keep the time 3 changepoint where it is and shift the time 6 changepoint to 7. This yields the end chromosome (2; 3, 7). Once one child is generated, the process is repeated until L new children are formed. These children represent the next generation. We do not allow different children to have the exact same chromosome; however, a mother and father could be the parents of more than one child.
Mutation is an aspect of GAs added to prevent premature convergence to poor solutions (local minima). Our mutation mechanism allows a small portion of children to have “extra changepoints.” Specifically, after each child is formed from its parents, each and every nonchangepoint time is independently allowed to become a changepoint time with probability pmut. In the computations below, pmut = 0.003 is used.
In this manner, successive generations are simulated. The solution to the optimization problem is taken as the fittest chromosome in the terminating generation. One terminates the GA when there is little or no improvement to the fittest member of a few successive generations. The specifics of how this is done are usually of little consequence.

The head-banging algorithm is a robust median-polished smoother that capably extracts general structure from noisy data. It is named from a child’s game where a face is banged against a board of nails protruding at various lengths, leaving an impression of the face, but smoothing the residual nail lengths. The algorithm is recursive in nature and unwieldy to quantify with equations [refer to Hansen (1991) for details]. However, head-banging techniques are local median smoothing methods that group stations into many subsets of neighboring stations, over which median trends are taken. Taking local medians accounts for spatial correlation in the trend estimates in a nonparametric manner. To run the head-banging algorithm, one only needs to set a parameter, called the number of triples. The number of triples essentially represents the number of neighboring stations that will be used to compute the smoothed values.
7. A case station study
This section examines the monthly MaxTmax series from Jacksonville, Illinois, introduced in section 2. The GA estimates two changepoints at the times 49 (January 1900) and 730 (October 1956). The estimated AR(1) coefficient in the fitted model (9) is
(top) Jacksonville monthly maxima temperature anomaly, (middle) reference temperature anomaly, and (bottom) Jacksonville minus reference difference anomaly, with estimated changepoint times demarcated with dashed vertical lines.
Citation: Journal of Climate 27, 11; 10.1175/JCLI-D-13-00283.1
When the two changepoints are ignored and the GEV model is fitted—this allows for general monthly means and a first-order Fourier expansion for
Jacksonville GEV monthly location estimates and their standard errors (all units are in °C).
Aspects change when the two changepoints are considered. While all changepoints are deemed to induce significant location shifts by the GA, they may not be significant as judged by the GEV likelihood. We discard all GEV nonsignificant changepoint times. Elaborating, the least “GEV significant” changepoint time (at the 5% significance level) is identified, and the GEV likelihood is refitted ignoring this changepoint time. This yields improved estimates of the location shifts and their standard errors. Such a “backwards elimination process” is repeated until all changepoints are deemed GEV significant at level 5%. Elaborating, the jth changepoint, where j = 2, …, k + 1, is GEV significant if Δj − Δj−1 is significantly nonzero. To gauge this, the Z score
For the Jacksonville MaxTmax series,
The crux here is that the estimated trend
GEV estimated linear trend line with mean shifts included (solid) and ignored (dashed) for the monthly Jacksonville maxima anomalies with seasonal cycle removed.
Citation: Journal of Climate 27, 11; 10.1175/JCLI-D-13-00283.1
It is worth comparing the changepoints found in the Jacksonville MaxTmax series to the metadata and to those found in a corresponding monthly average series from Jacksonville. To obtain changepoints in the monthly average series, a Gaussian MDL analysis akin to that in Lu et al. (2010) was used—a reference series was again constructed from the 40 most correlated neighbors. Table 2 lists our findings. The metadata identify station location or temperature gauge changes in 1895, 1927, 1962, and 1974. There is no metadata record after 1986. The MDL analysis of the monthly averaged series estimates changepoints in 1900, 1931, 1941, 1961, and 1970. Obviously, there are fewer changepoints in the extreme series than the mean series. This is expected as extremes are more variable than averages—abrupt shifts in them should be relatively harder to detect. This said, 1900 is estimated as a changepoint time in both the average and MaxTmax series. A changepoint is listed in the metadata at 1895. The 1961 estimated changepoint for averages corresponds better to the 1962 metadata changepoint than the estimated 1956 changepoint in the MaxTmax series, but not radically so. Obviously, additional study on this issue is needed.
Jacksonville changepoint comparison.
The Jacksonville MaxTmax trend estimate, with all significant extreme changepoints estimated via the GA, is
Sample autocorrelation function (ACF) of the seasonally scaled monthly Jacksonville MaxTmax residuals.
Citation: Journal of Climate 27, 11; 10.1175/JCLI-D-13-00283.1
8. Results for all stations
Trend estimates for the monthly MaxTmax series for all 923 stations are displayed in Fig. 9. While it may be surprising that 583 of the 923 stations had negative trends, a “warming hole” in the eastern United States has been previously noted (Lund et al. 2001; Robinson et al. 2002; DeGaetano and Allen 2002; Lu et al. 2005; Kunkel et al. 2006; Meehl et al. 2012, among many others). A histogram of the 923 GEV trends is supplied in Fig. 10. The estimated trends are right-skewed with a median trend of −0.63 and a 90% interior range (5th–95th percentile) between −3.56 and 2.97 (all units here are °C century−1). The head-banging algorithm was applied to the raw trends with a smoothing parameter of 10 triples. The result is depicted in Fig. 11. Here, color shades run from bright red (the most warming) to deep blue (the most cooling). In aggregate, maximum temperatures are decreasing in the eastern United States, with the exception of New England. In contrast, the western U.S. maximum temperatures are slightly warming for the most part. Head-banging smoothed Z scores for the trends are displayed in Fig. 12. Because of the larger absolute Z scores, our inferences are most confidently made for the majority of the eastern United States (cooling) and the southern Rockies (warming).
Map of GEV trends of U.S. monthly MaxTmax series (°C century−1).
Citation: Journal of Climate 27, 11; 10.1175/JCLI-D-13-00283.1
Histogram of the GEV trends of U.S. monthly MaxTmax series (°C century−1).
Citation: Journal of Climate 27, 11; 10.1175/JCLI-D-13-00283.1
Head-banging smoothed GEV trends of U.S. monthly MaxTmax series (°C century−1). The eastern United States shows cooling and the western United States shows warming.
Citation: Journal of Climate 27, 11; 10.1175/JCLI-D-13-00283.1
Z scores for the GEV trends of U.S. monthly MaxTmax series.
Citation: Journal of Climate 27, 11; 10.1175/JCLI-D-13-00283.1
The trends for monthly MinTmin series exhibit a different pattern. The raw trends for the 932 stations are displayed in Fig. 13. Here, the majority of the trends are increasing (728 of the 932 stations). A histogram of the trends is shown in Fig. 14. The median of the 932 trends is 1.65 and the 5th–95th percentile of trends spans from −2.15° to 5.53°C century−1. The head-banging smoothed minimum trends (10 triples again) are shown in Fig. 15. With the exception of two localized pockets in the Southeast and Colorado, cooling is sparse. Head-banging smoothed Z scores for the trends (10 triples again) are displayed in Fig. 16. The Southeast is the most significantly cooling location; confidence in warming is comparatively large for the western United States, northern Midwest, and New England.
Map of GEV trends of U.S. monthly MinTmin series (°C century−1).
Citation: Journal of Climate 27, 11; 10.1175/JCLI-D-13-00283.1
Histogram of the GEV trends of U.S. monthly MinTmin series (°C century−1).
Citation: Journal of Climate 27, 11; 10.1175/JCLI-D-13-00283.1
Head-banging smoothed GEV trends of U.S. monthly MinTmin series (°C century−1).
Citation: Journal of Climate 27, 11; 10.1175/JCLI-D-13-00283.1
Z scores for the GEV trends of U.S. monthly MinTmin series.
Citation: Journal of Climate 27, 11; 10.1175/JCLI-D-13-00283.1
For overall conclusions, the average trend in the MaxTmax series is −0.468°C century−1 with an average standard deviation of 2.048°C century−1 (over all 923 stations). The average trend in the MinTmin series is 1.646°C century−1 with an average standard deviation of 2.385°C century−1 (this is over 932 stations). It is interesting to assess the role of changepoints in these trend estimates. The average number of GEV significant changepoints is 1.74 for the MaxTmax series and 1.91 for the MinTmin series. When GEV likelihoods are fitted to the MaxTmax series without changepoint effects, the average trend is −0.420°C century−1; the corresponding average trend in the MinTmin series when changepoints are ignored is 1.013°C century−1. Including changepoints has slightly increased cooling in MaxTmax series and appreciably increased warming in the MinTmin series. Figure 17 displays a histogram of the estimated magnitudes of all GEV significant changepoint shifts for all stations. About 95% of the GEV significant changepoint magnitudes are within ±5.0°C for maxima and ±5.9°C for minima.
Histogram of the estimated shift magnitudes of all GEV significant changepoints for U.S. (top) MaxTmax and (bottom) MinTmin series (°C).
Citation: Journal of Climate 27, 11; 10.1175/JCLI-D-13-00283.1
We now do some comparisons. First, trend calculations were conducted when the only changepoint times allowed are those specified in the metadata. Specifically, GEV trend calculations were made when the only changepoints allowed are gauge changes or station relocation times that are listed in the metadata. Any metadata changepoint time that does not induce a significant series shift is eliminated in a backward regression procedure, akin to what was done before. When only metadata changepoint times are used, the average trend in MaxTmax series increases from −0.468° to 0.151°C century−1; the average trend in MinTmin series changes from 1.646° to 1.615°C century−1. Table 3 summarizes our findings.
Trend quantile and mean comparisons (°C century−1).
Second, Table 4 examines the effect of differing starting times on the computed trends. This table summarizes estimated trends in the MaxTmax and MinTmin series in aggregate (averaged over all stations) for varying starting times, including 1900, 1950, and 1979. The table shows that the most warming has taken place recently. Here, the changepoint times were taken as those that were estimated by the GA algorithm.
Trend mean comparisons (°C century−1) for different series starting dates.
Comparing to the trends in mean U.S. temperatures reported in Lu et al. (2005), similar spatial patterns emerge: a cooling southeastern United States with warming elsewhere. One noticeable discrepancy perhaps is the cooling seen here in MaxTmax series in the southern Great Lakes and Ohio Valley. The overall results also support the statement that minimum temperatures are warming much more rapidly than maximum temperatures, a hypothesis generally believed to be consistent with warming induced by carbon dioxide and/or increasing precipitation (Karl et al. 1991; Jones et al. 1999).
9. Comments and conclusions
About 75% of MaxTmax and MinTmin series start within 10 years of 1900. Since a linear trend is constant over time, to minimize statistical variability, it makes sense to use the longest record possible. Balanced against this, true climate trends are surely nonlinear, perhaps stemming from multidecadal variability (Williams et al. 2012). While our diagnostics do not reveal radical departures from linear assumptions, adding nonlinear components would improve the methods here. As Table 4 shows, trend estimators from different time segments do vary.
An improvement to our methods would allow estimation of the changepoint times and locations in the GEV likelihood, accounting for autocorrelation and reference station aspects. This would eliminate the Gaussian analysis of target minus references series step to estimate the station changepoint configuration. We attempted to develop such methods but failed. Handling autocorrelation in extremes is difficult.
It would be desirable to more deeply understand the differences between changepoints in extreme and mean series, along with how they relate to the metadata records. Such “validations” have been conducted for trends in means (Wendland and Armstrong 1993; Vose et al. 2005; Menne et al. 2009; Williams et al. 2012; Zhang et al. 2012). It is worth repeating that accounting for changepoints made overall trends cooler in MaxTmax series and made overall trends warmer in MinTmin series. In contrast, accounting for changepoint aspects substantially increased mean monthly maximum trends and slightly decreased mean monthly minimum trends (Menne et al. 2009). Reasons for this are worthy of further study and should provide greater confidence in our results.
Overall, the climate change inferences made here support the conclusions of previous authors that minimum temperatures are increasing more than maximum temperatures (Karl et al. 1991; Jones et al. 1999; DeGaetano and Allen 2002; Peterson et al. 2008). We also find that the western United States is warming more than the eastern United States, consistent with Kunkel et al. (2006) and Menne et al. (2009). As changepoint information is crucial for obtaining a realistic trend estimate at a specific location, warming and/or cooling conclusions for larger-scale regions may also depend on changepoint aspects, especially if a particular type of changepoint is systematic (e.g., a transition to electronic resistance thermometers or location moves to an airport). Impacts of changepoints on trends have already been verified in the literature, including Menne et al. (2009) and Williams et al. (2012) at regional (conterminous U.S.) scales and Lawrimore et al. (2011) on a global scale.
Acknowledgments
Jaechoul Lee’s research was supported by National Science Foundation Grant DMS 1107225. Robert Lund’s and Shanghong Li’s research was supported by National Science Foundation Grant DMS 0905770. Comments made by three referees significantly improved this article. The authors acknowledge color graphics help from Dongmei Wang.
REFERENCES
Bloomfield, P., and D. Nychka, 1992: Climate spectra and detecting climate change. Climatic Change, 21, 275–287, doi:10.1007/BF00139727.
Brockwell, P. J., and R. A. Davis, 1991: Time Series: Theory and Methods. 2nd ed. Springer, 579 pp.
Burt, C. C., 2004: Extreme Weather: A Guide and Record Book. W. W. Norton, 304 pp.
Coles, S., 2001: An Introduction to Statistical Modelling of Extreme Values. Springer, 208 pp.
Davis, R. A., T. C. M. Lee, and G. A. Rodriguez-Yam, 2006: Structural break estimation for nonstationary time series models. J. Amer. Stat. Assoc., 101, 223–239, doi:10.1198/016214505000000745.
DeGaetano, A. T., and R. J. Allen, 2002: Trends in twentieth-century temperature extremes across the United States. J. Climate, 15, 3188–3205, doi:10.1175/1520-0442(2002)015<3188:TITCTE>2.0.CO;2.
DeGaetano, A. T., R. J. Allen, and K. P. Gallo, 2002: A homogenized historical extreme dataset for the United States. J. Atmos. Oceanic Technol., 19, 1267–1284, doi:10.1175/1520-0426(2002)019<1267:AHHTED>2.0.CO;2.
El Fadli, K. I., and Coauthors, 2013: World meteorological organization assessment of the purported world record 58°C temperature extreme at El Azizia, Libya (13 September 1922). Bull. Amer. Meteor. Soc., 94, 199–204, doi:10.1175/BAMS-D-12-00093.1.
Fomby, T. B., and T. J. Vogelsang, 2002: The application of size-robust trend statistics to global-warming temperature series. J. Climate, 15, 117–123, doi:10.1175/1520-0442(2002)015<0117:TAOSRT>2.0.CO;2.
Gallagher, C., R. B. Lund, and M. Robbins, 2013: Changepoint detection in climate series with long-term trends. J. Climate, 26, 4994–5006, doi:10.1175/JCLI-D-12-00704.1.
Gumbel, E. J., 1958: Statistics of Extremes. Columbia University Press, 375 pp.
Hansen, K. M., 1991: Head-banging: Robust smoothing in the plane. IEEE Trans. Geosci. Remote Sens., 29, 369–378, doi:10.1109/36.79427.
Jones, P. D., 1988: Hemispheric surface air temperature variations: Recent trends and an update to 1987. J. Climate, 1, 654–660, doi:10.1175/1520-0442(1988)001<0654:HSATVR>2.0.CO;2.
Jones, P. D., M. New, D. E. Parker, S. Martin, and I. G. Rigor, 1999: Surface air temperature and its changes over the past 150 years. Rev. Geophys., 37, 173–199, doi:10.1029/1999RG900002.
Karl, T. R., G. Kukla, V. N. Razuvayev, M. J. Changery, R. G. Quayle, R. R. Heim Jr., D. R. Easterling, and C. B. Fu, 1991: Global warming: Evidence for asymmetric diurnal temperature change. Geophys. Res. Lett., 18, 2253–2256, doi:10.1029/91GL02900.
Katz, R. W., and B. G. Brown, 1992: Extreme events in a changing climate: Variability is more important than means. Climatic Change, 21, 289–302, doi:10.1007/BF00139728.
Koenker, R., 2005: Quantile Regression. Cambridge University Press, 349 pp.
Kunkel, K. E., X.-Z. Liang, J. Zhu, and Y. Lin, 2006: Can CGCMs simulate the twentieth-century “warming hole” in the central United States? J. Climate, 19, 4137–4153, doi:10.1175/JCLI3848.1.
Lawrimore, J. H., M. J. Menne, B. E. Gleason, C. N. Williams, D. B. Wuertz, R. S. Vose, and J. Rennie, 2011: An overview of the Global Historical Climatology Network monthly mean temperature data set, version 3. J. Geophys. Res.,116, D19121, doi:10.1029/2011JD016187.
Leadbetter, M. R., G. Lindgren, and H. Rootzén, 1983: Extremes and Related Properties of Random Sequences and Processes. Springer-Verlag, 336 pp.
Li, S., and R. B. Lund, 2012: Multiple changepoint detection via genetic algorithms. J. Climate, 25, 674–686, doi:10.1175/2011JCLI4055.1.
Lu, Q., and R. B. Lund, 2007: Simple linear regression with multiple level shifts. Can. J. Stat., 35, 447–458, doi:10.1002/cjs.5550350308.
Lu, Q., R. B. Lund, and P. L. Seymour, 2005: An update of U.S. temperature trends. J. Climate, 18, 4906–4914, doi:10.1175/JCLI3557.1.
Lu, Q., R. B. Lund, and T. C. M. Lee, 2010: An MDL approach to the climate segmentation problem. Ann. Appl. Stat., 4, 299–319, doi:10.1214/09-AOAS289.
Lund, R. B., H. Hurd, P. Bloomfield, and R. L. Smith, 1995: Climatological time series with periodic correlation. J. Climate, 8, 2787–2809, doi:10.1175/1520-0442(1995)008<2787:CTSWPC>2.0.CO;2.
Lund, R. B., P. L. Seymour, and K. Kafadar, 2001: Temperature trends in the United States. Environmetrics, 12, 673–690, doi:10.1002/env.468.
Lund, R. B., X. L. Wang, Q. Lu, J. Reeves, C. Gallagher, and Y. Feng, 2007: Changepoint detection in periodic and autocorrelated time series. J. Climate, 20, 5178–5190, doi:10.1175/JCLI4291.1.
McCormick, W., and Y. Qi, 2000: Asymptotic distribution for the sum and maximum of Gaussian processes. J. Appl. Prob., 37, 958–971.
Meehl, G. A., J. M. Arblaster, and G. Branstator, 2012: Mechanisms contributing to the warming hole and the consequent U.S. east–west differential of heat extremes. J. Climate, 25, 6394–6408, doi:10.1175/JCLI-D-11-00655.1.
Menne, M. J., and C. N. Williams Jr., 2005: Detection of undocumented changepoints using multiple test statistics and composite reference series. J. Climate, 18, 4271–4286, doi:10.1175/JCLI3524.1.
Menne, M. J., and C. N. Williams Jr., 2009: Homogenization of temperature series via pairwise comparisons. J. Climate, 22, 1700–1717, doi:10.1175/2008JCLI2263.1.
Menne, M. J., C. N. Williams Jr., and R. S. Vose, 2009: The U.S. Historical Climatology Network monthly temperature data, version 2. Bull. Amer. Meteor. Soc., 90, 993–1007, doi:10.1175/2008BAMS2613.1.
Menne, M. J., C. N. Williams Jr., and M. A. Palecki, 2010: On the reliability of the U.S. surface temperature record. J. Geophys. Res.,115, D11108, doi:10.1029/2009JD013094.
Mitchell, J. M., Jr., 1953: On the causes of instrumentally observed secular temperature trends. J. Meteor., 10, 244–261, doi:10.1175/1520-0469(1953)010<0244:OTCOIO>2.0.CO;2.
Peterson, T. C., X. Zhang, M. Brunet-India, and J. L. Vázquez-Aguirre, 2008: Changes in North American extremes derived from daily weather data. J. Geophys. Res.,113, D07113, doi:10.1029/2007JD009453.
Reiss, R. D., and M. Thomas, 2007: Statistical Analysis of Extreme Values with Applications to Insurance, Finance, Hydrology and Other Fields. 3rd ed. Springer, 530 pp.
Robinson, W. A., R. Reudy, and J. E. Hansen, 2002: General circulation model simulations of recent cooling in the east-central United States. J. Geophys. Res., 107, 4748, doi:10.1029/2001JD001577.
Smith, R. L., 1989: Extreme value analysis of environmental time series: An application to trend detection in ground-level ozone. Stat. Sci., 4, 367–377, doi:10.1214/ss/1177012400.
Van de Vyver, H., 2012: Evolution of extreme temperatures in Belgium since the 1950s. Theor. Appl. Climatol., 107, 113–129, doi:10.1007/s00704-011-0456-2.
Vose, R. S., D. R. Easterling, and B. Gleason, 2005: Maximum and minimum temperature trends for the globe: An update through 2004. Geophys. Res. Lett., 32, L23822, doi:10.1029/2005GL024379.
Wendland, W. M., and W. Armstrong, 1993: Comparison of maximum–minimum resistance and liquid-in-glass thermometer records. J. Atmos. Oceanic Technol., 10, 233–237, doi:10.1175/1520-0426(1993)010<0233:COMRAL>2.0.CO;2.
Williams, C. N., M. J. Menne, and P. W. Thorne, 2012: Benchmarking the performance of pairwise homogenization of surface temperatures in the United States. J. Geophys. Res.,117, D05116, doi:10.1029/2011JD016761.
Zhang, J., W. Zheng, and M. J. Menne, 2012: A Bayes factor model for detecting artificial discontinuities via pairwise comparisons. J. Climate, 25, 8462–8474.