1. Introduction
Estimating tornado and casualty (injuries and fatalities) counts (hereinafter referred to as characteristics) of severe weather outbreaks is an important and challenging problem. It is important because of the potential for loss of life and property damage. It is challenging because of the uncertainties associated with exactly how many and where the tornadoes will occur; however, progress is being made. Guidance from dynamical models help forecasters outline areas of possible severe weather threats days in advance (Hitchens and Brooks 2014) while guidance from statistical models helps forecasters quantify probabilities for given severe weather events (Thompson et al. 2017; Cohen et al. 2018; Elsner and Schroder 2019; Hill et al. 2020). For example, Cohen et al. (2018) use a regression model to specify the probability of tornado occurrence given certain environmental and storm-scale conditions (circulation above radar level, rotational velocity, circulation diameter, etc.). Elsner and Schroder (2019) extend this model by making use of the cumulative logistic link function that estimates probabilities for each damage rating using storm-relative helicity, bulk shear, and convective available potential energy (CAPE). These studies put statistical guidance for estimating severe weather outbreak characteristics on a firm mathematical foundation (Cohen et al. 2018; Elsner and Schroder 2019). Room for additional work in this area motivates the present study. For instance, the cumulative logistic regression (Elsner and Schroder 2019) provides a distribution for the percentage of tornadoes within each enhanced-Fujita (EF) rating category (Fujita 1981), but a model is needed to estimate the overall number of tornadoes given the likelihood of at least some tornadoes.
Tornado outbreaks pose a risk of significant loss of life and property. Anderson-Frey and Brooks (2019) consider the role environmental factors play in the number of outbreak fatalities. They use self-organizing maps on the significant tornado parameter (STP) and find that more damaging tornadoes (>EF3) present a higher risk for fatalities. However, they also note that both deadly and nondeadly tornadoes are associated with high values of STP. Self-organizing maps are useful for describing the role of environmental variables on casualties, but a statistical model is needed to quantify the relationship between casualty counts and environmental factors. Here we demonstrate a method to model “outbreak”-level tornado and casualty counts from environmental conditions and predefined tornado clusters. A cluster is defined (informally) as a group of 10 or more tornadoes occurring over a relatively short time scale (e.g., one day) and over a relatively limited spatial domain (e.g., one–three states) (Fig. 1). The model allows us to quantify the associative relationships between environmental variables and tornado counts. Moreover, the approach might eventually help to extend the available statistical guidance for predicting outbreak characteristics particularly when combined with other models.
Example tornado clusters. Each point is the tornadogenesis location shaded by EF rating. The black line is the spatial extent of the tornadoes occurring on that convective day and is defined by the minimum convex hull encompassing the set of locations.
Citation: Weather, Climate, and Society 13, 3; 10.1175/WCAS-D-20-0130.1
In this paper, we focus on outbreaks rather than on individual tornadoes. The larger space and time scales associated with the outbreak matches our interest in the larger-scale environmental factors like CAPE and shear. In what follows, we call the outbreaks “clusters” as is done in Schroder and Elsner (2019) because we make no attempt to associate the cluster with a particular weather system. We do not reference the weather feature(s) responsible for the cluster, although it is likely that many of them are associated with a single synoptic-scale system.
Clusters in the United States are most frequent during April, May, and June (Dixon et al. 2014; Tippett et al. 2012; Dean 2010) with most of them occurring across the central plains and the Southeast. Clusters are less common in the Southeast and the southern plains during the summer months as the jet stream migrates north taking the necessary wind shear with it (Concannon et al. 2000; Gensini and Ashley 2011; Jackson and Brown 2009). The percentage of all tornadoes occurring in clusters has recently been found to be increasing over time (Moore 2017; Tippett et al. 2016; Elsner et al. 2015; Brooks et al. 2014).
This paper has two objectives: 1) to demonstrate that environmental conditions prior to the occurrence of any tornadoes can be used to skillfully model the number of tornadoes in a cluster containing at least 10 tornadoes (tornado-count model), and 2) to show that these same environmental conditions can be used to estimate the number of casualties if the number of people “in harm’s way” is known (casualty-count model). We accomplish these objects by fitting negative binomial regressions to cluster-level data including environmental variables and tornado and casualty counts on a convective day (24-h period between 1200 and 1200 UTC) when the number of tornadoes is at least 10 (see Elsner and Schroder 2019). The paper is outlined as follows: The data and methods are discussed in section 2c, including the mathematics of a negative binomial regression. Statistics describing the response (i.e., tornado-casualty counts) and environmental variables and model results are given in section 3. A summary of the results with conclusions is given in section 4.
2. Data and methods
We fit regression models to a set of tornado and reanalysis data aggregated to the level of tornado clusters. Here, we describe how we organize the data and the procedures to aggregate values to the cluster level. For our purposes, a cluster is a group of at least 10 tornadoes occurring relatively close to one another in both space and time over a convective day. Ten is chosen as a compromise between too few clusters leading to greater uncertainty and too many clusters leading to excessive time required to fit the models (Elsner and Schroder 2019). Ten is also the number that is sometimes used formally to define an outbreak (Galway 1977; Anderson-Frey et al. 2018). The number of tornadoes in each cluster is the response variable in the tornado-count regression model, and the number of casualties is the response variable in the casualty-count regression model. Explanatory variables include outbreak size and location as well as environmental variables from reanalysis data representing conditions before the occurrence of the first tornado in the cluster.
a. Tornado clusters
First, we extract the date, time, genesis location, and magnitude of all tornado reports between 1994 and 2018 from the Storm Prediction Center (SPC; https://www.spc.noaa.gov/gis/svrgis/). We choose 1994 as the start year because it is the first year of the extensive use of the WSR-88D (Heiss et al. 1990). In total, there are 30 497 national tornado reports during this period. The geographic coordinates for each genesis location are converted to Lambert conic conformal coordinates, where the projection is centered on 96°W longitude.
Next, we assign to each tornado a cluster identification number based on the space and time differences between genesis locations. Two tornadoes are assigned the same cluster identification number if they occur close together in space and time (e.g., 1 km and 1 h). When the difference between individual tornadoes and existing clusters surpasses 50 000 s (~14 h), the clustering ends. The space–time differences have units of seconds because we divide the spatial distance by 15 m s−1 to account for the average speed of tornado-producing storms. This speed is commensurate with the magnitude of the steering-level wind field across the clusters. The clustering is identical to that used in Elsner and Schroder (2019), who developed a cumulative logistic model to the damage scale at the individual tornado level. Additional details on the procedure, as well as a comparison of the identified clusters to well-known outbreaks, are available in Schroder and Elsner (2019).
We keep only clusters having at least 10 tornadoes occurring within the same convective day, which results in 768 clusters with a total of 17 069 tornadoes. The average number of tornadoes per cluster is 22 and the maximum is 173 (27 April 2011). There are 80 clusters with exactly 10 tornadoes. Each cluster varies by area and by where it occurs geographically (see Fig. 1 for examples of clusters). The cluster area is defined by the minimum convex hull (black polygon) that includes all of the tornado genesis locations. An example of a small cluster is the 19 July 1994 cluster with nine tornadoes over northern Iowa and one over northwestern Wisconsin had an area of 33 359 km2 and lasted about 4 h. The largest cluster in the data is the 27 April 2011 cluster with 173 tornadoes spread over more than a dozen states, and it had an area of 1 064 337 km2 with tornadoes occurring throughout the 24-h period (from 1200 UTC day 1 to 1200 UTC day 2).
For each cluster, we sum the number of injuries and deaths across all tornadoes to get the cluster-level number of casualties (sum of injuries and fatalities). Further, we estimate the population within the cluster area and the geographic center of the cluster. Population values are U.S. Census Bureau estimates in cities with at least 40 000 people (Steiner 2019). Population is used as an explanatory variable in place of cluster area in the casualty-count model.
We encountered situations in which there were multiple clusters on a given day. For example, there were two clusters on 11 November 1995 (Fig. 2). The first cluster was responsible for 15 tornadoes resulting in three casualties. The second cluster was responsible for 10 tornadoes resulting in four casualties. Both clusters were the result of a single cold front along the eastern coast of the United States. However, they are not grouped into a single cluster because the minimum between-tornado space–time distance was larger than our threshold.
Example of multiple clusters on a single convective day. Each point is a tornado genesis location. The black line is the spatial extent of the tornadoes for each cluster and is defined by the minimum convex hull encompassing the set of locations.
Citation: Weather, Climate, and Society 13, 3; 10.1175/WCAS-D-20-0130.1
b. Environmental variables
Large-scale environmental conditions for producing tornadoes are well studied and include large magnitudes of convective available potential energy, bulk shear, and weak convective inhibition (Brooks et al. 1994; Rasmussen and Blanchard 1998; Thompson et al. 2003; Shafer and Doswell 2011; Doswell et al. 2006). We obtain variables associated with these environmental conditions from the National Centers for Atmospheric Research’s North American Regional Reanalysis (NARR), which is supported by the National Centers for Environmental Prediction (Mesinger et al. 2006). Each variable has numeric values given on a 32-km raster grid with the values available in 3-h increments starting at 0000 UTC. In the severe weather literature, these environmental variables are called “parameters.” However, since we employ statistical models, here we call them variables to be consistent with the statistical literature where the word “parameter” denotes unknown model coefficients and moments of statistical distributions (e.g., the mean).
We select environmental variables at the nearest 3-h NARR time prior to the occurrence of the first tornado in the cluster. For example, we use the environmental variables given at 1500 UTC if the first tornado in a cluster occurs at 1630 UTC. This selection criteria results in a sample of the environment that is less contaminated by the deep convection itself but at a cost that underestimates the severity in cases where environmental conditions rapidly change favoring tornado development. About 60% of all clusters have the initial tornado occurring between 1800 and 0000 UTC (Table 1). However, there are more tornadoes in clusters when the first tornado occurs between 1500 and 1800 UTC on average.
Cluster statistics by time of day. Each cluster is categorized by the closest 3-h time (defined by the NARR data) prior to the first tornado.
The environmental variables that we consider include CAPE and convective inhibition (CIN) as computed using the near-surface layer (from 0 to 180 hPa above the ground level) consistent with Allen et al. (2015b). We also include deep layer bulk shears (DLBS; 1000–500 hPa) and shallow layer bulk shears (SLBS; 1000–850 hPa) computed as the square root of the sum of the squared differences between the u- and υ-wind components at the respective levels consistent with Tippett et al. (2012). Climate researchers use these NARR variables at the climatological scale as proxies for the more traditional variables used in forecasting severe weather (Allen et al. 2015b; Moore et al. 2016; Tippett et al. 2012).
We take the highest (lowest for CIN) value across the grid of values within the area defined by the cluster’s convex hull (Fig. 3). This is done to capture environmental conditions that represent the unadulterated pretornado environment. In contrast, the mean (or median) value is influenced by conditions throughout the domain including earlier occurring non-tornado-producing convection and in areas within the clusters that did not experience tornado activity. Histograms of the maximums (not shown) show no evidence of extreme behavior.
Example of the environmental factors for the 6 May 2003 tornadoes. The black line is the spatial extent of the cluster on that convective day. Shading represents the intensity of the environment. CAPE is purple, storm-relative helicity is blue, CIN is green, and deep-layer shear is red. The black square is the location of the maximum value for the environmental factor.
Citation: Weather, Climate, and Society 13, 3; 10.1175/WCAS-D-20-0130.1
We do not include storm-relative helicity (SRH), lifted condensation level (LCL), or dewpoint temperature (DEW) in this research, although these variables have proven to be indicators of favorable environments for tornado production. SRH is not used because it is correlated with DLBS and SLBS (Table 2). Likewise, dewpoint temperature and LCL height are not used because of their relatively high correlation with CAPE. A model that included SRH, DEW, and LCL height as predictors showed no significant improvement over a model without them. Further, we do not use composite variables including the STP and the supercell composite parameter (SCP). STP, for example, is the product of variables including CAPE, SRH, CIN, and LCL height. A moderate value of STP can result from either high CAPE and low shear or low CAPE and high shear environments holding the other variables constant. Here we separate this composite relationship to examine the direct relationships between CAPE and shear on tornado activity at the scale of outbreaks.
Correlation matrix of the environmental variables (defined in the text) that are considered in this study. Variables that are highly correlated (value > 0.5) are in boldface type. Therefore, CAPE, CIN, DLBS, and SLBS are used as explanatory variables in the models.
c. Negative binomial regression
We fit a negative binomial regression model to the cluster-level data. We chose this type of regression because the response variable in the tornado (and casualty) model is a count. A count variable is described by a discrete distribution like the Poisson or negative binomial rather than by a continuous distribution like the normal (Gausssian). The choice of which discrete distribution is made in favor of the negative binomial since the mean number of tornadoes (casualties) per cluster is substantially smaller than the variance in the number of tornadoes (casualties), whereas the Poisson distribution assumes the mean is equal to the variance.
Regression model skill is evaluated by comparing the observed tornado and casualty counts with what is predicted by the model. The predicted rates for each cluster are obtained by plugging the values of the associated explanatory variables into the model. Predicted rates are under dispersed (lower variation) relative to the observed counts. Comparisons are made using the metrics of Pearson correlation coefficient and mean absolute error. Predictive skill using these metrics is evaluated using in-sample and out-of-sample predictions. In-sample predictions are made using all clusters to fit a single model while out-of-sample predictions are made by successively holding one cluster out of the model fitting procedure and using the particular model to predict the counts from the cluster left out (hold-one-out cross validation; see Elsner and Schmertmann 1994).
3. Results
a. Descriptive statistics
The number of clusters decreases exponentially with an increasing number of tornadoes per cluster (Fig. 4). There are 80 clusters with 10 tornadoes but only 10 clusters with 30 tornadoes. The right tail of the count distribution is long with the 27 April 2011 cluster having 173 tornadoes [47 (6%) of the clusters have more than 50 tornadoes and are not shown]. However, more clusters have 20 or 21 tornadoes than expected from a simple decay function. This deviation is unlikely the result of physical processes, and it appears too large to be sampling variability. It might be due to a consistent rounding of the totals to the nearest 5 or 10. There is an upward trend in the number of tornadoes per cluster (not shown) consistent with recent studies (Elsner et al. 2015). The distribution of casualties is also skewed toward many clusters having only a few casualties and a few have many; 36% of all clusters (275) are without a casualty, and 56% of the clusters have fewer than four casualties.
Histograms of the (a) number of clusters by number of tornadoes and (b) number of clusters by number of casualties. The histograms are right truncated at 50 to show detail on the left side of the distributions. Only clusters with at least 10 tornadoes are considered in this study.
Citation: Weather, Climate, and Society 13, 3; 10.1175/WCAS-D-20-0130.1
There is a seasonality to the chance of at least one tornado cluster (Fig. 5). The empirical 7-day probability of at least one cluster is between 20% and 30% for much of the year except between the middle of March and early July (Fig. 5a). The probabilities approach 80% between mid- and late May The number of tornadoes per cluster is less variable ranging between about 10 and 35 tornadoes per week with no strong seasonality although clusters during July and August tend to have somewhat fewer tornadoes (Fig. 5b). The casualty rate, defined as the number of casualties per 100 000 people within the cluster area, has a distinct seasonality with rates being highest between the periods of March to April and August to September (Fig. 5c).
(a) Probability of a cluster, (b) average number of tornadoes per cluster, and (c) average number of casualties per 1 000 000 people per cluster by week of the year.
Citation: Weather, Climate, and Society 13, 3; 10.1175/WCAS-D-20-0130.1
Across the 768 clusters the mean of the maximum values of CAPE is 2225 J kg−1 and the mean of the minimum values of CIN is −114 J kg−1 (Table 3). The maximum deep-layer bulk shear values range from 5.6 to 47.9 m s−1. Cluster areas range from 361 to 1 064 337 km2 with an average of 167 990 km2.
Variables considered in the regression models. Values include the range and the average across the 768 tornado clusters.
b. A model for the number of tornadoes
First, we fit a negative binomial regression to the cluster-level tornado counts using the explanatory variables given in Table 3. This is our tornado-count model. We divide the cluster area by 10 million so that it has units of 100 km2. We divide CAPE by 1000 so it has units of 1000 J kg−1 and we divide CIN by 100 so it has units of 100 J kg−1. This simplifies interpretation of the model coefficients but does not affect the goodness of fit.
All terms have signs on the coefficient that are physically reasonable (Table 4). The number of tornadoes in a cluster increases with cluster area, CAPE, and bulk shear (deep and shallow layers) and increases for decreasing CIN (i.e., less inhibition) as expected. The significance of the variable in statistically explaining tornado counts is assessed by the corresponding z value given as the ratio of the coefficient estimate to its standard error (SE). We reject the null hypothesis that a particular variable has no explanatory power if its corresponding p value is less than 0.01. Here, we fail to reject the null hypothesis for the variables latitude, longitude, and year, which indicates that these nonphysical variables have a relatively small impact on tornado counts relative to the physical variables given the data and the model. In particular, there is no significant trend over time in the number of tornadoes in these clusters. The only physical variable that is not statistically significant is CIN. This is likely a result of the NARR data being too coarse to adequately represent CIN. We remove all statistically insignificant variables before fitting a final model.
Coefficients in the tornado-count models. The size parameter n is 6.27 ± 0.393 [standard error (SE)] for the initial model and 6.25 ± 0.392 (SE) for the final model.
All variables in the final model are significant although the magnitudes of the coefficients have changed a bit relative to their values in the initial model. The in-sample correlation between the observed counts and predicted rates is 0.59 [(0.54, 0.64), 95% uncertainty interval (UI)] (Fig. 6). We find that the model is not improved by using the average values of these same environmental variables. The model statistically explains almost 60% of the variation in cluster-level tornado counts but tends to over predict the number of tornadoes for smaller clusters and slightly under predict the number of tornadoes for larger clusters. We test the significance of area and find the model performance decreases by 25% when excluding area from the model. The mean absolute error between the observed counts and expected rates is 8.6 tornadoes or 5.2% of the range in observed counts and 9.3% of the range in predicted rates. The out-of-sample errors are quite similar due to the large sample size (768 clusters). A hold-one-out cross validation exercise (Elsner and Schmertmann 1994) results in an out-of-sample correlation of 0.58 and a mean absolute error of 8.6 tornadoes. The lag-1 temporal autocorrelation in cluster-level tornado counts is 0.13.
Observed cluster-level tornado counts vs predicted rates from a negative binomial regression. The thin black line is the line of best fit. The thick line is the slope of the model indicating the relationship between the observed and predicted tornado counts, and the shading indicates the associated standard error.
Citation: Weather, Climate, and Society 13, 3; 10.1175/WCAS-D-20-0130.1
The value of β0 (Table 4) is the regression estimate when all variables in the model are evaluated at zero. The effect size for a given explanatory variable is given by the magnitude of its corresponding coefficient. The coefficient is expressed as the difference in the logarithm of the expected tornado counts for a unit increase in the explanatory variable holding the other variables constant. For example, the scaled units of CAPE are 1000 J kg−1. An increase in CAPE of 1000 J kg−1 results in an [exp(0.0459) − 1] × 100% = 4.7% increase in the expected number of tornadoes, conditional on at least 10 tornadoes. Continuing, units of deep-layer bulk shear are 10 m s−1 so that an increase in shear of 10 m s−1 results in a 13% increase in the expected number of tornadoes. A similar increase in shallow-layer bulk shear results in an 11.1% increase in the number of tornadoes.
For example, on 12 April 2020, the 1200 UTC guidance from the SPC convective outlook defined an area with a 10% chance of at least one tornado occurring within 40 km of any location (10% tornado risk). The area of the polygon was approximately 400 000 km2 (much larger than the average cluster area) centered on Mississippi (Fig. 7). With an area of that size, the model estimates the probability of at least 30 tornadoes for a range of deep-layer shear values and conditional on the amount of CAPE while holding shallow-layer shear at the average value of all clusters (Fig. 8). Given an average amount of shallow-layer shear, a deep-layer shear of 10 m s−1 and low CAPE (fifth-percentile value), the model predicts a 17% [(9%, 26%) UI] chance of at least 30 tornadoes (given a cluster with at least 10 tornadoes). In contrast, given a deep-layer shear of 40 m s−1 and high CAPE (95th percentile value), the model predicts a 65% [(56%, 71%) UI] chance of at least 30 tornadoes. There were more than 100 tornadoes on that day.
Convective outlook (shapefiles: https://www.spc.noaa.gov/cgi-bin-spc/getacrange.pl?date0=20200412&date1=20200412) issued by the SPC at 1200 UTC 12 Apr 2020, along with the locations of tornado reports (https://www.spc.noaa.gov/climo/reports/200412_rpts.html) over the 24-h period starting at that time. The outlook category numbers indicate the chance of observing severe weather within 40 km of any location.
Citation: Weather, Climate, and Society 13, 3; 10.1175/WCAS-D-20-0130.1
Estimated probability of at least 30 tornadoes given an outbreak of at least 10 tornadoes and the regression model. The predicted count from the model is a parameter in a negative binomial distribution with cluster area set at 400 000 km2, and shallow-level bulk shear is set to its mean value.
Citation: Weather, Climate, and Society 13, 3; 10.1175/WCAS-D-20-0130.1
The model quantifies the empirical relationship between CAPE and, independently, shear in terms of a probability distribution on the number of tornadoes. It predicts the expected count given values for the explanatory variables. The negative binomial distribution uses the model’s predicted count and the size parameter to generate a distribution of probabilities. For example, the model gives predicted probabilities across a range of CAPE and deep-layer shear values (holding shallow-layer shear at its mean value) that provides a picture of the relationship (Fig. 9). The predicted probabilities of at least 30 tornadoes given an outbreak covering an area of 400 000 km2 increase from low values of both CAPE and shear to high values of both CAPE and shear.
Estimated probability of at least 30 tornadoes given an outbreak of at least 10 tornadoes and the regression model across a range of CAPE and deep-layer bulk shear values while holding the shallow-layer bulk shear at a mean value.
Citation: Weather, Climate, and Society 13, 3; 10.1175/WCAS-D-20-0130.1
c. Sensitivity of the results to cluster definition
The clustering method is taken from Schroder and Elsner (2019), in which a sensitivity analysis was performed to examine the reliability of the resulting clusters. They examined various stopping thresholds and determined that a space–time distance of 50 000 s best matched the subjectively identified outbreaks of Forbes (2006) with an agreement of 88%. Larger and smaller stopping thresholds resulting in lower agreement percentages. Still this objective method results in clusters with varying levels of tornado density (tornadoes per unit area), which might influence the results of the regression model. For example, Fig. 1 shows that the 6 June 1999 cluster has much lower tornado density relative to the 5 February 2008 cluster.
To directly test the sensitivity of our cluster definition, we first correlate the model residuals (observed count minus the predicted rate) for each cluster with the tornado density. The Pearson product-moment correlation coefficient is 0.02 indicating that tornado density is not a significant factor in the model’s ability to predict the conditional tornado counts from the environmental variables. Second, we refit the model using all clusters except the five clusters having the lowest tornado density. The mean absolute error is only marginally improved from 8.6 to 8.4 tornadoes, providing further evidence that model results are not particularly sensitive to the inclusion of clusters with low tornado density.
d. A model for the number of casualties
Next, we fit a negative binomial regression to the cluster-level casualty counts (direct injuries and deaths) using the same explanatory variables (Table 3) with the exceptions that population (scaled by 100 000 residents) replaces cluster area and C (casualty count) replaces T (tornado count) as the dependent variable. This is our casualty-count model. We find that CIN is the only variable not significant in the initial model (Table 5). We remove it before fitting a final model.
Coefficients in the casualty-county models. The size parameter n is 0.261 ± 0.014 (SE) for the initial and final models.
The in-sample correlation between the observed casualty counts and predicted rates is 0.43 [(0.37, 0.48) UI] (Fig. 10). We test the significance of population and find that the model performance decreases by 12% when excluding population from the model. The mean absolute error between the observed counts and expected rates is 39 casualties or 1.3% of the range in observed counts and 3.4% of the range in predicted rates. The out-of-sample correlation is 0.36 and the mean absolute error is 40 casualties. The skill is lower than the skill of the tornado-count model as there is additional uncertainty associated with the number of casualties given a tornado. The lower skill is also a result of the many other factors that can influence casualties including demographic variables and location (see summary and conclusions below).
Observed cluster-level casualty counts vs predicted rates from a negative binomial regression. Clusters without casualties are plotted at the far left. The thin black line is the line of best fit. The thick line is the slope of the model indicating the relationship between the observed and predicted casualty counts, and the shading indicates the associated standard error.
Citation: Weather, Climate, and Society 13, 3; 10.1175/WCAS-D-20-0130.1
As expected from the tornado-count model, the number of casualties resulting from a cluster of tornadoes increases with CAPE and with the two bulk shear variables (Table 5), which is consistent with Anderson-Frey and Brooks (2019). Holding all other variables constant, an increase in CAPE of 1000 J kg−1 results in a 28% increase in the expected number of casualties. An increase in deep-layer bulk shear of 10 m s−1 results in a 98% increase in the expected number of casualties per cluster and a similar increase in shallow-layer bulk shear results in a 76% increase in the expected number of casualties per cluster, conditional on at least 10 tornadoes. Additionally, the model indicates that casualties decrease at a rate of 3.6% yr−1. This is very likely the result of improvements made by the National Weather Service in warning coordination and dissemination leading to better awareness especially for these large outbreak events.
Also, as expected, the number of people in harm’s way is a significant explanatory variable for the cluster-level casualty count. The relationship between population and number of casualties is quantified at the tornado level in Elsner et al. (2018) and Fricker et al. (2017) so we expect the relationship to hold at the cluster level. Here, we are able to compare the influence of shear and CAPE on the probability of casualties as modulated by population (Fig. 11). Model results are shown for three levels of population. The probability of a large number of casualties increases with increasing shear and increasing CAPE, while keeping the other variables at their mean values and the year at 2018.
Probability of at least 50 tornado casualties as a function of (left) DLBS and (right) CAPE and modulated by the number of people in harm’s way. The other variables are set at their mean values, and the year is set at 2018.
Citation: Weather, Climate, and Society 13, 3; 10.1175/WCAS-D-20-0130.1
Importantly, we also find that where the cluster occurs has a significant influence on the number of casualties consistent with other studies (Ashley and Strader 2016; Fricker and Elsner 2019). For every 1°N move in latitude, the casualty rate decreases by 5.5%; for every 1°E move in longitude, the casualty rate increases by 2.9%. Thus, cluster-level casualties are highest over the Southeast. This effect is independent of the number of tornadoes since location was not a significant factor in the tornado-count model. The result is also independent of the number of people in harm’s way since population is included as an exploratory variable in the model.
To visualize the difference of the combined effects of latitude and longitude on the difference in the probability of many casualties, we plot modeled casualty probabilities (at least 25) as a function of CAPE and deep-layer shear for two hypothetical outbreaks that are the same in every way except that one outbreak is centered on Sioux City, Iowa (42.5°N, 96.4°W), and the other is centered on Birmingham, Alabama (33.5°N, 86.8°W) (Fig. 12). The modeled probabilities are lowest (around 5%) for low CAPE and shear values and highest (above 30%) for high CAPE and shear values. The difference in modeled probabilities across these two locations peaks at about +12 percentage points for high-CAPE and high-shear regimes when the outbreak is centered on Birmingham.
Probability of at least 25 tornado casualties as a function of DLBS and CAPE and modulated by location for two hypothetical outbreaks: one centered over Sioux City and the other centered over Birmingham. The SLBS is set to its mean value, the year is set to 2018, and population is set to 4 million.
Citation: Weather, Climate, and Society 13, 3; 10.1175/WCAS-D-20-0130.1
4. Summary and conclusions
Estimating characteristics of severe weather outbreaks (e.g., tornado and casualty counts) is challenging but important. Forecasters use a combination of numerical weather prediction and empirical guidance to outline areas of severe convective weather. Here, we demonstrate a statistical regression model that can take advantage of the large sample of independent tornado “outbreaks” as a way to statistically explain the number of tornadoes and the number of casualties in a cluster of at least 10 tornadoes. We fit negative binomial regressions to tornado characteristics aggregated to the level of tornado clusters where a cluster is a space–time group of at least 10 tornadoes occurring between 1200 UTC day 1 and 1200 UTC day 2 over the period 1994–2018. The number of tornadoes in each cluster is the response variable in the tornado-count model, and the number of casualties (deaths plus injuries) is the response variable in the casualty-count model. Environmental explanatory variables for the models are extracted from reanalysis data representing conditions before the occurrence of the first tornado in the cluster consistent with Schroder and Elsner (2019). Additional explanatory variables include cluster area, population, location, and year.
The estimated tornado rates, conditional on there being at least 10 tornadoes, explain 59% of the observed tornado counts in-sample, and the estimated casualty rates explain 43% of the observed casualty counts in-sample. Because of the large sample size, the out-of-sample skill is lower but still useful. The models show that a 1000 J kg−1 increase in CAPE results in a 4.7% increase in the expected number of tornadoes conditional on at least 10 tornadoes and a 28% increase in the expected number of casualties, holding the other variables constant. The models further show that a 10 m s−1 increase in deep-layer bulk shear results in a 13% increase in the expected number of tornadoes and a 98% increase in the expected number of casualties, holding the other variables constant. This research is consistent with Anderson-Frey and Brooks (2019) who found the number of tornadoes and casualties to increase with both CAPE and shear. This study quantifies these increases. The casualty-count model also shows a significant decline in the number of casualties at a rate of 3.6% yr−1. Casualty rates depend on where the outbreak occurs, with more deaths and injuries, on average, over the Southeast, controlling for the other variables—a result that is consistent with the recent work of Fricker and Elsner (2019) and Biddle et al. (2020).
Some of the unexplained variability in cluster-level tornado counts (and casualty counts) arises from the uncertainty associated with the preferred storm mode and the evolution of mesoscale convective systems, neither of which are captured by a single maximum value in the variable space of CAPE and shear. The counts are also limited by the quality of the NARR data. The NARR tends to unrealistically favor tornado environments during specific convective setups (Gensini and Ashley 2011; Gensini et al. 2014; Allen et al. 2015a). Additionally, we use only the maximum values (minimum for CIN) of the environmental variables, which may limit the representation of the cluster environment. Also, outbreaks associated with tropical cyclones likely add a bit of noise to both models since the number of tornadoes is sensitive to the extent and location of convective bursts within overall evolution of the landfalling storm.
The casualty-count model would be improved by including a skillful estimate of the number of tornadoes. Indeed, in a perfect-prognostic setting, where we know the number of tornadoes in the outbreak, the out-of-sample correlation between the observed number of casualties and the modeled estimated rate of casualties increases to 0.79. Further, although our approach to extracting signal from noise in the tornado dataset is sound, exclusive focus on clusters with at least 10 tornadoes is a type of selection bias meaning that the sample of data used to fit the model does not represent the population of all outbreaks, which limits what we can say in general about the effect of convective environments on the probability distribution of casualty counts.
The tornado-count model can be modified to provide guidance to forecasters given a convective outlook that highlights an area of elevated threat for tornadoes and a prediction of CAPE and shear across the threat area. The model needs to be calibrated using threat polygons (not cluster areas as was done here) and include predicted environmental values, but the same model equation can be used to provide a forecast probability distribution on the number of tornadoes. Further, a numerical convolution of this probability distribution with a forecast probability distribution for each EF-rating category (Elsner and Schroder 2019) will result in a prediction of the expected number of counts by category as well as the associated uncertainties.
The casualty-count model can be employed in a research setting to help better understand the socioeconomic, demographic, and communication factors that make some communities particularly vulnerable to deaths and injuries (Dixon and Moore 2012; Senkbeil et al. 2013; Klockow et al. 2014; Fricker and Elsner 2019). Work along this line has been done at the individual tornado level by identifying unusually devastating events (Fricker and Elsner 2019), but scaling this type of analysis to the cluster level to identify unusually devastating outbreaks might provide additional insights.
It is possible that the models could be improved by including nonlinear effects. One type of nonlinearity is interaction where the effect of CAPE on casualties is modulated by shear, for example. However, interaction effects usually must be specified without reference to the data, so additional research on this is needed. The model could also be improved by including variables that represent other environmental factors, convective modes, and efficiency of tornado production. The models also might be improved by adjusting the threshold definition of a cluster. Increasing the threshold on the tornado-count model from 10 to 14 decreases the sample size to 505 clusters and reduces the effect sizes on CAPE and shear by around 25%. Decreasing the threshold from 10 to 6 increases the sample size, and thus reduces the standard error assuming the effect size stays the same. A casualty-count model might also be improved by relaxing the assumption that the numbers of people injured or killed are independent. Casualty counts are typically not independent at the household level where multiple people live under the same roof. In this case, a better model might include a zero-inflated count process.
Acknowledgments
The negative binomial regression models in this paper were implemented with the glm.nb function from the MASS R package (Venables and Ripley 2002). Graphics were made with the ggplot2 framework (Wickham 2017). The code and data to fit all of the models are available on GitHub (https://github.com/jelsner/cape-shear).
REFERENCES
Allen, J. T., M. K. Tippett, and A. H. Sobel, 2015a: An empirical model relating U.S. monthly hail occurrence to large-scale meteorological environment. J. Adv. Model. Earth Syst., 7, 226–243, https://doi.org/10.1002/2014MS000397.
Allen, J. T., M. K. Tippett, and A. H. Sobel, 2015b: Influence of the El Niño/Southern Oscillation on tornado and hail frequency in the United States. Nat. Geosci., 8, 278–283, https://doi.org/10.1038/ngeo2385.
Anderson-Frey, A. K., and H. Brooks, 2019: Tornado fatalities: An environmental perspective. Wea. Forecasting, 34, 1999–2015, https://doi.org/10.1175/WAF-D-19-0119.1.
Anderson-Frey, A. K., Y. P. Richardson, A. R. Dean, R. L. Thompson, and B. T. Smith, 2018: Near-storm environments of outbreak and isolated tornadoes. Wea. Forecasting, 33, 1397–1412, https://doi.org/10.1175/WAF-D-18-0057.1.
Ashley, W. S., and S. M. Strader, 2016: Recipe for disaster: How the dynamic ingredients of risk and exposure are changing the tornado disaster landscape. Bull. Amer. Meteor. Soc., 97, 767–786, https://doi.org/10.1175/BAMS-D-15-00150.1.
Biddle, M. D., R. P. Brown, C. A. Doswell, and D. R. Legates, 2020: Regional differences in the human toll from tornadoes: A new look at an old idea. Wea. Climate Soc., 12, 815–825, https://doi.org/10.1175/WCAS-D-19-0051.1.
Brooks, H. E., C. A. Doswell, and J. Cooper, 1994: On the environments of tornadic and nontornadic mesocyclones. Wea. Forecasting, 9, 606–618, https://doi.org/10.1175/1520-0434(1994)009<0606:OTEOTA>2.0.CO;2.
Brooks, H. E., G. W. Carbin, and P. T. Marsh, 2014: Increased variability of tornado occurrence in the United States. Science, 346, 349–352, https://doi.org/10.1126/science.1257460.
Cohen, A. E., J. B. Cohen, R. L. Thompson, and B. T. Smith, 2018: Simulating tornado probability and tornado wind speed based on statistical models. Wea. Forecasting, 33, 1099–1108, https://doi.org/10.1175/WAF-D-17-0170.1.
Concannon, P., H. E. Brooks, and C. A. Doswell, 2000: Climatological risk of strong and violent tornadoes in the United States. Second Symp. on Environmental Applications, Long Beach, CA, Amer. Meteor. Soc., 212–219.
Dean, A. R., 2010: An analysis of clustered tornado events. 25th Conf. on Severe Local Storms, Denver, CO, Amer. Meteor. Soc., P2.19, https://ams.confex.com/ams/pdfpapers/176148.pdf.
Dixon, P. G., A. E. Mercer, K. Grala, and W. H. Cooke, 2014: Objective identification of tornado seasons and ideal spatial smoothing radii. Earth Interact., 18, https://doi.org/10.1175/2013EI000559.1.
Dixon, R. W., and T. W. Moore, 2012: Tornado vulnerability in Texas. Wea. Climate Soc., 4, 59–68, https://doi.org/10.1175/WCAS-D-11-00004.1.
Doswell, C. A., R. Edwards, R. L. Thompson, J. A. Hart, and K. C. Crosbie, 2006: A simple and flexible method for ranking severe weather events. Wea. Forecasting, 21, 939–951, https://doi.org/10.1175/WAF959.1.
Elsner, J. B., and C. P. Schmertmann, 1994: Assessing forecast skill through cross validation. Wea. Forecasting, 9, 619–624, https://doi.org/10.1175/1520-0434(1994)009<0619:AFSTCV>2.0.CO;2.
Elsner, J. B., and Z. Schroder, 2019: Tornado damage ratings estimated with cumulative logistic regression. J. Appl. Meteor. Climatol., 58, 2733–2741, https://doi.org/10.1175/JAMC-D-19-0178.1.
Elsner, J. B., S. C. Elsner, and T. H. Jagger, 2015: The increasing efficiency of tornado days in the United States. Climate Dyn., 45, 651–659, https://doi.org/10.1007/s00382-014-2277-3.
Elsner, J. B., T. Fricker, and W. D. Berry, 2018: A model for U.S. tornado casualties involving interaction between damage path estimates of population density and energy dissipation. J. Appl. Meteor. Climatol., 57, 2035–2046, https://doi.org/10.1175/JAMC-D-18-0106.1.
Forbes, G. S., 2006: Meteorological aspects of high-impact tornado outbreaks. Symp. Challenges of Severe Convective Storms, Atlanta, GA, Amer. Meteor. Soc., P1.12, https://ams.confex.com/ams/pdfpapers/99383.pdf.
Fricker, T., and J. B. Elsner, 2019: Unusually devastating tornadoes in the United States: 1995–2016. Ann. Amer. Assoc. Geogr., 110, 724–738, https://doi.org/10.1080/24694452.2019.1638753.
Fricker, T., J. B. Elsner, and T. H. Jagger, 2017: Population and energy elasticity of tornado casualties. Geophys. Res. Lett., 44, 3941–3949, https://doi.org/10.1002/2017GL073093.
Fujita, T. T., 1981: Tornadoes and downbursts in the context of generalized planetary scales. J. Atmos. Sci., 38, 1511–1534, https://doi.org/10.1175/1520-0469(1981)038<1511:TADITC>2.0.CO;2.
Galway, J. G., 1977: Some climatological aspects of tornado outbreaks. Mon. Wea. Rev., 105, 477–484, https://doi.org/10.1175/1520-0493(1977)105<0477:SCAOTO>2.0.CO;2.
Gensini, V. A., and W. Ashley, 2011: Climatology of potentially severe convective environments from the North American Regional Reanalysis. Electron. J. Severe Storms Meteor., 6 (8), https://ejssm.org/ojs/index.php/ejssm/article/viewArticle/85.
Gensini, V. A., T. L. Mote, and H. E. Brooks, 2014: Severe-thunderstorm reanalysis environments and collocated radiosonde observations. J. Appl. Meteor. Climatol., 53, 742–751, https://doi.org/10.1175/JAMC-D-13-0263.1.
Heiss, W. H., D. L. McGrew, and D. Sirmans, 1990: NEXRAD: Next generation weather radar (WSR-88D). Microwave J., 33, 79.
Hilbe, J., 2011: Negative Binomial Regression. Cambridge University Press, 553 pp.
Hill, A. J., G. R. Herman, and R. S. Schumacher, 2020: Forecasting severe weather with random forests. Mon. Wea. Rev., 148, 2135–2161, https://doi.org/10.1175/MWR-D-19-0344.1.
Hitchens, N. M., and H. E. Brooks, 2014: Evaluation of the Storm Prediction Center’s convective outlooks from day 3 through day 1. Wea. Forecasting, 29, 1134–1142, https://doi.org/10.1175/WAF-D-13-00132.1.
Jackson, J. D., and M. E. Brown, 2009: Sounding-derived low-level thermodynamic characteristics associated with tornadic and non-tornadic supercell environments in the southeast United States. Natl. Wea. Dig., 33, 16–26.
Klockow, K. E., R. A. Peppler, and R. A. McPherson, 2014: Tornado folk science in Alabama and Mississippi in the 27 April 2011 tornado outbreak. GeoJournal, 79, 791–804, https://doi.org/10.1007/s10708-013-9518-6.
Mesinger, F., and Coauthors, 2006: North American Regional Reanalysis. Bull. Amer. Meteor. Soc., 87, 343–360, https://doi.org/10.1175/BAMS-87-3-343.
Moore, T. W., 2017: On the temporal and spatial characteristics of tornado days in the United States. Atmos. Res., 184, 56–65, https://doi.org/10.1016/j.atmosres.2016.10.007.
Moore, T. W., R. W. Dixon, and N. J. Sokol, 2016: Tropical Cyclone Ivan’s tornado cluster in the Mid-Atlantic region of the United States on 17–18 September 2004. Phys. Geogr., 37, 210–227, https://doi.org/10.1080/02723646.2016.1189299.
Rasmussen, E. N., and D. O. Blanchard, 1998: A baseline climatology of sounding-derived supercell and tornado forecast parameters. Wea. Forecasting, 13, 1148–1164, https://doi.org/10.1175/1520-0434(1998)013<1148:ABCOSD>2.0.CO;2.
Schroder, Z., and J. B. Elsner, 2019: Quantifying relationships between environmental factors and power dissipation on the most prolific days in the largest tornado “outbreaks.” Int. J. Climatol., 40, 3150–3160, https://doi.org/10.1002/joc.6388.
Senkbeil, J. C., D. A. Scott, P. Guinazu-Walker, and M. S. Rockman, 2013: Ethnic and racial differences in tornado hazard perception, preparedness, and shelter lead time in Tuscaloosa. Prof. Geogr., 66, 610–620, https://doi.org/10.1080/00330124.2013.826562.
Shafer, C. M., and C. A. Doswell, 2011: Using kernel density estimation to identify, rank, and classify severe weather outbreak events. Electron. J. Severe Storms Meteor., 6 (2), https://ejssm.org/ojs/index.php/ejssm/article/viewArticle/74.
Steiner, E., 2019: Spatial history project. Stanford University Center for Spatial and Textual Analysis, http://web.stanford.edu/group/spatialhistory/cgi-bin/site/index.php.
Thompson, R. L., R. Edwards, J. A. Hart, K. L. Elmore, and P. Markowski, 2003: Close proximity soundings within supercell environments obtained from the rapid update cycle. Wea. Forecasting, 18, 1243–1261, https://doi.org/10.1175/1520-0434(2003)018<1243:cpswse>2.0.co;2.
Thompson, R. L., and Coauthors, 2017: Tornado damage rating probabilities derived from WSR-88D data. Wea. Forecasting, 32, 1509–1528, https://doi.org/10.1175/WAF-D-17-0004.1.
Tippett, M. K., A. H. Sobel, and S. J. Camargo, 2012: Association of U.S. tornado occurrence with monthly environmental parameters. Geophys. Res. Lett., 39, L02801, https://doi.org/10.1029/2011GL050368.
Tippett, M. K., C. Lepore, and J. E. Cohen, 2016: More tornadoes in the most extreme U.S. tornado outbreaks. Science, 354, 1419–1423, https://doi.org/10.1126/science.aah7393.
Venables, W. N., and B. D. Ripley, 2002: Modern Applied Statistics with S. 4th ed. Springer, 495 pp.
Wickham, H., 2017: tidyverse: Easily install and load ‘Tidyverse’ packages, version 1.1.1. R package, https://CRAN.R-project.org/package=tidyverse.