1. Introduction
Ensemble forecasting aims to characterize forecast uncertainty associated with the growth of small uncertainties in the initial conditions of the forecast (Molteni et al. 1996). A natural interpretation of an ensemble forecasting system is as an estimator of the probability of occurrence of future weather events of interest (Leith 1974); for example, in the case of flood prediction, what is the chance of rainfall accumulation exceeding a particular critical threshold value in a given catchment? However, what is meant by future weather events of interest is highly dependent on the end user. This makes the identification of key predictands to be used in the assessment of forecast skill an important challenge (Morss et al. 2008). One rational choice may be to consider that in the extratropics the occurrence of significant weather such as intense rainfall or high winds is often associated with the development and passage of cyclonic weather systems (Bengtsson et al. 2005; Hawcroft et al. 2012). Therefore, the skill in predicting the location and intensity of these systems is of particular importance, their representation in models having implications for both weather and climate (Zappa et al. 2013).
A relatively recent development in the prediction of extratropical weather systems has been the application of objective identification and tracking of cyclonic features to global ensemble forecasting systems. In particular the methodology of Hewson and Titley (2010) was introduced to run on the 15-day version of the Met Office Global and Regional Ensemble Prediction System (MOGREPS-15) (Bowler et al. 2009) forecasts in 2006, and has since been implemented in the European Centre for Medium-Range Weather Forecasts Integrated Forecast System. The tracking methodology is one of those being used in the Intercomparison of Mid Latitude Storm Diagnostics (Neu et al. 2013). MOGREPS-15 was run as part of the THORPEX Interactive Grand Global Ensemble (TIGGE; Park et al. 2008) from 2006 to 2014, and has now been superseded by the higher-resolution 7-day MOGREPS-G ensemble. The Hewson and Titley (2010) cyclone tracking system will be implemented in the 7-day MOGREPS-G forecasts in 2015.
The feature tracking within these ensemble systems provides a compelling visual representation of the prediction uncertainty of ensemble forecasts (Swinbank et al. 2015) and the forecast products from the feature tracking have been used by Met Office operational forecasters for several years. For example, Fig. 1 is a sequence of “Dalmatian plots” from MOGREPS-15 at increasing lead time but with identical validity times. These forecasts correspond to a severe windstorm that struck the United Kingdom on 28 October 2013 causing severe disruption to transport and power supplies as well as resulting in several deaths. The contours show the mean sea level pressure from the control forecast and the various colored dots show the locations of all objectively identified cyclonic features from all ensemble members, colored according to the maximum wind speed at 1 km above the surface within 300 km of the feature center. The increasing uncertainty with lead time is apparent through the increasing scatter of the cyclonic features. At T + 0 h (Fig. 1a), the features are densely clustered into almost totally overlapping groups. As time progresses, the feature points become increasingly scattered, but at T + 72 h and T + 120 h (Figs. 1b,c) they are still in coherent and distinct groups of similar features, meaning that although the exact location and strength of the features becomes increasingly uncertain, it is still possible to be confident that there will be a severe weather system approaching the United Kingdom. In light of the forecasts, the Met Office issued a severe weather warning to the general public for high winds over southern England 5 days in advance of the storm. At T + 168 h, the features are sufficiently scattered that it is no longer possible to identify coherent clusters of features of similar strength. For example, the red and orange dots are mixed together with blue and green dots. The large numbers of red and orange dots do indicate a high likelihood that a strong cyclone will occur, but the mixing with other colored dots indicates that the uncertainty in its location is larger than the distance separating it from adjacent cyclonic features. Despite this, the effect of the larger-scale flow is still very clear in, for example, the very low number of cyclonic features in the vicinity of the Mid-Atlantic Ridge seen in the control forecast mean sea level pressure.
“Dalmatian plots” showing (a) 0-, (b) 3-, (c) 5-, and (d) 7-day MOGREPS-15 forecast valid at 0000 UTC 28 Oct 2013. Contours show the mean sea level pressure from the control forecast. Colored dots show locations of cyclonic feature centers from all forecast members. Coloring indicates the max wind [knots (kt; where 1 kt = 0.51 m s−1)] within 300 km of feature center at 1 km ZAGL.
Citation: Weather and Forecasting 30, 5; 10.1175/WAF-D-15-0039.1
In this paper, we present an assessment of the changes in the climatological statistics of these features tracked over forecast lead times and an assessment of the skill in the probabilistic predictions of them.
The rest of the paper is divided into four sections. In section 2, we give a brief overview of the tracked data used in this paper and MOGREPS-15. In section 3, the statistical properties of the mesoscale features are examined. In section 4, the predictive skill of the mesoscale features is assessed. A summary of our results and conclusions is presented in section 5.
2. Tracked feature data
a. Tracking methodology
The data used in this paper consist of objectively identified and tracked cyclonic features from MOGREPS-15 forecasts. The data consist of twice-daily forecasts produced at 0000 and 1200 UTC, covering a period of more than 6 yr from 1 December 2006 to 31 December 2012. To aid in the understanding of what is to follow, in this section we shall give a brief outline of the tracking method used. See Hewson and Titley (2010) for a full description of the objective feature identification and tracking methodology.
The tracking algorithm aims to identify and track developing cyclonic storms through their entire life cycle from small kinks in fronts (labeled diminutive frontal waves) through developed frontal waves to closed low pressure centers (labeled barotropic lows). The algorithm uses a hybrid of objective fronts, pressure minima, and vorticity maxima to identify features and define their locations. As in Hewson (1998), the fronts themselves are defined to coincide with sharp gradients in wet-bulb potential temperature. Warm and cold fronts are distinguished by the sign of the local geostrophic advection of wet-bulb potential temperature, with warm fronts defined for positive advection and cold fronts defined for negative advection. Following the terminology of Hewson and Titley (2010), the three classes of cyclonic feature are defined using data at 1-km geopotential height above ground level (hereafter 1 km ZAGL) as follows:
a diminutive frontal wave is a maximum in the alongfront component of geostrophic vorticity,
a frontal wave is the intersection of a cold and warm front at which the cross frontal geostrophic vorticity is positive, and
a barotropic low is defined as any other isolated minimum in the 1000-hPa geopotential height.
The tracking is performed on a 50-km-resolution equal-area grid within a subdomain of the full model domain, spanning the region of approximately 11.5°–85.5°N, 117.2°W–76.8°E. The methodology uses a contour intersection methodology so that the locations of feature centers are not restricted to grid locations but can vary continuously in space. The use of 1000-hPa geopotential height to identify barotropic lows has some disadvantages. One of these is that anomalously large numbers of barotropic lows are identified over high topography (particularly Greenland). The sensitivity of the statistical properties of tracked cyclones in analysis fields to the inclusion and exclusion of such orographic features has been examined by Rudeva et al. (2014). They found that bulk measures of cyclone behavior such as mean minimum central pressure and maximum deepening rate have only weak dependence on the inclusion or exclusion of orographic features. However, in the study presented here we are interested in the prediction of the location of mobile systems. For this reason, we shall restrict our analysis of the statistics of features to a North Atlantic control region. This control region is defined to lie within 30°–60°N, 50°–10°W, and covers a total area of ~107 km2.
Within this work, we shall not consider the distinction between different classes of frontal waves. This choice is motivated by the observation that tracked features frequently switch between classes during their evolution and therefore the same feature can be classified differently in different ensemble members. We will instead use vorticity at the feature center 1 km ZAGL as a means of quantifying feature strength. For reference, Fig. 2 shows the pdf of 1 km ZAGL feature vorticity for different feature classes calculated from the analysis fields for the whole of the dataset.
Gaussian kernel smoothing probability density estimate of the relative vorticity at 1 km ZAGL of cyclonic features of different classes identified in analysis fields within the North Atlantic control region. A kernel bandwidth of 10−5 s−1 was used to create the distributions.
Citation: Weather and Forecasting 30, 5; 10.1175/WAF-D-15-0039.1
b. MOGREPS-15
MOGREPS-15 consists of 23 perturbed members plus one unperturbed control forecast, produced using the global 33-km-resolution configuration of the Met Office Unified Model. Control and perturbed forecasts are run using the same model resolution and dynamics; however, perturbed members are run with stochastic physics and the control forecast without. The perturbed members are initialized from the control analysis using additive perturbations derived from an ensemble transform Kalman filter (ETKF; Wang and Bishop 2003). Because of the pseudo-operational nature of the data, the period under consideration spans 13 versions of MOGREPS-15. Details of the differences between these 13 versions, as well as descriptions of subsequent versions and the new MOGREPS-7, are available online (http://www.ecmwf.int/en/research/projects/tigge). Since our main interest is the generic predictability properties of cyclonic features, we shall not consider the different versions of the model in detail here. However, one change of interest is that starting in April 2012 the configuration of the ensemble was changed so that one of the perturbed members (member 23) was run directly from the control analysis without initial condition perturbations. The effect of stochastic parameterizations on the cyclonic features is discussed in the appendix.
3. Systematic differences between features in forecast and analysis fields
In this section, we shall highlight the evolution of the climatological statistics of cyclonic features with increasing lead time. We shall focus particularly on the differences between the analysis, control, and perturbed forecasts. Since the method of generation of member 23 changes part way through the dataset, we exclude it from the analysis in this section so that only 22 perturbed members are considered.
Figure 3 shows the mean areal density of features versus lead time for the control and perturbed forecasts. Figure 3a shows the density of frontal waves and Fig. 3b shows the density of barotropic lows. The thick solid line shows the control member, and the dashed solid line the average over all perturbed forecast members. For reference, the thin solid line shows the same statistic computed for the corresponding valid time of the analyses data. To give an illustration of how large differences between the control and perturbed forecasts would be anticipated to be through sampling, the yellow-shaded region illustrates the range of values of the statistic obtained from perturbed forecasts by selecting one perturbed member at random for each analysis time. This is plotted at the 99th percentile of the resampling distribution estimated from a sample of 10 000 realizations, but this is not intended to denote a 99th percentile confidence interval.
The instantaneous mean number of (a) frontal waves and (b) barotropic lows per unit area as a function of lead time, calculated for the North Atlantic control region with the 6-yr dataset. Shown is the control forecast fields (thick solid line), perturbed forecast fields (dashed line), and concurrent analysis fields (thin solid line). Yellow shading indicates the 99th percentile region for resampling between perturbed ensemble members.
Citation: Weather and Forecasting 30, 5; 10.1175/WAF-D-15-0039.1
Several points can be made about the feature density. First, the number density is a decreasing function of lead time, with the mean density of all feature types below that of the analysis fields by 24-h lead time. Second, the expected number of features in perturbed forecasts is systematically larger than that of the control forecasts at all lead times. At the initial time, this must be attributable to the initial condition perturbations added to the perturbed members. At later lead times, this is a consequence of the stochastic parameterizations, which are applied to perturbed forecasts but not the control forecast (see the appendix). The third point to be made is that the rate at which the number of frontal waves decreases with increasing lead time is significantly faster than that of barotropic lows.
For comparison, Fig. 4 shows the square root of the area-averaged 850-hPa enstrophy (vorticity squared) calculated for the North Atlantic region at different forecast lead times. There is a clear decay in enstrophy with increasing lead time (e-folding time of about 18 h), which is the same for both control and perturbed forecasts, although there is an approximately constant offset between the two. The exponential decay of enstrophy is very suggestive of numerical diffusion but may also relate to a spindown of unbalanced features in the initial conditions. At the initial time, the presence of the offset in enstrophy between the control and perturbed members must be attributable to the additive ETKF perturbations since this is the only difference between the two. After the first couple of days, however, we attribute the offset primarily to the additional enstrophy supplied to the perturbed members by stochastic parameterizations (see the appendix), most likely the stochastic kinetic energy backscatter scheme (Shutts 2005; Bowler et al. 2009). The 18-h e-folding time for enstrophy decay is comparable to the e-folding time scale of the frontal-wave feature number density (~17.5 h) shown in Fig. 3a. The number densities of barotropic lows (Fig. 3b) have a longer e-folding time scale ~24 h. This longer e-folding time scale in mean feature density reflects the higher vorticity of barotropic lows (Fig. 2); that is, there are fewer weak features that rapidly decay with diffusion. But it may also be interpreted as a reflection of the fact that barotropic lows are defined from pressure minima meaning that decay requires significant movement of mass, whereas frontal waves are defined from the wind or vorticity field at fronts and therefore have more rapid evolution (Neu et al. 2013).
The area-averaged 850-hPa enstrophy as a function of lead time, calculated for the North Atlantic control region from 1° × 1° wind fields using all forecasts starting in January, April, July, and October between 2007 and 2012. Shown is the control forecast fields (thick solid line) and perturbed forecast fields (dashed line). Yellow shading indicates the 99th percentile region for resampling between perturbed ensemble members.
Citation: Weather and Forecasting 30, 5; 10.1175/WAF-D-15-0039.1
Figures 5a and 5b show the mean vorticity of frontal waves and barotropic lows, respectively, as a function of forecast lead time. Considering first Fig. 5b, there is rapid decay in the mean vorticity of barotropic lows in both the perturbed and control forecasts. The vorticity saturates at ~90% of the value in the analysis for perturbed forecasts, with the control forecasts saturating at a slightly lower (~89%) value. By contrast, the mean vorticity of frontal waves in forecast fields increases. Comparing the vorticity tendencies along the tracks of features that are matched one to one between the control forecast and analysis, it is found that the average vorticity of both classes of features in the control forecasts decrease with lead time relative to the analysis field. At 24-h lead time, the vorticity of the frontal waves in the control forecast fields is on average 3.3 × 10−6 s−1 lower than that of the corresponding frontal waves in the analysis fields. For barotropic lows, the vorticity is on average 1.6 × 10−5 s−1 lower in the control forecast than the analysis. This implies that the increase in the mean vorticity of frontal waves in forecasts can be seen as an effect of the change in frontal-wave population; that is, the analysis fields contain a large number of weak features that are not sustained in the forecast.
The mean vorticity of (a) frontal waves and (b) barotropic lows at 1 km above the ground as a function lead time, calculated for features within the North Atlantic control region. Shown is the control forecast fields (thick solid line), perturbed forecast fields (dashed line), and concurrent analysis fields (thin solid line).
Citation: Weather and Forecasting 30, 5; 10.1175/WAF-D-15-0039.1
4. Forecast skill of features
a. Definition of predictand and skill













Results showing the variation in skill with vorticity threshold are presented using deciles of the vorticity distribution so that the figure axes are linear with respect to frequency. The deciles were calculated from the analysis data within the North Atlantic region and are the deciles of the distribution shown in Fig. 2. One possibility with this definition of vorticity threshold would be to calculate the strike probabilities using deciles of the vorticity distribution at differing lead times, so that some forecast recalibration would be included in the results. The results presented do not include such a recalibration.
b. Verification of strike probabilities
In this section we shall consider how the Brier skill score varies with lead time, strike radius, and vorticity threshold. The results are calculated for forecasts produced every 12 h between 1 December 2006 and 30 November 2012. This time range spans the change in the generation method of member 23; however, the results were found not to be sensitive to the inclusion or exclusion of these dates. Some forecasts had one or more individual forecast members, or verifying analyses, missing from the tracked dataset and these forecast dates have been excluded from the calculations. The forecasts with missing data were distributed randomly in time.
Figure 6 shows the Brier skill score (color shading) as a function of vorticity threshold and strike radius, for lead times of 1, 3, and 5 days. The white and dark blue contours show the resolution and uncertainty, respectively. These were all calculated as the average of the values calculated at single locations. For simplicity we used sample locations on a regular 2.5° grid. Figures produced for individual locations are qualitatively similar but have some quantitative differences.
Color shading showing BSSs for (a) 1-, (b) 3-, and (c) 5-day forecasts, plotted as a function of strike radius and vorticity threshold, and aggregated over multiple spatial locations. Shown is the resolution component (white contours) and the uncertainty component (blue contours) of the BS decomposition.
Citation: Weather and Forecasting 30, 5; 10.1175/WAF-D-15-0039.1
The skill is significantly less than 1.0 at 1-day lead time, with a maximum value of 0.65. The magnitude of the forecast skill lies somewhere between the values expected for small-scale predictands, such as rainfall, and large-scale predictands such as large-scale flow regimes. For example, Hamill (2012) find 1-day forecasts of 1°-resolution 24-h accumulated rainfall greater than 1 mm to be ~0.4, whereas Frame et al. (2011) find the skill in predicting movements of the North Atlantic jet to be ~1.0 at 1-day lead times. The strike probability depends on multiple scales of motion and can be associated with position uncertainties in the centers of large and perhaps fairly predictable features such as cyclones, and the existence uncertainty in smaller and perhaps inherently unpredictable features such as small kinks on frontal surfaces. The strike probability does not distinguish between these two paradigms with all features being reduced to a single point in space by identifying turning points in the forecast fields. It may therefore be considered both a synoptic- and mesoscale predictand.
The variation in skill with strike radius can be interpreted as follows. At small radii, the skill will be low since the locations of the features are uncertain. For sufficiently large radii, the forecast strike probability will saturate at the climatological value and therefore provides no new information beyond climatology. The scales for which BSS is maximized provide an estimator of the transition point between these two paradigms. The Brier score is strongly linked to the information content of the forecasts (Weijs et al. 2010), so these scales may also be interpreted as those that contain the most forecast information. Figure 6 suggests that the strike radius at which skill is maximized is an increasing function of lead time. This can be seen clearly in Fig. 7, which shows the area-averaged BSS as a function of strike radius at different lead times for forecasts of features with vorticity above the seventh decile. The strike radius at which maximum skill occurs (shown by the dashed line) increases almost linearly at a rate of 43 km day−1 (estimated from a linear fit) from about 650 km at 12-h lead time to about 950 km at 7-day lead time. Similar figures for different vorticity thresholds between the first and ninth deciles have very similar results, with the rate of increase in strike radius of maximum skill in the range 38–43 km day−1. When no threshold is applied, the strike radius of maximum skill increases more rapidly with lead time at 50 km day−1. We attribute this difference to the loss of weak features with increasing lead time noted in section 3 essentially leading to a bias in the forecast strike probability relative to the analysis. As can be seen in Fig. 6, for a given lead time, the strike radius at which maximum skill occurs is an increasing function of the vorticity threshold, so that the skill is maximized at smaller radius when lower vorticity thresholds are used. This may be due to the fact that there are larger numbers of small features, so that climatology becomes quite difficult to beat once the radius becomes large.
BSS vs radius for strike probability forecasts of features with vorticity greater than the seventh decile. Each solid line shows a different lead time, with lighter shading indicating longer lead time. Plotted data have a resolution of 100 km. The dashed line indicates the radius at which the max skill occurs. The location of the max was obtained by cubic-spline fitting to the plotted data.
Citation: Weather and Forecasting 30, 5; 10.1175/WAF-D-15-0039.1
The fact that maximum skill occurs at such large radii even at very short lead time (e.g., 650-km scale at 12 h) may seem surprising considering that Froude (2010) finds mean position errors of one geodetic degree (~111 km) at day 1 and eight geodetic degrees (~888 km) at day 7; however, it must be remembered that the tracking in Froude (2010) was performed using a different methodology and much coarser–resolution data (T42, ~2.8°) than that used in the present paper, meaning that it focused on large synoptic-scale cyclones. More significantly the analysis performed by Froude (2010) was heavily conditioned on the verifying analysis; for example, all feature tracks that were unmatched to analysis tracks or had position error greater than four geodetic degrees before day 4 were removed. So while Froude (2010) provides useful information about the development of forecast errors in situations that are relatively well forecast, the use of verifying analysis to filter the forecast data would make it an overly optimistic assessment of the predictive capability of models. It is also worth noting that having maximum skill at 650 km does not imply that forecasts are not skillful for smaller radii; for example, Fig. 7 indicates there is still a small amount of skill for 300-km radii out to 7 days.
A more subtle feature of Fig. 6 is the way in which the relative skill at different r and
BSS vs lead time for a strike radius of 600 km. The different lines correspond to different vorticity thresholds. Solid lines shows the zeroth to eighth deciles, with lighter shading indicating a higher threshold. Dashed line shows the ninth decile.
Citation: Weather and Forecasting 30, 5; 10.1175/WAF-D-15-0039.1
The shapes of the BSS curves shown in Fig. 8 are typical of those calculated for all strike radii and vorticity thresholds, with the Brier skill decaying as a quadratic function of lead time. The lead time at which BSS reaches zero can be interpreted as the practical limit of the predictability of cyclonic features for forecasts produced by MOGREPS-15. This may of course be longer for other forecast models or increased ensemble sizes. Figure 9 shows the lead time at which the BSS reaches zero for different strike radii and vorticity thresholds. The longest lead time for which Brier skill remains above zero is 14 days for features with vorticity above the ninth vorticity decile and strike radius ~950–1000 km. The skill at these scales and intensities at such long lead times is likely due to constraints the larger, more predictable, scales place on the location and propagation of cyclones, as was illustrated in Fig. 1d. It may also be associated with the longer lifetime associated with strong cyclones.
The lead time (days) at which the BSS of strike probability forecasts reaches zero as a function of strike radius and vorticity threshold.
Citation: Weather and Forecasting 30, 5; 10.1175/WAF-D-15-0039.1
Figures 10 and 11 (color shading) show the radius at which maximum skill occurs and the density of features with vorticity above the seventh decile at different geographic locations calculated from the analysis fields. For reference, the black contours in Fig. 10 show the corresponding maximum BSS. We show the radius at which skill is maximized as opposed to the radius at which skill meets a particular threshold since this provides an upper bound on the skill. For strike radii larger or smaller than this, the skill will be lower. There is a clear, if imperfect, inverse proportionality between cyclone density and the radius at which skill is maximized, which becomes stronger as lead time increases. We attribute the existence of this proportionality to the strength of the constraint the large-scale flow has on the evolution of cyclones. For example, the high density of cyclones in the northwestern region of Fig. 11 is associated with the jet stream, with a region of particularly high density near the tip of Greenland. This second feature is likely due to lee cyclogenesis at the tip of Greenland. Similar density maxima are seen in studies such as that of Hoskins and Hodges (2002). The jet region provides both the baroclinicity needed for cyclone development and the strong steering that constrains the cyclone’s trajectory to remain within it. We attribute the high skill at relative small radius to the strong steering by the large-scale and hence more predictable flow. This region is also associated with larger cyclones, which have longer lifetimes (Rudeva and Gulev 2007) and may therefore be more predictable. In contrast, the southeastern half of the domain has greatest skill at much larger radius. We attribute this to the lack of strong steering from the large-scale flow, and to the predominance of secondary cyclones or frontal waves in this area of the domain. Figures 6–9 show area-averaged quantities, so they will be either over- or underestimates for many specific locations. Regions where the maximum skill is low are expected to reach the zero skill limit of predictability sooner. For example, the maximum skill is noticeably lower in the northwestern region of Fig. 10c so that we expect skill in this region to reach zero at shorter lead times.
Color shading with the radius (km) at which the max BSS of a forecast of features with vorticity above the seventh decile occurs. Values are calculated on a 2.5° × 2.5° grid and smoothed for plotting using a 2 × 2 gridpoint–averaging kernel. Black contours show the corresponding values of BSS.
Citation: Weather and Forecasting 30, 5; 10.1175/WAF-D-15-0039.1
Areal number density of features (10−7 km−2) with vorticity above the seventh decile, estimated using 300-km-radius flat circular kernel.
Citation: Weather and Forecasting 30, 5; 10.1175/WAF-D-15-0039.1
5. Summary and conclusions
This paper has summarized the statistical properties and forecast skill of objectively tracked cyclonic features in MOGREPS-15. It has been shown that there is a noticeable reduction in the areal density of features in the forecast relative to the analysis, which saturates at about 7 days lead time for barotropic lows and 5 days for frontal waves. The perturbed forecasts saturate at a slightly higher feature density than do those of the control, which is attributable to the presence of stochastic parameterization in the perturbed forecast. This stochastic effect may simply be that the increased vorticity in the stochastically forced forecast members puts larger numbers of features over thresholds built into the objective identification algorithm, or it may be a more subtle (and perhaps more physical) effect.
The forecast skill of strike probabilities has been assessed. It is found that the maximum skill is largely uniform in the Atlantic to the east of Greenland; however, the strike radius at which this occurs is much smaller along the region associated with the North Atlantic storm track, indicating that it is easier to forecast the locations of cyclones in this region. Furthermore, it is found that for a given vorticity threshold as lead time increases, the radius at which forecast skill is maximized increases linearly with time. The longest lead time at which the skill of strike probability forecasts remains greater than zero is found to be ~14 days but only for cyclonic features with vorticity greater than the ninth decile of the vorticity distribution and for strike radii ~1000 km. When all features are accounted for, the Brier skill score falls to zero within 9 days on average, for any strike radius.
Acknowledgments
This work was funded by the DIAMET project (NERC Grant NE/I005234/1). Access to the MOGREPS-15 cyclonic feature database was enabled as a Met Office contribution to DIAMET. The authors thank Richard Swinbank, for his involvement in DIAMET, and Piers Buchanan, for facilitating access to the complete dataset. We thank Tim Hewson for helpful discussions about the feature identification methodology and the research presented. This paper benefited from the helpful comments of three anonymous reviewers.
APPENDIX
Impact of Stochastic Parameterizations
Starting in April 2012 MOGREPS-15 was constructed from one unperturbed control member without stochastic parameterizations, 22 perturbed members with stochastic parameterizations, and one unperturbed member with stochastic parameterizations. This configuration provides a convenient means of determining whether stochastic parameterization is the cause of differences in the statistics of cyclonic features between the control and perturbed members described in section 3. Figure A1 shows the evolution of the mean areal density of features over the forecast lead time for the subset of data in which MOGREPS-15 has been configured with an unperturbed stochastic member. The lines and shading are as in Fig. 3, with the addition of the red line, which shows values for the unperturbed stochastic member. There is clear evidence that while the control member is distinct from the perturbed members the unperturbed stochastic member is not. This is more evident when considering the area-averaged enstrophy shown in Fig. A2. The enstrophy of the control members remains distinct from that of the perturbed members throughout the forecast, whereas the enstrophy of the members with stochastic parameterizations but without initial condition perturbations converges rapidly toward the enstrophy of the perturbed members. Within 2 days it is closer in terms of enstrophy to the perturbed members than the control, and by 5 days, at the latest, it is indistinguishable from the perturbed members.
As in Fig. 3, but for a subset of analysis times from 1 Apr to 1 Dec 2012 in which one stochastically forced member is run from unperturbed initial conditions. Red line shows values computed for this member.
Citation: Weather and Forecasting 30, 5; 10.1175/WAF-D-15-0039.1
The area-averaged 850-hPa enstrophy as a function of lead time, calculated for the North Atlantic control region from 1° × 1° wind fields using all forecasts starting in April, July, and October 2012. Shown is the control forecast fields (thick solid line) and the perturbed forecast fields (dashed line). Yellow shading indicates the 99th percentile region for resampling between perturbed ensemble members. Red line shows values for the ensemble member run with stochastic parameterizations but without initial condition perturbation.
Citation: Weather and Forecasting 30, 5; 10.1175/WAF-D-15-0039.1
REFERENCES
Bengtsson, L., Hodges K. I. , and Froude L. S. R. , 2005: Global observations and forecast skill. Tellus, 57A, 515–527, doi:10.1111/j.1600-0870.2005.00138.x.
Bowler, N. E., Arribas A. , Beare S. E. , Mylne K. R. , and Shutts G. J. , 2009: The local ETKF and SKEB: Upgrades to the MOGREPS short-range ensemble prediction system. Quart. J. Roy. Meteor. Soc., 135, 767–776, doi:10.1002/qj.394.
Brier, G. W., 1950: Verification of forecasts expressed in terms of probability. Mon. Wea. Rev., 78, 1–3, doi:10.1175/1520-0493(1950)078<0001:VOFEIT>2.0.CO;2.
Frame, T. H. A., Ambaum M. H. P. , Gray S. L. , and Methven J. , 2011: Ensemble prediction of transitions of the North Atlantic eddy-driven jet. Quart. J. Roy. Meteor. Soc., 137, 1288–1297, doi:10.1002/qj.829.
Froude, L. S., 2010: TIGGE: Comparison of the prediction of Northern Hemisphere extratropical cyclones by different ensemble prediction systems. Wea. Forecasting, 25, 819–836, doi:10.1175/2010WAF2222326.1.
Hamill, T. M., 2012: Verification of TIGGE multimodel and ECMWF reforecast-calibrated probabilistic precipitation forecasts over the contiguous United States. Mon. Wea. Rev., 140, 2232–2252, doi:10.1175/MWR-D-11-00220.1.
Hamill, T. M., and Juras J. , 2006: Measuring forecast skill: Is it real skill or is it the varying climatology? Quart. J. Roy. Meteor. Soc., 132, 2905–2923, doi:10.1256/qj.06.25.
Hawcroft, M., Shaffrey L. , Hodges K. , and Dacre H. , 2012: How much Northern Hemisphere precipitation is associated with extratropical cyclones? Geophys. Res. Lett., 39, doi:10.1029/2012GL053866.
Hewson, T. D., 1998: Objective fronts. Meteor. Appl., 5, 37–65, doi:10.1017/S1350482798000553.
Hewson, T. D., and Titley H. A. , 2010: Objective identification, typing and tracking of the complete life-cycles of cyclonic features at high spatial resolution. Meteor. Appl., 17, 355–381, doi:10.1002/met.204.
Hoskins, B. J., and Hodges K. I. , 2002: New perspectives on the Northern Hemisphere winter storm tracks. J. Atmos. Sci., 59, doi:10.1175/1520-0469(2002)059<1041:NPOTNH>2.0.CO;2.
Jolliffe, I. T., and Stephenson D. B. , 2012: Forecast Verification: A Practitioner’s Guide in Atmospheric Science. 2nd ed. J. Wiley and Sons, 292 pp.
Leith, C., 1974: Theoretical skill of Monte Carlo forecasts. Mon. Wea. Rev., 102, 409–418, doi:10.1175/1520-0493(1974)102<0409:TSOMCF>2.0.CO;2.
Molteni, F., Buizza R. , Palmer T. N. , and Petroliagis T. , 1996: The ECMWF Ensemble Prediction System: Methodology and validation. Quart. J. Roy. Meteor. Soc., 122, 73–119, doi:10.1002/qj.49712252905.
Morss, R. E., Lazo J. K. , Brown B. G. , Brooks H. E. , Ganderton P. T. , and Mills B. N. , 2008: Societal and economic research and applications for weather forecasts: Priorities for the North American THORPEX program. Bull. Amer. Meteor. Soc., 89, 335–346, doi:10.1175/BAMS-89-3-335.
Neu, U., and Coauthors, 2013: IMILAST: A community effort to intercompare extratropical cyclone detection and tracking algorithms. Bull. Amer. Meteor. Soc., 94, 529–547, doi:10.1175/BAMS-D-11-00154.1.
Park, Y.-Y., Buizza R. , and Leutbecher M. , 2008: TIGGE: Preliminary results on comparing and combining ensembles. Quart. J. Roy. Meteor. Soc., 134, 2029–2050, doi:10.1002/qj.334.
Rudeva, I., and Gulev S. K. , 2007: Climatology of cyclone size characteristics and their changes during the cyclone life cycle. Mon. Wea. Rev., 135, 2568–2587, doi:10.1175/MWR3420.1.
Rudeva, I., Gulev S. K. , Simmonds I. , and Tilinina N. , 2014: The sensitivity of characteristics of cyclone activity to identification procedures in tracking algorithms. Tellus, 66A, 24961, doi:10.3402/tellusa.v66.24961.
Shutts, G., 2005: A kinetic energy backscatter algorithm for use in ensemble prediction systems. Quart. J. Roy. Meteor. Soc., 131, 3079–3102, doi:10.1256/qj.04.106.
Swinbank, R., and Coauthors, 2015: The THORPEX Interactive Grand Global Ensemble (TIGGE) and its achievements. Bull. Amer. Meteor. Soc., doi:10.1175/BAMS-D-13-00191.1, in press.
Wang, X., and Bishop C. H. , 2003: A comparison of breeding and ensemble transform Kalman filter ensemble forecast schemes. J. Atmos. Sci., 60, 1140–1158, doi:10.1175/1520-0469(2003)060<1140:ACOBAE>2.0.CO;2.
Weijs, S. V., Van Nooijen R. , and Van De Giesen N. , 2010: Kullback–Leibler divergence as a forecast skill score with classic reliability-resolution-uncertainty decomposition. Mon. Wea. Rev., 138, 3387–3399, doi:10.1175/2010MWR3229.1.
Wilks, D. S., 2011: Statistical Methods in the Atmospheric Sciences. 3rd ed. Elsevier, 676 pp.
Zappa, G., Shaffrey L. C. , and Hodges K. I. , 2013: The ability of CMIP5 models to simulate North Atlantic extratropical cyclones. J. Climate, 26, 5379–5396, doi:10.1175/JCLI-D-12-00501.1.