1. Introduction
Stochastic parameterizations have the potential to increase forecast skill and decrease model biases by capturing the inherently turbulent nature of many subgrid processes [for a comprehensive overview, see Berner et al. (2016)]. In the case of atmospheric deep convection, the fluctuations around the mean state within a grid box become significant for model grid spacing less than 100 km (Jones and Randall 2011). This subgrid noise can feed back onto the resolved scales, impacting tropical oscillations (Wang et al. 2016; Christensen et al. 2017) and the upscale growth of forecast errors (Selz and Craig 2015b). Designing a stochastic parameterization requires some model of the subgrid-scale variability. Simple approaches include perturbing parameterized model tendencies in a multiplicative way (Buizza et al. 1999) or perturbing parameters in the parameterizations (Ollinaho et al. 2017). More complex schemes have been designed based on subgrid Markov chains (Dorrestijn et al. 2013) and cellular automata (Bengtsson et al. 2013).
In this study, we focus on a theory of convective variability based on statistical physics developed by Craig and Cohen (2006; the theory is hereafter abbreviated CC06). Its application in a stochastic parameterization framework (Plant and Craig 2008) proved beneficial in a number of ways: it produces scale-aware fluctuations in a mesoscale model (Keane et al. 2014); forecast errors grow upscale realistically, as opposed to deterministic parameterizations in which errors grow too slowly (Selz and Craig 2015a); and in global climate simulations, precipitation variability and tropical wave activity, such as the Madden–Julian oscillation, are improved (Wang et al. 2016; Wang and Zhang 2016). Recently, Sakradzija et al. (2015) extended the approach to shallow convection, which improves coupling the resolved model dynamics (Sakradzija et al. 2016). For all its benefits, the CC06 theory is based on strongly simplifying assumptions about how convection behaves. The main purpose of this study is to test the applicability of the CC06 theory outside of its comfort zone.
a. The CC06 theory
b. Previous tests of the CC06 theory
So far, few studies have directly tested the assumptions and predictions of CC06. Cohen and Craig (2006) used a convection-permitting model in a radiative–convective equilibrium setup with different large-scale forcing and vertical wind shear strengths and found that p(m) was well approximated by an exponential distribution for all settings but that p(N) was broader than predicted by Eq. (2) because of cloud clustering. Despite this, the simulated mass flux variability was up to 20% lower than predicted, which they attributed to compensating errors. Davoudi et al. (2010) confirmed this result in their simulations with a diurnal cycle of radiation. Scheufele (2014) tested the sensitivity of the Cohen and Craig (2006) results to model resolution and found that clouds are more strongly clustered at higher resolutions. Furthermore, to reproduce the exponential cloud mass flux distributions, a separation of connected clouds into individual updrafts becomes necessary. Davies (2008) focused on the variation of the CC06 predictions in an idealized diurnal cycle setup. Cloud clustering and convective variability were strongest shortly after convective initiation and during the decline of convective activity. Studying the variability scaling in trade wind shallow cumulus, Sakradzija et al. (2015) saw a drastic increase in cloud organization once precipitation started to form. Consequently, the mass flux variability was many times larger than the predictions of the statistical theory.
c. Motivation and aims of this paper
The studies mentioned above have two things in common: first, they show that the cloud field is organized in contradiction with the CC06 assumptions, and second, all the studies were conducted with simplified, idealized setups. These setups generally aim to represent tropical equilibrium convection or a diurnal cycle over land in the absence of any large-scale changes in forcing or other complications such as land surface variations or orography. Stochastic parameterizations in general circulation models, however, must be able to cope with a wide variety of convection around the globe. It remains unclear if, and to what extent, the CC06 theory and the stochastic parameterizations based on it are useful for representing complex real-world weather situations.
In this study, we aim to test the assumptions and predictions of CC06 in simulations of midlatitude summertime convection over land. In particular, we ask the following research questions:
How well do the CC06 predictions hold up for complex, nonequilibrium convection?
Are there systematic deviations, and can we explain them?
What role does convective organization play?
To answer these questions, we set up ensemble simulations (section 2) that allow us to quantitatively measure convective variability in real-world case studies (section 3). The analysis details are described in section 4. In section 5, we compare the predictions of the CC06 theory to our simulation results and identify systematic deviations. We then focus on cloud organization (section 6) and the cloud mass flux distribution (section 7). A discussion of the implications for stochastic parameterizations follows in section 8 before a summary in section 9.
2. Numerical simulations, observations, and computational reproducibility
a. COSMO model and ensemble setup
The numerical simulations are done with the Consortium for Small-Scale Modeling (COSMO) model (Baldauf et al. 2011). The horizontal grid spacing Δx is 0.025°, roughly 2.8 km, with 50 levels in the vertical. There is no parameterization of deep convection, but shallow convection is parameterized using the Tiedtke scheme [for complete information on the parameterizations used, see Doms et al. (2011)]. The domain spans 357 grid points in either horizontal direction centered over Germany at 50°N, 10°E. For the analysis, a 256 × 256 gridpoint subdomain, roughly 717 km × 717 km, at the center of the simulation domain is considered to avoid boundary spinup effects. The 50-member ensemble simulations are started at 0000 UTC on each of the 12 consecutive days (see section 3) with a simulation time of 24 h. For initial and boundary conditions, we use hourly interpolated deterministic COSMO European version (COSMO-EU) analyses, which have a horizontal resolution of 7 km. Each ensemble member differs only in the random seed used for the stochastic perturbation scheme, described below, which has the effect of randomly shuffling the convective cells. Additionally, one deterministic simulation is run without the stochastic perturbation scheme. The output frequency is 60 min. The model name lists are saved in the online repository accompanying this paper (see section 2e).
Given unlimited computational resources, it would be desirable to run the model at a higher resolution to actually resolve the cloud features. With our computational constraints, however, a trade-off between resolution, ensemble size, and length of the simulation period is necessary. To ensure the simulations create realistic cloud features, we compare the model to radar observations (see section 2c). We will also discuss the potential impact of resolution on our results in section 9.
b. Stochastic boundary layer perturbations
The physically based stochastic perturbation (PSP) scheme was first proposed by Kober and Craig (2016). It represents a general framework for adding process-specific perturbations to reintroduce missing variability on the grid scale of convection-permitting models that is associated with unresolved processes. The process considered here is boundary layer turbulence. In the appendix, we present an updated formulation of the scheme to clarify the physical rationale. In this study, the stochastic scheme enables us to obtain many different realizations of the convective cloud fields for the same large-scale flow. One limitation of this approach is the fact that only subgrid turbulence is considered in the stochastic perturbations. Other subgrid processes such as orography might preferentially trigger convection in particular locations, thereby violating the CC06 assumptions.
c. Radar-derived precipitation observations
To validate our simulations we use the Radar Online Aneichung (RADOLAN) radar-weighted (RW) product provided by the German Weather Service (DWD; DWD 2017). These data contain estimates of 1-hourly precipitation accumulations based on radar reflectivities adjusted using rain gauges. Even though the precipitation values are not direct observations, we use the term observation for the RADOLAN RW product in the rest of the text. The original data, which have a spatial resolution of 1 km, are adapted to the COSMO model grid. For all model–observation comparisons, only grid points with observation data are used. If daily time series are computed, a joint mask is used only including grid points for which observation data are available at all times throughout the day. The hourly RADOLAN RW products are valid at 10 min to the full hour, whereas our model precipitation accumulations are written to the full hour. For the purposes of this study, we think it is reasonable to neglect this difference. Last, because of the automatic adjustment of the radar observations with rain gauge data, some unrealistically high precipitation values occur, sometimes exceeding 300 mm h−1 (K. Stephan 2017, personal communication). Since we are not computing forecast scores in this study, an ad hoc measure of removing all grid points with hourly precipitation accumulations larger than 100 mm h−1 turned out to be sufficient to remove any significant artifacts.
d. Displacement growth of stochastic perturbations
Figure 1a shows precipitation snapshots of the observations and two ensemble members. The large-scale precipitation pattern is very similar, but the individual convective elements appear to be completely uncorrelated. To quantitatively assess the displacement at different scales, we look at the saturation of the ensemble difference spectra of kinetic energy and precipitation [Figs. 1b,c; for a detailed description of the calculation of the spectra, see Selz and Craig (2015b)]. The kinetic energy spectrum shows typical signs for upscale error growth: small scales saturate first, followed by saturation at increasingly larger scales. At the scale of large precipitation patterns, around 300 km, the difference between the ensemble members is below 20%. This confirms that the large-scale forcing is similar between all ensemble members. In contrast, the precipitation spectrum at the scale of individual convective elements, approximately 50 km, is almost fully decorrelated after 6 h. The combination of similar large-scale conditions and displaced convection allows us to investigate convective variability in real-weather case studies.
e. Computational details and reproducibility
This subsection closely follows the guidelines on publishing computational results proposed by Irving (2016). The analysis and plotting of model and observation data were done using Python. The Python libraries Numerical Python (NumPy; van der Walt et al. 2011) and Scientific Computing Tools for Python (SciPy; Jones et al. 2001) were used heavily. The raw data were read with the Python module cosmo_utils (code available upon request). The figures were plotted using the Python module Matplotlib (Hunter 2007). Plotting colors were chosen according to the hue–chroma–luminance color space (Stauffer et al. 2015). Some plots were postprocessed using the vector graphics program Inkscape.
To enable reproducibility of the results, this paper is accompanied by a version-controlled code repository (https://github.com/raspstephan/convective_variability_analysis) and a Figshare repository (Rasp 2017), which contains a snapshot of the code repository at the time of submission and supplementary log files for each figure. These log files contain information about the computational steps taken from the raw data to the generation of the plots. While the model code and initial data are not openly available, a detailed technical description of the model simulations can be found in the cosmo_runscripts directory of the code repository. The Jupyter notebooks (Kluyver et al. 2016) mentioned in the text are stored in the directory jupyter_notebooks of the repository. Links to noninteractive versions of the notebooks can be found on the front page of the Github repository; rendered PDF versions are also added to the supplement of this paper.
3. Weather situation and precipitation in model and observations
a. Synoptic situation and convective regime
The period from 26 May to 9 June 2016 was characterized by extraordinary extreme weather over central Europe and, in particular, Germany (Piper et al. 2016). Heavy precipitation, exceeding a 200-yr return period in some regions of southern Germany, caused flash floods that, together with hail measuring up to 5 cm in diameter and 12 confirmed tornadoes, resulted in damages of over EUR 5 billion. A persistent heavy-precipitation period of similar length is unprecedented in a 55-yr climatology. For this study, we selected 12 contiguous days from 28 May to 8 June.
The period can be roughly divided into two phases (Fig. 2). In the first, from 28 May to approximately 3 June, an upper-level trough dominated European weather, subsequently developing into a cutoff low. This upper-level feature caused strong synoptic lifting and several weak surface low pressure systems. These were accompanied by cyclonic circulation centered over the Alpine region and southeasterly advection of moist air over central Europe. This lead to a destabilization of the atmosphere, particularly over southern Germany. In the second phase, from 4 to 8 June, the cutoff gave way to a stationary upper-level ridge, typical for an omega-blocking situation. This caused very persistent weather with large instability building up over southern Germany.
The synoptic instability, combined with strong surface heating, provided a favorable environment for the development of deep convection. Precipitation followed a diurnal pattern on most days (Fig. 3) but was modulated by synoptic lifting on several days. This is most noticeable in the night of 29–30 May, in which a mesoscale convective system in association with large-scale ascent covered most of southern Germany (domain-mean precipitation plots for each individual day can be found in the supplement). The precipitation lags the convective available potential energy (CAPE) by around 4 h. The chosen period represents a variety of nonequilibrium convection over land and is therefore well suited to test the CC06 theory outside of the regime for which it was originally designed.
b. Precipitation in simulations and observations
Finally, we want to compare the mean precipitation in our simulations with the PSP scheme (ens), without the PSP scheme (det), and in the observation (obs) in Fig. 3. The PSP scheme causes an earlier onset and a higher maximum in precipitation on several days, which is the expected systematic effect (Kober and Craig 2016). The maximum precipitation increase caused by the stochastic perturbations is 10%, indicating that the model behavior is not drastically altered.
Larger differences can be seen between the model simulations and the observations. It is important to note that we are not interested in obtaining simulations that are as close to the observations as possible. Rather, our aim is to test whether the general weather situation is reproduced and if there are any systematic differences. On most days, the precipitation amounts in the simulations and observations match well. There is, however, a systematic early decline of precipitation in the late afternoon. Inspecting the precipitation stamps (available in the supplement) suggests that, in some situations, the model is not able to reproduce lasting organized convection when the forcing becomes weaker (an example for this can be seen in Fig. 8). While we are confident that the general characteristics of the convection are well captured in our simulations, it is important to keep this systematic deviation in mind for the subsequent analysis. Furthermore, there is a spinup peak in the first few forecast hours caused by the stochastic perturbations, which is typical when starting from a downscaled analysis. To avoid this effect and to allow the perturbations to develop, we start all our subsequent analyses from 0600 UTC.
4. Cloud identification and computation of statistics
To test the CC06 predictions and assumptions, we need to identify the individual clouds and compute statistics of the modeled updraft mass flux field. The process of identifying cloud objects and computing the radial distribution function is illustrated in an accompanying Jupyter notebook (called cloud_identification_and_rdf.ipynb). We identify convective updrafts by using a vertical velocity threshold w > 1 m s−1 combined with a positive cloud water content at model level 30. This corresponds to a height above ground level of around 3000 m in the terrain-following COSMO grid. Ideally, one would choose the cloud base since this is what many convection parameterizations use, but this is difficult to determine. Previous studies have shown that the mass flux statistics are relatively insensitive to changes in the analysis height in a reasonable range around the chosen level [see, e.g., Fig. 13 of Davoudi et al. (2010) or Fig. 5.6 in Davies (2008)]. From the resulting binary field, objects are classified as pixels that share an edge. Visual inspection suggested that many “objects” are in fact conglomerates of several touching updrafts [this has also been found by Scheufele (2014)]. We therefore separate the cloud objects using a local maximum filter in combination with a watershed algorithm (Beucher and Meyer 1992). For the local maximum filter, we use a search footprint of 3 × 3 grid points. All subsequently presented analysis is done using the separated objects unless otherwise stated. For further information and sensitivity tests of the cloud separation algorithm, see the aforementioned Jupyter notebook cloud_identification_and_rdf.ipynb.
Computation of statistics
Additionally, we compute statistics for the mean heating rate Q for each coarse box j at the same model level. Note that, unlike M, Q can be negative. The variables M and Q are well correlated, with a correlation coefficient of 0.8 across all scales.
5. Comparison of CC06 predictions with simulations
a. Scaling of standard deviation with the mean
Our first research question is whether the CC06 theory is applicable for complex real-weather situations. Assuming that the variation of the mean cloud mass flux 〈m〉 is small, an assumption we will revisit later, the CC06 theory states that the domain-total mass flux standard deviation increases with the square root of its mean [Eq. (4)]. The combined data, including all coarse boxes for all scales n for all days and time steps, show that, over more than three orders of magnitude in 〈M〉 and almost two orders of magnitude in horizontal scale, the mass flux standard deviation is well described by the proposed square root relation (Fig. 4a). The fit parameter b is 9.62 × 107 kg s−1. The variability drops off at both ends, which is reflected in a change in b if the curve is fitted to each scale individually—smaller scales have a steeper slope.
The heating rate Q, being a horizontally averaged quantity, needs to be multiplied by the area of the coarse box to test for the scaling; Q × A behaves similarly to M (Fig. 4b). The square root scaling applies over an even larger range of scales, but the uncertainty increases toward small values of Q × A. In contrast to the updraft mass flux, which is by definition positive, the heating rate can also be negative. Therefore, the heating rate standard deviation is not constrained to go to zero as its mean goes to zero, which is a potential cause for the deviations at smaller scales.
Additionally, we fit a linear relation, which corresponds to multiplicative noise as in the stochastically perturbed parameterization tendencies (SPPT) scheme (Shutts and Palmer 2007). Note that changing b does not change the apparent slope on a log–log plot but only displaces the line. The linear relation is not able to capture the standard deviation scaling of 〈M〉 or Q × A in our simulations. This is in line with the findings of Shutts and Pallarès (2014), who argue that, for convection, the standard deviation is better described by a square root scaling.
This first general test of the main prediction of the CC06 theory confirms its applicability even in the complex situations in our simulations. One key feature of the theory is its scale awareness. This can be seen in the slope of the fit b, which varies by a factor of 2 as the mean mass flux increases by more than three orders of magnitude (see insert of Fig. 4a). A small variation of the slope indicates that the CC06 scaling describes the variability for a range of different coarsening resolutions without the need for retuning of the parameters. This resolution independence is a desirable trait for stochastic parameterizations, particularly as the gray zone is approached. Multiplicative perturbations, in contrast, appear to be inherently resolution dependent. We discuss the implications of our findings for SPPT in section 8.
b. Deviations from the CC06 theory
The analysis above highlights some systematic deviations of our simulation results from the CC06 predictions. The cloud number distribution p(N), an indicator of cloud organization, varies with the time of day, predominantly affecting larger scales. Furthermore, the cloud mass flux distribution p(m) is narrower for smaller scales. In the following two sections, we explore the physical processes behind these deviations.
6. Cloud clustering
The parameter α indicates that cloud clustering strongly affects convective variability. Now we introduce another, independent metric of cloud clustering, the radial distribution function (RDF), which allows us to better understand how clouds organize. The RDF measures how many times more likely than random a cloud is located at a certain distance r to another cloud. A completely random distribution would give RDF = 1 for all r. The mathematical and computational details of the algorithm are explained in the accompanying Jupyter notebook cloud_identification_and_rdf.ipynb. To enable comparison with observation data, we use the hourly precipitation field for the RDF analysis. The results for the mass flux field are qualitatively similar. The RDF shows some variability between simulation days (see supplement). In particular, the first, synoptically dominated phase shows relatively constant RDFs, while the second, locally forced phase shows a stronger diurnal variation. This diurnal cycle also shows up in the composite in Fig. 7. Figure 7a shows the RDF as a function of the radius for two times, 1400 and 2100 UTC. The simulations with and without stochastic perturbations are very similar, confirming that the PSP scheme does not drastically alter the behavior of clouds. The observations have their peak RDF value at larger distances. This agrees with the visual impression that clouds in the model are more intermittent than in the observations, a characteristic of cloud-resolving models also observed in other studies (Hanley et al. 2015; Nguyen et al. 2017). In general, the RDF indicates that cloud clusters have a scale of around 50–100 km (25–50 km is the radius at which the RDFs drop to half their peak value) and that the clustering is stronger in the evening.
Since the RDF changes mostly in amplitude, not in shape, we use the maximum RDF value as a proxy of clustering strengths. Figure 7b therefore illustrates the diurnal variation in clustering. The results strongly correspond to the evolution of α: clouds are more clustered in the morning and evening. Similar results were found by Davies (2008) in her idealized diurnal cycle experiments. The scale dependence of α may be explained by the typical cluster size of around 50–100 km. The small scales are close to the typical cloud separation distance, around 10 km. Consequently, the clustering does not fully impact these scales. The medium scales correspond to the typical clustering size and, therefore, experience the strongest impact resulting in larger values of α. Finally, the large scales can contain many cloud clusters. During the day, these clusters appear to be more regularly spaced, leading to values of α smaller than one. This could be related to orography, the land surface, or synoptic variations within the domain. Toward the evening, the convective activity drops off rapidly. The few remaining active clusters are less constrained by the large-scale forcing and rely more on internal processes to maintain convection. Supporting evidence for this argument can be found in the ensemble precipitation variability (Fig. 3), which stays approximately constant even as the total precipitation amount rapidly declines.
To understand the nature of cloud clustering, it is important to note that the RDF measures the cloud clustering relative to all clouds in the domain. In fact, the absolute cloud number density around existing clouds changes little in the diurnal cycle. The variation in the RDF stems primarily from the increased isolation of the existing clusters in the morning and evening. Figure 8 illustrates a typical sequence of these events. In the morning, convection is clustered because of two processes: larger precipitation patches from the previous day in the south and spatially confined regions of early convective cells in the north. In the early afternoon, when convection is strongest, convective cells are distributed over most of the domain. Local clustering still occurs, but the clusters are simply more numerous. In the evening, as the forcing subsides, only strong, congregated convection survives. On this particular day, the formation of a squall line over western Germany is missed by all of the simulations. As mentioned in section 3, this failure of the model to produce larger organized features is apparent on several days.
7. Cloud mass flux distribution
The parameter β shows a substantial dependence on the coarse-graining scale and a weaker systematic diurnal cycle. In this section, we further investigate the cloud mass flux distribution. Specifically, we inspect the variations of the mean mass flux, the geographical variation of 〈m〉, and the impact of the cloud separation algorithm.
a. Temporal and spatial variations of 〈m〉
Figure 9 shows how the mean cloud mass flux and size varies with the time of day. Note that 〈m〉 increases as the total convective activity picks up, but its peak is delayed around 3 h behind 〈M〉. Also 〈σ〉 is smallest during the increase of convection around 1200 UTC and then increases toward the evening. The difference between the separated and nonseparated 〈m〉 are also strongest at that time; 〈m〉 fluctuates by about ±20% throughout the diurnal cycle, while 〈M〉 varies by a factor of 4. This is in line with previous findings of how the mean cloud mass flux changes with the forcing (Cohen and Craig 2006; Scheufele 2014): the primary effect of increasing the large-scale cooling is an increase in the number of clouds; the changes in the cloud properties are secondary. For this reason, 〈m〉 is often a prescribed constant, for example, in the Plant and Craig (2008) parameterization. To assess the impact of using a fixed 〈m〉, we compute RV using the overall mean cloud mass flux of 5.07 × 107 kg s−1 (Fig. 6a). This increases the diurnal variation for the small and medium scales. Additionally, the spread of these scales is increased. These changes indicate that the variations in 〈m〉 impact the CC06 variance predictions, but the magnitude is secondary compared to other systematic biases.
In section 5b, we found that small and medium scales seem to have a narrower than expected cloud mass flux distribution. The precipitation snapshots (Figs. 1 and 8 and supplemental material) lead us to hypothesize that 〈m〉 varies geographically. There appear to be regions with mostly larger clouds and other regions with mostly smaller clouds. This could result from differences in the synoptic situation, orography, cloud–cloud interactions, or land surface variations leading to changes in the Bowen ratio, which was found to be important for the shallow cumulus mass flux distribution (Sakradzija and Hohenegger 2017). On the domain scale, regions with different mean cloud sizes are included, potentially increasing the width of cloud mass flux distribution.
b. Overall cloud mass flux distribution—The impact of cloud separation
Next, we take a closer look at the overall cloud size distribution for all dates, times, and ensemble members (Fig. 10). If the clouds are separated using the local maximum method, the distribution is close to exponential. Clouds collect near the grid scale, an observation also made by Scheufele (2014). Without the cloud separation, the distribution is closer to a power law. Windmiller (2017) argues that a power-law cloud size distribution is the result of cloud clustering when individual clouds drawn from an exponential distribution overlap. It is curious, however, that this effect already shows up at resolutions of 2.8 km. Previous kilometer-scale studies in idealized setups (Cohen and Craig 2006; Scheufele 2014) found that cloud overlap only became significant at much higher resolutions. This seems to suggest that in our real-weather case studies, clouds tend to congregate more. The deterministic run and the stochastically perturbed ensemble agree well for smaller mass fluxes. The differences for the larger bins are most likely caused by the much smaller sample size of the deterministic run.
8. Implications for (stochastic) parameterizations of convection
Much of the interest in a theory for convective variability such as CC06 comes from the application to stochastic convective parameterization. The theory provides guidance for how the stochastic variability should change with meteorological situation and model resolution. The evaluation of the theory presented here has both positive and negative implications for a convection scheme, like that of Plant and Craig (2008), which is based on the CC06 theory. Most importantly, the basic square root scaling of mass flux standard deviation with the total mass flux in any finite region holds to a reasonable degree of approximation. Although the convective cases considered here are much more complex than the radiative–convective equilibrium environment for which the theory was first developed and tested, the additional complexity does not completely alter the behavior. Since it is the variability scaling that relates the amplitude of convective variability to both the convective closure and the model resolution, its use in stochastic parameterization is supported. In contrast, the most commonly used stochastic parameterization method in numerical weather prediction, SPPT, scales the standard deviation linearly proportional to the convective amount, which is not supported by our results. This does not imply that SPPT should not be used since it is not a convection parameterization but an “all inclusive” method to account for model errors and primarily increase ensemble spread. Furthermore, SPPT must also represent other processes that may scale differently. However, to the extent that small-scale variability is associated with convective clouds, the incorrect scaling implies that the scheme will need to be retuned whenever factors such as model resolution are changed.
However, the significant deviations from CC06 theory found in this study should be accounted for in parameterization. This is not simple, since the dominant deviation was related to convective clustering on a characteristic scale of order 100 km, varying in intensity throughout the diurnal cycle. Unlike the simple model of unorganized convection, the clustering couples different grid columns in the large-scale model—an effect that can be difficult to implement. The model dynamics can only represent organization on scales larger than the effective resolution, approximately 5Δx (Skamarock 2004). Furthermore, even within a grid box, convective clustering may impact other parameterized processes including radiation and surface fluxes, which would need to be accounted for in the relevant schemes. The changes in 〈m〉 are technically easier to implement in a stochastic convection scheme, but a good understanding of what determines 〈m〉 is still lacking. Additionally, we note that it is important to consider the individual updrafts rather than overlapping features to construct distributions that agree with the theory.
Finally, it should be emphasized that we focused on spatial variability in this study and did not address the important question of temporal structures or convective memory, both of which are important for parameterization. The Plant and Craig (2008) parameterization has some memory by giving each random cloud a fixed cloud lifetime larger than the convective time step. Sakradzija et al. (2016) further related the cloud lifetime to the mass flux. No attempt has been made, however, to include the effect of organization on convective memory.
9. Conclusions
In this study, we tested a minimally simple theory of convective variability developed by Craig and Cohen (2006) in complex summertime weather over land. The theory’s core building blocks, a random distribution of clouds in space and an exponential mass flux distribution, were mainly developed with tropical maritime convection in mind. We chose 12 consecutive, high-impact weather days over Germany to test the theory’s prediction outside of its comfort zone. To quantify the variability of convection in real-weather case studies, we set up a 50-member ensemble in which all members had the same large-scale conditions but the convective cells were displaced using stochastic boundary layer perturbations.
In general, the mass flux standard deviation scales with the square root of its mean, in accordance with the theory. We did find systematic deviations, however. First, clouds tend to be clustered, violating the no cloud–cloud interaction assumption, more strongly so in the morning and evening. The typical cluster size is around 50–100 km. Second, the mean mass flux per cloud varies geographically—clouds of a certain size appear to congregate—and temporally, by around ±20% in the diurnal cycle. This indicates that the cloud properties are not entirely independent of the large-scale conditions, as assumed by the theory.
Our findings support the applicability of the CC06 theory for stochastic parameterizations in global models but also highlight areas for improvement. Particularly, the organization of clouds, most likely caused by cold pool dynamics, remains an outstanding issue. The incorporation of cloud organization in convection parameterization requires either a diagnostic grid-scale indicator of subgrid clustering or a prognostic subgrid variable.
While the 12-day period in our study presents a variety of nonequilibrium convections, they still only represent a narrow selection of all possible convective situations. In particular, it would be interesting to investigate how well the theory holds up for larger organized systems such as mesoscale convective systems or even tropical cyclones. Furthermore, the realism of our simulations is somewhat limited by our grid spacing. As shown in comparison with radar observations, our kilometer-scale model produces a large number of gridcell storms and is, on several days, not able to create lasting organization into the night. A more detailed investigation of the resolution dependence of cloud statistics would certainly be desirable, particularly as recent studies (Craig and Dörnbrack 2008; Scheufele 2014; Hanley et al. 2015; Heath et al. 2017) show a lack of convergence even at horizontal grid spacings of around 100 m.
Acknowledgments
The research leading to these results has been done within the subproject “Representing forecast uncertainty using stochastic physical parameterizations” of the Transregional Collaborative Research Center SFB/TRR 165 “Waves to Weather” funded by the German Research Foundation (DFG). We thank Robert Redl for technical support and the German Weather Service (DWD) for providing the COSMO model and the initial and boundary conditions, as well as ECMWF for providing the data for our synoptic charts. Last, we would like to thank the three anonymous reviewers for their help in improving this paper.
APPENDIX
Rationale and Formulation of the Stochastic Boundary Layer Perturbations
In the atmosphere, the boundary layer is characterized by turbulent eddies, which occur on a wide range of scales from millimeters to approximately the height of the boundary layer, typically around 1 km. Convection-permitting models with horizontal grid spacings of order 1 km are not able to represent these eddies. Therefore, parameterizations are necessary to describe the effect of the unresolved turbulence on the grid scale. Usually, these boundary layer parameterizations are assumed to represent the average effect of many eddies and are deterministic in nature: given a certain grid-scale condition, they always produce the same mean response. The only variance thus comes from the resolved boundary layer circulation. On a scale equivalent to the grid length of a kilometer-scale model, however, the turbulent response can vary significantly from realization to realization. Therefore, the variability in the model, expressed by the joint probability density function (PDF) of boundary layer quantities, can be much smaller than the corresponding variability in nature. While the mean boundary evolution might still be adequately represented, this lack of variability can drastically alter the grid-scale behavior if nonlinear convection occurs.
On typical summer days, turbulence is driven by surface heating, leading to a growth of the boundary layer after sunrise until the capping inversion is reached. At this point, only parcels with enough momentum as well as positive humidity and temperature perturbations can break through the inversion layer to eventually trigger deep convection. Parcels that have these properties originate from the extreme end of the joint boundary layer PDF. Reduced variability accordingly reduces the probability of such parcels existing. This can lead to systematic biases in model behavior. In other words, there is a disparity between the convection, which is assumed to be resolved, and the process responsible for triggering it, namely, boundary layer turbulence, which is not resolved. Note that operational models such as the COSMO model, which is specifically designed for precipitation forecasts, are often tuned to produce the correct diurnal cycle of precipitation at the expense of other biases.
The subgrid standard deviation
REFERENCES
Baldauf, M., A. Seifert, J. Förstner, D. Majewski, M. Raschendorfer, and T. Reinhardt, 2011: Operational convective-scale numerical weather prediction with the COSMO model: Description and sensitivities. Mon. Wea. Rev., 139, 3887–3905, https://doi.org/10.1175/MWR-D-10-05013.1.
Bengtsson, L., M. Steinheimer, P. Bechtold, and J.-F. Geleyn, 2013: A stochastic parametrization for deep convection using cellular automata. Quart. J. Roy. Meteor. Soc., 139, 1533–1543, https://doi.org/10.1002/qj.2108.
Berner, J., and Coauthors, 2016: Stochastic parameterization: Toward a new view of weather and climate models. Bull. Amer. Meteor. Soc., 98, 565–588, https://doi.org/10.1175/BAMS-D-15-00268.1.
Beucher, S., and F. Meyer, 1992: The morphological approach to segmentation: The watershed transformation. Mathematical Morphology in Image Processing, E. R. Dougherty, Ed., Marcel Dekker, 433–481.
Bierdel, L., P. Friederichs, and S. Bentzien, 2012: Spatial kinetic energy spectra in the convection-permitting limited-area NWP model COSMO-DE. Meteor. Z., 21, 245–258, https://doi.org/10.1127/0941-2948/2012/0319.
Buizza, R., M. Milleer, and T. N. Palmer, 1999: Stochastic representation of model uncertainties in the ECMWF ensemble prediction system. Quart. J. Roy. Meteor. Soc., 125, 2887–2908, https://doi.org/10.1002/qj.49712556006.
Christensen, H. M., J. Berner, D. R. B. Coleman, and T. N. Palmer, 2017: Stochastic parameterization and El Niño–Southern Oscillation. J. Climate, 30, 17–38, https://doi.org/10.1175/JCLI-D-16-0122.1.
Cohen, B. G., and G. C. Craig, 2006: Fluctuations in an equilibrium convective ensemble. Part II: Numerical experiments. J. Atmos. Sci., 63, 2005–2015, https://doi.org/10.1175/JAS3710.1.
Craig, G. C., and B. G. Cohen, 2006: Fluctuations in an equilibrium convective ensemble. Part I: Theoretical formulation. J. Atmos. Sci., 63, 1996–2004, https://doi.org/10.1175/JAS3709.1.
Craig, G. C., and A. Dörnbrack, 2008: Entrainment in cumulus clouds: What resolution is cloud-resolving? J. Atmos. Sci., 65, 3978–3988, https://doi.org/10.1175/2008JAS2613.1.
Davies, L., 2008: Self-organisation of convection as a mechanism for memory. Ph.D. thesis, University of Reading, 157 pp.
Davoudi, J., N. A. McFarlane, and T. Birner, 2010: Fluctuation of mass flux in a cloud-resolving simulation with interactive radiation. J. Atmos. Sci., 67, 400–418, https://doi.org/10.1175/2009JAS3215.1.
Doms, G., and Coauthors, 2011: A description of the nonhydrostatic regional COSMO model. Part II: Physical parameterization. COSMO Tech. Rep., 161 pp.
Dorrestijn, J., D. T. Crommelin, J. A. Biello, and S. J. Böing, 2013: A data-driven multi-cloud model for stochastic parametrization of deep convection. Philos. Trans. Roy. Soc. London, 371A, 20120374, https://doi.org/10.1098/rsta.2012.0374.
DWD, 2017: Analysen radarbasierter stündlicher (RW) und täglicher (SF) Niederschlagshöhen. DWD, https://www.dwd.de/DE/leistungen/radolan/radolan.html.
Hanley, K. E., R. S. Plant, T. H. M. Stein, R. J. Hogan, J. C. Nicol, H. W. Lean, C. Halliwell, and P. A. Clark, 2015: Mixing-length controls on high-resolution simulations of convective storms. Quart. J. Roy. Meteor. Soc., 141, 272–284, https://doi.org/10.1002/qj.2356.
Heath, N. K., H. E. Fuelberg, S. Tanelli, F. J. Turk, R. P. Lawson, S. Woods, and S. Freeman, 2017: WRF nested large-eddy simulations of deep convection during SEAC4RS. J. Geophys. Res. Atmos., 122, 3953–3974, https://doi.org/10.1002/2016JD025465.
Hunter, J. D., 2007: Matplotlib: A 2D graphics environment. Comput. Sci. Eng., 9, 90–95, https://doi.org/10.1109/MCSE.2007.55.
Irving, D., 2016: A minimum standard for publishing computational results in the weather and climate sciences. Bull. Amer. Meteor. Soc., 97, 1149–1158, https://doi.org/10.1175/BAMS-D-15-00010.1.
Jones, E., and Coauthors, 2001: SciPy: Open source scientific tools for Python. Accessed 28 August 2017, http://www.scipy.org.
Jones, T. R., and D. A. Randall, 2011: Quantifying the limits of convective parameterizations. J. Geophys. Res., 116, D08210, https://doi.org/10.1029/2010JD014913.
Keane, R. J., G. C. Craig, C. Keil, and G. Zängl, 2014: The Plant–Craig stochastic convection scheme in ICON and its scale adaptivity. J. Atmos. Sci., 71, 3404–3415, https://doi.org/10.1175/JAS-D-13-0331.1.
Kluyver, T., and Coauthors, 2016: Jupyter notebooks—A publishing format for reproducible computational workflows. Positioning and Power in Academic Publishing: Players, Agents and Agendas, IOS Press, 87–90, https://doi.org/10.3233/978-1-61499-649-1-87.
Kober, K., and G. C. Craig, 2016: Physically based stochastic perturbations (PSP) in the boundary layer to represent uncertainty in convective initiation. J. Atmos. Sci., 73, 2893–2911, https://doi.org/10.1175/JAS-D-15-0144.1.
Mellor, G. L., and T. Yamada, 1982: Development of a turbulence closure model for geophysical fluid problems. Rev. Geophys., 20, 851–875, https://doi.org/10.1029/RG020i004p00851.
Nguyen, H., A. Protat, H. Zhu, and M. Whimpey, 2017: Sensitivity of the ACCESS forecast model statistical rainfall properties to resolution. Quart. J. Roy. Meteor. Soc., 143, 1967–1977, https://doi.org/10.1002/qj.3056.
Ollinaho, P., and Coauthors, 2017: Towards process-level representation of model uncertainties: Stochastically perturbed parametrizations in the ECMWF ensemble. Quart. J. Roy. Meteor. Soc., 143, 408–422, https://doi.org/10.1002/qj.2931.
Piper, D., M. Kunz, F. Ehmele, S. Mohr, B. Mühr, A. Kron, and J. Daniell, 2016: Exceptional sequence of severe thunderstorms and related flash floods in May and June 2016 in Germany—Part 1: Meteorological background. Nat. Hazards Earth Syst. Sci., 16, 2835–2850, https://doi.org/10.5194/nhess-16-2835-2016.
Plant, R. S., and G. C. Craig, 2008: A stochastic parameterization for deep convection based on equilibrium statistics. J. Atmos. Sci., 65, 87–105, https://doi.org/10.1175/2007JAS2263.1.
Raschendorfer, M., 2001: The new turbulence parameterization of LM. COSMO Newsletter, No. 1, DWD, Offenbach, Germany, 89–97, http://www.cosmo-model.org/content/model/documentation/newsLetters/newsLetter01/newsLetter_01.pdf.
Rasp, S., 2017: Variability and clustering of mid-latitude summertime convection—Metadata. Figshare, accessed 23 November 2017, https://doi.org/10.6084/m9.figshare.c.3864490.
Sakradzija, M., and C. Hohenegger, 2017: What determines the distribution of shallow convective mass flux through a cloud base? J. Atmos. Sci., 74, 2615–2632, https://doi.org/10.1175/JAS-D-16-0326.1.
Sakradzija, M., A. Seifert, and T. Heus, 2015: Fluctuations in a quasi-stationary shallow cumulus cloud ensemble. Nonlinear Processes Geophys., 22, 65–85, https://doi.org/10.5194/npg-22-65-2015.
Sakradzija, M., A. Seifert, and A. Dipankar, 2016: A stochastic scale-aware parameterization of shallow cumulus convection across the convective gray zone. J. Adv. Model. Earth Syst., 8, 786–812, https://doi.org/10.1002/2016MS000634.
Scheufele, K., 2014: Resolution dependence of cumulus statistics in radiative-convective equilibrium. Ph.D. thesis, Ludwig–Maximilian University of Munich, 121 pp.
Selz, T., and G. C. Craig, 2015a: Simulation of upscale error growth with a stochastic convection scheme. Geophys. Res. Lett., 42, 3056–3062, https://doi.org/10.1002/2015GL063525.
Selz, T., and G. C. Craig, 2015b: Upscale error growth in a high-resolution simulation of a summertime weather event over Europe. Mon. Wea. Rev., 143, 813–827, https://doi.org/10.1175/MWR-D-14-00140.1.
Shutts, G. J., and T. N. Palmer, 2007: Convective forcing fluctuations in a cloud-resolving model: Relevance to the stochastic parameterization problem. J. Climate, 20, 187–202, https://doi.org/10.1175/JCLI3954.1.
Shutts, G. J., and A. C. Pallarès, 2014: Assessing parametrization uncertainty associated with horizontal resolution in numerical weather prediction models. Philos. Trans. Roy. Soc. London, 372A, 20130284, https://doi.org/10.1098/rsta.2013.0284.
Skamarock, W. C., 2004: Evaluating mesoscale NWP models using kinetic energy spectra. Mon. Wea. Rev., 132, 3019–3032, https://doi.org/10.1175/MWR2830.1.
Stauffer, R., G. J. Mayr, M. Dabernig, and A. Zeileis, 2015: Somewhere over the rainbow: How to make effective use of colors in meteorological visualizations. Bull. Amer. Meteor. Soc., 96, 203–216, https://doi.org/10.1175/BAMS-D-13-00155.1.
Taylor, H. M., and S. Karlin, 1998: An Introduction to Stochastic Modeling. Academic Press, 646 pp.
van der Walt, S., S. C. Colbert, and G. Varoquaux, 2011: The NumPy array: A structure for efficient numerical computation. Comput. Sci. Eng., 13, 22–30, https://doi.org/10.1109/MCSE.2011.37.
Wang, Y., and G. J. Zhang, 2016: Global climate impacts of stochastic deep convection parameterization in the NCAR CAM5. J. Adv. Model. Earth Syst., 8, 1641–1656, https://doi.org/10.1002/2016MS000756.
Wang, Y., G. J. Zhang, and G. C. Craig, 2016: Stochastic convective parameterization improving the simulation of tropical precipitation variability in the NCAR CAM5. Geophys. Res. Lett., 43, 6612–6619, https://doi.org/10.1002/2016GL069818.
Windmiller, J. M., 2017: Organization of tropical convection. Ph.D. thesis, Ludwig–Maximilian University of Munich, 137 pp., https://edoc.ub.uni-muenchen.de/21245/.
Note that the angle brackets used throughout the text describe ensemble means.