1. Introduction
Recent large floods in Europe, such as those that have occurred in the Meuse and Rhine basins in 1995, over large areas of the United Kingdom in 1998 and 2000, in the Elbe basin in 2002 and in the Danube basin in 2005 and 2006, have led to increased interest in research on upgrading existing operational flood forecasting systems. In this context special emphasis is put on the quantification of the uncertainty associated with a forecast. A variety of operational users are getting progressively accustomed to the reality that typical predicants such as precipitation, river stages, discharges, or volumes at critical locations need to be associated with an accurate measure of the predictive uncertainty. Several approaches [Beven and Binley 1992; Lardet and Obled 1994; Schaake and Larson 1998; Krzysztofowicz 1999; Liu et al. 2005; the Hydrological Ensemble Prediction Experiment (HEPEX) hosted at the National Weather Service and the European Centre for Medium-Range Weather Forecasts; Schaake et al. 2006] have been pursued with the aim of quantifying predictive uncertainty due to different sources.
The sources of predictive uncertainty can be manifold. They may be attributable to uncertainty of precipitation forecasts and observations, errors in the conceptualization of the system, model initialization, or process parameterization. Whatever the source of the uncertainty, the final probabilistic forecast must be accompanied by an adequate measure of the total uncertainty in the form of probability density functions, which are revised on the basis of a priori knowledge from historical information.
For the particular area of flood forecasting it is desirable to detach the responsibility of the forecaster, which consists in predicting a critical river level by means of operational models, from those of civil protection services in charge of making the decision of whether an evacuation or controlled flooding is necessary. This is only feasible if a reliable measure of the uncertainty associated with a discharge or river stage prediction is available and communicated by the forecaster to the decision maker.
There is thus a general need to provide robust uncertainty assessment tools, which can be implemented and used in operational flood forecasting systems. These tools should be internally consistent, statistically self-calibrating (Krzysztofowicz 1999), and require modest computational effort by performing the bulk of the computations offline. The latter attribute is especially relevant, because in emergency situations the time available between performing a forecast, issuing a warning, and organizing response actions is generally limited. This fact is exacerbated in river basins with typically short response times.
The general framework for a Bayesian forecasting system (BFS) presented by Krzysztofowicz (1999) constitutes a major effort in formalizing the quantification of uncertainty in the flood forecasting process, given arbitrary deterministic models. The BFS consists of three components: (i) an input uncertainty processor (IUP), (ii) a hydrologic uncertainty processor (HUP), and (iii) an integrator of uncertainty (IU). The BFS is based on a complete and consistent theory that identifies and separates relevant sources of uncertainty. It provides a general reference framework for building the uncertainty processors and the integrator, components that can be readily implemented in a real-time flood forecasting system. Sequel papers have implemented various BFS components as a one-branch processor (precipitation-independent model; Kelly and Krzysztofowicz 2000; Krzysztofowicz and Kelly 2000) or two-branch processor (precipitation-dependent model; Krzysztofowicz and Herr 2001), analyzed their statistical properties and carried out preliminary verifications. One of the most salient features of the BFS theory is the provision of a formal basis for the development and successive revision of its components, while preserving internal consistency and statistical properties, which make it appealing for operational use.
Operational experience has shown that the most prominent source of uncertainty in the flood forecasting process is due to the quantitative precipitation forecast (QPF). The aim of this paper is to present an application of a Bayesian processor for QPF output (BPO), which is to be used as an IUP for the BFS. The proposed processor quantifies the uncertainty of the QPF and is suited for offline execution. In contrast to the BFS introduced in Krzysztofowicz (1999), we suggest modifying the system whereby the originally proposed IUP, formulated as a precipitation uncertainty processor (Kelly and Krzysztofowicz 2000), is substituted by a Bayesian processor of output (BPO) (Krzysztofowicz 2004; Maranzano and Krzysztofowicz 2004).
The original IUP (Krzysztofowicz 1999) is formulated in terms of probability distributions of hydrological model output, conditional on measured precipitation. Here we propose to separate the hydrological model output from QPFs obtained from numerical weather prediction (NWP) in the Bayesian forecasting system. The aim is to enhance the segregation of sources of uncertainty by means of dedicated processors that are designed for processing hydrological and numerical weather prediction model output individually. In this manner the uncertainty on precipitation, given a numerical weather forecast, is treated explicitly in the BFS. Thus we propose in this paper an input uncertainty processor based on probability of precipitation and conditional on two variates: 1) the QPF and 2) the relative air humidity acting as ancillary indicator on the state of the atmosphere. Further below we implement the processor.
Given the large attention of the scientific and operational community on the ensemble techniques, the present application needs to be put into perspective in this context (Schaake et al. 2006). The IUP presented here is used for a deterministic precipitation forecast. An extension of the BPO for use with the ensemble prediction system is straightforward. An ensemble weather forecast represents an auxiliary randomization of the future precipitation and can be handled naturally within the BFS theory (Krzysztofowicz 2001a). The restriction of the present application to deterministic forecasts has been chosen to simplify the exposure. Finally we observe that attention needs to be paid to the fact that the present application, as well as the applications by Krzysztofowicz (1999), Krzysztofowicz and Kelly (2000), and Krzysztofowicz and Herr (2001) are restricted to typically small river basins, where the impact of spatial and temporal distribution of precipitation on the overall forecast uncertainty can be ignored. For larger basins however, this aspect needs to be addressed explicitly, as spatial and temporal variability of precipitation may have significant impact on the predictive uncertainty of the basin outflow forecasts. In such cases the predictive uncertainty on precipitation at individual locations within the basin and over a range of temporal scales needs to be integrated into the procedure. However, this aspect remains at present matter of further investigation. The following exposure is structured into five parts: Section 2 gives a brief overview of the BFS and introduces the BPO as an IUP in the BFS. Section 3 summarizes the theoretical concepts of the theory underlying the BPO, section 4 describes the actual implementation of the BPO, and section 5 is dedicated to a discussion of the results and draws the conclusions.
2. The Bayesian forecasting system
a. Assumptions
The BFS proposed by Krzysztofowicz (1999) rests on the general assumption that the principal sources of uncertainty in the flood forecasting process can be split into two groups: 1) the input uncertainty and 2) the hydrologic uncertainty.
The first group is attributable to all the forcing inputs of the hydrologic models (e.g., precipitation, temperature, and evaporation), assuming the model to be perfect and the internal model states to be well known.
The second group is due to the conceptualization errors in the description of the hydrological and hydraulic processes, the uncertainty in the choice of the parameters in the governing equations, the uncertainty of the knowledge of the internal model states, and the measurement errors of environmental variables, assuming that the precipitation is perfectly known and certain.
In this fashion the two principal sources of uncertainty are isolated from each other and treated separately. With particular attention to short-term flood forecasting (in the order of several hours), evaporation or temperature (their impact on the hydrologic response can be considered medium term) are assumed to have negligible contribution to the forecast, in contrast to precipitation. For this reason input uncertainty is considered attributable to precipitation only. The uncertainty due to the remaining meteorological variables is lumped into the hydrological uncertainty. However, the general structure of the BFS allows a relaxation of this hypothesis if a separate treatment of such sources of uncertainty is deemed necessary.
b. Variates
c. Predictive uncertainty
The integration operator (IU) in Eq. (9) combines the hydrological uncertainty and the uncertainty on the precipitation, yielding the total predictive uncertainty on H. The resulting probability density χ(·|h0, u, z, t), the PQFF, expresses the predictive uncertainty on the predictand H, conditional on the set (h0, u, z, t) and marginalized with respect to the actual precipitation w and the predictand estimate r.
3. Bayesian processor
a. Operational requirements
The implementation of the BFS based on Eq. (9) requires the estimation of the marginal densities ψ(·|z, t), and θ(·|s, h0, u, w), which represent the two groups of uncertainties. This constitutes a major challenge and is the key to a successful application of the processor. The distribution estimators IUP and HUP are both formulated as Bayesian processors, in which one revises a prior density by means of a likelihood function, which statistically represents the performance of the forecasting model over a historical period. For an application in a real-time forecasting system the operational requirements impose clear limitations on the methods that can be used in specifying the two conditional densities. Carrying out a large number of simulations, as for instance required when putting Monte Carlo–sampled data though a deterministic model (Lardet and Obled 1994), should be possibly avoided in a real-time forecasting situation because of time constraints. This issue becomes even more important when using computationally intensive process-based hydrological models. The method should also meet the criteria of robustness and protect the user automatically from producing an unrealistic forecast (Krzysztofowicz 2001b), if for instance model predictions turn out to be noninformative. Thus modeling the densities with the aid of analytical-numerical expressions, which can be prepared offline but evaluated and updated by processing a limited amount of information at the time of the forecast, is a valuable solution.
The IUP formulated as precipitation uncertainty processor (PUP) in Kelly and Krzysztofowicz (2000) and the HUP (Krzysztofowicz and Kelly (2000)) meet both criteria. These requirements, as will become clear later on, are also met by the BPO (Krzysztofowicz 2004), which is implemented hereunder and proposed as IUP in (9). The application of the HUP will not be addressed further in this paper and remains a matter for future investigations.
b. Input uncertainty processor
The IUP quantifies the uncertainty on the actual precipitation, given a precipitation forecast. Precipitation can be modeled as an intermittent process, in which the variate assumes either the value zero in the absence of rain or a positive value if an event occurs. Stochastically, precipitation is represented (Krzysztofowicz 2004) by the predictand W, a mixed (binary–continuous) variate combining two processes: (i) the binary process of event occurrence/nonoccurrence, expressed through the binary predictand V = 1 for an event, and V = 0 in absence of an event, and (ii) a continuous process w representing the precipitation depth and conditional on event occurrence. The predictand W has a predictor Z, which is analogously a mixed (binary–continuous) variate. A forecasted event at a location and at a point in time can thus be represented stochastically by the joint realization (υ, z) of the binary and the continuous predictands. The precipitation depth and its prediction are represented by the joint realization (w, z). A revised posterior distribution on the model input conditional on a forecast is obtained from an assumed prior by means of Bayesian inference as proposed by Krzysztofowicz (2004) and reported herein.
1) Bayesian revision of the binary process
2) Bayesian revision of the continuous process
3) The probabilistic quantitative precipitation forecast
4. Application
The HIRLAM model delivers hourly precipitation depths for each model cell, which we aggregated to 6-hourly cumulative values. The forecasts are carried out every 6 h with base times 0000, 0600, 1200, and 1800 UTC + 1. The measured precipitation, recorded at hourly intervals, is accumulated into 6-hourly cumulative precipitation amounts. The hourly temperature–dewpoint temperature differences observations are averaged over 6-h periods. The length of the total investigation period (NWP forecasts and measurements) ranges from 1 January 2000 to 31 October 2005. The Bayesian revision is performed with the 6-hourly QPFs, whereby lead times of 24 and 48 h are chosen. Since the observed and forecast precipitation variables are mixed (binary–continuous), the prior density functions, as well as the posterior probability and posterior distribution functions, need to be evaluated separately for the binary and continuous processes. The binary variable is assumed 0 in the absence of a precipitation event, and 1 otherwise. The distinction between an event and a nonevent is made on the basis of a minimum precipitation threshold of 1 mm in 6 h.
a. Preliminary analysis of forecasted and measured precipitation
First the 6-h cumulative forecasted and measured precipitation variables Z and W, including the temperature differences T, are grouped into four seasons: winter (1 January–31 March), spring (1 April–30 June), summer (1 July–30 September), and autumn (1 October–31 December). This grouping is necessary because the precipitation process is nonstationary and it is not recommended to carry out a stationary statistical analysis, as precipitation patterns change during the annual cycle. In principle, to properly account for effects of nonstationarity, a data analysis on a monthly basis should be performed for applications in many parts of the world. However, given the climate of the study area, which is typically rather stationary over longer periods of time, we consider only seasonal variations. The separation into seasons yields separate statistics that pay tribute to the particular climatic characteristics of the season. Next, the predictor realizations z produced by the NWP model for the two selected lead times (24 or 48 h) are sorted into four series, conditional on the actual event occurrence υ = 1 or nonoccurrence υ = 0, indicated with z0n and z1n, where n = {24, 48} indexes the 24- or 48-h forecast, respectively.
Figure 1 shows the empirical cumulative distributions conditional on υ = 0 and υ = 1 for winter and summer, including the distribution of the actually measured precipitation w. The left plot (winter) shows five distributions. The red dotted curve represents the measured precipitation. The green (24-h forecast) and blue dotted curve (48-h forecast) represent the distribution of the forecasted precipitation depth for events, at which precipitation has actually been observed (υ = 1). The remaining two curves (crosses) show the probabilities of precipitation depth for those forecasts, in which no actual precipitation has been recorded (υ = 0) and thus represent the lowest skill predictions. The right plot shows the same empirical probability distributions for the summer season. It is evident that the model has a higher skill in the summer season, yielding a probability distribution on precipitation depth conditional on υ = 1 more similar to the one of the observations than for the winter season.
b. Normal quantile transform
The next procedural step is to map the respective variates into the normal space through the application of the normal quantile transform (NQT) (e.g., Kelly and Krzysztofowicz 1994). The application of the NQT yields Gaussian surrogate variates in the normal space. In the normal space, expressions for moments of distributions are obtained analytically and relationships between statistical variables, as necessary for deriving the likelihood, can be fitted via (multi)linear regressions. Parametric expressions for conditional marginal densities such as a likelihood, a priori, and expected or posterior density are obtained. The subsequent transformation back into the original space yields the sought-after parametric expressions for the conditional densities and distributions required for the Bayesian formulation of the uncertainty processors.
The upper plot in Fig. 2a shows the forecasted precipitation z versus the measured precipitation w at Maastricht, during winter for lead times of 24 and 48 h, respectively. Table 1 reports the number of events (υ = 1) and nonevents (υ = 0) for the predictand W and the predictor Z, grouped per seasons. The lower plot presents the normal quantile transforms Q−1[Γ(w)] against Q−1[Δ(z)] of the same data series in the normal space. The data for the 24-h forecast are indicated with dots, while those relative to the 48-h forecast are indicated with diamonds. The linear regressions of the data in the normal space are indicated with a dashed line for the 24-h forecasts and with a dashed–dotted line for the 48-h period. Figure 2b shows the same data for the summer season. It is evident that the correlation between the forecasted and the measured 6-hourly cumulative precipitation is generally poor. The evidently poor correlation between measured and forecasted indicates a low forecasting skill as far as the precipitation depth is concerned. As will become clear further below, this uncertainty on precipitation depth propagates through the IUP into the revised posterior probability through the Bayesian processor.
1) Prior densities
The prior densities need to be derived separately for the binary and the continuous process. For the binary process the prior density reduces to a numerical value, while for the continuous process it is given by an actual density. The necessary procedural steps are given next.
(i) Binary process
The prior probability of precipitation occurrence g in Eq. (10) is derived directly from the historical sample w of precipitation recorded at Maastricht, grouped into the four seasons. Table 2 summarizes the prior probabilities of precipitation occurrence over the chosen historical period 1 January 2000–31 October 2005. At Maastricht the probability of precipitation occurrence is slightly higher in spring and summer than in autumn and winter.
(ii) Continuous process
2) Likelihood
The likelihood functions need also to be established separately for the binary and the continuous process, whereby the binary process leads to a likelihood that is parametric with respect to υ = {0, 1}, while the likelihood function of the continuous process results in family of densities, as shown below. Maranzano and Krzysztofowicz (2004) presented the equations for the binary process [Eq. (10)] based on two cases for the predictor Z (Z = 0 and Z > 0) and performed preliminary verifications. Here we extend the application to the combined binary–continuous process.
(i) Binary process
The cumulative distribution functions Fυn = P(Z < z|Z > 0, V = υ), υ = {0, 1}, n = {24, 48} have the corresponding density functions fυn. The distributions Fυn are derived from historic data for each season and for the two forecasting lead times. Subsequently, a parametric cumulative distribution function is fitted to the data by searching optimal scale and shape parameters. The parametric density functions are obtained from the fitted cumulative distribution functions by derivation. Figure 6a shows the empirical distributions of the forecasted precipitation depths z versus precipitation depth for the two lead times already reproduced in Fig. 1, and the respective Weibull fits, for υ = {0, 1} and n = {24, 48}. Once again this parametric model has proven to give the best fit to the particular set of data. Figure 6b shows the corresponding parametric densities fυn. Also, these densities are monotonically decreasing functions indicating a higher probability concentration for low precipitation depths.
(ii) Continuous process
3) Posterior probability
The posterior revised densities for the binary and the continuous process are derived next. The continuous process requires an inverse transformation from the Gaussian to the original space.
(i) Binary process
Given the family of likelihood functions, and given a prior probability g, the posterior probability Π is given by appropriate substitutions in Eq. (12). Figure 8 shows the probabilities of actual precipitation occurrence during the winter season for a forecasted precipitation depth, given different values of the prior probability g and forecasting lead times 24 and 48 h, respectively. We observe that the probabilities of precipitation occurrence are monotonically increasing for increasing forecasted precipitation in the 24-h forecasts, while the same probabilities are increasing and successively decreasing in the 48-h forecasts. This implies that for 48-h forecasts with high precipitation depths an event occurrence is less likely than for forecasted depths that lie in the medium range. The implications of this behavior of Π on the BPO will be discussed in more depth in section 5.
(ii) Continuous process
4) The probabilistic quantitative precipitation forecast
For the PQPF, the posterior density function ψ given by (19) and the posterior distribution function Ψ given by (18) are obtained by combining Eqs. (42) and (44) with the posterior probability Πz given by (12). Figures 9a and 9b show the posterior conditional density ψ(wn|zn, tn, s) given by the BPO and corresponding distribution functions Ψ for (a) winter and (b) summer, for forecasted values of zn and tn and an observed value of s0 at the begin of the forecast. Equation (19) represents the input uncertainty processor ψ(·|z, t, s) (IUP), which then needs to be integrated with the hydrologic uncertainty processor θ(·|r, h0, u, w) (HUP) to obtain the total predictive uncertainty of the forecast [Eq. (9)].
5. Discussion
The precedent exposure shows the necessary procedural steps for the implementation of an IUP as a Bayesian processor of output for operational use. In this section we discuss some of the most relevant properties of the processor for operational flood forecasting. A series of general properties of the proposed IUP are highlighted first:
1) The processor is based exclusively on QPFs and the measured historical precipitation time series. It does not require output information from the hydrological model. As such, the BPO strictly separates the precipitation uncertainty due to numerical weather prediction from uncertainty attributable to the hydrological component of the forecast.
2) The computations required for operating the processor can be carried out offline. From the statistical analysis of observed and forecasted precipitation, the parameters An, Bn, Cn, Dn, and K2n in Table 5 are fitted for the four seasons. These parameters need to be updated periodically as the historical forecasted and measured series expand in time. In this sense the processor is well suited for an application in an operational context, where as little as possible computational effort should be spent at forecasting time. The execution of the processor consists in evaluating first expressions (12), (42), and (44) for a given forecasted precipitation depth z and the (surrogate) relative air humidities t and s and then the mixed cumulative distributions and densities (18) and (19). The latter two expressions are plotted in Figs. 9a and 9b for selected forecasted precipitation depths and relative air humidity (i.e., temperature–dewpoint temperature differences).
On the present application of the BPO to a chosen set of measured and forecasted data, the following observations can be made:
3) The correlations between the 6-hourly accumulated forecast data for either lead time with the measured precipitation are very poor. To improve these correlations, areal averages of measured precipitation over larger areas need to be compared with corresponding forecasts that are averaged over more adjacent model cells. After the actual data have been transformed into the normal space (Figs. 2a and 2b) a linear regression between the transformed variables yields very low correlation coefficients as evident from Table 4. As a result the informational added value of the QPF with respect to precipitation depth is low, resulting in negligible added value of the Bayesian revision of the prior probability density g(w|s). This fact underscores that the contribution to the uncertainty of a flood forecast that can be attributed to the numerical weather forecast is significant. This uncertainty needs to be quantified systematically when QPFs are used to drive hydrological models in predicting critical discharges at a basin outlet. The BPO provides such a quantification tool.
4) Similarly, the correlation between observed (surrogate) relative air humidity and forecasted precipitation is also low (see Table 4) and would be even lower if forecasted humidity would have been used in estimating the likelihood instead of observations. From this we deduce that the inclusion of additional auxiliary indicators on the state of the atmosphere, such as relative air humidity on the ground or ground air pressure, adds little informational value to the Bayesian revision process of the prior probability g(w|s) because of the inherently limited skill of the model in predicting precipitation depth.
5) Graphs in Figs. 9a and 9b show the mixed posterior distribution function given by (18) and posterior density function given by (19), which are plotted for winter and summer and selected values of the forecasted precipitation depth z, the (pseudo) forecasted humidity t, and the humidity s0 observed at the begin of the forecast. The posterior mix Ψ is significantly influenced by the weighting with the probability Π of precipitation occurrence. The weighting is responsible to a large extent for the shift of the posterior mix Ψ with respect to the prior distribution G(w|s0). The posterior probability Φ and density ϕ of precipitation depth (not shown graphically) resemble in shape the respective priors. This similarity between prior and posterior is explained by the fact that the QPF has significantly more skill in predicting precipitation occurrence/nonoccurrence rather than predicting precipitation depth at a location (when comparing the QPF with one precipitation measurement at a single location). The utilization of an uncertainty processor like the BPO allows us to identify and quantify to what extent the model adds value to a forecast. With decreasing information content of the QPF with respect to precipitation depth and probability of event occurrence, the posterior mix of probabilities collapses onto the prior distributions. The forecaster will not be able to deliver a posterior probability, which is better than what he could have predicted without a new model run, solely on the basis of information on historical model performance expressed by the likelihood function.
6) To effectively assess the validity of the BPO procedure and the benefits of using it in producing PQPFs a sound validation is required. In the present case a series of actual observations in the January 2000–October 2005 period could be compared against probabilities of occurrence produced by the processor. However, given the low skill of the forecasts for the available dataset, we have refrained from performing such validation. Further validation can be done by applying the BPO to a set of forecasts at different locations. Systematic comparisons with other common methods for probability of precipitation determination, for example, the model output statistics (MOS) technique (Glahn and Lowry 1972), need also to be investigated. Such an analysis however is beyond the scope of this paper and is omitted here.
7) The impact of spatial and temporal variability of precipitation on the predictive uncertainty has not been addressed here. The focus is exclusively on a single small-scale plot. We emphasize however that this assumption does not hold if the processor is to be used to assess the predictive uncertainty for larger areas. In this case precipitation may vary strongly in space and in time according to the particular spatial (e.g., local, regional, continental) and temporal (e.g., 6-, 12-, 24-, 48-hourly, or longer periods) scale of observation. The wide range of spatiotemporal scales must be taken into account by identifying focus areas of various size and by aggregating forecasts over periods representing typical problem-specific time scales of interest. These time scales typically discern small basins with response times of a few hours from large basins, whose contraction times lie in the order of days.
For the actual implementation of the full BFS the proposed IUP needs to be integrated with an appropriate hydrologic uncertainty processor. The design and implementation of such a processor is given in Krzysztofowicz and Kelly (2000) (precipitation occurrence-independent model) and Krzysztofowicz and Herr (2001) (precipitation-dependent model). A combination of the BPO as IUP with the HUP through an integrator of uncertainty will be matter of sequel work.
6. Summary and conclusions
An implementation of a modified BPO (previously described in Krzysztofowicz 2004) to process output from a numerical weather prediction model has been presented, in which relative air humidity is used as an ancillary indicator of the state of the atmosphere. The purpose of the processor is to quantify the uncertainties associated with an event for both the event occurrence and the precipitation depth, on the basis of quantitative precipitation forecasts (QPFs) produced by the HIRLAM model. The application is carried out for 6-h QPFs and lead times of 24 and 48 h. The processor can be used as a input uncertainty processor (IUP) for the forcing input into hydrological models. Integration of the IUP with a corresponding hydrologic uncertainty processor yields a Bayesian forecasting system (BFS) for quantification of the total predictive uncertainty in flood prediction.
The precipitation process is represented as a binary–continuous mixture of random variables. The processor is Bayesian in the sense that prior knowledge on the predictive capability of the QPF over a historical period is used to obtain a revised probabilistic quantitative precipitation forecasts expressed in terms of posterior revised statistics.
The processor has been tested with forecasted precipitation series produced by the HIRLAM QPF. The forecasts are compared with precipitation measured at one single station. A preliminary analysis on the measured data is performed to establish prior probabilities of precipitation occurrence and depth for the four seasons. From the analysis of the forecasts it is evident that the skill of the model lies mainly in predicting the event occurrence/nonoccurrence (binary process) rather than in predicting the event depth (continuous process).
The Bayesian processor has the underlying property that poor model performance through noninformative forecasts, such as seen here, can be promptly identified, because the revised probabilistic forecasts show little improvement in terms of information content with respect to prior knowledge. From the results it is evident that the mixed binary–continuous posterior probabilities are mainly influenced by the revised probabilities of the binary process, where the larger forecasting skill of the QPF actually lies. The revised posterior probabilities of the continuous process show limited improvement with respect to the prior information and have thus negligible impact on the posterior mix. This is attributable to the rather low skill of the QPF with respect to precipitation depth. This example shows how the input uncertainty processor formulated as a Bayesian processor of output and shown here is suited as instrument to quantify sources of uncertainty in the forecasting process and emphasizes the utility for employing Bayesian theory in uncertainty quantification.
The present investigation ignores spatial and/or temporal variability of rainfall and is therefore only applicable for small basins, in which these characteristics are negligible. The uncertainty due to spatiotemporal variability of the precipitation fields must be adequately addressed and remains a matter for future investigations.
Acknowledgments
We thank the Dutch Royal Meteorological Institute (KNMI) for having provided the HIRLAM model results and the weather station data records. We would also like to thank three anonymous referees for suggestions that have helped improve the manuscript.
REFERENCES
Beven, K., and Binley A. , 1992: The future of distributed models: Model calibration and uncertainty prediction. Hydrol. Processes, 6 , 279–298.
De Groot, M. H., 1970: Optimal Statistical Decisions. McGraw-Hill, 489 pp.
Glahn, H. R., and Lowry D. A. , 1972: The use of model output statistics (MOS) in objective weather forecasting. J. Appl. Meteor., 11 , 1203–1211.
Herr, H. D., and Krzysztofowicz R. , 2005: Generic probability distribution of rainfall in space: The bivariate model. J. Hydrol., 306 , 234–263.
Kelly, K. S., and Krzysztofowicz R. , 1994: Probability distributions for flood warning systems. Water Resour. Res., 30 , 1145–1152.
Kelly, K. S., and Krzysztofowicz R. , 2000: Precipitation uncertainty processor for probabilistic river stage forecasting. Water Resour. Res., 36 , 2643–2653.
Krzysztofowicz, R., 1999: Bayesian theory of probabilistic forecasting via deterministic hydrologic model. Water Resour. Res., 35 , 2739–2750.
Krzysztofowicz, R., 2001a: Reply. Water Resour. Res., 37 , 441–442.
Krzysztofowicz, R., 2001b: The case for probabilistic forecasting in hydrology. J. Hydrol., 249 , 2–9.
Krzysztofowicz, R., 2004: Bayesian processor of output: A new technique for probabilistic weather forecasting. Preprints, 17th Conf. on Probability and Statistics in the Atmospheric Sciences, Seattle, WA, Amer. Meteor. Soc., 4.2.
Krzysztofowicz, R., and Kelly K. S. , 2000: Hydrologic uncertainty processor for probabilistic river stage forecasting. Water Resour. Res., 36 , 3265–3277.
Krzysztofowicz, R., and Herr H. D. , 2001: Hydrologic uncertainty processor for probabilistic river stage forecasting: Precipitation-dependent model. J. Hydrol., 249 , 46–68.
Lardet, P., and Obled C. , 1994: Real-time flood forecasting using a stochastic rainfall generator. J. Hydrol., 162 , 391–408.
Liu, Z., Martina M. L. V. , and Todini E. , 2005: Flood forecasting using a fully distributed model: Application of the TOPKAPI model to the Upper Xixian Catchment. Hydrol. Earth Syst. Sci., 1 , 9. 69–86.
Maranzano, C. J., and Krzysztofowicz R. , 2004: Bayesian processor of output for probabilistic forecasting of precipitation occurrence. Preprints, 17th Conf. on Probability and Statistics in the Atmospheric Sciences, Seattle, WA, Amer. Meteor. Soc., 4.3.
Schaake, J., and Larson L. , 1998: Ensemble streamflow prediction (ESP): Progress and research needs. Preprints, Special Symp. on Hydrology, Amer. Meteor. Soc., Phoenix, AZ, J19–J24.
Schaake, J., Franz K. , Bradley A. , and Buizza R. , 2006: The Hydrologic Ensemble Prediction Experiment (HEPEX). Hydrol. Earth Syst. Sci. Discuss., 3 , 3321–3332.
Undén, P., and Coauthors, 2002: HIRLAM-5 scientific documentation, December 2002. HIRLAM Tech. Rep., 132 pp.+appendix.
APPENDIX
Acronyms
BFS Bayesian forecasting system
BPO Bayesian processor of weather model output
HIRLAM High-Resolution Limited-Area Model
HUP Hydrologic uncertainty processor
IU Integrator of uncertainty
IUP Input uncertainty processor
PQPF Probabilistic quantitative precipitation forecast
PQFF Probabilistic quantitative flood forecast
QPF Quantitative precipitation forecast
Cumulative probability of precipitation distributions at Maastricht: (a) winter and (b) summer.
Citation: Journal of Hydrometeorology 9, 1; 10.1175/2007JHM858.1
(a) (top) Forecasted vs measured precipitation and (bottom) normal quantile transform, 24- and 48-h forecasting lead time, winter. (b) (top) Forecasted vs measured precipitation and (bottom) normal quantile transform, 24- and 48-h forecasting lead time, summer.
Citation: Journal of Hydrometeorology 9, 1; 10.1175/2007JHM858.1
(a) Empirical and modeled cdf, modeled pdf of w, all seasons. (b) Empirical and modeled cdf, modeled pdf of s, all seasons.
Citation: Journal of Hydrometeorology 9, 1; 10.1175/2007JHM858.1
Prior: homoscedasticity and normality of residuals.
Citation: Journal of Hydrometeorology 9, 1; 10.1175/2007JHM858.1
(a) Scatterplots of observed humidity S vs observed precipitation W and of NQT of S vs NQT of W, winter and spring. (b) Scatterplots of observed humidity S vs observed precipitation W and of NQT of S vs NQT of W, summer and autumn.
Citation: Journal of Hydrometeorology 9, 1; 10.1175/2007JHM858.1
(a) Binary process: parametric cdfs. (b) Binary process: parametric pdfs.
Citation: Journal of Hydrometeorology 9, 1; 10.1175/2007JHM858.1
Likelihood: homoscedasticity and normality of residuals, lead time 24 h.
Citation: Journal of Hydrometeorology 9, 1; 10.1175/2007JHM858.1
Posterior probability of precipitation occurrence for a range of values of the prior probability, 24- and 48-h forecast, winter.
Citation: Journal of Hydrometeorology 9, 1; 10.1175/2007JHM858.1
(a) Prior probability distribution and posterior mix of revised distributions, prior probability density functions, and posterior mix of revised density functions, winter. (b) Prior probability distribution and posterior mix of revised distributions, prior probability density functions, and posterior mix of revised density functions, summer.
Citation: Journal of Hydrometeorology 9, 1; 10.1175/2007JHM858.1
Number of events for 𝗪 and 𝗭, per season, Maastricht.
Prior probabilities of precipitation occurrence per season, Maastricht.
Prior parameters, with d and e the regression constants, μ the std dev of residuals, and R the correlation coefficients.
Likelihood parameters, with an, bn, and cn the regression constants, νn the std dev of residuals, and Rn correlation coefficients.
Coefficients of the posterior density in the normal space.