## 1. Introduction

Improved assimilation of atmospheric sounding data over land from sensors such as the Advanced Microwave Scanning Radiometer for Earth Observing System (AMSR-E) and the Special Sensor Microwave Imager (SSM/I) requires better estimates of the land surface microwave emissivity. Although this problem pertains to all of the land areas of the globe, the particulars are different for snow-covered and snow-free areas owing to substantially different dielectric properties of snow and ice as compared with snow-free areas. Microwave brightness temperature measured by spaceborne sensors over snow-covered areas originates from radiation emitted from the underlying surface, the snowpack, the vegetation, and the atmosphere. In theory, the dielectric constant of frozen water is altered relative to that of water in its liquid form, and the effect of snow on the emissivity can be used in algorithms to estimate snow water equivalence (SWE) from spaceborne emissions, typically at 18.7 and 37 GHz. In practice, though, the emissivity of snow depends not only on SWE but also on snowpack microstructure, especially grain size and temperature.

These complications with SWE retrievals have motivated an alternative approach, which focuses on top of the atmosphere (TOA) emissions in the microwave frequencies and attempts to assimilate satellite radiance observations with model predictions rather than retrieving SWE. This approach is particularly relevant for applications such as the retrieval of atmospheric moisture in which the SWE problem is incidental. However, even where there is a motivation to update land surface variables, such as SWE, the assimilation of brightness temperatures (*T _{b}*), rather than derived the SWE products, requires knowledge of snow physical properties because they affect the (surface) emissivity. National Centers for Environmental Prediction (NCEP) operational models currently use the Community Radiative Transfer Model (CRTM), which predicts TOA microwave and infrared radiances and brightness temperatures. CRTM is generally much more sophisticated in its representation of atmospheric radiative transfer than of the land surface. The land surface emissivity in CRTM is based on snow depth and surface temperature, from which they use an empirical regression for grain size as inputs into the land emission model (Weng et al. 2001). Whether or not snow is present is determined by the output of the (Noah) land scheme snow depth prediction, which is updated daily using (in operations) a direct insertion approach. The observations are the Air Force Weather Agency (AFWA) global snow depth analysis (SNODEP), which is based on the interpolation of daily station reports (K. Mitchell 2007, personal communication).

The approach we propose to develop in this paper is a part of a broader National Oceanic and Atmospheric Administration–National Aeronautics and Space Administration (NOAA–NASA) Joint Center for Satellite Data Assimilation (JCSDA) sponsored project that is intended to incorporate recent improvements in snow emissions modeling into CRTM. The current approach for estimating snow emissivity in CRTM will be compared with one, or a combination of, forward snow emission models (SEMs) coupled with the Noah land surface scheme. Technically, the objective of the present contribution is twofold: i) to test a new methodology for estimating microphysical snow forcings by a land surface scheme and ii) to evaluate the feasibility of coupling the land surface scheme with SEMs in terms of accuracy of *T _{b}* predictions. The first objective is accomplished by improving the existing parameterization of snow physics in the Variable Infiltration Capacity (VIC) model (Liang et al. 1994). The new algorithm accounts for snow metamorphism, compaction from the weight of new snowfall, and an effective internal snowpack compaction. It also calculates the average snow grain size using a crystal growth equation. In the future, this parameterization is intended to be transferred into the operational Noah model to improve CRTM estimates of snow

*T*. To accomplish the second objective, we evaluate the performance of three SEMs coupled with the VIC model using AMSR-E satellite observations of

_{b}*T*(at 18.7, 37, and 89 GHz for both horizontal and vertical polarization) at two sites from the NASA Cold Land Processes Experiment (CLPX) in Colorado during the winter of 2003. The three SEMs are the Microwave Emission Model of Layered Snowpacks (MEMLS) of Mätzler and Wiesmann (1999), a modification of the Land Surface Microwave Emission Model (LSMEM) of Gao et al. (2004), and the Dense Media Radiative Transfer (DMRT) model of Tsang et al. (2000). Additionally, an example of the multimodel

_{b}*T*estimate, based on Bayesian model averaging, is computed and compared to AMSR-E measurements. It demonstrates, using a bootstrap validation procedure, that the multimodel estimate increases the mean prediction accuracy as well as provides a nonparametric estimate of the error distributions.

_{b}As discussed above, the motivation of this paper is assessing the assimilating current satellite microwave brightness temperature (from AMSR-E) into microwave emission snow models (often referred to as forward models). The reader should recognize the multitude of challenges in this assimilation, which include the poorly posed problem of predicting microwave emissions as a result of their sensitivity to snow depth, the freeze–thaw cycle that affects snow grain morphology, the development of layering, and snowpack water. The challenge also includes scaling effects from the low-resolution (25 km) AMSR-E pixel resolution and the spatial heterogeneity in vegetation, topography, and snowfall. Therefore, this paper focuses mainly on the most favorable period within CLPX to best assess our ability to assimilate AMSR-E brightness temperatures.

This paper is organized as follows: section 2 provides a description of the new snow module implemented in VIC. The three SEMs used in this study are discussed in section 3. The mathematical formulation of the Bayesian model averaging is given in section 4. The CLPX 2003 measurements and the models’ setup are described in section 5. The results are discussed in section 6. The paper closes with the summary and a short discussion of future research directions in section 7.

## 2. Variable Infiltration Capacity model

### a. Basic description

VIC is a macroscale hydrology model that solves the energy and water balance over model grid cells that represent the modeled basin domain (Liang et al. 1994). Each grid cell can have multiple soil layers and be partially covered by different vegetation types in a mosaic-type representation, whereas topography is represented by a maximum of five elevation bands. Soil moisture storage capacity is characterized by a spatial probability distribution, although precipitation can also be spatially nonuniform. Baseflow is calculated as a nonlinear function of the lower soil layer moisture. Moisture and energy fluxes are computed separately for each vegetation class and elevation band within each grid cell and then area-weighted and summed over the grid cell, thus allowing the model to account for subgrid variability in topography, land cover, soil moisture, and precipitation. Streamflow is then simulated by routing subsurface and surface runoff using the method of Lohmann et al. (1998). Snow accumulation and ablation processes are simulated using a two-layer energy and mass balance approach (Cherkauer and Lettenmaier 2003). The surface layer is used to model the energy exchanges between the snowpack and the atmosphere while the lower layer acts as a reservoir for the excess snow mass from the surface layer (Wigmosta et al. 1994). Snowfall can be intercepted by an overstory canopy and then released to the ground snowpack through meltwater drip, mass release, or throughfall. The model accounts for melting/refreezing water within each layer, with water percolation being simulated based on a preset liquid water holding capacity for each layer.

### b. Modification of snow module

^{3}for newly fallen snow and accounted for densification by compacting the snowpack with the weight of new snowfall. Rather than a constant value, we incorporated the algorithm by Hedstrom and Pomeroy (1998) that calculates the density of fresh snow based on air temperature. We also modified the snow densification algorithm by calculating a compaction rate according to Jordan (1991). This model accounts for settling as a result of snow metamorphism, compaction from the weight of new snowfall, and an effective internal snowpack compaction. In addition, a snow crystal growth algorithm was added to the model based on the one used by the snow thermal model SNTHERM (see Jordan 1991). This algorithm is based on the equation that describes growth by sintering in metals and ceramics:

*d*(mm) is the average grain size,

*T*(K) is the temperature, and

*a*,

*b*are adjustable parameters. For dry snow in SNTHERM, we considered using a simple function of the form

*T*(K) is snow temperature,

_{s}*P*(hPa) represents the atmospheric pressure,

_{a}*z*(m) is snow depth, and

*g*

_{1}(m

^{4}kg

^{−1}) is an adjustable parameter (here we used the value of 7 × 10

^{−7}). The value of effective diffusion coefficient for water vapor in snow

*D*

_{eos}at 1000 hPa and 0°C is 0.92 × 10

^{−4}(m

^{2}s

^{−1}), and the value of variation of saturation vapor pressure with temperature relative to phase

*C*(kg m

_{kT}^{−2}K

^{−1}) can be calculated by Eq. (20) in Jordan (1991). Note that

*d*in (2) is expressed in meters. When implementing (2) in VIC, we approximated the absolute vertical thermal gradient ∂

*T*/∂

_{s}*z*by |

*T*−

_{s}*T*|/Δ

_{g}*z*, where

*T*(K) is ground temperature. For wet snow when liquid water content

_{g}*θ*within the snowpack exceeds a threshold (0.0001), grain growth rate increases and, as proposed by Jordan (1991), is modeled using the similar growth function to that for dry snow:

_{l}*θ*is set to a maximum of 0.09 if it exceeds that value, and adjustable parameter

_{l}*g*

_{2}is taken as 4 × 10

^{−12}(m

^{2}s

^{−1}). The grain size estimated by the model is a depth-weighted average of the grain size of newly fallen snow (taken as 0.1 mm), and the grain size of the existing snowpack is obtained from (2) or (3).

## 3. The snow emission models

In this section, we provide a brief description of the three SEMs used in this study. For the sake of clarity, the key aspects of these models are listed in Table 1. Before characterizing each individual model, we introduce the inputs required by the models. The inputs are nearly the same for all models and include snow depth, snow density, snowpack temperature, ground temperature, and surface roughness of the air/snow boundary, except that the LSMEM and DMRT models require the “mean grain size” of snow particles as an input, whereas the MEMLS model requires the “correlation length.” The mean grain size is taken as the radius of spherical ice particles approximating the ice grains in snow. The correlation length is related to snow grain size, shape, and volumetric distribution of snow grains (e.g., Jin 1993). However, this relationship is not straightforward (see, e.g., Pulliainen et al. 1999). To derive a value for the correlation length from the mean grain size, we multiplied the value of the mean grain size by the factor of 0.15 (C. Mätzler 2006, personal communication). This was done so comparisons can be made with results for models using either the mean grain size or a correlation length as inputs.

### a. The Land Surface Microwave Emission Model

The LSMEM is a radiative transfer model that can predict the brightness temperature of a surface that can be partially covered with snow and/or vegetation using four different modules that account for emission from vegetation, bare soil, snow, and atmospheric effects (Gao et al. 2004). The snow emission model is based on the Helsinki University of Technology (HUT) model (Pulliainen et al. 1999) that treats the snowpack as a single homogeneous layer. The model describes the emission contribution of a snowpack as a function of snow depth, snow density, snow grain size, snow temperature, temperature at the snow–ground interface, frequency, and incidence angle. In addition to the upward emitted radiation, the model takes into account the contribution emitted downward and reflected upward from the snow/soil boundary, the emission contribution from underlying soil, and the atmospheric radiation reflected from the snow cover. The model also considers the multiple reflections caused by snow/soil and air/snow boundaries. The basic assumption in the HUT snow emission model is that scattering is mostly concentrated in the forward direction. The snow extinction coefficient is calculated from a modified empirical relationship and is a function of grain diameter and frequency (Hallikainen et al. 1987). The dielectric constants of ice and snow (permittivities) can be calculated using different optional models—here we used the model by Hallikainen et al. (1986).

### b. The Dense Media Radiative Transfer model

The DMRT model describes the propagating and scattering of particles in a dense medium, which allows the particles to occupy a fractional volume larger than 10%. In a nontenuous electrically dense medium, the dielectric properties of the particles are significantly different from those of the background medium, and the assumption of independent scattering is no longer valid because there is more than one scatterer within a wavelength distance. Under these conditions, the classical radiative transfer (CRT) theory is not valid. Several methods have been used to derive the DMRT equations including the effective field approximation (EFA), also called Foldy’s approximation (Tsang and Kong 1981); the quasi-crystalline approximation (QCA; Tsang and Kong 1981; Jin 1993); and the QCA with sticky particles (Tsang et al. 2000). In this study, we use the equations of the DMRT derived under the QCA approximation for moderate-size particles (Tsang et al. 2000). The Percus–Yevick equation is used to describe the pair distribution function (Tsang and Kong 1981). The effective propagation constant is computed on the basis of the generalized Lorentz–Lorenz law and the generalized Ewald–Oseen extinction theorem (e.g., Tsang et al. 2000). The extinction coefficient is then calculated from the imaginary part of the effective propagation constant. The formula used for ice permittivity is the same as the one used in the HUT model. The scattering and absorption coefficients are derived from, and the equations of the radiative transfer theory are solved by, the Gaussian quadrature method and the eigenvalues and eigenvectors technique (e.g., Jin 1993). The brightness temperatures are obtained by considering the boundary conditions, which provide the weights of the elements of the base of eigenvectors.

### c. The Microwave Emission Model of Layered Snowpack model

In MEMLS (Mätzler and Wiesmann 1999; Wiesmann and Mätzler 1999), the snow cover is thought to be a stack of horizontal layers. Each layer is characterized by a thickness, a correlation length, its density, liquid water content, and temperature. The layer interfaces are assumed to be planar. The sandwich model, based on multiple scattering radiative transfer, is used (Wiesmann et al. 1998) to combine internal scattering and reflections at the interfaces. Internal volume scattering is accounted for by a two-flux model (up- and downwelling streams) derived from a six-flux approach (fluxes in all spatial directions). The absorption and scattering coefficients are functions of the six-flux parameters. The absorption coefficient can be obtained by density, frequency, and temperature, and the scattering coefficient depends on the correlation length, density, and frequency. The MEMLS model is based on the studies carried out by Wiesmann et al. (1998). The measurements of these authors lead to the empirical approach to determine the scattering coefficient of snow in the frequency range of 5–100 GHz and a correlation length range of 0.01–0.3 mm.

## 4. Bayesian model averaging

Bayesian model averaging (BMA) is a statistical method that infers from an ensemble of competing predictions the probabilistic prediction that possesses more skill and reliability than the original ensemble members (Raftery et al. 2003). BMA has been principally used in generalized linear regression applications. Recently (Raftery et al. 2003, 2005), it has been successfully applied to numerical weather prediction. In this study, we apply BMA to construct multimodel brightness temperature predictions using individual predictions from the three SEMs described in section 3. The BMA scheme is briefly described below.

### a. Problem statement

*y*be a scalar quantity to be forecast, and let

*f*be the forecast of

_{k}*y*produced by model

*k*. The forecast

*f*is then characterized by a conditional probability density function (pdf),

_{k}*g*(

_{k}*y|f*), which can be interpreted as the pdf of

_{k}*y*conditional on

*f*, given that

_{k}*f*is the best forecast in the ensemble. The BMA predictive pdf for the

_{k}*k*-member ensemble of forecasts can be written as the following mixture model (see Hoeting et al. 1999; Raftery et al. 2003, 2005):

*w*is the weight such that ∀

_{k}_{k}

*w*

_{k}≥ 0 and Σ

^{K}

_{k=1}

*w*

_{k}= 1. When predicting the brightness temperature, it is reasonable to assume that

*g*

_{k}(

*y*|

*f*

_{k}) ∼

*μ*

_{k},

*σ*

^{2}

_{k}). Note that although the latter conditionals are Gaussian, (4) is a nonparametric Gaussian mixture model (McLachlan and Peel 2000). The model is fitted to data sample

*χ*= {

*y*

_{t},

*f*

_{1,t},

*f*

_{2,t}, . . . ,

*f*

_{K,t}}

^{t=N}

_{t=1}, where the subscript

*t*denotes time,

*f*,

_{k}*is referred to as the*

_{t}*k*th forecast in the ensemble for time

*t*, and

*y*represents the corresponding verification. An estimate of the parameter vector

_{t}**= [**

*θ**w*

_{1}, . . . ,

*w*

_{k},

*μ*

_{1}, . . . ,

*μ*

_{k},

*σ*

_{1}, . . . ,

*σ*

_{k}]

^{T}is obtained by maximizing the log-likelihood function

### b. Solution by the EM algorithm

*θ*_{0}, for the parameter vector

**. In the E step, the posterior probabilities**

*θ**ẑ*

_{k,t}are estimated given the current guess for the parameters. For the BMA in (4), the E step is

*j*refers to the

*j*th iteration of the EM algorithm, and

*g*[

*y*

_{t}|

*μ*

^{(j−1)}

_{k,t},

*σ*

^{(j−1)}

_{k}] is a normal density with mean

*μ*

^{(j−1)}

_{k,t}and standard deviation

*σ*

^{(j−1)}

_{k}evaluated at

*y*. The mean is modeled as a function,

_{t}*ϕ*, that depends on

*f*,

_{k}*and a set of parameters,*

_{t}

*β**. We choose a local model that has linear parameters:*

_{k}*ϕ*, the form of the local model. In the absence of any prior information, a local linear model is a good choice [so

*ϕ*

_{1}(

*f*,

_{k}*) = 1 and*

_{t}*ϕ*

_{2}(

*f*,

_{k}*) =*

_{t}*f*,

_{k}*]. For simplicity of further notation, we define the posterior-weighted average of a quantity*

_{t}*x*as

*w*,

_{k}*μ*, and

_{k}*σ*using the current estimates of

_{k}*z*,

_{k}*as weights. Thus,*

_{t}*χ*is scarce, some adjustments are needed to keep the number of parameters entering EM, that is dim(

**), low compared to the sample size. A simplification that we incorporated here is that instead of estimating the coefficients**

*θ*

*β**iteratively using (13), we fixed their values prior to EM by performing linear regression of*

_{k}*y*onto each

_{t}*f*,

_{k}*, as in Raftery et al. (2005).*

_{t}### c. BMA predictive mean and variance

## 5. Data description and model identification

Meteorological measurements were recorded in 10-min intervals at 10 sites within the CLPX small regional study area (SRSA) located in north-central Colorado (39.5°–41°N, 105°–107.5°W) between 20 September 2002 and 1 October 2003. In this study, we used data records from two meteorological towers within the Fraser mesocell study area (MSA), Fraser Alpine (FA) and Fraser Experimental Forest headquarters (FHQ), from the period of 1 February to 31 May 2003. The dataset included air temperature, atmospheric pressure, relative humidity, wind speed and direction, and shortwave and longwave radiation as well as snow depth [see Feng et al. (2008) for a detailed description of these data]. The observed 10-min values were averaged to 1-h intervals to conform to VIC input requirements. It is important to stress that precipitation was not directly measured at the two CLPX sites. Therefore, for both FA and FHQ sites, we used hourly precipitation data based on 1/8th degree (∼12 km) hourly merged gauge–radar precipitation product available from the North American Land Data Assimilation System (NLDAS). Driven by the above mentioned meteorological forcings, VIC was integrated with the time step of 1 h at both FA and FHQ sites. The three SEMs described in section 3 were then forced with the ground temperature, snow temperature, depth, density, and grain size from VIC. The estimates of *T _{b}* were obtained for passive microwave frequencies at 18.7, 37, and 89 GHz for both horizontal (H) and vertical (V) polarization except for DMRT, which currently cannot handle a 89-GHz channel. Further, the

*T*simulations were restricted only to dry snow conditions. This was because the retrieval of SWE is not possible when liquid water content in snow increases as a consequence of melting. The penetration depth at microwave frequencies when snow is wet is of the order of a few centimeters as a consequence of the absorption coefficient, which limits our capability of deriving SWE/SD (where SD is snow depth) in wet snow conditions from microwave data in general. Moreover, LSMEM and DMRT versions used in this study can only estimate

_{b}*T*for dry snow. Next, the estimated

_{b}*T*was compared to the

_{b}*Aqua*AMSR-E satellite data at the incidence angle of 55°. These data were gridded to the geographic (latitude–longitude) grids of the CLPX 2003 large regional study area (LRSA) and interpolated from swath space using inverse distance squared resampling (Brodzik 2003). The grid resolution was approximately 1/4° (∼25 km), so both FHQ and FA sites are located within the same AMSR-E pixel.

## 6. Results

### a. Comparison of snow depth

Before estimating *T _{b}*, we compared VIC predictions of snow depth

*z*(in cm) with the available in situ data. The results are shown in Fig. 1. At the FHQ site, the VIC predictions consistently capture the snow accumulation. However, peaks of snow deposition events are oversmoothed and underestimated compared to the measured snow depth. This is mainly a result of the scale mismatch; VIC was forced with a combination of in situ forcings and the NLDAS precipitation, so the estimated snow depth is at the mix of scales as opposed to the point scale in situ measurements. As a result, the timing of early summer snowmelt at the end of May 2003 is delayed. At the FA site, the spatial variability of snow depth is determined by both snowdrift and snowmelt (see Feng et al. 2008). The former process is not accounted for in the current version of VIC. This, together with the scale mismatch, causes a significant overestimation of snow depth. In addition, Fig. 1 shows the fluctuations in VIC-predicted average grain size

*d*(in mm). Because hourly time series of measured grain size are not available for the considered study period, the grain size predictions can only be evaluated qualitatively.

^{1}Especially for the FHQ site, the crystal growth equation (see section 2b) implemented in VIC consistently describes two processes that govern the snowpack dynamics: i) metamorphism (snow grain coalescence reducing the voids in between, causing grain size to increase) and ii) compaction (reduction in average grain size as a result of new snowfall). Metamorphism is shown in the upper panel of Fig. 1 during several snowmelt episodes, and compaction is shown during several snow accumulation episodes.

### b. Comparison of brightness temperatures

*f*represents fractional tree cover,

*T*

_{b}_{,tree}is the brightness temperature of trees and

*T*

_{b,}_{model}is raw estimate of brightness temperature from a particular SEM. Here, Fraser area

*f*= 0.53, and

*T*

_{b,}_{tree}was taken as the mean of the distribution of brightness temperatures for trees (see Tedesco et al. 2005 for details), that is, 268 K at 18.7 GHz and 271 K at 37 GHz. At 89 GHz, the value of

*T*

_{b}_{,tree}was assumed to be 272 K. Figure 2 shows

*T*estimates for FHQ site together with the corresponding AMSR-E measurements (ascending and descending overflights are grouped together).

_{b}It is clear that for 18.7 and 37 GHz, all three SEMs overestimate AMSR-E brightness temperature in the beginning of the study season [1 February–13 March 2003; time index 0–1000 (hour)]. Although there are many factors that can contribute to overestimation, one factor is that the value of *T _{b,}*

_{tree}might be too high, which could occur if the trees were snow covered. Another problem, especially in the case of 37 GHz

*T*, is that the grain size estimates by VIC might be too low. On the other hand, the overestimation is not present at 89 GHz (both horizontal and vertical). This can be explained considering that, in general, values of

_{b}*T*at 89 GHz are strongly influenced by the diurnal freeze–thaw cycles at the surface of snowpack. The surface is usually cooler and dryer than deeper layers of snowpack. Note that in Fig. 2, the variability in both estimated and measured

_{b}*T*decreases with the decrease of the frequency. Additionally, 37- and 89-GHz channels are generally more sensitive to the attenuating influence of the snowpack than the 18.7-GHz channel, which is influenced by the soil underlying the snowpack, especially for thin snowpacks. For these two channels, LSMEM estimates of

_{b}*T*exhibit large fluctuations compared to the other two models. This can be attributed to the sensitivity of LSMEM predictions of

_{b}*T*to the variability in the average temperature of the snowpack

_{b}*T*

_{snow}(cf. the plot of VIC predictions of

*T*

_{snow}at FHQ site in Fig. 3 with LSMEM

*T*predictions at 37 and 89 GHz in Fig. 2). Another important aspect in Fig. 2 is that the AMSR-E data have a pronounced early summer snowmelt signature (increasing trend in measured

_{b}*T*). This snowmelt event is not adequately reproduced by SEMs. The reason for that is twofold. First, we restricted our simulations to dry snow conditions, and the brightness temperature of dry snow is much lower than that of the wet snow. Second, as mentioned in section 6a, the VIC prediction of the timing of the snowmelt was delayed. For the FA site (Fig. 2), it is evident that the results are worse than for the FHQ site, especially at 37 and 89 GHz (both horizontal and vertical). In the second half of the study season, there is a U-shaped feature in the predicted

_{b}*T*time series, which is obviously caused by the rise and rapid decrease in grain size, as depicted in the lower panel of Fig. 1. This grain size pattern may be attributed to the overestimated snow depth at FA (see section 6a). To help study the polarization effects, Fig. 4 shows the AMSR-E and model-predicted polarization differences (horizontal minus vertical) for 18.7, 37, and 89 GHz at the FHQ (left) and FA (right) sites. The models provide larger polarization differences than those observed by the AMSR-E sensor for two reasons. First, in this study a single-layer approximation to model snowpack was used (multiple layers tend to reduce the polarization differences). Second, other factors in the scenes observed by the sensor, such as vegetation, might also decrease the magnitude of polarization differences. Another interesting effect in Fig. 4 is that LSMEM-predicted differences are larger than what the other models predict, probably because LSMEM assumes that 96% of scattering occurs in the forward direction. Nevertheless, the polarization difference magnitudes appear to be relatively similar for all three models with the exception of DMRT, which shows a large increase at the FHQ site during the snowmelt event in late spring.

_{b}### c. BMA and multimodel brightness temperature prediction

The differences between observed and modeled snow *T _{b}* can be due to the following: i) the misrepresentation of snow properties (especially gain size) by the models at spatial scales is equivalent to the AMSR-E surface footprint size, ii) the AMSR-E

*T*is actually a combination of

_{b}*T*contributions from the snow, ground, vegetation cover, and atmosphere, and iii) instrumental and input errors. Individual SEMs capture only some of these uncertainties and then probably only partially, depending on the frequency channel used and the adequacy of parameterization of snow emission physics. Accordingly, by trying to select the “best” SEM out the multimodel ensemble, some sources of uncertainty captured by the remaining SEMs might be ignored, thus underestimating the total uncertainty in

_{b}*T*estimates. To tackle this problem, we constructed multimodel AMSR-E

_{b}*T*predictions using the BMA technique described in section 4. Given the difficulties at FA described in the previous section, the BMA analysis is performed only for the FHQ site.

_{b}#### 1) Bootstrap validation for BMA

*B*= 30 independent replicates of the training data (145 points, which is 75% of the total sample size) and the testing data (48 points, which is 25% of the total sample size). Then, we refitted the BMA scheme to each of the training replicates and examined the behavior of the fits using corresponding replicates of the testing data. If

*y**

^{b}

_{val},

*ŷ**

^{b}

_{val}}

^{Nval}

_{tval=1}is the bivariate set of estimates of the BMA predictive mean in (16) and associated AMSR-E verifications in the

*b*th bootstrap testing set, then the estimate of the average bootstrap prediction error for the BMA scheme is

*N*

_{val}denotes the number of points in the

*b*th testing set, and ɛ(·) is the loss function for measuring errors. A typical choice of the loss function is the root mean squared error (RMSE),

#### 2) BMA results

To initialize the EM fitting procedure, for each frequency channel and for each polarization, the variances of the conditional Gaussian pdfs on the right-hand side of (4) were taken as variances of individual SEMs *T _{b}* estimates, respectively. Figure 5 shows an example of the fitted conditionals and predictive BMA pdf of 37-GHz (vertical)

*T*at FHQ site at 0800 UTC on 20 March 2003. Note that these estimates were obtained by running the EM on one of the replicates of the bootstrap training set. There was a disagreement among the raw ensemble member predictions: two of them (LSMEM and MEMLS) were around 260 K, whereas the other one (DMRT) was around 270 K. This difference of 10 K is quite large. After estimating the ensemble member conditional means (hereafter the bias-corrected

_{b}*T*estimates), the raw SEMs predictions were shifted toward verifying the AMSR-E observation (cf. raw SEMs predictions with the means of the Gaussians in Fig. 5). The latter turned out to be outside the raw ensemble range. The resulting BMA predictive mean slightly overestimates AMSR-E

_{b}*T*. However, it is much more accurate than the raw SEMs estimates. Because each MEMLS and DMRT bias-corrected

_{b}*T*underestimates the verifying observation and LSMEM

_{b}*T*overestimates it, the BMA predictive pdf is positively skewed. The values of the weights of the model-conditional components of this pdf were 0.6 for LSMEM, 0.36 for DMRT, and 0.04 for MEMLS. The small weight for the MEMLS component is due to the high correlation (0.87) between LSMEM and MEMLS

_{b}*T*predictions. This implies that if the LSMEM prediction is known, the additional information from the MEMLS prediction is much less than it would be if the two predictions were uncorrelated. Figure 6 shows the time evolution of one bootstrap realization of BMA predictive pdf, its mean, and the standard deviation envelopes at the FHQ site for all analyzed frequencies and polarizations. It is clear that the variability of the AMSR-E data is well described by the BMA standard deviation envelopes. Moreover, the BMA predictive mean approximates AMSR-E

_{b}*T*better than raw SEMs predictions in Fig. 2. This is confirmed in Table 2, which compares the average bootstrap values of

_{b}## 7. Summary and outlook

In this paper, brightness temperatures of snow at 18.7 and 37 GHz were simulated by three SEMs (LSMEM, MEMLS, and DMRT) and at 89 GHz by two SEMs (LSMEM and MEMLS) coupled with the VIC land surface scheme using data from the FHQ and FA sites collected during the CLPX 2003 experiment. The objective of this study was to assess the feasibility of the coupling in terms of the quality of *T _{b}* predictions. Because we simulated hourly time series of

*T*for the period of February–May 2003, the natural way to validate our simulations was to compare them to satellite AMSR-E measurements. All the analyzed SEMs had common inputs predicted by VIC. The latter was driven by a combination of in situ and NLDAS forcings, which introduced a scale discrepancy between the SEMs predictions and the AMSR-E observations. In the beginning of the study season at 18.7 and 37 GHz, all the models overestimated the AMSR-E

_{b}*T*at the FHQ site. This may have been caused either by trees covered by snow in the AMSR-E pixel or by an underestimation of the average snow grain size. Also, the signature of early summer snowmelt event in the AMSR-E measurements was not well captured by the SEMs at all analyzed frequencies. The reason for that is because in our investigation, we only considered dry snow conditions. Thus,

_{b}*T*was underestimated compared to the

_{b}*T*signature of wet snow that would be measured by AMSR-E. In general, except for the 89-GHz channel, the variability range of AMSR-E measurements of

_{b}*T*was not well captured by SEMs. In particular, MEMLs and DMRT produced oversmoothed

_{b}*T*estimates. LSMEM predictions, on the other hand, turned out to be sensitive to the average snow temperature fluctuations. This was particularly pronounced at 37 and 89 GHz. The penetration depth of the latter frequency channel is small, so effective fluctuations in snowpack surface temperature govern the dynamics of

_{b}*T*Our simulations were less successful at the FA site because the spatial variability of snow depth was highly influenced by snowdrift. This phenomenon is not accounted for in the current version of VIC, so the simulation of the microphysical snow properties at FA was of poor quality. As a result, the simulated

_{b}.*T*exhibited some artifacts as explained in section 6a.

_{b}Apart from considering the individual SEMs predictions, we proposed a new multimodel procedure for estimating the AMSR-E brightness temperature of snow based on an ensemble of SEMs (LSMEM, MEMLS, and DMRT) coupled with the VIC land surface scheme. The procedure is based on BMA and offers not only an adequate nonparametric description of the *T _{b}* predictive pdf, but it also improved the

*T*prediction accuracy compared to individual SEMs results. This is because, to some extent, BMA reduces the effects of scaling, model error sources, atmospheric contribution to radiative transfer, and sparseness of AMSR-E measurements—problems that are notorious when comparing satellite snow

_{b}*T*with SEMs predictions. Using a simple bootstrap validation procedure, we have shown that the BMA predictive mean outperformed the predictions of individual SEMs in terms of

_{b}*T*compared to SEMs predictions, our preliminary results have shown that the errors of BMA were still significant.

_{b}There are a number of issues that need further investigation to improve the BMA *T _{b}* estimates to make them useful in hydrometeorological practice. i) Additional research is needed to better understand the influence of atmospheric moisture profiles on

*T*estimates from SEMs. An attractive option here would be to implement the ensemble of SEMs as a microwave surface module of CRTM and run it in the coupled mode with atmospheric absorption and atmospheric scatter modules. ii) It is critical to further understand the influence of the mean grain size (or correlation length in the case of MEMLS) on the SEMs performance. Because the measurements of this parameter are not usually collected systematically, perhaps a better option would be to compute an “effective” grain size by simply calibrating this parameter to produce

_{b}*T*estimates that match AMSR-E

_{b}*T*as closely as possible. Note that this effective value would automatically account for any potential scaling mismatch. Some preliminary results of the parameter calibration approach for LSMEM—applied for soil moisture retrievals though, not for snow

_{b}*T*estimation—are given in Pan et al. (2006). And, (iii) the final issue concerns the use of BMA predictive pdfs in data assimilation. These pdfs could, for example, be applied to construct the nonparametric likelihood operators in particle filters, offering enhanced quality of the updates of variables that describe snowpack evolution in land surface models.

_{b}## Acknowledgments

This research was possible through support from the NOAA Joint Center for Satellite Data Assimilation (resulting in the Development of Improved Forward Models for the Retrieval of Snow Properties using EOS-era Satellites proposal) to Princeton University (Agreement NA04NES4400002) and to the University of Washington (Agreement NA04NES4400003). This support is gratefully acknowledged.

## REFERENCES

Andreadis, K., Liang D. , Tsang L. , Lettenmeier D. , and Josberger E. , 2008: Characterization of errors in a coupled snow hydrology–microwave emission model.

,*J. Hydrometeor.***9****,**149–164.Brodzik, M. J., 2003: CLPX-Satellite: AMSR-E brightness temperature grids, National Snow and Ice Data Center, Boulder, Colorado, digital media. [Available online at http://nsidc.org/data/docs/daac/nsidc0145_clpx_amsre/.].

Cherkauer, K. A., and Lettenmaier D. P. , 2003: Simulation of spatial variability in snow and frozen soil.

,*J. Geophys. Res.***108****.**8858, doi:10.1029/2003JD003575.Dempster, A., Laird N. , and Rubin D. , 1977: Maximum likelihood from incomplete data via the EM algorithm.

,*J. Roy. Stat. Soc., Ser. B***39****,**1–38.Feng, X., Sahoo A. , Arsenault K. , Houser P. , Luo Y. , and Troy T. J. , 2008: The impact of snow model complexity at three CLPX sites.

,*J. Hydrometeor.***9****,**1464–1481.Gao, H., Wood E. , Drusch M. , Crow W. , and Jackson T. , 2004: Using a microwave emission model to estimate soil moisture from ESTAR observations during SGP99.

,*J. Hydrometeor.***5****,**49–63.Hallikainen, M., Ulaby F. , and Abdelrazik M. , 1986: Dielectric properties of snow in the 3 to 37 GHz range.

,*IEEE Trans. Antennas Propag.***34****,**1329–1340.Hallikainen, M., Ulaby F. , and Deventer T. , 1987: Extinction behavior of dry snow in the 18- to 90-GHz range.

,*IEEE Trans. Geosci. Remote Sens.***GE-25****,**737–745.Hedstrom, N. R., and Pomeroy J. W. , 1998: Measurements and modelling of snow interception in the boreal forest.

,*Hydrol. Processes***12****,**1611–1625.Hoeting, J., Madigan D. , Raftery A. E. , and Volinsky C. T. , 1999: Bayesian model averaging: A tutorial.

,*Stat. Sci.***14****,**382–401.Jin, Y-Q., 1993:

*Electromagnetic Scattering Modelling for Quantitative Remote Sensing*. World Scientific, 333 pp.Jordan, R., 1991: A one-dimensional temperature model for a snow cover: Technical documentation for SNTHERM.89. Special Report 91-16, Cold Regions Research and Engineering Laboratory, U.S. Army Corps of Engineers, 61 pp.

Liang, X., Lettenmaier D. , Wood E. , and Burges S. , 1994: A simple hydrologically based model of land surface water and energy fluxes for general circulation models.

,*J. Geophys. Res.***99****,**14415–14428.Lohmann, D., Raschke E. , Nijssen B. , and Lettenmaier D. , 1998: Regional scale hydrology: II. Application of the VIC-2L model to the Weser River, Germany.

,*Hydrol. Sci. J.***43****,**143–158.Mätzler, C., and Wiesmann A. , 1999: Extension of the microwave emission model of layered snowpacks to coarse-grained snow.

,*Remote Sens. Environ.***70****,**307–316.McLachlan, G., and Krishnan T. , 1997:

*The EM Algorithm and Extensions*. Wiley, 274 pp.McLachlan, G., and Peel D. A. , 2000:

*Finite Mixture Models*. Wiley, 419 pp.Pan, M., Ferguson C. , Crow W. , and Wood E. , 2006: Using data assimilation techniques to calibrate soil moisture retrievals: Conference presentation.

,*Eos, Trans. Amer. Geophys. Union,***87**(Fall Meet. Suppl.), Abstract H21I-03.Pulliainen, J., Grandell J. , and Hallikainen M. , 1999: HUT snow emission model and its applicability to snow water equivalent retrieval.

,*IEEE Trans. Geosci. Remote Sens.***37****,**1378–1390.Raftery, A. E., Balabdaoui F. , Gneiting T. , and Polakowski M. , 2003: Using Bayesian model averaging to calibrate forecast ensembles. Department of Statistics Tech. Rep. 440, University of Washington, 32 pp.

Raftery, A. E., Gneiting T. , Balabdaoui F. , and Polakowski M. , 2005: Using Bayesian model averaging to calibrate forecast ensembles.

,*Mon. Wea. Rev.***133****,**1155–1174.Tedesco, M., Kim E. J. , Gasiewski A. , Klein M. , and Stankov B. , 2005: Analysis of multiscale radiometric data collected during the Cold Land Processes Experiment-1 (CLPX-1).

,*Geophys. Res. Lett.***32****.**L18501, doi:10.1029/2005GL023006.Tsang, L., and Kong J. A. , 1981: Scattering of electromagnetic waves from random media with strong permittivity fluctuations.

,*Radio Sci.***3****,**303–320.Tsang, L., Chen C. , Chang A. , Guo J. , and Ding K. , 2000: Dense media radiative transfer theory based on quasi-crystalline approximation with applications to microwave remote sensing of snow.

,*Radio Sci.***35****,**731–749.Weng, F., Yan B. , and Grody N. , 2001: A microwave land emissivity model.

,*J. Geophys. Res.***106****,**20115–20123.Wiesmann, A., and Mätzler C. , 1999: Microwave emission model of layered snowpacks.

,*Remote Sens. Environ.***70****,**307–316.Wiesmann, A., Mätzler C. , and Weise T. , 1998: Radiometric and structural measurements of snow samples.

,*Radio Sci.***33****,**273–289.Wigmosta, M., Vail L. , and Lettenmaier D. , 1994: A distributed hydrology-vegetation model for complex terrain.

,*Water Resour. Res.***30****,**1665–1679.

AMSR-E measured vs simulated brightness temperatures at 18.7, 37, and 89 GHz (V) obtained with LSMEM, MEMLS, and DMRT models using VIC-predicted forcing data at FHQ and FA sites (February–May 2003).

Citation: Journal of Hydrometeorology 9, 6; 10.1175/2008JHM909.1

AMSR-E measured vs simulated brightness temperatures at 18.7, 37, and 89 GHz (V) obtained with LSMEM, MEMLS, and DMRT models using VIC-predicted forcing data at FHQ and FA sites (February–May 2003).

Citation: Journal of Hydrometeorology 9, 6; 10.1175/2008JHM909.1

AMSR-E measured vs simulated brightness temperatures at 18.7, 37, and 89 GHz (V) obtained with LSMEM, MEMLS, and DMRT models using VIC-predicted forcing data at FHQ and FA sites (February–May 2003).

Citation: Journal of Hydrometeorology 9, 6; 10.1175/2008JHM909.1

VIC predictions of average temperature of snowpack at FHQ (black line) and FA (gray line) sites (February–May 2003).

Citation: Journal of Hydrometeorology 9, 6; 10.1175/2008JHM909.1

VIC predictions of average temperature of snowpack at FHQ (black line) and FA (gray line) sites (February–May 2003).

Citation: Journal of Hydrometeorology 9, 6; 10.1175/2008JHM909.1

VIC predictions of average temperature of snowpack at FHQ (black line) and FA (gray line) sites (February–May 2003).

Citation: Journal of Hydrometeorology 9, 6; 10.1175/2008JHM909.1

AMSR-E measured vs simulated polarization differences in brightness temperature (horizontal minus vertical) at 18.7, 37, and 89 GHz obtained with LSMEM, MEMLS, and DMRT models using VIC-predicted forcing data at FHQ and FA sites (February–May 2003).

Citation: Journal of Hydrometeorology 9, 6; 10.1175/2008JHM909.1

AMSR-E measured vs simulated polarization differences in brightness temperature (horizontal minus vertical) at 18.7, 37, and 89 GHz obtained with LSMEM, MEMLS, and DMRT models using VIC-predicted forcing data at FHQ and FA sites (February–May 2003).

Citation: Journal of Hydrometeorology 9, 6; 10.1175/2008JHM909.1

AMSR-E measured vs simulated polarization differences in brightness temperature (horizontal minus vertical) at 18.7, 37, and 89 GHz obtained with LSMEM, MEMLS, and DMRT models using VIC-predicted forcing data at FHQ and FA sites (February–May 2003).

Citation: Journal of Hydrometeorology 9, 6; 10.1175/2008JHM909.1

An example of BMA predictive pdf (thick curve) and its three conditional components (thin curves) for the 37 GHz (V) brightness temperature prediction at FHQ site at 0900 UTC 20 Mar 2003. Also shown are the raw ensemble member predictions (blue, red, and black squares), the BMA predictive mean (yellow square), and the verifying AMSR-E observation (green square).

Citation: Journal of Hydrometeorology 9, 6; 10.1175/2008JHM909.1

An example of BMA predictive pdf (thick curve) and its three conditional components (thin curves) for the 37 GHz (V) brightness temperature prediction at FHQ site at 0900 UTC 20 Mar 2003. Also shown are the raw ensemble member predictions (blue, red, and black squares), the BMA predictive mean (yellow square), and the verifying AMSR-E observation (green square).

Citation: Journal of Hydrometeorology 9, 6; 10.1175/2008JHM909.1

An example of BMA predictive pdf (thick curve) and its three conditional components (thin curves) for the 37 GHz (V) brightness temperature prediction at FHQ site at 0900 UTC 20 Mar 2003. Also shown are the raw ensemble member predictions (blue, red, and black squares), the BMA predictive mean (yellow square), and the verifying AMSR-E observation (green square).

Citation: Journal of Hydrometeorology 9, 6; 10.1175/2008JHM909.1

AMSR-E measured brightness temperature vs a bootstrap realization of BMA predictive pdf of brightness temperature (blue color scale) at 18.7, 37, and 89 GHz (H, V) at FHQ site (February–May 2003). The solid yellow line represents BMA predictive mean, whereas dashed yellow lines represent the BMA predictive standard deviation envelopes.

Citation: Journal of Hydrometeorology 9, 6; 10.1175/2008JHM909.1

AMSR-E measured brightness temperature vs a bootstrap realization of BMA predictive pdf of brightness temperature (blue color scale) at 18.7, 37, and 89 GHz (H, V) at FHQ site (February–May 2003). The solid yellow line represents BMA predictive mean, whereas dashed yellow lines represent the BMA predictive standard deviation envelopes.

Citation: Journal of Hydrometeorology 9, 6; 10.1175/2008JHM909.1

AMSR-E measured brightness temperature vs a bootstrap realization of BMA predictive pdf of brightness temperature (blue color scale) at 18.7, 37, and 89 GHz (H, V) at FHQ site (February–May 2003). The solid yellow line represents BMA predictive mean, whereas dashed yellow lines represent the BMA predictive standard deviation envelopes.

Citation: Journal of Hydrometeorology 9, 6; 10.1175/2008JHM909.1

Summary of SEMs used in this study.

Comparison of mean performance statistics *T _{b}.* The means are taken over 30 bootstrap realizations of training and testing data.

Comparison of standard deviations (K) of the RMSE and MAE for BMA and raw SEMs predictions of *T _{b}.* The estimates are based on 30 bootstrap realizations of training and testing data.

^{1}

Andreadis et al. (2007) compared the grain size from VIC with sparse local scale observation site (LSOS) snowpit measurements. The results suggest that, on average, VIC tends to underestimate the in situ measured grain size.