1. Introduction




Migrating from single process, single variable oriented approaches to synergistic frameworks, like DA, or the hydrological consistency approach of McCabe et al. (2008) that integrate several diverse variables is to exploit fully the information content of available RS observations. Such efforts culminate in value-added level 4 “knowledge” (Asrar and Dokken 1993) data products, of which E is a primary example.
Over land, climate may be viewed as a complex balance of many highly coupled processes that collectively drive the exchanges of moisture and energy between soil, vegetation, and the overlying atmospheric planetary boundary layer (PBL; e.g., Betts et al. 1996; Dirmeyer 2006; Ek and Holtslag 2004; Shuttleworth 1988). Measurement of any number of these states, fluxes, or related variables, either concurrently or independently, at varying time steps and resolution as is often the reality with RS, provides an opportunity to establish confidence in the derived quantities through hydrological consistency. For example, for drought monitoring observations of low soil moisture alone may be insufficient because of the sensing depth (1–3 cm) and sampling (i.e., mixed pixels) limitations. Closely timed observations of zero P, low humidity, high Ts, and low Q, would improve the reliability of such monitoring (Luo and Wood 2007). Likewise, SM combined with P (warm season) or SM combined with P, Ts, SCA, and SWE (cold season) should reduce flood forecasting errors (Bindlish et al. 2009; Crow et al. 2005; Hong et al. 2007; Kumar et al. 2008). Linked, derived products enable independent uncertainty estimates, as demonstrated for P using SM (Crow and Bolten 2007), but the performance can benefit mutually (Crow et al. 2009).
That the land surface and atmospheric processes feedback on one another (i.e., coupling) has important implications for hydrologic intensification (see Huntington 2006) that may occur under a changing climate. Remote sensing offers the only viable method for quantifying through observation at regional to global scales, hydrologic sensitivity to climate change, including the strength of land–atmospheric coupling. The absence of an objective, multiseason, observation-based quantification of large-scale coupling strength has been recognized as a “major obstacle to the evaluation of model performance and development” (Guo et al. 2006). The GEWEX Global Land–Atmosphere Coupling Experiment (GLACE; Guo et al. 2006; Koster et al. 2006) called attention to distinct multimodel coupling “hot spots” located in the Sahel, the central Great Plains, India, and eastern Asia. To what extent are these spatial patterns of simulated coupling strength reflective of reality, and not just an artifact of the models (Koster et al. 2003)?
To truly address any of these issues requires that we first answer the following question: How well do we observe the underlying variables? Our current environment is characterized by an abundance of (RS) data that is of poorly quantified skill, a consequence of which is our present inability to prescribe meaningfully confidence limits to model outputs.
With these issues in mind, the aim of this paper is to validate “indirect” variables, including Ts, 2-m air temperature (Ta), and humidity—both relative (RH) and specific (q), such that confidence in derived coupling diagnostics, E and SM, for example, may be achieved. Historically, these variables have been undervalidated. We focus on the evaluation of AIRS, the most advanced atmospheric sounding system ever deployed in space (Chahine et al. 2006). Our analyses span a greater than 6-yr period (2002–08) over two highly diverse (both in climate and potential applicability) domains: CONUS and Africa. We evaluate the data against observations from more than 2000 ground stations from the National Climatic Data Center’s (NCDC) Integrated Surface Database (ISD; Lott et al. 2001), offline data from the North American Land Data Assimilation System (NLDAS; Mitchell et al. 2004), and estimates of Ts over Africa from the Meteorological Satellites (Meteosat) Second Generation’s (MSG) Spinning Enhanced Visible and Infrared Imager (SEVIRI; Schmetz et al. 2002). We quantify retrieval accuracies over a broad range of surface and atmospheric conditions. Specifically along gradients in Ta or Ts, cloud albedo (αcloud), total column precipitable water vapor/ice (pwv), elevation, topographic (vertical) complexity (vc), land cover (lc) class, leaf area index (LAI), snow cover, and season. Using statistics of bias, random error, and nonparametric correlation, we map the distribution of mean retrieval accuracies and provide a summary of conditions for which we expect retrievals to be most and least skillful.
2. Study domains and period
We focus our analyses on two regions: the CONUS domain (25°–53°N, 125°–67°W) and Africa (40°S–40°N, 20°W–55°E), which we hold to be global end members with respect to ground station network densities, and coincidentally the potential impact of RS data assimilation to weather forecast skill. CONUS boasts the highest-density openly available ground measurement network in the world, making it one of the most suitable domains for validating satellite-based near-surface variables. On the other hand, parts of Africa are the most data sparse in the world. Together, the two domains offer a large variety of retrieval conditions (in terms of both landscapes and climate) for which to evaluate retrieval accuracies. The total study period spans from September 2002 through December 2008, with two exceptions: 1) analyses using CERES-derived αcloud are limited to September 2002–August 2007, and 2) analyses using SEVIRI Ts are limited to July 2005–2009 (see section 3). Only the ascending, or afternoon (∼1:30 p.m. local time) overpass, is considered. Daytime retrievals are both more challenging and valuable than similar nighttime measurements because they capture the effects of differential heating and evaporation, or surface water and energy fluxes, which we aim to help quantify and that may feedback on the atmosphere.
3. Data and methods
Table 1 provides an overview of the 13 data products used in this study. Because of space constraints, we refer the reader to Ferguson et al. (2010) for a description of the following datasets: 1) the National Centers for Environmental Prediction Global Forecast System (NCEP GFS) surface pressure [packaged with AIRS Level 2 Support Product Dataset (AIRX2SUP)], 2) the Global Modeling and Assimilation Office (GMAO/GEOS-3) and the European Centre for Medium-Range Weather Forecasts (ECMWF) near-surface wind fields [packaged with the CERES Single Scanner Footprint (SSF)], 3) the NLDAS NCEP Eta Data Assimilation System (EDAS) forcing (see also Cosgrove et al. 2003), 4) the Variable Infiltration Capacity (VIC) model (Troy et al. 2008) surface temperature, and 5) the MODIS for North American Carbon Program (NACP; Wofsy and Harriss 2002) temporally smoothed and spatially complete leaf area index (F. Gao et al. 2008). All other datasets are briefly summarized herein. Note that NLDAS–VIC Ts will be implied when general reference to “NLDAS Ts” is made.
a. Remote sensing products
1) AIRS
The AIRS on NASA Aqua provides twice-daily global coverage of the atmospheric infrared spectrum (3.7–15.4 μm) in 2378 channels across three windows: 3.74–4.61, 6.20–8.22, and 8.8–15.4 μm. It is accompanied on Aqua by the AMSU, which measures microwave radiances at 15 channels between 23 and 89 GHz. Combined measurements from AIRS and AMSU serve as inputs to the AIRS cloud-clearing algorithm (Chahine 1974, 1977; Chahine et al. 2001, 2006) that enables successful soundings 47.5% and 72.3% of the time over land and ocean, respectively (Susskind et al. 2003).
AIRS was designed to provide sounding data with vertical resolution and accuracies on par with point-based radiosonde observations (RAOBs; 1-K RMS error in 1-km tropospheric layers and 20% RMS error in 2-km tropospheric layers for water vapor), but available globally, for the purpose of improved weather prediction through data assimilation (Susskind et al. 2006). Primary retrievals include atmospheric temperature and humidity profiles, ozone profiles, sea/land surface skin temperature, and cloud properties, including outgoing longwave radiation (Aumann et al. 2003). The observed RMS error can be described as the sum of four terms: 1) instrument noise and forward retrieval model error under perfectly cloud-free and uniform scene conditions, 2) errors introduced by the cloud-clearing algorithm, 3) errors in surface emissivity, and 4) temporal and spatial collocation errors (Chahine et al. 2006).
Since the launch of AIRS in May 2002, significant improvements (version updates) have been made to the original AIRS retrieval algorithm (Susskind et al. 1998)—these are well documented in Susskind et al. (2003) and Susskind et al. (2006). The current state of the AIRS retrieval algorithm, version 5, implements a radiative transfer algorithm (RTA) that accounts for nonlocal thermodynamic equilibrium (NLTE) effects on the shortwave channels, incorporates a new, accurate cloud clearing algorithm using AIRS spectra only, and provides for the first time, case-by-case product error estimates (Susskind et al. 2010).
(i) Preprocessing












(ii) Near-surface air temperature and humidity
Previously, AIRS Ta and q have been evaluated as part of the total vertical profile. Divakarla et al. (2006) compared AIRS temperature and water vapor profiles against collocated RAOBs at 538 stations globally for a 2-yr period (September 2002–December 2004) amounting to more than 82 000 intercomparisons. Over midlatitude land (50°–23°N, 50°–23°S), RMS differences of Ta and q ranged from 1.75 (P = 1000 hPa) to 1.25°C (P = 800 hPa) and 22 (P = 1000 hPa) to 27% (P = 800 hPa), respectively (Divakarla et al. 2006). Tobin et al. (2006), through a collaborative effort between the NASA–EOS project and the Department of Energy Atmospheric Radiation Measurement (ARM) program, performed a more targeted intercomparison over multiyear periods at three ARM sites using Aqua overpass-dedicated RAOBs. At the ARM Southern Great Plains (SGP) facility, they reported mean RMS differences of approximately 2°C and 25% for Ta and q, respectively (Tobin et al. 2006). Other studies have intercompared AIRS profiles with forecast models, such as the NCEP–GFS and ECMWF (Divakarla et al. 2006; Susskind et al. 2006).
The most thorough regional assessment of AIRS Ta to date, to our knowledge, was completed by W. H. Gao et al. (2008) using collocated observations from 540 meteorological stations in China (15°–55°N, 75°–130°E) for 2 months (July 2004 and January 2005). Using data of nominal quality (Qual_Temp_Profile_Bot < 2), mean RMS differences for daytime retrievals in July and January were found to be 3.4° and 2.8°C, respectively.
Beyond these studies, the best approximation of AIRS accuracies may well be error estimates for the Television and Infrared Observation Satellite (TIROS) Operational Vertical Sounder (TOVS), a first generation AIRS if you will. For example, Lakshmi et al. (2001) showed TOVS afternoon Ta retrievals were biased by +2.1°C with RMS differences of 3.8°C relative to 26 stations distributed over the Arkansas–Red River basin.
(iii) Surface temperature
Owing to subfootprint-scale spatial heterogeneity in surface emissivity, Ts products remain the most challenging (along with SM) to validate. Thermal infrared (TIR)-based RS sources including: MODIS, the Advanced Very High Resolution Radiometer (AVHRR), Geostationary Operational Environmental Satellites (GOES), geostationary Meteosat satellites, and the Advanced Spaceborne Thermal Emission and Reflection Radiometer (ASTER), offer at a minimum, the opportunity for independent quality checks of Ts. Knuteson et al. (2008) performed a global intercomparison between average monthly AIRS level 3 version 5 Ts and upscaled MODIS level 3 (MYD11C3) collection 4 Ts for July 2003. Across all land cover classes, excluding cases of snow/ice cover, AIRS and MODIS Ts agreed within 0.5° and 1.5°C for the night and daytime overpasses, respectively. The AIRS Science Team’s current estimate of AIRS Ts accuracy is 2°–3°C, globally.
(iv) Total column precipitable water vapor
We use AIRS pwv to evaluate the impact of water vapor on retrieval performance. Raja et al. (2008) showed strong agreement (0.91 ≤ R ≤ 0.98) between AIRS and Global Positioning System (GPS) pwv, using more than 375 National Oceanic and Atmospheric Administration (NOAA) Earth System Research Laboratory (ESRL; see online at http://gpsmet.noaa.gov) stations over CONUS, for the period of April–October 2004. The monthly mean bias (AIRS minus GPS) and RMS differences exhibited a systematic seasonal trend, ranging from −1.2 mm (July) to +0.75 mm (April), and 4.5 mm (July) to 3.0 mm (April), respectively (Raja et al. 2008).
A special issue of the Journal of Geophysical Research (2006, Vol. 111) details many of the AIRS Science Team’s validation activities (see Fetzer 2006).
2) SEVIRI/Meteosat
The European Organisation for the Exploitation of Meteorological Satellites (EUMETSAT) Land Surface Analysis-Satellite Applications Facility (LSA-SAF) generates, on an operational basis, Ts from the SEVIRI/Meteosat. SEVIRI Ts is available for a period of more than 4 yr (July 2005 to the present) at the full SEVIRI temporal (∼15 min) and spatial (∼3 km) resolution and quality screened for satellite zenith view angle (SZA) >60° and cloud cover. The retrieval algorithm is based on the generalized split-window (GSW) algorithm of Wan and Dozier (1996), initially developed for AVHRR and MODIS, but adapted for SEVIRI’s 10.8 and 12.0 μm channels (Madeira 2002). The algorithm’s parameters are functions of the SZA and pwv (obtained from ECMWF forecasts). The channel surface emissivity is computed as the area-weighted mean of bare soil and vegetation emissivities, where the fractional vegetation cover is a SAF-retrieved product (Garcia-Haro et al. 2005) and emissivities are preassigned by class in a land cover map (Peres and DaCamara 2005; Trigo et al. 2008a). The complete methodology is provided in the SAF Land Surface Temperature (LST) Product User Manual (available online at http://landsaf.meteo.pt/).
Errors in SEVIRI Ts are known to derive from uncertainties in surface emissivity, topographic complexity (vc), pwv, and SZA (Freitas et al. 2010; Trigo et al. 2008b). Theoretically, given an uncertainty of 1% in SAF emissivity, SEVIRI Ts accuracies should fall within a range of ±2°C (Kabsch et al. 2008; Trigo et al. 2008a). The most pronounced errors occur due to the presence of undetected clouds (NWC-SAF 2010). Using matchups at two independent ground radiometers located at the SAF validation station in Mitra, Portugal (38.54°N, 8.00°W) for 61 and 35 days in 2006, Kabsch et al. (2008) found SEVIRI Ts to be systematically biased high (+2.5°C) at the approximate time of Aqua’s daytime overpass.
From its native resolution, SEVIRI Ts was first elevation corrected to sea level and then resampled to 0.125° geographic coordinates using box averaging. Next, the 0.125° values were elevation corrected to GTOPO30 (see below) mean elevation. Finally, the 0.125° SEVIRI Ts record was queried using a temporal window of ±59 min, or in other words ±3 SEVIRI time steps, to determine the nearest temporal match to the coincident AIRS retrieval. The current LSA–SAF Ts retrieval algorithm (version 7.2) has benefited from a series of modifications over the period of record (Trigo et al. 2008b). Unfortunately, a complete reprocessing of the time series has not occurred. Therefore, we use the discontinuous SEVIRI Ts record as is. Beginning on 8 July 2008, the LSA–SAF Ts product provides greater spatial coverage and case-by-case error estimates. We sample this later period using the legacy LSA–SAF “nominal” and “below nominal” accuracy levels, or RMS <2°C, and RMS >2°C, to match the spatial coverage of the earlier record.
3) CERES


AIRX2SUP (see above) does include a TIR cloud fraction (TotCld_4_CCfinal) estimate valid for the AIRS field of regard (45 km) that generally agrees well with αcloud (CONUS: τ = 0.41). However, we found that αcloud was a stronger explanatory variable of accuracy (see supplemental Figs. S4–S5) over arid regions.
4) MODIS land cover classification
The MODIS Terra collection 4, 1-km yearly lc classification product (MOD12Q1; Friedl 1996) 14-class University of Maryland (UMD) scheme (Hansen et al. 2000) was used. Data tiles were first mosaiced and then reprojected from their native sinusoidal projection to 0.01° geographic coordinates using the MODIS Reprojection Tool (MRT) version 4.0. Since, at the time of writing, MOD12Q1 was only available for years 2001–04, year 2004 was used.
5) GTOPO30
The GTOPO30 global 30 arc second (∼1 km) digital elevation model (DEM) was first resampled to 0.125° geographic coordinates using box averaging and then applied to elevation correct the AIRS and SEVIRI temperature products (see above). This ensures topographic consistency, since NLDAS (and NLDAS–VIC) data are also GTOPO30 based.
6) SRTM-90m
The void-filled Shuttle Radar Topography Mission (SRTM) version 4, 90-m DEM (Reuter et al. 2007) was used to generate 0.125° fields of vc, defined as the standard deviation of SRTM 90-m elevation.
7) MODIS snow cover
We use the MODIS Terra (MOD10C1) and MODIS Aqua (MYD10C1; Hall et al. 2002) collection 5 0.05° daily global SCA products. These products were created using the daily MODIS Terra and MODIS Aqua 500-m tile products, MOD10A1 and MYD10A1, respectively. MYD10C1 is currently considered “provisional,” while MOD10C1 has been well validated (Hall and Riggs 2007).
b. In situ data
For the total record, hourly observations of 10-m wind (w), Ta, 2-m dewpoint temperature (Td), and surface pressure (Psurf) or alternatively, altimeter setting (alt), were obtained from the NCDC ISD. The ISD comprises worldwide surface weather observations from approximately 20 000 stations historically (∼12 000 available for 1995–2007), collected and stored from sources such as: the Automated Weather Network (AWN), the Global Telecommunications System (GTS), the Automated Surface Observing System (ASOS), as well as data keyed from paper forms (Lott et al. 2001). ISD data have undergone extensive automated quality control (Lott, 2004). Over CONUS, NCDC maintains an additional, more extensively screened database called the Quality Controlled Local Climatological Database (QCLCD).
For this study, data for Africa were obtained from ISD (available 27 January 2009: available online at ftp://ftp.ncdc.noaa.gov/pub/data/noaa/); data for CONUS were obtained from QCLCD v1.4 (available 28 January 2009: available online at http://cdo.ncdc.noaa.gov/qclcd_ascii/). Only observations satisfying the strictest quality flags were accepted. Furthermore, only stations with more than 50 matched (within ±90 min of the overpass) observations were considered. The median sample size per station over CONUS and Africa was 1695 ± 642 and 642 ± 462, respectively. Stations that satisfied the initial sample size criterion were then screened once more on a per variable basis using the Kendall’s τ correlation metric (hereafter τ). Any station/variable for which AIRS (NLDAS)–NCDC time series failed to correlate at the p = 0.01 significance level were culled from the analyses (see Table 2). Over CONUS and Africa, approximately 1500 and 500 stations were included, respectively (see Table 2).










c. Methods








4. Results
Over CONUS we first evaluate the AIRS and NLDAS data against NCDC in situ observations [section 4a(1)] and then evaluate AIRS relative to NLDAS offline data [section 4a(2)]. Spatially and temporally continuous (hourly) fields of elevation-corrected and validated (Luo et al. 2003; Mitchell et al. 2004) NLDAS meteorology afford the maximum number of matchups with AIRS. We exploit these data characteristics to evaluate the skill of AIRS retrievals over a broad range of surface and atmospheric conditions [section 4a(2)]. Specifically, we take sample subsets along gradients (generally quintiles) of AIRS Ts or Ta, αcloud, pwv, elevation, vc, UMD lc class, LAI, and season, including SCA. The precise break points that we employed are provided in Table 3, listed by domain. Quintiles were used for the pwv (calculated from the January through December 2004 distribution), elevation, vc, and LAI. In the case of pwv, two supplemental levels—the 5th and 95th percentile—were also included. The break points for αcloud and Ts or Ta were selected arbitrarily. The warm and cold seasons correspond to the months of May–September and October–April, respectively. Similar analyses were carried out over Africa between AIRS and NCDC [section 4b(1)] and AIRS and SEVIRI [section 4b(2)]. Quoted statistics are for the total record, unless otherwise noted.
a. Continental United States
1) Comparison with NCDC
Figures 1 and 2 illustrate the spatial distribution of AIRS–NCDC (Fig. 1) and NLDAS–NCDC (Fig. 2) mean bias (Ta) and relative bias (q and RH). Figure 3 shows the spatial distribution of correlation between GFS and NCDC Psurf and GEOS-3 and NCDC w. Figure 4 provides side-by-side comparisons of the frequency distributions of error statistics from AIRS/GFS/GEOS-3–NCDC and NLDAS–NCDC matchups for all variables—the mean statistics of which have been summarized in Table 4.
Figure 1 shows a strong east (cold or neutral) to west (warm) transition in AIRS Ta bias occurs near 97.5°W—right along the hot spot region and typical strong vegetation/soil moisture gradient, not mirrored by NLDAS. The Ta bias is well correlated with elevation (τ = 0.49, Pearson’s R = 0.62), particularly for mean elevations greater than 500 m, with slope of +2.1°C km−1. For a majority of the Midwest, Ta is shown to have an absolute bias of less than 1°C. Over central Iowa (41°–43°N, 94°–92°W, n = 53), Ta is in fact unbiased (0°C, n = 16). Elsewhere, along the Texas–Oklahoma border (34°–37°N, 103°–98°W) the mean bias is +3.0°C (n = 24) and along the Gulf Coast (29°–31.5°N, 95°–85°W) the mean bias is −2.1°C (n = 53).
Figures 1 and 4 show that q is predominantly biased dry (negative). If we consider the same regions (as above), we find q to have a relative bias of +0.7% (Texas–Oklahoma), −17.7% (Gulf Coast), and −19.9% (central Iowa). On the other hand, RH has a slight positive (wet) bias over CONUS. Although, over the same regions, RH has a relative bias of −2.5% (Texas–Oklahoma), +0.2% (Gulf Coast), and −14.2% (central Iowa). Most of the dry bias in RH occurs within a distinct band known as the Midwest “corn belt” (see Fig. 1).
As shown in Fig. 4, there is a bimodal distribution of GFS–NCDC (and NLDAS–NCDC) Psurf correlation and random error. We determined that the area east of 97.5°W was the source of this pattern. For example, 35 of 69 stations in Illinois have a τ less than 0.20 and ubRMS greater than 7.5 hPa. Of the remaining stations, 32 have a τ greater than 0.84 and ubRMS less than 2.0 hPa. Overall, both GFS and NLDAS Psurf and GEOS-3 and NLDAS w datasets are most uncertain west of 104°W (west). The median RMS error for GFS (NLDAS) Psurf is 21.1 hPa (10.1 hPa) in the west (n ∼ 300) and 6.2 hPa (4.6 hPa) in the east (n ∼ 1160). The median RMS error for GEOS-3 (NLDAS) w is 2.2 m s−1 (2.1 m s−1) in the west and 1.7 m s−1 (1.8 m s−1) in the east. GFS Psurf is distinctly biased low in the West (median = −17.9 hPa), but nearly unbiased in the East (median = +0.4 hPa). Area-mean GFS Psurf is biased relatively low in northern New York, New Hampshire, and Vermont (−10.9 hPa, n = 39) and high in the Midwest (Minnesota, Wisconsin, Iowa, and Illinois: +5.0 ± 6.8 hPa, n = 263) and Virginia and North Carolina (+6.1 hPa, n = 96). GEOS-3 w is biased low in Texas, Oklahoma, Kansas, eastern New Mexico, and southern California (−0.73 ± 0.63 m s−1, n = 257) and slightly high along the eastern coast (Delaware, Maryland, Virginia, North Carolina, South Carolina, and Georgia: +0.5 ± 0.5 m s−1, n = 170). Over CONUS, GFS Psurf and GEOS-3 w have mean biases of −2.5 hPa and −0.1 m s−1, respectively (Table 4). Agreement between GFS and NCDC Psurf (τ = −0.54), and to a lesser extent GEOS-3 and NLDAS w (τ = −0.29), was found to be anticorrelated with vc.
Relative to the NCDC stations (truth), NLDAS, and AIRS/GFS/GEOS-3 exhibit different error characteristics (Fig. 4). For all variables, NLDAS forcing is consistently better correlated with less uncertainty relative to AIRS/GFS/GEOS-3. Differences between GFS and NLDAS Psurf and GEOS-3 and NLDAS w, which are both model outputs, may be attributed to issues of scale (GFS Psurf ∼ 45 km, GEOS-3 w ∼20 km, and NLDAS ∼12.5 km).
Relative to NCDC, AIRS, and NLDAS: 1) Ta are similarly biased (±0.5°C) over Iowa, Missouri, southern Wisconsin, and eastern Oklahoma; 2) q are similarly biased (±5%) over northern and western Texas and western Oklahoma; and 3) RH are similarly biased (±5%) over the larger part of CONUS, but disagree in sign over the Midwest (e.g., Minnesota, Wisconsin, Iowa, northern Illinois, northern Indiana, Ohio), and the Southeast (e.g., Alabama, Georgia, South Carolina, North Carolina, and Virginia). However, the domain-averaged bias of AIRS (−10.3%) and NLDAS (+8.8%) q are of opposite sign (Table 4). Over California’s Sacramento River valley (n = 25), both AIRS and NLDAS are shown to be biased warm (AIRS: +1.6°C; NLDAS: +2.3°C) and dry (AIRS q: −14.5%; NLDAS q: −6.8%).
When NLDAS is used as the surrogate truth (next section) for calculating AIRS bias, the true AIRS bias (i.e., AIRS minus NCDC) may be under (subtractive) or over (additive) represented—a function of the inherent NLDAS–NCDC error characteristics. Accordingly, it is feasible that the sign of the true AIRS–NCDC bias differs from the calculated AIRS–NLDAS bias. Such cases are fairly limited, but do occur at 7.1% (q), 9.8% (Ta), and 12.0% (RH) of the NCDC stations.
The representativeness of the NCDC ground station network at the scale of AIRS, especially in the West, where station density falls off and vc increases, is disputable. Station data is also not without error. Biases are introduced through the affect of localized influences due to imperfect siting (i.e., proximity to trees, buildings, and parking lots; Davey and Pielke 2005). We therefore argue that NLDAS forcing, the accuracy of which we have demonstrated, rather than NCDC stations, is more appropriate for comparisons conducted on the scale order of AIRS. Accordingly, in the conditional sampling analyses to follow, we use NLDAS forcing as surrogate truth.
2) Comparison with NLDAS
Figure 5 shows the geographic distribution of AIRS warm season retrieval accuracies relative to NLDAS. Overall, patterns in AIRS–NCDC (Fig. 1) and AIRS–NLDAS (Fig. 5) bias are spatially coherent. The median AIRS–NLDAS biases were found to be +2.1°C (Ts), −0.5°C (Ta), −17.5% (q), and −7.0% (RH). The median random errors are 4.4°C (Ts), 3.2°C (Ta), 20.8% (q), and 22.3% (RH). Agreement between AIRS and NLDAS is generally strong with median τs of 0.62 (Ts), 0.66 (Ta), 0.55 (q), and 0.48 (RH). For Ts, the ubRMS local maxima are centered over the Midwest corn belt and the West Texas high plains. For Ta, q, and RH, the largest uncertainty is observed in the West, but particularly in California and Nevada for q and RH. Local minima in correlation occur over the Southeast (Ts and Ta) and along the Gulf Coast and Pacific Northwest (q). The patterns of RH error and correlation represent the combined effect of Ta and q accuracy characteristics. Across CONUS, q is biased dry (negative) with the exception of the Cascade, Sierra Nevada, and Peninsular mountain ranges of California, which coincidentally, are also local maxima of uncertainty (ubRMSP > 30%). Anomaly correlations computed using a climatological (2002–08) ±5-day moving average of each respective product/variable (AIRS and NLDAS), show that seasonality does impact the error characteristics—much more so for humidity than temperature. [Supplemental Fig. S2 details the seasonal dependence for each of the 13 major river basins in CONUS, as calculated from AIRS footprint (not gridded) retrievals.]
Figure 6 provides maps of AIRS surface air temperature gradient (Ts − Ta) error characteristics, also for the warm season. The bias is positive for all pixels, with a median of +2.1°C and 95th percentile equal to +6.1°C. The median τ and ubRMS differences were 0.2° and 3.4°C, respectively. Over the domain, we observed an anticorrelation between bias and LAI (τ = −0.46) and a positive correlation between elevation and bias (τ = 0.28). Here, LAI is calculated as the 4-yr (2003–06) warm season mean MODIS for NACP product (F. Gao et al. 2008; Jonsson and Eklundh, 2004). The strongest correlation occurs over the Great Plains.
The fact that the AIRS Ts − Ta gradient has a strong (positive) bias relative to NLDAS, particularly over the West, suggests that NLDAS–VIC sensible heat flux (H) may be too high (and evapotranspiration too low). This corroborates the finding of Ferguson et al. (2010), which showed NLDAS upward longwave radiation (LW↑) was biased low relative to International Satellite Cloud Climatology Project (ISCCP; Schiffer and Rossow 1983) LW↑. Specifically, they found mean daily differences of 19.7 and 22.7 W m−2 over the Colorado and Great basins, respectively. At mean temperatures of 15°C, this translates into ∼4°C difference in Ts.
The remainder of this section is devoted to summarizing the comparative shifts in AIRS accuracies (relative to NLDAS) with scene (both atmospheric and surface) conditions (Fig. 7). The underlying statistics (i.e., the 5-number summary) plus the means are provided for all boxplots in supplemental Tables S3–S6. Unless otherwise noted, 1) the difference in means between all histograms for a given condition (i.e., αcloud) are significant at the p = 0.05 level and 2) all cited statistics are medians of their respective distributions across CONUS.
It is important to note that neither the geographic coverage (i.e., the number of pixels) nor the quantity of sample pairs per pixel is constant for each histogram, but in fact varies widely, even between variables. For example, there are very few pixels south of 35°N included in the Ta < 0°C sample or west of 97.5°W (except western Washington and Oregon) included in the αcloud > 0.6 sample. Samples conditioned on lc class are particularly diverse in their coverage, and thus, have been addressed explicitly in Table 5. With the exception of a few land cover classes (i.e., lc03 over CONUS and lc01, lc03, and lc05 over Africa) that were eliminated from our analyses on the basis of insufficient data, there is ample coverage for each of the remaining samples. For the total record, our analyses considered 75 050 (179 380) 0.125° pixels over CONUS (Africa) each with 1700 ± 205 (470 ± 185) data pairs.
(i) Season
The Ta retrievals exhibit a negative bias (−0.5°C) during the warm season and slightly positive bias (+0.1°C) during the cold season. Relative to the cold season, Ta ubRMS differences are reduced from 3.8° to 2.5°C and τ is increased from 0.73 to 0.78, when snow covered-only scenes are considered. This finding is reasonable, considering the presence of snow inhibits the influence of differential surface heating on the lower atmosphere, and hence, the development of thermal gradients. Conversely, snow cover diminishes the correlation between AIRS and NLDAS Ts (0.69 to 0.46) and RH (0.42 to 0.13). The distribution of Ts ubRMS is particularly heavy tailed (5th percentile = 3.0°C, 95th percentile = 6.8°C) for snow covered-only retrievals, which may be the result of snow/no-snow discrepancies between NLDAS and MODIS. The mean values of total and warm season bias in q and of warm and cold season τ do not differ significantly.
(ii) Surface/air temperature
Of all sample subsets, agreement between AIRS and NLDAS Ts reaches its minimum (τ = 0.43) for subzero surface temperatures Centigrade. Under the same conditions, random errors in Ta (4.2°C) and q (32.6%) are at or near study maximums, which may explain the minimum in τ for RH (0.08). Correlation between AIRS and NLDAS Ta is low at both temperature extremes (Ta < 0°C, τ = 0.43; Ta > 30°C, τ = 0.44), while correlation between AIRS and NLDAS q is relatively constant with temperature. Although for warmer scenes (Ta > 20°C), ubRMSP in q is reduced.
(iii) Cloud albedo
The accuracies of Ta, Ts, q, and to a lesser extent RH, are all strongly linked to cloud coverage. Under clear skies (αcloud < 0.05), the AIRS–NLDAS correlation is strongest for all variables: 0.80(Ts), 0.86(Ta), 0.75(q), and 0.49(RH). With increasing clouds, correlation diminishes sharply and random errors also trend upward. Bias in Ts (Ta) changes in sign with cloud cover, from +3.6°C (+1.6°C) for clear skies to −5.5°C (−4.1°C) for overcast conditions (αcloud > 0.6). Bias in q grows in (negative) magnitude with increasing cloud coverage, from −9.7% (clear) to −29.0% (overcast). Traversing the gradient in αcloud, the dry (negative) bias in q and coincident cold (negative) bias in Ta counteract to constrain bias in RH within the range of −7.2% (0.05 ≤ αcloud < 0.2) to −7.5% (0.4 ≤ αcloud < 0.6) for typical cloud conditions. In a separate comparison (not shown), we found that the bias in Ta for mostly cloudy (0.4 ≤ αcloud < 0.6) and overcast scenes differed insignificantly from biases calculated using cold season-only retrievals for corresponding cloud types. We therefore suggest that clouds and not ambient temperature drive the seasonal error characteristics (above). Geographically, the largest differences between clear-sky and mostly cloudy Ts (Ta) bias occur over parts of Nevada, California, and Oregon (approximately 40°–44°N, 121°–116°W), with ranges of 7.6°C (5.3°C). For Ts, an additional local maxima in clear-minus-cloudy sky bias exists over western Texas (30°–34°N, 104°–101°W, +8.2°C).
(iv) Precipitable water vapor
For all variables, the distribution of AIRS–NLDAS τ has the widest spread for dry atmospheres (pwv < 3.18 mm). AIRS–NLDAS agreement is strongest for average pwv conditions. Accuracies for average conditions of pwv (20th–60th percentile) do not differ strongly. The impact of scene pwv on RH bias generally coincides with that of q and not Ta—trending toward positive with increasing pwv. However, the trend in uncertainty of RH corresponds with the general trend in uncertainty of Ta.
Along a gradient of increasing pwv, uncertainty in q is shown to diminish sharply: from 38.4% (pwv < 3.18 mm) to 15.1% (pwv > 34.29 mm). This inverse relationship between random error and pwv seemingly contradicts the trend between random error and clouds (above). Here, we emphasize that αcloud and pwv are not one in the same. While clouds exist at varying (and often multiple) levels of the atmosphere, pwv is driven by the lowest (and most humid) layers in the atmosphere. Therefore, pwv is strongly linked with RH, not necessarily αcloud. High pwv samples should correspond with warm and humid events/regions. Accordingly, the error characteristics should be similar to those from high Ta samples (Ta > 20°C), which Fig. 7 confirms.
(v) Elevation
We previously noted a direct relationship between Ts, Ta, and Ts − Ta bias and mean elevation. In terms of their bias, q and RH only exhibit sensitivity in the uppermost quintiles of elevation. The uncertainty in RH is shown to increase with elevation, coincident with diminished correlation of q and increased (positive) bias in Ta.
In another experiment (not shown), we considered the same intervals in elevation for clear-sky-only retrievals in order to isolate strictly elevation-dependent effects. We found that the trends for all-sky cases (see Fig. 7) remain consistent. However, the magnitude of Ts and Ta (warm) bias increased significantly for all quintiles under clear skies. In terms of Ta, for example, biases for each quintile shifted from: −1.6° to +0.4°C, −1.1° to +0.5°C, −0.6° to +1.2°C, 2.1° to 3.7°C, and 2.9° to 5.1°C. Likewise, for Ts, biases shifted from −0.2° to +2.5°C, 0° to 2.5°C, 0.4° to 2.9°C, 3.8° to 6.2°C, and 5.2° to 8.2°C.
(vi) Vertical complexity
Trends in error with increasing vc, were consistent with those observed for elevation (above). Given the strong correspondence between elevation and vc, this is not surprising. However, the trend is less distinct, with fatter-tailed distributions of bias for each interval of vc relative to elevation quintile counterparts. Therefore, we hold that elevation, and not vc, is the driving variable.
(vii) UMD land cover class
Land cover classes effectively aggregate the influences of regional climatology (atmospheric and radiative forcing), terrain, vegetation thickness (LAI), and heterogeneity (emissivity) influences on AIRS retrievals. Large bias-driven uncertainty in Ts (+3.6° to +7.6°C) and Ta (+1.9° to +3.3°C) is shown for retrievals over shrublands (lc06, lc07), grasslands (lc10), and barren ground (lc16). Over evergreen broadleaf (lc02), deciduous broadleaf (lc04), and mixed forests (lc05), the range of Ts (−0.4° to 0°C) and Ta (−2.2° to −1.3°C) bias is comparatively lower. However, the fact that the range (5th–95th percentile) over evergreen needleleaf forest (lc01) is nearly 4 times broader suggests that factors other than vegetation are dominating retrieval accuracy. For all variables, there are multiple pairs of landcover classes that yield insignificantly different error statistics relative to one another. The RH has the greatest uncertainty over shrubland (lc06, ubRMSP = 39.2%; lc07, ubRMSP = 36.7%) and barren ground (ubRMSP = 40.6%). Over savannas (lc09), the AIRS and NLDAS RH exhibit the best agreement (τ = 0.56), but AIRS and NLDAS q are least correlated (τ = 0.39). Over open shrubland (lc07), q is nearly unbiased (+1.0%).
(viii) LAI
Trends in bias, ubRMS, and τ are shown to be inversely related to LAI. Between the lowest and uppermost quintile of LAI, ubRMS(P) is reduced from 5.6° to 4.2°C, 3.8° to 3.0°C, 30.7% to 18.0%, and 28.7% to 18.6% in the case of Ts, Ta, q, and, RH, respectively. Biases in Ts (Ta) shift from +2.1° to −0.5°C (+1.7° to −1.8°C) along the gradient of LAI. With increasing LAI, it is possible that the respective surfaces for which NLDAS and AIRS Ts and Ta are representative are diverging.
b. Africa
1) Comparisons with NCDC
Figures 8 and 9 are maps of the AIRS/GFS/GEOS-3–NCDC error statistics over Africa. Africa has been divided into four subdomains: Sahara (SAH: 18°–30°N, 20°W–65°E), western Africa (WAF: 12°S–18°N, 20°W–22°E), eastern Africa (EAF: 12°S–18°N, 22°–52°E), and southern Africa (SAF: 35°–12°S, 10°W–52°E) for which the regional mean error statistics are provided. Domain-averaged values are provided in Table 4. Analyses are limited by station coverage and density, which is especially sparse (or nonexistent) over the Sahara Desert (15°–30°N), central Africa (i.e., Angola, Congo, Democratic Republic of the Congo, Equatorial Guinea, Uganda, and Zambia), and Somalia. The foremost limiting factor in our analyses was the availability of Psurf, which is only reported at 75% of the stations (see Table 2).
AIRS temperature and humidity retrievals exhibit relatively higher uncertainties over the SAH and EAF, and SAH and SAF, respectively (see Fig. 8). Over SAH, q (−30.3%) and RH (−25.6%) exhibit substantial negative biases, while Ta is biased negatively over SAH (−1.4°C) and WAF (−1.6°C) and positively over EAF (+4.6°C) and SAF (+3.5°C). Consistent with findings over CONUS, increases in Ta bias were found to correlate (τ = 0.41) with increases in elevation (not shown). Notably, the mean slope of the relationship is of much larger magnitude (+10.7°C km−1 above 500 m). GFS Psurf is correlated (τ = 0.39) with large random errors (4.9 hPa) over the EAF, likely due to the regions’ terrain. The range in Psurf bias and ubRMS across the four subdomains is −4.8 to +2.7 and 3.1 to 4.9 hPa, respectively. Agreement between GEOS-3 and NCDC w is poor for all regions, but at a minimum over EAF (τ = 0.20). The range in w bias and ubRMS across the four subdomains is −0.3 to +0.5 and 2.0 to 2.5 m s−1, respectively.
2) Comparisons with SAF-LST
Figure 10 illustrates results from AIRS-SEVIRI Ts intercomparisons over Africa for the total record. For 98% of the pixels, AIRS is biased warm relative to SEVIRI. The median bias, ubRMS, and τ are +0.5°C, 3.9°C, and 0.68, respectively. The 95th percentile of bias is +6.4°C. For a subdomain of the African Monsoon Multidisciplinary Analysis (AMMA) study region (12.5°–17.5°N, 2.5°W– 2.5°E), Ts is biased by +1.9°C with a ubRMS of 4.6°C, and τ of 0.58. The correlation between Ts bias and elevation is not particularly strong (τ = −0.14). Nevertheless, the largest biases are found above 1000 m in elevation, with the exception of the Sahara.
Figure 11 (which is presented quantitatively in supplemental Fig. S7) shows results from conditional sampling analyses. As with CONUS, the range in geographic coverage of samples conditioned on landcover class warranted disclosure (see Table 5). Seasonality is less straightforward when considering Africa as a whole and therefore replaced with SEVIRI quality levels: “nominal” and “below nominal.” Use of nominal quality-only (RMS < 2°C) SEVIRI Ts data has the primary effect of reducing coverage to 122 100 (0.125°) pixels located south of the 16°N parallel (and the Sahara). Relative to all available matchups (+0.5°C), nominal quality-only sampling yields a marginally lower bias (−0.1°C). However, the mean ubRMS differs by only 0.08°C between sampling approaches; the agreement is actually lower for the nominal quality-only sample (nominal, τ = 0.64; all, τ = 0.68). LSA–SAF applies a cloud mask, which is why there is no data available for intercomparison in overcast conditions (αcloud > 0.6).
The error characteristics derived from RS product–product (this section) and RS product–model [section 4a(2)] intercomparisons were determined to be fairly robust despite the geographic diversity of the domains over which they were computed. For example (positive) bias in Ts is reduced, random error is enhanced, and correlation weakened with increasing αcloud. With increasing LAI, bias, random error, and correlation were also reduced. Conversely, we did not observe similar trends in bias and random error as a function of pwv, elevation, or vc. Bias and ubRMS both increase with pwv over Africa (unlike AIRS-NLDAS) but, correlation diminishes (like AIRS–NLDAS). In fact, the greatest overall consistency is observed in scenes of the lowest pwv (<6.34 mm), for which bias, ubRMS, and τ are −0.9°C, 2.1°C, and 0.80, respectively. For clear-skies, the bias is reduced (−0.12°C), but we observe less overall agreement (ubRMS = 3.7°C, τ = 0.66). With regard to elevation and vc, bias and ubRMS are either reduced or remain constant. The fact that the relative range in τ for each quintile of elevation, vc, and lc type is substantially broader relative to that observed over CONUS is also noteworthy. We hypothesize that both RS retrievals are similarly (and consistently, with respect to one another) affected by factors that vary with altitude (i.e., elevation, vc, and lc), but that SEVIRI is relatively more sensitive to pwv. The strongest and weakest AIRS–SEVIRI correlation occurs over barren ground (0.73) and evergreen broadleaf forest (0.39), respectively.
5. Significance of error characteristics
a. Soil moisture and cloud base height
The error characteristics identified in this study have nontrivial implications to a multitude of applications in land hydrology. In Fig. 12a, we illustrate the sensitivity of Princeton’s remotely sensed X-band soil moisture (SM) retrievals (Gao et al. 2006) to errors in Ts across the vegetation gradient. The envelope of SM sensitivity (0.6%–3.5%°C−1) is bounded by the relative vegetation extremes of Jornada basin, New Mexico (barren) and Little River, Georgia (croplands). Figures 12b and 12c show the sensitivity of cloud base height (LCL; refer to Bolton 1980) to errors in Ta (∼10 hPa °C−1) and q [∼20 hPa (g kg−1)−1]. In the lower atmosphere, LCL dependence on Psurf is negligible (not shown). By construct, RH and LCL are closely linked.
b. Evapotranspiration
Using the AIRS–NLDAS biases calculated for warm season (see Fig. 5), we assessed the impact of applying bias correction on AIRS Ta, Ts, and RH inputs to estimates of 4-yr (2003–06) mean warm season daytime RS–E for 12, 1° × 1° regions of varying land cover and climate, and 7 major hydrological basins in CONUS (Table 6). RS–E was calculated using the modified Penman–Monteith (P–M) formulation (Mu et al. 2007), replacing GMAO meteorological inputs with inputs from AIRS and CERES, as formally described in Ferguson et al. (2010). Included in the set of study sites are five U.S. Department of Agriculture Agricultural Research Service (USDA–ARS) experimental watersheds and four BigFoot (Cohen et al. 2006; Turner et al. 2006) sites. We note here that Lamont and Little Washita, Oklahoma are contained within the North American hot spot for land–atmosphere coupling (Koster et al. 2004). Daytime VIC–E minus the free-water (canopy) evaporation component, which is not accounted for in the P–M framework, is included in Table 6 and serves as our best estimate of truth. With bias correction applied, RS–E is reduced over the semiarid Colorado (−23%) and arid great (−18%) basins and enhanced by 8%–30% over the remaining temperate basins. Since the VIC model uses inputs from NLDAS, it is not surprising that bias-corrected RS–E estimates at basin-scale shift in the direction of correspondent VIC–E estimates. Additionally, we show (Table 6) that biases of ±0.5 and ±1.0 m s−1 in w can contribute an additional −5.8% to +3.9% and −13.0% to +6.4% uncertainty to basin-scale RS–E, respectively. Bias correction did impact the RS–E sampling characteristics because for pixels where net radiation became negative due to the correction of Ts, RS–E was set to missing.
VIC–E is not without uncertainty. Considering the degree of irrigation in Fresno and the greater Sacramento River valley, a warm season E of 15.6 mm month−1 seems unrealistic and is probably due to ignoring the effects of irrigation in this version of VIC. Nevertheless, the scope of this study precludes further investigation.
For Africa, assuming typical atmospheric and radiative conditions [Psurf = 977 hPa, Ts = 30.4°C, w = 1.9 m s−1, Rn = 9.56 MJ m−2 day−1, momentum roughness length (Zom) = 0.015 m, water vapor roughness length = Zom(10 m)−1], potential evaporation (PE) from bare soil was estimated to be 419 cm yr−1 using the P–M equation (Monteith 1964) with surface resistance set to 0. If we apply constant daytime biases in Ts of +1°, +3°, +6°, +9°C, correspondent estimates of PE increase by 1%, 2.9%, 5.5%, and 7.9%, respectively.
6. Summary and conclusions
This study represents a careful assessment of AIRS-derived near-surface meteorology (Ts, Ta, q, RH, and Ts − Ta), GFS Psurf, and GEOS-3 w through intercomparison with ground measurements (NCDC), offline model output (NLDAS), and geostationary RS retrievals (SEVIRI). Our objective was threefold: 1) to quantify the mean accuracies of AIRS retrievals over an extensive domain; 2) to identify the scene conditions (atmospheric and surface) for which retrievals are particularly skillful, or conversely, prone to error; and 3) to quantify the impacts of these uncertainties on land surface hydrology variables E and SM. The nature of the task is that error estimates—and these conclusions—are invariably a product of several factors, beginning foremost with the quality screening protocol that is applied at the footprint level, the spatial interpolation technique, and ultimately, the domain and season that the analysis is conducted.
We focused this study on evaluating the 0.125° gridded AIRS product [section 3a(1)] over CONUS and Africa, but for the benefit of our discussion, also provided summary error statistics at the AIRS footprint (13.5 km) scale (Fig. A1, supplemental Figs. S1–S5) over CONUS. Through conditional sampling, we explicitly investigated retrieval errors across gradients in ambient surface and air temperature, cloud albedo, total column precipitable water vapor/ice, elevation, topographic complexity, land cover class, leaf area index, snow cover, and season (Figs. 7 and 11). Under the best retrieval conditions—clear skies—the interquartile range of AIRS–NLDAS RMS errors were found to be 3.7°–7.7°C (Ts), 2.3°–4.6°C (Ta), 26.9%–27.2% (q), and 26.2%–32.2% (RH). Over Africa, AIRS–SEVIRI Ts intercomparisons yielded an interquartile RMS range of 3.6°–4.8°C for clear skies and significantly lower (2.4°–2.6°C) for atmospherically dry (pwv < 6.34 mm) conditions. Using data matchups at ∼1500 NCDC ground stations, we found mean AIRS/GFS/GEOS-3–NCDC (NLDAS–NCDC) RMS errors of 3.8°C (2.3°C), 32.0% (21.3%), 30.7% (24.4%), 12.1 hPa (9.3 hPa), and 1.9 m s−1 (2.0 m s−1) for Ta, q, RH, Psurf, and w, respectively. Notably, the accuracy of NLDAS forcing has only previously been verified for a 3-yr period (1996–99) over the SGP (Luo et al. 2003; Mitchell et al. 2004). Using ∼500 NCDC stations in Africa, mean AIRS/GFS/GEOS-3–NCDC RMS errors were found to be 4.6°C (Ta), 33.8% (q), 34.3% (RH), 11.0 hPa (Psurf), and 2.4 m s−1 (w).
In general, AIRS q and Ta were shown to be biased dry (CONUS: −10.3%; Africa: −12.4%) and (slightly) warm (CONUS: +0.2°C; Africa: +1.0%). Previously, dry biases were also noted in AIRS–GPS pwv comparisons over CONUS (Raja et al. 2008). The interquartile ranges in AIRS–NLDAS Ta and q RMS (above) encompass prior estimates of 2.8°C (January) and 3.4°C (July) for China (W.H. Gao et al. 2008) and 27% (P = 800 hPa) globally (Divakarla et al. 2006). Biases in Ta were shown to correlate well with elevation (above 500 m), as best illustrated by the east (negative) to west (positive) gradient in bias over CONUS (Fig. 1). This supports similar findings of Jones et al. (2010) and Gao et al. (2008b) with AIRS and Lakshmi et al. (2001) with TOVS. We suggest that this correlation is induced by changes in several variables with altitude, including lc class, LAI, and humidity. Because of the combined effects of errors in Ta and q, RH had a much lower bias (CONUS: −1.4%; Africa: −10.5%). There is a distinctly negative-biased band of RH correspondent with the Midwest “corn belt” (Fig. 1).
Collectively, our findings indicate that the gridded near-surface AIRS variables are not meeting the 2°–3°C (Ts), 1.0°C (Ta) and 20% (q and RH) RMS accuracy criterion set forth by the AIRS Science Team. Of course, within the CONUS and Africa subdomains, there exists substantial spatial diversity in error characteristics. For example, mean errors are clearly affected by the west CONUS and Sahara Africa. Moreover, the NLDAS and NCDC datasets do have inherent uncertainty. Mitchell et al. (2004) determined that NLDAS–VIC Ts had an RMS of 4.5°C relative to the Geostationary Operational Environmental Satellite (GOES) over the north-central CONUS during July 1999. And this study’s findings [section 4a(2)], taken jointly with those of Ferguson et al. (2010), suggest that NLDAS–VIC Ts is biased cold over the west CONUS.
AIRS provides a unique set of variables for land surface hydrological studies that are ideally available coincidentally (in time and space) with radiation and microwave measurements from the CERES and the Advanced Microwave Scanning Radiometer- Earth Observing System (AMSRE-E) sensors, also on board NASAs Aqua. In section 5, we quantified the impact of errors in Ts on microwave soil moisture retrievals and estimates of PE, errors in Ta and q on estimates of LCL, and errors in Ts, Ta, RH, and w on estimates of warm-season daytime RS–E. These limited examples derive from a much larger set of applications for which there is a clear need for confidence/error estimates to quantify input-driven uncertainty. Despite challenges to skillful retrievals, the strength of RS remains to be that it provides accurate patterns of real-life land surface variability, which is currently missing from land hydrology and atmospheric analyses (McCabe et al. 2008).
Throughout this study, an underlying question was whether beyond CONUS, in data-scarce regions such as Africa, could AIRS uncertainty be quantified a priori as a function of scene atmospheric and surface conditions? Our conclusion is that although this may be feasible, multiple explanatory variable are necessary and it will likely require regional and temporally varying regression relationships that may only be obtainable through initial AIRS versus truth intercomparisons.
Only the ascending (∼1330 local time) retrievals have been evaluated. In the future, our framework may be transferred to nighttime retrievals, as well as expanded globally. Given insights gained from this study, we would divide the analysis by river basin, as we have done here for the footprint-scale intercomparisons (supplemental Figs. S2–S5), and vary the search radius of the interpolation scheme as a function of mean pressure difference with distance (see supplemental Table S2).
Acknowledgments
This research was jointly supported through the author’s NASA Earth and Space Science Fellowship NNX08AU28H: “Understanding Hydrologic Sensitivity and Land–Atmosphere Coupling through Space-Based Remote Sensing” and NASA Grant NNG04GQ32G: “A Terrestrial Evaporation Product Using MODIS Data” (Eric F. Wood, PI). We gratefully acknowledge the efforts of Pedro Diegues at LSA-SAF (see online at http://landsaf.meteo.pt/) who provided the SEVIRI/Meteosat data. AIRS level 2 data products were obtained from the NASA Goddard Earth Sciences Data and Information Services Center via FTP (airspar1u.ecs.nasa.gov). CERES SSF products were obtained from the Atmospheric Science Data Center at the NASA Langley Research Center (see online at http://eosweb.larc.nasa.gov/). The SRTM version 4 90m DEM was obtained from the International Centre for Tropical Agriculture (CIAT; see online at http://srtm.csi.cgiar.org). The GTOPO30 DEM was obtained from the U.S. Geological Survey’s Center for Earth Resources Observation and Science (EROS) Data Center (see online at http://edc.usgs.gov/). The MOD12Q1 land cover dataset was obtained from the NASA-developed Earth Observing System (EOS) Clearinghouse (ECHO) through the NASA Warehouse Inventory Search Tool (WIST; see online at https://wist.echo.nasa.gov). MODIS snow products were obtained through the National Snow and Ice Data Center (NSIDC; see online at http://nsidc.org) Distributed Active Archive Center. The MRT is maintained and distributed by the NASA Land Processes Distributed Active Archive Center (LPDAAC; see online at https://lpdaac.usgs.gov).
REFERENCES
Alsdorf, D., Bates P. , Lettenmaier D. , Mognard N. , Moller D. , and Rodriguez E. , 2008: What will SWOT measurements and products look like? Eos, Trans. Amer. Geophys. Union,89 (Fall Meeting Suppl.), Abstract H311-03.
Andersson, E., and Coauthors, 2007: Analysis and forecast impact of the main humidity observing systems. Quart. J. Roy. Meteor. Soc., 133 , 1473–1485.
Andreadis, K. M., and Lettenmaier D. P. , 2006: Assimilating remotely sensed snow observations into a macroscale hydrology model. Adv. Water Resour., 29 , 872–886.
Asrar, G., and Dokken D. J. , 1993: Earth Observing System (EOS) reference handbook. National Aeronautics and Space Administration, Earth Science Support Office, Document Resource Facility, 145 pp.
Aumann, H. H., and Coauthors, 2003: AIRS/AMSU/HSB on the Aqua mission: Design, science objectives, data products, and processing systems. IEEE Trans. Geosci. Remote Sens., 41 , 253–264.
Betts, A. K., Ball J. H. , Beljaars A. C. M. , Miller M. J. , and Viterbo P. A. , 1996: The land surface-atmosphere interaction: A review based on observational and global modeling perspectives. J. Geophys. Res., 101 , 7209–7225.
Bindlish, R., Crow W. T. , and Jackson T. J. , 2009: Role of passive microwave remote sensing in improving flood forecasts. IEEE Geosci. Remote Sens. Lett., 6 , 112–116.
Birkett, C. M., 1998: Contribution of the TOPEX NASA radar altimeter to the global monitoring of large rivers and wetlands. Water Resour. Res., 34 , 1223–1239.
Bolton, D., 1980: The computation of equivalent potential temperature. Mon. Wea. Rev., 108 , 1046–1053.
Bowling, L. C., and Coauthors, 2003: Simulation of high-latitude hydrological processes in the Torne-Kalix basin: PILPS phase 2(e)—1: Experiment description and summary intercomparisons. Global Planet. Change, 38 , 1–30.
Chahine, M. T., 1974: Remote sounding of cloudy atmospheres. Part I: Single cloud layer. J. Atmos. Sci., 31 , 233–243.
Chahine, M. T., 1977: Remote sounding of cloudy atmospheres. Part II: Multiple cloud formations. J. Atmos. Sci., 34 , 744–757.
Chahine, M. T., and Coauthors, 2001: AIRS—Team retrieval for core products and geophysical parameters: Level 2 Algorithm Theoretical Basis Document. JPL D-17006, NASA GSFC, 198 pp. [Available online at http://eospso.gsfc.nasa.gov/eos_homepage/for_scientists/atbd/docs/AIRS/atbd-airs-L2.pdf].
Chahine, M. T., and Coauthors, 2006: Improving weather forecasting and providing new data on greenhouse gases. Bull. Amer. Meteor. Soc., 87 , 911–926.
Cohen, W. B., and Coauthors, 2006: MODIS land cover and LAI collection 4 product quality across nine sites in the western hemisphere. IEEE Trans. Geosci. Remote Sens., 44 , 1843–1857.
Cosgrove, B. A., and Coauthors, 2003: Real-time and retrospective forcing in the North American Land Data Assimilation System (NLDAS) project. J. Geophys. Res., 108 , 8842. doi:10.1029/2002JD003118.
Crow, W. T., and Wood E. F. , 2003: The assimilation of remotely sensed soil brightness temperature imagery into a land surface model using Ensemble Kalman filtering: A case study based on ESTAR measurements during SGP97. Adv. Water Resour., 26 , 137–149.
Crow, W. T., and Bolten J. D. , 2007: Estimating precipitation errors using spaceborne surface soil moisture retrievals. Geophys. Res. Lett., 34 , L08403. doi:10.1029/2007GL029450.
Crow, W. T., Bindlish R. , and Jackson T. J. , 2005: The added value of spaceborne passive microwave soil moisture retrievals for forecasting rainfall-runoff partitioning. Geophys. Res. Lett., 32 , L18401. doi:10.1029/2005GL023543.
Crow, W. T., Huffman G. J. , Bindlish R. , and Jackson T. J. , 2009: Improving satellite-based rainfall accumulation estimates using spaceborne surface soil moisture retrievals. J. Hydrometeor., 10 , 199–212.
Darnell, W. L., Staylor W. F. , Gupta S. K. , Ritchey N. A. , and Wilber A. C. , 1992: Seasonal-variation of surface radiation budget derived from International Satellite Cloud Climatology Project C1 data. J. Geophys. Res., 97 , 15741–15760.
Davey, C. A., and Pielke R. A. , 2005: Microclimate exposures of surface-based weather stations—Implications for the assessment of long-term temperature trends. Bull. Amer. Meteor. Soc., 86 , 497–504.
Dirmeyer, P. A., 2006: The hydrologic feedback pathway for land–climate coupling. J. Hydrometeor., 7 , 857–867.
Divakarla, M. G., Barnet C. D. , Goldberg M. D. , McMillin L. M. , Maddy E. , Wolf W. , Zhou L. H. , and Liu X. P. , 2006: Validation of Atmospheric Infrared Sounder temperature and water vapor retrievals with matched radiosonde measurements and forecasts. J. Geophys. Res., 111 , D09S15. doi:10.1029/2005JD006116.
Ek, M. B., and Holtslag A. A. M. , 2004: Influence of soil moisture on boundary layer cloud development. J. Hydrometeor., 5 , 86–99.
Ferguson, C. R., Sheffield J. , Wood E. F. , and Gao H. , 2010: Quantifying uncertainty in a remote sensing-based estimate of evapotranspiration over continental USA. Int. J. Remote Sens., 14 , 3821–3865.
Fetzer, E. J., 2006: Preface to special section: Validation of Atmospheric Infrared Sounder Observations. J. Geophys. Res., 111 , D09S01. doi:10.1029/2005JD007020.
Frappart, F., Calmant S. , Cauhope M. , Seyler F. , and Cazenave A. , 2006: Preliminary results of ENVISAT RA-2-derived water levels validation over the Amazon basin. Remote Sens. Environ., 100 , 252–264.
Freitas, S. C., Trigo I. F. , Bioucas-Dias J. M. , and Gottsche F. M. , 2010: Quantifying the uncertainty of land surface temperature retrievals from SEVIRI/Meteosat. IEEE Trans. Geosci. Remote Sens., 48 , 523–534.
Friedl, M. A., 1996: Relationships among remotely sensed data, surface energy balance, and area-averaged fluxes over partially vegetated land surfaces. J. Appl. Meteor., 35 , 2091–2103.
Ganguly, S., Samanta A. , Schull M. A. , Shabanov N. V. , Milesi C. , Nemani R. R. , Knyazikhin Y. , and Myneni R. B. , 2008: Generating vegetation leaf area index Earth system data record from multiple sensors. Part 2: Implementation, analysis and validation. Remote Sens. Environ., 112 , 4318–4332.
Gao, F., and Coauthors, 2008: An algorithm to produce temporally and spatially continuous MODIS-LAI time series. IEEE Geosci. Remote Sens. Lett., 5 , 60–64.
Gao, H., Wood E. F. , Jackson T. J. , Drusch M. , and Bindlish R. , 2006: Using TRMM/TMI to retrieve surface soil moisture over the southern United States from 1998 to 2002. J. Hydrometeor., 7 , 23–38.
Gao, W. H., Zhao F. S. , Xu Y. F. , and Feng X. , 2008: Validation of the surface air temperature products retrieved from the atmospheric infrared sounder over China. IEEE Trans. Geosci. Remote Sens., 46 , 1783–1789.
Garcia-Haro, F. J., Sommer S. , and Kemper T. , 2005: A new tool for variable multiple endmember spectral mixture analysis (VMESMA). Int. J. Remote Sens., 26 , 2135–2162.
Guo, Z. C., and Coauthors, 2006: GLACE: The Global Land–Atmosphere Coupling Experiment. Part II: Analysis. J. Hydrometeor., 7 , 611–625.
Gupta, S. K., Kratz D. P. , Stackhouse P. W. , and Wilber A. C. , 2001: The Langley Parameterized Shortwave Algorithm(LPSA) for surface radiation budget studies (Version 1.0). NASA/TP-2001-211272, 31 pp.
Hall, D. K., and Riggs G. A. , 2007: Accuracy assessment of the MODIS snow products. Hydrol. Processes, 21 , 1534–1547.
Hall, D. K., Riggs G. A. , Salomonson V. V. , DiGirolamo N. E. , and Bayr K. J. , 2002: MODIS snow-cover products. Remote Sens. Environ., 83 , 181–194.
Hansen, M. C., Defries R. S. , Townshend J. R. G. , and Sohlberg R. , 2000: Global land cover classification at 1 km spatial resolution using a classification tree approach. Int. J. Remote Sens., 21 , 1331–1364.
Holmes, T., De Jeu R. A. M. , Owe M. , and Dolman A. J. , 2009: Land surface temperature from Ka band (37 GHz) passive microwave observations. J. Geophys. Res., 114 , D04113. doi:10.1029/2008JD010257.
Hong, Y., Adler R. F. , Negri A. , and Huffman G. J. , 2007: Flood and landslide applications of near real-time satellite rainfall products. Nat. Hazards, 43 , 285–294.
Hsu, K. L., Gao X. G. , Sorooshian S. , and Gupta H. V. , 1997: Precipitation estimation from remotely sensed information using artificial neural networks. J. Appl. Meteor., 36 , 1176–1190.
Hsu, K. L., Gupta H. V. , Gao X. G. , and Sorooshian S. , 1999: Estimation of physical variables from multichannel remotely sensed imagery using a neural network: Application to rainfall estimation. Water Resour. Res., 35 , 1605–1618.
Huffman, G. J., Adler R. F. , Stocker E. , Bolvin D. , and Nelkin E. , 2003: Analysis of TRMM 3-hourly multi-satellite precipitation estimates computed in both real and post-real time. Preprints, 12th Conf. on Satellite Meteorology and Oceanography, Long Beach, CA, Amer. Meteor. Soc., P4.11.
Huffman, G. J., and Coauthors, 2007: The TRMM Multisatellite Precipitation Analysis (TMPA): Quasi-global, multiyear, combined-sensor precipitation estimates at fine scales. J. Hydrometeor., 8 , 38–55.
Huntington, T. G., 2006: Evidence for intensification of the global water cycle: Review and synthesis. J. Hydrol., 319 , 83–95.
Jones, L. A., Ferguson C. R. , Kimball J. S. , Zhang Z. , Chan S. K. , McDonald K. C. , Njoku E. G. , and Wood E. F. , 2010: Satellite microwave remote sensing of daily land surface air temperature minima and maxima from AMSR-E: Comparison with AIRS/AMSU. IEEE J. Selected Topics Appl. Earth Observ. Remote Sens., 3 , 111–123.
Jonsson, P., and Eklundh L. , 2004: TIMESAT—A program for analyzing time-series of satellite sensor data. Comput. Geosci., 30 , 833–845.
Joyce, R. J., Janowiak J. E. , Arkin P. A. , and Xie P. P. , 2004: CMORPH: A method that produces global precipitation estimates from passive microwave and infrared data at high spatial and temporal resolution. J. Hydrometeor., 5 , 487–503.
Kabsch, E., Olesen F. S. , and Prata F. , 2008: Initial results of the land surface temperature (LST) validation with the Evora, Portugal ground-truth station measurements. Int. J. Remote Sens., 29 , 5329–5345.
Kelly, R. E., Chang A. T. , Tsang L. , and Foster J. L. , 2003: A prototype AMSR-E global snow area and snow depth algorithm. IEEE Trans. Geosci. Remote Sens., 41 , 230–242.
Knuteson, R. O., Ackerman S. A. , Revercomb H. , and Tobin D. , 2008: Comparison of NASA AIRS and MODIS land surface temperature and infrared emissivity measurements from the EOS Aqua platform. Int. Workshop on the Retrieval and Use of Land Surface Temperature: Bridging the Gaps, Asheville, NC, NOAA/NCDC.
Koster, R. D., Suarez M. J. , Higgins R. W. , and Van den Dool H. M. , 2003: Observational evidence that soil moisture variations affect precipitation. Geophys. Res. Lett., 30 , 1241. doi:10.1029/2002GL016571.
Koster, R. D., and Coauthors, 2004: Regions of strong coupling between soil moisture and precipitation. Science, 305 , 1138–1140.
Koster, R. D., and Coauthors, 2006: GLACE: The Global Land–Atmosphere Coupling Experiment. Part I: Overview. J. Hydrometeor., 7 , 590–610.
Kumar, S. V., Reichle R. H. , Peters-Lidard C. D. , Koster R. D. , Zhan X. W. , Crow W. T. , Eylander J. B. , and Houser P. R. , 2008: A land surface data assimilation framework using the land information system: Description and applications. Adv. Water Resour., 31 , 1419–1432.
Lakshmi, V., Czajkowski K. , Dubayah R. , and Susskind J. , 2001: Land surface air temperature mapping using TOVS and AVHRR. Int. J. Remote Sens., 22 , 643–662.
Lott, N., 2004: The quality control of the integrated surface hourly database. Preprints, 14th Conf. on Applied Climatology, Seattle, WA, Amer. Meteor. Soc., 7.8.
Lott, N., Baldwin R. , and Jones P. , 2001: The FCC integrated surface hourly database: A new resource of global climate data. National Climatic Data Center Tech. Rep. 2001-01, U.S. Department of Commerce, National Environmental Satellite Data and Information Service, National Climatic Data Center, 42 pp.
Lu, H., Koike T. , Fujii H. , Ohta T. , and Tamagawa K. , 2009: Development of a physically-based soil moisture retrieval algorithm for spaceborne passive microwave radiometers and its application to AMSR-E. J. Remote Sens. Soc. Japan, 29 , 253–261.
Luo, L. F., and Wood E. F. , 2007: Monitoring and predicting the 2007 U.S. drought. Geophys. Res. Lett., 34 , L22702. doi:10.1029/2007GL031673.
Luo, L. F., and Coauthors, 2003: Validation of the North American Land Data Assimilation System (NLDAS) retrospective forcing over the southern Great Plains. J. Geophys. Res., 108 , 8843. doi:10.1029/2002JD003246.
Madeira, C., 2002: Generalized split-window algorithm for retrieving land-surface temperature from MSG/SEVIRI data. Proc. Land Surface Analysis SAF Training Workshop, Lisbon, Portugal, European Organisation for the Exploitation of Meteorological Satellites, 42–47.
Margulis, S. A., and Entekhabi D. , 2003: Variational assimilation of radiometric surface temperature and reference-level micrometeorology into a model of the atmospheric boundary layer and land surface. Mon. Wea. Rev., 131 , 1272–1288.
McCabe, M. F., Wood E. F. , Wójcik R. , Pan M. , Sheffield J. , Gao H. , and Su H. , 2008: Hydrological consistency using multi-sensor remote sensing data for water and energy cycle studies. Remote Sens. Environ., 112 , 430–444.
Mitchell, K. E., and Coauthors, 2004: The multi-institution North American Land Data Assimilation System (NLDAS): Utilizing multiple GCIP products and partners in a continental distributed hydrological modeling system. J. Geophys. Res., 109 , D07S90. doi:10.1029/2003JD003823.
Monteith, J. L., 1964: Evaporation and environment: The state of movement of water in living organisms. Symp. Soc. Exp. Biol., 19 , 205–234.
Mu, Q., Heinsch F. A. , Zhao M. , and Running S. W. , 2007: Development of a global evapotranspiration algorithm based on MODIS and global meteorology data. Remote Sens. Environ., 111 , 519–536.
Njoku, E. G., Jackson T. J. , Lakshmi V. , Chan T. K. , and Nghiem S. V. , 2003: Soil moisture retrieval from AMSR-E. IEEE Trans. Geosci. Remote Sens., 41 , 215–229.