A Comparison of Methods for Filling Gaps in Hourly Near-Surface Air Temperature Data

Brian Henn Department of Civil and Environmental Engineering, University of Washington, Seattle, Washington

Search for other papers by Brian Henn in
Current site
Google Scholar
PubMed
Close
,
Mark S. Raleigh Department of Civil and Environmental Engineering, University of Washington, Seattle, Washington

Search for other papers by Mark S. Raleigh in
Current site
Google Scholar
PubMed
Close
,
Alex Fisher Department of Civil and Environmental Engineering, University of Washington, Seattle, Washington

Search for other papers by Alex Fisher in
Current site
Google Scholar
PubMed
Close
, and
Jessica D. Lundquist Department of Civil and Environmental Engineering, University of Washington, Seattle, Washington

Search for other papers by Jessica D. Lundquist in
Current site
Google Scholar
PubMed
Close
Restricted access

Abstract

Near-surface air temperature observations often have periods of missing data, and many applications using these datasets require filling in all missing periods. Multiple methods are available to fill missing data, but the comparative accuracy of these approaches has not been assessed. In this comparative study, five techniques were used to fill in missing temperature data: spatiotemporal correlations in the form of empirical orthogonal functions (EOFs), time series diurnal interpolation, and three variations of lapse rate–based filling. The method validation used sets of hourly surface temperature observations in complex terrain from five regions. The most accurate method for filling missing data depended on the number of available stations and the number of hours of missing data. Spatiotemporal correlations using EOF reconstruction were most accurate provided that at least 16 stations were available. Temporal interpolation was the most accurate method when only one or two stations were available or for 1-h gaps. Lapse rate–based filling was most accurate for intermediate numbers of stations. The accuracy of the lapse rate and EOF methods was found to be sensitive to the vertical separation of stations and the degree of correlation between them, which also explained some of the regional differences in performance. Horizontal distance was less significantly correlated with method performance. From these findings, guidelines are presented for choosing a filling method based on the duration of the missing data and the number of stations.

Current affiliation: Horn Point Laboratory, University of Maryland Center for Environmental Science, Cambridge, Maryland.

Corresponding author address: Brian Henn, Department of Civil and Environmental Engineering, University of Washington, Box 352700, Seattle, WA 98195-2700. E-mail: bhenn@u.washington.edu

Abstract

Near-surface air temperature observations often have periods of missing data, and many applications using these datasets require filling in all missing periods. Multiple methods are available to fill missing data, but the comparative accuracy of these approaches has not been assessed. In this comparative study, five techniques were used to fill in missing temperature data: spatiotemporal correlations in the form of empirical orthogonal functions (EOFs), time series diurnal interpolation, and three variations of lapse rate–based filling. The method validation used sets of hourly surface temperature observations in complex terrain from five regions. The most accurate method for filling missing data depended on the number of available stations and the number of hours of missing data. Spatiotemporal correlations using EOF reconstruction were most accurate provided that at least 16 stations were available. Temporal interpolation was the most accurate method when only one or two stations were available or for 1-h gaps. Lapse rate–based filling was most accurate for intermediate numbers of stations. The accuracy of the lapse rate and EOF methods was found to be sensitive to the vertical separation of stations and the degree of correlation between them, which also explained some of the regional differences in performance. Horizontal distance was less significantly correlated with method performance. From these findings, guidelines are presented for choosing a filling method based on the duration of the missing data and the number of stations.

Current affiliation: Horn Point Laboratory, University of Maryland Center for Environmental Science, Cambridge, Maryland.

Corresponding author address: Brian Henn, Department of Civil and Environmental Engineering, University of Washington, Box 352700, Seattle, WA 98195-2700. E-mail: bhenn@u.washington.edu
Save
  • Beckers, J. M., and Rixen M. , 2003: EOF calculations and data filling from incomplete oceanographic datasets. J. Atmos. Oceanic Technol., 20, 18391856.

    • Search Google Scholar
    • Export Citation
  • Box, G. E. P., and Jenkins G. M. , 1976: Time Series Analysis: Forecasting and Control. Holden-Day, 575 pp.

  • Claridge, D. E., and Chen H. , 2006: Missing data estimation for 1–6 h gaps in energy use and weather data using different statistical methods. Int. J. Energy Res., 30, 10751091, doi:10.1002/er.1207.

    • Search Google Scholar
    • Export Citation
  • Daly, C., Gibson W. P. , Taylor G. H. , Johnson G. L. , and Pasteris P. , 2002: A knowledge-based approach to the statistical mapping of climate. Climate Res., 22, 99113.

    • Search Google Scholar
    • Export Citation
  • Daly, C., Halbleib M. , Smith J. I. , Gibson W. P. , Doggett M. K. , Taylor G. H. , Curtis J. , and Pasteris P. P. , 2008: Physiographically sensitive mapping of climatological temperature and precipitation across the conterminous United States. Int. J. Climatol., 28, 20312064, doi:10.1002/joc.1688.

    • Search Google Scholar
    • Export Citation
  • Daly, S. F., Davis R. , Ochs E. , and Pangburn T. , 2000: An approach to spatially distributed snow modeling of the Sacramento and San Joaquin basins, California. Hydrol. Processes, 14, 32573271.

    • Search Google Scholar
    • Export Citation
  • Dodson, R., and Marks D. , 1997: Daily air temperature interpolated at high spatial resolution over a large mountainous region. Climate Res., 8, 120.

    • Search Google Scholar
    • Export Citation
  • Eischeid, J. K., Pasteris P. A. , Diaz H. F. , Plantico M. S. , and Lott N. J. , 2000: Creating a serially complete, national daily time series of temperature and precipitation for the western United States. J. Appl. Meteor., 39, 15801591.

    • Search Google Scholar
    • Export Citation
  • Garen, D. C., Johnson G. L. , and Hansen C. L. , 1994: Mean areal precipitation for daily hydrologic modeling in mountainous regions. J. Amer. Water Resour. Assoc., 30, 481491.

    • Search Google Scholar
    • Export Citation
  • Gunes, H., Sirisup S. , and Karniadakis G. E. , 2006: Gappy data: To krig or not to krig? J. Comput. Phys., 212, 358382, doi:10.1016/j.jcp.2005.06.023.

    • Search Google Scholar
    • Export Citation
  • Hamlet, A. F., and Lettenmaier D. P. , 2005: Producing temporally consistent daily precipitation and temperature fields for the continental United States. J. Hydrometeor., 6, 330336.

    • Search Google Scholar
    • Export Citation
  • Liston, G. E., and Elder K. , 2006: A meteorological distribution system for high-resolution terrestrial modeling (MicroMet). J. Hydrometeor., 7, 217234.

    • Search Google Scholar
    • Export Citation
  • Lundquist, J. D., and Cayan D. R. , 2007: Surface temperature patterns in complex terrain: Daily variations and long-term change in the central Sierra Nevada, California. J. Geophys. Res., 112, D11124, doi:10.1029/2006JD007561.

    • Search Google Scholar
    • Export Citation
  • Lundquist, J. D., and Huggett B. , 2008: Evergreen trees as inexpensive radiation shields for temperature sensors. Water Resour. Res., 44, W00D04, doi:10.1029/2008WR006979.

    • Search Google Scholar
    • Export Citation
  • Lundquist, J. D., Cayan D. R. , and Dettinger M. D. , 2003: Meteorology and hydrology in Yosemite National Park: A sensor network application. Information Processing in Sensor Networks, F. Zhao and L. Guibas, Eds., Association for Computing Machinery, 518–528.

  • Lundquist, J. D., Pepin N. , and Rochford C. , 2008: Automated algorithm for mapping regions of cold-air pooling in complex terrain. J. Geophys. Res., 113, D22107, doi:10.1029/2008JD009879.

    • Search Google Scholar
    • Export Citation
  • MathWorks, Inc., 2010: MATLAB, version 7.11.0. MathWorks, Inc.

  • Maurer, E. P., Wood A. W. , Adam J. C. , Lettenmaier D. P. , and Nijssen B. , 2002: A long-term hydrologically based dataset of land surface fluxes and states for the conterminous United States. J. Climate, 15, 32373251.

    • Search Google Scholar
    • Export Citation
  • Meek, D. W., and Hatfield J. L. , 1994: Data quality checking for single-station meteorological databases. Agric. For. Meteor., 69, 85109.

    • Search Google Scholar
    • Export Citation
  • Minder, J. R., Mote P. W. , and Lundquist J. D. , 2010: Surface temperature lapse rates over complex terrain: Lessons from the Cascade Mountains. J. Geophys. Res., 115, D14122, doi:10.1029/2009JD013493.

    • Search Google Scholar
    • Export Citation
  • Pape, R., Wundram D. , and Löffler J. , 2009: Modelling near-surface temperature conditions in high mountain environments: An appraisal. Climate Res., 39, 99109, doi:10.3354/cr00795.

    • Search Google Scholar
    • Export Citation
  • Pepin, N., and Kidd D. , 2006: Spatial temperature variation in the eastern Pyrenees. Weather, 61, 300310, doi:10.1256/wea.106.06.

  • Preisendorfer, R. W., 1988: Principal Component Analysis in Meteorology and Oceanography. Elsevier, 425 pp.

  • Raible, C. C., Bischof G. , Fraedrich K. , and Kirk E. , 1999: Statistical single-station short-term forecasting of temperature and probability of precipitation: Area interpolation and NWP combination. Wea. Forecasting, 14, 203214.

    • Search Google Scholar
    • Export Citation
  • Raleigh, M. S., and Lundquist J. D. , 2012: Comparing and combining SWE estimates from the SNOW-17 model using PRISM and SWE reconstruction. Water Resour. Res., 48, W01506, doi:10.1029/2011WR010542.

    • Search Google Scholar
    • Export Citation
  • Ralph, F. M., and Coauthors, 2005: Improving short-term (0–48 h) cool season quantitative precipitation forecasting: Recommendations from a USWRP workshop. Bull. Amer. Meteor. Soc., 86, 16191632.

    • Search Google Scholar
    • Export Citation
  • Rolland, C., 2003: Spatial and seasonal variations of air temperature lapse rates in Alpine regions. J. Climate, 16, 10321046.

  • Serreze, M. C., Clark M. P. , Armstrong R. L. , McGinnis D. A. , and Pulwarty R. S. , 1999: Characteristics of the western United States snowpack from snowpack telemetry (SNOTEL) data. Water Resour. Res., 35, 21452160, doi:10.1029/1999WR900090.

    • Search Google Scholar
    • Export Citation
  • Stahl, K., Moore R. D. , Floyer J. A. , Asplin M. G. , and McKendry I. G. , 2006: Comparison of approaches for spatial interpolation of daily air temperature in a large region with complex topography and highly variable station density. Agric. For. Meteor., 139, 224236, doi:10.1016/j.agrformet.2006.07.004.

    • Search Google Scholar
    • Export Citation
  • Tobin, C., Nicotina L. , Parlange M. B. , Berne A. , and Rinaldo A. , 2011: Improved interpolation of meteorological forcings for hydrologic applications in a Swiss Alpine region. J. Hydrol., 401, 7789, doi:10.1016/j.jhydrol.2011.02.010.

    • Search Google Scholar
    • Export Citation
  • Von Storch, H., and Zwiers F. W. , 1999: Statistical Analysis in Climate Research. Cambridge University Press, 484 pp.

  • Walton, T. L., 1996: Fill-in of missing data in univariate coastal data. J. Appl. Stat., 23, 3140.

  • Webster, R., and Oliver M. A. , 2001: Geostatistics for Environmental Scientists. John Wiley and Sons, 271 pp.

  • Wigmosta, M. S., Vail L. W. , and Lettenmaier D. P. , 1994: A distributed hydrology–vegetation model for complex terrain. Water Resour. Res., 30, 16651679.

    • Search Google Scholar
    • Export Citation
All Time Past Year Past 30 Days
Abstract Views 0 0 0
Full Text Views 2350 1031 52
PDF Downloads 1317 276 17