Evaluation and Statistical Correction of Area-Based Heat Index Forecasts That Drive a Heatwave Warning Service

Nicholas Loveday Bureau of Meteorology, Melbourne, Victoria, Australia

Search for other papers by Nicholas Loveday in
Current site
Google Scholar
PubMed
Close
https://orcid.org/0009-0000-5796-7069
and
Maree Carroll Bureau of Meteorology, Melbourne, Victoria, Australia

Search for other papers by Maree Carroll in
Current site
Google Scholar
PubMed
Close
https://orcid.org/0009-0008-6830-8251
Restricted access

Abstract

This study evaluates the performance of the area-based, district heatwave forecasts that drive the Australian heatwave warning service. The analysis involves using a recently developed approach of scoring multicategorical forecasts using the fixed risk multicategorical (FIRM) scoring framework. Additionally, we quantify the stability of the district forecasts between forecast updates. Notably, at longer lead times, a discernible overforecast bias exists that leads to issuing severe and extreme heatwave district forecasts too frequently. Consequently, at shorter lead times, forecast heatwave categories are frequently downgraded with subsequent revisions. To address these issues, we demonstrate how isotonic regression can be used to conditionally bias correct the district forecasts. Finally, using synthetic experiments, we illustrate that even if an area warning is derived from a perfectly calibrated gridded forecast, the area warning will be biased in most situations. We show how these biases can also be corrected using isotonic regression which could lead to a better warning service. Importantly, the evaluation and bias correction approaches demonstrated in this paper are relevant to forecast parameters other than heat indices.

Significance Statement

Heatwaves have terrible impacts on society. Understanding the performance of heatwave forecasts and warnings is important so that we can understand how they can be improved to reduce negative health impacts. We demonstrate techniques to evaluate the performance, biases, and stability of area-based forecasts that are used by a warning service. We illustrate how area-based forecasts can be improved through conditional bias correction. Additionally, we conduct a synthetic experiment that shows that deriving an area-based warning from a perfectly calibrated gridded forecast may lead to biased warnings.

© 2025 American Meteorological Society. This published article is licensed under the terms of the default AMS reuse license. For information regarding reuse of this content and general copyright information, consult the AMS Copyright Policy (www.ametsoc.org/PUBSReuseLicenses).

Corresponding author: Nicholas Loveday, nicholas.loveday@bom.gov.au

Abstract

This study evaluates the performance of the area-based, district heatwave forecasts that drive the Australian heatwave warning service. The analysis involves using a recently developed approach of scoring multicategorical forecasts using the fixed risk multicategorical (FIRM) scoring framework. Additionally, we quantify the stability of the district forecasts between forecast updates. Notably, at longer lead times, a discernible overforecast bias exists that leads to issuing severe and extreme heatwave district forecasts too frequently. Consequently, at shorter lead times, forecast heatwave categories are frequently downgraded with subsequent revisions. To address these issues, we demonstrate how isotonic regression can be used to conditionally bias correct the district forecasts. Finally, using synthetic experiments, we illustrate that even if an area warning is derived from a perfectly calibrated gridded forecast, the area warning will be biased in most situations. We show how these biases can also be corrected using isotonic regression which could lead to a better warning service. Importantly, the evaluation and bias correction approaches demonstrated in this paper are relevant to forecast parameters other than heat indices.

Significance Statement

Heatwaves have terrible impacts on society. Understanding the performance of heatwave forecasts and warnings is important so that we can understand how they can be improved to reduce negative health impacts. We demonstrate techniques to evaluate the performance, biases, and stability of area-based forecasts that are used by a warning service. We illustrate how area-based forecasts can be improved through conditional bias correction. Additionally, we conduct a synthetic experiment that shows that deriving an area-based warning from a perfectly calibrated gridded forecast may lead to biased warnings.

© 2025 American Meteorological Society. This published article is licensed under the terms of the default AMS reuse license. For information regarding reuse of this content and general copyright information, consult the AMS Copyright Policy (www.ametsoc.org/PUBSReuseLicenses).

Corresponding author: Nicholas Loveday, nicholas.loveday@bom.gov.au
Save
  • Borg, M., M. Nitschke, S. Williams, S. McDonald, J. Nairn, and P. Bi, 2019: Using the excess heat factor to indicate heatwave-related urinary disease: A case study in Adelaide, South Australia. Int. J. Biometeorol., 63, 435447, https://doi.org/10.1007/s00484-019-01674-5.

    • Search Google Scholar
    • Export Citation
  • Cheng, J., and Coauthors, 2019: Cardiorespiratory effects of heatwaves: A systematic review and meta-analysis of global epidemiological evidence. Environ. Res., 177, 108610, https://doi.org/10.1016/j.envres.2019.108610.

    • Search Google Scholar
    • Export Citation
  • Coates, L., J. van Leeuwen, S. Browning, A. Gissing, J. Bratchell, and A. Avci, 2022: Heatwave fatalities in Australia, 2001–2018: An analysis of coronial records. Int. J. Disaster Risk Reduct., 67, 102671, https://doi.org/10.1016/j.ijdrr.2021.102671.

    • Search Google Scholar
    • Export Citation
  • Diebold, F. X., and R. S. Mariano, 1995: Comparing predictive accuracy. J. Bus. Econ. Stat., 13, 253263, https://doi.org/10.1080/07350015.1995.10524599.

    • Search Google Scholar
    • Export Citation
  • Engel, C., and E. E. Ebert, 2012: Gridded operational consensus forecasts of 2-m temperature over Australia. Wea. Forecasting, 27, 301322, https://doi.org/10.1175/WAF-D-11-00069.1.

    • Search Google Scholar
    • Export Citation
  • Fawcett, R., and T. Hume, 2010: The GOCF/AWAP system—Forecasting temperature extremes. IOP Conf. Ser.: Earth Environ. Sci., 11, 012005, https://doi.org/10.1088/1755-1315/11/1/012005.

    • Search Google Scholar
    • Export Citation
  • Ford, T. W., P. A. Dirmeyer, and D. O. Benson, 2018: Evaluation of heat wave forecasts seamlessly across subseasonal timescales. npj Climate Atmos. Sci., 1, 20, https://doi.org/10.1038/s41612-018-0027-7.

    • Search Google Scholar
    • Export Citation
  • Glowacki, T. J., Y. Xiao, and P. Steinle, 2012: Mesoscale Surface Analysis System for the Australian domain: Design issues, development status, and system validation. Wea. Forecasting, 27, 141157, https://doi.org/10.1175/WAF-D-10-05063.1.

    • Search Google Scholar
    • Export Citation
  • Gneiting, T., 2011: Making and evaluating point forecasts. J. Amer. Stat. Assoc., 106, 746762, https://doi.org/10.1198/jasa.2011.r10138.

    • Search Google Scholar
    • Export Citation
  • Gneiting, T., F. Balabdaoui, and A. E. Raftery, 2007: Probabilistic forecasts, calibration and sharpness. J. Roy. Stat. Soc., 69B, 243268, https://doi.org/10.1111/j.1467-9868.2007.00587.x.

    • Search Google Scholar
    • Export Citation
  • Griffiths, D., M. Foley, I. Ioannou, and T. Leeuwenburg, 2019: Flip-Flop Index: Quantifying revision stability for fixed-event forecasts. Meteorol. Appl., 26, 3035, https://doi.org/10.1002/met.1732.

    • Search Google Scholar
    • Export Citation
  • Hansen, A., P. Bi, M. Nitschke, P. Ryan, D. Pisaniello, and G. Tucker, 2008: The effect of heat waves on mental health in a temperate Australian City. Environ. Health Perspect., 116, 13691375, https://doi.org/10.1289/ehp.11339.

    • Search Google Scholar
    • Export Citation
  • Hawkins, M. D., V. Brown, and J. Ferrell, 2017: Assessment of NOAA National Weather Service methods to warn for extreme heat events. Wea. Climate Soc., 9, 513, https://doi.org/10.1175/WCAS-D-15-0037.1.

    • Search Google Scholar
    • Export Citation
  • Heo, S., A. Nori-Sarma, K. Lee, T. Benmarhnia, F. Dominici, and M. L. Bell, 2019: The use of a quasi-experimental study on the mortality effect of a heat wave warning system in Korea. Int. J. Environ. Res. Public Health, 16, 2245, https://doi.org/10.3390/ijerph16122245.

    • Search Google Scholar
    • Export Citation
  • Hering, A. S., and M. G. Genton, 2011: Comparing spatial predictions. Technometrics, 53, 414425, https://doi.org/10.1198/TECH.2011.10136.

    • Search Google Scholar
    • Export Citation
  • Hoffmann, D., E. E. Ebert, C. Mooney, B. Golding, and S. Potter, 2023: Using value chain approaches to evaluate the end-to-end warning chain. Adv. Sci. Res., 20, 7379, https://doi.org/10.5194/asr-20-73-2023.

    • Search Google Scholar
    • Export Citation
  • Hondula, D. M., S. Meltzer, R. C. Balling Jr., and P. Iñiguez, 2022: Spatial analysis of United States National Weather Service excessive heat warnings and heat advisories. Bull. Amer. Meteor. Soc., 103, E2017E2031, https://doi.org/10.1175/BAMS-D-21-0069.1.

    • Search Google Scholar
    • Export Citation
  • Jones, D. A., W. Wang, and R. Fawcett, 2009: High-quality spatial climate data-sets for Australia. Aust. Meteor. Oceanogr. J., 58, 233248, https://doi.org/10.22499/2.5804.003.

    • Search Google Scholar
    • Export Citation
  • Jordan, A. I., A. Mühlemann, and J. F. Ziegel, 2022: Characterizing the optimal solutions to the isotonic regression problem for identifiable functionals. Ann. Inst. Stat. Math., 74, 489514, https://doi.org/10.1007/s10463-021-00808-0.

    • Search Google Scholar
    • Export Citation
  • Just, A., and M. Foley, 2020: Streamlining the graphical forecast process. J. South. Hemisphere Earth Syst. Sci., 70, 108113, https://doi.org/10.1071/ES19047.

    • Search Google Scholar
    • Export Citation
  • Leeuwenburg, T., and Coauthors, 2024: scores: A Python package for verifying and evaluating models and predictions with xarray. J. Open Source Software, 9, 6889, https://doi.org/10.21105/joss.06889.

    • Search Google Scholar
    • Export Citation
  • Loveday, N., and Coauthors, 2024: The Jive verification system and its transformative impact on weather forecasting operations. Bull. Amer. Meteor. Soc., 105, E2047E2063, https://doi.org/10.1175/BAMS-D-23-0267.1.

    • Search Google Scholar
    • Export Citation
  • Lowe, R., M. García-Díez, J. Ballester, J. Creswick, J.-M. Robine, F. R. Herrmann, and X. Rodó, 2016: Evaluation of an early-warning system for heat wave-related mortality in Europe: Implications for sub-seasonal to seasonal forecasting and climate services. Int. J. Environ. Res. Public Health, 13, 206, https://doi.org/10.3390/ijerph13020206.

    • Search Google Scholar
    • Export Citation
  • Mason, H., J. C. King, A. E. Peden, and R. C. Franklin, 2022: Systematic review of the impact of heatwaves on health service demand in Australia. BMC Health Serv. Res., 22, 960, https://doi.org/10.1186/s12913-022-08341-3.

    • Search Google Scholar
    • Export Citation
  • McEvoy, D., I. Ahmed, and J. Mullett, 2012: The impact of the 2009 heat wave on Melbourne’s critical infrastructure. Local Environ., 17, 783796, https://doi.org/10.1080/13549839.2012.678320.

    • Search Google Scholar
    • Export Citation
  • Murphy, A. H., 1977: The value of climatological, categorical and probabilistic forecasts in the cost-loss ratio situation. Mon. Wea. Rev., 105, 803816, https://doi.org/10.1175/1520-0493(1977)105<0803:TVOCCA>2.0.CO;2.

    • Search Google Scholar
    • Export Citation
  • Nairn, J., and R. Fawcett, 2013: Defining heatwaves: Heatwave defined as a heat-impact event servicing all community and business sectors in Australia. CAWCR Tech. Rep. 060, 96 pp., http://www.bom.gov.au/research/publications/cawcrreports/CTR_060.pdf.

  • Nairn, J., and R. Fawcett, 2015: The excess heat factor: A metric for heatwave intensity and its use in classifying heatwave severity. Int. J. Environ. Res. Public Health, 12, 227253, https://doi.org/10.3390/ijerph120100227.

    • Search Google Scholar
    • Export Citation
  • Nitschke, M., G. Tucker, A. Hansen, S. Williams, Y. Zhang, and P. Bi, 2016: Evaluation of a heat warning system in Adelaide, South Australia, using case-series analysis. BMJ Open, 6, e012125, https://doi.org/10.1136/bmjopen-2016-012125.

    • Search Google Scholar
    • Export Citation
  • Perkins, S. E., L. V. Alexander, and J. R. Nairn, 2012: Increasing frequency, intensity and duration of observed global heatwaves and warm spells. Geophys. Res. Lett., 39, L20714, https://doi.org/10.1029/2012GL053361.

    • Search Google Scholar
    • Export Citation
  • Perkins-Kirkpatrick, S. E., and S. C. Lewis, 2020: Increasing trends in regional heatwaves. Nat. Commun., 11, 3357, https://doi.org/10.1038/s41467-020-16970-7.

    • Search Google Scholar
    • Export Citation
  • Polazzo, F., S. K. Roth, M. Hermann, A. Mangold-Döring, A. Rico, A. Sobek, P. J. Van den Brink, and M. C. Jackson, 2022: Combined effects of heatwaves and micropollutants on freshwater ecosystems: Towards an integrated assessment of extreme events in multiple stressors research. Global Change Biol., 28, 12481267, https://doi.org/10.1111/gcb.15971.

    • Search Google Scholar
    • Export Citation
  • Taggart, R., N. Loveday, and D. Griffiths, 2022: A scoring framework for tiered warnings and multicategorical forecasts based on fixed risk measures. Quart. J. Roy. Meteor. Soc., 148, 13891406, https://doi.org/10.1002/qj.4266.

    • Search Google Scholar
    • Export Citation
  • Thompson, J. C., 1952: On the operational deficiences in categorical weather forecasts. Bull. Amer. Meteor. Soc., 33, 223226, https://doi.org/10.1175/1520-0477-33.6.223.

    • Search Google Scholar
    • Export Citation
  • Toloo, G., G. Fitzgerald, P. Aitken, K. Verrall, and S. Tong, 2013: Evaluating the effectiveness of heat warning systems: Systematic review of epidemiological evidence. Int. J. Public Health, 58, 667681, https://doi.org/10.1007/s00038-013-0465-2.

    • Search Google Scholar
    • Export Citation
  • Vitali, A., A. Felici, S. Esposito, U. Bernabucci, L. Bertocchi, C. Maresca, A. Nardone, and N. Lacetera, 2015: The effect of heat waves on dairy cow mortality. J. Dairy Sci., 98, 45724579, https://doi.org/10.3168/jds.2015-9331.

    • Search Google Scholar
    • Export Citation
  • Welbergen, J. A., S. M. Klose, N. Markus, and P. Eby, 2008: Climate change and the effects of temperature extremes on Australian flying-foxes. Proc. Roy. Soc., 275B, 419425, https://doi.org/10.1098/rspb.2007.1385.

    • Search Google Scholar
    • Export Citation
  • Wu, Y., X. Wang, J. Wu, R. Wang, and S. Yang, 2020: Performance of heat-health warning systems in Shanghai evaluated by using local heat-related illness data. Sci. Total Environ., 715, 136883, https://doi.org/10.1016/j.scitotenv.2020.136883.

    • Search Google Scholar
    • Export Citation
All Time Past Year Past 30 Days
Abstract Views 102 102 2
Full Text Views 577 576 543
PDF Downloads 260 260 218