RainForests: a machine-learning approach to calibrating NWP precipitation forecasts

Belinda Trotta aBureau of Meteorology, Melbourne, Australia

Search for other papers by Belinda Trotta in
Current site
Google Scholar
PubMed
Close
,
Benjamin Owen aBureau of Meteorology, Melbourne, Australia

Search for other papers by Benjamin Owen in
Current site
Google Scholar
PubMed
Close
,
Jiaping Liu aBureau of Meteorology, Melbourne, Australia

Search for other papers by Jiaping Liu in
Current site
Google Scholar
PubMed
Close
,
Gary Weymouth bRetired from Bureau of Meteorology, Melbourne, Australia

Search for other papers by Gary Weymouth in
Current site
Google Scholar
PubMed
Close
,
Thomas Gale aBureau of Meteorology, Melbourne, Australia

Search for other papers by Thomas Gale in
Current site
Google Scholar
PubMed
Close
,
Timothy Hume aBureau of Meteorology, Melbourne, Australia

Search for other papers by Timothy Hume in
Current site
Google Scholar
PubMed
Close
,
Anja Schubert aBureau of Meteorology, Melbourne, Australia

Search for other papers by Anja Schubert in
Current site
Google Scholar
PubMed
Close
,
James Canvin aBureau of Meteorology, Melbourne, Australia

Search for other papers by James Canvin in
Current site
Google Scholar
PubMed
Close
,
Daniel Mentiplay aBureau of Meteorology, Melbourne, Australia

Search for other papers by Daniel Mentiplay in
Current site
Google Scholar
PubMed
Close
,
Jennifer Whelan aBureau of Meteorology, Melbourne, Australia

Search for other papers by Jennifer Whelan in
Current site
Google Scholar
PubMed
Close
, and
Robert Johnson aBureau of Meteorology, Melbourne, Australia

Search for other papers by Robert Johnson in
Current site
Google Scholar
PubMed
Close
Open access

Abstract

Probabilistic forecasts derived from ensemble prediction systems (EPS) have become the standard basis for many products and services produced by modern operational forecasting centres. However statistical post-processing is generally required to ensure forecasts have the desired properties expected for probability-based outputs. Precipitation, a core component of any operational forecast, is particularly challenging to calibrate due to its discontinuous nature and the extreme skew in rainfall amounts. A skillful forecasting system must maintain accuracy for low-to-moderate precipitation amounts, but preserve resolvability for high-to-extreme rainfall amounts, which, though rare, are important to forecast accurately in the interest of public safety. Existing statistical and machine-learning approaches to rainfall calibration address this problem, but each has drawbacks in design, training approaches, and/or performance. We describe RainForests, a machine-learning approach for calibrating ensemble rainfall forecasts using gradient-boosted decision trees. The model is based on the ecPoint system recently developed at ECMWF by Hewson and Pillosu (2021), but uses machine-learning models in place of the semi-subjective decision trees of ecPoint, along with some other improvements to the model structure. We evaluate RainForests on the Australian domain against some simple benchmarks, and show that it outperforms standard calibration approaches both in overall skill and in accurately forecasting high rainfall conditions, while being computationally efficient enough to be used in an operational forecasting system.

© 2024 American Meteorological Society. This is an Author Accepted Manuscript distributed under the terms of the default AMS reuse license. For information regarding reuse and general copyright information, consult the AMS Copyright Policy (www.ametsoc.org/PUBSReuseLicenses).

Corresponding author: Belinda Trotta, belinda.trotta@bom.gov.au

Abstract

Probabilistic forecasts derived from ensemble prediction systems (EPS) have become the standard basis for many products and services produced by modern operational forecasting centres. However statistical post-processing is generally required to ensure forecasts have the desired properties expected for probability-based outputs. Precipitation, a core component of any operational forecast, is particularly challenging to calibrate due to its discontinuous nature and the extreme skew in rainfall amounts. A skillful forecasting system must maintain accuracy for low-to-moderate precipitation amounts, but preserve resolvability for high-to-extreme rainfall amounts, which, though rare, are important to forecast accurately in the interest of public safety. Existing statistical and machine-learning approaches to rainfall calibration address this problem, but each has drawbacks in design, training approaches, and/or performance. We describe RainForests, a machine-learning approach for calibrating ensemble rainfall forecasts using gradient-boosted decision trees. The model is based on the ecPoint system recently developed at ECMWF by Hewson and Pillosu (2021), but uses machine-learning models in place of the semi-subjective decision trees of ecPoint, along with some other improvements to the model structure. We evaluate RainForests on the Australian domain against some simple benchmarks, and show that it outperforms standard calibration approaches both in overall skill and in accurately forecasting high rainfall conditions, while being computationally efficient enough to be used in an operational forecasting system.

© 2024 American Meteorological Society. This is an Author Accepted Manuscript distributed under the terms of the default AMS reuse license. For information regarding reuse and general copyright information, consult the AMS Copyright Policy (www.ametsoc.org/PUBSReuseLicenses).

Corresponding author: Belinda Trotta, belinda.trotta@bom.gov.au
Save