Seasonal Forecast of Two-Meter Temperature and Precipitation in Tanzania: A Hybrid Cluster and Point-by-Point Machine Learning Approach

Camille Sansonnet afinres, Paris, France

Search for other papers by Camille Sansonnet in
Current site
Google Scholar
PubMed
Close
https://orcid.org/0009-0000-5974-1071
,
Michiel Schaeffer afinres, Paris, France
cUniversitas Islam Internasional Indonesia, Depok, Indonesia
dCisalak, Kec. Sukmajaya, Kota Depok, Jawa Barat, Indonesia
eUtrecht University, Utrecht, Netherlands
fClimate Analytics, Berlin, Germany

Search for other papers by Michiel Schaeffer in
Current site
Google Scholar
PubMed
Close
, and
Florent Baarsch afinres, Paris, France
bPostdam Institute for Climate Impact Research, Telegrafenberg, Potsdam, Germany

Search for other papers by Florent Baarsch in
Current site
Google Scholar
PubMed
Close
Restricted access

Abstract

In seasonal weather forecasting, the exact location within a study area determines the relationship between local predicted variables and global predictors. Both dynamic models and machine learning approaches that define models for single grid points can determine these relationships with high spatial granularity, but at high computational cost. To avoid the latter, clustering of predicted variables is often used in machine learning approaches, which however sacrifices geographical resolution. In this paper, we present a machine learning approach that is a hybrid between gridpoint and cluster-based approaches (finres_S2S). This approach preserves geographical resolution, but at low computational cost, and is tested for monthly 2-m temperature (2T) and precipitation (TP) in Tanzania and for a lead time of up to 6 months. The finres_S2S approach has a number of advantages over both the cluster and point approaches, including that of obtaining good results compared to a dynamic model. We find that the dominant predictors for the application area are associated with El Niño–Southern Oscillation, the Madden–Julian oscillation, and the Indian Ocean dipole.

Significance Statement

Seasonal forecasting of atmospheric parameters is important in various fields including agriculture. Machine learning makes it possible to forecast at lower computational costs than dynamic models. This advantage is even more important in certain regions of the world. The hybrid model developed here (finres_S2S) allows seasonal forecasting with a spatial resolution equal to that of ERA5 data and a low cost in calculation time. The model is tested over Tanzania, and the seasonal forecast scores outperform those of ECMWF. The development of the use of machine learning in seasonal forecasting seems to provide a complementary response to dynamic models, and research efforts in this direction should be pursued.

© 2024 American Meteorological Society. This published article is licensed under the terms of the default AMS reuse license. For information regarding reuse of this content and general copyright information, consult the AMS Copyright Policy (www.ametsoc.org/PUBSReuseLicenses).

Corresponding author: Camille Sansonnet, camille.sansonnet@finres.dev

Abstract

In seasonal weather forecasting, the exact location within a study area determines the relationship between local predicted variables and global predictors. Both dynamic models and machine learning approaches that define models for single grid points can determine these relationships with high spatial granularity, but at high computational cost. To avoid the latter, clustering of predicted variables is often used in machine learning approaches, which however sacrifices geographical resolution. In this paper, we present a machine learning approach that is a hybrid between gridpoint and cluster-based approaches (finres_S2S). This approach preserves geographical resolution, but at low computational cost, and is tested for monthly 2-m temperature (2T) and precipitation (TP) in Tanzania and for a lead time of up to 6 months. The finres_S2S approach has a number of advantages over both the cluster and point approaches, including that of obtaining good results compared to a dynamic model. We find that the dominant predictors for the application area are associated with El Niño–Southern Oscillation, the Madden–Julian oscillation, and the Indian Ocean dipole.

Significance Statement

Seasonal forecasting of atmospheric parameters is important in various fields including agriculture. Machine learning makes it possible to forecast at lower computational costs than dynamic models. This advantage is even more important in certain regions of the world. The hybrid model developed here (finres_S2S) allows seasonal forecasting with a spatial resolution equal to that of ERA5 data and a low cost in calculation time. The model is tested over Tanzania, and the seasonal forecast scores outperform those of ECMWF. The development of the use of machine learning in seasonal forecasting seems to provide a complementary response to dynamic models, and research efforts in this direction should be pursued.

© 2024 American Meteorological Society. This published article is licensed under the terms of the default AMS reuse license. For information regarding reuse of this content and general copyright information, consult the AMS Copyright Policy (www.ametsoc.org/PUBSReuseLicenses).

Corresponding author: Camille Sansonnet, camille.sansonnet@finres.dev

Supplementary Materials

    • Supplemental Materials (PDF 5.5575 MB)
Save
  • Ahmed, M., R. Seraj, and S. M. S. Islam, 2020: The k-means algorithm: A comprehensive survey and performance evaluation. Electronics, 9, 1295, https://doi.org/10.3390/electronics9081295.

    • Search Google Scholar
    • Export Citation
  • Berhane, F., and B. Zaitchik, 2014: Modulation of daily precipitation over East Africa by the Madden–Julian oscillation. J. Climate, 27, 60166034, https://doi.org/10.1175/JCLI-D-13-00693.1.

    • Search Google Scholar
    • Export Citation
  • Bi, K., L. Xie, H. Zhang, X. Chen, X. Gu, and Q. Tian, 2023: Accurate medium-range global weather forecasting with 3D neural networks. Nature, 619, 533538, https://doi.org/10.1038/s41586-023-06185-3.

    • Search Google Scholar
    • Export Citation
  • Biau, G., and E. Scornet, 2015: A random forest guided tour. arXiv, 1511.05741v1, https://doi.org/10.48550/ARXIV.1511.05741.

  • Bouras, E. H., L. Jarlan, S. Er-Raki, R. Balaghi, A. Amazirh, B. Richard, and S. Khabba, 2021: Cereal yield forecasting with satellite drought-based indices, weather data and regional climate indices using machine learning in Morocco. Remote Sens., 13, 3101, https://doi.org/10.3390/rs13163101.

    • Search Google Scholar
    • Export Citation
  • Breiman, L., 2001: Random forest. Mach. Learn., 45, 532, https://doi.org/10.1023/A:1010933404324.

  • Cohen, J., D. Coumou, J. Hwang, L. Mackey, P. Orenstein, S. Totz, and E. Tziperman, 2019: S2S reboot: An argument for greater inclusion of machine learning in subseasonal to seasonal forecasts. Wiley Interdiscip. Rev.: Climate Change, 10, e00567, https://doi.org/10.1002/wcc.567.

    • Search Google Scholar
    • Export Citation
  • Cucchi, M., G. P. Weedon, A. Amici, N. Bellouin, S. Lange, H. M. Schmied, H. Hersbach, and C. Buontempo, 2020: WFDE5: Bias-adjusted ERA5 reanalysis data for impact studies. Earth Syst. Sci. Data, 12, 20972120, https://doi.org/10.5194/essd-12-2097-2020.

    • Search Google Scholar
    • Export Citation
  • Di Capua, G., and Coauthors, 2019: Long-lead statistical forecasts of the Indian summer monsoon rainfall based on causal precursors. Wea. Forecasting, 34, 13771394, https://doi.org/10.1175/WAF-D-19-0002.1.

    • Search Google Scholar
    • Export Citation
  • Diro, G. T., D. I. F. Grimes, and E. Black, 2011: Teleconnections between Ethiopian summer rainfall and sea surface temperature: Part I—Observation and modelling. Climate Dyn., 37, 103119, https://doi.org/10.1007/s00382-010-0837-8.

    • Search Google Scholar
    • Export Citation
  • Drosdowsky, W., and L. E. Chambers, 2001: Near-global sea surface temperature anomalies as predictors of Australian seasonal rainfall. J. Climate, 14, 16771687, https://doi.org/10.1175/1520-0442(2001)014<1677:NACNGS>2.0.CO;2.

    • Search Google Scholar
    • Export Citation
  • Dueben, P. D., and P. Bauer, 2018: Challenges and design choices for global weather and climate models based on machine learning. Geosci. Model Dev., 11, 39994009, https://doi.org/10.5194/gmd-11-3999-2018.

    • Search Google Scholar
    • Export Citation
  • Emerton, R., and Coauthors, 2018: Developing a global operational seasonal hydro-meteorological forecasting system: GloFAS-seasonal v1.0. Geosci. Model Dev., 11, 33273346, https://doi.org/10.5194/gmd-11-3327-2018.

    • Search Google Scholar
    • Export Citation
  • Fatima, S. S., M. Wooldridge, and N. R. Jennings, 2008: A linear approximation method for the Shapley value. Artif. Intell., 172, 16731699, https://doi.org/10.1016/j.artint.2008.05.003.

    • Search Google Scholar
    • Export Citation
  • Feng, P., B. Wang, D. L. Liu, F. Ji, X. Niu, H. Ruan, L. Shi, and Q. Yu, 2020: Machine learning-based integration of large-scale climate drivers can improve the forecast of seasonal rainfall probability in Australia. Environ Res. Lett., 15, 084051, https://doi.org/10.1088/1748-9326/ab9e98.

    • Search Google Scholar
    • Export Citation
  • Gevaert, A. I., D. G. Miralles, R. A. M. De Jeu, J. Schellekens, and A. J. Dolman, 2018: Soil moisture-temperature coupling in a set of land surface models. J. Geophys. Res. Atmos., 123, 14811498, https://doi.org/10.1002/2017JD027346.

    • Search Google Scholar
    • Export Citation
  • Ghaderpour, E., H. Dadkhah, H. Dabiri, F. Bozzano, G. Scarascia Mugnozza, and P. Mazzanti, 2023: Precipitation time series analysis and forecasting for Italian regions. Eng. Proc., 39, 23, https://doi.org/10.3390/engproc2023039023.

    • Search Google Scholar
    • Export Citation
  • Ghaderpour, E., B. Antonielli, F. Bozzano, G. Scarascia Mugnozza, and P. Mazzanti, 2024: A fast and robust method for detecting trend turning points in InSAR displacement time series. Comput. Geosci., 185, 105546, https://doi.org/10.1016/j.cageo.2024.105546.

    • Search Google Scholar
    • Export Citation
  • Hansen, J. W., S. J. Mason, L. Sun, and A. Tall, 2011: Review of seasonal climate forecasting for agriculture in sub-Saharan Africa. Exp. Agric., 47, 205240, https://doi.org/10.1017/S0014479710000876.

    • Search Google Scholar
    • Export Citation
  • Hersbach, H., and Coauthors, 2020: The ERA5 global reanalysis. Quart. J. Roy. Meteor. Soc., 146, 19992049, https://doi.org/10.1002/qj.3803.

    • Search Google Scholar
    • Export Citation
  • Hill, A. J., and R. S. Schumacher, 2021: Forecasting excessive rainfall with random forests and a deterministic convection-allowing model. Wea. Forecasting, 36, 16931711, https://doi.org/10.1175/WAF-D-21-0026.1.

    • Search Google Scholar
    • Export Citation
  • Johnson, S. J., and Coauthors, 2019: SEAS5: The new ECMWF seasonal forecast system. Geosci. Model Dev., 12, 10871117, https://doi.org/10.5194/gmd-12-1087-2019.

    • Search Google Scholar
    • Export Citation
  • Jones, J. W., J. W. Hansen, F. S. Royce, and C. D. Messina, 2000: Potential benefits of climate forecasting to agriculture. Agric. Ecosyst. Environ., 82, 169184, https://doi.org/10.1016/S0167-8809(00)00225-5.

    • Search Google Scholar
    • Export Citation
  • Ju-Young, S., K. Kyu Rang, and H. Jong-Chul, 2020: Seasonal forecasting of daily mean air temperatures using a coupled global climate model and machine learning algorithm for field-scale agricultural management. Agric. For. Meteor., 281, 107858, https://doi.org/10.1016/j.agrformet.2019.107858.

    • Search Google Scholar
    • Export Citation
  • Kanamitsu, M., and Coauthors, 2002: NCEP dynamical seasonal forecast system 2000. Bull. Amer. Meteor. Soc., 83, 10191038, https://doi.org/10.1175/1520-0477(2002)083<1019:NDSFS>2.3.CO;2.

    • Search Google Scholar
    • Export Citation
  • Kashinath, K., and Coauthors, 2021: Physics-informed machine learning: Case studies for weather and climate modelling. Philos. Trans. Roy. Soc., A379, 20200093, https://doi.org/10.1098/rsta.2020.0093.

    • Search Google Scholar
    • Export Citation
  • Kim, S., and H. Kim, 2016: A new metric of absolute percentage error for intermittent demand forecasts. Int. J. Forecasting, 32, 669679, https://doi.org/10.1016/j.ijforecast.2015.12.003.

    • Search Google Scholar
    • Export Citation
  • King’uza, P. H., and S. L. P. Tilwebwa, 2019: Inter-annual variability of March to May rainfall over Tanzania and its association with atmospheric circulation anomalies. Geogr. Pannonica, 23, 147161, https://doi.org/10.5937/gp23-22430.

    • Search Google Scholar
    • Export Citation
  • Klemm, T., and R. A. McPherson, 2017: The development of seasonal climate forecasting for agricultural producers. Agric. For. Meteor., 232, 384399, https://doi.org/10.1016/j.agrformet.2016.09.005.

    • Search Google Scholar
    • Export Citation
  • Klopper, E., C. H. Vogel, and W. A. Landman, 2006: Seasonal climate forecasts – potential agricultural-risk management tools? Climatic Change, 76, 7390, https://doi.org/10.1007/s10584-005-9019-9.

    • Search Google Scholar
    • Export Citation
  • Loken, E. D., A. J. Clark, and C. D. Karstens, 2020: Generating probabilistic next-day severe weather forecasts from convection-allowing ensembles using random forests. Wea. Forecasting, 35, 16051631, https://doi.org/10.1175/WAF-D-19-0258.1.

    • Search Google Scholar
    • Export Citation
  • Mafuru, K. B., and T. Guirong, 2020: The influence of ENSO on the upper warm temperature anomaly formation associated with the March–May heavy rainfall events in Tanzania. Int. J. Climatol., 40, 27452763, https://doi.org/10.1002/joc.6364.

    • Search Google Scholar
    • Export Citation
  • Makula, E. K., and B. Zhou, 2023: Relationship between Antarctic sea ice and the variability of March to May precipitation in Tanzania. Int. J. Climatol., 43, 13821395, https://doi.org/10.1002/joc.7921.

    • Search Google Scholar
    • Export Citation
  • Marchant, R., C. Mumbi, S. Behera, and T. Yamagata, 2007: The Indian Ocean dipole – the unsung driver of climatic variability in East Africa. Afr. J. Ecol., 45, 416, https://doi.org/10.1111/j.1365-2028.2006.00707.x.

    • Search Google Scholar
    • Export Citation
  • Marutho, D., S. Hendra Handaka, and E. Wijaya, 2018: The determination of cluster number at k-mean using elbow method and purity evaluation on headline news. 2018 Int. Seminar on Application for Technology of Information and Communication, Semarang, Indonesia, Institute of Electrical and Electronics Engineers, 533–538, https://doi.org/10.1109/ISEMANTIC.2018.8549751.

  • Mbigi, D., and Z. Xiao, 2021: Tanzanian rainfall responses to El Niño and positive Indian Ocean dipole events during 1951–2015. Atmos. Oceanic Sci. Lett., 14, 100093, https://doi.org/10.1016/j.aosl.2021.100093.

    • Search Google Scholar
    • Export Citation
  • McGovern, A., R. Lagerquist, D. John Gagne II, G. E. Jergensen, K. L. Elmore, C. R. Homeyer, and T. Smith, 2019: Making the black box more transparent: Understanding the physical implications of machine learning. Bull. Amer. Meteor. Soc., 100, 21752199, https://doi.org/10.1175/BAMS-D-18-0195.1.

    • Search Google Scholar
    • Export Citation
  • Meinshausen, N., and G. Ridgeway, 2006: Quantile regression forest. J. Mach. Learn. Res., 7, 983999.

  • Meza, F. J., J. W. Hansen, and D. Osgood, 2008: Economic value of seasonal climate forecasts for agriculture: Review of ex-ante assessments and recommendations for future research. J. Appl. Meteor. Climatol., 47, 12691286, https://doi.org/10.1175/2007JAMC1540.1.

    • Search Google Scholar
    • Export Citation
  • Peng, J., S. Dadson, G. Leng, Z. Duan, T. Jagdhuber, W. Guo, and R. Ludwig, 2019: The impact of the Madden-Julian Oscillation on hydrological extremes. J. Hydrol., 571, 142149, https://doi.org/10.1016/j.jhydrol.2019.01.055.

    • Search Google Scholar
    • Export Citation
  • Pohl, B., Y. Richard, and N. Fauchereau, 2007: Influence of the Madden–Julian oscillation on southern African summer rainfall. J. Climate, 20, 42274242, https://doi.org/10.1175/JCLI4231.1.

    • Search Google Scholar
    • Export Citation
  • Reichstein, M., G. Camps-Valls, B. Stevens, M. Jung, J. Denzler, N. Carvalhais, and Prabhat, 2019: Deep learning and process understanding for data-driven Earth system science. Nature, 566, 195204, https://doi.org/10.1038/s41586-019-0912-1.

    • Search Google Scholar
    • Export Citation
  • Roudier, P., 2012: Climat et agriculture en afrique de l’ouest: Quantification de l’impact du changement climatique sur les rendements etévaluation de l’utilité des prévisions saisonnières. Ph.D. thesis, EHESS, 190 pp.

  • Shukla, S., A. McNally, G. Husak, and C. Funk, 2014: A seasonal agricultural drought forecast system for food-insecure regions of East Africa. Hydrol. Earth Syst. Sci., 18, 39073921, https://doi.org/10.5194/hess-18-3907-2014.

    • Search Google Scholar
    • Export Citation
  • Syakur, M. A., B. K. Khotimah, E. M. S. Rochman, and B. D. Satoto, 2018: Integration k-means clustering method and elbow method for identification of the best customer profile cluster. IOP Conf. Ser.: Mater. Sci. Eng., 336, 012017, https://doi.org/10.1088/1757-899X/336/1/012017.

    • Search Google Scholar
    • Export Citation
  • Takaya, Y., and Coauthors, 2018: Japan Meteorological Agency/Meteorological Research Institute-Coupled Prediction System version 2 (JMA/MRI-CPS2): Atmosphere–land–ocean–sea ice coupled prediction system for operational seasonal forecasting. Climate Dyn., 50, 751765, https://doi.org/10.1007/s00382-017-3638-5.

    • Search Google Scholar
    • Export Citation
  • van Ogtrop, F., M. Ahmad, and C. Moeller, 2014: Principal components of sea surface temperatures as predictors of seasonal rainfall in rainfed wheat growing areas of Pakistan. Meteor. Appl., 21, 431443, https://doi.org/10.1002/met.1429.

    • Search Google Scholar
    • Export Citation
  • van Straaten, C., K. Whan, D. Coumou, B. van den Hurk, and M. Schmeits, 2022a: Improving sub-seasonal forecasts by correcting missing teleconnections using ANN-based post-processing. 2022 EGU General Assembly, Vienna, Austria, EGU, EGU22-1686, https://doi.org/10.5194/egusphere-egu22-1686.

  • van Straaten, C., K. Whan, D. Coumou, B. van den Hurk, and M. Schmeits, 2022b: Using explainable machine learning forecasts to discover subseasonal drivers of high summer temperatures in western and central Europe. Mon. Wea. Rev., 150, 11151134, https://doi.org/10.1175/MWR-D-21-0201.1.

    • Search Google Scholar
    • Export Citation
  • Vashisht, A., and B. Zaitchik, 2022: Modulation of East African boreal fall rainfall: Combined effects of the Madden–Julian oscillation (MJO) and El Niño–Southern Oscillation (ENSO). J. Climate, 35, 20192034, https://doi.org/10.1175/JCLI-D-21-0583.1.

    • Search Google Scholar
    • Export Citation
  • Vijverberg, S., M. Schmeits, K. van der Wiel, and D. Coumou, 2020: Subseasonal statistical forecasts of eastern U.S. hot temperature events. Mon. Wea. Rev., 148, 47994822, https://doi.org/10.1175/MWR-D-19-0409.1.

    • Search Google Scholar
    • Export Citation
  • Watson-Parris, D., 2021: Machine learning for weather and climate are worlds apart. Philos. Trans. Roy. Soc., A379, 20200098, https://doi.org/10.1098/rsta.2020.0098.

    • Search Google Scholar
    • Export Citation
  • Weyn, J. A., D. R. Durran, and R. Caruana, 2019: Can machines learn to predict weather? Using deep learning to predict gridded 500-hPa geopotential height from historical weather data. J. Adv. Model. Earth Syst., 11, 26802693, https://doi.org/10.1029/2019MS001705.

    • Search Google Scholar
    • Export Citation
All Time Past Year Past 30 Days
Abstract Views 1215 1215 729
Full Text Views 65 65 7
PDF Downloads 79 79 7