Causal Discovery for Climate Research Using Graphical Models

Imme Ebert-Uphoff Department of Electrical and Computer Engineering, Colorado State University, Fort Collins, Colorado

Search for other papers by Imme Ebert-Uphoff in
Current site
Google Scholar
PubMed
Close
and
Yi Deng School of Earth and Atmospheric Sciences, Georgia Institute of Technology, Atlanta, Georgia

Search for other papers by Yi Deng in
Current site
Google Scholar
PubMed
Close
Restricted access

Abstract

Causal discovery seeks to recover cause–effect relationships from statistical data using graphical models. One goal of this paper is to provide an accessible introduction to causal discovery methods for climate scientists, with a focus on constraint-based structure learning. Second, in a detailed case study constraint-based structure learning is applied to derive hypotheses of causal relationships between four prominent modes of atmospheric low-frequency variability in boreal winter including the Western Pacific Oscillation (WPO), Eastern Pacific Oscillation (EPO), Pacific–North America (PNA) pattern, and North Atlantic Oscillation (NAO). The results are shown in the form of static and temporal independence graphs also known as Bayesian Networks. It is found that WPO and EPO are nearly indistinguishable from the cause–effect perspective as strong simultaneous coupling is identified between the two. In addition, changes in the state of EPO (NAO) may cause changes in the state of NAO (PNA) approximately 18 (3–6) days later. These results are not only consistent with previous findings on dynamical processes connecting different low-frequency modes (e.g., interaction between synoptic and low-frequency eddies) but also provide the basis for formulating new hypotheses regarding the time scale and temporal sequencing of dynamical processes responsible for these connections. Last, the authors propose to use structure learning for climate networks, which are currently based primarily on correlation analysis. While correlation-based climate networks focus on similarity between nodes, independence graphs would provide an alternative viewpoint by focusing on information flow in the network.

Corresponding author address: Yi Deng, School of Earth and Atmospheric Sciences, Georgia Institute of Technology, 311 Ferst Drive, Atlanta, GA 30332-0340. E-mail: yi.deng@eas.gatech.edu.

Abstract

Causal discovery seeks to recover cause–effect relationships from statistical data using graphical models. One goal of this paper is to provide an accessible introduction to causal discovery methods for climate scientists, with a focus on constraint-based structure learning. Second, in a detailed case study constraint-based structure learning is applied to derive hypotheses of causal relationships between four prominent modes of atmospheric low-frequency variability in boreal winter including the Western Pacific Oscillation (WPO), Eastern Pacific Oscillation (EPO), Pacific–North America (PNA) pattern, and North Atlantic Oscillation (NAO). The results are shown in the form of static and temporal independence graphs also known as Bayesian Networks. It is found that WPO and EPO are nearly indistinguishable from the cause–effect perspective as strong simultaneous coupling is identified between the two. In addition, changes in the state of EPO (NAO) may cause changes in the state of NAO (PNA) approximately 18 (3–6) days later. These results are not only consistent with previous findings on dynamical processes connecting different low-frequency modes (e.g., interaction between synoptic and low-frequency eddies) but also provide the basis for formulating new hypotheses regarding the time scale and temporal sequencing of dynamical processes responsible for these connections. Last, the authors propose to use structure learning for climate networks, which are currently based primarily on correlation analysis. While correlation-based climate networks focus on similarity between nodes, independence graphs would provide an alternative viewpoint by focusing on information flow in the network.

Corresponding author address: Yi Deng, School of Earth and Atmospheric Sciences, Georgia Institute of Technology, 311 Ferst Drive, Atlanta, GA 30332-0340. E-mail: yi.deng@eas.gatech.edu.
Save
  • Abramson, B., J. Brown, W. Edwards, M. Murphy, and R. Winkler, 1996: Hailfinder: A Bayesian system for forecasting severe weather. Int. J. Forecasting, 12, 5771.

    • Search Google Scholar
    • Export Citation
  • Arnold, A., Y. Liu, and N. Abe, 2007: Temporal causal modeling with graphical Granger methods. Proc. 13th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining (SIGKDD ’07), San Jose, CA, ICKDD, 10 pp. [Available online at http://www.cs.cmu.edu/~aarnold/cald/frp781-arnold.pdf.]

  • Barnston, A. G., and R. E. Livezey, 1987: Classification, seasonality, and persistence of low-frequency atmospheric circulation patterns. Mon. Wea. Rev., 115, 10831126.

    • Search Google Scholar
    • Export Citation
  • Baxter, S., and S. Nigam, 2012: Pentad analysis of wintertime PNA development and its relationship to the NAO. Preprints, 24th Conf. on Climate Variability and Change, New Orleans, LA, Amer. Meteor. Soc., P202152. [Available online at http://ams.confex.com/ams/92Annual/webprogram/Paper202152.html.]

  • Benedict, J., S. Lee, and S. Feldstein, 2004: A synoptic view of the North Atlantic Oscillation. J. Atmos. Sci., 61, 121144.

  • Borgelt, C., 2010: A conditional independence algorithm for learning undirected graphical models. J. Comput. Syst. Sci., 76, 2133.

  • Cano, R., C. Sordo, and J. Gutierrez, 2004: Applications of bayesian networks in meteorology. Advances in Bayesian Networks, J. A. Gámez et al., Eds., Springer, 309–327.

  • Catenacci, M., and C. Giuppomi, 2009: Potentials of bayesian networks to deal with uncertainty in climate change adaptation policies. Centro Euro-Mediterraneo per i Cambiamenti Climatici (CMCC) Tech. Rep. RP0070, 29 pp.

  • Charniak, E., 1991: Bayesian networks without tears. AI Mag., 12 (4), 5063.

  • Chu, T., and C. Glymour, 2008: Search for additive nonlinear time series causal models. J. Mach. Learn. Res., 9, 967991.

  • Chu, T., D. Danks, and C. Glymour, 2005: Data driven methods for nonlinear granger causality: Climate teleconnection mechanisms. Carnegie Mellon University, Dept. of Philosophy Tech. Rep. CMU-PHIL, 171 pp.

  • Cofino, A., R. Cano, C. Sordo, and J. Gutierrez, 2002: Bayesian networks for probabilistic weather prediction. Proc. 15th European Conf. on Artifical Intelligence (ECAI 2002), Lyon, France, ECAI, 695–700.

  • Colombo, D., M. H. Maathuis, M. Kalisch, and T. S. Richardson, 2012: Learning high-dimensional directed acyclic graphs with latent and selection variables. Ann. Stat., 40, 294321.

    • Search Google Scholar
    • Export Citation
  • Cooper, G., and E. Herskovitz, 1992: A Bayesian method for the induction of probabilistic networks from data. Mach. Learn., 9, 330347.

    • Search Google Scholar
    • Export Citation
  • Cossention, M., F. Raimondi, and M. Vitale, 2001: Bayesian models of the pm 10 atmospheric urban pollution. Proc. Ninth Int. Conf. on Modeling, Monitoring and Management of Air Pollution: Air Pollution IX, Ancona, Italy, Wessex, 143–152.

  • Deng, Y., and M. Mak, 2005: An idealized model study relevant to the dynamics of the midwinter minimum of the Pacific storm track. J. Atmos. Sci., 62, 12091225.

    • Search Google Scholar
    • Export Citation
  • Deng, Y., and M. Mak, 2006: Nature of the differences in the intraseasonal variability of the Pacific and Atlantic storm tracks: A diagnostic study. J. Atmos. Sci., 63, 26022615.

    • Search Google Scholar
    • Export Citation
  • Deng, Y., and T. Jiang, 2011: Intraseasonal modulation of the North Pacific storm track by tropical convection in boreal winter. J. Climate, 24, 11221137.

    • Search Google Scholar
    • Export Citation
  • Dole, R., 2008: Linking weather and climate. Synoptic-Dynamic Meteorology and Weather Analysis and Forecasting: A Tribute to Fred Sanders, Amer. Meteor. Soc., 297–348.

  • Donges, J., Y. Zou, N. Marwan, and J. Kurths, 2009: The backbone of the climate network. Europhys. Lett., 87, 48007, doi:10.1209/0295-5075/87/48007.

    • Search Google Scholar
    • Export Citation
  • Eichler, M., 2007: Granger causality and path diagrams for multivariate time series. J. Econom., 137, 334353.

  • Franzke, C., S. Lee, and S. Feldstein, 2004: Is the North Atlantic Oscillation a breaking wave? J. Atmos. Sci., 61, 145160.

  • Franzke, C., S. Feldstein, and S. Lee, 2011: Synoptic analysis of the Pacific–North American teleconnection pattern. Quart. J. Roy. Meteor. Soc., 137, 329346.

    • Search Google Scholar
    • Export Citation
  • Friedman, N., M. Linial, I. Nachman, and D. Pe’er, 2000: Using bayesian networks to analyze expression data. J. Comput. Biol., 7 (3–4), 601620.

    • Search Google Scholar
    • Export Citation
  • Garfinkel, C., and D. Hartmann, 2008: Different ENSO teleconnections and their effects of the stratospheric polar vortex. J. Geophys. Res., 113, D18114, doi:10.1029/2008JD009920.

    • Search Google Scholar
    • Export Citation
  • Gozolchiani, A., K. Yamasako, O. Gazit, and S. Havlin, 2008: Pattern of climate network blinking links follws El Niño events. Europhys. Lett., 83, 28005, doi:10.1209/0295-5075/83/28005.

    • Search Google Scholar
    • Export Citation
  • Granger, C. W. J., 1969: Investigating causal relations by econometric models and cross-spectral methods. Econometrica, 37, 424438.

  • Hegyi, B. M., and Y. Deng, 2011: A dynamical fingerprint of tropical Pacific sea surface temperatures on the decadal-scale variability of cool-season arctic precipitation. J. Geophys. Res., 116, D20121, doi:10.1029/2011JD016001.

    • Search Google Scholar
    • Export Citation
  • Jensen, F. V., and T. D. Nielsen, 2007: Bayesian Networks and Decision Graphs. 2nd ed. Springer, 284 pp.

  • Kachigan, S., 1991: Multivariate Statistical Analysis. 3rd ed. Radius Press, 303 pp.

  • Kalnay, E., and Coauthors, 1996: The NCEP/NCAR 40-Year Reanalysis Project. Bull. Amer. Meteor. Soc., 77, 437471.

  • Kennett, R. J., 2000: Seabreeze prediction using bayesian networks. Honours thesis, School of Computer Science and Software Engineering, Monash University, 48 pp. [Available online at http://www.csse.monash.edu.au/hons/projects/2000/Russell.Kennett/thesis.ps.]

  • Kennett, R. J., K. B. Korb, and A. E. Nicholson, 2001: Seabreeze prediction using bayesian networks. Proc. Fifth Pacific-Asia Conference on Knowledge Discovery and Data Minung (PAKDD’01), Hong Kong, China, PAKDD, 148–153.

  • Kenward, A., cited 2011: Data storm: What to do with all this climate information? [Available online at http://www.climatecentral.org/blogs/data-storm-what-to-do-with-all-this-climate-information/.]

  • Kistler, R., and Coauthors, 2001: The NCEP–NCAR 50-Year Reanalysis: Monthly means CD-ROM and documentation. Bull. Amer. Meteor. Soc., 82, 247267.

    • Search Google Scholar
    • Export Citation
  • Koller, D., and N. Friedman, 2009: Probabilistic Graphical Models–Principles and Techniques. 1st ed. MIT Press, 1280 pp.

  • Lee, B., and J. Joseph, 2006: Learning a probabilistic model of rainfall using graphical models, project report for machine learning (Fall 2006). Carnegie Mellon University School of Computer Science, 8 pp. [Available online at http://www.cs.cmu.edu/~epxing/Class/10701-06f/project-reports/lee_joseph.pdf.]

  • Li, Y., and N.-C. Lau, 2012: Impact of ENSO on the atmospheric variability over the North Atlantic in late winter–role of the transient eddies. J. Climate, 25, 320342.

    • Search Google Scholar
    • Export Citation
  • Mak, M., and Y. Deng, 2006: Diagnostic and dynamical analyses of two outstanding aspects of storm tracks. Dyn. Atmos. Oceans, 43 (1–2), 8099, doi:10.1016/j.dynatmoce.2006.06.004.

    • Search Google Scholar
    • Export Citation
  • Margolin, A. A., I. Nemenman, K. Basso, C. Wiggins, G. Stolovitzky, R. Dalla Favera, and A. Califano, 2006: Aracne: An algorithm for the reconstruction of gene regulatory networks in a mammalian cellular context. BMC Bioinf., 7 (Suppl.), S7, doi:10.1186/1471-2105-7-S1-S7.

  • Martius, O., C. Schwierz, and H. Davies, 2007: Breaking waves at the tropopause in the wintertime Northern Hemisphere: Climatological analyses of the orientation and the theoretical lc1/2 classification. J. Atmos. Sci., 64, 25762592.

    • Search Google Scholar
    • Export Citation
  • Murphy, K. P., 2001: Active learning of causal Bayes net structure. University of California, Berkeley, Department of Computer Science Tech. Rep., 8 pp. [Available online at http://www.cs.ubc.ca/~murphyk/papers/alearn.ps.gz.]

  • Neapolitan, R. E., 2003: Learning Bayesian Networks. Prentice Hall, 647 pp.

  • Palmer, T., 1999: A nonlinear dynamical perspective on climate prediction. J. Climate, 12, 575591.

  • Pearl, J., 1988: Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. 2nd ed. Morgan Kaufman Publishers, 552 pp.

  • Pearl, J., 2000: Causality–Models, Reasoning and Inference. Cambridge University Press, 400 pp.

  • Peter, C., W. de Lange, J. Musango, K. April, and A. Potgleter, 2009: Applying Bayesian modelling to assess climate change effects on biofuel production. Climate Res., 40, 249260.

    • Search Google Scholar
    • Export Citation
  • Rebane, G., and J. Pearl, 1987: The recovery of causal poly-trees from statistical data. Proc. Sixth Workshop on Uncertainty in AI, Seattle, WA, AAAI, 222–228.

  • Riviére, G., 2010: Role of Rossby wave breaking in the west Pacific teleconnection. Geophys. Res. Lett., 37, L11802, doi:10.1029/2010GL043309.

    • Search Google Scholar
    • Export Citation
  • Riviére, G., and I. Orlanski, 2007: Characteristics of the Atlantic storm-track eddy activity and its relation with the North Atlantic oscillation. J. Atmos. Sci., 64, 241266.

    • Search Google Scholar
    • Export Citation
  • Spirtes, P., and C. Glymour, 1991: An algorithm for fast recovery of sparse causal graphs. Soc. Sci. Comput. Rev., 9, 6272.

  • Spirtes, P., C. Glymour, and R. Scheines, 1991: From probability to causality. Philos. Stud., 64, 136.

  • Spirtes, P., C. Glymour, and R. Scheines, 1993: Causation, Prediction, and Search: Springer Lecture Notes in Statistics. 1st ed. Springer Verlag, 526 pp.

  • Spirtes, P., C. Glymour, and R. Scheines, 2000: Causation, Prediction, and Search. 2nd ed. MIT Press, 546 pp.

  • Steinhaeuser, K., N. V. Chawla, and A. R. Ganguly, 2010: Complex networks in climate science: Progress, opportunities and challenges. Proc. Conf. on Intelligent Data Understanding, San Francisco, CA, NASA, 16–26.

  • Swanson, N., and C. Granger, 1997: Impulse response functions based on a causal approach to residual orthogonalization in vector autoregressions. J. Amer. Stat. Assoc., 92, 357367.

    • Search Google Scholar
    • Export Citation
  • Tsonis, A., and P. Roebber, 2004: The architecture of the climate network. Physica A, 333, 497504.

  • Tsonis, A., and K. Swanson, 2008: Topology and predictability of El Niño and La Niña networks. Phys. Rev. Lett., 100, 228502, doi:10.1103/PhysRevLett.100.228502.

    • Search Google Scholar
    • Export Citation
  • Tsonis, A., K. Swanson, and P. J. Roebber, 2006: What do networks have to do with climate? Bull. Amer. Meteor. Soc., 87, 585596.

  • Tsonis, A., K. Swanson, and S. Kravtsov, 2007: A new dynamical mechanism for major climate shifts. Geophys. Res. Lett., 34, L13705, doi:10.1029/2007GL030288.

    • Search Google Scholar
    • Export Citation
  • Verma, T., and J. Pearl, 1990: Equivalence and synthesis of causal models. Proc. Sixth Conf. on Uncertainty in Artificial Intelligence, Portland, OR, AUAI, 220–227.

  • Wallace, J., and D. Gutzler, 1981: Teleconnections in the geopotential height field during the Northern Hemisphere winter. Mon. Wea. Rev., 109, 784812.

    • Search Google Scholar
    • Export Citation
  • White, H., K. Chalak, and X. Lu, 2011: Linking Granger causality and the Pearl causal model with settable systems. Proc. Neural Information Processing Systems (NIPS) Mini-Symp. on Causality in Time Series, Vancouver, British Columbia, Canada, Journal of Machine Learning Research, 1–29.

  • Woollings, T., B. Hoskins, M. Blackburn, and P. Berrisford, 2008: A new Rossby wave breaking interpretation of the north atlantic oscillation. J. Atmos. Sci., 65, 609626.

    • Search Google Scholar
    • Export Citation
  • Wright, S., 1921: Correlation and causation. J. Agric. Res., 20, 557585.

  • Wright, S., 1934: The method of path coefficients. Ann. Math. Stat., 5, 161215.

  • Yamasaki, K., A. Gozolchiani, and S. Havlin, 2008: Climate networks around the globe are significantly affected by El Niño. Phys. Rev. Lett., 100, 228501.

    • Search Google Scholar
    • Export Citation
  • Yamasaki, K., A. Gozolchiani, and S. Havlin, 2009: Climate networks based on phase synchronization track El Niño. Prog. Theor. Phys., 179 (Suppl.), 178188.

    • Search Google Scholar
    • Export Citation
All Time Past Year Past 30 Days
Abstract Views 0 0 0
Full Text Views 2684 1007 118
PDF Downloads 2310 805 77