Beyond PCA: Additional Dimension Reduction Techniques to Consider in the Development of Climate Fingerprints

Michael Weylandt aDepartment of Statistical Sciences, Sandia National Laboratories, Albuquerque, New Mexico

Search for other papers by Michael Weylandt in
Current site
Google Scholar
PubMed
Close
https://orcid.org/0000-0002-0950-3881
and
Laura P. Swiler bCenter for Computing Research, Sandia National Laboratories, Albuquerque, New Mexico

Search for other papers by Laura P. Swiler in
Current site
Google Scholar
PubMed
Close
Restricted access

Abstract

Dimension reduction techniques are an essential part of the climate analyst’s toolkit. Due to the enormous scale of climate data, dimension reduction methods are used to identify major patterns of variability within climate dynamics, to create compelling and informative visualizations, and to quantify major named modes such as El Niño–Southern Oscillation. Principal components analysis (PCA), also known as the method of empirical orthogonal functions (EOFs), is the most commonly used form of dimension reduction, characterized by a remarkable confluence of attractive mathematical, statistical, and computational properties. Despite its ubiquity, PCA suffers from several difficulties relevant to climate science: high computational burden with large datasets, decreased statistical accuracy in high dimensions, and difficulties comparing across multiple datasets. In this paper, we introduce several variants of PCA that are likely to be of use in climate sciences and address these problems. Specifically, we introduce non-negative, sparse, and tensor PCA and demonstrate how each approach provides superior pattern recognition in climate data. We also discuss approaches to comparing PCA-family results within and across datasets in a domain-relevant manner. We demonstrate these approaches through an analysis of several runs of the E3SM climate model from 1991 to 1995, focusing on the simulated response to the Mt. Pinatubo eruption; our findings are consistent with a recently identified stratospheric warming fingerprint associated with this type of stratospheric aerosol injection.

© 2024 American Meteorological Society. This published article is licensed under the terms of the default AMS reuse license. For information regarding reuse of this content and general copyright information, consult the AMS Copyright Policy (www.ametsoc.org/PUBSReuseLicenses).

M. Weylandt’s current affiliation: Department of Information Systems and Statistics, Zicklin School of Business, Baruch College, CUNY, New York, New York.

Corresponding author: Michael Weylandt, michael.weylandt@baruch.cuny.edu

Abstract

Dimension reduction techniques are an essential part of the climate analyst’s toolkit. Due to the enormous scale of climate data, dimension reduction methods are used to identify major patterns of variability within climate dynamics, to create compelling and informative visualizations, and to quantify major named modes such as El Niño–Southern Oscillation. Principal components analysis (PCA), also known as the method of empirical orthogonal functions (EOFs), is the most commonly used form of dimension reduction, characterized by a remarkable confluence of attractive mathematical, statistical, and computational properties. Despite its ubiquity, PCA suffers from several difficulties relevant to climate science: high computational burden with large datasets, decreased statistical accuracy in high dimensions, and difficulties comparing across multiple datasets. In this paper, we introduce several variants of PCA that are likely to be of use in climate sciences and address these problems. Specifically, we introduce non-negative, sparse, and tensor PCA and demonstrate how each approach provides superior pattern recognition in climate data. We also discuss approaches to comparing PCA-family results within and across datasets in a domain-relevant manner. We demonstrate these approaches through an analysis of several runs of the E3SM climate model from 1991 to 1995, focusing on the simulated response to the Mt. Pinatubo eruption; our findings are consistent with a recently identified stratospheric warming fingerprint associated with this type of stratospheric aerosol injection.

© 2024 American Meteorological Society. This published article is licensed under the terms of the default AMS reuse license. For information regarding reuse of this content and general copyright information, consult the AMS Copyright Policy (www.ametsoc.org/PUBSReuseLicenses).

M. Weylandt’s current affiliation: Department of Information Systems and Statistics, Zicklin School of Business, Baruch College, CUNY, New York, New York.

Corresponding author: Michael Weylandt, michael.weylandt@baruch.cuny.edu

Supplementary Materials

    • Supplemental Materials (PDF 5.3733 MB)
Save
  • Allen, G., 2012: Sparse higher-order principal components analysis. Proceedings of the 15th International Conference on Artificial Intelligence and Statistics, N. D. Lawrence and M. Girolami, Eds., Proceedings of Machine Learning Research, Vol. 22, PMLR, 27–36, http://proceedings.mlr.press/v22/allen12.html.

  • Allen, G., and M. Weylandt, 2019: Sparse and functional principal components analysis. 2019 IEEE Data Science Workshop (DSW), Minneapolis, MN, Institute of Electrical and Electronics Engineers, 11–16, https://doi.org/10.1109/DSW.2019.8755778.

  • Allen, M. R., and P. A. Stott, 2003: Estimating signal amplitudes in optimal fingerprinting, Part I: Theory. Climate Dyn., 21, 477491, https://doi.org/10.1007/s00382-003-0313-9.

    • Search Google Scholar
    • Export Citation
  • Bonfils, C., and Coauthors, 2008: Detection and attribution of temperature changes in the mountainous western United States. J. Climate, 21, 64046424, https://doi.org/10.1175/2008JCLI2397.1.

    • Search Google Scholar
    • Export Citation
  • Falasca, F., A. Bracco, A. Nenes, and I. Fountalis, 2019: Dimensionality reduction and network inference for climate data using δ-MAPS: Application to the CESM large ensemble sea surface temperature. J. Adv. Model. Earth Syst., 11, 14791515, https://doi.org/10.1029/2019MS001654.

    • Search Google Scholar
    • Export Citation
  • Fulton, J. D., and G. C. Hegerl, 2021: Testing methods of pattern extraction for climate data using synthetic modes. J. Climate, 34, 76457660, https://doi.org/10.1175/JCLI-D-20-0871.1.

    • Search Google Scholar
    • Export Citation
  • Glantz, M. H., R. W. Katz, and N. Nicholls, 1991: Teleconnections Linking Worldwide Climate Anomalies. Cambridge University Press, 548 pp.

  • Golaz, J.-C., and Coauthors, 2022: The DOE E3SM model version 2: Overview of the physical model and initial model evaluation. J. Adv. Model. Earth Syst., 14, e2022MS003156, https://doi.org/10.1029/2022MS003156.

    • Search Google Scholar
    • Export Citation
  • Golub, G. H., and C. F. Van Loan, 2013: Matrix Computations. 4th ed. Johns Hopkins University Press, 756 pp.

  • Hannachi, A., I. T. Jolliffe, D. B. Stephenson, and N. Trendafilov, 2006: In search of simple structures in climate: Simplifying EoFs. Int. J. Climatol., 26, 728, https://doi.org/10.1002/joc.1243.

    • Search Google Scholar
    • Export Citation
  • Hannachi, A., I. T. Jolliffe, and D. B. Stephenson, 2007: Empirical orthogonal functions and related techniques in atmospheric science: A review. Int. J. Climatol., 27, 11191152, https://doi.org/10.1002/joc.1499.

    • Search Google Scholar
    • Export Citation
  • Hasselmann, K., 1997: Multi-pattern fingerprint method for detection and attribution of climate change. Climate Dyn., 13, 601611, https://doi.org/10.1007/s003820050185.

    • Search Google Scholar
    • Export Citation
  • Hegerl, G. C., K. Hasselmann, U. Cubasch, J. F. B. Mitchell, E. Roeckner, R. Voss, and J. Waszkewitz, 1997: Multi-fingerprint detection and attribution analysis of greenhouse gas, greenhouse gas-plus-aerosol and solar forced climate change. Climate Dyn., 13, 613634, https://doi.org/10.1007/s003820050186.

    • Search Google Scholar
    • Export Citation
  • Horn, R. A., and C. R. Johnson, 2013: Matrix Analysis. 2nd ed. Cambridge University Press, 662 pp.

  • Jolliffe, I. T., 2002: Principal Component Analysis. 2nd ed. Springer-Verlag, 488 pp., https://doi.org/10.1007/b98835.

  • Jolliffe, I. T., N. T. Trendafilov, and M. Uddin, 2003: A modified principal component technique based on the LASSO. J. Comput. Graph. Stat., 12, 531547, https://doi.org/10.1198/1061860032148.

    • Search Google Scholar
    • Export Citation
  • Kolda, T. G., and B. W. Bader, 2009: Tensor decompositions and applications. SIAM Rev., 51, 455500, https://doi.org/10.1137/07070111X.

    • Search Google Scholar
    • Export Citation
  • Kremser, S., and Coauthors, 2016: Stratospheric aerosol—Observations, processes, and impact on climate. Rev. Geophys., 54, 278335, https://doi.org/10.1002/2015RG000511.

    • Search Google Scholar
    • Export Citation
  • Mitchell, J. F. B., and Coauthors, 2001: Detection of climate change and attribution of causes. Climate Change 2001: The Scientific Basis, IPCC, 695–738.

  • North, G. R., and M. J. Stevens, 1998: Detecting climate signals in the surface temperature record. J. Climate, 11, 563577, https://doi.org/10.1175/1520-0442(1998)011<0563:DCSITS>2.0.CO;2.

    • Search Google Scholar
    • Export Citation
  • Nowack, P., J. Runge, V. Eyring, and J. D. Haigh, 2020: Causal networks for climate model evaluation and constrained projections. Nat. Commun., 11, 1415, https://doi.org/10.1038/s41467-020-15195-y.

    • Search Google Scholar
    • Export Citation
  • Ramsay, J. O., and B. W. Silverman, 2002: Applied Functional Data Analysis: Methods and Case Studies. 1st ed. Springer-Verlag, 191 pp., https://doi.org/10.1007/b98886.

  • Ramsay, J. O., and B. W. Silverman, 2005: Functional Data Analysis. 2nd ed. Springer-Verlag, 429 pp., https://doi.org/10.1007/b98888.

  • Robock, A., 2000: Volcanic eruptions and climate. Rev. Geophys., 38, 191219, https://doi.org/10.1029/1998RG000054.

  • Santer, B. D., and Coauthors, 2007: Identification of human-induced changes in atmospheric moisture content. Proc. Natl. Acad. Sci. USA, 104, 15 24815 253, https://doi.org/10.1073/pnas.0702872104.

    • Search Google Scholar
    • Export Citation
  • Sun, J., K. Ding, Z. Lai, and H. Huang, 2022: Global and regional variations and main drivers of aerosol loadings over land during 1980–2018. Remote Sens., 14, 859, https://doi.org/10.3390/rs14040859.

    • Search Google Scholar
    • Export Citation
  • Thompson, D. W. J., J. M. Wallace, P. D. Jones, and J. J. Kennedy, 2009: Identifying signatures of natural climate variability in time series of global-mean surface temperature: Methodology and insights. J. Climate, 22, 61206141, https://doi.org/10.1175/2009JCLI3089.1.

    • Search Google Scholar
    • Export Citation
  • Thompson, R. C., 1972: Principal submatrices IX: Interlacing inequalities for singular values of submatrices. Linear Algebra Appl., 5 (1), 112, https://doi.org/10.1016/0024-3795(72)90013-4.

    • Search Google Scholar
    • Export Citation
  • Tibshirani, R., M. Saunders, S. Rosset, J. Zhu, and K. Knight, 2005: Sparsity and smoothness via the fused lasso. J. Roy. Stat. Soc., 67B, 91108, https://doi.org/10.1111/j.1467-9868.2005.00490.x.

    • Search Google Scholar
    • Export Citation
  • Trenberth, K. E., and A. Dai, 2007: Effects of Mount Pinatubo volcanic eruption on the hydrological cycle as an analog of geoengineering. Geophys. Res. Lett., 34, L15702, https://doi.org/10.1029/2007GL030524.

    • Search Google Scholar
    • Export Citation
  • Trendafilov, N. T., and I. T. Jolliffe, 2006: Projected gradient approach to the numerical solution of the SCoTLASS. Comput. Stat. Data Anal., 50, 242253, https://doi.org/10.1016/j.csda.2004.07.017.

    • Search Google Scholar
    • Export Citation
  • Wagman, B. M., L. P. Swiler, K. Chowdhary, and B. Hillman, 2021: The fingerprints of stratospheric aerosol injection in E3SM. Tech. Rep. SAND2021-11522R, 18 pp., https://doi.org/10.2172/1821542.

  • Wang, G., and D. Schimel, 2003: Climate change, climate modes, and climate impacts. Annu. Rev. Environ. Resour., 28 (1), 128, https://doi.org/10.1146/annurev.energy.28.050302.105444.

    • Search Google Scholar
    • Export Citation
  • Weylandt, M., 2019: Multi-rank sparse and functional PCA Manifold optimization and iterative deflation techniques. 2019 IEEE Eighth Int. Workshop on Computational Advances in Multi-Sensor Adaptive Processing (CAMSAP), Le Gosier, Guadeloupe, Institute of Electrical and Electronics Engineers, 500–504, https://doi.org/10.1109/CAMSAP45676.2019.9022486.

  • Weylandt, M., and L. P. Swiler, 2023: Replication materials for ‘Beyond PCA: Additional Dimension Reduction Techniques to Consider in the Development of Climate Fingerprints’. Zenodo, accessed 20 December 2023, https://doi.org/10.5281/zenodo.10581710.

  • Wills, R. C. J., D. S. Battisti, K. C. Armour, T. Schneider, and C. Deser, 2020: Pattern recognition methods to separate forced responses from internal variability in climate model ensembles and observations. J. Climate, 33, 86938719, https://doi.org/10.1175/JCLI-D-19-0855.1.

    • Search Google Scholar
    • Export Citation
  • Witten, D. M., R. Tibshirani, and T. Hastie, 2009: A penalized matrix decomposition, with applications to sparse principal components and canonical correlation analysis. Biostatistics, 10, 515534, https://doi.org/10.1093/biostatistics/kxp008.

    • Search Google Scholar
    • Export Citation
  • Yeh, S.-W., and Coauthors, 2018: ENSO atmospheric teleconnections and their response to greenhouse gas forcing. Rev. Geophys., 56, 185206, https://doi.org/10.1002/2017RG000568.

    • Search Google Scholar
    • Export Citation
All Time Past Year Past 30 Days
Abstract Views 882 807 113
Full Text Views 182 170 6
PDF Downloads 237 218 6