A novel latent space data assimilation framework with Autoencoder-Observation to Latent space (AE-O2L) network. Part I: the observation-only analysis method

Hang Fan a Precision Regional Earth Modeling and Information Center, Nanjing University of Information Science and Technology, Nanjing, China
b Collaborative Innovation Center on Forecast and Evaluation of Meteorological Disasters, Nanjing University of Information Science and Technology, Nanjing, China

Search for other papers by Hang Fan in
Current site
Google Scholar
PubMed
Close
,
Yubao Liu a Precision Regional Earth Modeling and Information Center, Nanjing University of Information Science and Technology, Nanjing, China
b Collaborative Innovation Center on Forecast and Evaluation of Meteorological Disasters, Nanjing University of Information Science and Technology, Nanjing, China

Search for other papers by Yubao Liu in
Current site
Google Scholar
PubMed
Close
,
Zhaoyang Huo a Precision Regional Earth Modeling and Information Center, Nanjing University of Information Science and Technology, Nanjing, China
b Collaborative Innovation Center on Forecast and Evaluation of Meteorological Disasters, Nanjing University of Information Science and Technology, Nanjing, China

Search for other papers by Zhaoyang Huo in
Current site
Google Scholar
PubMed
Close
,
Yuewei Liu c National Science Foundation-National Center for Atmospheric Research (NSF-NCAR), Boulder, Colorado

Search for other papers by Yuewei Liu in
Current site
Google Scholar
PubMed
Close
,
Yueqin Shi d Cloud-Precipitation Physics and Weather Modification Key Laboratory, Weather Modification Centre of China Meteorological Administration, Beijing, China

Search for other papers by Yueqin Shi in
Current site
Google Scholar
PubMed
Close
, and
Yang Li e CMA Key Laboratory of Cloud-Precipitation Physics and Weather Modification, CMA Weather Modification Centre, Beijing, China

Search for other papers by Yang Li in
Current site
Google Scholar
PubMed
Close
Restricted access

Abstract

Data assimilation (DA) aims to achieve consistent atmospheric analyses with observations and numerical model forecasts. However, the increasing trend in forecast resolution and observation richness places an increasing computational burden on DA. To address this challenge, we develop a novel latent space data assimilation (LSDA) framework that performs efficient DA in a reduced-dimensional latent space learned by an Autoencoder from numerical atmospheric states. Distinct from previously reported LSDA methods, our approach introduces an extra neural network, O2Lnet, trained on simulated observations derived from model states, to map real observations onto the AE latent space. The observation-derived latent state obtained by O2Lnet can then be directly decoded to obtain the analysis in model space using the decoder component of the Autoencoder. In Part I, we aim to demonstrate the feasibility of this observation-only analysis method, denoted as LSDA-OOA, by inferring 2-meter temperature (T2) analyses on 1km-grids with both idealized and real T2 observations.

The idealized experiments demonstrate that given sufficient observations, LSDA-OOA can yield high-quality analyses while exhibiting a favorable resiliency to random observation errors. When applied to analyze real T2 observations, LSDA-OOA produced T2 analyses with an accuracy comparable to the WRF (Weather Research and Forecast) four-dimensional data assimilation (FDDA) method. In particular, it greatly outperforms WRF-FDDA for the cases containing larger errors in forecasts (background fields). Finally, we replace the training data from WRF-FDDA analyses with the forecasts instead and find that this only results in a small increase of the error in the LSDA-OOA analyses.

© 2025 American Meteorological Society. This is an Author Accepted Manuscript distributed under the terms of the default AMS reuse license. For information regarding reuse and general copyright information, consult the AMS Copyright Policy (www.ametsoc.org/PUBSReuseLicenses).

Corresponding author: Yubao Liu, ybliu@nuist.edu.cn

Abstract

Data assimilation (DA) aims to achieve consistent atmospheric analyses with observations and numerical model forecasts. However, the increasing trend in forecast resolution and observation richness places an increasing computational burden on DA. To address this challenge, we develop a novel latent space data assimilation (LSDA) framework that performs efficient DA in a reduced-dimensional latent space learned by an Autoencoder from numerical atmospheric states. Distinct from previously reported LSDA methods, our approach introduces an extra neural network, O2Lnet, trained on simulated observations derived from model states, to map real observations onto the AE latent space. The observation-derived latent state obtained by O2Lnet can then be directly decoded to obtain the analysis in model space using the decoder component of the Autoencoder. In Part I, we aim to demonstrate the feasibility of this observation-only analysis method, denoted as LSDA-OOA, by inferring 2-meter temperature (T2) analyses on 1km-grids with both idealized and real T2 observations.

The idealized experiments demonstrate that given sufficient observations, LSDA-OOA can yield high-quality analyses while exhibiting a favorable resiliency to random observation errors. When applied to analyze real T2 observations, LSDA-OOA produced T2 analyses with an accuracy comparable to the WRF (Weather Research and Forecast) four-dimensional data assimilation (FDDA) method. In particular, it greatly outperforms WRF-FDDA for the cases containing larger errors in forecasts (background fields). Finally, we replace the training data from WRF-FDDA analyses with the forecasts instead and find that this only results in a small increase of the error in the LSDA-OOA analyses.

© 2025 American Meteorological Society. This is an Author Accepted Manuscript distributed under the terms of the default AMS reuse license. For information regarding reuse and general copyright information, consult the AMS Copyright Policy (www.ametsoc.org/PUBSReuseLicenses).

Corresponding author: Yubao Liu, ybliu@nuist.edu.cn
Save