1. Introduction
The nonhydrostatic effects start to play a significant role when the dimension of both the horizontal and vertical scales of motion become comparable. This happens typically when reaching a horizontal grid size of around 2 km. Aimed at achieving those scales, nonhydrostatic dynamics was introduced in the originally hydrostatic limited-area NWP system ALADIN. The fully compressible nonhydrostatic dynamical kernel of the ALADIN system was designed following the rule that it keeps as many features of its hydrostatic version as possible. The basic choices made are as follows: the spectral technique used for the horizontal spatial discretization, semi-implicit time stepping, and semi-Lagrangian advection [see Bubnová et al. (1995) and Bénard et al. (2010) for details].
For vertical discretization, the finite-difference (FD) scheme of Simmons and Burridge (1981) was applied since the beginning of the project, innovated with the semi-Lagrangian vertical advection according to Ritchie et al. (1995). This scheme is only first-order accurate for nonuniform spacing of vertical levels. An alternative finite-element (FE) vertical discretization was implemented in the hydrostatic core of the ALADIN system and in the global forecast system ARPEGE/IFS by Untch and Hortal (2004). A remarkable fact is that, with the hydrostatic approximation and the semi-Lagrangian advection, the only vertical operations needed are vertical integrals. Therefore, an integral operator was derived based on the Galerkin method using cubic B splines with compact support as basis functions. Also, the use of piecewise linear B-spline basis functions was implemented as an alternative option. It was shown that the FE scheme gives more accurate phase speeds of most of the linear gravity waves than the FD scheme. In addition, the cubic FE scheme has proven to be eighth-order accurate for integrating smooth functions, compared to the first-order accuracy of the FD method. Furthermore, the FE scheme reduces the level of vertical noise in forecasts with the full hydrostatic model of ECMWF, reduces the cold bias in the lower atmosphere, and improves the transport in the stratosphere. Consecutively, the FE vertical discretization has been tested in several hydrostatic applications of the limited-area model ALADIN, with detected positive impact on the objective verification scores. On top of that, the FE method has proven beneficial in 2D vertical plane idealized tests of a resting, hydrostatically balanced state. Finally, but not less important, the computational cost of such an improvement in accuracy is negligible, because the FE integral operator is defined once in the setup of the model, and it is used in the model, otherwise, exactly in the same way as the FD integral operator.
Not surprisingly, a need has emerged to extend the FE method to the vertical discretization of the fully compressible dynamical core of the ALADIN system. This task has shown to be more intricate and troublesome. Contrary to the hydrostatic equations in a mass-based vertical coordinate, where only the vertical integral operator appears, and to the fully compressible system of equations cast in a height-based coordinate, where only the vertical derivative operator occurs (Simarro and Hortal 2012), in the fully compressible Euler equations of the ALADIN system designed for the mass-based vertical coordinate, both the integral and the derivative vertical operators appear (Laprise 1992).
The difficulty lies not in the necessity to define both sets of operators, but in the need to define them consistently in order to assure the stability and accuracy of the model. There are two points in the nonhydrostatic dynamical core of the ALADIN system where the replacement of the FD operators by the high-order FE operators must be done carefully.
The first crucial point is the semi-implicit scheme. As it is explained in Bubnová et al. (1995), in the semi-implicit step of the ALADIN system, an implicit linear system is solved separately for each horizontal spectral eigenfunction. The unknowns are here the amplitudes of the prognostic variables at each model level for the next time step. The procedure is to reduce the system to a Helmholtz equation for the vertical divergence amplitudes. Because of this reduction, and for stability reasons, an analytic relation involving vertical operators, the so-called C1 constraint, must be fulfilled by the discrete version of those operators (see section 5b and appendix B for C1 definition). The FD operators of the nonhydrostatic dynamical core of the ALADIN system are defined in such a way that C1 constraint is satisfied. However, if the vertical operators do not fulfill the C1 constraint, as it is the case for the FE operators defined in this work, an alternative method must be adopted. We stop the reduction of the implicit linear system toward the Helmholtz equation just before the point where the C1 constraint is used. In this way, a linear system involving the vertical divergence and horizontal divergence amplitudes appears, which is solved iteratively. Such a solution converges in all tested idealized and real cases and, because of the choice of a convenient preconditioning of the system, one iteration of the implicit solver is enough to reach satisfying results. The computational price to be paid is very small and does not penalize the whole integration significantly.
The second crucial point is the transformation between the vertical divergence, used in the spectral calculations for stability reasons explained in Bénard et al. (2004, 2005), and the vertical velocity, used in the gridpoint calculations. This transformation must be invertible, and it implies that integral and derivative vertical operators cannot be defined independently. This goal is not fulfilled for FE operators up to now, and the transformations between the vertical divergence and the vertical velocity are still FD ones. The accuracy reached in real forecasts may thus be limited by this fact, and it is foreseen to include invertible high-order transformations between vertical velocity and vertical divergence in the future.
In this paper we follow the notation of the reference papers Bubnová et al. (1995) and Bénard et al. (2010), with small differences. Functions that appear in the paper are mostly space and time dependent. We omit the horizontal space and the time dependence, since we are interested in the vertical space dependence only. Continuous functions are written in lightface italic font (f), discrete functions represented as vectors of values at individual vertical levels are written in boldface roman font (
The paper is organized as follows. In section 2, model variables and the set of equations used are presented. In section 3, the definition of discrete vertical operators based on FE is described, while vertical operator accuracy is discussed in section 4. A semi-implicit time scheme and the consequences that the usage of FE method in vertical has on its design are given in section 5. Results of idealized test cases are shown in section 6, where the comparison with reference cases using the FD method is shown. Real case experiments are presented in section 7 with the average computational time needed for their execution. Section 8 summarizes the results achieved, outlines future directions, and concludes the paper.
2. Model variables and equations
The aim of this work is to develop a set of high-resolution FE vertical operators and to implement it in the nonhydrostatic fully compressible dynamical core of the ALADIN system. Therefore, the model equations are exactly the same as those detailed in Bénard et al. (2010). For completeness, we rewrite the model equations here, with some simplifications in order to make the reading easier.













3. Finite-element scheme
To solve numerically the equations briefly described in the previous section, the spatial domain is discretized horizontally and vertically. In the vertical, the model domain is divided into a number of L layers, using the mass-based vertical coordinate described in Simmons and Burridge (1981). The full model levels are located inside these layers, while the half model levels are located at the material boundaries, that is, at the top and bottom of the atmosphere, and at the interfaces between layers. The model variables are staggered in the vertical direction, being all the variables defined at full levels, with the exception of the vertical velocity, which is located at half levels. See Fig. 1 for vertical staggering illustration.
Staggering of variables among vertical half and full model levels.
Citation: Monthly Weather Review 146, 10; 10.1175/MWR-D-18-0043.1
On the other hand, the horizontal discretization is spectral based on the use of eigenfunctions of the horizontal Laplacian operator, and all horizontal derivatives are calculated in the spectral space (Bubnová et al. 1995). In the following, we omit the horizontal direction, as this work is focused on vertical operators.
The method used for constructing FE operators is similar to the procedure described in Untch and Hortal (2004). However, there are differences, mainly, as it is explained later in this section, the inclusion of boundary conditions that are applied to the input and output functions and the general order of basis functions used. To make a clearer exposition of the method, we first describe how the vertical operators are used in the model.







Boundary conditions for discrete FE operators used in the model. Input and output functions are
Now, we introduce a novelty with respect to the method described in Untch and Hortal (2004). The discrete representation




In the following paragraphs we describe first how the discrete representation of a continuous function known on model η levels is obtained, and then how the vertical operators are constructed.
a. Interpolation with B-spline curve
In the process of discretizing vertical operators, we apply the FE procedure, as described for example in Lynch (2005). We use B-spline functions of a general order as the basis functions. Thus, linear functions and cubic B splines are particular choices that are consistent with the FE method used in the hydrostatic dynamical core of the ALADIN system.












The number of basis functions, that is, the cardinal of the index










b. Vertical operator definition



















In general, the operator matrix
4. Accuracy of vertical operators
We apply the vertical FE operators from Table 1 to the smooth function
The mean absolute error (MAE) of the vertical derivative operators applied on ξ for several resolutions with regularly distributed vertical levels. The accuracy order of these operators is calculated from MAE using linear regression (Calculated order). The analytically estimated accuracy orders (Analytical order), published in Staniforth and Wood (2005) for the first derivative FE operator and derived in section 4 for the second derivative FE operator, are listed in the last column. The FD derivative operator of the kth order is denoted as
The error of the defined FE vertical operators and of their FD counterparts calculated with 100 regularly distributed vertical levels, for (left) the first derivative and (right) the second derivative. We denote the eighth-order FD operator for the first derivative as
Citation: Monthly Weather Review 146, 10; 10.1175/MWR-D-18-0043.1
The mean absolute error of the defined vertical operators in relation to the number of vertical levels. Black lines show ideal line corresponding to the order 4, 6, and 8. The ideal line is followed almost exactly with FD operators of the corresponding order (not shown).
Citation: Monthly Weather Review 146, 10; 10.1175/MWR-D-18-0043.1
When using cubic B splines, more than eighth-order accuracy is achieved for the integral and the first derivative operators
The theoretical accuracy of integral and first derivative operators defined with cubic B splines was mathematically explained in Staniforth and Wood (2005). To the best of our knowledge, the same has not yet been done for a second derivative operator defined with cubic splines. We show here that the theoretical accuracy of
We assume regular η level distribution with distance












5. Time stepping
a. Linear system

























Discretization of the vertical integral operators






b. Implicit problem
Unfortunately, this constraint is not fulfilled for the FE operators used in the vertical discretization. On the other hand, the implicit problem in the discrete form is a linear inversion, and could be performed with two or more variables. If we adopt a solution of the implicit problem for the couple (D, d), then the constraint C1 no longer needs to be fulfilled. Instead of solving a system of L equations for the variable d, the Helmholtz equation, we can solve a system of
Therefore, instead of a direct inversion, we have opted for a preconditioned iterative method, which we outline in appendix B.
c. Nonlinear system













6. Sensitivity in idealized experiments
A set of test cases has been run in the 2D vertical plane version of the ALADIN system, including the nonlinear nonhydrostatic flow over idealized orography according to Bubnová et al. (1995) and the density current test published in Straka et al. (1993).
a. Nonlinear nonhydrostatic flow



We set
Vertical velocity at time
Citation: Monthly Weather Review 146, 10; 10.1175/MWR-D-18-0043.1
b. Density current















The potential temperature field at time
Citation: Monthly Weather Review 146, 10; 10.1175/MWR-D-18-0043.1
All basic features of the solution are kept by both methods. Compared with the reference solution, the details are better captured by the FE method. For a grid spacing of 25 m (Figs. 5b,c) the FE method gives a better shape of all the rotors than the FD method, and for a grid spacing of 50 m (Figs. 5d,e) the depth of the rotors is better resolved by the FE than the FD method. All results are in good agreement with the results shown in Fig. 1 of Straka et al. (1993).
7. Sensitivity in real-case experiments
Two series of forecasts starting from the ALARO1 analysis at 0000 UTC were run in 2-km horizontal resolution over the central Europe domain centered above the Czech Republic and partially covering the Alps (see Fig. 6) with 87 Czech operational vertical levels. One set starts from 28 May 2016 and continues to 6 June 2016 with convection events present frequently during the day over the majority of the domain. The second set covers the time period starting from 21 October to 30 October 2017. The conditions were stable with very strong wind occurring on 29 October 2017 with wind gusts exceeding
Orography in real experiments. The blue rectangle denotes the domain for cumulated precipitation shown in Fig. 7.
Citation: Monthly Weather Review 146, 10; 10.1175/MWR-D-18-0043.1
No sign of instability is apparent in any of the experiments. The iterative semi-implicit solver converges as indicated by the spectral radius test of the iteration matrix calculated in the setup part of the integration. Objective scores of the results with 1 iteration and with 10 iterations coincide in all parameters except for the time evolution of the bias for precipitation cumulated for 24 h in the autumnal series, where a small advantage of further iterations may be observed. We conclude that one iteration of the semi-implicit solver is enough for an accurate solution.
Furthermore, objective score characteristics are neutral to the change of vertical discretization (from FD to FE). The phenomenon that can be identified in the results is an interaction of the vertical discretization with the resolved convection. Just for providing an example, the precipitation cumulated for 3 h between 1100 and 1400 UTC is shown in Fig. 7 for the integration starting at 0000 UTC 1 June 2016. The maxima are decreased slightly with FE discretization, which corresponds better to observations.
Estimation of precipitation cumulated at 1100–1400 UTC 1 Jun 2016 over the Czech republic territory. (top) Combined information from radars and point rain gauges and the corresponding precipitation field forecasted by the ALARO simulation with the vertical discretization realized (middle) through FE and (bottom) through FD. Only one iteration of the semi-implicit solver is applied in both cases.
Citation: Monthly Weather Review 146, 10; 10.1175/MWR-D-18-0043.1
The computational performance depends heavily on the computational platform used. The 3D experiments for this paper were run on NEC LX series HPC cluster with 320 computing nodes, with each node based on two Intel Broadwell CPUs. For any configuration, the FE method is more expensive then the FD method. The computational overhead of the FE method depends nevertheless on the used hybrid parallelization (MPI-OpenMP), nodes usage, and on the cash-blocking mechanism realized through the cash-blocking length parameter. We show in Table 3 the average CPU time needed for one time step of the whole model integration for several cash-blocking lengths. The results obtained are dependent on other computer device parameters as well and the conclusions on the CPU time consumption are only illustrative.
The average CPU time needed for one time step of the 3D simulation for the two vertical discretization methods, the average CPU time overhead of the FE method without iteration of the Helmholz solver, and the average CPU time overhead of the Helmholtz solver iterations over the FD method in percent units. Here “0 + 1” denotes CPU time overhead of the FE solution used in 3D experiments over the FD solution. Different cash-blocking lengths are applied. The 3D experiments were run on 72 computing nodes of the NEC LX series HPC cluster, each node is based on two Intel Broadwell CPUs.
The average CPU time overhead of the FE method over the FD method was calculated using the average CPU time per one time step. Without iteration of the Helmholtz solver, the overhead comes from the matrix multiplication employed in the FE method in place of the difference and division operations used in the FD method. It depends on the matrix multiplication code efficiency. We use FORTRAN routine DGEMM from the Lapack library for matrix multiplication. The magnitude of this overhead is 5%–8%. Moreover, there is an overhead of needed CPU time for the FE method coming from different methods used in the Helmholtz solver and depending on the number of iterations of this solver. The magnitude of this overhead is about 1%–2% per iteration (one iteration is enough to reach satisfactory results).
The overall CPU time needed is thus increased by 8%–9% when the FE method is used (the last column in Table 3 denoted “0 + 1”) depending on the parallelization (MPI-OpenMP) and the optimization method applied. The important fact is that the CPU time needed is independent of the order of B splines used as basis functions. On the other hand, increasing the order of the FD vertical discretization would lead to the increase in CPU time consumption.
8. Summary and discussion
We describe in this work a finite-element vertical discretization used in the nonhydrostatic fully compressible dynamical core of the ALADIN system, which is general in the order of used B splines. The described method is an extension and generalization of the FE method implemented for the vertical discretization of the hydrostatic dynamical core of the ALADIN system and of ARPEGE/IFS global weather prediction system described in Untch and Hortal (2004).
The treatment of boundary conditions was changed with respect to the FE implementation in the hydrostatic core of the ALADIN system. We have included, in the definition of the FE operators, a set of linear boundary conditions that are applied to the input and output functions. We found out that the boundary conditions are crucial to achieve satisfactory stability properties. Nevertheless, the conditions imposed on the input and output functions are fully compatible with physical criteria valid for the corresponding meteorological variable.
In addition, compared to FD spectral space computations, we have implemented a stationary iterative solution of the Helmholtz structure equation. The proposed method appears to be convergent and it was shown that one iteration provides sufficient accuracy.
We performed a set of standard idealized tests, like the density current Straka test and various flow regimes over a bell-shaped mountain. These experiments proved the satisfactory accuracy properties of the proposed FE discretization, and they showed that the nonhydrostatic dynamical core remains as stable as it is with the FD discretization used in the vertical when semi-implicit time stepping is applied. Moreover, 3D diabatic experiments were performed with a 2-km model horizontal resolution over the central Europe domain partially covering the Alps. The objective scores were neutral to the change of vertical discretization in all tested cases. A slight shift of the precipitation amounts to lower intensities was observed with the FE method used, especially in the summer period.
The stability properties of the NH dynamical core require us to keep vertical staggering of the model variables for FE discretization. Hence, unlike all the other prognostic variables, the vertical velocity w is defined on the half model levels. The vertical derivative of w then requires an application of a staggered FE operator. This requirement limits the theoretical accuracy of the proposed FE method, because staggered FE operators are not superconvergent. Moreover, the transformations between w and the vertical divergence variable d needed in the implicit calculations keep the FD approach. This may again limit the possible overall accuracy of the model. This is left for further investigation.
We have implemented the FE method with the general order of B splines. So far all tests were restricted to the cubic B splines only. Nevertheless, we plan to study the influence of the B-spline order on the accuracy and the time stepping stability of the whole system.
Acknowledgments
The authors express their gratitude to all the ALADIN and HIRLAM colleagues involved in the development of the ALADIN-HIRLAM system for their support. We would like to give special thanks to Álvaro Subías for fruitful discussions and Karim Yessad for his work on several parts of the semi-implicit time scheme facilitating our work. We are grateful to RC LACE for the financial support of several research stays devoted to the topic of this paper. We further thank the two anonymous reviewers whose suggestions helped improve and clarify the manuscript.
APPENDIX A
Construction of B-Spline Basis Functions
In this appendix we explain how the B-spline basis functions are constructed. The goal is to construct a B-spline basis function
The input information is the order C of the B splines (



We must define some η points, referred to as the knots, which may be different from the η values at the full model levels. However, once the distribution of the full model levels is given, the choice of knots is not arbitrary, because B-spline basis functions must be distributed in such a way that there is at least one full model level in the support of each B-spline function. Once the knots are set, the B-spline basis functions are constructed from them using de Boor’s algorithm (de Boor 1978).
The number of knots is
A different basis for seven vertical model levels. Black dots denote vertical η levels, while squares indicate positions of knots. The thickness of lines differs for individual basis functions to better distinguish between them. (left) No boundary conditions applied and (right) three boundary conditions applied implicitly:
Citation: Monthly Weather Review 146, 10; 10.1175/MWR-D-18-0043.1
APPENDIX B
Iterative Procedure to Solve the Helmholtz Problem














APPENDIX C
Definition of Full-Level Positions and Layer Depths























REFERENCES
Bénard, P., 2003: Stability of semi-implicit and iterative centered-implicit time discretizations for various equation systems used in NWP. Mon. Wea. Rev., 131, 2479–2491, https://doi.org/10.1175/1520-0493(2003)131<2479:SOSAIC>2.0.CO;2.
Bénard, P., 2004: On the use of a wider class of linear systems for the design of constant-coefficients semi-implicit time schemes in NWP. Mon. Wea. Rev., 132, 1319–1324, https://doi.org/10.1175/1520-0493(2004)132<1319:OTUOAW>2.0.CO;2.
Bénard, P., R. Laprise, J. Vivoda, and P. Smolíková, 2004: Stability of leapfrog constant-coefficients semi-implicit schemes for the fully elastic system of Euler equations: Flat-terrain case. Mon. Wea. Rev., 132, 1306–1318, https://doi.org/10.1175/1520-0493(2004)132<1306:SOLCSS>2.0.CO;2.
Bénard, P., J. Mašek, and P. Smolíková, 2005: Stability of leapfrog constant-coefficients semi-implicit schemes for the fully elastic system of Euler equations: Case with orography. Mon. Wea. Rev., 133, 1065–1075, https://doi.org/10.1175/MWR2907.1.
Bénard, P., J. Vivoda, J. Mašek, P. Smolíková, K. Yessad, C. Smith, R. Brožková, and J. F. Geleyn, 2010: Dynamical kernel of the Aladin–NH spectral limited-area model: Revised formulation and sensitivity experiments. Quart. J. Roy. Meteor. Soc., 136, 155–169, https://doi.org/10.1002/qj.522.
Bubnová, R., G. Hello, P. Bénard, and J. F. Geleyn, 1995: Integration of the fully elastic equations cast in the hydrostatic pressure terrain-following coordinate in the framework of the ARPEGE/Aladin NWP system. Mon. Wea. Rev., 123, 515–535, https://doi.org/10.1175/1520-0493(1995)123<0515:IOTFEE>2.0.CO;2.
Caluwaerts, S., D. Degrauwe, P. Termonia, F. Voitus, P. Bénard, and J.-F. Geleyn, 2015: Importance of temporal symmetry in spatial discretization for geostrophic adjustment in semi-implicit Z-grid schemes. Quart. J. Roy. Meteor. Soc., 141, 128–138, https://doi.org/10.1002/qj.2344.
de Boor, C., 1978: A Practical Guide to Splines. Applied Mathematical Sciences Series, Vol. 27, Springer-Verlag, 348 pp., https://doi.org/10.1137/1022106.
Guerra, J. E., and P. A. Ullrich, 2016: A high-order staggered finite-element vertical discretization for non-hydrostatic atmospheric models. Geosci. Model Dev., 9, 2007–2029, https://doi.org/10.5194/gmd-9-2007-2016.
Hortal, M., 2002: The development and testing of a new two-time-level semi-Lagrangian scheme (SETTLS) in the ECMWF forecast model. Quart. J. Roy. Meteor. Soc., 128, 1671–1687, https://doi.org/10.1002/qj.200212858314.
Laprise, R., 1992: The Euler equations of motion with hydrostatic pressure as an independent variable. Mon. Wea. Rev., 120, 197–207, https://doi.org/10.1175/1520-0493(1992)120<0197:TEEOMW>2.0.CO;2.
Lorenz, E. N., 1960: Energy and numerical weather prediction. Tellus, 12, 364–373, https://doi.org/10.3402/tellusa.v12i4.9420.
Lynch, D. R., 2005: Numerical Partial Differential Equations for Environmental Scientists and Engineers: A First Practical Course. Springer, 388 pp., https://doi.org/10.1007/b102052.
Ritchie, H., C. Temperton, A. Simmons, M. Hortal, T. Davies, D. Dent, and M. Hamrud, 1995: Implementation of the semi-Lagrangian method in a high-resolution version of the ECMWF forecast model. Mon. Wea. Rev., 123, 489–514, https://doi.org/10.1175/1520-0493(1995)123<0489:IOTSLM>2.0.CO;2.
Simarro, J., and M. Hortal, 2012: A semi-implicit non-hydrostatic dynamical kernel using finite elements in the vertical discretization. Quart. J. Roy. Meteor. Soc., 138, 826–839, https://doi.org/10.1002/qj.952.
Simmons, A. J., and D. M. Burridge, 1981: An energy and angular-momentum conserving vertical finite-difference scheme and hybrid vertical coordinates. Mon. Wea. Rev., 109, 758–766, https://doi.org/10.1175/1520-0493(1981)109<0758:AEAAMC>2.0.CO;2.
Staniforth, A., and N. Wood, 2005: Comments on “A finite-element scheme for the vertical discretization in the semi-Lagrangian version of the ECMWF forecast model” by A. Untch and M. Hortal (April B, 2004, 130, 1505–1530). Quart. J. Roy. Meteor. Soc., 131, 765–772, https://doi.org/10.1256/qj.04.10.
Straka, J. M., R. B. Wilhelmson, L. J. Wicker, J. R. Anderson, and K. K. Droegemeier, 1993: Numerical solutions of a non linear density current: A benchmark solution and comparisons. Int. J. Numer. Methods Fluids, 17, 1–22, https://doi.org/10.1002/fld.1650170103.
Termonia, P., and Coauthors, 2018: The ALADIN System and its canonical model configurations AROME CY41T1 and ALARO CY40T1. Geosci. Model Dev., 11, 257–281, https://doi.org/10.5194/gmd-11-257-2018.
Untch, A., and M. Hortal, 2004: A finite-element scheme for the vertical discretization of the semi-Lagrangian version of the ECMWF forecast model. Quart. J. Roy. Meteor. Soc., 130, 1505–1530, https://doi.org/10.1256/qj.03.173.
Voitus, F., 2017: An alternative elimination procedure. Internal Note of Météo-France, Météo-France, Toulouse, France, 2 pp.
Yessad, K., 2006: Semi-implicit spectral computations in the NH version of ARPEGE/ALADIN: Specific problems when the constraint “C1” is relaxed. Météo-France Tech. Rep., Météo-France, Toulouse, France, 14 pp.
ALARO is the canonical model configuration of the ALADIN system, as described in Termonia et al. (2018).