1. Introduction
The new generation of atmospheric spectral radiance sensors has much higher radiometric sensitivity and spatial and spectral resolution than previous sensors and has a much larger number of channels. These enhancements and the growth in the number of high-resolution sensors place increasing demands on the computational aspects of remote sensing. Atmospheric radiative transfer (RT) models are an integral part of the modeling and simulation of scene radiances for use in sensor design and simulation. They are also used for extracting geophysical information from the radiometric data as part of the training for statistical inversion approaches, within physical inversion algorithms, and within direct assimilation of measured radiances into atmospheric numerical models.
This paper describes a fast and accurate radiative transfer modeling technique, optimal spectral sampling (OSS). The OSS approach was designed specifically to address the need for highly accurate real-time monochromatic radiative transfer calculations (including the Jacobians) for any class of multispectral, hyperspectral, or ultraspectral sensor at any spectral region from the microwave through the ultraviolet (Moncet 2000; Moncet et al. 2001a). Validation capabilities and multisensor data assimilation are enhanced with across-the-spectrum capability, allowing the same radiative transfer model to be used with multiple sensors to ensure spectral consistency.
One advantage of the TPTR technique is that it requires the evaluation of only a single exponential to compute the transmittance for each atmospheric level and molecular constituent, thereby minimizing the total number of calculations. While there is a computationally efficient analytic solution to the calculation of the radiometric Jacobians, the overall efficiency of this calculation is affected by the dependence of the optical depths on profile variables above the layer under consideration (Strow et al. 2003). Furthermore, the overall radiometric accuracy of the method can be difficult to control because it depends largely on the choice of predictors (X) that are selected on an empirical basis and lack a direct physical basis. A set of X developed for a downlooking sensor may be inadequate for an uplooking or limb-viewing sensor. Similarly, this method is not practical for use with airborne sensors because the coefficients (a) are obtained from a least squares fit to the path properties and a different set is required for each altitude of the sensor. Additional constraints arise from the fact that the formulation is not directly adaptable to nonpositive instrument line shape (ILS) functions, such as sinc functions produced by (unapodized) Fourier transforms of interferograms sampled over a limited range of optical path difference (McMillin et al. 1998; Barnet et al. 2000).
Similarly, this TPTR parameterization has inherent accuracy limits when applied to cloudy atmospheres with moderate cloud optical depths because it has no means to respond to the cloud-induced changes in the atmospheric spectrum on subchannel spectral scales, from line centers to line wings.
OSS is fundamentally a monochromatic approach, and that property alleviates many of the limitations listed above and makes the model generic and flexible. OSS can accurately treat surface reflectance and spectral variations of the Planck function and surface emissivity within the channel passband, given that the proper training is applied. In addition, the method can maintain its accuracy when applied to multiple scattering calculations—an important factor for treating cloudy radiances—and is directly applicable to nonpositive ILS. One advantage of OSS over existing methods is that its numerical accuracy is selectable such that it is capable of fitting reference transmittance calculations arbitrarily closely.
The OSS method was initially developed for supporting sensor system design trade-off studies and performance evaluations, such as those conducted in the context of the Cross-Track Infrared Sounder (CrIS) and the Conically Scanning Microwave Imager/Sounder (CMIS) payloads of the National Polar-Orbiting Operational Environmental Satellite System (NPOESS) and the Hyperspectral Environmental Suite for the Geostationary Operational Environmental Satellite (GOES) system. OSS is well suited to system development applications because, with only minor configuration changes, it can be tailored to a wide range of remote sensing problems: downlooking (satellite sensors), uplooking (ground-based sensors), and aircraft or balloon (variable viewing and altitude ranges), including limb and line-of-sight measurements. OSS was furthermore adopted for use within the baseline operational retrieval algorithms for CrIS and CMIS (Moncet et al. 2001a, b). OSS is used also by NASA (Liu et al. 2003) in their analysis of the aircraft sensor data such as that obtained by the NPOESS Airborne Sounding Testbed Interferometer.
This paper focuses on application to infrared remote sensing. The scope is limited to the theory and practical implementation of methods for modeling transmittances and radiances over measurement bands. Issues related to selection of a computational pressure grid and subgrid layer parameterizations (e.g., Matricardi 2003; Clough et al. 1992) are not addressed here because the OSS method is general with respect to these issues.
2. The OSS method
The main difficulties encountered in attempting to apply the ESFT technique to vertically inhomogeneous atmospheres directly (i.e., without invoking effective path properties) is in maintaining consistency among the different layers with respect to the choice of the representative k values. Most attempts to extend the application of the ESFT to inhomogeneous atmospheres (e.g., Lacis and Oinas 1991; Goody et al. 1989) rely on the assumption that the absorption coefficients in different altitude regimes are perfectly correlated (the correlated-k approach), an assumption that breaks down, for a single gas, in the presence of lines with different strengths or for highly temperature-dependent line strengths (e.g., West et al. 1990) and, for gas mixtures, when the relative concentration of the absorbers in the mixture differs in the two altitude regimes. Although an efficient mapping algorithm was developed by West et al. (1990) to reduce the errors due to the above assumption in the context of a single gas, none of the proposed approaches deals explicitly with gas mixtures. The problem can usually be avoided either by treating gas mixtures as a single gas (e.g., Mlawer et al. 1997; Sun and Rikus 1999), which may lead to significant errors when the composition of the mixture changes along the path, or by making use of the multiplication property for band transmittances (e.g., Armbruster and Fischer 1996), which is only valid in a few specific regimes. The impact of these approximations on the accuracy of the radiative transfer calculations is, however, limited by the fact that in most problems the radiation originates from a restricted altitude regime with quasi-homogenous thermodynamic properties, making it possible to tune the model parameters for that regime and for the interval considered. A rigorous solution to the inhomogeneous problem requires a full characterization of the multivariate probability distribution, f (k11, k12, . . . , k1m; k21, . . . , klm), of the absorption coefficients in all layers l and for all active constituents m. Simultaneously solving for all the representative k values in this context may not be feasible.
One distinct advantage of OSS is that the error tolerance for fitting (11) is selected a priori by the user, even in the multilayer case. This feature allows the user flexibility to tailor the fitting to balance the radiometric accuracy requirements dictated by the application and the computation time constraints. A tighter threshold will provide a fit with a larger number of nodes, at the cost of more calculations.
3. Search procedure
With the so-called “localized” training described in this paper, we train an OSS model for a given high-spectral-resolution sensor by considering one channel at a time. We defer detailed discussion of an alternative “global” training method, which gains computational economy by treating the channel set as a whole and which is more applicable to broad spectral domains.
We use an automated search to identify the smallest subset ℑN for which the rms difference is less than a prescribed tolerance: that is, εN < εtol. Two algorithms have been implemented for selecting the OSS nodes. The first approach is based on the search method Wiscombe and Evans (1977) applied in the context of ESFT. This approach (subsequently referred to as the W/E approach) starts with N = 1 and performs an exhaustive search among all possible one-member subsets ℑ1 for the spectral location that produces the lowest ε1. Once the first node has been determined, ε1 is compared to εtol. If ε1 is less than the prescribed tolerance, the procedure stops. Otherwise, N is incremented by one and the algorithm proceeds, as above, by successively adding all the elements of ℑM − ℑN−1 to ℑN−1 to form M − (N − 1) new ensembles ℑN, and by finding, among all these candidate combinations, the one that produces the lowest εN. The weights wi are recomputed for each trial combination. This incremental process is repeated until the solution satisfies εN < εtol. When we use training profile sets that are stratified by view angle (section 4), we continue until εN < εtol is satisfied at each of the angles. This approach ensures that the OSS model performance is uniform across the range of air mass paths. The impact of this strategy is most apparent at large incidence angles (60° and above).
With the W/E method, the addition of a new node does not modify the previously selected nodes unless the vector w at a given step (for the selection that produced the lowest εN) contains one or more negative weights, indicating that this solution is of dubious robustness. In the context of positively defined instrument functions, all weights are required to be positive (or no more than slightly negative with an increasing tolerance for negativity as N grows). Large negative coefficients arise when a newly added node carries redundant spectral information, in which case matrix 𝗔 becomes ill conditioned. Such situations arise most frequently when N becomes large (N > ∼10) and the solution path has reached a local minimum. In this case, one of the elements of ℑN−1 associated with a negative weight is dropped and replaced by νN and the step is reinitialized. Wiscombe and Evans (1977) provide a full discussion of this so-called “term-dropping” procedure. We augment this procedure by monitoring the determinant of matrix 𝗔. The positive-w criterion cannot be used when dealing with nonpositive ILS such as interferometric sinc functions, for which we rely solely on the determinant check. It is useful, when term dropping has been done, to attempt to refine the W/E solution once convergence has been reached by applying the entire search procedure a second time while restricting the candidate nodes to the ℑN obtained in the first pass, rather than starting with the full set ℑM. This second pass is aimed at eliminating any redundant elements from the initial selection.
A comparison of the performances of the W/E and MC approaches is provided in Fig. 2. The W/E method has the advantage that it is faster than the MC method, with the number of trial combinations on the order of 2MN and 10MN, respectively. On the other hand, failure to converge toward the optimal solution is not uncommon with the W/E method when N becomes large. The performance of the MC method, for the test cases considered (e.g., Fig. 2), equates that of a global search. The efficiency of the MC approach in relation to a global search is explained by the fact that, for a given N, there are many combinations of nodes that provide equally good fits.
The search procedure (with either approach) becomes time consuming when M exceeds a few thousand (i.e., for Δν ≳ 1 cm−1 in the infrared). In such cases, it is possible to divide Δν into smaller subintervals and apply the OSS selection to each subinterval. The width of the subintervals is then progressively increased until it becomes comparable to or even larger than the width of the instrument function’s primary response mode. At each increase in the size of the subintervals, a new search is applied using the nodes selected in the previous step as the initial set from which to select in the current step. It is only in the final selection step that the desired sensor instrument function is applied. The algorithm backtracks to smaller subintervals if necessary to meet the requirement for error <εtol. When dealing with instruments whose channels extensively overlap, the use of large preliminary subintervals tends to encourage sharing of nodes between neighboring channels, thus reducing the total number of nodes needed for the channel set as a whole. The preliminary node selections can furthermore be saved and reused with different sensors.
4. Training dataset
Atmospheric sets commonly used in remote sensing applications for fast-model training and validation include the diverse 52-profile set developed from the European Centre for Medium-Range Weather Forecasts (ECMWF) global model (http://www.metoffice.gov.uk/research/interproj/nwpsaf/rtm/) and the 48-profile set developed at the University of Maryland, Baltimore County (UMBC) (http://cimss.ssec.wisc.edu/itwg/groups/rtwg/rtwg.html). The atmospheric data are specified on the standard 101-level UMBC grid (Strow et al. 2003), the same grid used for the RT calculations in the current OSS model.
The starting point for building our training dataset was the 52-profile ECMWF set. This set was replicated four times to form a total of five subsets, each of which was assigned a different view angle (or zenith angle) value ranging from 0° and 60° (specifically, 0.00°, 36.87°, 48.19°, 55.15°, and 60.00°). The range of scan angle used for training is not an inherent property of OSS but was tailored here for application to operational cross-track scanning instruments. The total number of scenes in the final training set is 260. Each profile in the set was assigned a random surface emissivity (between 0.7 and 1.0) and a random surface pressure. Water vapor and ozone are the only variable molecules considered in the present study. In recent applications, the OSS training has been expanded to include variability of other trace gases, including synoptic, seasonal, and secular trends associated with climate change.
To ensure robustness of the OSS RT model, the atmospheric dataset used for training must encompass the full range of atmospheric conditions encountered in nature and include realistic vertical structure. Because adequate measurements of temperature and molecular concentrations in the upper atmosphere are lacking, profiles are generally extrapolated from the lower part of the atmosphere, thereby introducing artificial correlations between the lower and upper atmosphere as well as profile-to-profile correlations. In addition, profiles often tend to be smoother than in nature, either because of the vertical coarseness of the radiosonde records or because of the various physical and numerical constraints used in weather or chemistry models. Exploitation of these artificial vertical correlations, if left uncorrected, is inherent to the OSS training process (and other fast RT methods) and results in a reduction in the number of nodes needed to fit the reference radiances within required accuracy threshold. While the reduced node set is sufficient to fit the training data, it is not robust when applying OSS to real atmospheres (or different sources of atmospheric data), so errors may be larger than those obtained in the training.
5. OSS forward model
a. Model structure
The OSS forward model takes, as inputs, profiles of temperature and molecular concentration for the variable constituents, surface temperature, emissivity, and reflectivity, as well as profiles of cloud microphysical properties. The model computes radiances and Jacobians for a set of channels and specified viewing angle, sun angle, and observer level. Input surface emissivity and reflectivity are on a spectral grid that is defined by the user at run time. Surface optical properties are linearly interpolated from the grid points to the node wavenumbers. A separate module contains cloud optical properties tabulated as a function of fixed retrievable thermodynamic and microphysical parameters. The OSS model requires a set of wavenumbers (used for the calculation of the Planck function and interpolation of surface/cloud optical properties) and weights for the selected nodes as well as, for each node, absorption coefficients for “fixed” and variable concentration species tabulated as a function of temperature, pressure and, for water vapor, water vapor itself. The absorption lookup table is produced at the training stage (see section 5b). The reference line-by-line model that we have been using for the infrared is LBLRTM (Clough et al. 1992).
The radiative transfer computations are performed either on the internal OSS pressure grid (provided with the input molecular absorption lookup tables) or on a variable grid specified by the user as a calling argument, in which case a pressure interpolation of the tabulated absorption coefficients is performed. The number of molecules treated as variable, among the available molecular entries in the lookup table, is set once by the user at run time. The remaining molecules are assigned a standard concentration profile (currently stored in the absorption lookup table header) and their optical depths are added to the fixed-gases optical depth for each node. The same applies at any given node to variable molecules whose contribution to total atmospheric optical depth falls below a specified threshold. The contribution of these molecules is not zeroed out; only the impact of variations in their concentrations is neglected. The ability to dynamically select the variable species according to specific application requirements is greatly facilitated by the way molecular absorption is parameterized in the OSS method. Many trace constituents contribute significantly to the observed radiances only over limited spectral intervals. Computer memory is minimized by letting the number and type of variable constituents vary on a node-by-node basis in the lookup tables (the number of such constituents is generally less than five across the spectrum when all HITRAN molecules are considered). The fact that we only process the main active constituents at any given node also results in greater computational efficiency.
The baseline OSS computational structure has the node loop as the outer spectral loop. Because a given node generally contributes to more than one instrument channel, this structure avoids redundancies in the radiative transfer calculations and makes minimal use of computer memory.
For a given node, the OSS model first computes total (nadir) layer optical depths (on the input pressure grid) based on the data stored in the absorption lookup table, as well as the derivatives of these optical depths with respect to atmospheric temperature and concentrations of all variable constituents. A separate RT module uses the optical depths to compute radiances and analytically computes the derivatives of these radiances with respect to all input geophysical parameters. Finally, (11) is implemented in the form of a channel loop embedded into the main node loop in which one applies a weighting to the monochromatic radiances and Jacobians obtained for the current node and add the results to the radiance vector and Jacobian matrix elements corresponding to the channels that receive contribution from this node. Computation of radiances and analytical Jacobians is described in section 5c.
b. Optical depth calculations
For dry gases, the infrared OSS model considers only the pressure and temperature dependences of the absorption coefficient. This treatment does not provide an explicit treatment of the self-broadening in the computation of the line width and (consistent with our reference LBLRTM line-by-line model) makes no provision for the impact of the different collision efficiency of the water molecules (for which data are not available from the HITRAN spectroscopic database). In the infrared model, water vapor affects the optical depth calculations of the dry gases only through its impact on the molecular amounts.
c. Radiance and Jacobian calculations in the clear sky
6. Performance of the method
a. Dependencies
The number of nodes (a driver of computation time) depends on the specified radiometric accuracy of the model. For a given accuracy threshold, the number of nodes for a channel depends also, in a somewhat complex way, on the channel bandwidth, the range of the atmospheric opacity, and the number of active molecular species and the way their absorption overlaps both spectrally and vertically. To illustrate some of these dependencies, we have trained OSS models for an idealized spaceborne scanning radiometer measuring the TOA radiances in contiguous square impulse (“boxcar”) channel response functions of uniform width, covering the 650–850 cm−1 domain.
Figure 3 shows the number of selected nodes for 5-cm−1 boxcar widths and accuracy thresholds 0.05 and 0.01 K with carbon dioxide as the only absorber, with carbon dioxide and water vapor, and with carbon dioxide, water vapor, and ozone. When CO2 alone is considered, the average number of nodes needed to describe the TOA radiances in the 5 cm−1 wide intervals is largest in the 650–750 cm−1 region (up to 10 and 18 for the 0.05-K and 0.01-K accuracy thresholds, respectively) and decreases sharply beyond 750 cm−1. In the center of the CO2 ν2 band (667 cm−1), the absorption spectrum is characterized by strong, well-separated lines—a structure that gives rise to a large range of atmospheric opacities within the individual 5 cm−1 wide channels—and a large distribution of altitudes that contribute to each channel’s radiance (i.e., broad weighting functions). In this situation, a relatively large number of nodes is needed to model the impact on the channel radiances of temperature variations at all altitudes spanned by the weighting function. Other parts of the infrared spectrum where weighting functions are broad or double-peaking (as occurs when two or more gases absorb in distinct altitude regimes), hence showing a local maximum in number of nodes, include the 1042 cm−1 O3 band, the 1310 cm−1 CH4 band, and the CO2 ν3 band (2349 cm−1). In contrast, spectral regions containing single-species absorption features concentrated in the troposphere, such as regions with only water vapor absorption (∼1600 cm−1), require fewer nodes to meet a given radiometric threshold. An additional factor for water vapor is the strong continuum absorption, which reduces the transparency of the atmosphere between lines, tends to sharpen the weighting functions and makes the number of points used by the method relatively small in those regions. The variability in constituent concentration has only a moderate impact on the number of OSS nodes required to meet the accuracy threshold.
It is apparent from Fig. 3 that the addition of H2O and O3 causes a significant increase in the number of nodes selected for the model with 0.01-K accuracy. It is noteworthy that the increase in number of nodes due to O3 in the 700–750 cm−1 region is at least as pronounced as the increase due to H2O beyond 700 cm−1, despite the fact that the impact of variations in O3 concentration on the TOA radiances is smaller than the impact of H2O variations (typically less than 1.5 K compared to up to 15–20 K for H2O, Fig. 4). The relatively large impact of O3 on the number of nodes is attributed to a combination of two factors: (i) the greater complexity of the O3 absorption spectrum, which results in a weaker correlation between O3 and CO2 absorption spectra and (ii) the span of the O3 weighting functions.
The dependence of the number of nodes on the width of the channels is illustrated in Fig. 5. The dependence is most pronounced in the 700–750 cm−1 and 750–800 cm−1 intervals. In the 650–700 cm−1 region, there is a marked increase in the number of nodes when going from 0.1 to 0.5 cm−1 (i.e., up to scales comparable to the spacing between adjacent CO2 lines), but the number of selected nodes reaches a plateau as resolution further degrades. The structure of the CO2 absorption spectrum does not change significantly across the 650–700 cm−1 spectral domain, which makes the probability distribution of the absorption/radiance in a channel relatively independent of the width of the instrument function. In the 700–750 cm−1 and 750–800 cm−1 regions, the rapid changes in the CO2 absorption characteristics in the wing of the ν2 band, combined with changes in the absorption characteristics of O3 and H2O, cause the number of nodes to keep increasing with the width of the channels beyond scales of tens of wavenumbers. For this reason, modeling wideband imaging radiometers in these regions requires more nodes per channel than for high-spectral-resolution sounders. By comparison, the computational requirements for TPTR schemes are independent of the channel bandwidth, so they become more competitive with OSS, with respect to speed, for wide channels, although the accuracy of these schemes is limited by their failure to take account of the Planck function and surface emissivity variations across the bandwidth.
b. Application to the AIRS instrument
Performance data discussed in this section were obtained using H2O and O3 as variable gases and fixed concentrations for the other active molecules. Figure 7 shows the errors obtained for the Atmospheric Infrared Sounder (AIRS) (Aumann et al. 2003) with the 49-profile UMBC set used as independent validation data when training was done with the noisy ECMWF training set for an error threshold of 0.05 K. Note that the rms errors shown in Fig. 7 are generally much smaller than the nominal accuracy, even though lookup table interpolation errors are included and slightly exceed it in some channels. This consistency of performance with independent data demonstrates the effectiveness of the way that we built the training set. Note that the errors are small for low surface emissivities (Fig. 7b) as well as for high emissivities.
Table 1 shows the number of nodes selected for the AIRS channels. Figure 8 illustrates the node counts on a channel-by-channel basis, and shows how the number of nodes required to meet the error threshold (εtol = 0.05 K) varies with the absorption characteristics of the different spectral regions, as discussed in section 6a. The average number of nodes per channel over the whole AIRS instrument bandwidth is 10.5. However, note that the instrument functions for neighboring AIRS channels have significant spectral overlap. Consequently, many adjacent channels have a large fraction of nodes in common even though, with the localized training, there is no explicit mechanism to maximize the reuse of nodes across channels. From the point of view of timing the RT calculations, the relevant number of nodes is the total number of unique nodes. Over the full AIRS bandwidth, the total number of unique nodes is 3157, which corresponds to an average of ∼1.3 RT calculations per channel. The global training method (section 7) is designed to maximize the number of nodes shared among channels and has the potential to further enhance the computational efficiency of the OSS model for high-spectral-resolution instruments.
Saunders et al. (2007) presented a comparison of 14 RT models as applied to AIRS channels, including six line-by-line models (including LBLRTM) and eight fast models designed for remote sensing applications. The results show that OSS tracks very closely with LBLRTM with respect to radiances and Jacobians, and more closely than the other three fast models that also were trained to match LBLRTM, including the widely used OPTRAN model. A comparison of the OSS analytical Jacobians with those obtained from the LBLRTM (by finite difference method, using the double-precision LBRTM version) for selected AIRS channels is provided in the appendix.
Similar results have been found in comparing OSS and OPTRAN in application to infrared and microwave sensors with relatively low spectral resolution [specifically, the High-Resolution Infrared Radiation Sounder (HIRS), the Advanced Microwave Sounding Unit (AMSU), and the Special Sensor Microwave Imager/Sounder (SSM/I) (Han et al. 2005; Garand et al. 2001)].
Table 1 includes the computation time for the AIRS channel set. For the timing of the mapping of Jacobians from OSS nodes to channels, we included a step whereby the dimension of the Jacobians was reduced through EOF transformation of the geophysical variables (temperature, water vapor, ozone profiles, etc.), as would be done when the retrieval inversion operates on principal components of the geophysical variables. The timings were obtained with a single-CPU Intel Pentium-4 2.80-GHz computer.
Radiance computation requires six multiplications per layer and per node to compute the contribution of water vapor and an additional three multiplications for the fixed gases and each variable constituent. By comparison, the TPTR method (3) typically requires ten multiplications per layer, per channel for each constituent. Although it is difficult to compare the computational performance of two methods on that basis, as details of the code implementation largely influence the efficiency, it is apparent that the timing of the two methods for radiance computation should be comparable as long as the average number of nodes remains around two or three. With the OSS method, adding the computation of the Jacobians with respect to temperature and variable constituents approximately doubles the time required by radiance computations alone (Table 1). In the AIRS timing comparisons performed by Han et al. (2005), the OPTRAN-v7 TPTR model was slower than OSS (fit to 0.05-K accuracy) by a factor of 7.1 when Jacobians were included and by a factor of 2.3 when Jacobians were excluded from OPTRAN but retained in OSS.
c. Application to MODIS
The application of OSS to relatively wide imager bands is demonstrated for a selection of infrared channels from the Moderate Resolution Imaging Spectroradiometer (MODIS) (Barnes et al. 1998; Seemann et al. 2003). The number of nodes required to achieve 0.05-K accuracy ranged from 3 to 19 (Table 2). The node requirements for these channels were generally ∼0%–30% higher than for the much finer resolution AIRS channels in the same spectral regions.
7. Conclusions
The OSS method provides a way to approximate channel-average radiances by generalizing the ESFT or k-distribution concept to vertically inhomogeneous atmospheres with multiple absorbers. With this method, one can achieve selectably high fidelity to a reference line-by-line model with orders of magnitude reduction in computational cost.
In the present paper, we focused on the application of the OSS method to infrared sensors in the thermal regime. We demonstrated that only a few monochromatic RT operations, on average, are required to achieve accuracies on the order of a few hundredths of a kelvin for the AIRS high-spectral-resolution sensor. This efficiency, when combined with the efficiency with which our RT model produces the Jacobians required for the radiance inversion, makes the method very well suited to remote sensing applications. Several characteristics of the OSS method that distinguish it from other methods are summarized in Table 3.
Some of the characteristics in Table 3 are particularly relevant for atmospheric data assimilation systems. Algorithm speed and memory usage are often major concerns, and trade-offs against radiometric accuracy may be beneficial. The introduction of new sensor data sources into an assimilation system may be facilitated by the robustness and simplicity of the OSS method, accommodating new spectral and geometric properties without any need to develop new empirical formulations or to validate empirical coefficients. The speed with which the necessary coefficients can be generated is an additional asset of OSS.
The localized training described in this paper minimizes the number of nodes required to reconstruct radiances in each single (isolated) channel. While this criterion provides very good performance for AIRS and near-optimal performance for wideband imaging instruments such as MODIS, modification can be made to the search method to significantly benefit applications to high-spectral-resolution sounders. When ILS from different channels overlap extensively (such as sinc functions), the localized training does not explicitly enforce a maximum reuse of nodes among the channels. Even when the ILSs do not strongly overlap (like AIRS), some further reduction of node requirements can be obtained by treating the channel set as a whole and recognizing that nodes in one spectral region can be used to model radiances for channels in distant spectral locations with similar absorption properties. An alternative global OSS training approach selects nodes for the channel set as a whole, while employing node clustering to efficiently account for spectral correlations, and thus condenses the information of the full channel set into a minimal number of nodes. This alternative will be presented as a follow-up to the current paper.
Recently, the OSS method has been extended to deal accurately and efficiently with cloudy atmospheres in which scattering processes are important. We have also extended the OSS training to include all variable molecular species that are relevant to infrared sounding of the earth’s atmosphere. These developments will be reported separately. Improvements in the handling of the solar term and additional validation work in the near-infrared and visible domains, as well as modeling of nonlocal thermodynamic equilibrium effects in the near-infrared and microwave domain, are left for future work.
Acknowledgments. The work reported here was supported in part by the Integrated Program Office of the National Polar-Orbiting Operational Environmental Satellite System and by the Joint Center for Satellite Data Assimilation through NOAA award NA04NES4400005. Additional support was internal AER funding. We thank Xu Liu, who participated in the testing and validation of versions of the NPOESS/CrIS RT model and of alternate methods for extension of the OSS training process to nonpositive ILS, and Ryan Aschbrenner, who assisted with some computations. We also thank S. Hannon and L. Strow for access to their UMBC training dataset and R. Saunders and M. Matricardi for the profile dataset derived from ECMWF profiles produced by F. Chevallier.
REFERENCES
Anderson, G. P., J. H. Chetwynd, S. A. Clough, E. P. Shettle, and F. X. Kneizys, 1986: AFGL atmospheric constituent profiles (0–120 km). AFGL-TR-86-0110, Environmental Research Papers 954, 43 pp.
Armbruster, W., and J. Fischer, 1996: Improved method of exponential sum fitting of transmissions to describe the absorption of atmospheric gases. Appl. Opt., 35 , 1931–1941.
Aumann, H. H., and Coauthors, 2003: AIRS/AMSU/HSB on the Aqua mission: Design, science objectives, data products, and processing systems. IEEE Trans. Geosci. Remote Sens., 41 , 253–264.
Barnes, W. L., T. S. Pagano, and V. V. Salomonson, 1998: Prelaunch characteristics of the Moderate Resolution Imaging Spectroradiometer (MODIS) on EOS-AM1. IEEE Trans. Geosci. Remote Sens., 36 , 1088–1100.
Barnet, C. D., J. M. Blaisdell, and J. Susskind, 2000: Practical methods for rapid and accurate computation of interferometric spectra for remote sensing applications. IEEE Trans. Geosci. Remote Sens., 38 , 169–183.
Binder, K., and D. Heermann, 1997: Monte Carlo Simulations in Statistical Physics: An Introduction. 3rd ed. Springer, 150 pp.
Chedin, A., N. A. Scott, C. Wahiche, and P. Moulinier, 1985: The improved initialization inversion method: A high-resolution physical method for temperature retrievals from satellites of the TIROS-N series. J. Appl. Meteor, 24 , 124–143.
Clough, S. A., M. J. Iacono, and J-L. Moncet, 1992: Line-by-line calculation of atmospheric fluxes and cooling rates: Application to water vapor. J. Geophys. Res., 97 , 15761–15785.
Eyre, J. R., 1991: A fast radiative transfer model for satellite sounding systems. ECMWF Tech. Memo. 176, 28 pp. [Available online at http://www.ecmwf.int/publications/.].
Eyre, J. R., and H. M. Woolf, 1988: Transmittances of atmospheric gases in the microwave region: A fast model. Appl. Opt., 27 , 3244–3249.
Garand, L., and Coauthors, 2001: Radiance and Jacobian intercomparison of radiative transfer models applied to HIRS and AMSU channels. J. Geophys. Res., 106 , 24017–24031.
Goody, R., and Y. L. Yung, 1989: Atmospheric Radiation: Theoretical Basis. 2nd ed. Oxford University Press, 519 pp.
Goody, R., R. West, L. Chen, and D. Crisp, 1989: The correlated-k method for radiation calculations in nonhomogeneous atmospheres. J. Quant. Spectrosc. Radiat. Transfer, 42 , 539–550.
Han, Y., and Coauthors, 2005: Comparison between OPTRAN and OSS. Proc. Third Annual JCSDA Workshop, Camp Springs, MD, Joint Center for Satellite Data Assimilation. [Available online at http://www.jcsda.noaa.gov/documents/meetings/wkshp2005/Han_ORA.ppt.].
Lacis, A. A., and V. Oinas, 1991: A description of the correlated k distribution method for modeling nongray gaseous absorption, thermal emission, and multiple scattering in vertically inhomogeneous atmospheres. J. Geophys. Res., 96 , 9027–9063.
Landau, D., and K. Binder, 2000: A Guide to Monte Carlo Simulations in Statistical Physics. Cambridge University Press, 398 pp.
Liu, X., J-L. Moncet, D. K. Zhou, and W. L. Smith, 2003: A fast and accurate forward model for NAST-I instrument. Optical Remote Sensing, OSA Technical Digest, Optical Society of America, paper OMB2. [Available online at http://www.opticsinfobase.org/abstract.cfm?URI=ORS-2003-OMB2.].
Matricardi, M., 2003: RTIASI-4, a new version of the ECMWF fast radiative transfer model for the infrared atmospheric sounding interferometer. ECMWF Tech. Memo. 425, 65 pp. [Available online at http://www.ecmwf.int/publications/.].
McMillin, L. M., and H. E. Fleming, 1976: Atmospheric transmittance of an absorbing gas: A computationally fast and accurate transmittance model for absorbing gases with constant mixing ratios in inhomogeneous atmospheres. Appl. Opt., 15 , 358–363.
McMillin, L. M., and H. E. Fleming, 1977: Atmospheric transmittance of an absorbing gas. 2: A computationally fast and accurate transmittance model for slant paths at different zenith angles. Appl. Opt., 16 , 1366–1370.
McMillin, L. M., H. E. Fleming, and M. L. Hill, 1979: Atmospheric transmittance of an absorbing gas. 3: A computationally fast and accurate transmittance model for absorbing gases with variable mixing ratios. Appl. Opt., 18 , 1600–1606.
McMillin, L. M., M. D. Goldberg, H. Ding, J. Susskind, and C. D. Barnet, 1998: Forward calculation for interferometers: Method and validation. Appl. Opt., 37 , 3059–3068.
Mlawer, E. J., S. J. Taubman, P. D. Brown, M. J. Iacono, and S. A. Clough, 1997: Radiative transfer for inhomogeneous atmospheres: RRTM, a validated correlated-k model for the longwave. J. Geophys. Res., 102 , 16663–16682.
Moncet, J-L., 2000: Radiance modeling. U.S. Patent 6 584 405, filed 5 May 2000 and issued 24 June 2003.
Moncet, J-L., and Coauthors, 2001a: Algorithm theoretical basis document for the Cross-Track Infrared Sounder (CrIS) Environmental Data Records (EDR), version 1.2.3. AER Doc. P882-TR-E-1.2.3-ATBD-03-01, 157 pp. [Available online at http://eic.ipo.noaa.gov/IPOarchive/SCI/atbd/cris_atbd_03_09_01.pdf.].
Moncet, J-L., and Coauthors, 2001b: Algorithm theoretical basis document (ATBD) for the Conical-Scanning Microwave Imager/Sounder (CMIS) Environmental Data Records (EDRs), version 1.4. Vol. 2: Core physical inversion module. AER Doc. P757-TR-I-ATBD-CORE-MODULE-20010315, 126 pp. [Available online at http://eic.ipo.noaa.gov/IPOarchive/SCI/atbd/ATBD_V.02CorePhysicalInversionModule.pdf.].
Peytremann, E., 1974: Line-blanketing and model stellar atmospheres. I. Statistical method and calculations of a grid of models. Astron. Astrophys., 33 , 203–214.
Saunders, R., and Coauthors, 2007: A comparison of radiative transfer models for simulating Atmospheric Infrared Sounder (AIRS) radiances. J. Geophys. Res., 112 .D01S90, doi:10.1029/2006JD007088.
Seemann, S. W., J. Li, W. P. Menzel, and L. E. Gumley, 2003: Operational retrieval of atmospheric temperature, moisture, and ozone from MODIS infrared radiances. J. Appl. Meteor., 42 , 1072–1091.
Sneden, C., H. Johnson, and B. Krupp, 1976: A statistical method for treating molecular line opacities. Astrophys. J., 204 , 281–289.
Strow, L. L., S. E. Hannon, S. De Souza-Machado, H. E. Mottler, and D. Tobin, 2003: An overview of the AIRS radiative transfer model. IEEE Trans. Geosci. Remote Sens., 41 , 303–313.
Sun, Z., and L. Rikus, 1999: Improved application of exponential sum fitting transmissions to inhomogeneous atmospheres. J. Geophys. Res., 104 , 6291–6303.
Susskind, J., C. D. Barnet, and J. M. Blaisdell, 2003: Retrieval of atmospheric and surface parameters from AIRS/AMSU/HSB data in the presence of clouds. IEEE Trans. Geosci. Remote Sens., 41 , 309–409.
Thomas, G. E., and K. Stamnes, 1999: Radiative Transfer in the Atmosphere and Ocean. Cambridge University Press, 546 pp.
Tjemkes, S. A., and J. Schmetz, 1997: Synthetic satellite radiances using the radiance sampling method. J. Geophys. Res., 102 , 1807–1818.
West, R., D. Crisp, and L. Chen, 1990: Mapping transformations for broadband atmospheric radiation calculations. J. Quant. Spectrosc. Radiat. Transfer, 43 , 191–199.
Wiscombe, W. J., and J. W. Evans, 1977: Exponential-sum fitting of radiative transmission functions. J. Comput. Phys., 24 , 416–444.
APPENDIX
Examples of Jacobian Performance
Comparisons of analytical Jacobians from OSS with finite difference Jacobians from LBLRTM have shown very good agreement for temperature, water vapor, and ozone (Fig. A1). The accuracy of the OSS temperature Jacobians meets the criterion for “excellent” quality proposed by the International TOVS Study Conference Radiative Transfer Working Group (Garand et al. 2001). Although not apparent in these plots, some minor oscillations in the temperature Jacobians may occur in some circumstances, originating from the fact that, in the OSS parameterization, the Jacobians for a channel are obtained as a superposition of a finite number of monochromatic Jacobians that reach their maximum value at different altitudes. Even when the number of OSS nodes is sufficient for reaching a 0.05-K accuracy threshold, the degree of overlap of the weighting functions associated with individual nodes may be such that small depressions in the profiles of temperature derivatives become apparent between the peaks of those weighting functions. If there is a practical need to do so, the amplitude of these oscillations can be decreased by increasing the number of nodes, by either adopting a stricter accuracy threshold or using a more challenging training set. Similarly, OSS Jacobians with respect to an absorbing gas may have larger relative differences from the reference for channels in which the gas has very little impact on the radiance and other absorbers are dominant. OSS training operates on all the gases at once, which effectively weights each gas according to how much the variability in its concentration impacts the TOA radiances. One simple way to force the OSS model to more closely match the Jacobians associated with minor absorbers is to include, in the training, profiles in which the minor absorbers are the only atmospheric constituents; however, we are have not encountered any circumstances for which this revision would have any practical benefit.
The internal consistency of OSS with respect to its analytical Jacobians and finite differences of its radiances is implicit in the validations of radiances and Jacobians against LBLRTM (Figs. 7 and A1). Nevertheless, we have done additional comparisons against finite differences of OSS radiances and found that the criterion for excellent match is met for temperature and for all significant absorbing gases.
Example of errors in TOA brightness temperature introduced by approximation (5) for a portion of the AIRS channel set (Aumann et al. 2003). These errors include neglect of spectral variations of the Planck function across the channel for AIRS, which accounts for up to ∼0.1 K error, so the error due to the reflected radiation is dominant in this part of the spectrum. Emissivities are from a uniform random distribution in the range 0.8 to 1, representing values for surfaces from deserts to oceans, and were assumed to be constant across each channel bandwidth. No natural surface would have emissivities as low as 0.8 over every channel, so the larger errors would be limited to the channels where the emissivities are lowest for the given surface. The atmospheric conditions were from the 364-profile set described in section 4.
Citation: Journal of the Atmospheric Sciences 65, 12; 10.1175/2008JAS2711.1
Radiance fitting error vs number of nodes for the W/E approach, MC approach, and global search. The experiments used a square impulse (“boxcar”) instrument function of width 10 cm−1 centered at 740 cm−1.
Citation: Journal of the Atmospheric Sciences 65, 12; 10.1175/2008JAS2711.1
The number of OSS nodes required to achieve (a) 0.05-K and (b) 0.01-K accuracies in all channels for a typical cross-track scanning sensor with contiguous 5 cm−1 boxcar response functions spanning the 650–850 cm−1 domain.
Citation: Journal of the Atmospheric Sciences 65, 12; 10.1175/2008JAS2711.1
Impact on nadir TOA radiances of variability in water vapor and ozone concentration for contiguous 5 cm−1 boxcar response functions spanning the 650–850 cm−1 domain.
Citation: Journal of the Atmospheric Sciences 65, 12; 10.1175/2008JAS2711.1
Average number of OSS nodes per boxcar vs boxcar width in the intervals as labeled, for (a) 0.05-K and (b) 0.01-K accuracy thresholds.
Citation: Journal of the Atmospheric Sciences 65, 12; 10.1175/2008JAS2711.1
Brightness temperature errors as a function of average number of nodes (selected spectral points) in contiguous (top) 0.5 cm−1 and (bottom) 5 cm−1 wide boxcar functions covering the interval 740–760 cm−1, for (left) OSS and (right) RSM.
Citation: Journal of the Atmospheric Sciences 65, 12; 10.1175/2008JAS2711.1
Validation of OSS fit for the AIRS channel set in terms of rms error of brightness temperature for profiles with emissivities of (a) 0.98 and (b) 0.80. The errors are from the combined processes of spectral sampling and interpolation of absorption coefficients from a lookup table. The ECMWF profile set from section 4 was used for training and the UMBC profile set was used for validation. The errors without interpolation of absorption coefficients, where coefficients were obtained from the line-by-line model directly for each atmospheric profile, covered very nearly the same range as those plotted here, being lower for some channels and higher for others, but within 0.02 K of these total errors (maximum absolute difference) for every channel.
Citation: Journal of the Atmospheric Sciences 65, 12; 10.1175/2008JAS2711.1
The number of nodes selected for each AIRS channel using localized training, (a) including redundancy and (b) excluding redundancy, such that each selected node is counted as one divided by the number of channels for which the node is used. The sum of all the values from (b) is thus equal to the total number of nodes Ntot to fit the full AIRS channel set. The total atmospheric absorption for the U.S. Standard Atmosphere is plotted for reference.
Citation: Journal of the Atmospheric Sciences 65, 12; 10.1175/2008JAS2711.1
Fig. A1. Comparison between LBLRTM (solid) and OSS (dashed) Jacobians for selected AIRS channels and UMBC profiles, for (a) temperature, (b) water vapor, and (c) ozone. The derivatives were converted from radiance to equivalent brightness temperature and, for (b) and (c), they are with respect to the logarithm of the mixing ratio.
Citation: Journal of the Atmospheric Sciences 65, 12; 10.1175/2008JAS2711.1
Detailed node counts and computation times with localized training for the AIRS instrument with the full channel set and the 281-channel subset used at NCEP (Susskind et al. 2003);
Number of OSS nodes required for 0.05-K accuracy for selected MODIS channels.
Characteristics of the OSS approach.
Note that the dependence of the self-broadening on water vapor concentration should be expressed, strictly speaking, in terms of number concentrations. The use of mass units simply avoids having to include a unit conversion in the OSS model. The impact of the errors introduced by this shortcut was found to be small compared to the other sources of errors even in regions where the self-broadened continuum dominates.