1. Introduction
Finding the rotational center of an existing tropical cyclone (TC) is an essential step in the analysis and forecasting of the event. Studies have shown the importance of accurate initial positioning to intensity estimation methods (Olander et al. 2004; Velden et al. 2006) and operational forecasts (J. L. Franklin 2009, personal communication). In addition, numerical model track predictions can benefit from synthetic (bogus) TC observations (Goerss 2009) and/or initial TC motion vectors (Aberson 2002) that rely on accurate positioning. Given that the majority of TCs are away from sources of direct observation, the primary substantial observations are from satellites [with the exception of the aircraft reconnaissance program in the western North Atlantic Ocean (NATL) TC basin].
Storm positions are routinely estimated and provided by TC analysis and warning agencies around the globe. The primary observing tool derives from visible (Vis) and infrared (IR) imagery provided by operational geostationary satellites. Static images and animated sequences are used to “fix” a TC center position based on the structure and organization of cloud features. Often a clear and well-formed eye is apparent, making the fixes generally reliable. However, the Vis–IR methods can struggle in situations where the eye appears ragged or is partially obscured, or the TC center is totally covered by a central dense overcast. In these circumstances, passive microwave (PMW) observations from polar-orbiting satellites can play a crucial role in revealing convective organization and eyewall structure that would otherwise be obscured by cloud tops (Velden et al. 1989; Hawkins et al. 2001; Patadia et al. 2004). The 85–92-GHz observations are also more frequent and of higher resolution than other PMW observations from various platforms, and they will continue to be more frequent in the future as the current generation of polar-orbiting satellites is replaced (Velden and Hawkins 2010). However, PMW observations can also present a unique set of challenges for applications to TC center positioning. These challenges include the irregular temporal image spacing (ranging from ∼30 min to ∼24 h but averaging 3–6 h for the current set of operational conical scanners), comparatively low resolution (currently 6–16 km for the 85–92-GHz conical scanners), the need for regridding and normalization between varying sensors, and corrections for scan geometries.
In this paper, we present an initial attempt to develop an objective TC center-fixing algorithm that takes advantage of the 85–92-GHz viewing strengths, while attempting to account for the limitations. Measurements from the Defense Meteorological Satellite Program (DMSP) Special Sensor Microwave Imager (SSM/I) and Special Sensor Microwave Imager Sounder (SSMIS), the Aqua Advanced Microwave Scanning Radiometer for Earth Observing System (AMSR-E), and the Tropical Rainfall Measuring Mission (TRMM) Microwave Imager (TMI) are employed in the investigation. Only the 85–92-GHz spectral channels are considered in depth in this phase of the algorithm development because of their superior resolution and observational frequency among the variety of microwave observations. It is recognized that other channels (i.e., 37 GHz and IR) could lend additional information, and these will be briefly explored at the end of the paper.
Most of the recent efforts toward objective center-fixing of TCs have been focused on ground-based radar imagery. Wood (1994) fits the Doppler velocity field to an idealized Rankine vortex to estimate the rotational center. Wong et al. (2004, 2008) present an algorithm to calculate the center of rainband spiral patterns in radar reflectivity. Subsequently, Wong et al. (2007) and Wong and Yip (2009) developed an algorithm that calculates the TC rotational center from a motion field analysis, which is calculated from feature tracking with sequential radar reflectivity images. Chang et al. (2009) describe an algorithm that determines the center of the low-reflectivity region of the TC eye, and uses information from sequential images to insure consistency. Techniques for geostationary IR imagery include Lee and Liu (2001) and Zhang et al. (2005), but these methods apply only to select cases.
Although radar and PMW imagery are not identical, their observed structural patterns are quite similar and lend themselves to similar image processing techniques (Wimmers and Velden 2007). The irregular spacing of sequential PMW images prevents the application of any technique that requires feature tracking and the temporal filtering of results. However, the spiral-band analysis methods hold promise despite two shortcomings. The first shortcoming is theoretical: parallax, vertical shear, and precession in the central vortex can cause a natural offset between the center indicated by elevated spiral bands and the surface-level center. The second shortcoming is practical: when the imagery has no well-defined and organized banding pattern, the algorithm cannot be applied. This problem is largely compensated with Kalman filtering in Wong et al. (2004), but Kalman filtering requires a high temporal resolution lacking in PMW observations. The PMW method described here attempts to account for these shortcomings, and combines the following features: first, it can work independently and objectively on individual images; second, it accounts for both eye patterns and any banded circulation patterns; third, it includes a “default” option to fall back to a predefined first-guess position when the detection signal is determined to be too weak; and fourth, it has been tested and refined in operational applications for several years to insure robustness.
This paper introduces a PMW center-fixing method, which we call the Automated Rotational Center Hurricane Eye Retrieval (ARCHER). Section 2 describes the methodology and components of the algorithm; section 3 details the calibration and optimization of the technique using two Atlantic TC seasons’ worth of data. Section 4 validates the method with an independent dataset of the 2005 Atlantic TC season; section 5 explores the application of ARCHER to other kinds of satellite imagery; and section 6 concludes with highlights and a discussion of the inherent issues with TC center fixing. In addition, appendix A characterizes the operational errors in TC forecasting used to estimate the expected first-guess error corresponding to a given basin; and appendix B presents a method for correcting the algorithm error values to account for uncertainties in the best-track reference dataset.
2. Center-fixing methodology
The ARCHER algorithm finds the position in the image that is centered by the spiral-patterned image edges as well as the edges of a whole or partial eye. To accomplish this, two separate components are applied. A spiral-centering algorithm finds the optimal center of a spiral pattern in the image. Second, a ring-fitting algorithm finds the edges most likely associated with the inner eyewall. The two components are complementary: the spiral component guides the algorithm toward the correct candidate for the eye (and away from false eyes), and the eye-fitting component adds precision to the more holistic spiral estimate. The formulae presented here contain many weights and other parameters. The most sensitive parameters are described and optimized in the next section. However, there are too many parameters to fully optimize independently in this study, and therefore the other less sensitive parameters were determined instead by careful trial and error and are presented in Table 1.
The preferred unit of distance in this study is great circle degrees (1 great circle degree ≈ 111.18 km). However, for brevity, this term is shortened to “degrees.” Thus the reader should note that distance described here in degrees is independent of direction, unlike in ordinary navigation.
a. Image preprocessing
Parallax is corrected by applying a uniform adjustment to the image. The navigation of the image is shifted ∼12 km (with the exact number depending on the instrument’s zenith angle) toward the satellite to compensate for the parallax of features at a height of 10 km. Of course, the height of the 85–92-GHz signal only averages 10 km, and can vary about that height, but the error caused by this difference is on the order of only a few kilometers, and is therefore minor compared to other sources of error. After correcting for parallax, the brightness temperature image is interpolated to a regular grid (evenly spaced latitude and longitude coordinates) for input into the centering algorithm.
b. First-guess position
An initial estimate of the rotational center is a necessary input to the algorithm because the algorithm must operate on a limited-size domain and because a first-guess position can serve as a useful reference point to check the validity of the algorithm result. Operationally, the ARCHER algorithm obtains its first-guess position from the most recent forecast track provided by the National Hurricane Center (NHC) for TCs in the North Atlantic and east Pacific basins, and by the Joint Typhoon Warning Center (JTWC) for TCs in other basins. However, in the calibration and validation of the ARCHER algorithm the first-guess position is set at a controlled offset distance from the NHC best-track position (section 3).
c. Large-scale method: Spiral centering
The spiral centering is performed by application of a two-step process: 1) a large-domain, coarse spiral score (CSS) function, and 2) a relatively smaller-domain fine spiral score (FSS) function. The reason for having two functions operate rather than one is that a large-domain function is effective at determining a rough estimate of the rotational center even with a very poor first-guess estimate of position, whereas a smaller domain function has more accuracy if it is provided with a good initial position estimate (supplied by the coarse step).
(cSS, c0) are coefficients (15, 20) to scale the result to a range of 0–50,
(ϕ, θ) are (longitude, latitude) in the spiral score field,
N is the number of points in the sample disk,
i indicates a single a point inside the sample disk,
disk is the domain of N grid points in a disk centered on the initial guess position (the “sample disk”),
I is the microwave horizontal polarization brightness temperature image (and the log function on I de-emphasizes the extreme gradients),
S is the log-5 spiral vector field,
α is the inclination of the spiral vector field in radians (zero indicates a circular vector field and a log-5 field is inclined 5°, or 0.087 radians, outward from a circular vector field),
± means “+” in the Northern Hemisphere and “−” in the Southern Hemisphere, and ∓ means the opposite,
(x̂, ŷ) are the unit vectors for north and east, respectively, and
(xi, yi) are offset distances in the (x̂, ŷ) directions, respectively, where xi is normalized to be approximately equal in spacing to yi.
This function essentially computes a score that reflects the average alignment of the image boundaries with a spiral centered at a given point (ϕ, θ). This alignment is expressed in terms of the cross product of the image gradient and its collocated element of the spiral field. The vector norm function (‖·‖) is necessary to give equal prominence to gradients directed either radially inward or outward. We use a log-5 spiral, which is less than the inclination generally associated with TC spiral patterns, because the gradients associated with horizontal shear throughout the image are generally more circular than the patterns of spiral bands. (However, the algorithm is not very sensitive to the particular value of the spiral inclination.) In practice, scores from this function are calculated at a spacing of several grid cells over a predetermined domain of points, and a successful result produces a contoured bull’s-eye pattern focused on or near the rotational center (Fig. 1).
d. Small-scale method: Ring centering
RSR is ring score radius, the ring score for each possible radius,
(ϕ, θ) are (longitude, latitude) in the ring score field,
cRSR is a constant (value of 250) to scale the result to a range of 0–100,
r is the length of the radius being evaluated (units: degrees),
N is the number of points in the sample ring,
i is the index for a point on the sample ring,
I1/3 is the microwave horizontal polarization brightness temperature image to the power of ⅓ (to deemphasize the higher gradients),
r̂ is the unit vector pointed radially outward from the center of the ring to a point of index i on the ring, and
max[RSR(ϕ, θ, r), r] is the maximum of the three-dimensional field RSR over the dimension r.
Of course not every TC image contains a well-defined eye. In such events, the ring score is normally quite low. This means that unless the corresponding spiral score is very high, the ARCHER algorithm will default to the first-guess position rather than use the position with the maximum score. This process is discussed further in the next subsection.
e. Optimized combination of spiral and ring centering
By design, the spiral-centering score field and the ring-centering score field combine into a weighted sum that takes advantage of the two methods’ complementary strengths. Specifically, the ring centering yields highly precise results for a well-defined eye, and the spiral-centering method is important both when an eye is not evident and when the ring-centering method finds one or more false eyes (i.e., inner-core, rain-free moat regions caused by convective downdrafts or environmental dry air intrusions). In the latter case, the spiral method provides a critically important additional weight to the feature nearest the rotational center.
One extra component in the combined scores is a distance penalty field that turns spiral score fields into “guided” spiral scores [Eqs. (5) and (6)]. The distance penalty is a field that varies with the square of the distance from the initial guess. Therefore it is minor near the first-guess position and dominant at the edges of the analysis domain. This acts as a check to insure that no matter how unusual the image, the final result will not stray unreasonably far from the first-guess position.
The scores are combined in a multistep algorithm as follows:
1) Step 1: Guided coarse spiral (GCS)
2) Step 2: Guided fine spiral (GFS)
3) Step 3: Ring score
Calculate the RS from Eq. (4).
4) Step 4: Combined score
3. Empirical calibration
The objective of this section is to find the optimum values of two parameters: the relative weight of the spiral score to the ring score and the optimum default threshold for the maximum combined score.
a. Calibration dataset: 2007–08 North Atlantic TC seasons
The calibration dataset is selected from microwave imagery of TCs during the 2007 and 2008 North Atlantic storm seasons (Table 2). Taken together, the data from these two years make up a representative sample of a wide variety of TC structures intensities. The “truth” position of the rotational center for each case is a cubic interpolation of the NHC best-track dataset. To insure that the truth dataset is as accurate as possible, only the best-track values ±3 h from an aircraft reconnaissance position fix are used. This is the most limiting constraint on the sample size. Additional criteria for the microwave observations are the following: the best-track position must lie inside the satellite swath (allowing for partial coverage cases if the coverage of the TC exceeds 50%); all storms are required to be south of 40°N latitude to serve as a coarse exclusion of extratropical transition cases; and all observed TCs must have a surface wind speed greater than or equal to 34 kt (1 kt ≃ 0.5 m s−1) because organized rotation is rarely evident in PMW imagery below tropical storm classifications.
It is desirable to partition the data sample by cases showing the effects of “weak vertical shear” and “strong vertical shear” because the displacement between surface center of rotation and image–height center of rotation can lead to difficulties in the application of the algorithm. Just as importantly, vertical shear can adversely effect the organization of the TC and disrupt the conventional spiral-eye configuration that makes the algorithm successful. Such a disruption naturally complicates the process of center-fixing. Unfortunately, systematic effects of vertical shear on vortex tilt and rotational asymmetries are not well understood or documented to the point that any particular static environmental analysis can serve as an effective measure to adequately stratify the sample. For example, the period of time a shear regime has been acting on a vortex might be just as important as the strength of the shear or the TC intensity at the analysis time. Consequently, several attempts to use objective environmental vertical shear statistics to partition the data did not yield very effective results.
However, it can be shown that the zonal translation velocity of the TC, which is easily extracted from the best track, can serve as a useful proxy for distinguishing TCs that are well-suited to the ARCHER algorithm from those in which vortex tilt and asymmetric structure are more likely to pose problems. Figure 3 shows that algorithm error increases sharply where TC eastward translation speeds (Vx) exceed 5 kt. This increase in error is predominantly caused by recurvature during which stronger westerly wind shear regimes can often quickly transform an organized vortex into more distorted asymmetric and/or tilted structures. Consequently, the calibration and validation steps partition the observations into TCs in which Vx < 5 kt (group A, including all westward-moving storms) and Vx ≥ 5 kt (group B). Only group A TCs are used in the calibration, because our investigation finds that the ARCHER algorithm is effective in only a minority of group B cases. The validation of the ARCHER algorithm in group B cases is handled in a separate analysis.
b. Method
We approximate the real-time distribution of forecast error on the dataset by simulating 12 scenarios per image of a first-guess position displaced to varying offsets from the best-track position: 0.1°, 0.4°, and 0.7° in the four cardinal directions. Thus, the total number of cases (3972) in the calibration is the number of microwave images in the calibration dataset (331) times the number of scenarios per image (12). The statistics from these cases (such as RMS) are then combined into weighted averages to approximate the algorithm performance for the North Atlantic operational environment, which has a certain probability of small (0.1°), medium (0.4°), and large (0.7°) forecast (first guess) position errors. The weights have been determined according to the observed distribution of forecast position error in the North Atlantic operational environment (appendix A).
The guided fine spiral and ring score fields are calculated for each case, and the final positions determined for a range of spiral weights (wGFS) and score threshold values (τ). The errors of the final ARCHER positions form the basis of the performance statistics. With a set of statistics for the algorithm over this range of weights and thresholds, we can then judge the performance in order to identify the optimal weight–threshold combination, based on the following criteria:
RMS: Minimum possible RMS position error between the algorithm centers and the truth (best track) values.
Number of excessive errors: For all the scenarios, no case should have an algorithm position error exceeding the largest first-guess error in the simulation (0.7°).
Target usage: When presented with a range of parameter values that produce similar accuracy statistics, the value with the fewest defaults to first-guess positions will be selected. This is desirable because some amount of uncertainty in the application of the best-track dataset is inevitable; therefore the algorithm error is somewhat overstated by the statistics.
Several iterations of experimentation showed that three groupings of cases based on TC maximum surface wind speed (Vmax) required separate weight–threshold combinations: tropical storm (34 ≤ Vmax < 65 kt), category 1 hurricane (65 ≤ Vmax < 84 kt), and category 2–5 hurricane (Vmax ≥ 84 kt). Counter to our expectations, the data organized itself in this “trimodal” pattern rather than any sort of continuous evolution with increasing Vmax. The final results are organized in this fashion and the weight–threshold combinations optimized accordingly.
c. Results and discussion
Each row addresses one of the three criteria for parameter evaluation, weighted by the relative frequency of 0.1°, 0.4°, and 0.7° displacements in the North Atlantic (Table A1, described below in appendix A). The first row is the RMS error of the ARCHER position (with respect to the best-track position). The second row of Fig. 4 identifies the number of excessive errors, indicated by the (weighted) number of cases in which the errors exceed the largest offset distance used in the simulations (this can be a fractional value lower than 1 because of the weighting scheme). The third row in this figure is the fraction of cases in which the algorithm position was applied, as opposed to defaulting to the first guess. Finally, the fourth row plots the distribution of distances between the best track and the ARCHER result. This distribution is not weighted because a nonweighted version is easier to read and interpret.
For each column, a magenta polygon outlines the area in weight–threshold space that achieves the best compromise between the three criteria: low RMS, few excessive errors, and a high fraction of applied targets. The values in these polygons show that one should expect the algorithm performance to be essentially the same throughout the range of weights and thresholds therein. However, a specific weight and threshold value must be used in the validation, and so a specific point is selected for each group (marked with a white circle), based on the robustness of this point in weight–threshold space. The values corresponding to these points are presented in Table 3.
4. Algorithm validation
The purpose of this section is to characterize the accuracy of the ARCHER algorithm with a dataset that is independent of the calibration. The results are meant to provide a record of performance to help users take the best possible advantage of the algorithm output and assign the proper level of confidence to its results.
a. Method
The validation method is similar to that of the calibration section. First-guess forecast position error is simulated with displacements from the best-track position in the four cardinal directions at 0.1°, 0.4°, 0.7°, and 1.0°. (Although 1.0° is not used to compute basinwide statistics, it is included here just to show algorithm performance in a more extreme condition.) The performance statistics for each of these simulated first-guess displacements are combined in weighted averages to estimate the overall performance in the North Atlantic and other TC basins. As with the algorithm calibration, performance statistics for a particular basin are estimated by weighting a statistic (such as RMS position error) for the relative occurrence of small (0.1°), medium (0.4°), and large (0.7°) offsets in the forecast position corresponding to the distribution of forecast error common to that basin. This approach of simulating first-guess error in a variety of ways was preferable to using the single real-time NHC forecast at the time of each image because it generated more robust results for the sensitivity to forecast error.
The dataset is partitioned into group A and group B cases. Throughout the validation, the empirical difference between the best-track position and the “true” rotational center at the height and navigation of the image is a very important consideration in interpreting the results, and so this issue is analyzed as well.
b. Validation dataset: 2005 North Atlantic hurricane season
The very active 2005 hurricane season of the North Atlantic was chosen for the independent validation dataset, with 241 separate 85–92-GHz PMW observations. The constraints for data quality were the same as with the calibration dataset: all observations must be ±3 h from an aircraft reconnaissance position fix, the best-track position must be inside the satellite swath, TCs must be south of 40°N latitude, and TCs must have maximum surface wind speeds greater than 34 kt. Of these 241 observations, approximately 40% are tropical storms, 20% are category 1 hurricanes, and 40% are category 2–5 hurricanes.
c. Algorithm performance for group A TCs
To interpret the performance of the algorithm, the results are first partitioned into a matrix of histograms (Fig. 5) that breaks the cases down by TC intensity (rows) and first-guess displacement (columns). A few trends and features in Fig. 5 are immediately apparent. The consequence of the algorithm defaulting to the first guess is a peak in the histogram at 0.1°, 0.4°, 0.7°, or 1.0°. The number of defaults decreases with increasing TC intensity, with defaults predominant in tropical-storm-strength TCs and almost nonexistent with category 2–5 hurricanes. Each group of TC intensities (the top three rows) shows a different feature that dominates the distribution of error. For tropical storms, the likelihood of a default is high for all values of first-guess position displacement. For category 1 hurricanes, the number of defaults increases with first-guess displacement (increasing from left to right), showing that the ARCHER score for the target center is more likely to fall below the threshold as the accuracy of the first-guess decreases. For category 2–5 hurricanes, the peak of the error distribution is unchanged with the accuracy of the first guess, and the shape remains similar except that the tail of the histogram grows slightly as the accuracy of the first-guess decreases.
Another significant feature of the category 2–5 hurricane row of histograms is that the error peaks at 0.05° rather than 0, as one might expect in a grouping where a majority of cases have well-defined eyes. This suggests that a portion of the error in the ARCHER product is due to an “intrinsic error” in the correspondence between the PMW images and the best-track dataset, which causes a departure of the best-track positions from the true center indicated by the image. Several factors contribute to the difference between best-track center and the center of rotation at the level of the microwave retrieval in a given image: vortex tilt caused by vertical shear or vortex precession, residuals in parallax correction, satellite navigational error, and best-track error.
This knowledge is important to consider in applications that use the ARCHER result because most applications (such as intensity estimation) require the image-relative center rather than the center of rotation at the surface. Therefore, it is highly useful to quantify this type of intrinsic error between the best track and the image-relative center if at all possible. Appendix B provides an estimation of the intrinsic error and shows that it can be approximated as an additive component of the ARCHER product’s variance with respect to the best track. In the following analysis, this approximation is used to complement the standard RMS error (relative to the best track) with a corresponding image-relative RMS error value.
Table 4 presents these complementary statistics. Results are sorted in columns by first-guess displacement, and then by estimates for the values in a North Atlantic basin environment in the fifth column and all other basins in the sixth column. The statistic “percentage applied” (percentage of ARCHER target positions applied rather than defaulting) varies greatly by TC intensity. For tropical storms, only about 17% of the cases use the ARCHER target position; whereas, about 84% of category 1 cases and 99.9% of the category 2–5 cases use the ARCHER target position. The statistic “percentage worsened” (percentage of cases in which the final position was in error by greater than 0.71° for first-guess displacements of 0.1°–0.7° and greater than 1.1° for first-guess displacements of 1.0°) was very low: less than 1% in all relevant groupings. (The value was 1% for one grouping when the first-guess displacement is 1.0°, which is already a very rare circumstance operationally.)
Trends in image-relative RMS are best explained visually (Fig. 6). The RMS of the ARCHER target positions (left plot) is encouragingly small (∼0.05°) for category 1 and category 2–5 hurricanes when the first-guess displacement is small (0.1°). The RMS for these two groups increases at roughly the same rate with increasing first-guess displacement. This increase with displacement is of the same order of magnitude as the RMS at the lowest first-guess displacement, showing that the sensitivity of the ARCHER algorithm to the first-guess displacement is significant but does not reduce the overall performance appreciably, because first-guess displacements of 0.4° and greater are considerably less common. However, the ARCHER target RMS for tropical-storm-strength TCs is significantly higher. The RMS at 0.1° does not even exceed accuracy of the default displacement. However, Table 4 shows that the tropical storm ARCHER target RMS for the North Atlantic basin (0.171°) and other basins (0.173°) is lower on the whole than if the default value were used every time.
The right side of Fig. 6 shows that the percentage of defaults in the ARCHER algorithm is still significant for tropical storm and category 1 hurricanes. However, it is important to note that the frequency of occurrence of these cases decreases sharply as the displacement increases, and so these errors at displacements of 0.4 and 0.7 have only a small impact on the general algorithm performance.
Table 4 (Vmax ≥ 34, All) provides the aggregate results for the ARCHER algorithm in the right two columns. Overall, the image-relative RMS of the applied ARCHER targets is 0.064° (7.1 km) for the North Atlantic basin and 0.067° (7.4 km) for all other basins. When including the cases in which ARCHER must default to the first-guess, the ARCHER image-relative RMS is 0.146° (16.2 km) for the North Atlantic basin and 0.165° (18.3 km) for all other basins. However, when considering only category 1–5 hurricanes (fourth grouping in Table 4), the overall performance is considerably better, with an image-relative RMS of 0.080° (8.8 km) in the North Atlantic and 0.087° (9.7 km) in the other basins.
d. Algorithm performance for group B TCs
The 2005 North Atlantic observations of group B TCs (eastward translation speed Vx ≥ 5 kt) verified by aircraft reconnaissance do not provide a large enough sample to perform a statistically significant validation (45, 8, and 45 cases for tropical storm, category 1, and category 2–5 hurricanes, respectively). Instead, we use a combined dataset from the 2005, 2007, and 2008 seasons to validate the ARCHER algorithm for these cases. This does not violate the independence of the calibration and validation datasets, because the group B cases for 2007–08 were not included in the calibration.
Figure 7 shows a histogram matrix corresponding to the group B validation; note that the x axes have changed in scale. In this figure, both the category 1 and category 2–5 hurricane rows show peaks in their distributions at 0.15°–0.20°, which is greater than the position of the peaks for group A storms. This suggests two possible factors: 1) a larger intrinsic error, or 2) a systematic error in resolving the center of group B storms caused by possible effects from high-shear–tilt in extratropical transition events to be expected in this grouping. Our examination of all group B images for category 2–5 storms shows that both factors are likely in effect.
Throughout the group B sample we also observe a common, alternative mode of convective organization primarily responsible for the algorithm’s systematic error here, which we will refer to as the “hook” pattern. This systematic pattern appears to contribute significantly the histogram peaks at 0.15°–0.20°. Figure 8 shows a typical example of a hook pattern. In this image, convection is dominant in the north, curving cyclonically inward toward the best-track center of rotation. However, the guided fine spiral algorithm finds a center of the curvature to the south, indicated by low-level gradients and the other banding structure. Furthermore, the absence of any clear eye leads to a weak ring score field. Thus the guided fine spiral score field dominates in the combined result and the final position falls farther away from the best track than does the first guess. This may appear to be easily remedied by weighting the spiral score less, or setting the threshold higher, but each of these options tend to increase the error in other cases—for example, by choosing the wrong eye pattern or unnecessarily defaulting to the first guess.
Table 5 provides the numerical results of the validation for group B TCs. Statistics from this table show that for group B, the ARCHER target position is used less often for tropical storms and category 1 hurricanes than it is used for group A cases of corresponding intensities. Although the target position is used in more than 50% of the cases of the category 1 grouping, the percentage of cases in which the algorithm error exceeds the 0.71° threshold (% worsened) is substantial: ∼10%. Judging by inspection of the whole dataset, the largest displacements between best-track and the ARCHER target positions occur during convective hook patterns, whereas cases with coherent eyewalls are still resolved accurately.
5. Applications with other satellite imagery
Although ARCHER was designed for and calibrated to operate on 85–92-GHz microwave brightness temperatures, the algorithm can be easily adapted to imagery from other satellite instruments. The following examples demonstrate a similarly robust application of the ARCHER algorithm to longwave infrared and 37-GHz passive microwave imagery.
a. Geostationary longwave infrared
It is desirable to investigate the possibility of ARCHER working effectively on geostationary infrared (IR) imagery, because the temporal availability is much greater than that of PMW. TCs often share similar structural characteristics viewed in the IR as with the 85–92-GHz PMW imagery: curved gradients throughout the areas of convective banding, and an eye with a local brightness temperature maximum. The image values also vary in the same direction: convection causes lower brightness temperatures and subsidence causes higher brightness temperatures. The two main differences are that infrared gradients tend to be less sharp but more widespread (because IR gradients generally follow contours in cloud height rather than edges of convection–precipitation) and the eye is more often obscured. Fortunately, the ARCHER algorithm can operate well in these environments because weak gradients can have a significant effect on the spiral scores, and also, the combination of spiral scores and ring scores give the algorithm a measure of resilience to obstructions in the eye.
Figure 9 shows the performance of the ARCHER algorithm on Geostationary Operational Environmental Satellite (GOES-12) IR image during an intensifying stage of Hurricane Katrina (2005) in the Gulf of Mexico. In this scenario, the first-guess position is located 0.7° (77 km) to the west. The guided fine spiral score is affected only by the gradients of the eyewall and the minor gradients caused by rotation in the surrounding cloud cap. The ring score is at a maximum in the center of the eye, and is not adversely affected by the clouds in the southwest portion of the eye. The final result is in excellent agreement with the best-track position (within 0.05°, or 5 km).
The ARCHER performance is also good when applied to major hurricanes with no discernable eye patterns. In Fig. 10, the first-guess position is 0.7° (77 km) to the west of the best-track position, yet the spiral score is still guided to the correct center. The strong spiral score in this scenario clearly dominates the combined score, leading to a center estimate that is 0.16° (18 km) away from the best-track center.
b. The 37-GHz brightness temperature
The imagery from 37-GHz microwave radiometers, particularly at the high resolution of the AMSR-E instrument, has a number feature-tracking characteristics found in both 85–92-GHz imagery and IR imagery. This channel is sensitive primarily to microwave emission by water droplets, so the image shows the distribution of low-level precipitation in the TC. Accordingly, the image is unobstructed by high clouds as in the 85–92-GHz imagery. However, the 37-GHz image also shows a wider distribution of convective events than the 85–92-GHz image and the convective bands are similar in appearance to those in an IR image. To apply ARCHER to the horizontal polarization of 37-GHz imagery, two simple changes are necessary because the gradients are generally of the opposite sign: first, reversing the weights on positive and negative gradient cross products of the guided fine spiral score [Eq. (3)]; and, second, reversing the sign in the ring score summation term [Eq. (4)]. These changes would not be necessary on 37-GHz polarization corrected temperature (PCT) imagery, but we apply the algorithm to the horizontal polarization instead because of its better texture.
Figures 11 and 12 show the ARCHER algorithm applied to two AMSR-E 37-GHz horizontal polarization images of Hurricane Katrina (2005) in the Gulf of Mexico. The first-guess position is located 0.7° (77 km) west of the best-track position. Both cases show ample distributions of spiral-oriented edges in the convective bands to guide the spiral scores to the rotational center. The eye is also evident in both images even though it is not apparent in the corresponding IR images. This leads the algorithm toward an accurate ring score center as well. In the final result, the ARCHER positions are accurate to 0.11° (11 km) in Fig. 11 and 0.05° (5 km) in Fig. 12. Part of the reason for such close agreement is likely the fewer opportunities for intrinsic error because it represents an estimate that is much closer to the surface than that of the 85–92-GHz retrieval.
6. Conclusions
ARCHER is a widely applicable objective TC center–finding algorithm with an ability to distinguish spiral and eye patterns in satellite images and estimate the rotational center with high accuracy. Its ancillary inputs require only the information that is freely available in real time so that it can run in a fully automated mode. The algorithm shows considerable skill for center-fixing most TCs, and accuracy increases with TC intensity. For less organized TCs, the algorithm is not as accurate because of the high proportion of convective patterns that do not have coherent eyewalls and corresponding spiral bands. When using this algorithm operationally, it is important to interpret the result as the estimated rotational center derived from the retrieval height and navigation of the image signal rather than at the surface, although some corrections for parallax are possible. Tables 4 and 5 provide measures of the algorithm accuracy that will aid in this interpretation. When applied to less organized storms, the interpretation of the result depends heavily on the type of convective structure. In particular, eye-spiral structures lead to the most accurate ARCHER target positions, while axially asymmetric hook structures lead to the least accurate ARCHER position estimates.
Overall, the skill of this algorithm is limited by the fraction of TCs that do not conform, even partially, to the spiral-eye mode of convective organization. In such cases the outstanding problem is that the structures in the image do not align with the actual surface rotational center (i.e., highly sheared or tilted vortices). A possible remedy would be to develop an ancillary technique based on other information that identifies these vortex structures in order to apply the ARCHER algorithm with customized parameters and thresholds. Beyond that, a significantly greater improvement in center-fixing accuracy would probably require the management of multisatellite information and perhaps the inclusion of numerical model information as well.
Acknowledgments
This work was sponsored by the Oceanographer of the Navy through the PEO C4I PMW-150 program office and the Naval Research Laboratory (Jeff Hawkins). We thank Derrick Herndon of CIMSS for collating the best-track and flight reconnaissance data, and we thank the three reviewers for their constructive comments.
REFERENCES
Aberson, S., 2002: Two years of operational hurricane synoptic surveillance. Wea. Forecasting, 17 , 1101–1110.
Brown, L. C., and P. M. Berthouex, 2002: Statistics for Environmental Engineers. CRC Press, 489 pp.
Chang, P-L., B. J-D. Jou, and J. Zhang, 2009: An algorithm for tracking eyes of tropical cyclones. Wea. Forecasting, 24 , 245–261.
Dvorak, V. F., 1973: A technique for the analysis and forecasting of tropical cyclone intensities from satellite pictures. NOAA Tech. Memo. NESS 45, 19 pp.
Goerss, J., 2009: Impact of satellite observations on the tropical cyclone track forecasts of the NOGAPS. Mon. Wea. Rev., 137 , 41–50.
Hawkins, J. D., T. F. Lee, F. J. Turk, K. L. Richardson, C. Sampson, and J. Kent, 2001: Mapping tropical cyclone characteristics via passive microwave remote sensing. Preprints, 11th Conf. on Satellite Meteorology and Oceanography, Madison, WI, Amer. Meteor. Soc., 308–311.
Kummerow, C., W. Barnes, T. Kozu, J. Shiue, and J. Simpson, 1998: The tropical rainfall measuring mission (TRMM) sensor package. J. Atmos. Oceanic Technol., 15 , 809–817.
Lee, R. S. T., and J. N. K. Liu, 2001: An elastic contour matching model for tropical cyclone pattern recognition. IEEE Trans. Syst. Man Cybernetics, 31 , 413–417.
NASA MSFC, cited. 2001: AMSR-E data management plan–August 2001. [Available online at http://www.ghcc.msfc.nasa.gov/AMSR/data_management_plan.html].
NOAA OSDPD, cited. 2002: General SSMIS information. [Available online at http://www.osdpd.noaa.gov/PSB/IMAGES/ssmisdoc.htm].
Olander, T., and C. S. Velden, 2007: The Advanced Dvorak Technique (ADT) – continued development of an objective scheme to estimate tropical cyclone intensity using geostationary infrared satellite imager. Wea. Forecasting, 22 , 287–298.
Olander, T., C. Velden, and J. Kossin, 2004: The Advanced Objective Dvorak technique (AODT): Latest upgrades and future directions. Proc. 26th AMS Hurricane and Tropical Meteorology Conf., Amer. Meteor. Soc., P1.19.
Patadia, F., C. M. Kishtawal, P. K. Pal, and P. C. Joshi, 2004: Geolocation of Indian Ocean tropical cyclones using 85 GHz observations from TRMM Microwave Imager. Curr. Sci., 87 , 504–509.
Raytheon, cited. 2000: SSM/I user’s interpretation guide. [Available online at http://www.ncdc.noaa.gov/oa/rsad/ssmi/fnoc-ssmi-manual.pdf].
Velden, C. S., and J. Hawkins, 2010: Satellite observations of tropical cyclones. Global Perspectives of Tropical Cyclones, J. Chan and J. Kepert, Eds., 3rd ed. World Scientific Publishing Company, 201–226.
Velden, C. S., W. Olson, and B. Roth, 1989: Tropical cyclone center fixing using DMSP SSM/I data. Preprints, 18th Conf. Hurricanes and Tropical Meteorology, San Diego, CA, Amer. Meteor. Soc., J36–J39.
Velden, C. S., and Coauthors, 2006: The Dvorak tropical cyclone intensity estimation technique: A satellite-based method that has endured for over 30 years. Bull. Amer. Meteor. Soc., 85 , 353–385.
Wimmers, A., and C. S. Velden, 2007: MIMIC: A new approach to visualizing satellite microwave imagery of tropical cyclones. Bull. Amer. Meteor. Soc., 88 , 1187–1196.
Wong, K. Y., and C. L. Yip, 2009: Identifying centers of circulating and spiral vector field patterns and its applications. Pattern Recognit., 42 , 1371–1387.
Wong, K. Y., C. L. Yip, P. W. Li, and W. W. Tsang, 2004: Automatic template matching method for tropical cyclone eye fix. Proc. 17th Int. Conf. Pattern Recognition (ICPR’04), Cambridge, United Kingdom, Institute of Electrical and Electronics Engineers Computer Society, 650–653.
Wong, K. Y., C. L. Yip, and P. W. Li, 2007: A novel algorithm for automatic tropical cyclone eye fix using Doppler radar data. Meteor. Appl., 14 , 49–59.
Wong, K. Y., C. L. Yip, and P. W. Li, 2008: Automatic tropical cyclone eye fix using genetic algorithm. Expert Syst. Appl., 34 , 643–656.
Wood, V. T., 1994: A technique for detecting a tropical cyclone center using a Doppler radar. J. Atmos. Oceanic Technol., 11 , 1207–1216.
Zhang, Q. P., L. L. Lai, and W. C. Sun, 2005: Location of tropical cyclone center with intelligent image processing technique, ICMLC 2005. Lect. Notes Artif. Intell., 3930 , 898–907.
APPENDIX A
Characterization of Errors in Operational Tropical Cyclone Position Forecasts
The development of a PMW center-fixing technique requires that the algorithm be calibrated using a first-guess position estimate with an error profile matching that of the forecast positions typical in real-time applications. The most widely available and reliable first-guess TC position estimates are from the official forecast track updated regularly by the National Hurricane Center for TCs in the North Atlantic and east Pacific basins, and by the Joint Tropical Warning Center for TCs in the west Pacific, Indian Ocean, and Australia–Fiji basins. (Another good alternative would be track forecasts issued by Japan’s Regional Specialized Meteorological Center.) Applying this to incoming microwave satellite imagery in real time requires an interpolation between positions at forecast times from roughly 3 to 12 h, depending on the latency of the forecast information and the latency of the image processing and/or transmission.
To characterize the accuracy of this forecast position, we have collected several years of NHC and JTWC forecast positions used as real-time first-guess centers for PMW images in the MIMIC product (Wimmers and Velden 2007) since mid-2004. These forecast positions can be easily evaluated against the final best-track TC positions issued by the respective agencies. The forecasts in the North Atlantic basin must be treated separately from those of the other basins because the high number of regular aircraft observations there leads to an improvement in forecast accuracy.
The 2007 and 2008 North Atlantic storm seasons were selected to estimate the North Atlantic real-time first-guess center accuracy, and the 2008 west Pacific storm season was selected to estimate the first-guess center accuracy in the remaining basins. The following filtering criteria were applied to the data: the maximum sustained winds must exceed 34 kt and the best-track center latitude must be south of 40°N. These two criteria reduce the effect of ambiguous centers of rotation during early storm development and extratropical transition, respectively, on the development sample distribution. Also, the data for North Atlantic TCs Chantal (2007), Noel (2007), Dolly (2008), and Nana (2008) were lost from the forecast position archive, and so are not included.
Figures A1 and A2 show the distribution of forecast error in the 2007–08 North Atlantic storm seasons and 2008 west Pacific storm season, respectively. Both figures show a roughly even azimuthal distribution of error, and are bounded within less than one degree. The average error for the North Atlantic (NHC forecast) is lower than the average error for the west Pacific (JTWC forecast), which is to be expected because of the higher density of observations (including aircraft reconnaissance) in the North Atlantic. However, the tail of the distribution for the North Atlantic is greater, which may result from the fact that the precise North Atlantic observations afford more opportunities to correct the forecasts. Three asterisks on the histograms of Figs. A1 and A2 mark the binned averages of the distances at 0° to 0.25°, 0.25° to 0.55°, and >0.55°. These binned values (Table A1) are used in sections 3 and 4 to weight the results at simulated first-guess errors of 0.1°, 0.4°, and 0.7°, respectively.
APPENDIX B
Derivation of Image-Relative Error
The best possible statistic of algorithm accuracy is the error of the algorithm position with respect to the center position at the retrieval height of the image, because this is the center indicated by the information in the image. However, the only available “truth” dataset for the calibration and validation is the NHC best track, which is a position for the TC center of rotation at the surface. The difference between the best track and the image center can be treated as an additional, independent source of error in the validation, shown as follows.
Using the same method for a smaller set of group B cases, we calculate an intrinsic error RMS of 0.17°, which is significantly larger than the intrinsic error for group A cases, as expected. Because of the lack of best cases for estimating intrinsic error for category 1 hurricane and tropical storm groupings, we use the category 2–5 hurricane intrinsic error as a conservative substitute for the same reasons as before.
An example of the coarse spiral score calculation for Hurricane Emily (0823 UTC 19 Jul 2005, AMSR-E 89-GHz, horizontal polarization). The x–y axes are longitude–latitude. Spiral score values [Eq. (7)] are computed at (a) each dot and interpolated to the resolution of the image to reveal a bull’s-eye pattern centered at the maximum score near point 2. (b),(c) Examples of the cross-product magnitude fields contributing to the spiral score at two points. The cross-product magnitude field in (b) clearly has a lower average absolute value than that of (c) because of the poorer alignment of the image gradients with the corresponding spiral vector field.
Citation: Journal of Applied Meteorology and Climatology 49, 9; 10.1175/2010JAMC2490.1
ARCHER diagnostic images in a straightforward application on the image described in Fig. 1. In each image, the triangle is the best-track position, the cross is the first-guess position and the square marker is the location of the maximum score. (left) Guided fine spiral score, contours; (center) ring score, contours; (right) combined score using optimal weights, contours.
Citation: Journal of Applied Meteorology and Climatology 49, 9; 10.1175/2010JAMC2490.1
RMS of ARCHER positions with respect to the best track for all calibration and validation cases (solid line). Number of cases contributing to the RMS value (dashed line).
Citation: Journal of Applied Meteorology and Climatology 49, 9; 10.1175/2010JAMC2490.1
ARCHER performance statistics for the calibration dataset used to determine the optimum spiral weight and threshold parameters; columns are ordered by TC intensity (kt): (left) 34 ≤ Vmax < 65, (middle) 65 ≤ Vmax < 84, and (right) Vmax ≥ 84. (top to bottom) Weighted-average RMS error (°) between ARCHER position and best-track position weighted-average number of cases in which the ARCHER position error exceeded 0.71°, weighted-average percentage of cases in which the ARCHER target position was applied (rather than the default), and histograms of (unweighted) error for all cases in the corresponding TC intensity group. Magenta polygons and white open circles identify the domain of optimum parameter values and are discussed in the text.
Citation: Journal of Applied Meteorology and Climatology 49, 9; 10.1175/2010JAMC2490.1
Distance (centering error) between the ARCHER position and corresponding best-track position for group A cases, partitioned by TC intensity (kt, rows) and first-guess error (°, columns). The (top to bottom) Vmaxs: 34 ≤ Vmax < 65, 65 ≤ Vmax < 84, Vmax > 84, Vmax ≥ 34, and (left to right) first guess error 0.1°, 0.4°, 0.7°, and 1.0°.
Citation: Journal of Applied Meteorology and Climatology 49, 9; 10.1175/2010JAMC2490.1
ARCHER image-relative RMS arranged by first-guess position displacement, group A TCs; (a) applied target (nondefaulting) cases only, and (b) all cases.
Citation: Journal of Applied Meteorology and Climatology 49, 9; 10.1175/2010JAMC2490.1
As in Fig. 5, but for group B.
Citation: Journal of Applied Meteorology and Climatology 49, 9; 10.1175/2010JAMC2490.1
ARCHER diagnostic images for an example of a group B TC: 0709 UTC 2 Nov 2007 case of Hurricane Noel. Background images are from TMI (85 GHz, horizontal polarization). The symbols, contours and scores are as in Fig. 2.
Citation: Journal of Applied Meteorology and Climatology 49, 9; 10.1175/2010JAMC2490.1
ARCHER diagnostic images for a longwave infrared image: 0715 UTC 28 Aug 2005, Hurricane Katrina. Background images are from the GOES-12 10.7-μm channel. The symbols, contours, and scores are as in Fig. 2.
Citation: Journal of Applied Meteorology and Climatology 49, 9; 10.1175/2010JAMC2490.1
As in Fig. 9, but for 0015 UTC 28 Aug 2005.
Citation: Journal of Applied Meteorology and Climatology 49, 9; 10.1175/2010JAMC2490.1
ARCHER diagnostic images for a 37-GHz image: 1842 UTC 26 Aug 2005 case of Hurricane Katrina. Background images are from the AMSR-E conical scanner 37-GHz channel (horizontal polarization). The symbols, contours, and scores are as in Fig. 2.
Citation: Journal of Applied Meteorology and Climatology 49, 9; 10.1175/2010JAMC2490.1
As in Fig. 11, but for 1925 UTC 27 Aug 2005.
Citation: Journal of Applied Meteorology and Climatology 49, 9; 10.1175/2010JAMC2490.1
(a) First-guess TC center error as the difference between real-time available NHC forecast TC center during the 2007–08 North Atlantic seasons and the best track; (b) distribution of error distance. Asterisks in (b) mark average values binned to 0.1°, 0.4°, and 0.7° (described in text).
Citation: Journal of Applied Meteorology and Climatology 49, 9; 10.1175/2010JAMC2490.1
As in Fig. A1, but (for the difference between real-time available JTWC forecast TC center during the 2008 west Pacific season and the best track.
Citation: Journal of Applied Meteorology and Climatology 49, 9; 10.1175/2010JAMC2490.1
Important parameters in the ARCHER algorithm.
Satellite microwave instruments used in calibration and validation of the ARCHER algorithm.
Calibrated weight and threshold values for the ARCHER algorithm.
Validation results of the ARCHER algorithm, group A TCs.
Validation results of the ARCHER algorithm, group B TCs.a
Weights on the first-guess displacements used to approximate the operational error of TC position estimates.
Estimates of the intrinsic error.