1. Introduction
Supercell thunderstorms are perhaps the most violent of all thunderstorm types and are capable of producing severe winds, large hail, and weak-to-violent tornadoes (Dostalek et al. 2004). In general, however, the supercell class of storms is defined by a persistent rotating updraft (i.e., mesocyclone), which promotes the organization, maintenance, and severity of thunderstorm. The weather surveillance radars used by the National Weather Service (Crum and Alberty 1993) provide vivid pictures of the mesocyclonic activities. The interaction between the updrafts and the vertically sheared environment strongly controls the degree of the organization of a storm and the severity of convection. Supercells and tornadoes are associated with moderate-to-strong vertical wind shear and moderate-to-high instability.
A bounded weak-echo region (BWER) is a radar signature within a thunderstorm characterized by local minima in the radar reflectivity at low levels, which extends upward into and is surrounded by the higher reflectivities aloft. This feature is associated with a strong updraft and is almost always found in the inflow region of a thunderstorm (Markowski 2002; Cotton and Anthes 1989). In fact, a BWER is a representative of a local storm that develops in a strongly sheared environment and tends to a steady-state circulation.
The weather surveillance radars used by the National Weather Service (Crum and Alberty 1993) scan through thunderstorms starting at a low elevation angle (0.5°), and after completing a full 360° azimuthal sweep they progressively increase the elevation angle until an upper limit (19.5°) is reached. This is shown in Fig. 1. As the radar scan continues, a BWER first appears as a region of relatively low reflectivities surrounded by higher reflectivities at a lower elevation angle (Krauss and Marwitz 1984). Then, at higher elevation angles it becomes “capped” by a broad region of high reflectivity. The vertical cross section of a BWER is depicted in Fig. 2, and Fig. 3 depicts the horizontal cross section of a strong BWER.
There are several factors that make the radar signature of a BWER rarely appear very clear. As the distance between the radar and the storm increases, the ability of the radar to properly sample small-scale features within the storm, such as a BWER, becomes more difficult. This is so because the radar-sampling volume becomes larger. There is another problem with rapidly moving storms. By the time the radar scans upward through the storm, the higher-altitude capping region of the BWER may move significantly with the storm and may no longer be located over the relative reflectivity minimum detected at a lower altitude (Lakshmanan and Witt 1996). There is also an error associated with the vertical height measured by the radar (Howard and Gourley 1995), and this error varies with weather conditions.
Smalley et al. (1995) have proposed a method based on the 2D structure but did not consider the 3D structure of a BWER in the decision process. The success rate of this method (Smalley et al. 1995) is limited because it detects all local minima in the reflectivity field of the radar data. In this 2D scheme, the false-alarm rate is very high.
Fuzzy logic has been used to improve the performance of meteorological information processing systems in the past, notably in gust front (Delanoy and Troxel 1993) and also in BWER (Lakshmanan and Witt 1996) detection. Because of the various uncertainties associated with the appearance of a BWER in radar images, a fuzzy rule–based classification scheme is expected to work well.
An improved method for the detection of BWERs was proposed by Lakshmanan and Witt (1996). In this method they have considered the 3D structure of the BWER in their decision process. Their detection scheme is designed based on the concept of fuzzy logic. Lakshmanan and Witt (1996) computed 23 features on each suspected region obtained from the radar image, and these features are characterized by a fuzzy scale. They also assigned a weight to each feature. The decision is made based on a weighted average of features. Lakshmanan (2000) have used genetic algorithms (GAs) for tuning of the BWER detection algorithm.
In this paper we propose a fuzzy rule–based model for the detection of BWERs. We develop an automatic scheme for the extraction of fuzzy rules for classification of the suspected regions. We have used the dataset that was extracted and used by Lakshmanan and Witt (1996) and Lakshmanan (2000). Each feature vector in the dataset represents a region or segment suspected to contain a BWER or includes a part of a BWER. [We use the words region, subregion, and segment interchangeably to represent a small portion of a radar image that may contain a BWER. Note that, the word “segment” has a different usage in the Storm Cell Identification and Tracking algorithm for the Weather Surveillance Radar-1988 Doppler (WSR-88D).] We first generate a set of initial fuzzy rules from the dataset using exploratory data analysis techniques to classify each candidate region into one of three classes—strong, marginal, and no BWER. The initial rule set is then tuned using a gradient search for performance improvement. We achieved an improved performance on the training set as well as on the test dataset relative to the results reported by Lakshmanan and Witt (1996) and Lakshmanan (2000).
We describe our proposed model in section 2. The results are shown in section 3. Section 4 concludes the paper.
2. Proposed model
The fuzzy rule–based classification model, a conceptual model to classify objects, is based on approximate reasoning. The fuzzy set theoretical classification framework provides a degree of support to each potential class. A set of fuzzy rules is used to describe a particular class. The rules are defined on some features, which are computed from the radar images. For any new data (test data), the degree of the match of the features with each fuzzy rule is computed. The class label associated with the rule having the strongest match (i.e., the highest firing strength) defines the class of the new data point.
The proposed method consists of the following three steps: 1) generation of the training and test datasets (discussed in section 2a), 2) generation of an initial fuzzy rule base (discussed in section 2b), and 3) refining (tuning) of the rule base (discussed in section 2c).
a. Training and test data
Lakshmanan and Witt (1996) and Lakshmanan (2000) have collected data over 5 days, and we use the same data. The radar scans obtained on 4 days (25 May 1996, 21 April 1996, 2 June 1995, and 7 May 1995) contain 186 strong and 59 marginal BWERs. For the fifth day (1 June 1995) there was no BWER. Figure 4 shows a typical WSR-88D image from the Lubbock, Texas, (KLBB) radar at 2136 UTC 25 May 1996. The polar data were projected on a plain with a 1 km × 1 km uniform resolution Cartesian grid. The plane is tangential to the earth’s surface at the radar location. The Cartesian grid was limited to 256 km in range. Figure 4 has a BWER as is marked on it. The typical time to process a volume scan is about 30 s on a Sun workstation.
These radar images are preprocessed to find segments (subregions) that are suspected candidates for BWERs (Lakshmanan and Witt 1996). A segment is identified by looking for local minima in the radar elevation scans. Contiguous range gates that belong to local minima are connected to form a candidate segment. These segments are then labeled as having either strong, marginal, or no BWER. Each such segment is represented by a set of features, such as the number of pixels in the segment, the maximum and minimum values of the radar reflectivity, and so on. These data are divided into two parts—one for training of our proposed model and the other for testing the performance of the trained model. The training dataset includes 3 days of data (out of 4 days) with BWERs, along with 50% of the fifth day’s data without BWER. The observations of the remaining 1 day with BWERs along with the remaining 50% of data of the fifth day are used for testing. The training dataset contains 1479 strong, 735 marginal, and 4501 no BWER candidate segments, and the test dataset contain 2113 strong, 642 marginal, and 4501 no BWER segments. There are many segments with no BWER, but we have randomly selected 4501 segments for the training set and the same number of segments for the test set. We shall denote the training set as 𝗫Tr and test set as 𝗫Te.
Each segment is represented by 23 features, which can be divided into two major groups: geometric characteristics (contains 6 features) and BWER characteristics (contains 17 features).
We now provide a description of each feature. The six geometric features are cx, cy, minimum_x, minimum_y, maximum_x, and maximum_y, where (cx, cy) is the coordinate of the center of the BWER. Here (minimum_x, minimum_y) and (maximum_x, maximum_y) are the coordinates of the top-left and the bottom-right corners of the smallest rectangle containing the BWER, respectively. The coordinates are specified on a grid that is tangential to the earth’s surface at the radar location. The grid is at the resolution of the radar range gates [1 km for the Next Generation Weather Radar (NEXRAD)]. The radar elevation scans are projected to that tangential grid.
The remaining 17 BWER characteristics are number _of_pixels, maximum_rfl, minimum_rfl, average_rfl, maximum_bound, minimum_bound, average_bound, number_of_bounds, maximum_cap, minimum_cap, average_cap, height_of_the_BWER, best_height, low_VIL, prev_conf, cov_vol, and sweep.
Here, number_of_pixels is the number of pixels in the segment of the radar image. The features maximum_rfl, minimum_rfl, and average_rfl represent the maximum, the minimum, and the average reflectivity values in the segment (measured in dBZ).
The maximum_bound, minimum_bound, and average_bound measure some bounding values of the reflectivities that surround the weak-echo region in the radar elevation scan. The maximum_bound is defined as the highest reflectivity value in the immediate surrounding neighbor of the candidate 2D weak-echo region. Similarly, the minimum_bound (average_bound) is defined as the lowest (average) reflectivity value computed from the immediate neighbors of the candidate 2D weak-echo region. These three features are measured in dBZ. The number_of_bounds is the number of such boundary pixels that surround the weak-echo region in a radar elevation scan.
The cap at a pixel of the weak-echo region is the highest reflectivity value in any of the elevation scans above the weak-echo region. The maximum_cap of a candidate segment is the maximum reflectivity value of the caps of each of its constituent pixels. Similarly, the minimum_cap is the minimum reflectivity of the caps of each of its constituent pixels. The average_cap is the average value of the caps, averaged over all pixels that together form the weak-echo region. These features are also measured in dBZ.
There are three features that capture some 3D properties: height_of_the_BWER, best_height, and low_VIL (vertically integrated liquid). A BWER is a 3D structure that is formed by linking together segments in the 2D radar elevation scans. The height_of_the_BWER is the distance between the lowest candidate and the highest one, that is, the vertical length of the BWER. The best_height is the height of the highest candidate. The heights are computed using a 4/3 earth formula, assuming the standard atmospheric propagation, and are the height above the radar level. The VIL value at a location is the sum of all observed radar reflectivities (converted to liquid water content) in a vertical column above this location. The VIL at a pixel is, thus, an integration over all elevation scans. The low_VIL is the VIL computed from all elevation scans up to the elevation scan that contains the BWER.
The last three features are prev_conf, cov_vol, and sweep. The prev_conf is a measure of confidence of the closest candidate in the previous volume scan. It is computed by taking the current spatial location of the BWER, searching the list of candidates from the previous volume scan, and finding that closest to the BWER. If the distance of the closest match is within a few kilometers, then its confidence is used as prev_conf. The cov_vol and sweep features are the volume scan and sweep in which BWER was found.
These features were used by Lakshmanan and Witt (1996) and Lakshmanan (2000) also. To select a set of useful features, we ranked these features based on standard deviation. Features with very low standard deviations are discarded. In addition to this, we also used the online feature selection method of Pal and Chintalapudi (1997) to select/rank features. The top-ranked features suggested by this method were considered to be useful by experts also. We experimented with different sets of top-ranked features and found that the top seven features as ranked by the online feature selection method (Pal and Chintalapudi 1997) are adequate, and the use of more than seven top-ranked features does not improve the performance. Hence we settled for seven features.
The set of selected features are the minimum_rfl, average_rfl, maximum_bound, average_bound, average_cap, height_of_the_BWER, and low_VIL.
Each feature is then linearly normalized between [0, 1]. In other words, each input x is normalized by the formula (x − xmin)/(xmax − xmin), where xmax and xmin are the maximum and minimum values of x.
b. Generation of fuzzy rule base
We use exploratory data analysis techniques to design the initial rule base (Delgado et al. 1997; Bezdek et al. 2005; Pal et al. 1997, 2002).
In the FCM algorithm a cluster is represented by a centroid and the algorithm finds the centroids minimizing a sum of weighted squared distances of the data points from the centroids. The weights are related to memberships of data points to different clusters.
For each 𝗫TrK, the FCM algorithm produces CK centroids, μKj, j = 1, 2, . . . , CK, μKi ∈ 𝗥p, K = 1, 2, 3.
The ith cluster of 𝗫TrK, can be represented by a fuzzy rule of the form RKj: if x is close to μKj then x belongs to class K. In RKj and μKj, the subscript indicates the cluster number (rule number) and the superscript corresponds to the class label. To illustrate the rule structure, for notational clarity, for the time being we ignore the subscript and superscript. Suppose μ = (μ1, μ2, . . . , μp)T ∈ Rp is a cluster center for the strong BWER class. The associated rule is R: if x is close to μ then the class is strong BWER. The fuzzy set “x is close to μ” can be represented by a membership function, where the membership to the set is inversely related to the distance of x from μ (Bezdek et al. 2005).
Here x = (x1, x2, x3, x4, x5, x6, x7)T is a feature vector computed from a segment of a radar scan image that needs to be classified. The antecedent clause “x is close to μ” can be represented by a set of seven simpler atomic clauses and the rule can be rewritten as follows: if x1 is close to μ1 and x2 is close to μ2 and . . . and x6 is closest to μ6 and x7 is close to μ7, then the class is strong BWER. In this way we can generate the initial set of fuzzy rules for each cluster in 𝗫TrK for K = 1, 2, 3.
We next provide a schematic description of the algorithm for generation of the initial rules: The input is 𝗫 = 𝗫Tr1 ∪ 𝗫Tr2 ∪ 𝗫Tr3. For each of k = 1, 2, and 3, five steps are performed: In step 1, cluster the data 𝗫Trk from class k to get Ck cluster centers μ1, μ2, . . . , μCk. In step 2, convert each cluster center μi into a rule; if “x is close to μi” then the class is k. In step 3, rewrite the rule in step 2 as follows: If x1 is close to μi1 and . . . and x7 is close to μi7, then the class is k. In step 4, model “close to μij” by a Gaussian membership function; for example, x1 is close to μi1 by exp[−(x1 − μi1)2/σ2i1], where σi1 > 0. In step 5, estimate the initial value of σij by computing the standard deviation of the jth feature of the data points falling in the ith cluster of XTrk. At the end of step 5, return to step 1 and repeat for the remaining k values.
c. Tuning of rule base
The tuning of the initial rule base is done by minimizing the training error using a gradient descent technique. In this regard we now derive the appropriate learning mechanism.
The tuning process is continued till the change in error by an epoch becomes less than a predefined small threshold. In this way we get the final centroids (μ) and spreads (σ) of each fuzzy set associated with the rules. This rule set is now ready for classifying new data. The refined rule base is expected to result in a low error rate.
3. Results and discussions
In our investigation we have extracted four rules for each of the three classes. Out of the 4 × 3 = 12 rules, Table 1 shows only three typical rules—one for each class, before and after tuning. In Table 1, the “tuple” (μ, σ) indicates the center (μ) and spread (σ) of the Gaussian membership function.
A careful inspection of the rules before and after tuning in Table 1 reveals that tuning plays a significant role. For example, the rule corresponding to strong BWER (before tuning) is that if the minimum_rfl is close to 0.77 and average_rfl is close to 0.78 and . . . and low_VIL is close to 0.65, then the class is strong BWER.
The analysis of the rule base can also reveal very interesting characteristics about the underlying process. As an example, for the no BWER class the sixth feature, the height_of_the_BWER, is very important because the σ of the associated membership function is very small (0.03). Thus, the specificity of the fuzzy set, “the height_of_the_BWER is close to 0.31” is very high. This suggests that the vertical length (height of the suspected BWER) plays a very significant role in determining the no BWER case. Such useful semantic information is clearly an advantage of a fuzzy rule–based system over other systems, such as neural networks.
For this rule, the centers of Gaussian membership functions corresponding to only features 3, 4, and 5 changed significantly by the tuning; while the spread associated with each feature is changed significantly by the tuning to have a better coverage by the rule.
a. Before rule tuning
Table 2 shows the confusion matrix for the training data before tuning the rules. The first row shows that out of total 1479 strong cases, 818 are classified as strong, 429 as marginal, and 232 as no BWER. We emphasize again, 1479 strong cases do not represent 1479 strong BWERs, but 1479 segments of a radar scan that are strongly suspected as part of some BWER. Similarly, the second row is for the marginal BWER and the third row for no BWER cases.
Using Table 2 we find b = 1486, c = 232, and d = 818.
Similarly, Table 3 shows the confusion matrix with the test data before tuning the rules.
b. After rule tuning
Table 4 shows the confusion matrix with the training data after tuning the rules. In comparing Table 4 with Table 2 we find a significant reduction in the number of mislabeled cases.
4. Conclusions
The detection of a BWER signature using radar scans is often very difficult. The radar scans are first preprocessed to find the suspected regions. To generate the training data these suspected regions are manually labeled by experts. Each such region is represented by a set of features. Using exploratory data analysis an automatic scheme is developed for the extraction of fuzzy rules for classification of the suspected regions. It is a two-step process. In the first step we extracted an initial rule base using clustering. Then, in the second phase, we refined rules with respect to their context. In this regard, we derived the necessary update equations.
The proposed system is tested on a dataset not used in the training. Our system is found to produce an improved detection accuracy over the results reported in the literature on the same dataset. Our system extracts human-interpretable linguistic rules. We have demonstrated that such rules can reveal interesting information about the underlying process. This is a distinct advantage of the proposed system.
Our next step would be to integrate this information both spatially and temporally to find the location of each BWER. A genuine BWER should exhibit a cluster of suspected data points, and such clusters are expected to show up in successive radar scans. We plan to use the firing strength information to detect such clusters. We shall also investigate the use of cluster validity indexes to decide on the number of rules for each class. We expect that use of an appropriate feature set along with the right number of rules for each class will further improve the performance.
Acknowledgments
We sincerely acknowledge the editor and the anonymous reviewers for their valuable suggestions to improve the manuscript.
REFERENCES
Bezdek, J. C., J. Keller, R. Krishnapuram, and N. R. Pal, 2005: Fuzzy Models and Algorithms for Pattern Recognition and Image Processing. Kluwer Academic, 776 pp.
Cotton, W. R., and R. A. Anthes, 1989: Storm and Cloud Dynamics. Academic Press, 883 pp.
Crum, T., and R. Alberty, 1993: The WSR-88D and the WSR-88D operational support facility. Bull. Amer. Meteor. Soc., 74 , 1669–1687.
Delanoy, R. L., and S. W. Troxel, 1993: Machine intelligent gust front detection. Lincoln Lab. J., 6 , 187–212.
Delgado, M., A. F. Gomez-Skorneta, and F. Martin, 1997: A fuzzy clustering-based rapid prototyping for fuzzy rule-based modelling. IEEE Trans. Fuzzy Syst., 5 , 223–233.
Donaldson, R., R. Dyer, and M. Krauss, 1975: An objective evaluator of techniques for predicting severe weather events. Preprints, Ninth Conf. on Severe Local Storms, Norman, OK, Amer. Meteor. Soc., 321–326.
Dostalek, J. F., J. F. Weaver, and G. L. Phillips, 2004: Aspects of a tornadic left-moving thunderstorm of 25 May 1999. Wea. Forecasting, 19 , 614–626.
Howard, K., and J. Gourley, 1995: Evaluating the magnitude of uncertainties in WSR-88D measurements and measuring this impact on storm attribute trends—An initial study. National Severe Storms Laboratory Tech. Rep.
Klir, G. J., and B. Yuan, 1995: Fuzzy Sets and Fuzzy Logic—Theory and Application. Prentice Hall, 592 pp.
Krauss, T., and J. Marwitz, 1984: Precipitation processes within an Alberra supercell hailstorm. J. Atmos. Sci., 41 , 1025–1034.
Lakshmanan, V., 2000: Using a genetic algorithm to tune a bounded weak echo region detection algorithm. J. Appl. Meteor., 39 , 222–230.
Lakshmanan, V., and A. Witt, 1996: Detection of bounded weak echo regions in meteorological images. Proc. IAPR’96, Vienna, Austria, IAPR, 895–899.
Lemon, L. R., 1980: Severe storms radar identification techniques and warning criteria. National Severe Storms Forecast Center Tech. Rep. NWS NSSFC-3, 35 pp.
Markowski, P. M., 2002: Surface thermodynamic characteristics of hook echoes and rear-flank downdrafts. Part I: A review. Mon. Wea. Rev., 130 , 852–876.
Pal, K., R. Mudi, and N. R. Pal, 2002: A new scheme for fuzzy rule based system identification and its application to self-tuning fuzzy controllers. IEEE Trans. Syst. Man. Cybernetics B, 32 , 470–482.
Pal, N. R., and K. K. Chintalapudi, 1997: A connectionist system for feature selection. Neural Parallel Sci. Comput., 5 , 359–382.
Pal, N. R., K. Pal, J. C. Bezdek, and T. A. Runkler, 1997: Some issues in system identification using clustering. Int. Joint Conf. Neural Networks: IJCNN 1997, Piscataway, NJ, IEEE, 2524–2529.
Smalley, D., R. Donaldson, I. Harris, S. L. Tung, and P. Desrochers, 1995: Quantization of severe storm structures. Proc. 27th Conf. on Radar Meteorology, Vail, CO, Amer. Meteor. Soc., 617–619.
Smith, T., 1995: Visualization of WSR-88D data in 3-D using application visualization software. Preprints, 14th Conf. on Weather Analysis and Forecasting, Dallas, TX, Amer. Meteor. Soc., 442–446.
Three typical rules before and after tuning.
Confusion matrix with training data (before tuning).
Confusion matrix with test data (before tuning).
Confusion matrix with training data (after tuning).
Confusion matrix with test data (after tuning).