1. Introduction
The wind power density (WPD) is required for the estimation of power potential from wind turbines. It is a nonlinear function of the wind speed probability density function (PDF). The wind speed PDF is usually estimated from data and then used as a functional in the WPD distribution function, which can be integrated to obtain the WPD (e.g., Çelik 2003a). The wind speed PDF has traditionally been estimated using a parametric model applied to wind speed data at turbine height. These models generally include the Weibull (Stevens and Smulders 1979), Rayleigh (Çelik 2003b), and lognormal functions (Zaharim et al. 2009). The two-parameter Weibull function is the most often used parameteric model for the wind speed PDF (Hennessey 1977; Justus et al. 1978 Pavia and O’Brien 1986; Ramirez and Carta 2005; Monahan 2006). However, it has also been shown that wind speed does not always have a Weibull-like distribution (Jaramillo and Borja 2004; Yilmaz and Çelik 2008). An extreme example of such a case is shown in Fig. 1 using 5-min 10-m height wind speed data from Boise City, Oklahoma, for the year 2000 taken from the Oklahoma Mesonetwork (Brock et al. 1995). The maximum likelihood estimation of the Weibull PDF parameters clearly results in an unacceptable fit to the wind speed histogram.
Given a small sample size, a good argument can usually be made for applying a parametric PDF model to wind data. However, when a robust, smooth histogram of the wind speed distribution can be determined from an abundance of available data like that shown in Fig. 1, nonparametric techniques (Izenman 1991; Silverman 1986) are usually used for PDF estimation because of their flexibility and the very real possibility that the underlying true density may not be adequately represented by a parametric model (Jaramillo and Borja 2004). A commonly used nonparametric method is the kernel method (Silverman 1986; Juban et al. 2007). Kernel techniques are quite flexible and usually provide an acceptable estimate of the wind speed PDF. There are, however, many instances (e.g., for engineering applications such as turbine design) where a functional, as opposed to graphical, representation of the wind power distribution is required. Such a representation of the PDF using the kernel method results in a number of terms equal to the number of unique data points used in the fitting process, resulting in a very unwieldy mathematical function. Thus, the kernel method is a suboptimal technique for providing a functional representation of the distribution of power with wind speed.
Because the WPD is a function of the product of the PDF, the air density and one-half the cubed wind speed, minor errors in the PDF estimation can lead to large errors in the WPD estimate (Çelik 2003a). This is especially true when those errors are made at medium and high wind speeds (e.g., using 5-min measurements), where the sample sizes are usually quite small, leading to an estimation error in the WPD that is likely to be quite large. In this paper, the process of estimating the wind speed PDF and then multiplying the resulting functional by the cubed wind speed times one-half the air density to obtain the WPD distribution is hereafter referred to as the “indirect” method. The present study develops a nonparametric, direct estimate of the WPD distribution function by incorporating the wind speed cubed in a Gauss–Hermite expansion. It will be shown that not only does this “direct” method provide a better estimate of the WPD distribution in terms of estimation bias and variance, but it also produces a tractable and computationally simple mathematical formulation. The paper begins with a description of the basic WPD formulation in section 2. A general description of Gauss–Hermite expansion is given in section 3. The development of the direct WPD estimator using the Gauss–Hermite expansion is described in section 4. The result of a comparison of the direct, indirect, and Weibull maximum likelihood estimators of the WPD using a resampling method is described in section 5. An experiment using wind speed from two sites in Oklahoma is described in section 6. Finally, the conclusions and summary are in section 7.
2. The wind power density formulation

Thus, the error in e(υ) is a cubic function of the wind speed times the estimation error in the wind speed PDF. This can be an especially acute problem as the true wind speed PDF f (υ) is generally positively skewed for wind speed. Hence, there are less data available for estimating f (υ) at the higher wind speed values. This, together with the fact that the estimation error is a function of wind speed cubed, means that the estimation error in e(υ) will be greatest for the larger values of υ. Besides an insufficient number of data (i.e., sampling error), the estimation error is also a function of the quality of the fit of the selected function estimator. This paper will address both of the sources of error in the development of a new WPD distribution estimator.
A function estimator selected for a given task should have certain basic properties similar to that suggested by the data. For example, a Gaussian kernel method is commonly used to estimate probability densities because the weighted sum of these kernels is guaranteed to integrate to one and to be positive over the range of support of the data (Silverman 1986). The method is also flexible enough to mimic a wide range of PDF shapes. Another desired property of density estimators is that they are as parsimonious as possible for both computational reasons and further functional analysis. In this regard, the kernel method is not well suited for the reasons mentioned above. Thus, an optimal estimator of the WPD distribution would be one that has similar functional “shape” properties as the underlying distribution and produces a function that has reasonable mathematical tractability.
Near-Gaussian functions have been shown to be quite well estimated using a Gauss–Hermite expansion (van der Marel and Franx 1993). This is because the expansion is a sum of weighted Hermite polynomials that are derived from the standard Gaussian function. In fact, only the first term in the expansion is required to reproduce a Gaussian-shaped function (see below). van der Marel and Franx (1993) demonstrated that additional terms in the expansion tend to account for any asymmetry in a quasi-Gaussian-like curve. They also showed that for near-Gaussian functions only a relatively small number of terms were required for an excellent approximation. Thus, it would appear reasonable that a Gauss–Hermite expansion fit directly to the WPD distribution function rather than the wind speed PDF may provide a better estimate.
3. The Gauss–Hermite orthogonal series estimator
This paper examines a new way to apply a specific class of nonparametric estimators of the WPD distribution, specifically, orthogonal polynomial expansions. Early work on these estimators was conducted by Cencov (1962), Schwartz (1967), Kronmal and Tarter (1968), Watson (1969), Walter (1977), and Liebscher (1990). All of these works involved designing a PDF estimator. In particular, Hall (1980) studied expansions on the positive half-line using Hermite and Laguerre polynomial series and found that Hermite polynomials were generally the superior estimator based on comparisons of the rate of convergence. Bryukhan and Diab (1993) applied a Laguerre polynomial expansion to South African wind speed data for the PDF estimation and found satisfactory results. Although well established in the mathematical literature, there has been little research on practical applications of orthogonal functions for the estimation of the WPD distribution function. In astronomy and astrophysics, however, different forms of these estimators have been successfully used to estimate the spectral line profiles of galaxies (van der Marel and Franx 1993; Blinnikov and Moessner 1998). Expansions of Hermite polynomials were chosen because the spectral profiles tended to be near Gaussian in shape and are modeled quite well by these expansions. Two common Hermite expansions, the Gram–Charlier Series A and the Gauss–Hermite expansions, have been closely studied for their utility in modeling these spectra (van der Marel and Franx 1993; Blinnikov and Moessner 1998). The study by Blinnikov and Moessner (1998) demonstrated quite convincingly the superior convergence properties of the Gauss–Hermite over the Gram–Charlier expansion. Therefore, this paper exclusively focuses on the Gauss–Hermite form of the expansion.
4. Estimating the wind power distribution directly
Any function that is square integrable (i.e., has a finite variance, E[ f 2(V)] < ∞) can be represented by an orthogonal expansion. Thus, instead of estimating f (υ) and then inserting this in (2) to obtain an estimate of e(υ), it is proposed to expand the wind power distribution function directly without first estimating f (υ). Because the WPD function appears to be near Gaussian, a Gauss–Hermite expansion should provide a good estimate of this function. For simplicity and to focus on the use of this approach for wind modeling, it will be assumed that air density is constant and equal to 1.0 kg m−3. This is not a restrictive assumption because it can be relaxed to incorporate variability in air density in the development.
5. Experiment using resampling


The results of the resampling experiment are shown in Fig. 8. In this situation, the direct method again outperformed the other two indirect methods across all sample sizes used. The Weibull function estimate simply was not flexible enough to capture the overall character of the WPD as a function of wind speed. The indirect Gauss–Hermite expansion method is clearly unacceptable as well.
The resulting three WPD distribution functions were compared with a bar chart of WPD distribution derived from the Boise City wind speed histogram in Fig. 9. The bar chart was constructed by calculating the relative frequency of wind speed in each bin and multiplying this by one-half the cube of the bin center value. Figure 9 illustrates the problems with indirect methods of calculating the WPD distribution. For Boise City the Weibull function provided an inadequate fit to the wind speed histogram (refer to Fig. 1), especially between the values of 3.5 and 9.0 m s−1. Note that, although the Weibull estimator fit the right tail of the PDF reasonably well, the resulting WPD distribution estimates become increasingly poor after 10.0 m s−1 because of the magnification of minor PDF errors by the υ3/2 factor in the WPD formulation. The indirect Gauss–Hermite expansion provided an adequate fit up to approximately 12 m s−1. However, at values higher than this threshold the function oscillates, even becoming negative, indicating that error in the Gauss–Hermite expansion fit of the wind speed PDF produced moderate errors at the higher values, which were then significantly magnified by the υ3/2 factor in the WPD distribution formulation.
To check the consistency and generality of the previous results, the resampling experiments were rerun using 5-min 10-m wind speed data from 1994 through 2006 from a Mesonet site in Weatherford, Oklahoma, which represents 1 358 045 5-min measurements. The result of these experiments is shown in Figs. 10 and 11. As can be observed from these figures, these results also support those found using Boise City data in that the direct method is vastly superior to the indirect method in terms of being a much better estimator of the wind power density.
Obviously, the number of basis functions included in the expansion affects the quality of the fit. Because the objective of this study is to obtain the most mathematically parsimonious estimator, it is of interest to determine if the expansions converge with an increasing number of terms and what that convergence looks like.
6. An examination of the behavior of the orthogonal expansions
The results in the previous section depended upon the underlying shape of the function to be approximated and the number of terms used in the expansions. Thus, different results would most likely be obtained using different wind speed PDFs and a different number of terms. The “best” estimator is that which produces the smallest MISE for a given number of basis functions. To obtain insight into the behavior of the two expansions, the MISE was formed using the more skewed of the two Weibull functions fW(υ|α, β) shown in Fig. 4, that is, the one with parameters α = 1.8 and β = 6.5 m s−1.
7. Summary
An improved WPD distribution function estimator is required for wind resource assessment and turbine-engineering purposes. The aim of this project was to develop a new and improved wind resource modeling tool and examine its efficiency properties compared to commonly used methods. It was important that the estimator be mathematically tractable. Although the popular kernel method has been proven to be quite efficient at PDF estimation, its functional representation is highly unwieldy. This latter requirement eliminated the kernel method as a candidate estimator for this study’s objective. Truncated orthogonal Hermite polynomial series estimators seemed a likely candidate as they can provide a workable function [e.g., Eq. (32)]. An extensive literature review by the authors revealed a plethora of theoretical work on these estimators mostly prior to the year 2000. The theoretical work was mostly confined to the estimator’s convergence properties for PDFs. There is a lack of practical applications of orthogonal series estimators in the literature, especially for wind power estimation. The two most likely reasons for the seeming absence of applications are speculated to be the perception of translation invariance (Izenman 1991) on the abscissa of the Hermite system and the popularity of the kernel method for PDF estimation. An obvious correction to the translation problem is an appropriate data transformation given by van der Marel and Franx (1993) and shown in Eq. (15). Another potential drawback of using an orthogonal polynomial-based series is the possibility of negative values arising where the range of support is only on the positive real line. Although this is quite evident in Fig. 9 using the indirect orthogonal method, it did not appear to be a problem when applying the direct orthogonal estimator. Efromovich (1999) proposes several correction techniques in the case that negative values do arise.
Notwithstanding the lack of utilization of orthogonal series estimators, the results of the study indicated that the Gauss–Hermite expansion, used as an estimator of the WPD distribution function, does not appear to be efficient or convergent if first applied to estimate the wind speed PDF, and then subsequently using this estimate to construct the WPD distribution function (i.e., the indirect method). Alternatively, the use of the Gauss–Hermite expansion as a direct estimator of the WPD distribution results in an efficient, convergent, and mathematically tractable function. It is hypothesized that this results from the capability of Hermite polynomial expansions to naturally model near-Gaussian functions well (because Hermite polynomials are functions of the derivatives of the standard Gaussian), and the fact that estimation error in the wind speed PDF is not enhanced by the υ3/2 factor, which is characteristic of all indirect methods.
Acknowledgments
The authors and the Oklahoma Wind Power Initiative would like to recognize the Oklahoma Economic Development Generating Excellence program and the Oklahoma State Energy Office for providing funding for this project. Also the Oklahoma Climatological Survey is to be thanked for providing the Oklahoma Mesonetwork data.
REFERENCES
Blinnikov, S., and Moessner R. , 1998: Expansion for nearly Gaussian distributions. Astron. Astrophys., 130 , (Suppl.). 193–205.
Brock, F. V., Crawford K. C. , Elliott R. L. , Cuperus G. W. , Stadler S. J. , Johnson H. L. , and Eilts M. D. , 1995: The Oklahoma Mesonet: A technical overview. J. Atmos. Oceanic Technol., 12 , 5–19.
Bryukhan, F. F., and Diab R. D. , 1993: Decomposition of empirical wind speed distributions by Laguerre polynomials. Wind Eng., 17 , 147–151.
Çelik, A. N., 2003a: Assessing the suitability of wind speed probability distribution functions based on wind power density. Renewable Energy, 28 , 1563–1574.
Çelik, A. N., 2003b: A statistical analysis of wind power density based on the Weibull and Rayleigh models at the southern region of Turkey. Renewable Energy, 29 , 593–604.
Cencov, N. N., 1962: Estimation of an unknown distribution density from observations. Soviet Math. Dokl., 3 , 1559–1562.
Efromovich, S., 1999: Nonparametric Curve Estimation, Methods, Theory and Applications. Springer-Verlag, 411 pp.
Hall, P., 1980: Estimating a density of the positive half line by the method of orthogonal series. Ann. Inst. Stat. Math., 32 , 351–362.
Hennessey, J. O., 1977: Some aspects of wind power statistics. J. Appl. Meteor., 16 , 119–128.
Izenman, A. J., 1991: Recent developments in nonparametric density estimation. J. Amer. Stat. Assoc., 86 , 205–224.
Jaramillo, O. A., and Borja M. A. , 2004: Wind speed analysis in La Ventosa, Mexico: A bimodal probability distribution case. Renewable Energy, 29 , 1613–1630.
Juban, J., Siebert N. , and Kariniotakis G. , 2007: Probabilistic short-term wind power forecasting for the optimal management of wind generation. Proc. IEEE PowerTech 2007 Conf., Lausanne, Switzerland, Institute of Electrical and Electronics Engineers, 683–688.
Justus, C. G., Hargraves W. R. , Mikhail A. , and Graber D. , 1978: Methods for estimating wind speed frequency distributions. J. Appl. Meteor., 17 , 350–353.
Kronmal, R., and Tarter M. , 1968: The estimation of probability densities and cumulatives by Fourier series methods. J. Amer. Stat. Assoc., 63 , 925–952.
Li, M., and Li X. , 2005: MEP-type distribution function: A better alternative to Weibull function for wind speed distributions. Renewable Energy, 30 , 1221–1240.
Liebscher, E., 1990: Hermite series estimators for probability densities. Metrika, 37 , 321–343.
Monahan, A. H., 2006: The probability distribution of sea surface wind speeds. Part II: Dataset intercomparison and seasonal variability. J. Climate, 19 , 521–534.
Pavia, E. G., and O’Brien J. J. , 1986: Weibull statistics of wind speed over the ocean. J. Climate Appl. Meteor., 25 , 1324–1332.
Ramirez, P., and Carta J. A. , 2005: Influence of the data sampling interval in the estimation of the parameters of the Weibull wind speed probability density distribution: A case study. Energy Convers. Manage., 46 , 2419–2438.
Schwartz, S. C., 1967: Estimation of a probability density by an orthogonal series. Ann. Math. Stat., 38 , 1262–1265.
Silverman, B. W., 1986: Density Estimation. Chapman and Hall, 175 pp.
Stevens, M. J. M., and Smulders P. T. , 1979: The estimation of the parameters of the Weibull wind speed distribution for wind energy utilization purposes. Wind Eng., 3 , 132–145.
van der Marel, R. P., and Franx M. , 1993: A new method for the identification of non-Gaussian line profiles in elliptical galaxies. Astrophys. J., 407 , 525.
Walter, G. G., 1977: Properties of Hermite series estimation of probability density. Ann. Stat., 5 , 1258–1264.
Watson, G. S., 1969: Density estimation by orthogonal series. Ann. Math. Stat., 38 , 1262–1265.
Yilmaz, V., and Çelik H. E. , 2008: A statistical approach to estimate the wind speed distribution: The case of the Gelibolu region. Doğuş Üniversitesi Derg., 9 , 122–132.
Zaharim, A., Razali A. M. , Abidin R. Z. , and Sopian K. , 2009: Fitting of statistical distributions to wind speed data in Malaysia. European J. Sci. Res., 26 , 6–12.
A histogram overlaid with the maximum likelihood fit of the two-parameter Weibull PDF for Boise City, Oklahoma.
Citation: Journal of Atmospheric and Oceanic Technology 27, 7; 10.1175/2010JTECHA1390.1
Plots of six mixed Weibull probability density functions.
Citation: Journal of Atmospheric and Oceanic Technology 27, 7; 10.1175/2010JTECHA1390.1
The WPD functions corresponding to the Weibull PDFs shown in Fig. 2.
Citation: Journal of Atmospheric and Oceanic Technology 27, 7; 10.1175/2010JTECHA1390.1
Two Weibull functions used to test the direct and indirect methods.
Citation: Journal of Atmospheric and Oceanic Technology 27, 7; 10.1175/2010JTECHA1390.1
Results of the resampling experiment 1 for estimation error in the wind power density as compared with a theoretical value computed using a Weibull probability density function.
Citation: Journal of Atmospheric and Oceanic Technology 27, 7; 10.1175/2010JTECHA1390.1
As in Fig. 5, but for experiment 2.
Citation: Journal of Atmospheric and Oceanic Technology 27, 7; 10.1175/2010JTECHA1390.1
A Gaussian kernel fit to 10-m wind speed data from Boise City, Oklahoma.
Citation: Journal of Atmospheric and Oceanic Technology 27, 7; 10.1175/2010JTECHA1390.1
Results of the resampling experiment for MISE in WPD estimation as compared with the best kernel estimate using wind speed data from Boise City, Oklahoma.
Citation: Journal of Atmospheric and Oceanic Technology 27, 7; 10.1175/2010JTECHA1390.1
A combined plot of the Boise City, Oklahoma, WPD histogram and the three functional WPD estimators using 16 basis functions in each expansion.
Citation: Journal of Atmospheric and Oceanic Technology 27, 7; 10.1175/2010JTECHA1390.1
Results of the resampling experiment for MISE in WPD estimation as compared with the best kernel estimate using wind speed data from Weatherford, Oklahoma.
Citation: Journal of Atmospheric and Oceanic Technology 27, 7; 10.1175/2010JTECHA1390.1
The combined plot of the Weatherford, Oklahoma, WPD histogram and the three functional WPD estimators using 16 basis functions in each expansion.
Citation: Journal of Atmospheric and Oceanic Technology 27, 7; 10.1175/2010JTECHA1390.1
The mean-square error for the indirect and direct method based on a two-parameter Weibull function with parameters α = 1.8 and β = 6.5 m s−1. The number of terms in the expansion is m + 1.
Citation: Journal of Atmospheric and Oceanic Technology 27, 7; 10.1175/2010JTECHA1390.1
Plots of the indirect expansion of the WPD distribution with increasing number of (k = 0, 1, 2, … , m) basis functions. The solid thick curve is the WPD distribution computed from the Weibull function.
Citation: Journal of Atmospheric and Oceanic Technology 27, 7; 10.1175/2010JTECHA1390.1
Plots of the direct expansion of the WPD distribution with an increasing number of basis functions [eD(υ, k), k = 0, 3, 6, 8, 10]. The solid thick curve is the WPD distribution computed from the Weibull function.
Citation: Journal of Atmospheric and Oceanic Technology 27, 7; 10.1175/2010JTECHA1390.1
The MISE for the indirect and direct methods with an increasing number of terms (m = number of terms plus one).