1. Introduction
The exponentially growing needs of energy and degradation of environmental conditions are some of the important factors behind the promotion of renewable and clean sources of energy into the energy mix on national and localized power grids. These renewable sources of energy include wind, solar photovoltaic, solar thermal, geothermal, biomass, and tidal to name the main ones. Of these sources, the wind energy has been commercially developed, accepted, and available for small to very large grid connected applications. The cumulative global wind power installed capacity reached 432,419 MW at the end of 2015 compared to 17,400 MW in year 2000, an increase of about 2485.2% in 16 years period. Compared to 2014 the cumulative installed capacity increased by 17% from 369,695 MW in 2014 to 432,419 MW in 2015 [
1]. It has been established that power of the wind is worth deploying in all parts of the world and Pakistan in particular. For successful and profitable development, the understanding of wind speed variation and its availability for power production is critical. Wind is highly fluctuating meteorological parameters with time and location and hence need to be understood well. The present study aims at analyzing the wind speed data using Maximum Entropy Principle (MEP) and compares it with Weibull distribution function for wind power production at a selected site.
MEP is a widely used technique in various applications ranging from statistical and quantum mechanics to decision theory financial modelling. The MEP concept was presented by Jaynes [
2,
3] such that for any given available information about the distribution, an approach is established for finding the least informative probability distribution. MEP is based on a procedure that provides a link between Shannon’s information theory [
4] and statistical mechanics. The underlying approach is to construct a probability distribution that agrees with the given distribution having a particular set of quantities of the system background investigation. The procedure is based on the idea that among all the infinite number of distributions compatible with data distribution, a distribution which maintains the largest degree of uncertainty is the one which must be selected. In order to express uncertainty in a distribution, Shannon’s entropy [
4] is used. Shannon’s entropy as a measure of uncertainty is maximized to obtain the probability distribution function. As an example of this principle, maximum entropy probability distribution function for a random variable (wind speed) is determined by setting up first few low order moments as constraints on the given data distribution.
The Maximum Entropy Principle and Minimum Cross Entropy [
2,
3,
4,
5,
6,
7] are used to calculate distribution function. Liu and Chang [
8] used probability density function with the maximum entropy and stated that wind regimes have two humps. The maximum entropy distributions outperformed the Weibull function, irrespective of wind speed or power density analyzed. Wu [
9] calculated maximum entropy densities for employee income distributions and compared it with the conventional income distributions. In renewable wind energy sector, Li et al. [
10,
11] used MEP to estimate wind energy potential as an alternative to the Weibull distribution function and proposed a theoretical method of calculating the probability density function for wind speed data distribution. These studies utilized the idea of conservation of wind’s mass that is mass of wind molecules in motion, momentum, and energy to construct family of distributions derived from maximum entropy principle by employing a pre-exponential term. The above studies formulated distribution functions for diurnal, monthly, seasonal and yearly wind speed variations.
Rehman et al. [
12] used Weibull distribution to represent the wind speed data and found an excellent fit between the measured values and the two parameters based model. Bagiorgas et al. [
13] conducted wind power potential assessment for seven locations in the Aegean Sea (Greece) using the Weibull distribution and obtained close agreement between the measured and modeled values. In another study Bagiorgas et al. [
14] four different methods to model the measured wind speed data and estimated most energy carrying wind speed were compared. Sulaiman et al. [
15] examined the measured wind speed data in Oman using the Weibull function and the Kolmogorove Smirnov test and reported the goodness of fit considering 1% and 5% of confidence level. Rehman et al. [
16] studied the wind characteristics using two parameter Weibull distributions.
Ramírez and Carta [
17] investigated the hourly wind speed data recorded at weather stations located in the Canarian Archipelago and constructed a probability distribution function based on maximum entropy principle (MEP). The study concluded that a MEP-based function constrained by three low order moments presents a lower relative error in estimation of wind power density as compared to the power density estimated using the Weibull distribution function. Zhou et al. [
18] presented a comprehensive analysis of wind speed from five sites in North Dakota (USA) using MEP. The study in addition to MEP-based distribution also included Weibull, Rayleigh, Gamma, Lognormal, inverse Gaussian distributions. The study concluded that MEP-based distribution is more flexible as compared to other distribution functions. Zhang et al. [
19] investigated the wind speed data recorded using intertidal anemometer tower located at Rudong in the East China Sea. The recorded wind speed data was fitted to the Weibull function and a probability function using MEP. The study revealed that probability density function based on MEP performed well in fitting the wind speed data.
The present study is devoted to the determination of monthly and yearly maximum entropy probability density function for Baburband, Sindh Pakistan. A comparison with the conventional empirical Weibull distribution function is also made and the suitability of the MEP-based function is discussed for the investigated site.
3. Results and Discussion
Wind speed data for Baburband was acquired from AEDB Government of Pakistan [
29]. The site and wind speed data specifications are given in
Table 1.
The analysis is performed on time-series wind speed data collected in 10 min time interval for a period of one year (2010). The wind speed data was recorded using three-cup shaped anemometer in the range 0 to 30 m/s with a threshold of 0.00345 m/s and has a resolution of 0.056 m/s. The Baburband site is located northeast of Karachi, Pakistan on the Super Highway. Monthly and yearly descriptive statistics of the measured data are given in
Table 2. The monthly mean wind speed varied between a minimum of 4.95 m/s in October to a maximum of 9.59 m/s in May with an increasing trend from January to May and then a decreasing trend towards the end of the year see column 3 of
Table 2. At 80 m height, the overall average wind speed of 6.72 m/s was observed with standard deviation of 3.04 m/s, confidence interval (C.I.) of 0.06 and a confidence level (C.L.) of 95.0% (see
Table 2). The wind power density at 80 m varied between 104.01 W/m
2 and 719.91 W/m
2 corresponding to October and May, as observed from the data shown in column 8 of
Table 2. Wind power density followed almost the same trend like monthly mean wind speed over the year. However, the coefficient of variations (CV) of 45.29% is observed for the total number of data points. This implies that 55% of wind speed data points fall in a C.I. window of 0.06 around mean wind speed in the whole 12 months period on the average. Furthermore, the measured wind speed distribution is positively skewed (S = 0.53). This implies that larger number of wind speed data points is located to the right of the mean wind speed.
Using hourly averaged wind speed data, monthly and overall Weibull and Maximum Entropy probability and cumulative functions (
pdf and
cdf) are determined. The Weibull shape and scale parameters are estimated using Modified Maximum Likelihood Method (MMLM) and are summarized in
Table 3. No definite trend was observed in case of shape parameter; however, the scale parameter followed the monthly mean wind speed trend. The Weibull Mean wind speed or maximum energy carrying wind speed (
Vm) varied from a minimum of 4.50 m/s in October to 9.63 m/s in June as given in column 5 of
Table 3. This simply meant that in the month of June the maximum energy carrying wind speed was 9.63 m/s while in the month of October it was 4.50 m/s. On the other hand, the most probable wind speed (
Vmp) in column 6 of
Table 3 varied between 4.28 m/s and 10.02 m/s in the months of October and June; respectively. Here it can be said that in the month of June the most probable occurring wind speed was 10.02 m/s while 4.28 m/s in the month of October.
The wind power density (
PW) estimated using Weibull parameters is given in column 7 of
Table 3. In general
PW values are found to be lower than
PA given in
Table 2 above. In order to compare the
PW with
PA linear correlations are obtained by plotting the scatter diagrams, shown in
Figure 1 and coefficient of determination values (
R2) are determined for each month and the entire data set and are listed in column 8 of
Table 3. Almost in all the months the
R2 values are found to be around 0.99 while RMSE and χ
2 of approximately 1 × 10
−3 and 1 × 10
−6, respectively. The magnitudes of these performance parameters indicate that the modeled and measured wind speed based wind power densities are in close agreement.
Maximum entropy density functions at low order moments 3, 4, 5, 6, 7, 8 and 9 were determined for monthly and overall periods and are summarized in
Table 4 and
Table 5, respectively. The wind power density values (
PE) obtained using lower order moments are in close agreement with each other. In most of the months the coefficient of determination values
R2 are found to be around 0.9 for all the monthly analysis and also for entire data set. The scatter diagrams between the
PA and
PE values for low order moments 5, 6, 7, and 8 along with the best fit linear line are shown in
Figure 2a–d obtained using the entire data set. The goodness-of-fit test for maximum entropy function is performed using Kolmogorov–Smirnov (KS) test between the wind power density values (
PA and
PE) obtained using measured data and the maximum entropy density function. The test results indicate the acceptance of null hypothesis, as the critical value
Q95 = 0.01457 (for complete data set over the year with
n = 8711 wind speed data points) calculated using Equation (15) is greater than KS test statistics
Q = 0.643, calculated using Equation (14). Also in case of monthly analysis the critical value
Q95 = 0.04987 (for monthly data sets each of
n = 744 wind speed data points) is greater than KS test statistics
Q on monthly bases ranging from 7.5 × 10
−4 to 8.3 × 10
−4.
Figure 3 shows the histogram of measured wind speed data and overall probability plots of Weibull and maximum entropy functions for all four orders. A close look of probability plots for detailed analysis of wind speed data on monthly bases indicated that in general a better correlation is observed between the histograms and MEP-based functions compared to the Weibull function for all months. Furthermore, for majority of months all four orders showed a good agreement between measured data and the fitted function. Specifically, orders 5, 6 and 7 showed a better reliability of fit of MEP-based function to the measured wind speed data compared to orders 3, 4, 8 and 9. In case of a complete one year data set the correlation of Weibull and MEP based function with the histogram is comparable and is due to the fact that in case of a complete wind speed data set, the number of data points is much larger.
The wind power density values
PE (
Table 4 and
Table 5) are in close agreement with
PA values (column 8,
Table 2) obtained using measured wind speed data compared to
PW values (column 7,
Table 3) calculated using Weibull distribution function. For example in the month of May
PE is 719.88 W/m
2 while
PA and
PW values are 719.91 and 634.96 W/m
2, respectively. It is evident that
PE value is almost the same as
PA while
PW is way far from
PA. Similar type of comparisons is found for rest of the monthly and yearly values of wind power densities from two methods see
Table 4 and
Table 5.
Figure 4 shows that the wind power density values computed using Weibull functions are a bit different than the actual wind power density values for each month while those estimated using MEP-based function are comparable with actual power density values. This further proves that MEP-based function is more reliable for the estimation of the measured wind speed data distribution and subsequently the power density values. Although the authors have carried out a detailed analysis of measured wind speed data on monthly bases and have checked for the reliability of fit for the MEP-based function to the actual wind speed data distribution on all orders used (3, 4, 5, 6, 7, 8 and 9) but have found that there is a considerable deviation in the fit function above order 7. This may be attributed to the fact that on a monthly basis the data sample size is smaller compared to the entire year data set. The situation further aggravates as power densities are calculated for higher orders and becomes more pronounced for order 9 which show unrealistic values for some months.
In order to compute the energy generated by the chosen turbine, the continuous probability distribution of wind speed using Weibull and MEP-based function are integrated with the power characteristic curve of the turbine. The energy yield is calculated using NEG Micon NM60/250–1000 kW wind turbine. The technical specifications along with the wind power curve data are given in
Table 6. The power curve of the selected wind turbine is fitted to the Boltzmann Sigmoid function shown in
Figure 5. Fitted parameters for the turbine using Boltzmann Sigmoid function (see Equation (25)) are A
1 = −0.06473, A
2 = 1.07611
, X
0 = 8.8478, d
x = 2.03981. In order to determine a relationship between energy available in the wind and the energy that is converted by a wind turbine, three interconnected factors related to wind turbine needs to be investigated. These factors i.e., wind turbine efficiency (ε), availability (Σ
F) and capacity factor (C
F), in conjunction with wind characteristics, characterize a wind turbine for the location under investigation.
The energy density (
EW), ideal wind energy (
EWTI), and actual wind energy (
EWTA) values calculated for each month and complete one year data set using Weibull function are summarized in
Table 7. The corresponding values of availability factor, wind turbine efficiency, and the capacity factor are also included in this table.
Table 7 summarizes the ideal wind energy (
EETI), and actual wind energy (
EETA) values calculated using MEP-based function for each month at orders 5 and 6 and for a complete one year data set for all four orders 5, 6, 7 and 8. In case of an ideal wind turbine with zero losses the available wind energy can be extracted completely. The energy density values estimated based on an ideal wind turbine are much larger than the values calculated using an actual wind turbine which depends upon the turbine performance curve operating between cut-in and cut-off wind speeds. As seen from the data given in
Table 7 and
Table 8, the values of
EWTI and
EETI are integral multiple of
EWTA and
EETA values.
Furthermore, the data given in
Table 7 and
Table 8 suggests that the actual wind energy produced by a turbine in the month of May and June is higher than in the month of October, but the turbine efficiency for these months for both generating functions lay in the range of 30.85% and 32.06% which is less than the efficiency value in the month of October (40.39% and 41.33%). This apparent conflict between output power and efficiency can be explained by taking into consideration the availability and capacity factors. The availability factor for the months of May and June for both generating function was 97.53% and 99.74% which is much higher than that in the month of October i.e., 78.47% and 84.56%. At the same time the productivity of wind turbine or the capacity factor for May and June for both generating functions was 52.23% and 59.88% which is also higher than that of October i.e., 9.76% and 12.12%. Therefore in order to determine optimum energy output of a turbine and effectiveness of a potential site, all three turbine characteristics must be taken into account. The estimated efficiency values for all months and for the complete year data set are in conformity with the Betz limit.
In the light of above discussion, MEP-based function gives more realistic estimates of wind energy as compared to Weibull based function. Using MEP-based function, the highest estimate of wind energy using the chosen wind turbine is found to be 421,387.76 kWh (see column 5,
Table 8) in the month of May and 90,157.89 kWh which is lowest in the month of October.
4. Conclusions
In this paper the Weibull distribution function and MEP-based functions are fitted to the recorded wind speed data at the Baburband wind site for the year 2010 on a monthly basis. In case of the Weibull function, MMLM is used to estimate the Weibull k and c parameters, whereas in the case of the MEP-based function seven fit orders are used, i.e., orders 3 to 9 for monthly wind speed data and for the entire data. A detailed study of the monthly wind speed data revealed that orders 5, 6 and 7 show good correlation with the measured wind speed data distribution as compared to orders 8 and 9. Additionally for the complete one year data set, MEP-based function using orders 5, 6, 7, 8 and 9 and Weibull function showed good correlation with the recorded wind speed data distribution.
Similarly, for monthly data sets it is observed that using the MEP-based function higher wind energy values were obtained for the fitted turbine as compared to Weibull-based function. However, the estimated values were realistic for orders 5, 6 and 7 for monthly data, whereas at higher orders the estimated values were significantly different compared with the measured values.
The application of the MEP-based function depends on the solution of the set of highly nonlinear equations (see Equation (5)). Furthermore, the presence of exponential and pre-exponential terms in equation leads to additional difficulties in the numerical solution of the problem. Additionally, calculated wind power density value is weighted by a pre-exponential term (see Equations (4), (5) and (16)), which for higher powers diverges and gives unrealistic value. A good fit of a theoretical distribution function to the measured wind speed data requires a good fit of the data as well as a good representation of wind power densities. In this present paper, while applying the MEP-based function to the measured wind speed data, the authors have employed moment orders from ranging from 1 to 9. In the present case MEP-based function with orders 1 and 2 completely failed to fit with the measured data and are not included in the analysis. The results obtained by applying MEP-based function with orders 3 to 9 for measured monthly and yearly wind data are shown in
Table 4 and
Table 5. A criterion for accepting a good fit distribution is set based on the maximum value of the evaluated coefficient of determination (
R2-test) and additionally, the closest agreement of the calculated and actual power densities (
PW and
PA). Based on this criterion, it is observed that for majority of months the application MEP-based function at order 3 performed poorly both in terms of wind speed data (
R2-test results) and wind power density see
Table 4. The application of MEP-based function for the months of January, June, July, September and October using orders 9, 9, 9, 8, and 9, respectively, the
R2-test values are in the range ~0.4–~0.8. For these cases the fitted distribution is said to be not reliable. Except for these months, the
R2-test values at orders 4 to 9 are identical to the first decimal place (~0.9). In majority of months the application MEP-based function at order 7, the wind speed and wind power density representations are consistent with the behavior of the measured wind speed data. In general, moment orders 5 to 7 for the application of MEP-based function are suitable in terms of wind speed and power density, see
Table 4. In case of measured wind speed data over entire year, since the statistically standard deviation decreases as
, where
N is the number of wind speed data points, the
R2-test results are almost identical (see
Table 4) for all orders namely 3 to 9. However, the wind power density representation at orders 3 and 4 are not in good agreement with the measured wind power density. Nevertheless, based on the same criteria the application of MEP based function at order 7 show consistency of the wind speed and wind power density representations with the measured wind speed data.
Although monthly and yearly R2-test result values are higher for the Weibull function as compared to the MEP-based function, for practical wind speed situations where calm spells or zero wind speeds exist at any site, the Weibull distribution always gives a zero value for pdf and fails to predict zero wind speed. Contrary to this using the MEP-based function, the pdf is consistent in predicting zero or calm wind speeds at any site. A complete and reliable fit of the measured wind speed data distribution would result in a reliable estimation of wind power density of a site under study, thereby giving reliable estimates of wind energy extracted using a wind turbine.