1. Introduction
Wind power generation is replacing power generation via extensive gas-flow and uses wind to drive wind turbines. In 2000, to protect the environment, the Taiwanese government actively promoted the use of clean energy to reduce the greenhouse gas emissions generated by traditional power generation methods such as thermal power generation. The Taiwanese government’s expectations for wind power generation are very high. The government has developed an offshore wind power facility, the main goal of which is to generate enough electricity so that renewable energy can replace nuclear power generation. The vision of the Taiwanese government is to build a strong support industry by manufacturing the necessary wind turbine components, towers, and underwater cables for coastal engineering; by building underwater foundation pile; and by installing generators several miles offshore. According to statistics from the International Energy Agency, offshore wind power currently accounts for only 0.3% of global power generation, but experts have noted that wind power generation is expected to rapidly grow in the next 20 years, representing a business opportunity of up to one trillion US dollars. Therefore, the accuracy of wind power forecasting is a very important issue that could help governments to engage in effective policy planning. In recent years, many studies have investigated wind speed and power forecasting and adopted various prediction models to improve wind power generation forecasting. Lu et al. [
1,
2] used the Takagi–Sugeno fuzzy model to predict wind speed and power. Yu et al. [
3] developed hybrid models that combine the wavelet transform (WT) with the support vector machine (SVM), gated recurrent units network (GRU), standard recurrent neural network (RNN), and LSTM models for wind-speed forecasting. Using the WT can decompose the original wind time series into several subseries with better behavior and greater predictability. The results indicated that the hybrid models WT-RNN-SVM, WT-LSTM-SVM, and WT-GRU-SVM obtained the best performance. Zjavka and Mišák [
4] noted that wind output power forecasting entails chaotic large-scale patterns and has a high correlation with atmospheric circulation processes. The authors adopted the polynomial decomposition of the general differential equation, which represents the elementary Laplace transformations of a searched function, to predict the daily wind power. The results showed that their method can obtain lower casual errors due to using the decomposition method. Liu et al. [
5] used combined wavelet packet decomposition with a convolutional neural network and convolutional long short-term memory network to forecast one-day wind speed. In the one-day wind speed time series, the model was able to obtain robust and effective performance. Toubeau et al. [
6] adopted the LSTM to efficiently capture the complex temporal dynamics needed for wind power prediction.
When the RNN is applied to long-term dependence, the processing unit will continue to add and calculate the previously memorized information, causing the neural network to explode or disappear and eventually leading the network to collapse. Recurrent neural networks are weak in terms of learning long-term dependence [
7]. To improve the shortcomings of recurrent neural networks, Hochreiter and Schmidhuber [
8] proposed the long short-term nemory (LSTM) network in 1997. The long short-term memory model was developed on the basis of recurrent neural networks and is of a cyclical type. LSTM is a neural network architecture that adds the forget gate, input gate, and output gate to the processing unit in the hidden layer, the purpose of which is to read more past information. The model will first determine whether the information is useful, and then decide whether to add or delete the information to increase the ability of the neural network to be reliable over a long period.
Comparing the cyclic neural network and the long short-term memory model, it can be seen that the cyclic neural network will only receive the information calculated in the previous pass, while the long short-term memory model not only receives the information calculated in the last iteration but also all past messages. The long short-term memory model not only retains the advantages of the recurrent neural network but is also able to handle short-term dependencies. Therefore, the LSTM model can solve many tasks that the recurrent neural network could not solve in the past [
7]. The development of short-term memory models has thus far provided help in various processing tasks, and they are widely used in various fields such as speech recognition [
9,
10], handwriting recognition [
11], and predictions [
12,
13,
14].
Table 1 summarizes the long and short-term memory models used to make predictions in related literature since 2015. Tian and Pan [
15] used LSTM to predict the time series of car traffic, and the prediction time interval was divided into four types; the prediction accuracy of the five models was then compared and generalized. The study indicated that the prediction accuracy rate of LSTM was the best. This showed that LSTM has a high predictive ability and high generalization ability and that LSTM is better than RNN. Liu et al. [
16] explores the prediction accuracy of LSTM on neonatal brainwave maps with different numbers of neurons. To verify the feasibility of LSTM, the study also compared LSTM with RNN. The results showed that LSTM is better than RNN. Janardhanan and Barrett [
17] used LSTM to predict the computer CPU usage of Google’s data center in time series. Based on the results of multiple computers, the mean absolute percentage error (MAPE) value of the LSTM model was 17–23%, and that of the ARIMA model was 37–42%. Siami-Namini et al. [
18] adopted LSTM to predict six financial indexes and six economic indexes. In the financial and economic average forecast results, the error rates of LSTM forecasting were 87% and 84% lower than that of ARIMA, respectively. Phyo et al. [
19] used LSTM and the Deep Confidence Network (DBN) to predict the power load in Thailand, and the prediction accuracy of the two models was compared. The results showed that LSTM offers better predictive ability. Fan et al. [
20] developed an integrated method that combined the ARIMA model with the LSTM model. Shahid et al. [
21] developed a novel genetic long short-term memory (GLSTM) method; this method improved wind power predictions from 6% to 30% compared to existing techniques. Zhang et al. [
22] developed a convolutional neural network model based on a deep factorization machine and attention mechanism (FA-CNN). The results indicated that FA-CNN obtained better performance than the traditional LSTM.
In this study, the prediction model adopts three LSTMs with fuzzy seasonal indexes to approach the fuzzy set’s upper and lower bounds, respectively, as well as mode prediction values. This is a novel prediction model for wind power output forecasting. The rest of this paper is organized as follows.
Section 2 introduces the fuzzy seasonal LSTM (FSLSTM) in detail, which include fuzzy seasonal decomposition and fuzzy LSTM technology.
Section 3 presents the experimental results of the FSLSTM for wind power output prediction. Finally, we draw conclusions and make suggestions for future research in
Section 4.
3. A Wind Power Output Example and Empirical Results
Energy conservation and decreasing carbon are very important management issues for the global power industry. To demonstrate its concern regarding the global warming issue and to comply with the government’s Sustainable Energy Guidelines, the Taiwanese government is actively promoting the use of clean energy. The Taipower company has built 17 wind energy power stations in Taiwan that record monthly data on the total power output. All experimental data can be download from the National Development Council in Taiwan (
https://data.gov.tw (accessed on 21 July 2020)). In this study, we selected three wind energy power stations: the Shimen, Taichung, and Mailiao wind power plants.
Figure 3 and
Table 2 depict the monthly generated output power (units: kilowatt-hours) from these wind power stations during the period from January 2017 to June 2020 (the total number is 42). In this study, the monthly data were divided into three sets: firstly, a training set was employed to determine the optimum forecasting model during the period from January 2017 to December 2018 (the number of samples in the training set was 24); secondly, a validation set was employed to prevent the overfitting of the different models during the period from January 2019 to December 2019 (the number of samples in the validation set was 24); finally, a testing set was employed to investigate the performance of the different models during the period from January 2020 to June 2020 (the number of samples in the testing set was 24). The percentages of training, validation, and testing sets were 57%, 29%, and 14%, respectively.
Figure 3 clearly indicates that the measured time series feature seasonal data and three types of cycles for the different wind power plants. The Mailiao wind power plant can generate greater power because the Mailiao wind power plant features the largest number of wind-driven generators in Taiwan. Moreover, a larger power output can be obtained during winter in Taiwan as a result of the northeast monsoon.
Table 3 depict the fuzzy seasonality index with
k ranging from 1 to 12 from the selected wind power stations. In addition, the mean absolute percentage error MAPE(%) was used to measure the forecasting accuracy. Equation (12) illustrates the expression of MAPE(%):
where
M is the number of forecasting periods,
Ai is the actual production value at period
i, and
Pi is the forecasting production value at period
i. Moreover, the RMSE is employed to evaluate the training error of FSLSTM, which can be expressed as follows:
Figure 4 shows the training error of the FSLSTM in three wind power stations, adopting the Adam algorithm. We can observe that the FSLSTM can obtain a lower RMSE training error (smaller than 0.2) in three wind power stations.
In this study, the FSLSTM, LSTM [
8], ARIMA(1, 0, 0) [
25], generalized regression neural network (GRNN) [
26], back propagation neural network (BPNN) [
27], least square support vector regression (LSSVR) [
28], and seasonal autoregressive integrated moving average (SARIMA (1, 0, 0) (1, 0, 0)
12) [
25] models were used to forecast the monthly wind power output datasets in selected stations in Taiwan. The construction of LSTM is similar to that of the FSLSTM (see
Section 2.1). The LSTM network also adopted the Adam optimization algorithm to search for optimal parameters. The ARIMA is similar to SARIMA (see
Appendix A), with a difference in seasonal parameters. The construction and parameter (σ) of the GRNN is shown in
Appendix B. The parameter (σ) of the GRNN was set to 1. In this study, a well-known intelligent computing machine, BPNN, is also adopted to compare prediction models. In the BPNN, the input layer has one input neuron to catch the input patterns, the hidden layer has ten neurons to propagate the intermediate signals, and the output layer has one neuron. For more training assignments in the BPNN, the hyperbolic tangent sigmoid function is employed as the activation function in the hidden layer, the pure-line transfer function is employed in the output layer as the activation function, and the gradient training is adopted as the learning algorithm for the BPNN. The LSSVR is a popular prediction model in time series problems. For the main constructs of the LSSVR, readers can be refer to [
28], and the regularization parameter in the experiment was set to 1. The Radial Basis Function (RBF) Kernel Trick was employed in the LSSVR, and the parameter (σ) of the RBF was set to 0.01.
Table 4 depicts the training error with various prediction models. The proposed FSLSTM, LSTM, and GRNN approaches could obtain lower training errors, which means that the training models of the three approach achieved better performance.
Table 5 illustrates the actual values and experimental results of the FSLSTM model with the mode (M) and upper (U) and lower (L) bounds from January 2020 to June 2020 for the Shimen wind power plant.
Figure 5a makes a point-to-point comparison of the actual values and predicted values of FSLSTM. As shown in
Figure 4, the peak power output was in April 2020, which was not easily observed in the training dataset.
Figure 5b shows a comparison of the actual values and predicted values of ARIMA, SARIMA, GRNN, BPNN, LSTM, and FSLSTM-L for the Shimen wind power plant.
Table 5 shows the experimental results and MAPE(%) obtained by various models. The ranking of MAPE(%) is as follows: GRNN < FSLSTM-L < FSLSTM-M < LSTM < ARIMA < SARIMA < FSLSTM-U < BPNN < LSSVR.
Table 4 indicates that the GRNN obtained the smallest MAPE(%), showing the best performance. However,
Figure 5 shows that the predicted value of the GRNN could not capture the trend of power output at the Shimen wind power plant. The FSLSTM-L model was able to efficiently capture the trends of the data by using the fuzzy seasonal index, although the MAPE(%) of the FSLSTM-L was higher than that of the GRNN in the example. Thus, the proposed FSLSTM model is suggested to serve as a prediction model for power output for the Shimen wind power plant.
Table 6 illustrates the actual values and experimental results of the proposed model for the Taichung wind power plant.
Figure 6a makes a point-to-point comparison between the actual values and predicted values of the FSLSTM at the Taichung wind power plant. Downward trends of power output can be observed in
Figure 5.
Figure 6b illustrates a comparison between the actual values and predicted values of the ARIMA, SARIMA, GRNN, BPNN, LSTM, and FSLSTM-M models for the Taichung wind power plant.
Table 6 shows the experimental results and MAPE(%) obtained by various models for the Taichung wind power plant. The ranking of MAPE(%) is FSLSTM-M < SARIMA < FSLSTM-U < GRNN < LSTM < LSSVR < FSLSTM-L < BPNN < ARIMA.
Table 5 indicates that the FSLSTM-M obtained the smallest MAPE(%), which means that FSLSTM-M achieved the best performance in this example. Moreover,
Figure 6 shows that the predicted value of the FSLSTM-M was able to capture the trend of power output at the Taichung wind power plant. Moreover, the two seasonal models, FSLSTM-M and SARIMA, obtained better performance than the other models, possibly because the power output at the Taichung wind power plant has a seasonal influence. Thus, the proposed FSLSTM model is also suggested to serve as a prediction model for power output at the Taichung wind power plant.
Table 7 illustrates the actual values and experimental results of the proposed model for the Mailiao wind power plant.
Figure 7a also makes a point-to-point comparison of the actual values and predicted values of FSLSTM at the Mailiao wind power plant. As with the Taichung wind power plant, downward trends of power output can be observed in
Figure 7 for the Mailiao wind power plant. Because the Mailiao wind power plant provides a larger quantity of wind power generation than the other power plants, Mailiao outputs more power in KWh.
Figure 7b shows a comparison between the actual values and predicted values of ARIMA, SARIMA, GRNN, BPNN, LSTM, and FSLSTM-M models at the Mailiao wind power plant.
Table 7 shows the experimental results and MAPE(%) obtained using various models for the Mailiao wind power plant. The ranking of MAPE(%) is FSLSTM-M < SARIMA < FSLSTM-U < GRNN < LSTM < LSSVR < FSLSTM-L < BPNN < ARIMA.
Table 6 indicates that the FSLSTM-M obtained the smallest MAPE(%), which means that the FSLSTM-M achieved the best performance in this example. Both seasonal models, FSLSTM-M and SARIMA, obtained better performance than the other models and were able to capture the trend of power output for the Mailiao wind power plant, possibly for the same reasons as those of the Taichung wind power plant. Moreover, the Mailiao region is very close to the Taichung region in Taiwan. Thus, the ranking of MAPE(%) in the Mailiao region is the same as that of the Taichung region. Again, the proposed FSLSTM model is suggested to serve as a prediction model for wind power output at the Mailiao wind power plant.
By reviewing the three forecasting examples using the FSLSTM model, some findings can be concluded, as follows: (1) the FSLSTM model can efficiently handle seasonal influence. In the Taichung and Mailiao regions, the seasonal influence of wind power output can be observed. (2) In all examples, the FSLSTM model could obtain better performance and more accurately capture the trends of wind power output. This performance was not observed for the traditional LSTM in the three examples. (3) For the three different types of wind power output, the FSLSTM-M model obtained better performance than almost all other models. The FSLSTM-M model is thus recommended as a prediction model for wind power output.