1. Introduction
The excessive extraction and consumption of fossil fuels have led to dire environmental pollution. Renewable energy sources, encompassing solar power, biomass energy, wind energy, and hydropower, have witnessed extensive development and utilization. Among numerous renewable energy sources, photovoltaic (PV) power generation holds great importance in ensuring the security, stability, and cost-effective functioning of the electricity system. However, PV power generation exhibits strong randomness and fluctuations that have the potential to significantly disrupt the power grid during large-scale grid integration, ultimately affecting the stability and safety of the power system [
1]. Accurate PV power forecasting can mitigate its impact on the electrical grid. Therefore, enhancing the precision of PV forecasting is vital for bolstering the reliability of solar power generation and developing grid scheduling plans.
On the basis of distinct time scales, PV power output forecasting is primarily categorized as long-term, medium-term, and short-term predictions [
2]. The long-term forecast can be utilized to evaluate the quarterly and annual power generation indicators of power plants and the tasks of power generation, transmission, and power system distribution [
3]; medium-term forecasts are mainly used for the maintenance of electrical systems and PV power plants [
4]; and short-term prediction is beneficial for power sector staff to make generation plans quickly and arrange grid dispatching reasonably [
5]. Due to the significant importance of short-term solar PV power prediction in providing daily power generation planning decisions for the power industry, and achieving efficient and economic dispatch, it has emerged as a focal point of current research.
Currently, the primary research methodologies for PV power forecasting can be classified into physical methods, statistical methods, and hybrid methods [
6,
7,
8]. The physical method calculates the process and principle of PV power generation through physical formulas such as the solar radiation transfer equation. It involves building a physical model and utilizing environmental information, component parameters, and solar irradiance of PV power stations to predict PV power. However, the modeling process using a physics-based approach is complex and cost-intensive, making it unsuitable for short-term forecasting [
9]. Compared to physics methods, statistical approaches employ simpler modeling without requiring complex experimental measurements, thereby possessing better accuracy. The statistical method can be classified into two types: traditional statistical models and artificial intelligence approaches. Traditional statistical models comprise time series analysis [
10], grey theory [
11], regression analysis [
12], etc. Prema and Rao used the time series algorithm to forecast solar power generation, tested the data with different durations, and finally compared the error of the experimental results [
13]. Zhong et al. proposed a multidimensional grey prediction algorithm, exhibiting better predictive accuracy compared to conventional grey models [
14]. Reikard successfully employed an autoregressive model to predict PV power generation, achieving remarkable performance [
15]. The aforementioned approach has demonstrated satisfactory performance in predicting stationary time series. However, solar irradiance is influenced by clouds and seasons, resulting in non-stationary behavior in time series data. Therefore, these models fail to accurately capture the nonlinearity present in the data, leading to subpar predictive capabilities [
16].
To address the aforementioned problems, many researchers have commenced employing artificial intelligence approaches [
17], for instance, support vector machines [
18], extreme learning machines [
19], and neural networks [
20] for PV power forecasting. Li et al. employed the SVM model for short-term PV power forecasting [
21]. Nevertheless, the SVM relies on quadratic programming to determine the support vectors, leading to prolonged training time when dealing with a large number of samples. Al-Dahidi et al. utilized the ELM model to predict PV power [
22]. Although this method achieves satisfactory prediction results, the random initialization of weights and biases for hidden layer nodes in the ELM algorithm led to instability and overfitting issues [
23]. Kim et al. utilized LSTM to predict ultra-short-term PV power [
24]. This approach demonstrates excellent prediction accuracy when applied to large-scale temporal data sequences. However, determining the parameters of the LSTM model can be problematic, as it may not achieve the desired results when applied to other real-world prediction problems.
The hybrid method can leverage the advantages of different single prediction models, ultimately resulting in better predictive efficacy when compared to utilizing a single forecasting method [
25,
26]. Liu et al. used LSTM to predict PV power and built a LSTM prediction model combined with the dragonfly algorithm (DA) [
27]. The experimental outcomes demonstrate that the DA–LSTM model exhibits better predictive accuracy compared to both conventional predictive models and the LSTM model. Zheng et al. established a model for PV power prediction [
28]. This innovative approach harnessed particle swarm optimization (PSO) to effectively optimize LSTM networks. The experimental results indicate a noteworthy enhancement in the forecasting precision of the LSTM model after it was optimized with the PSO algorithm. Tuerxun et al. posited an improved condor search (MBES) algorithm to address the issue of selecting the best hyperparameters for LSTM and established an innovative MBES–LSTM model for predicting short-term power [
29]. The empirical findings indicate that the MBES–LSTM model surpasses the LSTM model in prediction precision. These documents primarily combine LSTM models with swarm intelligence optimization algorithms to form hybrid models to enhance the precision of power prediction.
Recently, an escalating multitude of scholars have amalgamated multiple deep learning models into a hybridized model with the intent of augmenting the precision of model predictions. For instance, Lim et al. established a hybrid approach composed of a convolutional neural network (CNN) and LSTM [
30]. The simulation findings demonstrate that the CNN–LSTM model exhibits favorable predictive performance. When the input temporal sequence expands in length, the information in the sequence is prone to loss, resulting in low prediction precision of the model. He et al. contemplated the bidirectional flow of information and employed a bidirectional long short-term memory network (BiLSTM) for prediction [
31]. By integrating the advantages of both the CNN and BiLSTM, a CNN–BiLSTM solar power prediction model is constructed. The CNN was utilized to extract influential factors’ features, while BiLSTM was employed for chronological prediction. The outcomes demonstrate that this approach effectively reduced training time and outperformed traditional forecasting models.
Through a review of the existing literature, it can be found that the current mainstream method is to combine different models to build a hybrid prediction model, but there is a scarcity of literature focusing on leveraging intelligent optimization algorithms to ascertain the optimal parameters of the hybrid model. Taking the CNN–BiLSTM model as an example, this model improves prediction precision, but it has excessive internal parameters and improper selection may lead to potential overfitting issues. The setting of the learning rate, regularization coefficient, and number of hidden layer neurons directly affects the accuracy of PV power prediction results. The learning rate exerts a significant influence on the training effectiveness of the model, while the regularization coefficient is employed to regulate the complexity of the model, thus preventing overfitting. The number of hidden layer neurons plays a pivotal role in the model’s fitting degree, and these parameters have great randomness. Relying solely on human professional knowledge and historical experience to select parameters cannot guarantee the predictive efficacy of the model. Therefore, it is necessary to choose an appropriate optimization algorithm to combine with the CNN–BiLSTM model to acquire the optimal parameters of the CNN–BiLSTM model. Hence, the snake optimization algorithm is introduced to optimize the parameters of the CNN–BiLSTM prediction model, thereby building a novel short-term PV forecasting model.
The snake optimization (SO) algorithm, motivated by principles of biomimetics, was proposed by Hashim and Hussien in 2022 [
32]. The SO algorithm possesses advantages such as fast convergence, strong exploitation capability, and minimal parameter adjustments, making it suitable for optimizing the CNN–BiLSTM model. However, the SO algorithm also suffers from the drawback of getting trapped in local optima, which affects its optimization effectiveness. Therefore, this study proposes a multi-strategy improved snake optimization (MISO) algorithm, aiming to avoid the algorithm getting trapped in local optima, bolstering its exploratory capacity, enhancing solution accuracy, and effectively tackling the drawbacks of the original algorithm. In addition, the MISO algorithm proposed in this article is applied to optimize the parameters of the CNN–BiLSTM model and the application of the MISO–CNN–BiLSTM model for predicting PV power. The main contributions of this study are as follows:
- (1)
K-means clustering is employed to categorize weather patterns into sunny, cloudy, and rainy for the reduction of the impact of data fluctuations on forecasts. Then, a Pearson correlation analysis is conducted on the historical PV data and meteorological factors that exhibit a high correlation with the power sequence are selected as input data for the predictive model.
- (2)
This study proposes a multi-strategy improved snake optimization (MISO) algorithm, which incorporates multiple optimization strategies to overcome the limitations of the original algorithm. The primary innovations of this approach encompass the subsequent elements: firstly, introducing Tent chaotic mapping to augment the initial population quality of the algorithm; secondly, improving the food quantity threshold to enhance the algorithm’s convergence speed; then, introducing the lens imaging backward learning strategy to enable the algorithm to obtain dynamic and inverse solutions in lens backward learning, further augmenting the algorithm’s optimization prowess; and finally, introducing the optimal individual adaptive disturbance strategy to reduce the possibility of the algorithm getting trapped in local optima.
- (3)
The optimization performance of the MISO algorithm is evaluated utilizing six classic test functions and compared with the grey wolf optimizer (GWO), whale optimization algorithm (WOA), and SO algorithms. The simulation findings indicate that the MISO algorithm outperforms other basic algorithms in convergence and solution precision. Next, the MISO algorithm and CNN–BiLSTM model are combined to establish the MISO–CNN–BiLSTM PV prediction model. Validated with real historical data from a specific location in Ningxia, China, the proposed method exhibits good precision under sunny, cloudy, and rainy scenarios.
The remaining sections of this paper are as follows:
Section 2 introduces the PV power prediction model and multi-strategy improved snake optimization algorithm.
Section 3 elucidates the principles of the K-means clustering algorithm, analyzes the factors influencing PV power generation, and identifies model inputs.
Section 4 provides an analysis and discussion of the findings from the simulation experiment. Finally,
Section 5 provides the conclusion of this study.
5. Conclusions
Due to the inherent uncertainty in PV power forecasting, particularly in situations with unpredictable weather changes, the precision of electricity predictions has become a significant technical challenge. This article employs K-means clustering to classify historical PV data, resulting in three distinct subsets: sunny, cloudy, and rainy. Based on these subsets, the corresponding PV power generation for distinct weather scenarios is forecasted. To enhance the precision of PV power prediction under varying weather types, this study utilizes the MISO–CNN–BiLSTM model. The empirical findings evince that the MISO–CNN–BiLSTM model surpasses the SO–CNN–BiLSTM, CNN–BiLSTM, BiLSTM, LSTM, and BP models in predicting performance. The conclusions of this research are as follows:
- (1)
Combining multiple enhancement techniques enhances the optimization performance of SO. The integration of the original SO with the Tent chaotic initialization, lens imaging reverse learning strategy, and optimal individual adaptive perturbation strategy significantly improves the overall performance of MISO.
- (2)
The simulation findings demonstrate that the established model has excellent predictive prowess. In various weather conditions, the MISO–CNN–BiLSTM model demonstrates significantly lower MAE and RMSE values in comparison to the other models presented in this research, providing evidence of its high prediction accuracy. Furthermore, the values of the MISO–CNN–BiLSTM model surpass those of other models mentioned in this paper, substantiating its superiority and reliability.
- (3)
The MISO–CNN–BiLSTM model can accurately forecast PV power, which is helpful for power grid system planning and dispatching and reduces the dispatching cost of the power system.
The MISO–CNN–BiLSTM PV power prediction model proposed by this research can achieve accurate prediction of PV output power under different weather scenarios. This contributes to enhancing the utilization efficiency of renewable energy generation, ensuring the security of renewable energy power systems. Moreover, it plays a decisive role in advancing the growth of the renewable energy sector. In addition to PV prediction, the model can also be used for power prediction of other similar renewable energy sources and may become a universal renewable energy power prediction method, which can promote the wider use of renewable energy.
This study has limitations. Although this study provides forecasts for short-term PV generation across three distinct weather conditions, it overlooks the consideration of numerous extreme weather phenomena such as rainstorms, snowstorms, sandstorms, haze, etc. In the future, research should be conducted on the power prediction of PV generation under inclement meteorological conditions, so as to enhance the dependability of the prediction model.