1. Introduction
Today, photovoltaics stands as one of the most crucial technologies for achieving green energy production goals and advancing towards a sustainable future. However, the inherent variability of solar energy poses a significant challenge when it comes to planning energy consumption. Predicting energy production from photovoltaics for the following day is a critical task that offers two substantial advantages in effective energy management:
- -
Consumer and energy communities can optimize their electricity usage based on forecasted energy availability. For instance, a consumer can strategically time the operation of appliances to reduce reliance on the grid, taking into account dynamic energy prices [
1,
2]. Various control systems are instrumental in scheduling electrical loads to ensure that they stay within the installed power capacity, and this coordination is significantly enhanced when integrated with a production forecasting system.
- -
Photovoltaic electricity production forecasting aids grid operators in planning energy distribution. The erratic nature of energy generated from renewable sources poses a challenge to maintaining grid frequency stability. Having prior knowledge of these fluctuations is increasingly crucial, especially as renewables are expected to contribute a larger share of the energy supply in the near future.
The prediction of energy generation can be categorized into short-term and long-term forecasts, with the prediction for the next day falling into the former category. These forecasting models are grouped based on the methodologies that they employ, with the most prominent categories being statistical, physical, and artificial intelligence (AI) methods.
The statistical approach involves seeking mathematical formulations that establish connections between input variables and electricity production. A widely used method within this category is the autoregressive moving average (ARMA) [
3]. Wang et al. [
4], on the other hand, used partial functional linear regression models. Although other techniques have evolved from ARMA [
5,
6], these approaches often provide less reliable results when addressing sudden changes in solar radiation. The inherent rapid fluctuations are not adequately captured by statistical methods.
Physical models, on the other hand, are based on equations that enable thermal and electrical modeling of photovoltaic panels. Various studies have proposed models capable of predicting both PV energy production and panel temperature [
7]. These models are often built upon energy balance methods [
8,
9], which may utilize one- [
10] or two-dimensional approaches [
11]. Additionally, computational fluid dynamics (CFD) models have been introduced [
12]. However, these methods require real-time access to climatic variables to accurately assess the thermoelectric behavior of PV panels, and such precise data are often unavailable for forecasting purposes, particularly for the following day.
The objective of this study was to employ an artificial intelligence approach to predict photovoltaic production. Artificial intelligence has assumed a pivotal role as a predictive tool in various applications, with a significant focus on solar energy production. Machine learning techniques, particularly diverse neural networks, have garnered extensive attention in the recent literature for forecasting PV production [
13,
14,
15]. For instance, Pedro et al. [
16] conducted a comparative analysis of several forecasting methods to evaluate their accuracy in predicting solar power output from a 1 MWp, single-axis tracking photovoltaic power plant in California. Their findings concluded that artificial neural network (ANN) models outperformed other methods.
In the context of monthly solar power output forecasting, a method employing seasonal decomposition and least-squares support-vector regression has been proposed [
17]. An ANN has been integrated with data processing, input variable selection, and external optimization techniques to forecast PV system power output [
18]. Furthermore, an ANN has been indirectly utilized for PV power prediction through solar irradiance forecasting [
19]. A multilayer perceptron (MLP) model was suggested to forecast 24 h solar irradiance based on daily solar irradiance and air temperature data from an experimental database. The study included a practical application comparing the actual power output from a rooftop PV plant in the Municipality of Trieste with the power calculated using 24-h-ahead solar irradiance forecasts.
In the Republic of Korea, an ANN was employed to model urban energy supply plants and renewable energy availability, integrating energy-related legal regulations, standards, and energy plant facilities into an energy geographic information system database [
20]. Forecasting power generation 24 h in advance using a radial basis function network (RBFN) was proposed in [
21]. This technique directly forecasts PV systems’ power output using historical records and real-time meteorological data. A recurrent neural network was introduced to predict PV power in a peak zone without relying on future meteorological forecasts, solely using PV power outputs and morning meteorological observations [
22]. In another study, a seven-parameter electrical model and a feedforward neural network were cascaded to test multicrystalline PV panels’ performance, achieving mean bias error deviations of less than ±1% [
23]. A recurrent neural network model with long short-term memory was developed to recognize temporal patterns in data collected from 164 PV sites over 63 months, including weather conditions and estimated solar irradiation. The model achieved a normalized root-mean-square error of 7.416% and a mean absolute percentage error of 10.8% [
24]. Almonacid et al. [
25] proposed a methodology for forecasting PV output one hour ahead using a dynamic artificial neural network. This approach employed two ANNs for predicting weather variables (solar irradiance and air temperature) and a third ANN to estimate the output power of a PV module. A fourth ANN incorporated the output of the preceding ANN and the PV configuration to provide final forecast values. Additionally, a combination of a linear regression model and an ANN was utilized to predict the performance of soiled PV modules using solar irradiation and ambient temperature [
26]. Notably, an ANN has also been applied to suggest an active cooling algorithm based on fan cooling for the back surface of PV panels [
27]. At the University of Malaya, an extreme learning machine (ELM) algorithm was developed to forecast the maximum power point tracking (MPPT) of three grid-connected PV plants, considering forecasting horizons of 1 h and 1 day ahead [
28].
The majority of the models mentioned in the previous discussion utilized solar radiation as their primary input data and achieved highly satisfactory results. However, in practice, obtaining accurate solar radiation information as an input in advance using forecast models can be challenging. On the other hand, numerous studies focus on predicting solar radiation and subsequently employ a physical model to estimate photovoltaic output. Undoubtedly, solar irradiance plays a pivotal role in determining the electrical performance of PV panels. Nevertheless, the electricity production is not solely defined by solar irradiance. The conversion efficiency also relies on the cell temperature, which, in turn, is influenced by various boundary conditions. A comprehensive analysis necessitates that the neural network directly provides the electrical output, allowing it to factor in the panel’s conversion efficiency.
In this study, two models are proposed, both based on artificial neural network (ANN) technology, with the goal of predicting the power output of a silicon photovoltaic module. In a departure from the existing literature, these models predict the PV module’s performance without relying on solar radiation data as inputs. The aim is to perform the forecast using the numerical weather prediction (NWP) data that are easily accessible through websites. Specifically, this relies on hourly temperature, relative humidity, and wind speed data. These models have the potential to empower individual energy communities to create their own forecasts. This approach holds significant promise, as it only requires standard meteorological data for each location. Furthermore, this work offers insights into determining the optimal number of neurons for the neural network architecture and ranks the most critical information for short-term forecasting. This information can serve as a foundational reference for future research endeavors aimed at exploring this problem further.
The remainder of this paper is structured as follows:
Section 2 introduces the applied methodology, presenting two distinct models, both starting from the hourly values of three selected quantities. The first model incorporates numerous additional inputs derived from daily processing of these hourly values, while the second model relies solely on the essential information. In
Section 3, the study’s outcomes are revealed. This section outlines the defined network architectures and presents the results of tests conducted using both experimental data and NWP data as inputs. Additionally, it includes a sensitivity analysis aimed at identifying the most critical variables in the models.
4. Conclusions
Photovoltaics is emerging as a pivotal technology in harnessing renewable energy sources, playing a crucial role in the transition toward a decarbonized energy future. The primary objective of this study was to forecast the electricity generated by photovoltaic panels on the following day. This prediction was achieved using easily accessible input data obtained from weather forecast websites (specifically, air temperature, relative humidity, and wind speed).
To accomplish this goal, two forecasting models with hourly resolution were developed based on artificial neural networks. The first model incorporates various data as inputs, including hourly values, and is supplemented with processed daily values to aid in identifying the type of day (i.e., overcast, clear, or partly cloudy). Subsequent analysis enabled the determination of the relative importance of each variable, leading to the elimination of redundant or unhelpful information. Model2 shares the same objective as the initial model but employs a reduced set of input data while maintaining a similar accuracy. The training process utilized experimental data gathered over three years at the University of Calabria. Notably, it was found that only a single hidden layer for the feedforward networks was sufficient, eliminating the need for multiple hidden layers. The key conclusions drawn from this study include the following:
- (1)
The day of the year is not important for the prediction, as similar information is provided by the minimum and maximum daily temperatures.
- (2)
The daily minimum relative humidity correlates with the daily PV energy production, with a good Pearson’s coefficient: −0.88.
- (3)
The models are stable if the input variables have a constant offset error.
- (4)
The most valuable information for the prediction is the hourly temperature trend.
- (5)
The models provide very good estimates when using experimental data as inputs. The coefficient of determination is about 0.95, with an RMSE of about 15 Wh.
- (6)
The accuracy of the forecast slightly decreases when the input information is taken from weather forecast websites. The coefficient of determination was 0.879 in the two weeks analyzed. The RMSE was 24.9 Wh. The accuracy of the forecast is closely linked to the accuracy of the NWP data. The results are dependent on source data, but they are nevertheless appreciable.
- (7)
The good behavior of Model2 implies that it is not necessary to provide too much information. Hourly trends of the three meteorological quantities and the daily minimum temperature are sufficient.
The limitation of this study is that the networks were trained on local climatic conditions. It would be interesting to assess whether they also perform well in different locations. Despite this limitation, our research has yielded valuable insights into electricity generation forecasting, addressing the challenges posed by the variable availability of solar sources—a concern that is gaining significance. The findings derived from experimental measurements offer valuable information for understanding the factors that exert the most influence on forecasting accuracy.
The practical implications of this work extend to utilities, where the economic impact is manifested through energy savings achieved via effective scheduling of electrical loads and an increase in self-consumed energy. The model, reliant on easily accessible data from websites, is usable by everyone. Leveraging public data ensures the seamless expansion and integration of this technology into control systems, facilitating its broader applicability. The incorporation of these models into smart grid frameworks represents a promising trajectory. The models’ hourly resolution aligns seamlessly with the dynamic nature of smart grids, enabling real-time adjustments and fostering an interconnected, responsive energy ecosystem. Microgrid architectures, which often rely on renewable sources, could benefit from the precision of these models in adapting to the fluctuations inherent in distributed energy systems. By facilitating informed decision-making in energy consumption patterns, these models contribute to the broader mission of transitioning towards sustainable and environmentally conscious energy practices. Moreover, as technological landscapes evolve, the adaptability of these models can be explored in conjunction with emerging technologies such as IoT (Internet of things) devices and advanced sensors. In essence, the hourly PV models’ versatility positions them as catalysts for holistic advancements in the realm of renewable energy utilization.