The substitution of renewable energy sources to the electricity grid has been remarkable during the last decade in Europe and elsewhere. European and international legislation for a decrease in greenhouse gas emissions played an important role in this expansion. According to IRENA, at the end of 2021, the capacity of installed global renewable energy reached 3064 GWp. This number comprises about 40% hydropower, 28% solar power, 27% wind power, and 5% other renewable power sources [
1]. Renewable electricity production capacity has also shown a significant increase in Greece, where 41.48% of total electricity was generated from renewable energy sources (RES) in 2021 [
2]. A remarkable number of fossil-fuel power-generation plants have been phased out from the Greek system. This has resulted in excessive electricity prices in the market as it coincided with significant increases in international natural gas prices in 2022, causing a new energy crisis that is yet to be contained. This situation creates challenges for further renewable energy investments, especially in photovoltaic (PV) systems.
1.1. Photovoltaic Systems Applications
An intense interest for investment is observed in this area by citizens, companies, energy communities, and the public sector. There are different modes of photovoltaic application contracts in Greece. Net metering, virtual net metering for energy communities, feed-in tariffs, and feed-in premiums are the main modes for grid-connected systems in both plants and buildings with a typical lifespan of 20–25 years. PV systems mainly support distributed energy generation to achieve effective integration into grids and micro-grids. There are a large number of studies one could cite as examples, which address the combination of ground-coupled heat pumps for upgraded post-COVID-19 ventilation systems [
3], rooftop PV combination with hybrid condensing radiant tubes’ heating systems [
4], and incorporation into residential buildings with air-to-water heat pump systems [
5]. Accelerated vehicle electrification is pushing for further expansion of PV systems. This includes smart office buildings’ energy systems with rooftop PV systems exploiting electric vehicle battery storage [
6], innovative building blocks in Germany with combined heat and power (CHP), battery storage and exploitation of electric vehicles storage [
7], and modular packages of electric vehicle charging stations in China, designed to charge 1000 electric vehicles using PV and battery energy storage systems [
8]. PV systems in combination with large-capacity battery systems is another important application area [
9]. Here, an efficient energy management system that handles on-site PV production with battery energy storage minimizes power exchange with the grid. Interest in classic investments in PV parks is significant and exploits double utilization of land with agrivoltaics, which utilize the land around the PV panels for food-producing crops [
10]. Jamil et al. studied the potential of agrivoltaics in Canada using bifacial PV for single-axis tracking and vertical system configurations [
11]. The combination of photovoltaic systems with hydrogen systems is gaining increased popularity in research. Important technical parameters of an integrated PV–hydrogen system include the PV tracking system coefficient, PV conversion efficiency, electrolyzer efficiency, and electrolyzer degradation coefficient [
12]. Other studies have analyzed a PV-based fuel-cell power system [
13], a CES (composite energy station) combined with a PV power-generation system, fuel cell, hydrogen production, hydrogen storage, hydrogenation, and charging, in order to supply energy for electric vehicles (EVs) and hydrogen fuel-cell vehicles (HFCVs) [
14].
Now, the uncertainty in predicting solar radiation is a major issue affecting the successful forecasting PV power output, which is essential to sizing, control system optimization, and economic analysis of the above-mentioned systems. Exploitation of data from grid-connected photovoltaic systems is a valid approach that provides significant information regarding this problem. Furthermore, performance analysis of grid-connected PV systems supports PV power forecasting. Economic evaluation of this type of project depends on a good understanding and modeling of degradation of photovoltaic systems that should rely on actual performance data. In the real world environment, all kinds of modules exhibit lower efficiency compared to the manufacturers’ specifications [
15]. Performance analysis gives a clear view of systems’ performance under real life conditions. Thus, it is an important task to be tackled with scientifically sound approaches with an important impact on both the design and evaluation stages of new and existing plants.
1.2. Photovoltaic Performance Analysis Approaches
Performance analysis is based on mathematical models, linear regression models, and the use of specialized software and is widely supported by the use of neural networks.
Neural networks (NNs) have proven invaluable to the performance analysis of renewable energy sources, especially photovoltaics and power forecasting of generation and consumption. All the applications differ in various aspects, e.g., input data, data preprocessing, ANN type and structure, parameter configuration, hybrid application, and performance [
16]. The primary neural network types utilized for power forecasting include multi-layer perceptron (MLP), the feed-forward deep neural network (DNN [
17]), long short-term memory networks (LSTMs), and convolutional neural networks (CNNs) [
17]. Power-generation forecasting studies of PV systems use different types of NNs, with a selection from several inputs and outputs. A distinction of the various approaches is based on the type and standardization of available data. López Gómez et al. combined data from a numerical weather prediction model with an artificial neural network (ANN) model in order to forecast power generation from a PV system using actual temperature and solar irradiation data [
18]. Gopi et al. developed a PV system annual yield and performance ratio (PR) forecasting model based on three environmental input parameters: solar irradiance, wind speed, and ambient air temperature. They employed data from a 2 MWp grid-connected PV system and three different machine-learning techniques: an adaptive neuro-fuzzy inference system (ANFIS); response surface methodology (RSM), which is a combination of statistical and mathematical approaches and allows one to determine the independent variable that, when changed, results in the responsible variable having an optimal value; and an ANN [
19]. Kolsi et al. compared various artificial intelligence (AI) models based on daily data and seven seasonal models were employed to predict solar potential: simple average (SA); simple moving average (SMA), nonlinear autoregressive (NAR); support vector machine (SVM); Gaussian process regression (GPR); and NN. The results were evaluated based on the root mean square error (RMSE) and mean absolute percentage error (MAPE [
20]). Lim et al. proposed a hybrid CNN–LSTM model. The CNN classifies weather conditions, while the LSTM is trained to classify power-generation characteristics. Typical results produce an MAPE of 4.58% on a sunny day and 7.06% on a cloudy day [
21]. Suresh et al. proposed a convolutional neural network (CNN) approach consisting of different architectures, namely multi-headed CNN and CNN–LSTM based on data preprocessing techniques to make accurate forecasts using irradiance, module temperature, ambient temperature, and wind speed for short-term forecasting [
22]. Andrade et al. used an MLP, RNN, and LSTM to forecast photovoltaic energy from data collected from the PV system in Brazil. The MLP performed adequately, requiring less training time [
23]. Kim et al. proposed a combination of a two-step NN bi-directional long short-term memory (BD-LSTM) model with an ANN model using exponential moving average (EMA) preprocessing of historical hourly input data of horizontal radiation, ambient temperature, and surface temperature [
24]. Preda et al. proposed an SVM, and data were collected from a cheap data logger and from an API weather station with good prediction results in the estimation of the PV generated power, supporting micro-grid operation [
25]. Meltek et al. proposed a model to predict the effect of the panel electric power of a photovoltaic thermal (PV-T) system using LSTM and MLF. Mean absolute error (MAE), RMSE, MAPE, and R
2 correlation coefficients were used as performance metrics [
26].
Another important aspect of performance analysis is fault detection using neural networks, IR-thermography electroluminescence images, or a combination thereof. Neural network fault diagnosis of PV systems is generally based on historical data, relevant data related to voltage, current, power, and I–V curves. Images are also employed as inputs [
27]. Samara et al. proposed a fault-diagnosis algorithm based on a nonlinear autoregressive exogenous (NARX) neural network that can detect multiple faults, such as open and short-circuit degradation, faulty maximum power point tracking (MPPT), and conditions of partial shading [
28]. Onim et al. proposed a CNN to detect dust accumulation on PV panels using a dataset of images of dusty and clean panels. The results demonstrated high accuracy levels [
29]. Selvaraj et al. proposed a method for accurate diagnosis of environmental faults using CNN and thermal images for classification of these faults [
30]. Lu et al. proposed a fault-diagnosis method to diagnose different PV faults using a proposed dual-channel CNN, which automatically extracts features and weights them to diagnose partial shading conditions and open-circuit faults [
31]. Yu et al. proposed dimension-reduction technology mapping multiple-sequence signals to a sequence of images processed by a CNN. Validation carried out on self-made solar power stations proved to be effective in identifying key operation conditions from historical data with negligible loss of features at the presence of mismatched phenomena [
32]. Dust accumulation is an important factor; thus, many researchers use neural networks in order to study this effect [
33,
34].
Except from ANNs, performance analysis procedures are based on statistical or other performance metrics. Most of these approaches are based on comparative analysis. Iqbal et al. proposed a fault-detection method based on string level comparison of DC power of actual and simulated PV plants with the aid of a statistical tool based on Student’s t-test [
35]. Minai et al. analyzed performance data of a 467.2 kWp grid-connected PV system using array, inverter and system efficiency, performance ratio (PR), and capacity utilization factor (CUF). These parameters are evaluated and compared with similar systems in different regions of the world [
36]. Karahüseyin et al. analyzed the performance of a mid-scale crystalline silicon (c-Si) PV system with different orientations and tilt angles in the same region for four years of outdoor exposure, using statistical methods to calculate PLRs; seasonal and trend decomposition using locally weighted scatterplot smoothing (STL); classical seasonal decomposition; and year-on-year methods coupled with PR, temperature-corrected PR, and weather-corrected PR [
37]. Agyekum et al. used the PR, degradation, energy-loss prediction, and the PVsyst simulation model to study the performance of solar photovoltaic (PV) modules under Russian weather conditions [
38]. Shin et al. proposed a weather-corrected index, linear regression method, temperature-correction equation, estimation error matrix, clearness index and proposed variable index, a one-class support vector machine (SVM) method, and a kernel technique to classify the fault state and anomaly output power of PV plants [
39]. Phuong Truong et al. presented a method to estimate the yield and analyze the performance of a grid-connected PV system in a MATLAB/Simulink environment for a rooftop PV system and a solar farm [
40]. Dhimish et al. presented degradation rates over a 10-year span for seven different PV systems located in England, Scotland, and Ireland using a power-irradiance technique that compares output measured power with a corresponding irradiance level [
41]. Bansal et al. conducted a long-term performance and degradation study based on IEC standard 61724 guidelines from actual data (incident solar irradiation, ambient and module temperature and generated electricity) with annual linear degradation rates found in the range of 0.9 to 1.1% for normal field modules with no visible degradation and 0.97 to 2.9% for visually degraded modules. Mean and median values were 1.8% and 1.6%, respectively, within a six-year operational period [
42].
Despite the significant progress in the performance analysis of PV systems, there exists ample room for further improvements. It is important to deploy data from grid-connected PV systems to support the creation of tools that either predict the energy generation of PV systems or evaluate their performance. Nowadays, PV systems are equipped with advanced monitoring systems that can collect a variety of useful performance data. There exist specific studies that propose methods to collect and systematically process these data, as in [
43] where a procedure for the automatic transfer of recorded data is described.
In the present paper, actual data collected from grid-connected photovoltaic systems in Central Greece are studied by means of several ANN types and statistical analysis. The objectives of this paper concern the performance evaluation of grid-connected PV systems after several years of operation, assisted by five different ANN types. The novelty of this approach lies in the deployment of actual data for 8 years of operation and the careful selection of inputs for ANN training, taking into account the quality of the atmosphere by use of clearness index and air mass.
The structure of this paper is as follows.
Section 2 presents the methodology, consisting of three base steps: (i) statistical and efficiency observations of available data; (ii) data preprocessing; (iii) statistical and efficiency metrics calculations; and (iv) use of five NNs with five inputs.
Section 3 presents the solar potential of Central Greece, observations of important performance metrics, comparative analysis among the five ANNs, and investigation into the PV systems’ degradation during the 8-year period. The results are analyzed and discussed. Conclusions and proposals for future work are presented in
Section 4.