1. Introduction
The emergence of new photovoltaic (PV) technologies and the continuous manufacturing efforts to reduce costs have introduced important challenges related to the lifetime power output and the long-term degradation rate () of weathered systems. Rapid technological progress is driven mainly by improvements in PV cell and module efficiencies, reduction in manufacturing costs and advancements in operation and maintenance (O&M) that scope to reduce costs and increase the outdoor operational performance. In this domain, the potential of PV technologies is only increased once the reduction of capital and operational cost reflects also reliable and high performing systems.
The accurate quantification of the gradual power decline of PV systems is of vital importance for accurate predictions of financial returns and the reduction of financial risks. The field exhibited
of PV systems directly affects the energy yield produced and consequently, the levelized cost of electricity (LCoE) and the return of investment (ROI). More specifically, the
is a decisive factor considered during the design phase of a PV system, since the economic viability is based on the capability of high performance over the expected service lifetime. Any
inaccuracies lead directly to increased financial risk [
1]. This renders degradation an important risk factor that affects the investment risk taken, especially, as currently new PV technologies are emerging in the market and the way ageing mechanisms are affected by different climatic conditions is unexplored.
The degradation of PV technologies is associated with different internal or external mechanisms that occur during the lifetime of the system. PV degradation can be categorized into the light induced degradation (LID), potential induced degradation (PID), the weathering due to environmental effects and the ageing due to material deterioration over time. In particular, degradation due to environmental effects and deterioration over time is a result of aged semiconductive material, broken interconnections, discoloration, delamination, corrosion and weathering of the composite materials. During outdoor operation, sunlight degrades cell performance due to the well-known photodegradation boron–oxygen complex [
2]. More specifically, the exhibited degradation of PV modules may differ amongst installations at low and high irradiance levels since the parametric properties of PV cells are affected. More specifically, series resistance increases at high irradiance conditions while the shunt resistance decreases at low irradiance conditions [
3]. Discoloration of the laminating foil and degradation of polymeric materials is another associated effect of sunlight exposure at locations with high ultraviolet (UV) irradiation levels [
4]. Elevated module operating temperature accelerates cumulative thermal degradation processes [
5], while humidity ingress causes electrochemical corrosion [
6]. Similarly, thermal cycles and wind loads stress materials and cause cell and ribbon fatigues. In addition, high dust content in the air, which can be found in dryer climatic areas, has a negative effect on the transparency of the glass as dust accumulates on the surface of the modules and deteriorates the quality of the cover [
7].
Furthermore, degradation mechanisms are important to understand because they may eventually lead to failure [
8]. Typically, a 10% power output decline is considered a failure, but there is no compromise on the definition of failure [
4], because a high-efficiency degraded PV module may still have a higher performance compared to a non-degraded module from a less efficient technology [
3]. In particular, hotspots that are caused by a variety of degrading mechanisms within cells can cause severe and sudden damage to the system [
9,
10,
11]. However, it is difficult and costly to distinguish between the effects of degradation and resulting failures, without specific on-site investigations that assist in drawing conclusions on the exact cause. This is the main reason why most degradation evaluations focus on the entire PV array or system by applying advanced statistical techniques such as time series analytics to monitored datasets, in order to validate the manufacturer’s performance warranties.
PV module manufacturers guarantee a certain degradation level over a period of 25 years based on indoor cycle tests, regardless of the location where the modules will be installed. Even though, the performance warranties provided by most manufacturers are extending over the lifetime period, field experience has shown variations in reported
values, which may be attributed to the limited number of field studies and the applied methodology variability [
12,
13]. Over the past years, concerted research efforts focused on calculating the
of PV systems by applying different statistical techniques to outdoor acquired performance measurements such as ordinary least squares (OLS), classical series decomposition (CSD), year on year (YOY), autoregressive integrated moving average (ARIMA), seasonal and trend decomposition using loess (STL) and robust principal component analysis (RPCA) [
14,
15,
16,
17,
18,
19,
20,
21]. More specifically, the most commonly employed methods relied on the construction of chronological performance rating time series and the application of OLS to extract the trend from the series [
13,
22,
23,
24]. OLS is a simple technique that has been routinely employed to PV performance time series, however, the trend decomposition simplicity renders the technique prone to errors caused by seasonality, non-constant trends, outliers and missing data. To mitigate limitations of simple detrending techniques, more sophisticated methods such as CSD, STL and seasonal ARIMA, have been applied in order to adjust for seasonal variations and calculate the trend [
15,
16,
21,
25,
26]. The seasonal ARIMA method can effectively deal with seasonal variations, random errors, outliers and level shifts and can therefore be used to specify a model that removes all autocorrelations in the model residuals [
27,
28]. Nevertheless, after calculating the trend with these methods another linear regressive step is commonly applied to calculate the slope of the trend, hence the
. Other robust methods to outliers, such as RPCA, were used to calculate the
of PV systems by reducing the dimensions of the dataset of user-defined principal components (PCs). Subsequently, the PCs are used to reconstruct the performance metrics of the first and last year, which by comparison provide an estimate of the
[
14]. Similarly, the YoY method is also used as a powerful and practical central-difference estimator for assessing the median degradation of PV systems by comparing a multitude of trends from month to month over each year [
7].
Even though there are many studies presenting the degradation and performance loss rates of PV systems worldwide, the lack of a generalized and standardized methodology to accurately quantify the by entirely analyzing outdoor acquired and monitored performance datasets, which in most cases are prone to errors and outliers, remains yet an important challenge. In addition, the question related to the impact of climate on the degradation of PV technologies is yet unexplored and necessary in order to yield information to optimize the location-specific technologies that will lead to even higher profitability and the development of a new accelerated indoor test.
The main aim of the work is to present a unified robust methodology for accurately calculating the of PV systems from outdoor monitored measurements and fill-in the gap of knowledge by evaluating the location dependency of the exhibited gradual performance loss. For this purpose, different crystalline silicon (c-Si) grid-connected PV systems of identical technical specifications were installed both at the outdoor test facility (OTF) of the Research Centre for Sustainable Energy FOSS in Nicosia, Cyprus (Köppen–Geiger: Mid-Latitude Steppe and Desert Climate) and the Institute for Photovoltaics at the University of Stuttgart in Stuttgart, Germany (Köppen–Geiger: Marine West Coast Climate). The infrastructure settings provide the perfect opportunity to study the effects of the location and their climatic differences on the degradation. The obtained long-term results, over a 12-year evaluation period, verified the hypothesis that the choice of the analysis technique and data quality is crucial for the accuracy of the results. Additionally, the initial hypothesis that degradation is location dependent was validated, since most of the investigated PV systems at the moderate climatic conditions in Germany exhibited lower yearly degradation compared to the respective systems at the warm climate of Cyprus.
This paper is organized as follows:
Section 2 presents the methodology including the description of the experimental infrastructure and
algorithm. In
Section 3, the results of the yearly
for all systems are analyzed.
Section 4 and
Section 5 present the overall discussion and conclusions, respectively.
2. Methodology
The methodology followed to develop the robust
algorithm and to evaluate the location-dependency of the exhibited degradation of the c-Si PV systems included the experimental setup at both locations, data quality and filtering criteria step, construction of performance rating time series and application of analytical statistical techniques. The accuracy of the devised
algorithm was verified against indoor maximum power determination measurements according to the IEC 61215 [
29], performed during the 8-year of operation at the OTF in Nicosia, Cyprus. Accordingly, to capture the
location-dependency of the installed c-Si systems, a comparative analysis was performed between the results obtained from the investigated climatic conditions. All calculations were implemented in Python as the executing programming language for data inference and time series analytics.
2.1. Experimental Setup
The PV system OTFs in Nicosia, Cyprus and Stuttgart, Germany were commissioned in May 2006 and include, amongst others, 12 grid-connected PV systems of different technologies. The installed fixed-plane PV systems range from monocrystalline silicon (mono-c-Si) and multicrystalline silicon (multi-c-Si), heterojunction with an intrinsic thin layer (HIT), enhanced film growth (EFG), multicrystalline advanced industrial (MAIN) cells to amorphous silicon (a-Si), cadmium telluride (CdTe) and copper indium gallium selenide (CIGS) technologies [
22,
30]. Each system has a nominal capacity of approximately 1 kW
p and is equipped with the same type of single-string inverter installed in close proximity, behind each respective system. The same inverters are used in order to exclude the influence of different maximum power point tracking (MPPT) methods to the DC yield [
31]. The inverters are also properly sized to ensure that the systems are always working at their maximum power point (MPP) [
31]. The systems at both locations are installed in an open-field arrangement at the optimal yearly energy yield inclination angles of 27.5° in Nicosia and 33° in Stuttgart, as shown in
Figure 1.
In this study, the long-term
location-dependency evaluation was analyzed over a 12-year operational period and focused only on the monitored PV systems that did not show incomprehensible behavior, attributed to overall low data availability (system component breakdowns at both the DC and AC side and sensor measurement, acquisition and communication problems). PV system failures that occurred for some of the systems were identified through the regular on-site O&M inspections carried out (visual inspections, thermal imaging and I–V tracing).
Table 1 provides an overview of the installed technologies considered in this study, at both locations.
The performance of the PV systems and the prevailing weather conditions were recorded according to the requirements set by the IEC 61724 [
32], and stored with the use of an advanced data acquisition (DAQ) monitoring platform. The platform is comprised of meteorological and PV operational measuring sensors connected to the central DAQ system. Specifically, the meteorological measurements included the in-plane solar irradiance (
), relative humidity (
), wind direction (
), wind speed (
) and ambient temperature (
) and module temperature (
). The PV system operational measurements include maximum power point (MPP) current (
), voltage (
) and power (
), as measured at the output of the PV array (DC side). AC energy measurements at the output of the inverter were also acquired using an energy meter but are not used in this investigation since the focus is on PV array degradation (DC side). The systems were continuously monitored and high-quality data (at a resolution of a second and recording interval of 1-, 15-, 30- and 60- min average) were acquired since 2006.
All the installed sensors and datalogging devices along with the associated measurement uncertainties (acquired from manufacturer datasheets and calibration files) are listed in
Table 2. More information on the installed acquisition devices and uncertainty sources can be found in [
31].
The temperature of the PV array was measured with a temperature sensor (Pt100) affixed to the back of a module installed at the middle of the array and at the position of the centrally located PV cell. The solar irradiance was measured with a thermopile pyranometer (spectrally flat Class A) with aperture oriented in parallel to the plane of array (POA) with a field of view of 180°. The spectral range of the pyranometer is 310–2800 nm and the expected daily uncertainty is ±2% (for a secondary standard instrument, the expected daily total error is ±2%, described by the World Meteorological Organization, because some response variations cancel out each other if the integration period is long). In practice, as the expected daily uncertainty of the pyranometer is based on a particular daily profile of irradiance, solar path and ambient temperature variations of a particular location, the application of the sensor in other climatic conditions renders the uncertainty of the pyranometer, a function of many variables such as directional errors in zenith and azimuth directions, cosine response, temperature sensitivity and level of irradiance [
31]. Systematic recalibration of the pyranometers installed at both sites was performed as required by the manufacturer and periodic cross-checks against sister sensors (other pyranometers installed in close proximity) were conducted in order to identify out-of-calibration sensors.
Furthermore, the pyranometer was also ventilated and heated to avoid incorrect measurements caused by dew and snow. All PV arrays and the pyranometer were cleaned periodically to minimize any soiling effects (e.g., seasonal cleaning and after dust events or snow accumulation).
2.2. Data Quality Filtering
Before calculating the of each PV system, the quality of the acquired datasets was ascertained against a set of filtering criteria. Specifically, the filtering criteria sequential steps applied to the acquired meteorological and PV operational 1-minute average datasets are listed below:
Maximum and minimum level—data outside predefined maximum and minimum threshold levels were filtered out.
Difference of time steps (Δt)—data with a time resolution step higher or lower than the DAQ system resolution were filtered out.
Repetitive values—data that is consecutively repeated because of DAQ failure were removed.
Offset values—identified sensor offsets were added to the data to correct the values. For parameters where a value should be physically zero during times with irradiation, the offset was automatically calculated with an average over the first 60 minutes of operation of each day. Additionally, a maximum value offset was defined which invalidated all data of the day.
Timing—isolated datasets without any surrounding data values were filtered out.
Based on the availability after this initial filter step, the entire dataset acquired over a daily period was considered valid or invalid.
To further examine the acquired data quality, feature correlations were performed. In particular, the correlations used were based on the power ratio of the measured DC power (
) and the maximum temperature corrected theoretical power (
), given by:
where
is the in-plane irradiance,
is the overall array area,
is the rated array efficiency and
is the power rating temperature adjustment factor as follows [
32]:
where
is the relative maximum power temperature coefficient (in units of °C
−1) and
is the module temperature (in °C) in time interval
.
Furthermore, the rated array efficiency,
is calculated by [
32]:
where
is the nominal power and
is the reference global solar irradiance of 1000 W/m
2.
An exemplary correlation between the power ratio and the solar irradiance is shown in
Figure 2. It is obvious from the plot that a lot of datasets are not aligned to the theoretical possible optimum. For an accurate analysis of the
, all erroneous dataset points, which may have occurred due to shading, snow and overall outliers, can be filtered by only considering data points around the theoretical optimum and set threshold levels. In particular, to account for measurement uncertainties, the threshold acceptable window was set lower than
and higher than
.
Additionally, a second irradiance filter was introduced after the initial data quality filtering step, in order to filter out data occurring lower than . This criterion was set to reduce the size of the data while at the same time ensure that no useful performance information is lost.
2.3. Performance Ratio
The performance metric adopted in this study was the performance ratio,
, because of its increasing use to characterize system performance and due to the removal of the most important first-order driver of system output, namely, the incident POA irradiance. This performance metric is largely independent of the incident solar irradiance on the PV system and for this reason it is widely used to compare the performance of PV systems at different locations, focusing only on the overall loss factors. In this context, the
PR metric was used in order to fully capture all environmental loss factors and PV system efficiency losses over a predefined recording period. The IEC-61724:2017 [
32], defines different ways to calculate the
. In this work, the
is calculated as follows [
32]:
where
is the measured DC power output per time step,
is the time step,
is the nominal power,
is the in-plane solar irradiance per time step and
is the reference solar irradiance of 1000 W/m
2.
Even though the can be calculated at different time resolutions (daily, weekly or monthly), for this investigation the monthly resolution was selected since it provides the necessary accuracy without adding too much noise compared to a daily resolution. For each PV system the 1-minute resolution datasets extracted from the data quality and filtering step were used to construct monthly time series over the 12-year evaluation period.
2.4. Time Series Reconstruction and Imputation
A data detection routine was subsequently applied to the constructed monthly
time series of each PV system, in order to identify missing values and gap periods attributed to sensor and DAQ system failures. The detected missing values and gaps were treated by either applying linear interpolation in the case that the monthly
gap occurred during the first year whereas in the case that the monthly
gap occurred after the first year, the missing value was calculated based on a naïve method that considered the value as being the same as the respective monthly
value of the previous year. Furthermore, if a gap was detected in the subsequent years, then the historic monthly mean of the previous respective months of the 3 preceding years was used to fill the missing gap. A typical example time series reconstruction and imputation algorithm application to reconstruct missing monthly
values is depicted in
Figure 3. The green plot shows the values used (preceding 3-year monthly
values were used) to calculate the reconstructed datapoints. The orange plot shows the reconstructed monthly
values that were in close agreement to the actual data.
2.5. Time Series Analytical Techniques
In the previous steps, the monthly time series of each PV system were constructed and data quality validated, in order to be used for the analysis. In this part of the process, the yearly of each investigated PV system were calculated by employing data-driven statistical time series techniques, namely the OLS, CSD, STL, ARIMA, RCPA and YOY.
2.5.1. Ordinary Least Squares
The most widely adopted time series analytical typical technique to calculate the
is OLS. The linear regression uses the least-squares method, which minimizes the distance between each datapoint and an assumed straight line (residual). This technique was applied to the monthly
time series of each system according to:
where
is the slope of the linear fit and
is the y axis intercept.
The absolute yearly
was calculated for each PV system using the gradient of the obtained linear fit and multiplying it by the number of months in a year [
16].
2.5.2. Classical Series Decomposition
Time series data can be split into the trend, which shows the basic direction of the analyzed data from which basically the can be derived from, the seasonality factor (in PV, this can be induced due to external factors such as weather and can be observed in fluctuations of the in winter and summer) and the noise or irregular part.
CSD is a time series decomposition technique that is commonly used to extract the underlying trend from the series with the application of a centered moving average [
33,
34]. A two-step centered moving average is used in order to obtain the trend from the time series. Furthermore, for a
moving average, the centred moving average at time
used to find the trend at time
,
, is as follows:
where
is the moving average order and
is equal to the half-width of a moving average.
In this study, this technique was applied to the monthly time series of the PV systems in order to extract the trend, the seasonality and the irregular component. The order of the moving average for each investigated PV system was equal to 12 because of the number of months per year.
Finally, linear regression was applied to the extracted trend in order to calculate the yearly .
2.5.3. Seasonal and Trend Decomposition Using Loess
Similarly to CSD, the STL method decomposes a seasonal time-series into the following components: seasonal, trend and remainder [
33], by performing an inner and an outer loop. The trend and the seasonal components are updated when a run in the inner loop takes place, which are equal to 1 or 2. The outer loop has an inner loop and it is followed by an estimation of accurate weights that works as an input for the next inner loop in order to decrease the impact of transient, abnormal behavior on the trend and seasonal parts [
34].
2.5.4. Autoregressive Integrated Moving Average
The ARIMA method is a generalization of the autoregressive moving average (ARMA) model and consists of two parts, the autoregressive (AR) and the weighted moving average (MA). Both models can be used to better understand the time series data and forecast future developments of data. The integrated part (I), which differs the ARMA model from the ARIMA model, means that an initial differentiating step can be applied one or more times to remove the present non-stationary. The non-stationary share describes the searched as a trend.
ARIMA models essentially consist of 3 main parts. In the first part, certain properties of the time series are characterized using autoregressive approaches. Subsequently, these properties are used in the second part to estimate the seasonality. In the end, the third part is the diagnostic testing performed in order to validate the results.
For this investigation, after the ARIMA model was constructed and the trend was successfully calculated, alike the degradation calculation of CSD and STL, a linear regression was applied and the yearly was calculated for each PV system by using the gradient obtained from each fit.
2.5.5. Robust Principal Component Analysis
RPCA is a method that reduces the dimensionality of multivariate data by creating an optimal superposition of the original variables with respect to the Euclidean norm. This is a linear algebraic tool that describes the geometrical characteristics of multivariate statistical data. The original data matrix is called
and it can be expressed as follows:
where
is a low rank data matrix that indicates the number of characteristic features dominating the
matrix and
is the perturbation matrix that causes the outliers [
14].
RPCA reduces the dimensions of the time-series into the PCs, which were then used to calculate the performance metrics of the first and last year of the evaluation period. The low-rank matrix () could then be used for further calculations of the and the perturbation matrix () could be disregarded.
In the scope of calculating the yearly
of PV systems, the low-rank matrix contains the
of each year in each column of the matrix. The yearly
was calculated by comparing the change of the integral of the first and last year monthly
. More specifically, the integrals describe the area of the respective year one
and the last year
. The change of both areas was then divided by the reference area of the first year
and the number of considered years
, to yield the
:
2.5.6. Year-on-Year
The YoY technique was proposed by the National Renewable Energy Laboratory (NREL) of the U.S. Department of Energy [
7] as a comparative data point method between subsequent years, where the
was calculated as a comparison of the data points of the same month, week or day. This procedure was then repeated for every data point of subsequent years.
After each individual was calculated, the YoY technique appended these values in a list of successive data points from one date to the same date in the following year. The list obtained by this procedure was then sorted in either an ascending or descending order.
Finally, the sorted list was used to obtain the median value, which represents the of the investigated PV system. By using the median value over the mean, the YoY was more robust (less prone) to outliers in comparison to the mean value. The median value was used as the yearly for the investigated PV systems in this analysis.
2.6. Long-Term Degradation Components
In general, the long-term
calculated using techniques applied to constructed performance time series depends on the quality of the components that affect degradation (degradation of PV modules and inverter), maintenance intensity from one year to another (soiling, snow, shading and failure repair duration), variation of climate condition change from one year to another (solar irradiation and ambient temperature) and component failures (PV module hot-spots, cracks, corrosion, delamination, glass breakage, cell interconnection breakage, diode failures and balance of system and inverter component failures). In this study, all necessary measures were taken in order to specifically exclude the influence of maintenance intensity and yearly irradiation variation. In more detail, the
part due to the maintenance intensity was minimized since an initial step before calculating the long-term
was to discard all data that were affected from soiling, shading and failure repairs (initially applied data quality filtering stage). According to a recent survey, the overall yearly soiling loss for modules installed at a tilt angle >15° for the Köppen–Geiger classification of the PV systems installed in Nicosia, Cyprus and Stuttgart, Germany is 4% and 0.5%, respectively [
35]. Assuming therefore that the arrays were only cleaned once a year and by rain, this would have reflected to a maximum soiling contribution of 0.33%/year and 0.04%/year to the long-term
for the systems in Cyprus and Stuttgart, respectively. However, this effect of dust and also snow was mitigated since the arrays were periodically cleaned (seasonal cleaning and after dust events or snow accumulation every year for this investigation).
The impact of climate variations to the long-term was investigated by analyzing the exhibited variation of both the in-plane global irradiation and module temperature of each system over the 12-year evaluation period. Specifically, the analysis included the calculation of the yearly variation of the measured in-plane global irradiation and module temperature at both locations and translating the obtained values to performance loss rate components that could be mapped to the .
Finally, the impact of component failures to the
, was further analyzed since two PV systems (sw165Ni multi-c-Si and bp185Ni mono-c-Si) exhibited failures that were detected through the regular on-site O&M inspections (visual inspections, thermal imaging and I–V tracing). In particular, a hot-spot at a module of the sw165Ni multi-c-Si and the bp185Ni mono-c-Si PV systems was detected in the 4th (September 2010, month 52) and 7th year of operation (June 2013, month 85), respectively. According to a review study on the failures of PV modules performed by the International Energy Agency (IEA) photovoltaic power systems (PVPS) Taskforce 13, the power loss category for a hot-spot failure is A, which denotes power loss below the detection limit of <3% [
36]. The impact assessment of this failure on the long-term
for these systems was calculated using the approach presented in [
37]. First, the
of a specific module with failure
(
) was calculated by [
37]:
where
is the power loss of the specific module given in percentage,
is the date of failure documentation of dataset
and
is the date of system start of dataset
.
The equation for the
failure impact of the whole system (
) is given by [
37]:
where
is the percentage of the system being affected by the failure and
is the investigated system part in percent from the total nominal system power
.
3. Results
3.1. Data Quality Filtering and Time Series Reconstruction
The
location-dependency comparison of both locations is of interest due to their differences in the climatic conditions. The yearly global solar irradiation in Nicosia, Cyprus (2000 kWh/m
2) was approximately 70% higher compared to Stuttgart, Germany (1100 kWh/m
2). Furthermore, the higher irradiation was clearly reflected to the amount of sun hours per year, with 3310 h in Nicosia compared to 1800 h in Stuttgart. The mean yearly ambient temperature in Nicosia, Cyprus was 19.7 °C, which is 110% higher compared to Stuttgart, Germany at 9.4 °C [
38,
39]. The exhibited monthly total irradiation at the POA at both locations over the 12-year evaluation period is provided in
Figure 4. The irradiation plots at both locations exhibited a monotonic trend behavior.
The initial application of the proposed data quality and filtering step to the acquired 1-minute datasets indicated that some of the installed sensors exhibited abnormal behavior (sensor drifts prior to a complete sensor failure, which resulted in erroneous data and outage periods) after several years of outdoor operation, at both location sites.
Figure 5 shows a typical example (ase170St multi-c-Si PV system installed in Stuttgart, Germany) of the acquired 1-minute datasets monthly data availability, after the application of the initial data quality and irradiance filtering steps. Based on the on-site outage log records, the specific system had a measurement sensor failure throughout the third, fourth and fifth operational years. The plots clearly demonstrated that data was available for this system apart from the period between months 32 and 60 (no data availability is represented as zero). Additionally, the application of both filtering steps reduced the percentile availability of the winter season datasets by a higher magnitude compared to the summer due to the high amount of low-irradiance datapoints present during the winter and prolonged night-time. Specifically, the application of the second step irradiance filter further reduced the data by approximately 20%. The fact that a large share of data was filtered out from the initial 12-year dataset provided evidence that the data quality and filtering step is necessary in order to ensure data fidelity for the subsequent analysis.
As observed in
Figure 6, the missing dataset gaps were successfully imputed and the monthly
time series was reconstructed for months 32–60, using the time series reconstruction algorithm. In particular, the data gaps due to the measurement equipment failure were reconstructed with accurate estimates, since the estimated monthly
values were comparable to exhibited values of the prior and posterior years. The imputation results further indicated that 3 consecutive prior years of data were adequate to fill the gap and estimate the
for the ase170St multi-c-Si PV system in Stuttgart, Germany.
The robustness of the time series reconstruction and imputation algorithm was further verified by applying the algorithm to reconstruct the time monthly
series of the multi-c-Si sw165St PV system in Stuttgart, Germany.
Figure 7 shows that the reconstruction algorithm was able to impute the time series missing gaps and sensor drifts identified for months 50–120, by utilizing the prior 3-year values.
Even though in some cases the datasets exhibited huge gaps, the filling gaps algorithm was able to reconstruct the missing values in a very reliable manner. This was proven by comparing the predicted filled gaps performance ratio and the performance ratio, which could be calculated after the sensor change. This led to the finding that the filling gaps algorithm worked properly, even for gaps of up to 4 years. Specifically, filling data gaps of up to 4 years proved to be possible in case the monthly datapoints of the starting years was of high quality and could be proven with ending years of high quality. Longer data gaps led to an overestimation of the performance ratio. Hence, ensuring high measurement quality in the first 3–5 years is of great importance to calculate the degradation of the PV power system.
3.2. Degradation Rates of PV Systems
The techniques of OLS, CSD, STL, ARIMA, RPCA and YoY were applied to the constructed and data quality verified monthly
time series of the investigated PV systems at both locations, in order to calculate the yearly
. The seasonal effects were considered by setting the
to begin from year 3 and extend towards the last year of the investigation. Additionally, the accuracy of each time series analytical technique to calculate the yearly
was benchmarked against the obtained
obtained during the 8th year using indoor standardized test procedures and a solar simulator to determine the maximum power of each system as per IEC 61215 [
29]. The obtained
was assumed to exhibit a linear behaviour and constant over time.
Figure 8 shows the course of the yearly
of the multi-c-Si system ase170Ni in Nicosia, Cyprus calculated with all the time series analytical techniques over the 12-year period. The results demonstrated that all the applied techniques exhibit similar performance since the maximum absolute difference between the obtained results for each investigated duration was <1%/year. Almost all techniques appeared to converge after 7 years of analysis, since the maximum absolute difference between the
values was 0.2%/year. Another important outcome was obtained from the comparative 8-year
analysis obtained indoors at STC and the time series techniques. The obtained results exhibited close agreement and overlap, providing scientific evidence that the adoption of time series technique is only crucial if the quality of data is low, otherwise simple methods such as the OLS yield accurate results and are applicable for
studies. The fact that less sophisticated
techniques (OLS, CSD and STL) exhibited close agreement to the indoor standardized test procedure results renders these techniques applicable for
studies and favourable over other elaborate and computer intensive techniques.
Furthermore,
Figure 9 shows the box-plot of the calculated yearly
over the 12-year evaluation period for the ase170Ni multi-c-Si PV system. The median value is displayed with an orange horizontal bar while the mean is depicted as a green triangle. The results indicated that for the high quality, filtered and reconstructed time series of this investigation, the adoption of which time series analytical technique is irrelevant and the results will be similar, since the standard deviation of the results (width of each box) is narrow.
The long-term yearly
of all the investigated c-Si PV systems at both locations over the 12-year evaluation period are presented in
Figure 10. Overall, the results demonstrated that the a170 mono-c-Si system and all multi-c-Si systems (ase170, ase250 and sw165) presented higher
in Nicosia, Cyprus (>0.1%/year absolute difference) compared to the results obtained in Stuttgart, Germany for all the applied time series de-trending techniques (OLS, CSD, ARIMA and STL). The only technique that showed a higher
in Stuttgart, Germany for systems a170 mono-c-Si and ase170 multi-c-Si was YoY. The YoY technique yielded the lowest
values for almost all of the PV systems. In general, the YoY technique is more prone to erroneous data due to the fact that erroneous monthly
shifts the median. The reason why PV systems such as the sw165Ni multi-c-Si PV system showed wider variations in the exhibited results specifically the YoY technique is attributed to failures such as hot spots that deteriorated in a non-linear manner the performance of the system. Accordingly, the application of RPCA resulted in most cases to higher
values compared to the other applied time series analytical techniques. The RPCA technique is also prone to erroneous data of the first and last year as result of only using the data of those years. Hence, if the data of the last year is affected by a sensor drift or erroneous data, then the
will differ from the
of other methods that also take all the other years into account.
Additionally, the only system with higher in Stuttgart, Germany compared to Nicosia, Cyprus is the bp185 mono-c-Si (0.1%/year absolute difference) for all the time series detrending techniques. The hip205 mono-c-Si system yielded similar results for both locations (<0.1%/year absolute difference).
3.3. Long-Term Degradation Location Dependency
To evaluate the location-dependency of the exhibited degradation, a box-plot of the yearly
over the 12-year evaluation period for each PV system at both locations was constructed and depicted in
Figure 11. The box-plot demonstrates that the median
value, displayed in orange color, of most of the investigated PV systems was slightly higher in Nicosia, Cyprus compared to Stuttgart, Germany. Along this context, an important outcome from this plot was that the calculated degradation for most of the PV systems installed at the warm climate of Nicosia, Cyprus was higher compared to the respective systems at the moderate climate of Stuttgart, Germany. More specifically, by comparing the median value of the obtained results from the applied techniques, all the multi-c-Si technologies (ase170, ase250 and sw165) exhibited higher
(>0.1%/year absolute difference) in Nicosia, Cyprus compared to Stuttgart, Germany. This demonstrated that the
of the investigated multi-c-Si technologies was affected by the installation location. For the mono-c-Si technologies, the a170 system showed a higher degradation in Nicosia, Cyprus compared to Stuttgart, Germany (>0.1%/year absolute difference), while the bp185 mono-c-Si system showed a higher degradation in Stuttgart, Germany (>0.1%/year absolute difference). The hip205 mono-c-Si system exhibited similar results since the absolute difference of median values of both locations were <0.1%/year.
In summary, the obtained results showed that the of the investigated multi-c-Si systems was location dependent since higher absolute differences in the range of 0.1–0.4%/year, were obtained compared to the respective systems in Stuttgart, Germany. For the mono-c-Si systems the results demonstrated that the location-dependency was also affected by the manufacturing technology.
For all the investigated PV systems in this study, apart from the sw165Ni multi-c-Si and the bp185Ni mono-c-Si PV systems, which exhibited failures, the calculated long-term was mainly attributed to the degradation of the system components (ageing due to material deterioration over time, light induced degradation (LID), potential induced degradation (PID) and weathering due to environmental effects). With respect to the impact of the yearly irradiation variation to the long-term , the analysis showed that in Nicosia, Cyprus and Stuttgart, Germany the yearly in-plane global irradiation variation was 99 kWh/m2/year (5% when normalized to the average yearly irradiation of the 12-year period, 1994 kWh/m2) and 95 kWh/m2/year (7% when normalized to the average yearly irradiation of the 12-year period, 1267 kWh/m2), respectively. However, in this investigation the was calculated using the as the time series metric and this provides the advantage that the performance values were normalized to the irradiance, hence not affected by any variation in the monthly or yearly irradiation. Similarly, the variation of the yearly average module temperature on the of each PV system demonstrated that in Nicosia, Cyprus and Stuttgart, Germany the highest yearly module temperature variation was 0.55 °C and 0.84 °C, respectively. These values were then multiplied by the power temperature coefficient of each technology and subsequently divided by the evaluation period (12 years) in order to yield a maximum performance loss rate thermal variation component affecting the , of 0.02%/year and 0.03%/year for the systems in Cyprus and Germany, respectively.
Furthermore, the part due to maintenance intensity from one year to another was minimized in this investigation since before calculating the long-term , an initial step taken was to discard through the filtering data quality stage the effect of soiling, shading and failure repairs. Additionally, the PV arrays were periodically cleaned (seasonal cleaning) and after dust events or snow accumulation in order to minimize these effects.
Finally, regarding, the hot-spot failures detected for the sw165Ni multi-c-Si and the bp185Ni mono-c-Si PV systems, the analysis showed that the impact of these failures on the long-term is 0.13%/year and 0.08%/year, respectively.
5. Conclusions
This paper presents the analysis performed to implement a unified methodology for accurately calculating the of grid-connected PV systems and provides evidence of the location-dependency. The method included the application of data inference and time series analytics to high-resolution 1-minute datasets acquired from different c-Si PV systems, installed and monitored at two different climatic locations (warm climate of Nicosia, Cyprus and moderate climate in Stuttgart, Germany).
The obtained results demonstrated that the application of the data quality and filtering steps ensured data fidelity and proved to be necessary since the results obtained from the inferred data were comparable to indoor applied standardized procedures.
Additionally, the adoption of time series technique proved insignificant once the underlying dataset was of high fidelity. More specifically, the yearly results showed minor differences for almost all techniques, after converging in a time frame of 7 years. This established less sophisticated techniques (OLS, CSD and STL) applicable for studies.
Furthermore, the long-term comparative analysis showed that the multi-c-Si PV systems installed at the warm climatic conditions in Nicosia, Cyprus were degrading at a higher rate compared to their counterparts at the moderate climate of Stuttgart, Germany (absolute difference in the yearly in the range of 0.1%/year to 0.4%/year). This provided adequate evidence and verified the initial hypothesis that degradation mechanisms are location dependent for the investigated multi-c-Si technologies. For the mono-c-Si systems the results demonstrated that the location-dependency was also affected by the manufacturing technology.
Finally, the manufacturers of the PV modules planned a degradation of around 1.0%/year. This value was almost always undershot by all modules, which showed a lower degradation during the considered time period. Hence, a lifetime extension of these modules can be taken into consideration and might also be applicable for current technologies.