1. Introduction
The current geopolitical situation with the Russian invasion of Ukraine has triggered the biggest energy crisis in Europe since the 1970s, in addition to humanitarian disasters. European countries are massively dependent on Russian fossil fuels for electricity generation, and the European Commission has envisaged a series of measures to reduce energy dependence by 2030, summarized in the REPowerEU document [
1]. This document contains three main pillars: (1) identification of alternative energy sources, (2) energy saving, and (3) the acceleration of clean and renewable energy.
With the combination of REPowerEU and the European Green Deal [
1], the EU aims to become the first climate-neutral continent before 2050. In this transition, renewable energies (RE) play a central role, and transmission system operators (TSOs), together with electricity suppliers, need to find appropriate approaches to accommodate the increasing share of RE. Dissimilar to nonrenewable energy sources such as gas, nuclear and coal, the production of wind and solar energy depends strongly on the weather, so that the supply can vary substantially at all time scales (from minutes to days and months). This poses an acute challenge to TSOs, who must guarantee a balance between electric demand and scheduled supply.
Consequently, TSOs require accurate weather forecasts to accommodate differences between user demand for electricity net of the distributed solar/wind generation (netload) and the scheduled supply that must be provided by dispatchable production units (netload prediction). These differences are referred to as an “imbalance”. Depending on the accuracy of the netload predictions, the dispatchable generators should be flexible enough to adjust imbalances at operational spatial scales ranging from local to national [
2,
3,
4]. Indeed, the National Integrated Plan for Energy and Climate [
5] explicitly states that with the planned growth of solar/wind generation, “the high quantity of non-programmable renewable sources will force to keep available a significant portion of thermoelectric generation capacity, in order to guarantee the necessary reserve margins for the safe operation of the system”.
For this reason, the management of the current electric systems in Europe requires substantial improvements in scheduling the supply and in their flexibility resources to respond to the growing solar/wind induced imbalances [
6,
7,
8,
9,
10,
11]. Moreover, even if the netload can be predicted more accurately, responding flexibly to them is a challenge in many countries where the national balancing energy market is divided into several regions, with imbalances and flexibility considered separately in each one [
12,
13]. Zhang et al. [
14,
15] and Pierro et al. [
16,
17] highlighted the value of improving the PV forecast in terms of reducing imbalances and reserves and related costs. On the other hand, Kies et al. [
12], Müller et al. [
13] and Pierro et al. [
18,
19] investigated the benefits of removing transmission grid constraints between different market zones to allow for a better share of renewable and flexible resources, as well as expanding the forecast-controlled area to improve the accuracy of regional solar generation forecasts (known as “forecast smoothing effect”).
Weather forecasting based on numerical weather predictions (NWP) models is currently used to predict both load and RE production. However, NWPs are physically based models in which the state of the atmosphere is described using dynamic and thermodynamic variables and laws and are often subject to systematic errors. It may be possible to improve such predictions by introducing bias corrections [
20,
21] or integrating machine learning (ML) [
22,
23].
The current research on PV power forecasting includes a large variety of approaches, which mainly depends on the forecast time interval, on the observed or modelled variables and, finally, on the forecasting approach [
24]. The PV power forecasting approaches are usually classified under two families: physical method and data-driven methods [
25]. Moreover, most of the forecasting models are tested at the single PV plant level [
26], whereas analyses on large-scale areas, such as the one presented here, are less frequent [
27].
When regional PV power forecasting approaches are considered, we found two main classes: (i) the bottom-up approach, where the PV power generation in the area is predicted by summing the PV power computed for each PV site, and (ii) upscaling approaches, which can follow the so-called Models Output Average or Model Inputs Average methods (as classified in [
28]).
In one of the first study on regional forecasting [
29], a uniform distribution and a distribution reflecting the regional fleet regionalization were investigated using the Models Output Average. Lately, in [
30], the same authors scaled up the aggregated output of PV systems, which were first clustered based on their geographical location, to obtain hourly regional PV output estimates.
According to [
31], it was assumed that a short time series of all PV systems was available, from which representative sites were then selected using a combination of k-means clustering and Principal Component Analysis (PCA), the result of which was then mapped to the total electricity generation of the area using a regression method. In [
31,
32], a probabilistic forecasting model was presented, and typical sites were selected based on a stratified random sample of installed capacities and locations. In [
33], four upscaling strategies were proposed to fit different scenarios of data availability and plant information. In [
34], an upscaling approach based on reference systems was compared with satellite-based estimates. The performance improvement that can be achieved by combining the two data sources was analyzed, and the inverse distance weighting (IDW) method was used to estimate the PV performance based on the performances of nearby plants. Interestingly, upscaling gave similar results to the hybrid approach when 10 or more reference plants were used.
The current research has also shown that data-driven methods that directly predict regional solar/wind generation) show the same accuracy in predicting the output of each system in the region. Although upscaling methods greatly reduce data handling and computation efforts and require little information about the regional solar/wind fleet, they are much more complex than models to predict the generation of a simple plant. Regional generation is affected by many sources of uncertainty: a minimum number of locations on which to calculate NWPs, errors in the VRE power estimation (TSO does not meter the regional generation), missing information on installed PV capacity, locations/technologies/orientation of systems, etc. For this reason, upscaling methods, in addition to physical/data-driven models mapping the NWP in solar/wind generation, usually require specific pre/post-processing to achieve acceptable accuracy [
35,
36,
37].
Finally, recent studies demonstrated a further accuracy improvement (both on a single plant [
38,
39]), as well as by different physical/data-driven techniques [
23,
26] (blending models) or using a chain of prediction models (hybrid models) [
35]. The scientific community is devoting a strong effort to improving the solar radiation forecast accuracy using machine learning approaches optimizing the ANN [
40,
41] and LSTM [
42,
43] models with different flavors and approaches.
In addition, the role of renewable energy in the industrial sector will become increasingly important, and the need to integrate different systems and technologies is crucial. The coexistence of hardware and software in power generation must be properly addressed and requires the development of a specific architecture, such as the one proposed in [
44], that can work with power systems, software, Big Data and artificial intelligence algorithms. At the same time, the role of microgrids and distributed generation is becoming increasingly important [
45], and energy routers and the energy internet will play an important role in the design of future energy networks.
In this paper we have developed three upscaling methods to predict the day-ahead regional load, solar and wind generations based on the combination of different data-driven techniques that ingest high-quality post-processed NWP (thus using the most updated forecast methodology). We assess the performance of our models in Italy, which provides a rigorous testing ground because of its broad range of latitudes, heterogeneous geography with mountains and expansive plains and division into different TSO forecast-controlled areas. Moreover, in this case, only a little bit of information was available: the aggregated regional hourly power data and the annual solar/wind installed capacity, so that the regional solar/wind production was treated as if it had been generated from a single virtual plant. These factors make it particularly challenging to predict imbalances and respond flexibly to them. Upscaling methods that perform well in Italy are likely to perform well in other complex environments.
Next, we analyzed how improving one-day weather forecasting can reduce imbalances, decrease flexibility requirements and improve energy distribution in the grid. We developed a methodology to simultaneously quantify the margin of reduction in imbalance volumes and flexibility requirements that can be achieved by increasing the accuracy of forecasting and strengthening the national transmission grid to remove capacity bottlenecks between the TSO forecast-controlled areas (market zones).
Finally, we estimated the levels of imbalance/flexibility/overgeneration that can be achieved in the future according to the load/PV/wind growth scenario developed by the Italian TSO by 2040, highlighting critical issues associated with reaching high solar/wind penetrations.
This methodology depends on the country’s weather conditions and could help TSOs to quantify the maximum reduction in imbalance and flexibility requirements that can be achieved by advanced forecasting and grid strengthening in their specific controlled areas.
Dissimilar to investigations in which the authors consider only the impact of solar forecast errors and uncertainty on the demand supply imbalance and reserves [
14,
15,
16,
17], we provide complete study considering the effects of the entire variable renewable energy (VRE) generation according to the size of the forecast-controlled areas. While authors typically address the problem of strengthening the transmission grid to enable a better share of renewable energy and reducing congestions/curtailment [
12,
13], our work focuses on the benefit of enlarging controlled areas to reduce imbalance and flexibility requirements. The problems associated with PV forecasting at the regional level and the impact of solar forecast smoothing effect resulting from a larger forecast footprint over Italy have already been evaluated in [
35,
36]. In this paper, we extend these results by considering regional load/PV/wind forecasts and evaluate smoothing effects in terms of reducing the imbalance/flexibility. We further expand these results by also considering future VRE penetration levels.
Section 3 presents the Italian regulatory framework to give the reader an overview of the local and national situation.
Section 4 discusses the methodology applied to quantify grid imbalance in terms of volume and cost.
Section 5 describes the forecasting models we applied, including a presentation of the experimental data used, the weather forecast models and the forecast methodologies developed herein.
Section 6 presents the metrics used in the literature to quantify the accuracy of forecasts.
Section 7 lists the results obtained with the models presented in
Section 5 and shows how the imbalance is reduced through better forecasts and grid reinforcement. These results are then extended to future scenarios where the TSO anticipates higher VRE penetration.
Section 8 concludes the paper and presents some issues for further discussion.
2. Regulatory Framework
Until 2020, Italy was divided into six market zones (In 2021, the Italian TSO add another zone; therefore, currently, the country is divided into seven market areas.) defined by the Italian TSO according to national transmission grid constraints that limit the capacity transit between areas (
Figure 1).
In this paper, we adopt the market zones nomenclature used by the Italian TSO: NORD (north zone), CNOR (center–north zone), CSUD (center–south zone), SUD (south zone), SICI (Sicily island) and SARD (Sardinia Island).
The imbalance between demand and supply in each market zones is defined as:
where the netload (or residual load) is the electricity demand net of the solar and wind generation, while the generation scheduled for the next day (that will have to be supplied by dispatchable generators) is the netload day-ahead forecast.
When the imbalance is negative (less scheduled supply than demand), upward regulation services are needed, while, when the imbalance is positive (more scheduled supply than demand), downward regulation services are required.
The imbalance is currently evaluated at each market zone only when the netload is greater than zero. Negative netload values mean VRE overgeneration that must be re-dispatched out of the zone (if transit limits at the point of interconnection allow) or curtailed.
Day-ahead netload forecast is used by the Italian TSO to size the tertiary reserve and ask, one day in advance, to the Balancing Service Providers the bids for load following and unit-commitment ancillary services. The great portion of Tertiary reserve (“Terziara di sostituzione”) is dimensioned by using a symmetric band (for up/downward regulation) evaluated through the 99.7% order quantile of the forecast errors [
47]. Therefore, the maximum system flexibility (Tertiary reserve) required to resolve the imbalance related to incorrect demand scheduling can be defined by:
Evaluation of the imbalance ( occurred over one year.
It is worth remarking that energy to balance the demand–scheduled supply is negotiated (one day ahead) on the Dispatching Services Market (MSD), while the energy to resolve imbalances due to contingency (unpredictable events) is acquired on the real-time Balancing Market (BM). Equations (1) and (3) refer only to energy exchanged on the Dispatching Services Market, and the MSD and flexibility requirements are related only to the flexible capacity need to compensate forecast errors that is approximatively half of the total imbalance volumes.
3. Methodology
The Italian imbalance volume and cost are defined as:
where
is a Boolean function that is equal to one if
or equal to zero otherwise,
and
are the prices on the MSD of the energy for upward and downward regulation and
is the number of hours of the evaluation period (usually months or year). The energy needed to accommodate the imbalances is purchased/sold by the Italian TSO on the energy markets (MSD and MB) using the “pay as bid”, scheme and the costs are mainly borne by the ratepayers.
Nowadays, the Italian imbalance volume and cost result from the sum of the zonal values. From Equations (1) and (3), it is clear that the simplest way to reduce imbalance and flexibility requirements is to improve the forecast accuracy by advanced forecasting methods. This is also the cheapest solution, since it can be done almost with no additional cost. The second way to reduce imbalance and flexibility is to remove the NTG bottleneck between the market zones enlarging the forecast-controlled area to the entire country. This will provide a further improvement of the prediction accuracy due to the “smoothing effect” (i.e., spatial decorrelation of the forecast error). This imbalance/flexibility mitigation measure requires significant investments, but most of them (8.9 billion EUR in the next 10 years [
48]) are already planned, since the reinforcement of the national transmission grid will be essential for the RES transition: (1) enable the better distribution of VRE generation across the country, (2) allow for a better share of flexible resources and more efficient generation, (3) avoid congestions and the curtailment of VRE through redispatch of zonal VRE overgeneration and (4) enable the integration of energy markets.
The objective of this work is to quantify the margin of reduction of the imbalance and flexibility requirements (at current and future VRE penetration levels) that can be achieved in Italy by improving the accuracy of forecasts and enlarging the forecasting footprint to the entire country through NTG-strengthening projects.
For this aim, we adopted the following methodology:
We developed advanced forecasting methods for the day-ahead prediction of load and VRE generation in each market zone. We evaluated our forecasts by comparing with the most used baseline forecasts (persistence) and by comparing the volume and costs of the imbalance obtained from our advanced forecasting methods with those obtained from the Italian TSO during 2016 (test year).
We computed the imbalance/flexibility requirements of 2016 of each market zone and of all the possible areas obtained by unifying two or more adjacent market areas.
We used the load and VRE capacity growth scenarios (
Figure 2) developed by the Italian TSO (according to the National Integrated Climate and Energy plan and “National Trend” European scenario published by ENTSO-E) to assess the order of magnitude of the imbalance/flexibility/overgeneration that could be achieved in the future using our forecasting methods on both individual market areas and a single national market.
For all the analysis and for the implementation of the neural network, we used MATLAB [
49].
Figure 2.
Load and VRE capacity growth NT-Italy scenarios developed by the Italian TSO [
50].
Figure 2.
Load and VRE capacity growth NT-Italy scenarios developed by the Italian TSO [
50].
6. Results
In this section, we present and discuss the results obtained. In
Section 6.1, we evaluate the accuracy of our load/PV/wind forecasts compared to the accuracy of the reference models. In
Section 6.2, we validate our forecasts by comparing the imbalance volume and costs resulting from our forecasts with the real imbalance volume/costs resulting from netload forecast method developed by the Italian TSO. In
Section 6.3, we show the margins for reducing imbalances and flexibility requirements that can be achieved with our netload forecast compared to those achievable with the persistence baseline forecast, both at the market area and country level. In
Section 6.4, we estimate the imbalance volumes/costs/flexibility and VRE overgeneration that may occur in the future according to the NT scenario.
6.1. Day-Ahead Forecast Accuracy Evaluation (Test Year 2016)
6.1.1. PV Forecast Accuracy
Figure 7 reports, for the northern zone, the scatterplots (observations vs. predictions) of the model chains that make up the hybrid method used for solar energy forecast together with the scatterplot of the persistence model.
The deterministic model provides a strong systematic over-prediction that was corrected by post-processing the prediction errors of the 15 days prior to the day to be predicted, resulting in an 80% improvement in accuracy. The ANN model, which considers the non-uniformity of irradiance and PV capacity along the region, reduces the out layers achieving an additional 9% of accuracy improvement.
At the end of the process, we obtain 56% of skill score with respect the persistence prediction.
Table 3 summarizes the main performance indicators of the forecast for the six Italian macro-regions. For each of them, the regional area is given in the first column. For all regions, the correlation is between 0.95 and 0.97. MBE is very small (as expected) ranging between 0.001% and −0.003%. The MAE is between 2.1% and 3.4%. The RMSE of our forecast (ANN hybrid model) is in the interval between 3.0% and 5.0%, while the RMSE of the persistence forecast (PM) is between 6.9% and 9.4%. The values for rhe Skill Score (SS) range from 48.3 to 56.1 for all regions except CNOR, where the SS = 36.3%.
The regional PV forecast accuracy depends on the persistence of the weather conditions and of the size of the forecast-controlled area. The more the irradiance profile does not change from day to day (low day-ahead variability), the lower the RMSE of the persistence prediction and thus the easier the day-ahead PV generation is to predict. Therefore, RMSE of the persistence model gives an indication of weather variability/predictability. On the other hand, increasing the size of the forecast-controlled area, due to the stochastic behavior of weather conditions, the spatial variability is increasing smoothed as well as the forecast errors from one side to the other of the region are more and more decorrelated thus they decrease in spatial average. Therefore, the larger the area controlled by the forecasts, the easier the predictability of the regional PV generation (with smoothed profile) and the more accurate the forecasts are. This is call “forecast smoothing effect”.
Wentz et al. [
58] compared two machine learning methods: Artificial Neural Networks (ANN) and Long-Term Short Memory (LTSM); they found that in very short prediction (1, 15, 60 min) LSTM outperform ANN. Dolara et al. [
59] used an hybrid model and an ANN method for predicting solar radiation and they found that the hybrid method is more accurate than just the ANN even changing some settings in the neural network. A weather classification approach with support vector machine is used in the work of Shi et al. [
60] Their Mean Relative Error (MRE) range from 12.42% to 4.85%, respectively, for cloudy and sunny models, which is definitively higher than our case, probably because of a shorter time series available. Conversely to the above-mentioned results, our approach is based on combining different models, from physical to machine learning, to maximize the ability of each model to improve the accuracy in forecasting solar radiation. This cascade approach results in a decrease of the uncertainties.
From
Table 3, it can be seen that the RMSE of the northern zone (three to four times larger in size than the other regions), is 1/3 lower than the average RMSE of the other zones (3% compared to 4.5% on average) We can also see that with the exception of the CNOR zone, comparing regions of similar size, the lower the RMSE of persistence the lower the RMSE of our prediction (for instance SARD vs. SICI).
Figure 8 show, by way of example, the hourly values of observed and predicted PV for three days in March 2016, the 3rd, 4th and 5th in the north zone. Interestingly, although the three days have different weather conditions (variable, clear and overcast) due to the large size of the region, the PV generation profiles are all very smooth.
6.1.2. Wind Forecast Accuracy
When we take wind energy into account, both the accuracy of the models and the precision decrease significantly comparing to solar prediction. The reasons for this lie in the nature of the source: wind energy production is much more variable and site dependent than solar. Furthermore, wind farms are not as distributed along regions as solar plants, so forecasting wind generation at the regional level would require more precise information on the location of individual plants. Finally, the distribution of wind speed affects production in nonlinear way with a cut-off and cut-in at high and very low speeds while PV generation is approximatively proportional to the incident irradiance. Therefore, small errors in the NWP of wind speed or uncertainty about wind power plant technology could have a much greater impact on the accuracy of wind generation prediction. In this case, forecasting regional wind generation was extremely challenging because we have no information on either the location or technology of the wind farms, but only know the aggregate output and annual installed capacity.
To predict wind power generation, we optimally blend the results of two upscaling methods based on Analog Ensemble (AE) and Artificial Neural Network (ANN) models, respectively.
Figure 9 reports, for the southern zone, the scatterplots (observations vs. predictions) of each step of the forecast methodology along with the scatterplot of the persistence model. The AE and ANN models provides a systematic under-prediction that was corrected by a post-processing procedure. It is worth noting that while post-processing removes bias, it does not necessarily improve accuracy in terms of RMSE; in fact, the RMSE of the post-processed AE output decreases by 2.5%, while the RMSE of the post-processed ANN output increases by 8%. However, unbiased predictions are essential to build an ANN blending model that has the ability to optimize the outliers and good results of each approach. The optimal blending improves the accuracy the AE and ANN upscaling methods of 27% and 19%, respectively, reaching a skill score over the persistence forecast of 53%. The improvement obtained with this blending approach is highly significant, even without considering all variability effects associated with the wind source.
As summarized in
Table 4, the correlation for wind energy is still high, ranging from 0.81 to 0.87 for all regions, MBE ranges from 0.003% to 0.015%, MAE ranges from 6.0% to 8.1%, RMSE ranges from 8.5% to 12.2%, while RMSE of persistence forecast ranges from 18.1% to 22.4%. The values of SS range from 45.4% to 53.8% with an average value of 48.7%.
First, it should be noted that the RMSE of our wind forecast compared with the one of the PV predictions is more than two time higher (9.2% vs. 4.2% in average). On the other hand, the RMSE of the wind persistence prediction is also more than twice as high as the RMSE of the PV persistence (18.2% vs. 8.3%), indicating the higher variability of wind speed than irradiance (as mentioned above).
The fact that the skill scores of the wind and PV are almost equal (on average: 48.7% vs. 48.8%) means that the two forecasting methods have the same forecasting ability, but the greater variability of the resource underlying the generation decreases the accuracy of the forecast.
Secondly, we point out that, due to the stochastic variability of the resource, also predictions of regional wind generation such as those of solar benefits of the “smoothing effect” (increase of forecast accuracy as the size of the region increases). Indeed, also in this case, the RMSE of the wind prediction of the northern zone is less than half of the average RMSE of the forecast of the all-other zones (4% vs. 10.2%).
Figure 10 shows, by way of example, the hourly values of observed and predicted wind generation for seven days of 2016: from 30 April–5 March in the South zone. Looking at both
Figure 8 and
Figure 10, we can see the greater hourly variability of wind generation compared to solar and the greater difficulties in forecasting.
6.1.3. Load Forecast Accuracy
Figure 11 shows the scatterplots (observations against forecasts) of SARIX and ANN predictions, of the ANN blend and of the smart persistence forecast for the Northern zone (which is the one with higher electric demand). It also reports the accuracy and the skill score of the various forecasting methods. We can see the advantage of using the blending method over the individual predictions (11% and 17%) and the improvement in accuracy over the smart persistence prediction (20%).
Table 5 shows, for the six market zones, the values of the accuracy metrics of our ANN blend forecast, the RMSE of the smart persistence prediction and the skill score over this reference model: correlation ranges between 0.92 and 0.97, MBE between −0.004% and 0.003%, MAE between 2.2% and 3.4%, RMSE between 3.2% and 4.9% while RMSE of the smart persistence ranges between 3.8% and 5.9% and SS between 17% and 21.3%. These values do not differ too much between market zones, and the RMSE of the North zone is even higher than the average RMSE of the other regions (4.7% vs. 3.8%). This is because, dissimilar to solar and wind, the load depends mainly on the statistic behavior of consumers therefore the load profiles do not smooth when the forecast-controlled area increases as well as the errors in the prediction are always strongly correlated inside the area so that the forecast accuracy does not decrease with the increasing of forecast footprint.
It is worth noting that the SP statistical model provides a fairly accurate prediction, which is why, compared with the solar and wind skill scores, our forecast has a moderate improvement in accuracy over the baseline forecast (17.8% on average).
Figure 12 shows, by way of example, the time series of the observed/predicted load for the interval between 5 and 9 March 2016.
We also remark that the greatest forecast errors are related to the prediction of vacation days (which differ greatly from those of the previous week), by using a separate forecast method for these days it would surely be possible to greatly improve the load forecast accuracy.
6.2. Netload Forecast Accuracy Assessment and Validation
While we already proved that our PV generation forecast is at the “state of the art” level by comparing to other predictions (on a different test year) and with other results in literature [
35], this is more difficult for wind. The ongoing wind forecast research is focused on the prediction of the generation of a single turbine or farm [
61], thus, obtained accuracy cannot be directly compared the regional ones. Indeed, regional wind forecasts suffer from the lack of geographical/technical information on the wind fleet, but benefit from the smoothing effect. The main effort on this issue, however, is to reduce forecasting errors using different approaches. Louka et al. [
62] and Lima [
63] are both working to improve weather forecasting by applying a Kalman filtering method to both the input data and the systematic errors, resulting in a reduction of the RMSE. The combination of adaptive methods and integrated approaches improves the estimation of wind energy production, as presented by Vaccaro [
64] who combined different data sources using a supervised learning system that merged data from different sources, such as wind speed observations in Italy and models. Similarly, Pang et al. [
65] combined physical models and ANN approaches to achieve an 80% reduction in RMSE, which is higher than our results (about 53% for wind), but for individual power plants rather than a regional assessment as we did. To improve wind power plant load prediction De Giorgi et al. [
66] included more atmospheric variables and different ANNs demonstrating the importance of considering the surface pressure and temperature in the plant site. With a comparable approach Velazquez [
67] demonstrated the importance of wind direction as input of the ANN models to decrease the forecasting error of about 20%.
For load forecasting, there are not standard way for performance evaluation as in solar and wind, therefore accuracy in not normalize by the peak demand, different KPIs or evaluation period are used etc. [
68], so that benchmark our method is very difficult.
For these reasons, we decide to assess the quality of our prediction directly comparing the imbalance volume and cost obtained by our netload forecast with the 2016 values resulting from netload prediction of the Italian TSO (
Figure 13).
Figure 13a contains the values for the total downward and upward imbalance in the six Italian market zones (grey and black histograms, respectively) and the associated downward and upward prices (grey and black dots, respectively) [
69].
Figure 13b shows the same values but applying the advanced forecasting approach we developed. Comparing the two figure an overall decrease in volumes and costs is observed with an annual reduction of 8% and 38%, respectively. It is worth noting that the TSO systematically slightly under-predicts residual demand, as upward regulation volumes are always larger than downward ones, which explains why we achieve 38% cost savings with 8% accuracy improvement (upward energy prices are much higher than downward). To be fair, given that in the real-time (MB) balancing market, the TSO has acquired more downward than upward regulation services, it is likely that the under-prediction is intentionally dictated to purchase in advance more upward reserves on MSD than real-time upward services on MB (which could be much more expensive). However, our prediction methods can be considered “state of the art” and compete in accuracy with current TSO prediction techniques.
In addition,
Figure 13b shows that, if capacity transit constraints between market zones were removed allowing direct forecasting of the entire Italian VRE generation, our forecasting methods would have resulted in a reduction of imbalance volumes and costs by 33% and 54% respectively. Therefore, upgrading the national transmission grid to remove transmission bottlenecks between market areas not only allows current reserves to be used regardless of the area in which they are located (increasing the effectiveness of regulation), but also, due to the VRE “smoothing effect,” greatly reduces the volumes and costs of imbalance.
6.3. Margins of Reduction of Imbalance and Flexibility Requirements Due to Advanced Forecasting and NTG Reinforcement
Figure 14 depicts the improvement our prediction achieves in terms of imbalance and flexibility for all 6 zones, the different combinations of two or more adjacent zones and the whole country. The x-axis shows the area of the different zones in km
2 and to the right of it the total area of Italy. On the y-axis we give the values for imbalance volumes (
Figure 14a) and flexibility requirement (
Figure 14b). Colors indicate the forecast methodology: blue is for the simple persistence model while black is for our advanced forecasting methods. The blue/black stars represent the Italian imbalance volume resulting from the sum of the imbalances obtained from the netload prediction of each market zone.; red star is the Italian imbalance resulting from a single netload prediction at national level.
Figure 14a reveals that, by using the advanced forecast (black star) instead of the baseline forecast (blue star), the Italian imbalance would be reduced by 50%, while an additional 30% reduction could be achieved by removing NTG constraints between market areas (red star).
However, the results reported above show that the TSO has already reached a netload forecast accuracy near to the “state of the art level” (we obtained just 8% of imbalance reduction) while there is more room for imbalance reduction if advanced forecast is coupled with network strengthening measures (33% of imbalance reduction). Indeed, Italian TSO has already planned to enhancing trade between market areas for improve integration of renewable energy sources [
49].
Figure 14b shows that, if the persistence prediction is used, smoothing effect is not enough to reduce flexibility when the forecast-controlled area increases. This means that although annual imbalance volumes decrease as the forecast footprint increases, this is not the case for the maximum forecast error when using very low-quality prediction (as the persistence model). In contrast, our forecast reduces the flexibility requirement by 38% compared to the value obtained through the persistence model, while reinforcement of the NTG could provide an additional 18% reduction.
These results are obviously dependent on regional weather conditions, so this analysis could be repeated for other countries, helping TSOs quantify the imbalance/flexibility benefits that can be achieved through forecast accuracy and transmission grid improvements.
6.4. Future Scenario of RES Penetration
Using the load/VRE hourly profiles of 2016 and our forecasting methods, we estimated the order of magnitude of imbalance volumes/costs/flexibility/overgenerations that could be reached in future according to load/VRE capacities growth scenario developed by the Italian TSO (
Figure 2).
Figure 15 presents the future scenarios for the imbalance volumes (a) and costs (b) from now to 2040. Due to the increasing VRE penetration, the imbalance volume is expected to grow from the 17.7 TWh/y by 2021 to 21.3–24.9 TWh/y by 2030–2040 (+21% and +40%). This increase can be contained if it would be possible to enlarge the forecast footprint to the entire nation. In this case by 2030 we could have an imbalance of 16.3 TWh/y even lower than 2021 value, while by 2040 of 19.8 TWh/y (+12%).
The imbalance costs on the dispatching market (MSD), under the business-as-usual assumption and without considering the price impact of unexpected events such as the current Ukraine war, are expected to increase of 23–43% by 2030–2040 with respect the 1090 million euro calculated for the current year. We estimated that NTG strengthening measures aimed at market integration could reduce MSD imbalance costs (borne by ratepayers) by 350–400 million euros per year and even greater additional savings could come from reducing the cost of real-time balancing (MB) where upward regulation can reach incredibly high prices.
In addition, markets integration leads to a better share of flexible resources present in the country reducing the systems flexibility requirements. Therefore, other economic save arises from reduction of the auctions on the capacity markets that in Italy are indirectly funding the construction of new 42 GW thermoelectric reserves.
Figure 16 shows that, only for balancing residual demand and scheduled supply, an increase of system flexibility requirements of 27–53% will be needed by 2030–2040: from the current 15.6 GW of tertiary reserves to 19.8–24 GW. Market zones integration would almost entirely avoid the need to increase reserves for load-following and unit commitment ancillary services, as we estimate a flexibility requirement of 14.2 GW by 2030 and 16 GW by 2040.
With the growing VRE penetration, the events during which wind and solar produce more than demand increase. Excess energy produced must be re-dispatched outside the market zone or curtailed when transit limits between zones are violated. This means that overgeneration implies reverse power flow events, as well as possible grid congestions.
Figure 17 shows the ratio between VRE production and the demand (VRE hourly penetration) of the southern zone. In 2016, there have already been instances where wind and solar generated more than 100% of demand (mainly during Sundays), but in 2030 and 2040 the number of such events will grow greatly, with VRE generation summer peaks of more than 200% and 300% of demand.
Figure 18a estimates the zonal/national VRE overgenerated energy according to the TSO scenario. It appears that the overproduction of VRE must be redistributed from southern and island areas and, in the absence of grid congestions, can be consumed almost entirely by the rest of the country (line with empty dots indicate that there is no remarkable overgenerated energy at national level).
However,
Figure 18b shows that, in 2028, there is a 0.3 percent probability that South/Sicily/Sardinia will have to redispatch more than 4.8/1.3/1.3 GW, so the transmission limits of 4.6 GW from south to south–central, 1.2 from Sicily to South, and 1.3 from Sardinia to South Central (
Figure 1) will be violated for at least one hour per year. The figure also shows that from 2033 onward there will also be a need to redispatch overgeneration peaks from the North Central area and from Italy to neighboring countries.
It is worth pointing out that, in addition to the increase in reverse power flow events and congestions, overgeneration involves severe ramps (down and up) and the complete shutdown of all dispatchable production units, which is not always possible. For these reasons, some EU country (not Italy) introduced negative energy prices that means that some generators can choose to pay to remain in operation.
To be certain, the impact of overgeneration from VRE on grid management and dispatch costs are still underestimated and relatively understudied. We will go deeply inside this topic in our future study.
7. Summary and Conclusions
In this paper, we have developed three upscaling methods to predict the day-ahead electricity demand and the wind/solar electricity generation of the market areas in which Italy is divided. The methods, which incorporate the latest techniques currently used to achieve high accuracy of VRE generation forecasts at regional level, are based on a chain or optimal mix of forecasts obtained through different pre/post-processing and data-driven models. We have shown that the accuracy of the PV-wind forecasts increases with the size of the region thanks to the smoothing effect of the forecasts, with an RMSE of 4.5–10.4% for the smaller market zones to 3 to 4% for the larger market area. On average, we achieved an improvement in accuracy compared to the persistence model (which was used as a benchmark) of about 50% for both the PV and wind forecasts. Electricity demand, which is mainly determined by the statistical behavior of consumers and depends less on stochastic weather conditions, is easier to forecast than PV and wind generation, but does not benefit from the smoothing effect. Our day-ahead load forecast achieves an RMSE between 3.2% and 4.9%, depending on the market area, with an average accuracy improvement with respect to a smart persistence forecast of 17.8%.
Our advanced forecasts were used to schedule the supply that dispatchable generators must provide to meet the next day’s residual demand. We have shown that the imbalance between residual demand and scheduled- supply resulting from our forecasts is 8% lower than the imbalance resulting from the TSO forecasts.
We then showed how to quantify the margin of imbalance reduction and flexibility requirements that can be achieved by using advanced forecasting techniques and NTG reinforcement with the aim of market zone integration. We found that the Italian imbalance could be reduced by 50% compared to the persistence forecast using our forecasting methods, while an additional 30% reduction could be achieved through the forecast smoothing effect of the VRE generation if NTG constraints between market areas were fully removed. Thus, the integration of the market areas not only leads to a better share of RES and flexible reserve, but also to a direct reduction of the imbalance volume due to the forecast smoothing effect. Furthermore, we have shown that our forecast reduces the flexibility demand by 38% compared to the value determined with the persistence model, while the strengthening of the NTG could allow for an additional reduction of 18%.
Finally, we used our forecasts to estimate the amount of electricity imbalance/cost/flexibility/overproduction that can be achieved in the future by meeting the European Union’s renewable generation targets. We found that the imbalance volumes will increase by 21% and 40% between 2030 and 2040 compared to current levels, while the integration of market areas could allow for a 7% reduction in imbalance by 2030 and a moderate 12% increase by 2040. Similarly, imbalance costs are expected to increase by 23–43% by 2030–2040. However, market integration could reduce imbalance costs (borne by ratepayers) by €350–400 million per year. We have also shown that a 27–53% increase in the tertiary reserve will be required by 2030–2040, but this flexibility growth can be avoided through a unique national balancing zone. At the end, we estimate the zonal/national VRE overgenerated energy, showing that this overproduction needs to be redistributed from the southern areas and islands and can be almost entirely consumed by the rest of the country. However, we have found that, from 2028 onwards, there will be congestion in the transmission grid, which will reduce the possibility of redistributing the overproduction.