Next Article in Journal
Contribution of the 2010 Maule Megathrust Earthquake to the Heat Flow at the Peru-Chile Trench
Next Article in Special Issue
Discussion of Wind Turbine Performance Based on SCADA Data and Multiple Test Case Analysis
Previous Article in Journal
Evaluation of the Communication Delay in a Hybrid Real-Time Simulator for Weak Grids
Previous Article in Special Issue
Global vs. Local Models for Short-Term Electricity Demand Prediction in a Residential/Lodging Scenario
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Digital Twin Approach for Improving Estimation Accuracy in Dynamic Thermal Rating of Transmission Lines

by
Gian Marco Paldino
1,*,
Fabrizio De Caro
2,
Jacopo De Stefani
1,3,
Alfredo Vaccaro
2,
Domenico Villacci
4 and
Gianluca Bontempi
1
1
Machine Learning Group, Université Libre de Bruxelles, 1050 Bruxelles, Belgium
2
Dipartimento di Ingegneria, Università degli Studi del Sannio, 82100 Benevento, Italy
3
Department Engineering Systems and Services, Faculty Technology Policy and Management, Delft University of Technology, Jaffalaan 5, 2628 BX Delft, The Netherlands
4
Dipartimento di Ingegneria Industriale, Università degli Studi di Napoli Federico II, 80125 Napoli, Italy
*
Author to whom correspondence should be addressed.
Energies 2022, 15(6), 2254; https://doi.org/10.3390/en15062254
Submission received: 31 January 2022 / Revised: 4 March 2022 / Accepted: 11 March 2022 / Published: 19 March 2022

Abstract

:
The limitation of transmission lines thermal capacity plays a crucial role in the safety and reliability of power systems. Dynamic thermal line rating approaches aim to estimate the transmission line’s temperature and assess its compliance with the limitations above. Existing physics-based standards estimate the temperature based on environment and line conditions measured by several sensors. This manuscript shows that estimation accuracy can be improved by adopting a data-driven Digital Twin approach. The proposed method exploits machine learning by learning the input–output relation between the physical sensors data and the actual conductor temperature, serving as a digital equivalent to physics-based standards. An experimental assessment on real data, comparing the proposed approach with the IEEE 738 standard, shows a reduction of 60% of the Root Mean Squared Error and a decrease in the maximum estimation error from above 10 °C to below 7 °C. These preliminary results suggest that the Digital Twin provides more accurate and robust estimations, serving as a complement, or a potential alternative, to traditional methods.

1. Introduction

The transition from fossil fuels to green energy is facilitated by the current technological advancements, which increase the efficiency of clean generators, favoring their large-scale diffusion [1].
Power systems, traditionally built to handle passive and controllable loads, are now put to the test by variable power generators, such as wind, whose profile prediction is still challenging. A large amount of renewable energy generation can lead to high transmitted powers on the lines, causing them to operate near or beyond established loadability margins. Therefore, it is necessary to revise traditional power line operational policies to maintain optimal line management. Transmission lines are characterized by thermal limits expressing their maximum operating temperature. The thermal limits directly affect the load of such lines, playing a crucial role in determining the maximum current intensity of the conductor.
Static Line Rating (SLR) aims at computing the maximum current capacity of the conductor, providing a single, fixed value representative of the worst-case scenario. This leads to conservative operation conditions of the lines, limiting the full employment of line capacity over time.
Differently, dynamic thermal line rating (DTLR) introduces the possibility of adapting the capacity of the transmission line dynamically. The significant advantages of DTLR over SLR include a possible reduction in line congestion caused by the static thermal limit, as well as easier integration of renewable energy sources. The Transmission System Operator (TSO) can adapt the current capacity based on the impact of the current atmospheric condition on the conductor temperature.
Nevertheless, the temperature could vary along bare overhead lines, yielding the necessity of repeating the temperature estimation for several points of the lines. This is particularly relevant when the line covers weather-varying distances of dozens of kilometers. The most relevant temperature among these repeated measurements is the highest, closest to the conductor thermal limit. The necessity of repeated measures significantly increases the costs of sensors, labor, and maintenance.
DTLR is an essential topic in power systems where the research community has produced many contributions. In this review, the DTLR was analyzed by grouping the manuscripts according to existing methods to perform DTLR. Mainly, DTLR includes many tasks such as: (i) estimation of the load capability curve given the current weather conditions, (ii) monitoring of the temperature conductor over the line, (iii) short-term forecasting of conductor temperature to assess transient emergency rating, which is currently an ongoing area of research [2]. Existing approaches for conductor temperature estimation belong to two main categories:
  • Direct methods are characterized by the measurement, through sensors, of the conductor temperature or typical characteristics related to it, such as sag, voltage, ground clearance, and mechanical stress [3]. The temperature measurement is often obtained employing expensive devices applied to a single point of the line [4], to measure the conductor surface temperature, which may differ from its core temperature. A typical direct method is the adoption of the Power Donut™ sensor to monitor the conductor temperature, current, and vibration in a single point of the line [3]. They are considered expensive [5] and measure surface temperature.
  • Indirect methods estimate the conductor temperature without actual measurement. For example, some applications estimate the average line conductor temperature from the Phasor measurement units using line parameters estimation [6]. Differently, the approaches estimating the point line conductor usually rely on real-time measurements of atmospheric conditions [7,8] around the conductor, such as air temperature and sun irradiance, and estimate the conductor temperature by solving an energy balance equation. A widely employed indirect method for estimating the conductor temperature of bare overhead lines given the weather conditions is provided by the IEEE 738 standard [9], presented in Section 2.1.
Since the IEEE 738 indirect approach may fail because of sensor measurements errors, recent research focused on alternative methods. For instance, Refs. [10,11] proposed a Recurrent Neural Network using only temperature and line current. Unfortunately, these approaches have two main limitations: first, they still need the conductor temperature sensor; second, they neglect the weather condition, which may considerably affect the line conductor.
Reference [12] proposes the adoption of Multi-Layer-Perceptron Network to map the relationship between historical samples of conductor and ambient temperature as inputs and convection cooling factor and conductor heat capacitance as outputs, followed by a Parameter Estimation Tester (PET) that maps the network outputs to the conductor temperature and its derivative. However, the IEEE 738 standard is used to prepare the training data, therefore the mapping suffers from sudden changes of input parameters, such as the sun irradiance, which is shown to be a frequent reason for IEEE 738 critical errors (Section 4.3). Furthermore, the experimental comparison is made with the IEEE 738 estimation, therefore not reflecting the ability of the model to follow the real conductor temperature.
Authors in [10] use an Echo State Network (ESN), a novel recurrent neural network (RNN), to learn the non-linear overhead conductor thermal dynamics, and their results show an encouraging match between the ESN and the IEEE 738 model under similar weather conditions. Nevertheless, the match with IEEE 738 does not necessarily mean that the model can follow the actual conductor temperature since both could be inaccurate.
Recent works focus on the security aspect of dynamic thermal rating by proposing specific deep learning architecture with customized cost functions to consider the DTLR security based on the required probability of exceedance [13], as well as DTLR variations improved with current ratio of negative and positive sequences, and voltage criterion for successfully differentiating between faults and unsafe and safe overloading [14].
Finally, the adoption of machine learning techniques has emerged as a key driver for the development of Industry 4.0. Relevant results include the exploitation of sensors data for analysis, monitoring, and security purposes [15], as well as energy management for smart buildings [16].
With these premises and at the best of the authors’ knowledge, it seems that state-of-the-art approaches do not address the most crucial issues: (i) to estimate the conductor temperature without the deployment of a temperature sensor, whose installation and maintenance require interrupting the line operation, and (ii) compensate for diverging temperature estimation over time, which usually affects methods that do not use the actual temperature but rely on the IEEE 738 standard for the training data.
For this reason, the TSOs are interested in deploying a limited number of sensors (only for the weather conditions and accessible without interrupting the line operation) and in the development of low-computational burden models, which can be processed by local-processing units in a network of cooperative sensors.
The advantage of this manuscript is the adoption of real, measured conductor temperature for the mapping, which, therefore, shows a reduction in sensitivity to sudden changes of the input parameters typical of the IEEE 738 standard using machine learning methodologies, which are suitable for the architecture described above.

1.1. Digital Twin

A digital twin (DT) is a virtual model of a physical system characterized by seamless integration between the cyber and physical spaces [17]. Seamless integration is achieved by integrating the physical and virtual data within the DTs [18]. DT approaches are becoming more and more pervasive in different power system research areas: Power Grid Online Analysis [19], where a DT can mirror a large scale power grid in real-time with only a sub-second delay; substations virtualization [20] where a “dynamic connection, two-way transmission” relationship is established between a substation and its DT; DT are receiving increasing attention also from the industrial side: patents have been granted for a system to monitor the state of turbines of a wind farm [21] and for a DT-based cooling process of a power system [22]. However, in the context of DTLR, the notion of DT appears, to the best of the authors’ knowledge, to be still novel.
Machine learning (ML) is often at the core of DT implementations [23,24] because of its capacity to extract meaningful relations from data. Once trained, a ML model can act as an equivalent of a physical system: given a specific input, it produces a corresponding output that ideally coincides with the physical system output. In this sense, a DT can adopt ML to model reality; and a standard ML model can be the enabler of the DT.
A further significant difference lies in the connection with the physical world: while the ML model does not have to interact with the modeled object, a DT does. Its integration with reality is bi-directional and automatic: any change in the physical world is reflected in the DT, and the enhanced decision-making outcomes of the latter are applied to the former. A DT also differs from a traditional physics-based simulation model. Although the latter is generally defined by closed-form mathematical formulations with well-defined hypotheses, a DT often has a non-parametric, data-driven approach (often based on a sensor network).
This manuscript recommends adopting a DT approach for dynamic thermal line rating by showing the performances of the ML models that would be at the core of a corresponding DT architecture. The complete DT architecture, including a simulation interface, a connection with the physical space, and a control mechanism to employ the mentioned ML models, is left for future research. For the sake of simplicity, in the following, the term Digital Twin will refer to the ML models adopted. A high level graphical representation of the proposed approach is shown in Figure 1.

1.2. Authors’ Contribution

The main contributions of this paper are:
  • The proposal of a Digital Twin approach for DTLR based on Machine Learning (ML): by employing a conductor temperature sensor to collect measurements on the line of interest for a limited amount of time, the DT would be trained using a ML model on the measured data. After the training phase, the DT will act as a complete virtual equivalent of the physical system modeled by the IEEE 738 standard;
  • A dimensionality reduction study: as shown in Section 4.2, the proposed approach could suggest which sensors measurements have a significant role in the sensors-temperature mapping through feature selection [25], providing the TSO with meaningful information in terms of sensors’ importance;
  • An investigation about the training phase duration: the DT can be trained with different amounts of historical data points, and the collected performances suggest a minimum duration of the training phase. This is the recommended minimal utilization time of the mentioned temperature sensor;
  • A prediction error analysis: the knowledge of the actual measured temperature allows the study of the areas of severe over/under-estimation by the IEEE-738 standard, utilizing a correlational and graphical analysis, presented in Section 4.3.
The experimental assessment presented in Section 4, shows that relying on a DT approach can significantly improve the accuracy of the estimated temperature with respect to the IEEE 738 standard: the prediction could influence the physical system based by adapting the current load, leading to better optimization of the bare overhead line.

2. Mathematical Formalization

A set of data X [ M , S ] and y [ M ] , respectively the matrix of acquired weather and measured line current data; and the vector of measured conductor temperature, is given. M is the number of total samples, and S is the number of acquired variables. Preliminary, the data above can be split into training and validation sets. Let, X t r n [ N , S ] and y t r n [ N ] , X v a l [ D , S ] , y v a l [ D ] , where N + D = M . X [ M , S ] , where S < S , is the reduced input data matrix, which contains only the non-constant variables for the prediction. y ^ [ M ] indicates the vector of the estimated conductor temperature, which can also be split in training and validation set, y ^ t r n [ N ] and y ^ v a l [ D ] , respectively.
This section provides a formal description of the methodologies adopted in the manuscript. First, Section 2.1 describes the IEEE 738 standard, providing the non-steady-state equation used to predict the conductor temperature. Next, sect ion Section 2.2 introduces the proposed Digital Twin models, namely black-box and grey-box [26]. Finally, Section 2.3 presents the core algorithm of the proposed Digital Twin approaches: the Random Forest Regressor. A graphical summary is illustrated in Figure 2.

2.1. IEEE 738 Standard

The IEEE 738 standard was developed in 1986 to “provide a practical, stable, and uniform (calculation) method for use and reference” [9]. It describes a numerical method for relating the core and surface temperature of a bare overhead electrical conductor to steady or time-varying electrical currents and weather conditions.
The static nature of this model collides with the highly dynamic behavior of the different parameters of the model (electric current and weather conditions, among others). For this reason, the model is run multiple times across short time spans, within which the values of the input parameters can be assumed to be constant.
The change in conductor temperature d T a v g over the time interval d t is calculated using the non-steady-state heat balance (Equation (1)). At the end of the time interval, the temperature is simply the sum of the initial temperature and its change. Then, using a series of such time intervals, the conductor temperature at the end of each interval is calculated to approximate the conductor temperature. In summary, the temperature is a time-varying quantity, depending on the current in the line and the weather conditions.
q c + q r + m · C p · d T a v g d t = q s + I 2 · R T avg
In Equation (1), q c [W/m] is the convection heat loss rate per unit length, q r [W/m] is the radiated heat loss rate per unit length, m C p [J/(m-°C)] is the total heat capacity of conductor, T a v g [°C] is the average conductor temperature, q s [W/m] is the heat gain rate from sun, I [A] is the conductor current, R ( T a v g ) [ Ω /m] is the AC resistance of conductor at temperature T a v g .
Therefore, a model based on IEEE 738, f i e e e , is an iterative model which supplies the prediction at t + 1 as follows:
y ^ v a l , i e e e [ t + 1 ] = f i e e e ( X v a l [ t , ] , y ^ v a l , i e e e [ t ] , Δ t ) t [ 1 , D ]
Equation (2) shows that IEEE 738 does not require any training data since it is just the numerical integration of the first-order differential Equation (1).

2.2. Proposed DT-Based Models

The proposed black-box model needs training data to perform and predicts the future temperature conductor considering only the weather and line current data.
y ^ v a l , b b [ t + 1 ] = f b b ( X v a l [ t , ] ) t [ 1 , D ]
where y ^ v a l , b b is the vector of predicted conductor temperature by using a black-box model and f b b is the black-box model. Hence, the black-box model is trained as follows:
f b b TRAIN ( X t r n , y t r n )
where the TRAIN function represents the training phase and returns a trained ML model.
The grey-box model also needs a set of training data to perform, but its aim is to correct the prediction of IEEE 738 model as follows:
y ^ v a l , g b [ t + 1 ] = y ^ v a l , i e e e [ t + 1 ] + z ^ v a l , g b [ t + 1 ] t [ 1 , D ]
where z ^ v a l , g b [ D ] is vector of the predicted error between the IEEE 738 and the actual temperature conductor, and y ^ v a l , g b [ D ] is the vector of the grey-box model final output. Particularly, the latter is obtained as:
z ^ v a l , g b [ t ] = f g b ( X v a l [ t , ] ) t [ 1 , D ]
where f g b is the grey-box model function, which is trained using the following map in the training step:
f g b TRAIN ( X t r n , e t r n , g b )
where e t r n , g b [ N 1 ] is the vector of the error between the true conductor temperature value and the estimated temperature of the conductor by IEEE 738 and the TRAIN function represents, as in (4), the training phase and returns a trained ML model. Particularly, each n-th element of this vector is linked to the error of the n + 1 -th step. Hence, the generic sample of this vector is equal to:
e t r n , g b [ n ] = y ^ t r n , i e e e [ n + 1 ] y t r n [ n + 1 ] n [ 1 , N 1 ]

2.3. Learning Algorithm

This work is characterized by the adoption of a Random Forest Regressor learning algorithm [27] from the scikit-library [28]. Preliminary experiments showed an outperformance of the considered model with respect to other approaches, such as linear regressor [29], k-NN [30], and SVM [31]. Therefore, in a winner-take-it-all approach, only the top-performing technique in the preliminary experiments has been chosen for the experimental assessment.
A Random Forest Regressor is a learning algorithm that leverages ensemble learning, combining predictions from multiple decision trees regressors to make a more accurate prediction than a single model. Each decision tree is trained with a different random subsample of the original data, and their final predictions are averaged. A random forest regression can effectively model linear and non-linear relationships in several other domains [32].
Constant columns in the original dataset do not provide meaningful information for a Random Forest model, therefore the need for the reduced input data matrix presented in Section 2.2.
Every decision tree is built top-down from a root node, containing all the data points in the corresponding random subsamples of X and its construction partitions the data into homogeneous groups by searching the feature value that maximizes a separation criteria.
A flowchart of a Random Forest algorithm with n trees is presented in Figure 3: the process starts with the collected data, which are split into training-set and testing-set. Next, the training-set is randomly subsampled n times, and each sample is used to build a decision tree. Every decision tree is then used to perform prediction on the unseen testing-set: the final Random Forest prediction is the average of each tree prediction.

3. Case Study Description

This section aims to show that adopting a DT can be of significant help in estimating the internal conductor temperature. In particular, storing the physical sensors data allows data-driven approaches to be adopted. A ML model can learn the relationship between the collected measures and the conductor temperature from historical data. Therefore, given more recent sensor measurements, the model can predict unseen values for the temperature.

3.1. Data

This manuscript employs real data gained from a sensor station installed on a tower of a High Voltage (HV) Overhead line (OHL) [33]. The time resolution is 1 min. The sensor station is installed about 45 m from the ground. It is equipped with a thermopile, which measures the solar radiative flow, an air temperature sensor, with a resolution of 1 °C, and a 3D ultrasonic anemometer, which measures wind speed components with an expected measurement error of 1 % m/s. Since the conductor is placed at a higher height than the sensor station, software estimates wind speed at 65 m according to wind shear equation [34], where the friction coefficients are experimentally computed using a neighbor mast station. The data are broadcast in real-time to a server, which couples the latter with the line current measurement for each t acquired at t + r , which is returned from the TSO’s Energy Management System (EMS). Since r is set equal to 60 s, a specific line current and weather variables stable condition are linked to a resulting conductor temperature reached after 60 s from a precise initial conductor temperature. In addition, to validate the performance of both methodologies, a device to measure the conductor temperature is temporarily installed on the conductor, called Micca™. The latter will not be employed in the final operative configuration.
To set a realistic analysis environment, the DT-based model does not consider—after the initial training phase—any real conductor temperature, which is used only to assess the model performance. However, since the IEEE 738 requires an initial conductor temperature to integrate the heat equation, the previously estimated conductor temperature value is the initial condition for the forward estimation. Under this setting, the IEEE 738 operates as an iterative predictor from a machine learning perspective, exposing it to all the critical issues characterizing this kind of model, e.g., the magnification of error at each iteration.

3.2. Experimental Settings

This manuscript’s experiments can be divided into three sets: prediction accuracy, dimensionality reduction, and error correlation analysis.
The first set of experiments, presented in Section 4.1, encompasses the primary results of this work, comparing the Digital Twin approaches with the IEEE 738 standard. The main parameters of this experiment are the dimensionality of the problem S and the number of trees of a Random Forest n t r e e s considered. The original number of features collected S is 10, while the number of non-constant features S is 5, namely air temperature, sun irradiance, conductor current, wind speed, and wind direction. n t r e e s = 100 , which is the default value of the model. For a proper validation of the performance metrics, 10-fold cross-validation is performed, i.e., the experiments are repeated 10 times by using a different training-set and testing-set. The size of the testing-set is 10% of the total data points: by adopting this technique, the performances over all the available data points can be measured.
The second set of experiments explores possibilities provided by adopting a Digital Twin that goes beyond the forecasting of the conductor temperature. They show that it is possible to utilize a limited number of sensors without significant loss of prediction accuracy and heavily reduce the size of the training-set of the model. The first result is obtained employing Feature Selection, the process of selecting the most informative columns from a table of data. This can be achieved with many methods, one of them being the important features suggestions of a Random Forest Regressor. This particular value is computed by extracting the Gini impurity-based feature importances after fitting all the available data into the model, calculated explicitly as the normalized total reduction in the criterion brought by that feature. The higher, the more important the feature. The second result is obtained performing an iterative run of the black-box experiments by increasing the training-set size, starting with the 5% of the dataset (approximately four days), and increasing by 5% at each iteration.
The last set of experiments aims at providing insights about physics-based model drawbacks. To achieve that, a correlational study was performed by filtering the days where an error greater than 5 °C in absolute value occurred for more than 40 measurements on the same day.

3.3. Metrics

Several metrics have been collected during these experiments. In the following, y ^ t r u t h [ t + 1 ] indicates the estimated value by any method, either y ^ i e e e [ t + 1 ] , y ^ b b [ t + 1 ] , or y ^ g b t + 1 .
The prediction error at time t + 1 , e y ^ [ t + 1 ] , is defined in Equation (9). Traditional metrics, such as the mean squared error (MSE, (10)) and the root mean squared error (RMSE, (11)), do not consider the asymmetrical nature of the described problem. An underestimation of the conductor temperature can result in an excessive increase in the current flow yielding to safety risks: it must be penalized more than an overestimation. For this reason, an asymmetrical variation of the MSE, from [35], is adopted and referred to as AMSE, defined in (12), where α ( 0 , 1 ) represents the desired degree of asymmetry, and 1 condition is a function whose value is 1 if the condition is true, 0 otherwise. For the reason mentioned above of penalizing underestimations, α is set to 0.25 , so that the corresponding MSE is multiplied by 0.75 for an underestimation and by 0.25 for an overestimation.
e y ^ [ t ] = y ^ [ t + 1 ] y t r u t h [ t + 1 ] t [ D 1 ]
MES y ^ = 1 K k = 1 K e y ^ [ k ] 2
RMSE y ^ = 1 K k = 1 K e y ^ [ k ] 2
AMSE y ^ = 1 K k = 1 K α 1 e y ^ [ k ] > 0 · e y ^ [ k ] 2
where K = D 1 .

4. Experimental Results

This section presents the results of the experimental assessment in the following order: the performance of the Digital Twin is shown in Section 4.1, followed by the dimensionality reduction analysis in Section 4.2. Finally, to conclude, it is possible to find the results of a correlation and visual analysis between the error and the input sensors measurements in Section 4.3.

4.1. Digital Twin Performance

The results of the experiments are shown in Figure 4. The plot shows the three error metrics introduced in Section 3.3, comparing the temperature predicted by the proposed Digital Twin approaches, with one estimated by the IEEE 738 standard. The Digital Twin predicts the temperature more accurately. The RMSE metric, having the same unity of measure as the temperature, can provide an intuitive understanding: the proposed grey-box approach has a mean RMSE of around 0.5 °C, while the IEEE 738 standard is slightly under 1.5 °C. The difference between the two approaches increases even further when considering the AMSE metric, meaning that the Digital Twin does fewer underestimations than the IEEE 738 standard. To offer a second point of view, a plot of the distribution of the estimation errors is provided in Figure 5. This confirms the predominance of the proposed approaches.

4.2. Dimensionality Reduction

As mentioned, one additional advantage of the Digital Twin is dramatically reducing the data collected without negatively affecting the model’s performance. For the physical model, the absence of one formula parameter is a blocking preventive of the correct estimation. The Digital Twin does not have such constraints.
Table 1 shows that the features do not share the same importance. By extracting the most relevant ones and iteratively adding them to the training-set, the performances can be computed and the best combination selected. The results clearly show that it is possible to significantly reduce the number of features to be collected without severely impacting the temperature estimation: the Digital Twin black-box model requires much fewer features to be collected than the physical model.
The significant difference of importance suggests the possibility not to consider the least significant features in the training-set. To assess how this would impact the performances of the Digital Twin, a re-execution of the black-box experiments is performed iteratively by increasing the number of features considered in the problem. First, by considering the air temperature and the sun irradiance, and then by adding the current flow, then the wind speed, and, finally, the wind direction. The results of this scenario are presented in Figure 6.
An additional direction to explore for reducing the problem’s dimensionality is the training-set size. This is of particular interest in this context because it indicates the time it is necessary to adopt the actual conductor temperature sensor for data collection. Results are presented in Figure 7: after around a training-set size of a month, there is no significant improvement. This suggests that one month might be a reasonable amount of time to collect the actual conductor temperature measurements.

4.3. Error Correlation Analysis

This section offers a correlational study between the critical errors of the IEEE 738 estimate and the corresponding input sensors values to provide a possible insight for future deeper analysis. The Pearson correlation coefficients between the error of the IEEE 738 standard and the input values are computed considering the hour of maximum error, the 10 h before, and the 10 h after. Specifically, the cell C f , d at row f and column d of Table 2 is the measure of the correlation between the error vector e t r n [ t + 1 ] and the values taken by the feature f in the considered 21h-fraction of day d. It is important to remind that the Pearson correlation coefficient is the ratio between the covariance cov of two vectors and the product of their standard deviations σ ; it takes values between 1 and 1, where 1 indicates total positive correlation, 1 total negative correlation, and 0 the absence of correlation.
In Figure 8 and Figure 9, it is possible to see some of the points analyzed to generate Table 2. This graphical representation aims at further analyzing the possible relationship between the areas of critical errors and the input parameters, providing the reader a visualization of the highest correlation coefficients found in Table 2. A correlational study has no causal implications, but it can give interesting insights and offer possible causal hypotheses. From Figure 8 and Figure 9, it appears that a sudden change in the most correlated feature in that moment, namely sun irradiance or conductor current, might yield to over/under-estimation of the conductor temperature by the IEEE 738 standard, while the DT is less affected by the same events, showing a minor reduction in performance.

5. Conclusions

This work proposes a Digital Twin approach to support the Transmission Line Operators in dynamic thermal rating of overhead transmission lines. A maximal core temperature is established to ensure the safety of transmission lines. Several factors influence the internal conductor temperature, including the amount of current injected in the line, the air temperature, the solar irradiance, and the wind intensity. Because of the expensive nature of direct measurement sensors, it is crucial to accurately estimate the line temperature to adapt the current for optimal line usage.
Physics-based DTLR methods allow the estimation of a line core conductor temperature from weather and line sensors rather than using expensive direct measurements devices. Nevertheless, their adoption requires multiple sensors, and the needed computations are not adequate for real-time estimations. The proposed approach improves the quality of temperature estimation, diminishes the number of required sensors, and reduces general costs. It exploits temporary availability of direct temperature measurement for a training phase, and it is designed with two submodules: a black-box module and a grey-box module. The former tries to learn the mapping from the input sensors parameters to the actual conductor temperature, the latter from the input sensors parameters to the IEEE 738 error. Both modules significantly improve the estimation quality, with better results for the grey-box module. A comparison with the IEEE 738 standard shows a reduction of 60% of the Root Mean Squared Error and a decrease in the maximum estimation error from above 10 °C to below 7 °C with respect to the actual conductor temperature. Furthermore, by not relying on IEEE 738 measurements, the black-box module offers multiple advantages, including a significant dimensionality reduction with respect to the IEEE 738 standard. The experimental results, yet preliminary, suggest the adoption of data-driven approaches in conjunction with physics-based ones. Further studies will focus on the exploration of forecasting strategies by studying techniques to improve the estimation accuracy with a broader prediction horizon, as well as the simplification of the employed models for facilitating the adoption of a data-driven Digital Twin approach. Additionally, future research will focus on adopting transfer learning techniques for exploiting the existing models on new lines, reducing the need for data for modeling, with a further reduction in costs.

Author Contributions

Conceptualization, G.M.P., F.D.C., J.D.S., A.V. and G.B.; methodology, G.M.P., F.D.C. and J.D.S.; software, G.M.P. and F.D.C.; validation, G.M.P., F.D.C. and J.D.S.; formal analysis, G.M.P., F.D.C. and J.D.S.; investigation, G.M.P., F.D.C. and J.D.S.; resources, F.D.C. and A.V.; data curation, F.D.C. and AV; writing—original draft preparation, G.M.P.; writing—review and editing, G.M.P., F.D.C., J.D.S., A.V. and G.B.; visualization, G.M.P.; supervision, A.V. and G.B.; project administration, F.D.C. and D.V.; funding acquisition, G.B. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Service Public de Wallonie Recherche under grant number 2010235–ARIAC by DigitalWallonia4.ai.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data used in this article cannot be made available at this stage of the research because of privacy reasons.

Acknowledgments

Gian Marco Paldino and Gianluca Bontempi are supported by the Service Public de Wallonie Recherche under grant nr 2010235–ARIAC by DigitalWallonia4.ai. Computational resources have been provided by the Consortium des Équipements de Calcul Intensif (CÉCI), funded by the Fonds de la Recherche Scientifique de Belgique (F.R.S.-FNRS) under Grant No. 2.5020.11 and by the Walloon Region.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:
MLMachine Learning
DTDigital Twin
HVHigh Voltage
SLRStatic Line Rating
DTLRDynamic Thermal Line Rating
TSOTransmission System Operator
OHLOverhead Lines
EMSEnergy Management System

References

  1. Alanne, K.; Saari, A. Distributed energy generation and sustainable development. Renew. Sustain. Energy Rev. 2006, 10, 539–558. [Google Scholar] [CrossRef]
  2. Karimi, S.; Musilek, P.; Knight, A.M. Dynamic thermal rating of transmission lines: A review. Renew. Sustain. Energy Rev. 2018, 91, 600–612. [Google Scholar] [CrossRef]
  3. Uski-Joutsenvuo, S.; Pasonen, R.; Rissanen, S. Maximising Power Line Transmission Capability by Employing Dynamic Line Ratings–Technical Survey and Applicability in Finland; Tech. Rep; VTT Technical Research Centre: Espoo, Finland, 2013. [Google Scholar]
  4. Singh, C.; Singh, A.; Pandey, P.; Singh, H. Power donuts in overhead lines for dynamic thermal rating measurement prediction and electric power line monitoring. Int. J. Adv. Res. Electr. Electron. Instrum. Eng. 2014, 3, 93949400. [Google Scholar]
  5. Olsen, R.G.; Edwards, K.S. A new method for real-time monitoring of high-voltage transmission-line conductor sag. IEEE Trans. Power Deliv. 2002, 17, 1142–1152. [Google Scholar] [CrossRef]
  6. Coletta, G.; Vaccaro, A.; Villacci, D. A review of the enabling methodologies for PMUs-based dynamic thermal rating of power transmission lines. Electr. Power Syst. Res. 2017, 152, 257–270. [Google Scholar] [CrossRef]
  7. Soto, F.; Alvira, D.; Martin, L.; Latorre, J.; Lumbreras, J.; Wagensberg, M. Increasing the capacity of overhead lines in the 400 kV Spanish transmission network: Real time thermal ratings. In Proceedings of the CIGRÉ Session, Paris, France, 15–17 September 1998; pp. 22–211. [Google Scholar]
  8. Seppa, T.; Salehian, A. Guide For Selection of Weather Parameters for Bare Overhead Conductor Ratings; CIGRE: Paris, France, 2006; Volume 2. [Google Scholar]
  9. IEEE Standard for Calculating the Current-Temperature Relationship of Bare Overhead Conductors. In IEEE Std 738-2012 (Revision of IEEE Std 738-2006–Incorporates IEEE Std 738-2012 Cor 1-2013); IEEE: Piscataway, NJ, USA, 2013; pp. 1–72. [CrossRef]
  10. Yang, Y.; Harley, R.G.; Divan, D.; Habetler, T.G. Thermal modeling and real time overload capacity prediction of overhead power lines. In Proceedings of the 2009 IEEE International Symposium on Diagnostics for Electric Machines, Power Electronics and Drives, Cargése, France, 31 August–3 September 2009; pp. 1–7. [Google Scholar] [CrossRef]
  11. Yang, Y.; Divan, D.; Harley, R.; Habetler, T. Real-time dynamic thermal rating evaluation of overhead power lines based on online adaptation of Echo State Networks. In Proceedings of the 2010 IEEE Energy Conversion Congress and Exposition, Atlanta, GA, USA, 12–16 September 2010; pp. 3638–3645. [Google Scholar]
  12. Yang, Y.; Harley, R.G.; Divan, D.; Habetler, T.G. MLPN based parameter estimation to evaluate overhead power line dynamic thermal rating. In Proceedings of the IEEE 2009 15th International Conference on Intelligent System Applications to Power Systems, Curitiba, Brazil, 8–12 November 2009; pp. 1–7. [Google Scholar]
  13. Safari, N.; Mazhari, S.M.; Chung, C.Y.; Ko, S.B. Secure Probabilistic Prediction of Dynamic Thermal Line Rating. J. Mod. Power Syst. Clean Energy 2021, 1–11. [Google Scholar] [CrossRef]
  14. Ali, E.; Knight, A.M. An Enhanced Algorithm for Conventional Protection Devices of Transmission Lines with DTLR. IEEE Access 2021, 9, 147295–147305. [Google Scholar] [CrossRef]
  15. Elsisi, M.; Mahmoud, K.; Lehtonen, M.; Darwish, M.M.F. Reliable Industry 4.0 Based on Machine Learning and IoT for Analyzing, Monitoring, and Securing Smart Meters. Sensors 2021, 21, 487. [Google Scholar] [CrossRef]
  16. Elsisi, M.; Tran, M.Q.; Mahmoud, K.; Lehtonen, M.; Darwish, M.M.F. Deep Learning-Based Industry 4.0 and Internet of Things towards Effective Energy Management for Smart Buildings. Sensors 2021, 21, 1038. [Google Scholar] [CrossRef]
  17. Tao, F.; Zhang, H.; Liu, A.; Nee, A.Y. Digital twin in industry: State-of-the-art. IEEE Trans. Ind. Inform. 2018, 15, 2405–2415. [Google Scholar] [CrossRef]
  18. Qi, Q.; Tao, F. Digital twin and big data towards smart manufacturing and industry 4.0: 360 Degree comparison. IEEE Access 2018, 6, 3585–3593. [Google Scholar] [CrossRef]
  19. Zhou, M.; Yan, J.; Feng, D. Digital twin framework and its application to power grid online analysis. CSEE J. Power Energy Syst. 2019, 5, 391–398. [Google Scholar] [CrossRef]
  20. Pan, H.; Dou, Z.; Cai, Y.; Li, W.; Lei, X.; Han, D. Digital Twin and Its Application in Power System. In Proceedings of the 2020 5th International Conference on Power and Renewable Energy (ICPRE), Shanghai, China, 12–14 September 2020; pp. 21–26. [Google Scholar] [CrossRef]
  21. Lund, A.M.; Mochel, K.; Lin, J.W.; Onetto, R.; Srinivasan, J.; Gregg, P.; Bergman, J.E.; Hartling, K.D.; Ahmed, A.; Chotai, S.; et al. Digital Twin Interface for Operating Wind Farms. U.S. Patent US 9,995,278, 12 June 2018. [Google Scholar]
  22. Shah, T.; Govindappa, S.; Nistler, P.; Narayanan, B. Digital Twin System for a Cooling System. U.S. Patent US9881430B1, 30 June 2018. [Google Scholar]
  23. Dröder, K.; Bobka, P.; Germann, T.; Gabriel, F.; Dietrich, F. A Machine Learning-Enhanced Digital Twin Approach for Human-Robot-Collaboration. Procedia CIRP 2018, 76, 187–192. [Google Scholar] [CrossRef]
  24. Min, Q.; Lu, Y.; Liu, Z.; Su, C.; Wang, B. Machine Learning based Digital Twin Framework for Production Optimization in Petrochemical Industry. Int. J. Inf. Manag. 2019, 49, 502–519. [Google Scholar] [CrossRef]
  25. Guyon, I.; Elisseeff, A. An introduction to variable and feature selection. J. Mach. Learn. Res. 2003, 3, 1157–1182. [Google Scholar]
  26. Bontempi, G.; Vaccaro, A.; Villacci, D. Semiphysical modelling architecture for dynamic assessment of power components loading capability. IEE Proc. Gener. Transm. Distrib. 2004, 151, 533. [Google Scholar] [CrossRef]
  27. Biau, G.; Scornet, E. A random forest guided tour. Test 2016, 25, 197–227. [Google Scholar] [CrossRef] [Green Version]
  28. Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-learn: Machine Learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
  29. Seber, G.A.; Lee, A.J. Linear Regression Analysis; John Wiley & Sons: Hoboken, NJ, USA, 2012; Volume 329. [Google Scholar]
  30. Song, Y.; Liang, J.; Lu, J.; Zhao, X. An efficient instance selection algorithm for k nearest neighbor regression. Neurocomputing 2017, 251, 26–34. [Google Scholar] [CrossRef]
  31. Cherkassky, V.; Ma, Y. Practical selection of SVM parameters and noise estimation for SVM regression. Neural Netw. 2004, 17, 113–126. [Google Scholar] [CrossRef] [Green Version]
  32. Howard, J.; Bowles, M. The two most important algorithms in predictive modeling today. In Proceedings of the Strata Conference Presentation, Santa Clara, CA, USA, 28 February–1 March 2012; Volume 28. [Google Scholar]
  33. Villacci, D.; Gasparotto, F.; Orrú, L.; Pelacchi, P.; Poli, D.; Vaccaro, A.; Lisciandrello, G.; Coletta, G. Congestion Management in Italian HV grid using novel Dynamic Thermal Rating methods: First results of the H2020 European project Osmose. In Proceedings of the 2020 AEIT International Annual Conference (AEIT), Catania, Italy, 23–25 September 2020; pp. 1–6. [Google Scholar] [CrossRef]
  34. Irwin, J.S. A theoretical variation of the wind profile power-law exponent as a function of surface roughness and stability. Atmos. Environ. 1979, 13, 191–194. [Google Scholar] [CrossRef]
  35. Tian, J. Forecasting the Unemployment Rate When the Forecast Loss Function Is Asymmetric; School of Economics and Finance, Faculty of Business, University of Tasmania: Hobart, Australia, 2009. [Google Scholar]
Figure 1. High-level overview of the proposed Digital Twin approach.
Figure 1. High-level overview of the proposed Digital Twin approach.
Energies 15 02254 g001
Figure 2. Detailed overview of the proposed Digital Twin approach.
Figure 2. Detailed overview of the proposed Digital Twin approach.
Energies 15 02254 g002
Figure 3. A Random Forest algorithm with n trees. A subsample of the original data are used to build each tree. The final prediction is the average between all the n trees predictions.
Figure 3. A Random Forest algorithm with n trees. A subsample of the original data are used to build each tree. The final prediction is the average between all the n trees predictions.
Energies 15 02254 g003
Figure 4. Performance metrics: the plot represents the mean values of the metrics presented in Section 3.3. The grey-box approach achieves optimal performances, especially in terms of AMSE, appearing to be the optimal approach.
Figure 4. Performance metrics: the plot represents the mean values of the metrics presented in Section 3.3. The grey-box approach achieves optimal performances, especially in terms of AMSE, appearing to be the optimal approach.
Energies 15 02254 g004
Figure 5. Error e y ^ t + 1 Equation (9) distribution without outliers (a) and with outliers (b). Referring with Q i to the i th quartile, an outlier is a value that is above Q3 + 1.5 (Q3 − Q1) or below Q1 − 1.5 (Q3 − Q1). The result is coherent with Figure 4: the grey-box shows less variance, particularly with the outliers. Its error is rarely greater than 5 degrees in absolute value.
Figure 5. Error e y ^ t + 1 Equation (9) distribution without outliers (a) and with outliers (b). Referring with Q i to the i th quartile, an outlier is a value that is above Q3 + 1.5 (Q3 − Q1) or below Q1 − 1.5 (Q3 − Q1). The result is coherent with Figure 4: the grey-box shows less variance, particularly with the outliers. Its error is rarely greater than 5 degrees in absolute value.
Energies 15 02254 g005aEnergies 15 02254 g005b
Figure 6. Variation of the error metrics by increasing the dimensionality of the problem. n indicates the number of features considered, in order of decreasing importance as shown in Table 1. (a) represents the Mean Squared Error, (b) the Root Mean Squared Error and (c) the Asymmetric Mean Squared Error, defined in Section 3.3. It is noticeable that the three most important features (air temperature, sun irradiance, and current flow) already allow to obtain an RMSE of less than 1 °C. In the plot, IEEE 738 is constant because it does not depend on external features.
Figure 6. Variation of the error metrics by increasing the dimensionality of the problem. n indicates the number of features considered, in order of decreasing importance as shown in Table 1. (a) represents the Mean Squared Error, (b) the Root Mean Squared Error and (c) the Asymmetric Mean Squared Error, defined in Section 3.3. It is noticeable that the three most important features (air temperature, sun irradiance, and current flow) already allow to obtain an RMSE of less than 1 °C. In the plot, IEEE 738 is constant because it does not depend on external features.
Energies 15 02254 g006
Figure 7. Variation of the errormetrics by increasing the data size used for the training phase. (a) represents the Mean Squared Error, (b) the Root Mean Squared Error and (c) the Asymmetric Mean Squared Error, defined in Section 3.3. On the x-axis, the number of days of data considered for the training. One month is sufficient to build a robust model, while more time brings no significant improvement.
Figure 7. Variation of the errormetrics by increasing the data size used for the training phase. (a) represents the Mean Squared Error, (b) the Root Mean Squared Error and (c) the Asymmetric Mean Squared Error, defined in Section 3.3. On the x-axis, the number of days of data considered for the training. One month is sufficient to build a robust model, while more time brings no significant improvement.
Energies 15 02254 g007aEnergies 15 02254 g007b
Figure 8. A visualization of the first to third critical areas for the IEEE 738 standard from Table 2. The error Equation (9), is represented for both the IEEE 738 standard and the grey-box proposed approach. Additionally, the most related feature, according to Table 2, is plotted. (a,b) report 2020-12-03, where (b) is a zoom on the area of interest of (a); (c,d) report 2020-12-04, where (d) is a zoom on the area of interest of (c); (e,f) report 2020-12-23, where (e) is a zoom on the area of interest of (f). In all cases, a rapid variation of the measure is most correlated with the error in the moment of its peak.
Figure 8. A visualization of the first to third critical areas for the IEEE 738 standard from Table 2. The error Equation (9), is represented for both the IEEE 738 standard and the grey-box proposed approach. Additionally, the most related feature, according to Table 2, is plotted. (a,b) report 2020-12-03, where (b) is a zoom on the area of interest of (a); (c,d) report 2020-12-04, where (d) is a zoom on the area of interest of (c); (e,f) report 2020-12-23, where (e) is a zoom on the area of interest of (f). In all cases, a rapid variation of the measure is most correlated with the error in the moment of its peak.
Energies 15 02254 g008
Figure 9. A visualization of the fourth and fifth critical areas for the IEEE 738 standard from Table 2. The error, defined by Equation (9), is represented for both the IEEE 738 standard and the grey-box proposed approach. Additionally, the most related feature, according to Table 2, is plotted. (a,b) report 2021-01-05, where (b) is a zoom on the area of interest of (a); (c,d) report 2021-01-26, where (d) is a zoom on the area of interest of (c). In all cases, a rapid variation of the measure is most correlated with the error in the moment of its peak.
Figure 9. A visualization of the fourth and fifth critical areas for the IEEE 738 standard from Table 2. The error, defined by Equation (9), is represented for both the IEEE 738 standard and the grey-box proposed approach. Additionally, the most related feature, according to Table 2, is plotted. (a,b) report 2021-01-05, where (b) is a zoom on the area of interest of (a); (c,d) report 2021-01-26, where (d) is a zoom on the area of interest of (c). In all cases, a rapid variation of the measure is most correlated with the error in the moment of its peak.
Energies 15 02254 g009
Table 1. This table collects values representing the features importance, computed employing the Random Forest Regressor built-in feature importance metric, as the normalized total reduction in the Gini impurity brought by that feature. The table clearly shows the predominant importance of the air temperature followed by the sun irradiance. Together, they account for more than 90% of the relevant information in predicting the conductor temperature.
Table 1. This table collects values representing the features importance, computed employing the Random Forest Regressor built-in feature importance metric, as the normalized total reduction in the Gini impurity brought by that feature. The table clearly shows the predominant importance of the air temperature followed by the sun irradiance. Together, they account for more than 90% of the relevant information in predicting the conductor temperature.
FeatureImportance [ ]
Air temp [°C]0.76
Sun irradiance [W/m]0.18
Current flow [A]0.03
Wind Speed [m/s]0.02
Arranged Wind Dir [°]0.01
Table 2. Pearson correlation coefficient between the error of the IEEE 738 standard and the input values, for period with critical errors. Dates are in the format YYYY/MM/DD.
Table 2. Pearson correlation coefficient between the error of the IEEE 738 standard and the input values, for period with critical errors. Dates are in the format YYYY/MM/DD.
Feature2020/12/032020/12/042020/12/232021/01/052021/01/10
Air temp−0.25−0.86−0.460.27−0.54
Irradiance−0.24−0.69−0.820.540.29
Current flow0.840.31−0.250.760.77
Wind Speed−0.090.390.26−0.02−0.3
Wind Dir−0.09−0.30−0.20−0.18−0.08
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Paldino, G.M.; De Caro, F.; De Stefani, J.; Vaccaro, A.; Villacci, D.; Bontempi, G. A Digital Twin Approach for Improving Estimation Accuracy in Dynamic Thermal Rating of Transmission Lines. Energies 2022, 15, 2254. https://doi.org/10.3390/en15062254

AMA Style

Paldino GM, De Caro F, De Stefani J, Vaccaro A, Villacci D, Bontempi G. A Digital Twin Approach for Improving Estimation Accuracy in Dynamic Thermal Rating of Transmission Lines. Energies. 2022; 15(6):2254. https://doi.org/10.3390/en15062254

Chicago/Turabian Style

Paldino, Gian Marco, Fabrizio De Caro, Jacopo De Stefani, Alfredo Vaccaro, Domenico Villacci, and Gianluca Bontempi. 2022. "A Digital Twin Approach for Improving Estimation Accuracy in Dynamic Thermal Rating of Transmission Lines" Energies 15, no. 6: 2254. https://doi.org/10.3390/en15062254

APA Style

Paldino, G. M., De Caro, F., De Stefani, J., Vaccaro, A., Villacci, D., & Bontempi, G. (2022). A Digital Twin Approach for Improving Estimation Accuracy in Dynamic Thermal Rating of Transmission Lines. Energies, 15(6), 2254. https://doi.org/10.3390/en15062254

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop