1. Introduction
The generation of electricity from renewable sources has been rapidly increasing in recent years, with an estimated growth from
in OECD countries in 2019 to one-third by 2035 [
1]. Among renewable energy technologies, wind energy is considered to be a leading contender for the coming decades, owing to its favorable energy density and conversion efficiency. The global installation of wind turbines nearly doubled from 58 GW in 2019 to 111 GW in 2020 [
2], and this trend has been further accelerated by recent geopolitical trends.
One of the keystones in enhancing the profitability of wind energy is the containment of operation and maintenance (O&M) costs, which can account for the 30% of the total costs over the lifetime of a wind plant [
3], and can dramatically increase for offshore installations. This underscores the increasing attention being given to wind turbine condition monitoring [
4], which is a challenging task due to the complex nature of wind turbines and their exposure to nonstationary operation conditions.
Supervisory control and data acquisition (SCADA) systems have traditionally been utilized for the remote control of wind turbines, but their concept has evolved to become a powerful tool for condition monitoring [
5,
6]. The basic idea is to measure selected channels at an appropriate sampling rate (ranging from one second for wind speed to several seconds for nacelle position) and store them after averaging over a standardized time basis, typically ten minutes. Even with the limited number of measurement channels collected by wind turbines in the 600–850 kW era, the use of SCADA data for condition and performance monitoring has become standard practice, with the power curve analysis even being standardized by the International Electrotechnical Commission (IEC) [
7]. The power curve, which relates the input wind speed to the output power, is widely recognized as the most intuitive representation of wind turbine performance [
8,
9,
10]. In this regard, the IEC has provided guidelines for standardizing, renormalizing, and analyzing power curve data. However, modern wind turbines, with rated powers ranging from 4 to 7 MW, are equipped with hundreds of measurement channels that capture various aspects, including environmental conditions, mechanical response (temperatures and vibrations), electrical behavior, and wind turbine control, among others. While this increases the potential of SCADA-based condition monitoring for wind turbine systems, it also presents new challenges.
The analysis of wind turbine power curves, following the guidelines of the IEC, has its limitations. It can tell us when a turbine is underperforming by comparing it to a reference, but it falls short in explaining the underlying causes. Recent breakthroughs in the field have sparked interest in developing multivariate data-driven models for power curves, aiming to overcome this limitation [
11,
12]. These innovative approaches consider the output power of wind turbines as a function of multiple input variables or covariates to provide a more comprehensive understanding of the factors that influence wind turbine performance. However, as discussed in more detail in
Section 1.2, most of the studies that employ multiple input–single output models are black-box models, which can be opaque, especially as the number of input variables increases. Therefore, there are open issues related to explainability in multivariate power curve models, which can be summarized as follows:
Therefore, the objective of this study is to address the above two open questions by utilizing explainable artificial intelligence (XAI) techniques in the analysis of two real-world test cases. The advancements in XAI [
13,
14] offer promising potential for the proper selection and interpretation of input variables, as XAI methods can provide insights into the relative importance of different sensors in determining a certain output, such as wind turbine power. In light of this, recent related work is reviewed in
Section 1.1.
1.1. Related Work
A multivariate data-driven wind turbine power curve model is a regression with multiple input variables and one output (the power). However, there are several open issues and no consolidated standards regarding the selection of input data and the model structure. As a result, there have been numerous recent studies in the literature addressing these challenges.
In the literature (e.g., [
15]), the use of meteorological mast data, such as air density, humidity, turbulence intensity, and wind shear, as covariates in addition to wind speed is a minority approach, as it has been established that operational variables are more appropriate as additional covariates [
16]. However, a recent study by [
17] utilizes multivariate polynomial regressions to model wind turbine power curves based on wind speed and turbulence measurements obtained from nacelle lidars. Another seminal study in the literature is [
18], where only data collected from the SCADA control system are employed, but additional covariates such as ambient temperature and wind direction are considered. A similar approach is pursued in [
19], where the power of a wind turbine is modeled as a function of various SCADA-collected environmental variables, including wind speed, temperature, turbulence intensity, and wind direction, by comparing several models, such as the Gaussian mixture copula model and regressive and Bayesian artificial neural networks.
A well-established study on the use of SCADA-collected operation variables for multivariate data-driven power curves is presented in [
20], where variables such as wind direction, yaw error, blade pitch, and rotor speed are employed in addition to wind speed. Blade pitch and rotational speed are the most commonly used additional covariates in the literature [
12,
21], as they are fundamental parameters that influence the power coefficient of a wind turbine, which in turn depends on the tip–speed ratio (equivalent to rotational speed) and blade pitch, as dictated by first principles [
22].
Some recent studies have addressed the problem of multivariate modeling of wind turbine power by segmenting the data based on various criteria. For instance, in [
23], a piecewise multivariate Gaussian Process (GP) model is proposed, where different models are created for each wind speed bin, incorporating environmental (e.g., wind speed and direction), operational (e.g., rotational speed and blade pitch angle), and thermal variables (e.g., gear bearing temperature, gear oil temperature) as input variables. A similar approach of segmenting the wind turbine power curve and constructing multivariate models for each segment is also explored in [
11,
24]. Another study by [
25] focuses on accounting for the effect of yaw misalignment on wind turbine power, and therefore, the data are segmented per bin of yaw angle, with a yaw-adjusted additive multivariate kernel (AMK) model established for each segment. For a recent review on data-driven wind turbine power curve modeling, refer to [
26].
A few recent studies, such as [
27,
28], have focused on automatic feature selection from a large set of potential covariates. In particular, they unravel the influence of wind turbine models, specifically focusing on the type of control employed, whether it be electrical or hydraulic blade pitch control. Their findings shed light on the nuanced relationship between the choice of features and the turbine model type, further fueling our quest to push the boundaries of knowledge in this field. Building upon this insight, the objective of our study is to further advance the field by formulating XAI methods for feature selection and interpretation of the constructed model and applying them for anomaly detection in wind turbine power curve modeling.
1.2. Contributions of the Work
The objective of the present work is to advance the state of the art in multivariate wind turbine power curve modeling and its application. This will be achieved through several perspectives, including:
Enhancing model accuracy.
Gaining insight into feature selection and improving explainability.
Applying the model for condition monitoring purposes.
A comprehensive examination of features is conducted using explanatory methods, including a sequential feature selection (SFS) algorithm and the calculation of Shapley coefficients [
29]. The selected regression model is GP, which is widely used in the literature for such applications [
12,
30]. Notably, the methodology proposed in this work provides a replicable criterion for selecting input variables (SFS) and quantifying the impact of each variable on the average error metrics (Shapley coefficients), which is particularly relevant for a black-box model such as GP.
A distinctive feature of this study is the broader set of considered possible covariates compared to the existing literature, including previously unexplored variables. The variable set includes measurements related to rotor control (rotational speed and hydraulic or electrical blade pitch) as well as the mechanical and thermal behavior of the wind turbine, based on the authors’ expertise. A large amount of real data, with different types of technology (hydraulic vs. electrical blade pitch control), are considered, enriching the experimental result section of this manuscript.
In summary, the anticipated innovative results of this work are as follows:
The optimal selection of input variables is found to be nontrivial and differs from the typical set employed in the literature, which usually includes only blade pitch and rotational speed.
Specifically, the role of blade pitch as an input variable should be reconsidered if more explanatory input variables are identified.
The proposed method is demonstrated to be effective for condition monitoring of wind turbine systems, successfully diagnosing three anomalies. Notably, two of these anomalies are related to blade pitch control (hydraulic and electrical, respectively), and one is associated with transverse tower vibrations, likely also influenced by blade pitch behavior.
The proposed application for condition monitoring is particularly advantageous in order to detect hidden anomalies in the operational behavior of wind turbines. Currently, the state of the art for the use of SCADA-collected data for condition monitoring is the normal behavior modeling of certain features, which are selected as targets based on first principles. For example, the temperature increase at meaningful rotating components of the wind turbine is targeted as a symptom of evolving damage. The approach proposed in the present work does not need to know a priori what the target is to monitor for identifying an anomaly. It starts from a set of representative features that is possibly vast, and highlights differences in the operation behavior. This allows for anomalies to be unveiled that are overlooked in the wind farm operation practice and in the scientific literature, such as those highlighted for the test cases of this work.
The rest of the manuscript is structured as follows:
Section 2 illustrates the proposed methodology,
Section 3 describes data and the experimental setting, and
Section 4 describes the conducted experiments. A discussion of the results is presented in
Section 5, while
Section 6 draws the main conclusions.
2. Proposed Methodology
The present study focuses on the construction of a multivariate power curve model using a combination of Gaussian process regression (GPR) and XAI-based methods. The novelty is that the accuracy of the built model, combined with the employment of the Shapley coefficient, allows for the realization of effective condition monitoring systems.
The proposed methodology follows a workflow illustrated in
Figure 1 and can be divided into three main blocks, labeled as (A), (B), and (C).
In block (A), the initial stage involves the collection of data from a reference wind turbine generator (WTG) over a specific time period. To identify the most influential variables from a vast pool of potential inputs, the sequential forward selection (SFS) algorithm is applied. Subsequently, the selected covariates obtained from SFS are utilized to construct a Gaussian process regression (GPR) model. To gain insights into the average error and the contribution of each selected covariate, Shapley coefficients [
31,
32] are computed.
The trained model, together with the selected input features and their rankings based on Shapley coefficients, is deployed to each WTG within the wind farm array, denoted as the c-th WTG. In block (B), the model trained using the reference WTG is utilized to evaluate the performance of the c-th WTG. The input data for the c-th WTG are filtered according to the sequential forward selection (SFS) process, ensuring optimal inputs for performance monitoring (PM). By comparing the measured and estimated outputs, accuracy metrics (AM) are computed. Once the operations in block (B) are concluded, an RMSE collection is obtained, consisting of one sample per WTG, providing insights into the overall performance of the wind farm.
In block (C), leveraging the information gathered from the RMSE distribution, outliers are detected, highlighting potential anomalous operating conditions in specific WTGs. For these identified WTGs, the Shapley coefficients are computed using the reference wind turbine model. The obtained rankings for these target wind turbines are then compared with that of the reference WTG to identify any disparities. Variables exhibiting divergent ranking positions serve as alarm signals for wind farm operators, aiding in WTG condition monitoring. This approach enables the timely detection and investigation of deviations from expected performance, ensuring that prompt corrective actions can be taken to maintain the optimal functioning of the wind farm.
Figure 1.
Workflow of the proposed methodology. (*) The selected features are the same extracted from the target WTG. (**) Iteration over each target WTG.
Figure 1.
Workflow of the proposed methodology. (*) The selected features are the same extracted from the target WTG. (**) Iteration over each target WTG.
The methodology is here discussed in detail, highlighting its key components, before introducing the workflow of the XAI-based methods. Mathematically, the process above leads to the development of a model:
where
is the power output vector of
N samples, and
is the multivariate input matrix with
M variables (or features). The latter includes the variables selected by SFS from a set of
predictors
, and validated by Shapley coefficients. For the generic
nth sample, Equation (
1) is
.
The XAI workflow employed in this work exploits the map of Equation (
1) in several ways (as is visible from the flow diagram in
Figure 1) and optimizes such map using a loss function (i.e., an accuracy metric). Therefore, before describing in detail the XAI methods in
Section 2.3, we describe in
Section 2.1 the particular map
selected in this work (GPR) and we introduce in
Section 2.2 the employed accuracy metrics, indicating at which step of the workflow they are employed.
2.1. Gaussian Process Regression
Gaussian regression is the nonparametric kernel-based probabilistic model used in this study to build the multivariate wind power curve according to the map .
Particularly, the principles of Gaussian process regression are extensively explained in [
33], and we refer to their work for detailed information, as well as to the work in [
12,
34] for wind turbines SCADA applications.
Mathematically, a Gaussian process is a stochastic process defined in terms of mean
and its covariance
function, as in Equation (
2):
where
m is the expected value (Equation (
3)):
and the covariance is defined in terms of its associate probability density function (Equation (
4)):
where
q is the number of input variables (which means the dimension of the vector
) and
is the determinant of the covariance matrix, whose order is equal to the number of available samples in the training set (
). The terms
and
indicate two points of the space, which correspond to two rows of matrix
. The mean
can be assumed without loss of generality to be zero. The meaning of an element
is a measure of how different points
x and
are.
The kernel covariance function selected in this work is the squared exponential, which is given in Equation (
5):
where
(variance),
(length scale), and
(noise variance) are the model hyperparameters. In this study, the hyperparameters
,
, and
are estimated using the Bayesian optimization. A detailed analysis of the covariance function selection for this kind of application is provided in [
34]. The elements of
are, therefore, the variances of each input variable in the diagonal and the covariance (computed according to Equation (
5)) between pairs of input variables in the off-diagonal. Each element of matrix
is computed by applying Equation (
5).
The assumption of the Gaussian process is, therefore, that the vector output
to predict is given by Equation (
6):
where
is the sample GP function, and
is the white noise and
the identity matrix. The model training is based on the log-likelihood maximization of the probability (Equation (
4)) of observing the training data set output
given the input observations matrix
, as in Equation (
7):
where
and
are the training input and output data, respectively.
The trained model can be used to predict the test output given a new set of unseen data
considering that
and
follow a joint normal distribution, as shown in Equation (
8):
Particularly,
is the estimated function, whose weights are returned by using Gaussian likelihood and computing the posterior distribution of the weights with respect to the test data [
35]. For this kind of model, the conditional distribution
is a multivariate normal distribution, with mean and variance given in Equations (
9) and (
10):
where
is the covariance matrix of the test data, whose size is
; and
(
) is the covariance matrix between training/test and test/ training data, whose size is
(
). Equation (
9) returns the corresponding vector of the estimated target
, where the single element is returned by considering the single point of
.
2.2. Accuracy Metrics
The proposed methodologies utilize an iterative approach, where the accuracy of the model is monitored using defined loss functions in block (A) of
Figure 1. The following metrics are therefore used to assess the model’s performance. Let
be the
i-th residual, defined in Equation (
11) as the difference between the
i-th measurement and the estimate provided by the model when the
i-th row of the
is fed as input:
where
and
are the true and estimated value, respectively.
The root-mean-square error (RMSE) is defined as in Equation (
12):
where
is the average residual and
, is the number of samples in the evaluation data set. The mean absolute error (
) is defined in Equation (
13):
Furthermore, the 95th percentile of the vector composed of the absolute value of the residuals is selected as a meaningful accuracy metric (indicated as
). The rationale for this is that hopefully, the constructed method should give high prediction errors with an extremely low probability. The 95th percentile is computed using the T-digest approximation [
36]. These metrics are normalized to the rated power, making them comparable with the literature.
The metrics are employed as follows:
The RMSE is used as a loss function at each step of the sequential feature selection process, block (A) of
Figure 1;
The RMSE, the MAE, and
) are used to evaluate the accuracy of the GPR model trained with the selected features, in relation to the state of the art in the literature. This is performed in block (A) of
Figure 1. For completeness, a comparison is set up against a univariate method, taking as input solely the wind speed, and the same accuracy metrics are employed.
The RMSE of the GPR model trained with the reference wind turbine and employed on the data of the target wind turbines is employed to identify outlier wind turbines that are suspected of anomaly. This is performed in block (B) of
Figure 1.
2.3. XAI Methods
The model described in
Section 2.1 can be executed with any selection of input variables, and its accuracy can be evaluated using the metrics in
Section 2.2. The two XAI methods proposed in this paper aim to provide insights into the importance of each possible input variable. The SFS method starts with an empty set of input variables. It adds them one at a time, identifying at each round of the GPR regression the variable that reduces a certain loss function the most (in this work, root-mean-square error). On the other hand, the calculation of the Shapley coefficients works in the opposite direction. It begins with the set of covariates selected at the output of the SFS algorithm and then computes the difference in model output for input data with or without each feature by marginalizing over subsets that do not contain the feature. The former method ranks input variables based on how much they reduce errors, while the latter ranks them based on how much they explain the error. These methods can be seen as two complementary ways of explaining feature importance.
2.3.1. Sequential Feature Selection
The goal of SFS is to identify the input variables that, when added, lead to an improvement in prediction accuracy. This algorithm is crucial in transitioning from relying solely on expert knowledge to having a clear criterion for selecting the input variables for a power curve model.
The SFS algorithm, applied in block (A) of the workflow of
Figure 1, follows the steps outlined below:
Initialize a matrix with null dimensions to store the most significant predictors from the set , with P input variables, a vector of output , a counter variable M set to 1, and . P represents the number of variables present in the original data sets, where each variable can be considered as a column vector with N samples. As more features related to the operating state of wind turbines are included from scratch, the value of P increases accordingly, but for a certain data set, P is fixed, and is therefore not a parameter of the model.
Repeat until
- (a)
Iterate for each j-th variable , where
Split the samples of set into training and validation sets using K-fold cross-validation.
For each , build a GP-based model using the k-th training set, and merge the variables of with the j-th variable in the set . Test each j-th model on each one of the validation sets.
Compute the average out-of-sample (loss function) across the sample-sets for each j-th model.
Sort the models and select the variable from that corresponds to the model with the lowest .
- (b)
Remove the selected variable from and merge by column with the matrix , which contains the variables previously selected, to create .
- (c)
Increase the counter M of one unit.
End the algorithm when the loss function stops decreasing, and the matrix contains the most informative M input variables.
2.3.2. Shapley Coefficients
The application of Shapley coefficients in machine learning applications is derived from coalition games, where it is interested in assigning the player a payout by assessing its contribution to reaching the total amount [
31]. In this specific case, the aim is to evaluate the role of each feature in the prediction’s quality after constructing the set of best predictors
using the SFS algorithm. Thus, to understand the reasoning behind computing the Shapley coefficients, it is useful to start from the model built in Equation (
1) for the
n-th instance, which is simplified to a linear model for this explanation:
The notation in Equation (
14) indicates the product between the
n-th row of
and the column vector of
coefficients. Equation (
14) can be compactly written as Equation (
15):
In this case, the
j-th Shapley coefficient can be computed as follows in (
16):
which represents the difference between the effect of the
j-th feature and the average effect (calculated as the expectation value
E). The coefficient
explains how much the average error depends on the
j-th feature, and thus ranks the features.
The above concept can be generalized for nonlinear models, as performed, for example, in [
32]. If an output is dependent on
M features, the coefficient given can be defined as in Equation (
17):
where
S is a subset of the
M features extracted by the SFS from the set
and stored in the matrix
, with the
j-th feature excluded. The functions
and
are the expected values of the prediction, with all features and with the exclusion of the
j-the feature, respectively. The functions
are estimated based on the observed input distribution, as in Equation (
18). It can be interpreted as the prediction for the feature values in a specific set S while marginalizing over the features that are not included in set S.
This formulation, which is the one employed in this work, is called the random baseline Shapley [
32]. The feature with the highest Shapley coefficient is considered the most important, as it explains the highest proportion of the residual between the measurement and model prediction. Due to the high computational demands (a sum with
terms should be considered for each coefficient), it is suggested to compute the Shapley coefficients on a representative subset of the data. In this work, 10% of the data are selected. The computational cost is in the order of 24 h for each set of Shapley coefficients, using a laptop with 3.1 GHz Intel Core and
Matlab 2023. In this study, the coefficients are reported as average absolute values, as the mean prediction error over a set is expected to be zero, and using the absolute value helps emphasize the feature importance. These coefficients are computed considering the
M extracted features according to the SFS algorithm.
2.4. Application for Condition Monitoring
Once a workflow like the one described above has been set up, it can be employed in various ways for condition monitoring. The practical implementation depends on the specific test case and the characteristics of the data set. In general, a space–time approach has been discussed in [
16] as helping to monitor wind turbine fleets. This means that a data-driven model can be utilized to analyze the time evolution of a single system or to compare nearby wind turbines installed at the same site, or both. The former approach is more suitable for tracking the evolution of an incoming fault, while the latter is useful for highlighting different operating modes and/or systematic errors between wind turbines that are expected to be equal but may not be in practice.
In this study, the latter approach is advocated and implemented in blocks (B) and (C) of the workflow (
Figure 1). The detailed steps are the following:
The multivariate model, trained with the data from the reference wind turbine, is utilized to simulate the output for all the wind turbines in the farm based on the given input variables.
Error metrics (Equations (
12) and (
13)) are computed for all the wind turbines in the farm to identify outliers.
For the selected outlier wind turbines, the Shapley coefficients are computed using the workflow described in
Section 2.3.2.
The difference between these coefficients and the corresponding values for the reference wind turbine is used as a target for identifying the component suspected of anomaly.
The 3 rule is applied, where the average Shapley coefficients for the target wind turbine are compared to those of the reference wind turbine in terms of standard deviations (). Covariates deviating more than 3 are identified through this procedure.
It should be noted that unlike standard IEC-like power curve models, it is not expected that the error metrics would be of similar magnitude when simulating a different wind turbine from the one used for training. This is because the relationship between the operational parameters and the power output depends on the health status of the system, which can vary among different wind turbines, even within the same plant. However, an outlier in the space–time comparison should still be distinguishable, as supported by the discussion in
Section 3.
The proposed application for condition monitoring represents an innovation with respect to the state of the art, thanks to the distinctive features of explainability. Actually, the vast majority of the approaches to wind turbine condition monitoring through SCADA data analysis implicitly assume a form of supervision, which is the selection of the target to monitor; for example, the temperature measured at meaningful rotating components of the system. The method formulated in this work is completely unsupervised, thanks to explainability. Starting from a set of features, as vast as possible, the differences in the operational behavior between reference and target wind turbines are identified. This approach can benefit the wind energy community in individuating anomalies that are substantially hidden, and thus overlooked.
5. Discussion
The results collected in
Section 4 respond to two challenging issues related to wind turbine systems monitoring:
Selecting what features are most representative for describing a multivariate normal behavior of a system as complex as a wind turbine;
Highlighting anomalies in the system operation behavior, which would not be visible through state-of-the-art scrutiny, and ascribing it to certain subcomponents.
The answer that this work provides to the above two issues is related to explainability. Regarding the feature selection, the criterion adopted in this work is minimizing the loss function in modeling the power of the wind turbine as a function of multiple input variables. This is due to the fact that ultimately, the interest is in attaining maximum efficiency, which requires monitoring the produced power as accurately as possible. The results collected in
Section 4.1.2 and
Section 4.2.2 indicate that the formulated models are substantially more accurate than benchmark models in the literature, and considerably diminish the probability of predicting the expected power with a high error. Furthermore, the analysis contributes in general to the insight into the use of SCADA-collected measurements for wind turbine normal behavior modeling. The results of this work indicate that the wind turbine SCADA systems collect dozens of measurement channels, which should be included in the monitoring algorithm differently with respect to the state of the art in the literature.
Regarding the condition monitoring application, the contribution provided by this work is the possibility of assuming a lower level of supervision with respect to the state of the art, which is compensated by the features of the XAI algorithm. In other words, the state of the art is typically based on the a priori identification of the target measurement channels to monitor, for which a normal behavior model is constructed. The workflow structured in this work instead starts from a vast set of features, and through the comparison between reference and target wind turbines, identifies the features and the machine working regime, which are likely anomalous. The added value of the XAI algorithm consists, therefore, in highlighting anomalies that are not visible through state-of-the-art scrutiny. Indeed, for the test case of interest, these anomalies were not highlighted by standard methods, and the indications coming from this work were useful for planning the maintenance.
6. Conclusions
The present work focuses on the use of XAI techniques for developing multivariate wind turbine power curve models and applying them for condition monitoring in two real-world test cases with different blade pitch control technologies (hydraulic vs. electrical).
The main conclusion drawn from this study is that the use of multivariate models for monitoring wind turbine power is highly recommended. With an appropriately structured workflow, as demonstrated in this work, multivariate models offer significant advantages in terms of accuracy and explainability compared to the state of the art in the literature. The improved accuracy is achieved by using a broader set of covariates and selecting the most relevant ones through sequential feature selection. The achieved model, despite being a black box (e.g., GPR), remains explainable through the use of Shapley coefficients to assess the contribution of each selected covariate to the average error metrics.
Particularly, the experiments in this manuscript highlight the following: (i) The role of the blade pitch as an input variable for wind turbine power modeling should be critically re-evaluated. Variables related to hydraulic (e.g., actuator pressures) or electrical (e.g., motor currents) control of the blade pitch may be more suitable for accurate power modeling; (ii) The potential of condition monitoring applications based on multivariate data-driven models and space–time comparison between nearby wind turbines is promising. In the selected test cases, anomalies related to blade pitch control and tower vibrations were identified in three wind turbines. These anomalies were not highlighted by state-of-the-art methods, which (differently with respect to the method of this work) typically require an a priori identification of the target where the anomaly is expected to occur.
There are several potential future directions of this work. In addition to the space comparison pursued in this study, a deeper analysis could involve monitoring the temporal evolution of the Shapley coefficients for each wind turbine to detect changes related to incoming faults [
44,
45]. Furthermore, a multi-input–multioutput model based on similar principles as in this work could have wider applications in the field of wind turbine system condition monitoring, as demonstrated in recent studies [
46].