1. Introduction
SCIMs are used to power most industrial appliances because of their robust nature and ability to generate sufficient torque to effectively drive much larger machinery at an affordable cost through the process of electromagnetic induction. Injection sea water pumps, air conditioner compressor drives, gas circulators in power generating firms, and oil exporting pumps in the oil and gas drilling industries are only a few of the well-known applications of SCIMs [
1]. SCIMs are often prone to failures and breakdowns as a result of faults and prolonged operation, and, if left unmonitored, often suffer major damage or breakdowns. According to a review done by Bhowmik et al. [
2], severe operating environments, insufficient insulation, purposely overloading the power supply, and factory defects are the most typical causes of breakdowns and failure. The production down-times caused by these flaws have frequently resulted in revenue loss, among other pitfalls. Therefore, early fault detection is important/crucial to avoid these occurrences [
1]. The consequences of these failures have increased the need for SCIM failure diagnosis as a crucial module for overall equipment prognostics and health management (PHM).
State-of-the-art research studies on SCIM FDI and prognostics feature data-driven PHM technologies, with research ongoing; these studies show that data-driven AI-based PHM technologies rely heavily on the quantity and quality of data to train AI-based predictive modeling [
1,
3]. However, the accuracy of these models is also dependent on how suitable the method befits the nature of the available data, which has led to various studies exploring numerous data-driven PHM methodologies. Researchers have used advances in artificial intelligence (AI), machine learning (ML), and deep learning to construct models that exploit current signals, vibration signals, and thermal signals generated by equipment via sensors, examining these signals separately or combining them for FDI [
1,
4,
5]. Fourier analysis has proven to be effective among the different ways of analysis owing to its convenience and nature of application, especially when it comes to current signature analysis and/or vibration signature analysis (VSA) [
5]. Fourier transforms (FTs) are essentially concerned with the decomposition of signals from their time domain to their frequency domain for analysis in both healthy and faulty motors, providing a superior platform for signal interpretation and feature extraction for FDI [
3,
6]. Even though it can decompose signals to their frequency domains, a FT still has limitations, such as its lack of transient information and its nature of providing only the average time of the spectrum content, thereby lacking in providing details on variations in frequency with regard to time of the signals [
7]. Fast Fourier transforms with high computation speed and short-time Fourier transforms that decompose data into the time–frequency domain are frequently used to solve these challenges [
7,
8]. Further, according to [
9], Fourier analysis is one of the highly efficient analytical tools that is compatible with MCSA for a variety of fault detection for SCIMs.
Although variable frequency drives (VFDs) have recently become more popular in industries than direct online starters due to their ability to provide flexible production control and soft motor start-up, variable frequencies, complex control systems, and harmonics generated at the drive output are still some of the major concerns they are associated with [
10]. Harmonics generated by VFDs pose a significant problem for motor bearings and stator windings since they raise their level of stress. Moreover, they have an impact on signal quality in terms of noise ratio, particularly when using stator current signals for FDI [
5,
10].
2. Motivation and Literature Review
As dependency on SCIMs by industries is on the increase due to their robust nature, failures of these machines cannot be accommodated; therefore, the need to find solutions to these failures or to predict possible failure time cannot be overemphasized. Globally, close to 90% of industrial equipment relies on SCIMs as their prime mover [
11]. According to Choudhary et al. [
1], faults in SCIMs are categorized based on the location of occurrence, i.e., internal or external, and these faults are then grouped based on the nature and/or origin of the fault, i.e., mechanical or electrical faults; the severity of each fault type depends on its location. For instance, an external electrical fault is less threatening than an internal electrical issue, but if left unmonitored, it can escalate to an internal electrical fault, which can lead to a total breakdown.
Bearing and stator faults account for more than 70% of general failures in SCIMs [
12]. According to an investigative report, bearing failure, unsurprisingly, has the highest percentage of occurrence in an SCIM (40–45% contribution) [
1], which can be traced to the nature of SCIMs, whose bearings are susceptible to damage when overloaded, misaligned, and/or unbalanced. Moreover, the bearings are subjected to continuous loading at all times when the SCIM is in operation. Stator failures are common as well, owing to large current flows in their winding coils and insulation weakness caused by mechanical and electrical stress and/or deterioration of insulation. The most common defect in a stator is an inter-turn fault, which happens when two turns in a phase become linked as a result of failed insulation and, if left unchecked, can lead to phase damage or more serious stator failures, resulting in substantial maintenance costs [
13]. Although not as common as stator and bearing problems, rotor faults are one of the most commonly occurring faults, accounting for over 8% of all SCIM faults [
1], and can lead to poor performance and/or breakdown of the motor if left unmonitored.
For adequate FDI, signals generated from these machines are employed for condition-based monitoring, which has proven to be useful in past and present research studies [
5,
14,
15]. Based on the nature of SCIMs, which generate vibration and thermal responses while in operation, vibration signals have been one of the most-used measurements thus far due to their efficiency in both time and frequency domains [
16]. On the flip side, due to the simplicity, low cost, and non-intrusive nature of obtaining current signals, MCSA has become very popular in recent times. This can be linked to the unique current signatures from the IM’s supply [
5,
15]. Due to the effectiveness/efficiency of this approach, MCSA has been effectively employed in both stator and rotor monitoring [
1,
11]. Furthermore, MCSA has also proven to be effective and compatible with Fourier analysis for discriminant feature extraction for critical FDI [
6,
8,
12,
14,
17].
In the literature, various studies have been presented for fault diagnosis frameworks and classifications that exploit MCSA and spectral features extracted based on Fourier series transformations of signals from their time to frequency domains. For instance, in [
15], the authors used both MCSA and FFT under different conditions. Two faults were considered in the study to evaluate their proposed algorithm using stationary and non-stationary signals. Studies presented in [
5,
14] employed MCSA for SCIM fault detection in broken rotor bars. In their methodologies, spectral features obtained from applying FFT analysis to MCSA were applied to independent component analysis (ICA) for improved performance; the extracted features were labeled FFT-ICA and were proven to contain a wealth of information for FDI with good outcomes when used for online fault detection. In [
18], the authors capitalized on the effectiveness of current FDI monitoring. Their proposed methodology used an FFT algorithm to interpret current signals for reliable anomaly detection in the IM. The authors also used advanced signal processing techniques in their proposed method for critical FDI at the bearing and rotor bar of SCIMs to improve the interpretation of current signals. Duc Nguyen et al. [
6], trained features extracted from the FFT spectrum of raw current signals using a machine learning algorithm. According to their research, their methodology presents a low-cost, accurate, and robust FDI instrument for SCIMs that uses only current signals and is also applicable to real-world data. In [
17], the authors used a novel methodology that combined two techniques, wavelet and power-spectral-density (PSD), to analyze the FFT spectrum of MCSA. The technique’s effectiveness was been demonstrated in diagnosing short turns and broken rotor bars in non-constant-load-torque SCIM applications, just as it does in constant-load-torque motor applications. In [
8], Yoo proposed a fault detection algorithm for SCIMs using FFT and PCA. He employed FFT to analyze induction motor current in the frequency domain for fault-characteristic spectral components and used PCA for easy extraction of features from the available components.
In reality, SCIMs and induction motors are often operated under varying loads. It is not recommended to meet/exceed the motor’s maximum loading specification; however, production demands may sometimes require increased loading. These situations often induce stress and/or faults in motors, which may gradually evolve into failures [
19]. On the flip side, SCIMs also experience faults that are humanly undetectable under low-load conditions [
20]. At low-load conditions, SCIMs often do not generate noise and/or observable fault symptoms, which makes operation riskier since abrupt failures may occur, leading to interrupted production and/or accidents. This rationale makes it necessary to develop intelligent monitoring for early fault detection in SCIMs at low-load conditions. Bessam et al. [
20] exploited the Hilbert transform (HT) and a neural network for broken rotor bar intelligent diagnosis in induction machines at low load. Beyond the efficiencies and limitations of their stand-alone HT-based diagnostic technique, other signal processing techniques could be integrated for comprehensive fault diagnostic efficiencies. For instance, Das et al. used an extended Park’s vector approach in conjunction with FFT, DWT, and PSD to process and extract features from current signals for distinguishing induction motor inter-turn stator winding faults from unstable supply voltage conditions [
21]. The coefficient of the major peak observed in FFT was used in their methodology to indicate fault severity and load level in the IM. Hussain et al. [
22] proposed a method for implementing and analyzing current signatures from a three-phase SCIM using a combination of three signal processing techniques: FFT, short-time Fourier transform, and continuous wavelet transform. Their method demonstrated that MCSA can detect changes in frequency components by obtaining the FFT-based spectrum that contains the initial information about the fault. In [
23], the authors demonstrated the non-intrusive nature and simplicity of MCSA. They presented a methodology for detecting faults in an SCIM’s stator winding using external flux sensors on a three-phase SCIM. The sensors were placed on the outside of the motor’s body in the X, Y, and Z directions so as not to interfere with the motor’s operation. FFT analysis of the stator currents revealed a short circuit fault in the SCIM stator winding.
For all these successful case studies, one major take-home is the high efficiency of FTs for MCSA and its robustness for isolating fault frequencies on frequency bands in varying magnitudes. This provides a reliable avenue for harnessing the information provided by frequency-magnitude coordinates as representative features for discriminate modeling for FDI. Peak detection offers a cost-effective diagnostic feature-extraction alternative to spectral frequency-domain extraction and was employed in our study. Its diagnostic efficiency has been recorded in [
19,
24]. According to Jena and Panigrahi [
19], peak detection was one of the techniques employed for fault localization in an automatic gear and bearing using vibration and acoustic signals. In their study, peak detection was one of their proposed filtering techniques, which aimed to unify the approach for both acoustic and vibration signals for enabling fault detection. Further, in [
24], using peak detection as one of their techniques, the authors proposed a methodology for distinguishing bearing fault signals from masking signals emitted by drive-train elements. Peaks in the frequency spectrum were used as a discriminating technique for fault classification and separation in their proposed model. In our quest to develop an FT-MCSA-based diagnostic framework for SCIMs, this study makes the following contributions:
A proposal for a three-phase MCSA-based peak detection approach for diagnostic feature extraction. The proposed feature extraction method extracts the coordinates of the highest peaks from the FFT, PSD, and autocorrelation function (ACF) spectra as features.
An extensive comparison of ML-based diagnostic models to provide a generalization paradigm for SCIM diagnosis.
A computational cost assessment of ML-based diagnostic models is presented, and empirical assessments is conducted for improved diagnostic assessment of the models.
The rest of the paper is structured thus:
Section 3 presents the theoretical background of the key modules of our proposed study, while
Section 4 presents an overview of the proposed MCSA-based diagnostic method.
Section 5 presents the experimental study on a physical case study, while
Section 6 and
Section 7 conclude the study.
4. Proposed System Model
The proposed MCSA-based diagnostic framework for three-phase induction motors fundamentally features a Fourier-based peak detection module for discriminative feature extraction, interpreted by ML-based classifiers for diagnosis.
Figure 1 shows the proposed diagnostic framework.
As shown, the model receives current signals from the three phases of the induction motor, which are simultaneously processed via FFT, PSD, and ACF for peak-based feature extraction. The Fourier transform (and its variants) is named after Joseph Fourier (21 March 1768–16 May 1830), and it serves as the foundation for most frequency-domain signal processing techniques. As previously stated, FFT and PSD provide a spectral representation of the constituent periodic components in the current signatures and can be exploited for accurate condition monitoring. In addition, ACF provides the degree of similarity between a discrete signal and its delayed copy as a function of the delay between them.
Feature extraction from FFT, PSD, and ACF exploits peak coordinates from FTT and PSD spectra and from ACF such that for the first l tallest peaks in each of the three functions, their coordinates are concatenated to form the feature set. These labeled features are then received as input by the ML-based classifiers for discriminative modeling, validation, testing, and performance evaluation using standard classification performance evaluation metrics. The subsections below summarize the core modules of the proposed diagnostic framework.
4.1. Fourier-Based Peak Detection for Feature Extraction
Digital signal processing (DSP) has been a reliable condition monitoring paradigm in a variety of applications for decades. Particularly for induction motors, current signatures are often stationary with different periodic components that are affected by changing operating conditions. Most signals are composed of complex synthesis of sine and cosine functions under relaxable assumptions, which provides a reliable avenue for FFT to flourish.
Different parameters can be extracted from the spectra to make the necessary discriminative inferences for diagnosis. Often, there is a change in magnitude of the spectral components as the operating conditions of the induction motor change, and this presents an opportunity to exploit spectral peaks and their coordinate frequency values as representative features. Given a time-recorded signal (one-dimensional digitized current signal)
, the Fourier transform of
is traditionally denoted
and is computed using Equation (
1):
where
is the input signal,
k is the length of the transform, and
is the corresponding frequency-domain output of the signal.
PSD exaggerates the impact of high-energy components while reducing it for lower-energy components by computing the energy densities of the constituent frequencies. Mathematically, PSD generates a spectrum by squaring the magnitude of the FFT outputs from Equation (
1), and is obtained using Equation (
2):
where
is the PSD-domain output from the FFT of the signal.
High autocorrelation (a maximum of 1) implies high similarity between the signal and its delayed component, while the reverse is the case if autocorrelation is close to zero. ACF can be computed via a convolution theorem using Equation (
3):
where * means complex conjugation and
is the inverse FFT.
Feature extraction from FFT, PSD, and ACF exploits peak coordinates such that for the first
l tallest peaks in each of these three functions, their coordinates are concatenated to form the feature set:
4.2. Discriminative Performance Evaluation Metrics
Because every ML model is unique to its architecture, it becomes necessary to exhaustively explore each model’s prowess for diagnostics while also considering other factors such as model complexity, computational costs, parametirization, etc. This presents the need to employ standardized diagnostic/discriminative performance evaluation metrics. These metrics include accuracy, sensitivity, precision, F1-score, and false alarm rate (FAR) [
40]. These criteria are defined, respectively, as (
4)–(
8).
where
and
, respectively, are the number of correctly classified classes, number of incorrectly classified classes, number of incorrectly labeled samples belonging to a class that were correctly classified, and the number of incorrectly labeled samples belonging to a class that were incorrectly classified.
Although these metrics provide a global perspective for evaluating model classification performance, it may become necessary to evaluate each model’s class-specific performance to ensure a more comprehensive performance assessment. For instance, a classifier may return an overall classification accuracy of 90% over a five-class problem set. This high accuracy may emanate from the model’s strengths for correctly classifying three out of the five classes, whereas it may flaw on the remaining two classes. On the contrary, another model may return the same level of accuracy but its class-specific classification performance may be fairly uniform for each class; hence it would be more reliable than the previous model. This is usually the case, and presents the need for the confusion matrix, which provides an avenue for evaluating each model’s class-specific diagnostic performance.
5. Experimental Study
This study proposes an MCSA-based diagnostic framework, which was employed on a physical testbed at the Defense Reliability Laboratory, Kumoh National Institute of Technology, Korea. The testbed consists of different four-pole, 0.25 hp, three-phase squirrel cage (delta connection) induction motors operating at different operating conditions, as summarized in
Table 1.
Three-phase induction motors are often exposed to different failure modes that emanate from sources ranging from environmental, thermal, electrical, and other factors. However, some failure modes are more critical than others, and they are prioritized in this study. Misalignment in a motor drive system is severe and the most frequently occurring condition in motor driven systems and may present itself in the form of angular, parallel/offset and/or a combination of parallel and angular misalignment [
42]. In reality, it may be highly impossible to experience a single type of misalignment in absolute absence of the other. Often, even an acceptably aligned rotor has some level of a combination of angular and parallel misalignment (though insignificant). On the other hand, another frequently occurring (and critical) failure in induction motors is a broken rotor bar, which often occurs due to mechanical stresses emanating from variable operating conditions [
43]. In addition, inter-turn short circuit winding is yet another critical and highly severe failure mode that often results in complete motor breakdown if undetected. This failure occurs as a result of aging and thermal stress to the insulator separating some turns in a particular phase of the motor [
1,
13,
44]. Consequently, our study prioritizes these three critical failure modes, which were replicated on the testbed shown in
Figure 2.
The induction motors were loaded via a DC loading mechanism—a DC power supply was connected to the driven motor to induce a magnetic field between its rotor and stator, which causes resistance in the driver, resulting in a low-load condition. The motors were operated at a constant speed of approximately 1780 RPM (30 Hz), while data were collected via the driver’s terminals using a NI 9246 module connected to a cDAQ-9178 connected to a desktop computer, as shown in
Figure 2 above. The digital current signals were collected via a LabVIEW interface and stored in .csv format at a sampling rate of 50 Hz; the spectral resolution of the signals was 0.0003 Hz.
Figure 3 shows the manually induced fault conditions to replicate rotor misalignment (ARM-1), a broken rotor bar (BRB-2), and inter-turn short circuit winding (ISC-3) failure modes for the experiment.
ARM-1 was achieved by first aligning the motors (driver and driven) using a precision laser alignment kit, and then to misaligned them by 0.3 mm for both parallel and angular misalignment. BRB-2 was imitated by drilling two holes of diameter 5 mm to a depth of 5 mm. ISC-3 was imitated by bridging seven (7) coils in the same phase. For control, a motor with no fault/failure mode (NOM-4) was also employed.
5.1. Signal Processing for Feature Extraction
Current data were collected from the three-phases of the motors at the different operating conditions and cleaned.
Figure 4 shows a visualization of the current signals collected from the induction motors at the different operating conditions, with red, blue, and green representing phases 1, 2, and 3, respectively.
As shown, the signals reveal some similarity in waveforms across the different operating conditions apart from the phase 2 signal (in blue) of
Figure 4c, whose maximum amplitude is about 2 amperes while the rest have a magnitude of 1 ampere. Next, the signals were processed for feature extraction using the method proposed in
Section 4.
Figure 5 shows the respective FFT, PSD, and ACF visualizations of the signals under the different operating conditions.
As shown in
Figure 5a,b, the FFT and PSD for each of the different operating conditions uniquely reflect different spectral bands of different magnitudes and frequency ranges, whereby PSD is more sensitive for the 1SC-3 condition for phase 2 signals. In addition, the ACF results in
Figure 5c reveals differing ACF amplitudes over the time delay for each of the phases. These were concatenated to form the feature set for discriminative modeling for diagnosis. To develop the feature set, the proposed peak detection algorithm extracted the ten (10) tallest peaks from the FFT, PSD, and ACF spectra, respectively, from each of the current signals collected from the different operating conditions.
5.2. Feature Evaluation
Ideally, the efficiency of traditional ML-based classifiers for diagnosis relies on the discriminative power of the input features. Interestingly, Spearman’s correlation provides a reliable avenue for evaluating the discrimnance amongst the features extracted from each of the operating conditions. This tool measures the linear dependence between two continuous variables and returns a value in the range of –1 (negative correlation) and +1 (positive correlation). Apart from serving as an easy-to-use feature selection tool, it fundamentally provides a hint of the level of dsiscriminance between/amongst features, whereby a high positive and/or negative correlation implies poor discriminance in the features and vice versa. Sometimes, it is also desirable to visually assess the features for discrimnance. This is often supported by dimensionality reduction tools such as principal component analysis (PCA), locally linear embedding (LLE), independent component analysis (ICA), etc. These algorithms fundamentally reduce the dimensions of a feature set and have been employed for numerous purposes, including feature selection, feature reduction, health index construction, etc., and are unique in their individual architectures. For ease-of-use and familiarity in the domain, LLE was employed for reducing the features to a three-dimensional vector for visualizing their discriminative potentials. Accordingly,
Figure 6 shows the feature assessment results.
Figure 6a shows the Spearman’s correlation heatmap of the peak features extracted from motor current signals. Overall, the feature set, which formed a 180-dimensional feature matrix, is mostly uncorrelated, with correlation values normally distributed with a mean of zero, as shown in
Figure 6b. This hints that the features are not correlated and are, hence, very useful for discriminative modeling. Further assessment of the features using the LLE-based feature visualization tool shows in
Figure 6c that the dissimilarity between respective feature clusters per operating condition. They are reasonably isolated in space, as shown in the dark blue, light blue, yellow, and red circles corresponding to the ARM-1, BRB-2, ISC-3, and NOM-4 conditions, respectively.
5.3. ML-Based Diagnosis
Empirically, the feature extraction and evaluation processes described in the previous subsections provide the platform upon which traditional ML-based classifiers are deployed for fault classification. Practically, each ML-based classifier is unique in its architecture and learning principle, and this often poses a concern when choosing the most appropriate model for practical use. In addition, their respective cons and computational costs also pose valid concerns during decision making. Consequently, we explore (as much as possible) many popular traditional ML-based classifiers in our study. The classifiers summarized in
Table 2 were employed for training (using the training feature set) and testing (using the test feature set).
For optimal efficiency, each algorithm has its own set of parameters and architecture, as shown in
Table 2. Exhaustive parameter tuning optimized parameters for each algorithm, which are recorded in
Table 2.
Figure 7 illustrates the accuracy, precision, recall, and F1-scores resulting from ten-fold cross-validation of the algorithms on the test data.
In general terms, accuracy measures a model’s ability to predict classes correctly and is represented in
Figure 7 in green circles. F-1 (represented by yellow squares with blue dotted lines) is calculated by taking the average of precision and recall, which determines the proportion of true predictions the model makes among all actual predictions. Precision determines the percentage of classes that are true and is represented by black triangles, while recall determines the percentage of predicted cases that are actually true and is represented by red stars. As observed, RF is the highest performing classifier, with a test accuracy of
. This is followed by NBC, QDA, and GBC, whose test accuracies are
,
, and
, respectively. On the downside, GPC and Gaussian SVM were the worst performing, with test accuracies of
and
, respectively, while the rest of the classifiers ranged between
and
.
It can be observed that the training scores of all the classifiers (except ABC) on the training data are almost
; however, their test performances are not as optimal as anticipated. This hints at the superiority of some classifiers over others. Interestingly, RF remains one of the most reliable ML-based classifiers and has been shown to be the most accurate in the proposed case study. From a different perspective, assessing the computational cost of the classifiers provides a further avenue for assessing their suitability for practical use, especially in cost-sensitive situations where computational power is a concern.
Table 3 summarizes the computational costs (in seconds) of the classifiers for training and testing.
Table 3 reveals that the most accurate classifier, RF, takes approximately 3660 milliseconds (3.66 s), whereas the second most accurate took much less than 1 s for the same process, yet returned a reliable test accuracy. However, its efficiency is limited to the underlying assumption that the input data distribution is Gaussian—which is not often the case for real-life applications. On the high side, GBC revealed itself as a greedy algorithm, as shown by its high computational cost of 31,210 milliseconds (31.2 s); yet it ranks third in the test comparative assessment. This is followed by GPC (whose cost is 5030 milliseconds), which is costlier than RF but still the second least accurate on the test data. Based on the comparison, a choice of classifier can be made according to the metric being assessed. Typically, computational speed is highly considered in most real-world scenarios, but not at the expense of predictive efficiency. Considering such circumstances, NBC may be preferred, since it offers both low computational cost and significantly high test accuracy. However, RF is an appropriate choice when there is abundant computing power available or when accuracy is critical.
Digging deeper into the algorithms, we assessed the class-specific predictive performance of the classifiers using the traditional confusion matrix, which reveals the probability of correct predictions of a classifier for each of the classes of interest, i.e., operating conditions.
Figure 8 illustrates the confusion matrix resulting from ten-fold cross-validation of the algorithms based on the test data.
Close examination of
Figure 8 reveals that GPC and Gaussian SVM returned the highest false positives (FP) per class (except NOM-4), as shown in
Figure 8e,k. However, class-specific efficacy in prediction for RF (see
Figure 8f) is observed in its high true positive values (across the diagonal). Unfortunately, the false-alarm rate for ISC-3 is observed in the
false negative to BRB-2. This implies that the model mostly recognizes data belonging to ISC-3 as belonging to BRB-2, and this reveals itself as a major limitation of our study.
6. Discussion, Open Issues, and Future Work
Despite the robustness and low cost of maintenance for extended use of SCIMs [
1], the increasing industrial reliance on these power machines, with the critical need to assess for efficiency, reliability, and safety, as emphasized by this study’s goals, cannot be overstated. This study focuses on presenting an ML model that can provide a high level of fault classification performance for various types of faults that are common in SCIMs; nevertheless, we consider the following to be some of the study’s inadequacies: Although the findings revealed the cost efficiency and accuracy of the proposed peak-based feature extraction and ML-based diagnosis, the need for improved diagnostic performance is still critical considering that against the limitations of traditional ML-based algorithms (accompanied with hand-crafted feature extraction processes), the superior feature extraction and classification efficiencies of deep learning models have been shown in multiple studies. These deep learning models—convolutional neural networks, recurrent neural networks, etc.—are popular for automated feature learning; however, their mystical defiance from the statistical principles from which they were fundamentally designed has been a major issue for their adoption. In addition, other issues such as trustworthiness, computational cost, increased complexity, over-fitting/under-fitting, high stochasticity in learning, extensive parameterization/optimization, etc., contribute significantly to hesitation towards generalization. On the other hand, although the traditional, hand-crafted feature extraction process for ML-based diagnosis offers comparatively poorer diagnostic efficiencies in relation to deep learning models, it does offer a transparent architecture for ensuring explainability, empirical investigations, and trustworthiness.
Under realistic situations, the occurrence of the SCIM failure modes presented in this study are not mutually exclusive and may occur in little or intense degrees. This presents the issue of accurately identifying the fault type (or combination of fault types) in place. For example, turn-to-turn short circuit may be minimal (just two turns), intense (several turns in the same phase), or may become very intense when it grows to a phase-to-phase short circuit. Although this presents a broad opportunity for more extensive research, it becomes an endless uphill task to replicate all the possible failure modes, their individual degrees of severity, the possible failure combinations, and their respective degrees of combined severity. Nonetheless, our study offers a reliable feature extraction approach that is expected to direct continued research in the domain. On a different note, the frequency-domain approach for SCIM fault diagnosis has some limitations, which are mostly inherited from spectral leakage and lack of transient information. The models often perform poorly in finite-time window situations, require high-frequency resolution for adequate performance, and exhibit unstable variations in side-band frequencies in varying load situations. In addition, they are often flawed at detecting certain faults at no load conditions, especially the broken rotor bar [
21,
45]. On the other hand, deep learning and time-frequency domain signal processing techniques offer better discrimninative feature extraction efficiencies; however, they are often associated with high computational costs. Because current signals are often stationary and do not often exhibit any transient changes, we believe the proposed frequency-domain approach is more beneficial, considering that the efficiencies of its counterparts—time-domain, time-frequency-domain approaches, and deep learning methods—are insignificant, computationally expensive, and highly unexplainable, respectively.
At minimal load conditions (as presented in our study), the peak detection technique proposed herein offers reliable discrimininative efficiencies. Nonetheless, the high false negatives by the best-performing classifiers for ISC-3 poses a strong concern and is currently a major motivation for our continued research. Notwithstanding, our comparative study herein (for the ML classifiers) provides a valid yardstick for assessing the efficiencies of our future work. From a broader perspective, beyond the efficiencies of standalone sensing/monitoring methods, combination of multiple sensing techniques has been reported in several studies [
45]. Such combinations may exploit vibration, temperature, acoustic emissions, and so on in a unified framework for comprehensive monitoring and/or diagnostic MCSA [
29,
30]. Although these sensor fusion techniques offer valid rationale, achieving a standardized approach for their use remains open for continued investigations. However, as part of our continued studies, we intend to explore deeper and more comprehensive approaches.
7. Conclusions
Squirrel cage induction motors are among the most popular industrial electrical motors due to their high motive power generation, durability, and low maintenance costs. The need for condition monitoring presents the opportunity for CS-based fault diagnosis; however, selecting the appropriate signals processing technique(s) for ML-based diagnostics remains open for continued research.
This paper presented a peak detection approach for discriminative feature extraction, which concatenates FFT, PSD, and ACF peak coordinates from the current signals sourced from a three-phase SCIM. These features are received by various ML-based classifiers, whose classification results are also presented in the study. An extensive comparison of ML-based diagnostic models provides a generalization paradigm for SCIM diagnosis. Results show that RF is the most accurate, with an accuracy of 79.25%, followed by NBC and QDA, with accuracies of 73% and 71.4%, respectively. Furthermore, computational cost assessment of the ML-based diagnostic models is conducted for improved diagnostic assessment of the models. Results show that RF’s computational cost of 3.66 s is in an acceptable range, while NBC has the lowest at 0.003 seconds.
The confusion matrix of the best-performing models revealed that the turn-to-turn fault was imprecisely predicted, providing an avenue for future research. Amongst the paper’s limitations, it is believed that the developed easy, nonintrusive, low-cost FDI framework offers a reliable direction for motivating future work.