Next Article in Journal
A Cluster-Based Energy-Efficient Secure Optimal Path-Routing Protocol for Wireless Body-Area Sensor Networks
Previous Article in Journal
An IoT System Proposed for Higher Education: Approaches and Challenges in Economics, Computational Linguistics, and Engineering
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Integrated Edge Deployable Fault Diagnostic Algorithm for the Internet of Things (IoT): A Methane Sensing Application

by
S. Vishnu Kumar
1,
G. Aloy Anuja Mary
1 and
Miroslav Mahdal
2,*
1
Department of Electronics and Communication Engineering, Vel Tech Rangarajan Dr. Sagunthala R&D Institute of Science and Technology, Avadi 600062, India
2
Department of Control Systems and Instrumentation, Faculty of Mechanical Engineering, VSB-Technical University of Ostrava, 17. Listopadu 2172/15, 70800 Ostrava, Czech Republic
*
Author to whom correspondence should be addressed.
Sensors 2023, 23(14), 6266; https://doi.org/10.3390/s23146266
Submission received: 22 May 2023 / Revised: 30 June 2023 / Accepted: 5 July 2023 / Published: 10 July 2023
(This article belongs to the Section Internet of Things)

Abstract

:
The Internet of Things (IoT) is seen as the most viable solution for real-time monitoring applications. But the faults occurring at the perception layer are prone to misleading the data driven system and consume higher bandwidth and power. Thus, the goal of this effort is to provide an edge deployable sensor-fault detection and identification algorithm to reduce the detection, identification, and repair time, save network bandwidth and decrease the computational stress over the Cloud. Towards this, an integrated algorithm is formulated to detect fault at source and to identify the root cause element(s), based on Random Forest (RF) and Fault Tree Analysis (FTA). The RF classifier is employed to detect the fault, while the FTA is utilized to identify the source. A Methane (CH4) sensing application is used as a case-study to test the proposed system in practice. We used data from a healthy CH4 sensing node, which was injected with different forms of faults, such as sensor module faults, processor module faults and communication module faults, to assess the proposed model’s performance. The proposed integrated algorithm provides better algorithm-complexity, execution time and accuracy when compared to FTA or standalone classifiers such as RF, Support Vector Machine (SVM) or K-nearest Neighbor (KNN). Metrics such as Accuracy, True Positive Rate (TPR), Matthews Correlation Coefficient (MCC), False Negative Rate (FNR), Precision and F1-score are used to rank the proposed methodology. From the field experiment, RF produced 97.27% accuracy and outperformed both SVM and KNN. Also, the suggested integrated methodology’s experimental findings demonstrated a 27.73% reduced execution time with correct fault-source and less computational resource, compared to traditional FTA-detection methodology.

Graphical Abstract

1. Introduction

The IoT is a network of physical objects that are rapidly being adapted into environmental monitoring for collecting, sharing and using the data for automation purpose through the Internet [1,2,3]. Based on advancements in fields such as microelectronics, sensing, communication, and distributed systems, it is a rapidly expanding area that is changing how humans interact with the physical world [4]. IoT devices have the ability to be remotely controlled, monitored, and optimized, which can increase productivity, reduce costs, and enable a variety of applications, such as environmental monitoring, healthcare, supply chain, and inventory management [5]. The general four-layer architecture of an IoT system [6] is illustrated in Figure 1. The perception layer is responsible for collecting the parameters of interest from the environment where the Sensing Edge Devices (SEDs) are deployed. The gateway layer is responsible for sharing/collecting information among SEDs and communicating it to the application layer through a service layer. The application layer is where the collected data from the perception layer is processed into meaningful information to direct the data-driven system or applications [7].
Despite being adapted into multiple functional areas, such as healthcare, agriculture, waste treatment and industrial automation, the IoT still face issues related to identifying faulty data being collected and shared [8,9]. Also, due to its heterogenous nature, the security and privacy of the architecture is also not standardized [10]. From the four-layer architecture it is understandable that the data collected from the environment has a greater impact over the entire IoT architecture, as any corruption in the SED-data can result in overall system failure [11]. SED faults occur for a variety of reasons, such as wear and tear [12], calibration error, physical damage, hostile environment due to heat [13], vibration [14], network failure or intentional tampering [15]. A SED may sustain the damage but then the fault must be identified as soon as it originates to save the data-driven system. In addition, the faulty element(s) must be recognized to make timely repair to increase the SED’s availability time. The other major reasons attributed to find faults in an IoT environment are the considerable amount of bandwidth and power that can be saved [16], which are really important in a resource constraint application such as remote gas sensing.
The conventional knowledge-driven, fault-finding approach involves data collection and the application of expertise; however, this approach remains inadequate to diagnose new failures, which can be replaced by data-driven techniques [17]. Physical inspections of SEDs can be challenging as they are frequently deployed in hard-to-reach isolated locations. Thus, remote diagnosis is preferred for fault identification, but the methodology suffers a major bottleneck—faulty connectivity emphasizing edge detection. Consequently, failure detection is a crucial operation and a fundamental requirement of all remotely deployable SEDs to increase the reliability and availability of an IoT system [18].

1.1. Motivation and Contribution

The main objective of this research is to detect, fault at its source with better accuracy and narrow down the faulty element(s). Using fault-detecting algorithm to identify the occurrence has been widely reported, but identifying the responsible element(s) is more important to reduce the repair time and to increase the availability. Thus, a comprehensive fault diagnostic architecture by integrating the classification tree and deductive tree is proposed. Care has been taken to develop the diagnostic algorithm to consume limited resource to facilitate edge deployment and to compress the Cloud’s stress.

1.2. Related Work

Faults are generally defined as a malfunction or deviation from a normal behavior of a system. In an IoT environment, faults may occur in any of the four-layers and may propagate to the other layer(s). From a perception layer perspective, once fault occurs, it requires repair time, affecting the SED’s availability [19]. The reported the literature has a wide range of faults, such as bias, drift, gain and accuracy degradation, but faults that occur in an IoT-SED is much greater [20]. Hard and soft faults are the two categories under which the SED failure can be classified; a soft fault is defined as transmitting an incorrect value, while any physical damage to SED resulting in erroneous value is termed as a hard fault [21]. The authors [22] have recognized that SED faults can be caused by a variety of issues, including calibration errors, hardware failures, hostile environments, or link failures.
In the literature, the recent works proposed by the authors [23] showcases a framework that applies Machine Learning (ML) models such as Decision Tree (DT) and Gaussian naïve Bayes to detect fault. The authors have greatly contributed by designing the algorithm to be edge deployable, but the proposed methodology covers only three types of faults, namely: Spike, Struct, and Bias fault. In addition, there is no reference to identify the element(s) responsible for the failure. The work [24] proposes an edge deployable, parameter transplantation-based, fault detection approach, which the authors developed and deployed in an embedded system to track the status of a bearing in real-time. However, the process has several drawbacks, like the requirement to gather enough tagged data each time a new model is acquired. The authors themselves have emphasized the need for additional study to lessen the volume of data required to train the model and cut costs. A new Deep Learning (DL)-based study was proposed by the authors [25], where an edge-AI and a specially created middleware solution based on FIWARE was used. The proposed architecture was subjected to tests, in an industrial autonomous transfer vehicle as a case study. It is apparent that this system has large hardware requirements and, additionally, the configuration necessitates the Robotic Operating System (ROS) for proper operation. This is inappropriate for resource-constrained applications such as remote gas monitoring with drones. The research work [26] offers an Echo State Networks model that takes advantage of sensor data fusion that extracts the dynamics from a collection of heterogeneous sensor data to detect fault. The authors orchestrated the edge device using cloud resources, and have not explicitly mentioned the minimal hardware requirements for real-time implementation, which is fundamentally relevant in terms of quick resource requirement and power consumption for application-like, drone-based gas sensing. In addition, when the number of sensors increases, the processing power and memory requirements for sensor fusion also increases. This results in a reduced real-time performance and increased latency. Additionally, for sensor data fusion, maintaining sensor compatibility depends heavily on sensor calibration.
The work [27] proposed the use of edge computing for fault diagnosis, in which the authors verified that ML models, such as DT, RF, and SVM, can be implemented over a low-cost, resource-constrained, edge computational board—Arduino Mega, which provided the necessary motivation for our work. Soft computing methods such as RF, K-NN and SVM have been effectively applied in an automatic fault detection system during the past few decades [28,29]. For the purpose of fault detection, each of these techniques has been effectively applied with modification, However, further research is required in identify the problematic element to reduce repair time. In contest to deploying the soft computing algorithm, many successful mechanisms have been proposed by reputed research groups, which are generally categorized as centralized, distributed and hybrid approaches [21,30]. Most of the proposed techniques are based on statistical methods, anomaly detection or based on ML. In a centralized approach, the algorithms are deployed in a central-node, which keeps track of each individual SEDs by diagnosing the trans-received message through algorithms such KNN [31]. But the bandwidth requirement and the cost associated with this is quite expensive. The drawbacks of centralized fault detection can be eliminated through a distributed approach [32], where ML algorithms such as SVM are implemented in a cluster head to identify the faulty SED. But this method suffers from the need of higher dataset. On the other hand, RF is a collective learning technique that finds applications in both classifications and regression [33] applications. The authors [34] proposed a hybrid algorithm based on feed-forward Neural Networks, but it does not identify the faulty element(s). The authors [35] also proposed the usage of a K-NN algorithm for fault identification, but this method suffers to capture the small changes, making them not suitable for instantly changing CH4 concentration. FTA is a deductive-graphical model that can be used in the design and operations phase. In the design phase, FTA is utilized to debug the design process, and in the operations phase, it can be used to identify the fault with the root cause of failure. When FTA is used for fault analysis, the general operational structure of the system is transformed into a structured logic tree diagram made up of logic gates and events [36]. The events reflect problematic conditions, and the gates represent the logical connections between the events [37]. The top event, which represents the overall system failure, is defined as the first step in the deductive process and the potential underlying root-causes are worked out backward. The work [11] shows the use of FTA and Fuzzy Neural Network in identifying sensor fault. An aquaculture application was used to demonstrate the merits of the proposed algorithm, but the work focused only on sensing faults, namely, constant deviation, biased sensor value, numerical mutation and supply voltage fault. In addition, the authors left a research gap with respect to identifying the element responsible for the fault to occur. The study [38] proposed the use of FTA for analyzing unexpected occurrences of events on C-written-embedded control software for industrial appliances. By identifying the root causes of a major occurrence and implementing preventative action, the suggested strategy achieves a secure ECSW. In the work [39], the authors suggested an FTA-based method for assessing the autonomous underwater vehicle’s state of health in real time. Performance, reliability, and defect information were employed as measures to assess the suggested architecture. As per the authors, as the size and complexity of the system rises, the FTA becomes more computationally expensive. To solve the shortcomings of fault identification in complex systems, the author [40] developed a system-phenomenon fault tree (SPF). The authors of work [41] contrasted fuzzy FTA with the traditional approach and demonstrated that different algorithms coupled with FTA can improve end results.
Knowledge-based techniques frequently mimic human intelligence by employing “if-then” principles. IoT technologies like Expert Systems (ES) are frequently used in these procedures. Although ES have been used to diagnose faults in many different fields, they struggle with knowledge acquisition and rule explosion, where there is a tendency for the systems to become more and more complicated. In these cases, multiple technologies are coupled to address these issues [42]. This motivated the current work, where RF and FTA are combined to detect the faults and the source of faults in an IoT-based gas sensing application.
The remaining sections of the article are organized as follows; Section 2 introduces SED as a system, and details the workings, associated faults, and briefs about the proposed integrated algorithm, and the place where it is implemented in a gas sensing architecture. Section 3 accounts the inference made from the experiment and Section 4 discusses the future aspects and conclusion of the work.

2. Materials and Methodology

A CH4 gas sensing application is used as the case study to verify the proposed integrated approach in edge fault finding. The sub-sections are arranged in following order, Section 2.1 describes in detail about the SED under study, Section 2.2 details about the integrated methodology and Section 2.3 explains the implementation of the proposed methodology.

2.1. Methane Sensing Edge Device

A SED is an incorporation of components, such as a sensor, microcontroller, communication and power module, as illustrated in Figure 2. It is deployed to collect the contextual information about the environment; in our case, the concentration of CH4 [43,44]. The SED under consideration is developed in-house, using commercially available components such as MQ-4, NodeMCU, 50 mL syringe, a USB-A- to Micro-B-type cable and a 5 V power bank.

2.1.1. Sensing Module

The sensing module of an edge device is responsible for data acquisition. In the current study context, the MQ-4 sensor is used as a sensing module to sense the CH4 concentration. MQ-4 works by means of generating an equivalent electrical signal to the surrounding CH4 concentration, based on the reaction between the metal oxide semiconductor and the gas [45]. The electrical signal is then converted into parts per million values, programmatically. In the current study, the CH4 produced from coal-waste is measured between 200 ppm to 350 ppm and the same has been used as lower and upper threshold limits in the FTA model. If the SED is subjected to any physical damage, it could result in drawing more current due to short-circuiting or producing an inappropriate signal, which could also result in harming other peripherals connected with it, such as the microcontroller. Additionally, a harsh environment can also cause the sensing element to produce inaccurate signals or no signals, affecting the accuracy.

2.1.2. Processor Module

The processor in the SED’s perspective is responsible for the sensed data and communicating it to the Cloud. In the current study of CH4 sensing, a Tensilica L106 chip is used. A damaged microcontroller could result in a situation where it is not able to process the sensor data or communicate the processed data to the outer world. To identify such situations, “User defined Error Codes” are introduced; an error code “E0” is sent to the Cloud, indicating a software issue happened and watchdog timer-initiated processor-reset. Similarly, error code “E1” is sent to the Cloud when the supply voltage reads below the threshold value, and “E2” is sent when the battery bank temperature is high (this functionality is not in the scope of the current work, but will be useful when the SED is used in a drone environment to sense the CH4 dispersion). Also, the processor is programmed to send an error code “E3” when the current sensor reads a sudden spike in current consumption; a sign of short-circuiting in an SED. When SEDs are deployed in a harsh environment, they are prone to temperature fluctuations, which could affect the sensor readings; thus, the on-chip temperature is also monitored and shared with the Cloud, and in the current study, the lower and upper threshold of the on-chip temperature has been limited between −40 degree Celsius and 125 degrees Celsius; in accordance with ESP8266′s datasheet.

2.1.3. Communication Module

An individual communication module itself is a small processing device, and in the current study, an ESP8266 Wi-Fi chip is used. It is responsible for facilitating communications between the edge sensor and the cloud services. Any physical damage or fault in the software stack can result in a faulty communication. In the FTA, the Round-Trip Time (RTT), followed by a Link Failure event, becomes initiated when the base-station did not receive any payload value. In the current study, 12.32 ms is used as the RTT threshold value.

2.1.4. Faults in SED

It is clear that the processor and communication module have dependencies. Thus, the probability of SED working as a single system can be calculated as the product of the probabilities of each component working while considering the dependencies, as in Equation (1), where p(S) represents a working sensor, p(P) represents a working processor and p(C) represents a working communication module. Similarly, the probability of the SED failing as a system can be calculated as in Equation (2), where q(S) represents a faulty sensor, q(P) represents a faulty processor and q(C) represents a faulty communication module.
P(SyW) = p(S) ×p(P|S) × p(C|P)
P(SyF) = q(S) + [p(S) × q(P|S)] + [p(S) × p(P|S) × q(C|P)
Equations (1) and (2) resemble the probability of the system having only two possible outcomes: either the system is working, or it is not working, thus promoting for a binary classification model.

2.2. Integrated Fault Diagnosis Algorithm

The proposed edge deployable fault detection algorithm is a combination of FTA with a binary classifier. FTA is a widely used methodology for identifying potential causes of system failures, However, it requires expert knowledge to be able to construct appropriately. Also, it consumes higher computational resources, which always becomes increased when new events and logics are added [39]. On the other hand, RF is a powerful classification algorithm to identify faulty sensor data, but fails to identify individual element(s) responsible for fault occurrence [46]. Thus, RF is used to invoke the FTA function on a case-to-case basis. This arrangement reduces the need for higher computational requirements in each cycle. This integration is particularly useful in resource constrained complex systems, where it may be difficult to identify the root causes of failures using traditional FTA. The following section provides a short brief on the technicality of FTA and RF models.

2.2.1. Fault Tree Analysis

FTA is a graphical model with a tree-like structure that uses logic gates and events to represent problematic conditions and the connections between them. The system failure is represented by the top event, and the underlying root causes are determined backwards. FTA follows two approaches for assessing fault, namely: qualitative and quantitative analysis.
The quantitative approach analyzes the potential failure modes through factors such as Reliability R ( t ) , Unreliability F ( t ) , Availability A ( t ) , Unavailability Q(t) and Failure Rate λ ( t ) . If we assume ‘T’ to be the random variable representing the time “t” to failure, then R(t) can be expressed as in Equation (3).
R ( t ) = P ( T t )
Conversely, F(t) can be defined as the probability of failure, before time “t”, as in Equation (4). As the occurrence of fault in a system is random by nature, by means of the law of probability, R ¯ ( t ) can be calculated as equal to Cumulative Distribution Function, i.e., R ¯ ( t ) = F f c ( t ) = 1 R ( t ) .
F ( t ) = R ¯ ( t ) = P ( T < t )
In practice, it is more suitable to consider the probability associated with a small range of values, as the probability of “T” having an exact value of “t” will be significant. Hence, the probability density function f(t) is introduced, as shown in Equation (5),
f ( t ) = d F ( t ) d t = d R ( t ) d t
The probability of a component failing during a specific time period [x, x + ∆t], given that it is operating at time instant [t = x], is described as λ(t) and is closely associated with R(t), as in Equation (6),
λ ( t ) = f ( t ) R ( t )
By integrating Equation (6), the expression for the reliability of a system can be derived as in Equation (7),
R ( t ) = exp ( 0 t λ ( t ) d t )
The FTA of the SED under research considers repeated events and the relationship between individual elements. Thus, R(t) is to be calculated as in Equation (8) for a series system, where “R1”, “R2” and “R3” represents the reliability of the sensing, processing and communication modules of the SED.
R(t) = R1(t) × R2(t) × R3(t)
Qualitative analysis identifies the minimum component list that could cause a system to fail as a whole. The calculation of the minimal cut “Ci” set assumes that the components fail independently of one another, which is defined by Equation (9), where the individual events are represented as X1, X2, … Xn.
P ( C i ) = j 1 n P ( X j )
In an FTA, each component’s significance must be individually identified to reduce the repair time; thus, Birnbaum’s measure IB(i|t), is used to assess a component’s significance, as defined in Equation (10),
I B ( i | t ) = R ( t ) R i ( t )   for   i = 1 ,   2 ,   .   N
The FTA, as shown in Figure 3, is built to capture the CH4-SED’s typical operating environment in a programmatic approach. The top event denotes the overall system failure, and individual nodes comes next, where the base events represent each individual element’s failure.

2.2.2. RF Classifiers

Recent attempts in sensor fault diagnosis have demonstrated the growing usage of RF, due to its benefit in dealing with small sample sizes and complex data formats [46]. RF is an ensemble algorithm that combines multiple weak decision trees grown from a randomly selected subspace, as illustrated in Figure 4. If there are “n” sample-points in a dataset “D” with “m” features, then the RF algorithm involves creating a forest of decision trees; each tree “hk” is individually constructed by recursively splitting the training data into smaller random subsets “Fk”, based on input features “m”, until each subset contains only data points belonging to a single class. This is followed by the subspace randomization scheme, “Dk” from “D” of size “n”, and the bagging scheme to resample with replacements. Once all the trees have been constructed, the algorithm produces the output “y” for the new input “x” using all of the decision trees. The final prediction is decided based on the majority voting scheme [47]. The decision tree node splits are chosen to decrease impurity at each step, which is governed by the Gini Index; it calculates how unclean or unequal the samples are, which are assigned to a node, based on a split at its parent, as represented in Equation (11).
G = i = 1 C p ( i ) ( i p ( i ) )

2.3. Implementation and Working

The overall concept of the proposed algorithm is illustrated in Figure 5; when a sensing node is placed in an environment to collect the contextual information, they are prone to produce faults, due to physicality of logical fault. In an environment where the collected information is shared through the base-station or cluster head, an integrated algorithm can be utilized to scan the payload data before being sent to the Cloud [21]. This enables the identification of the fault at edge level, thus saving repair time, bandwidth consumption and energy, and also reducing stress on the Cloud [32]. The faults were induced manually as follows:
  • Sensing Module
    ∘ 
    The sensing device was exposed to various glass jars with different biogenic-process start dates to induce upper and lower threshold limit exceeding the faulted.
    ∘ 
    Data line of the sensor was disconnected temporarily to induce no payload fault.
  • Processor Module
    ∘ 
    A forever-while (1) loop was used to induce the watchdog timer reset, resulting in the “E0” error.
    ∘ 
    Programmatically, the threshold values were adjusted to be in lower values and the chip was exposed to a light source in order to induce an on-chip temperature error.
    ∘ 
    A Regulated Power Supply system was used to induce a supply voltage error; error code “E1”.
    ∘ 
    External load (resistor-bank) was added to consume more current to induce a short-circuit scenario; error code “E3”.
  • Communication Module
    ∘ 
    The ESP8266 was programmatically disconnected to induce link-failure; changing the SSID/password.
    ∘ 
    The ESP8266 was connected to different access points with varied physical distance to induce RTT time-out.
The working of the proposed algorithm is illustrated in Figure 6 and the pseudocode of the proposed algorithm is given in Algorithm 1.
The sensor data is received and stored temporarily in a database and then checked for availability of payload content (“Sensor Value”, “Error Code”, “On-Chip Temperature”). If the payload content or data packet itself is missing, FTA function is called to test the communication link through Link-Failure and RTT tests. On the contrary, when payload content is present, the RF function is called by passing the payload content to test the faultiness of the senor data. Based on the output of the RF function, the payload is either passed to the Cloud or to the FTA function to identify the source of fault. The FTA function works based on the way it was invoked; RF function or Payload Checking function. If the FTA function has been invoked due to non-availability of payload, and if the links are able to be established and RTT values are within the threshold value, then the processor or sensor is identified to be the faulty element; if not, the communication module is concluded to be faulty. If the FTA function has been invoked from the RF function, then Sensor fault and processor fault tests are carried out as follows; the received payload data is verified against the pre-defined threshold values to identify the element(s) responsible for the fault. The approach is detailed in Algorithm 1.
Algorithm 1: Integrated Fault Detection and Identification Approach
Function ProcessSensorData:
  Input: Excel file path
  data ← ReadExcelData(excelFilePath)//Read sensor data from Excel
  if data is present then
    RFFunction(data)//Call RF function with sensor data
  else
    FTAFunction()//Call FTA function
Function RFFunction(data):
  result ← Trained Random Forest model(data)//Check for faulty data
  if result is not faulty then
    SendToCloud(data)//Send data to cloud
  else
    FTAFunction()//Call FTA function
Function FTAFunction():
  if sensor data is present then
    SensorFault()//Call SensorFault function
    ProcessorFault()//Call ProcessorFault function
  else
    LinkFailure()//Call LinkFailure function
    RTTFunction()//Call RTTFunction
Function SensorFault():
  //Paraphrase the sensor value from the payload and check against threshold value
  …
Function ProcessorFault():
  //Paraphrase the processor value/Error codes from the payload and check against threshold value
  …
Function LinkFailure():
  //Base station tries to ping the particular node
  …
Function RTTFunction():
  //A RTT measurement is done and cross-checked against the threshold value
  …
Function SendToCloud(data):
  //The sensor value or the list of faulty equipment is sent to the Cloud through MQTT protocol.
  …

3. Results and Discussions

To assess the effectiveness of the proposed integrated algorithm, a real-time, CH4-sensing SED was utilized, as shown in Figure 7. From the institute’s Bio-plant, where biogenic CH4 is produced from coal waste washeries, the dataset was generated.

3.1. Datasets

The data were gathered in-house from a MQ4 sensor fixed to a biogenic methane production unit. The readings were noted for 10 min continuously without fault, followed by adjusting different units, such as the sensor, microcontroller and Wi-Fi module to induce faults. The fault induced data were collected for ten minutes each, respectively. In total, 1515 datapoints were collected under five sets of features (“Sensor Value (PPM)”, “Supply Voltage (V)”, “Overall Circuit Current (mA)”, “On-Chip Temperature (°C)”, “Error Code”), where overall-voltage and circuit current were used for error code deduction, and the target column was labeled as “working” and “faulty”. Table 1 shows the sample of the data collected; as the dataset was generated in-house, it was imbalanced—1010 faulty and 405 non-faulty samples. The full dataset was randomly split into 70% (1061 samples) for training and 30% (454 samples) for testing.
The RF and FTA models were simulated using Python-3 language in the Jupyter Notebook through the Anaconda environment on a Windows platform with an Intel (R) Core (TM) i7 CPU @3.40 GHz and 16 GB of RAM-memory. For programming ML models, the Scikit-learn library was used, and the parameters utilized to design the models are as follows: RF model—number of estimators = 50, max depth = 4, max features = ‘sqrt’; KNN model—number of neighbors = 5; SVM model—kernel = ‘poly’, C = 0.4, and random state = 20. The following section details the simulation and implementation outcomes.

3.2. Performance of RF Classifier

Four different measures were used to assess the performance of the proposed algorithm. Accuracy [35] was the first metric used, as defined in Equation (12). The comparative outcomes between the classifiers are shown in Figure 8.
A = TP + TN TP + TN + FP + FN %
The True Positive Rate (TPR), also known as sensitivity [48], was used as the second metric, which was calculated as defined in Equation (13), where “TP” represents True Positive and “FN” represents False Negatives. Figure 9 presents a comparison of the confusion matrix for RF, KNN, and SVM, and Figure 10 shows the TPR comparison between the classifiers.
TPR = TP TP + FN
The third metric used is MCC [49], which ranks fault diagnosis according to accuracy, as in Equation (14), where the False Positive (FP) was descripted as a number of wrongly labeled faulty data. The MCC ranges the algorithms between −1 to +1, and the algorithm that is nearer to +1 indicates a strong correlation between the test and reality. Figure 11 represents the comparison between all the classifiers used; on the basis of their False Negative Rate (FNR) and the performance variation of each algorithm being clearly projected by the bars.
Table 2 lists the order of all the classifiers. In comparison to all of the classifiers, RF is seen to outperform others. RF is seen to be closest to +1, thus rated as the best classifier, and KNN was rated as the second most compatible method.
MCC = ( TP TN ) ( FP FN ) ( TP + FP ) ( TP + FN ) ( TN + FP ) ( TN + FN )
The F1 score is the fourth metric, utilized to assess performance. On the basis of FN and FP, it serves as a statistical metric to evaluate the effectiveness of a specific classifier [50,51]. It is defined as the harmonic mean of recall and precision, as defined in Equation (15). The Precision value is described as the correctness of judgement and the values obtained in the current study are projected in Figure 12. In Figure 13, the F1-score of all the classifiers are compared, and it is inferred that RF has the highest value, showcasing its better performance.
F 1 score = 2   Precision   Recall Precision +   Recall

3.3. Performance of the FTA

The FTA was programmatically implemented as a function based on “if-then” rules. The conditions were implemented using logical operators. The performance of FTA was measured using execution time as the metric, whose results are noted in row three of Table 3. From the table, it is understandable and validates our claim; FTA’s execution time increases with the number of sensor data. Based on the iterations of the test carried out, the following observations have been noted. All three elements are identified to be a minimal cut set element. Also, the reliability of the sensor element was identified to be poor compared to the processor and communication module, but identified to be the significant component in SED.

3.4. Performance of the Integrated Algorithm

The total time required to identify the fault using the FTA model is represented as “Tfta”; the time required to identify the fault using a classifier model, such as RF, KNN and SVM, is represented as “Trf”, “Tknn”, and “Tsvm”, respectively. The percentage of reduced execution time is calculated based on Equation (16), where the time taken by the integrated model is calculated as the sum of time taken by a classifier model multiplied by the number of datapoints, plus time taken by the FTA model, multiplied by the number of known faults:
Reduced   Execution   Time = ( Time   taken   by   FTA   model ) ( Time   Taken   by   Integrated   Model ) Time   taken   by   FTA   model %
A set of 158 sample datapoints with 37 known faults were used to test the integrated algorithm. The Tfta was calculated to be 42.65 microseconds. On the other hand, Trf was calculated as 20.96 ms, Tknn was calculated as 25.30 ms and Tsvm was calculated to be 22.66 ms. Thus, for a set of 158 datapoints, the total time required by FTA was 6.73 ms, whereas the time required by RF-FTA analysis was 3.82 ms (RF (158) + FTA (37)). This is an 27.73% decrease in executional time requirement compared to a traditional FTA approach. Table 3 illustrates the different iterations performed with known datapoints and faults in them, where row four indicates the percentage of time saved on employing the integrated RF-FTA approach.
From the table, it can be inferred that the proposed system consumes less time to scan the datapoints for fault, in comparison to the static time value, in the case of a more traditional FTA approach. Also, its inferred that the SVM model follows a close execution time requirement next to RF, whereas KNN, although producing better results in terms of other performance metrics, only performed next to RF and SVM in terms of execution time. In addition, it is inferred that the number of faulty datapoints increases the overall execution time, irrespective of the classifier model that was used. Thus, the proposed edge fault detection and identification algorithm has advantages over traditional methods, such as:
  • Reduced latency: Edge fault detection algorithms can detect faults much faster than traditional methods, which can help to prevent costly downtime.
  • Improved accuracy: Machine learning algorithms can learn to identify patterns in data, which can lead to improved accuracy in fault detection.

4. Conclusions

SEDs are prone to produce erroneous values due to various factors, and even a modest undetected fault could result in system fail. These faults can affect the SED’s availability and impact the overall data-driven system. The traditional FTA is the better solution to detect and identify the fault, but consumes higher executional time, which is likely to increase with additional problematic events. Thus, an edge deployable fault detection and identification algorithm is proposed, as discussed in Section 2, by integrating RF and FTA. The RF classifier is employed to detect the fault, while the FTA is utilized to identify the source of the fault. We used data from a healthy CH4 sensing node, which was injected with different forms of faults, as discussed in Section 2, to assess the proposed model’s performance. The proposed integrated algorithm provides better algorithm-complexity and accuracy when compared to a standalone FTA or standalone classifiers, such as RF, SVM and KNN, as discussed in Section 3. The performance of the classifiers was assessed using accuracy, TPR, MCC, Precision, FNR and F1-score, as performance metrics. From the field-experiment, RF produced 97.27% accuracy and outperformed both SVM and KNN. Also, the suggested methodology’s experimental findings demonstrated to work well, which is evident from Table 3 and Figure 8, Figure 9, Figure 10, Figure 11, Figure 12 and Figure 13. A 27.73% reduced time-consumption was achieved for 158 sample datapoints with 37 known faults, with correct fault-source and less computational resource, in comparison with a traditional FTA approach. Regarding future research, different algorithms can be experimented to invoke FTA and further reduce the execution time or modify SVM to have a better trade-off compared to the current method used.
In the current study, only one node has been used to verify the integrated approach. Also, the cluster head has been orchestrated in a personal computer setup to replicate the edge environment. In the future, modifications to the overall algorithm to incorporate multiple nodes and implement the algorithm in real hardware, such as a Raspberry Pi, is required to further assess the prosperity of the proposed approach.

Author Contributions

S.V.K.—Conceptualization, methodology, formal analysis, investigation, writing—original draft, G.A.A.M.—Conceptualization, methodology, formal analysis, investigation, writing—original draft, M.M.—Conceptualization, methodology, resources, writing—review and editing. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the European Regional Development Fund in the Research Centre of Advanced Mechatronic Systems project, CZ.02.1.01/0.0/0.0/16_019/0000867 within the Operational Programme Research, Development, and Education and the project SP2023/074 Application of Machine and Process Control Advanced Methods supported by the Ministry of Education, Youth and Sports, Czech Republic.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available through email upon request to the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Chen, P.; Liu, H.; Xin, R.; Carval, T.; Zhao, J.; Xia, Y.; Zhao, Z. Effectively Detecting Operational Anomalies In Large-Scale IoT Data Infrastructures By Using A GAN-Based Predictive Model. Comput. J. 2022, 65, 2909–2925. [Google Scholar] [CrossRef]
  2. Pamula, A.S.P.; Ravilla, A.; Madiraju, S.V.H. Applications of the Internet of Things (IoT) in Real-Time Monitoring of Contaminants in the Air, Water, and Soil. Eng. Proc. 2022, 27, 26. [Google Scholar] [CrossRef]
  3. Cheng, B.; Zhu, D.; Zhao, S.; Chen, J. Situation-Aware IoT Service Coordination Using the Event-Driven SOA Paradigm. IEEE Trans. Netw. Serv. Manag. 2016, 13, 349–361. [Google Scholar] [CrossRef]
  4. Min, H.; Fang, Y.; Wu, X.; Lei, X.; Chen, S.; Teixeira, R.; Zhu, B.; Zhao, X.; Xu, Z. A fault diagnosis framework for autonomous vehicles with sensor self-diagnosis. Expert Syst. Appl. 2023, 224, 120002. [Google Scholar] [CrossRef]
  5. Ma, X.; Dong, Z.; Quan, W.; Dong, Y.; Tan, Y. Real-time assessment of asphalt pavement moduli and traffic loads using monitoring data from Built-in Sensors: Optimal sensor placement and identification algorithm. Mech. Syst. Signal Process. 2023, 187, 109930. [Google Scholar] [CrossRef]
  6. Burhan, M.; Rehman, R.A.; Khan, B.; Kim, B.S. IoT Elements, Layered Architectures and Security Issues: A Comprehensive Survey. Sensors 2018, 18, 2796. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  7. Lombardi, M.; Pascale, F.; Santaniello, D. Internet of Things: A General Overview between Architectures, Protocols and Applications. Information 2021, 12, 87. [Google Scholar] [CrossRef]
  8. Ma, K.; Li, Z.; Liu, P.; Yang, J.; Geng, Y.; Yang, B.; Guan, X. Reliability-Constrained Throughput Optimization of Industrial Wireless Sensor Networks With Energy Harvesting Relay. IEEE Internet Things J. 2021, 8, 13343–13354. [Google Scholar] [CrossRef]
  9. Xu, S.; Huang, W.; Huang, D.; Chen, H.; Chai, Y.; Ma, M.; Zheng, W.X. A Reduced-Order Observer-Based Method for Simultaneous Diagnosis of Open-Switch and Current Sensor Faults of a Grid-Tied NPC Inverter. IEEE Trans. Power Electron. 2023, 38, 9019–9032. [Google Scholar] [CrossRef]
  10. Ayoub, I.; Balakrichenan, S.; Khawam, K.; Ampeau, B. DNS for IoT: A Survey. Sensors 2023, 23, 4473. [Google Scholar] [CrossRef]
  11. Chen, Y.; Zhen, Z.; Yu, H.; Xu, J. Application of Fault Tree Analysis and Fuzzy Neural Networks to Fault Diagnosis in the Internet of Things (IoT) for Aquaculture. Sensors 2017, 17, 153. [Google Scholar] [CrossRef] [Green Version]
  12. Salunke, R.; Nobahar, M.; Alzeghoul, O.E.; Khan, S.; La Cour, I.; Amini, F. Near-Surface Soil Moisture Characterization in Mississippi’s Highway Slopes Using Machine Learning Methods and UAV-Captured Infrared and Optical Images. Remote Sens. 2023, 15, 1888. [Google Scholar] [CrossRef]
  13. Gu, Y.; Zheng, G. Dynamic Evolution Characteristics of the Gear Meshing Lubrication for Vehicle Transmission System. Processes 2023, 11, 561. [Google Scholar] [CrossRef]
  14. Li, L.; Tan, Y.; Xu, W.; Ni, Y.; Yang, J.; Tan, D. Fluid-induced transport dynamics and vibration patterns of multiphase vortex in the critical transition states. Int. J. Mech. Sci. 2023, 252, 108376. [Google Scholar] [CrossRef]
  15. Lu, M.-C.; Huang, Q.-X.; Chiu, M.-Y.; Tsai, Y.-C.; Sun, H.-M. PSPS: A Step toward Tamper Resistance against Physical Computer Intrusion. Sensors 2022, 22, 1882. [Google Scholar] [CrossRef] [PubMed]
  16. Gaddam, A.; Wilkin, T.; Angelova, M.; Gaddam, J. Detecting Sensor Faults, Anomalies and Outliers in the Internet of Things: A Survey on the Challenges and Solutions. Electronics 2020, 9, 511. [Google Scholar] [CrossRef] [Green Version]
  17. Markulik, S.; Šolc, M.; Petrík, J.; Balážiková, M.; Blaško, P.; Kliment, J.; Bezák, M. Application of FTA Analysis for Calculation of the Probability of the Failure of the Pressure Leaching Process. Appl. Sci. 2021, 11, 6731. [Google Scholar] [CrossRef]
  18. Antonini, M.; Pincheira, M.; Vecchio, M.; Antonelli, F. An Adaptable and Unsupervised TinyML Anomaly Detection System for Extreme Industrial Environments. Sensors 2013, 23, 2344. [Google Scholar] [CrossRef]
  19. Rojek, I.; Jasiulewicz-Kaczmarek, M.; Piechowski, M.; Mikołajewski, D. An Artificial Intelligence Approach for Improving Maintenance to Supervise Machine Failures and Support Their Repair. Appl. Sci. 2023, 13, 4971. [Google Scholar] [CrossRef]
  20. Liu, Y.; Ma, X.; Qiao, W.; Luo, H.; He, P. Human Factor Risk Modeling for Shipyard Operation by Mapping Fuzzy Fault Tree into Bayesian Network. Int. J. Environ. Res. Public Health 2021, 19, 297. [Google Scholar] [CrossRef]
  21. Adday, G.H.; Subramaniam, S.K.; Zukarnain, Z.A.; Samian, N. Fault Tolerance Structures in Wireless Sensor Networks (WSNs): Survey, Classification, and Future Directions. Sensors 2022, 22, 6041. [Google Scholar] [CrossRef]
  22. Soltanali, H.; Khojastehpour, M.; Farinha, J.T.; Pais, J.E.d.A.e. An Integrated Fuzzy Fault Tree Model with Bayesian Network-Based Maintenance Optimization of Complex Equipment in Automotive Manufacturing. Energies 2021, 14, 7758. [Google Scholar] [CrossRef]
  23. Santo, Y.; Immich, R.; Dalmazo, B.L.; Riker, A. Fault Detection on the Edge and Adaptive Communication for State of Alert in Industrial Internet of Things. Sensors 2023, 23, 3544. [Google Scholar] [CrossRef] [PubMed]
  24. Ding, X.; Wang, H.; Cao, Z.; Liu, X.; Liu, Y.; Huang, Z. An Edge Intelligent Method for Bearing Fault Diagnosis Based on a Parameter Transplantation Convolutional Neural Network. Electronics 2023, 12, 1816. [Google Scholar] [CrossRef]
  25. Gültekin, Ö.; Cinar, E.; Özkan, K.; Yazıcı, A. Real-Time Fault Detection and Condition Monitoring for Industrial Autonomous Transfer Vehicles Utilizing Edge Artificial Intelligence. Sensors 2022, 22, 3208. [Google Scholar] [CrossRef]
  26. Bruneo, D.; De Vita, F. Detecting Faults at the Edge via Sensor Data Fusion Echo State Networks. Sensors 2022, 22, 2858. [Google Scholar] [CrossRef]
  27. Morenas, J.d.L.; Moya-Fernández, F.; López-Gómez, J.A. The Edge Application of Machine Learning Techniques for Fault Diagnosis in Electrical Machines. Sensors 2023, 23, 2649. [Google Scholar] [CrossRef]
  28. Cao, B.; Zhao, J.; Gu, Y.; Fan, S.; Yang, P. Security-Aware Industrial Wireless Sensor Network Deployment Optimization. IEEE Trans. Ind. Inform. 2019, 16, 5309–5316. [Google Scholar] [CrossRef]
  29. Liu, X.; He, J.; Liu, M.; Yin, Z.; Yin, L.; Zheng, W. A Scenario-Generic Neural Machine Translation Data Augmentation Method. Electronics 2023, 12, 2320. [Google Scholar] [CrossRef]
  30. Raposo, D.; Rodrigues, A.; Silva, J.S.; Boavida, F. A Taxonomy of Faults for Wireless Sensor Networks. J. Netw. Syst. Manag. 2017, 25, 591–611. [Google Scholar] [CrossRef]
  31. Welcer, M.; Szczepański, C.; Krawczyk, M. The Impact of Sensor Errors on Flight Stability. Aerospace 2022, 9, 169. [Google Scholar] [CrossRef]
  32. Meng, Y.; Wu, X.; Oladejo, J.; Dong, X.; Zhang, Z.; Deng, J.; Yan, Y.; Zhao, H.; Lester, E.; Wu, T.; et al. Application of Machine Learning in Industrial Boilers: Fault Detection, Diagnosis, and Prognosis. ChemBioEng Rev. 2021, 8, 535–544. [Google Scholar] [CrossRef]
  33. Zhang, Z.; Mehmood, A.; Shu, L.; Huo, Z.; Zhang, Y.; Mukherjee, M. A Survey on Fault Diagnosis in Wireless Sensor Networks. IEEE Access 2018, 6, 11349–11364. [Google Scholar] [CrossRef]
  34. Lau, B.C.; Ma, E.W.; Chow, T.W. Probabilistic fault detector for Wireless Sensor Network. Expert Syst. Appl. 2014, 41, 3703–3711. [Google Scholar] [CrossRef]
  35. Zidi, S.; Moulahi, T.; Alaya, B. Fault Detection in Wireless Sensor Networks Through SVM Classifier. IEEE Sens. J. 2017, 18, 340–347. [Google Scholar] [CrossRef]
  36. Javaid, A.; Javaid, N.; Wadud, Z.; Saba, T.; Sheta, O.E.; Saleem, M.Q.; Alzahrani, M.E. Machine Learning Algorithms and Fault Detection for Improved Belief Function Based Decision Fusion in Wireless Sensor Networks. Sensors 2019, 19, 1334. [Google Scholar] [CrossRef] [Green Version]
  37. Haque, S.A.; Rahman, M.; Aziz, S.M. Sensor Anomaly Detection in Wireless Sensor Networks for Healthcare. Sensors 2015, 15, 8764–8786. [Google Scholar] [CrossRef] [Green Version]
  38. Takahashi, M.; Anang, Y.; Watanabe, Y. A Proposal of Fault Tree Analysis for Embedded Control Software. Information 2020, 11, 402. [Google Scholar] [CrossRef]
  39. Byun, S.; Papaelias, M.; Márquez, F.P.G.; Lee, D. Fault-Tree-Analysis-Based Health Monitoring for Autonomous Underwater Vehicle. J. Mar. Sci. Eng. 2022, 10, 1855. [Google Scholar] [CrossRef]
  40. Chen, G. Fault Diagnosis Method Based on System-phenomenon-fault Tree. Chin. J. Mech. Eng. 2011, 24, 466–473. [Google Scholar] [CrossRef]
  41. Mentes, A.; Helvacioglu, I.H. An application of fuzzy fault tree analysis for spread mooring systems. Ocean Eng. 2011, 38, 285–294. [Google Scholar] [CrossRef]
  42. Wang, Y.; Li, Q.; Chang, M.; Chen, H.; Zang, G. Research on Fault Diagnosis Expert System Based on the Neural Network and the Fault Tree Technology. Procedia Eng. 2012, 31, 1206–1210. [Google Scholar] [CrossRef] [Green Version]
  43. Zhao, M.; Tian, Z.; Chow, T.W.S. Fault diagnosis on wireless sensor network using the neighborhood kernel density estimation. Neural Comput. Appl. 2018, 31, 4019–4030. [Google Scholar] [CrossRef]
  44. Swain, R.R.; Khilar, P.M.; Dash, T. Multifault diagnosis in WSN using a hybrid metaheuristic trained neural network. Digit. Commun. Netw. 2020, 6, 86–100. [Google Scholar] [CrossRef]
  45. Aldhafeeri, T.; Tran, M.-K.; Vrolyk, R.; Pope, M.; Fowler, M. A Review of Methane Gas Detection Sensors: Recent Developments and Future Perspectives. Inventions 2020, 5, 28. [Google Scholar] [CrossRef]
  46. Shanmugasundar, G.; Vanitha, M.; Čep, R.; Kumar, V.; Kalita, K.; Ramachandran, M. A Comparative Study of Linear, Random Forest and AdaBoost Regressions for Modeling Non-Traditional Machining. Processes 2021, 9, 2015. [Google Scholar] [CrossRef]
  47. Liu, Y.; Wang, Y.; Zhang, J. New machine learning algorithm: Random forest. In Information Computing and Applications; Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 2012; Volume 7473, pp. 246–252. [Google Scholar] [CrossRef]
  48. Shaik, K.; Ramesh, J.V.N.; Mahdal, M.; Rahman, M.Z.U.; Khasim, S.; Kalita, K. Big Data Analytics Framework Using Squirrel Search Optimized Gradient Boosted Decision Tree for Heart Disease Diagnosis. Appl. Sci. 2023, 13, 5236. [Google Scholar] [CrossRef]
  49. Ganesh, N.; Shankar, R.; Čep, R.; Chakraborty, S.; Kalita, K. Efficient Feature Selection Using Weighted Superposition Attraction Optimization Algorithm. Appl. Sci. 2023, 13, 3223. [Google Scholar] [CrossRef]
  50. Li, R.; Wu, X.; Tian, H.; Yu, N.; Wang, C. Hybrid Memetic Pretrained Factor Analysis-Based Deep Belief Networks for Transient Electromagnetic Inversion. IEEE Trans. Geosci. Remote Sens. 2022, 60, 1–20. [Google Scholar] [CrossRef]
  51. Li, J.; Deng, Y.; Sun, W.; Li, W.; Li, R.; Li, Q.; Liu, Z. Resource Orchestration of Cloud-Edge–based Smart Grid Fault Detection. ACM Trans. Sens. Netw. 2022, 18, 1–26. [Google Scholar] [CrossRef]
Figure 1. Key Building Blocks of an IoT Architecture.
Figure 1. Key Building Blocks of an IoT Architecture.
Sensors 23 06266 g001
Figure 2. General Architecture of a Sensing Edge Device.
Figure 2. General Architecture of a Sensing Edge Device.
Sensors 23 06266 g002
Figure 3. CH4-SED Fault Tree Architecture.
Figure 3. CH4-SED Fault Tree Architecture.
Sensors 23 06266 g003
Figure 4. RF Architecture.
Figure 4. RF Architecture.
Sensors 23 06266 g004
Figure 5. Illustration of the overall concept proposed.
Figure 5. Illustration of the overall concept proposed.
Sensors 23 06266 g005
Figure 6. Block Diagram of the proposed integrated system.
Figure 6. Block Diagram of the proposed integrated system.
Sensors 23 06266 g006
Figure 7. Sensing CH4 in a controlled environment.
Figure 7. Sensing CH4 in a controlled environment.
Sensors 23 06266 g007
Figure 8. Accuracy comparison between RF, KNN and SVM.
Figure 8. Accuracy comparison between RF, KNN and SVM.
Sensors 23 06266 g008
Figure 9. Confusion Matrix of (a) RF-Training, (b) KNN-Training, (c) SVM-Training, (d) RF-Testing, (e) KNN-Testing and (f) SVM-Testing.
Figure 9. Confusion Matrix of (a) RF-Training, (b) KNN-Training, (c) SVM-Training, (d) RF-Testing, (e) KNN-Testing and (f) SVM-Testing.
Sensors 23 06266 g009
Figure 10. True Positive Rate comparison between RF, KNN and SVM.
Figure 10. True Positive Rate comparison between RF, KNN and SVM.
Sensors 23 06266 g010
Figure 11. False Negative Rate comparison between RF, KNN and SVM.
Figure 11. False Negative Rate comparison between RF, KNN and SVM.
Sensors 23 06266 g011
Figure 12. Precision—Comparison between RF, KNN and SVM.
Figure 12. Precision—Comparison between RF, KNN and SVM.
Sensors 23 06266 g012
Figure 13. F1-score comparison between RF, KNN and SVM.
Figure 13. F1-score comparison between RF, KNN and SVM.
Sensors 23 06266 g013
Table 1. Structure of data collected.
Table 1. Structure of data collected.
Sensor Value (PPM)Battery Voltage (V)Overall Circuit Current (mA)On-Chip Temperature (°C)User Defined Error CodeLabel
3253.219937.9 Working
245320134.3E0Faulty
3123.120038.3 Working
2122.919833.1E1Faulty
3413.320239.8 Working
2222.819833.5E3Faulty
265320036.3 Working
3053.220338 Working
Table 2. Comparison of various performance measures of RF, SVM & KNN.
Table 2. Comparison of various performance measures of RF, SVM & KNN.
MeasureTrainingTesting
RFKNNSVMRFKNNSVM
Sensitivity 0.95930.87150.6880.97280.85820.625
Specificity10.8315-10.8943-
Precision10.9389110.9491
Negative Predictive Value0.90630.685200.95240.73330
False Positive Rate00.1685-00.1057-
False Discovery Rate00.0611000.0510
False Negative Rate0.04070.12850.3120.02720.14180.375
Accuracy0.97080.86150.6880.98240.86910.625
F1 Score0.97920.9040.81520.98620.90130.7692
Matthews Correlation Coefficient0.93240.6624-0.96250.7166-
Table 3. Time saved on employing the integrated approach- Different iterations.
Table 3. Time saved on employing the integrated approach- Different iterations.
DescriptionIteration-1Iteration-2Iteration-3Iteration-4Iteration-5Iteration-6
No. of Sample Datapoints5063728095158
No. of known fault10111081237
Time (ms)—FTA2.132.693.073.414.056.74
Time (ms)—RF-FTA1.471.791.942.022.504.89
Time (ms)—KNN-FTA1.692.062.252.372.925.58
Time (ms)—SVM-FTA1.561.902.062.152.665.16
Percentage reduction execution time between FTA vs. RF-FTA30.86%33.40%36.97%40.86%38.22%27.73%
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Kumar, S.V.; Mary, G.A.A.; Mahdal, M. Integrated Edge Deployable Fault Diagnostic Algorithm for the Internet of Things (IoT): A Methane Sensing Application. Sensors 2023, 23, 6266. https://doi.org/10.3390/s23146266

AMA Style

Kumar SV, Mary GAA, Mahdal M. Integrated Edge Deployable Fault Diagnostic Algorithm for the Internet of Things (IoT): A Methane Sensing Application. Sensors. 2023; 23(14):6266. https://doi.org/10.3390/s23146266

Chicago/Turabian Style

Kumar, S. Vishnu, G. Aloy Anuja Mary, and Miroslav Mahdal. 2023. "Integrated Edge Deployable Fault Diagnostic Algorithm for the Internet of Things (IoT): A Methane Sensing Application" Sensors 23, no. 14: 6266. https://doi.org/10.3390/s23146266

APA Style

Kumar, S. V., Mary, G. A. A., & Mahdal, M. (2023). Integrated Edge Deployable Fault Diagnostic Algorithm for the Internet of Things (IoT): A Methane Sensing Application. Sensors, 23(14), 6266. https://doi.org/10.3390/s23146266

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop