1. Introduction
Nowadays, automated systems play a crucial role in industrial production [
1]. They not only significantly improve production efficiency but also effectively reduce production costs, providing strong support for corporate sustainable development [
2]. However, as the scale and complexity of automation systems continue to expand, equipment defect issues become more prominent [
3]. If these issues are not properly managed, they can severely affect the stable operation and production efficiency of automated systems. The main characteristics of defects in automated equipment can be summarized as follows: high failure rates, diverse defect types, and complex maintenance and repair procedures.
Due to the above characteristics of defects in automated equipment, scientific and efficient defect management is vital for power grid regulation [
4]. In general, defect management for power grid automation equipment is a comprehensive system. Currently, the defect management methods adopted mainly focus on a closed-loop approach [
5], which can be summarized into four stages: “defect detecting, defect recording, defect rectifying, and operational verification”. Among these, the “defect recording” stage provides data support for subsequent defect rectification and is the most critical part of the entire automated equipment defect management system.
The common method for recording defects in automated equipment is manual handling by personnel. However, this manual management method has several issues. First of all, manual updating of defect reports can result in delays, making it difficult to update data in a timely manner. Aside from this, the efficiency of manually recording defect data is low, affecting the ability to quickly respond to and address defects. Moerover, manually entered information is often inconsistent, and data omissions or losses may occur.
Due to these shortcomings, numerous studies have been conducted in the power grid field to automate the defect recording process. Qing [
6] proposed a quick response (QR) code-based defect management system, capitalizing on its large data capacity, high security, and robust anti-counterfeiting capabilities to enhance the safety, stability, and economic efficiency of equipment operations. Gao et al. [
7] developed a big data analysis algorithm to create an automatic identification model for suspected familial defects in automated equipment, incorporating physical “ID” tracking to monitor the distribution of the same equipment, thereby improving defect identification accuracy. Zhang et al. [
8] employed modern information platforms to enable equipment status evaluation and batch defect diagnosis through multidimensional analysis and multi-platform interaction, thus improving the quality and efficiency of defect analysis. Huang et al. [
9] introduced an intelligent factory framework which integrates deep learning algorithms with IoT technology into defect detection systems, offering a more comprehensive solution for defect detection.
Knowledge graphs (KGs), as a structured knowledge representation method, organize complex information in the form of graphs, enabling computers to store, retrieve, and infer knowledge more effectively [
10]. With the rapid development of artificial intelligence technologies, the application of KGs in the power industry has become a research hotspot [
11]. The power system is a highly complex and dynamically changing system, involving a vast amount of equipment, operational procedures, safety standards, and real-time data. Traditional data processing methods face many challenges in handling this information, whereas knowledge graph technology, with its advantages in information organization, retrieval, and reasoning, provides a new solution to these problems. Meng et al. [
12] developed a novel method to recognize power equipment entities based on bidirectional encoder representation from transformers (BERT), facilitating the construction of a power equipment fault knowledge graph. Tang et al. [
13] introduced a method for constructing a power equipment KG by combining heterogeneous and multi-source data, thus enhancing the dispatching efficiency of power equipment. Cui [
14] integrated KGs with the Internet of Things (IoT) to enable real-time monitoring of a power grid and its surrounding environment.
By applying KGs to the recording of automated equipment defects, the graph structure of KGs can be effectively utilized to analyze and process data, improving the accuracy of defect diagnosis and addressing the shortcomings of manual recording. However, existing electric power KGs primarily focus on the construction of KGs themselves, lacking further processing and refinement of the KGs.
To address these issues, we built upon the construction of automated equipment defect KGs by incorporating knowledge graph completion to enhance the practical usability of the constructed KG. To improve the effectiveness of the constructed knowledge graph, we applied AI-based methods. AI technologies provide several enhancements throughout the knowledge graph construction process. First, AI can increase the efficiency and accuracy of information extraction, accelerating the construction process. Secondly, by leveraging deep learning and graph inference techniques, AI can automatically identify the root causes of equipment defects, predict potential failure modes, and generate new knowledge relationships, further enhancing the intelligence of a knowledge graph. Furthermore, AI can utilize real-time equipment data and maintenance records to autonomously update and refine the knowledge graph, ensuring that it continuously reflects the equipment’s current operational status and potential risks and thereby addressing the timeliness issues associated with static knowledge graphs.
Based on the above discussion, the main contributions of this paper can be summarized as follows:
A framework of the construction of automated equipment defect KGs os designed and implemented, effectively solving the problem of storing defect knowledge for automated equipment.
In the construction process, the RoBERTa-BiLSTM model is used for named entity recognition (NER), and the ALBERT-BiGRU model is used for relation extraction (RE), with both achieving performance improvements in their respective tasks.
We completed the constructed KG using the KBGAT algorithm [
15] and stored and visualized it in the Neo4j graph database, resulting in a complete equipment automated defect KG.
4. Knowledge Processing
4.1. Knowledge Graph Completion
Knowledge processing refers to the process of collecting, organizing, analyzing, transforming, and applying knowledge, aimed at enhancing its value and usability. Knowledge graph completion (KGC) is a technique in knowledge processing which fills in missing triples in a knowledge graph, thereby making it more complete. In constructing knowledge graphs for automated equipment defects, many rules are predefined manually, which can lead to gaps and omissions. Therefore, incorporating a knowledge completion step during the graph construction process is highly significant.
There are various methods for knowledge graph completion, including rule-based methods, tensor decomposition-based methods, and translation model-based methods (such as the Trans series). We adopted deep learning-based methods, planning to experiment with a series of deep learning models to identify the most effective one through comparison.
4.2. Model Parameter Settings
The experimental environment in this section is consistent with that described in
Section 3, with some of the training hyperparameters provided in
Table 8.
We used HITS@3, HITS@10, and MRR as evaluation metrics for the KGC task. HITS@n represents the average proportion of triples ranked at or below
n in link prediction, where higher values are preferable. The mean reciprocal rank (MRR) measures the average reciprocal rank of correctly predicted triples in the overall prediction results. A higher MRR indicates better model performance. MRR can be expressed as follows:
The parameter denotes the number of triples, while refers to the predicted ranking of the ith triplet link.
4.3. Experiments and Results
To evaluate the effectiveness of KGC models for the equipment defect knowledge graph discussed in this paper, several mainstream deep learning models were compared, yielding the experimental results displayed in
Table 9.
The comparative analysis revealed that the HAKE and KBGAT models each excelled in different performance metrics. After careful consideration, the and metrics were deemed more influential for the KGC task, leading to the selection of the KBGAT model for completing the equipment defect knowledge graph.
The KBGAT model leverages graph neural networks to effectively model entities and relationships while utilizing an attention mechanism to emphasize crucial information within the graph. By adjusting the weights based on the importance of different relationships, the attention mechanism enhances the model’s completion performance, particularly in complex knowledge graphs. Additionally, the use of graph neural networks allows KBGAT to efficiently propagate information throughout the knowledge graph, capturing intricate dependencies between entities and relationships. In KGC tasks, KBGAT can predict missing triples, fill in gaps, and uncover new relationships hidden within a graph.
6. Discussion
The results of our study offer valuable insights into the construction and application of automated equipment defect knowledge graphs for power grid regulation. In this discussion, we examine the implications of our findings, their alignment with existing research, and the broader impact of our work.
The effectiveness of our knowledge graph construction framework, which integrates a top-down schema design with a bottom-up data layer approach, provides a novel solution for managing complex and unstructured defect data. This hybrid methodology bridges the gap between theoretical knowledge representation and practical data management needs. Furthermore, the application scenarios we explored, including intelligent fault diagnosis and industrial equipment maintenance, underscore the practical value of our knowledge graph in improving operational efficiency and safety in power grid systems.
We also acknowledge the limitations of our study. While the dataset used was comprehensive for our purposes, it may not encompass the full diversity of defect scenarios across all types of automated equipment. Future work could focus on expanding the dataset to include a broader range of equipment and defect types. Additionally, while our knowledge graph completion using KBGAT yielded promising results, further optimization and testing with different algorithms could improve the graph’s accuracy and completeness.