Anomaly Detection for Asynchronous Multivariate Time Series of Nuclear Power Plants Using a Temporal-Spatial Transformer
Abstract
:1. Introduction
2. Related Works
3. Problem Setup
3.1. Reconstruction-Based Method
3.2. Issues of Asynchronous Correlations
- (1)
- Differences in the locations of sensors. Sensors of various types are placed in different locations to measure corresponding physical quantities, leading to differences in response times when changes occur. For example, during the process of power ramp-up, the temperature of the coolant within the reactor core rises rapidly. Temperature sensors located in the Reactor Coolant Pump (RCP) system promptly detect this change. However, sensors situated in the auxiliary cooling system or somewhere at further distances might only register the corresponding temperature variations after a delay of several minutes or even hours. Thus, although these variables are strongly correlated due to their underlying mechanisms, there is a significant time lag in their responses.
- (2)
- Differences in response rates among variables. For instance, the current and vibration signals of the main pump are transient variables, with changes occurring almost instantaneously during pump shutdown, whereas temperature is a gradual variable that changes slowly. This leads to the time-lagged correlations between variables with different response rates.
- (3)
- Differences in noise levels. Different levels of noise also exert a certain influence on the correlation among variables. In nuclear power data, the noise levels associated with various variables can vary significantly. For example, signals such as vibration and flow rate often exhibit higher noise levels and greater fluctuations, while signals like temperature are almost devoid of noise. These variations in noise levels can impact the analysis of correlations between variables.
4. Proposed Approach
4.1. Anomaly Detection Strategy
4.2. Temporal-Spatial Transformer Model
- (1)
- Channel-Independent Patch-Wise Embedding. As previously mentioned, the model proposed in this article employs channel-independent patch-wise embedding instead of the traditional embedding representation method utilized in the original transformer model. By this means of embedding, the input MTS segments are embedded into a vector through a linear projection and then added with position encoding. This embedding method offers several advantages. Primarily, compared to point-wise embedding, patch-wise embedding can capture richer and more stable local association information. Furthermore, channel-independent embedding representations align better with the use of temporal attention mechanisms in the initial phase.
- (2)
- Two-Stage Temporal-Spatial Attention Mechanism. In the vanilla transformer, the self-attention mechanism is only used to capture temporal dynamics over time, and the spatial correlations are not fully exploited. The two-stage temporal-spatial attention mechanism involves first calculating the temporal attention scores for each channel’s data, followed by using the temporally associated channel data as input for the second-stage spatial attention mechanism to establish cross-channel spatial correlations. This approach facilitates a more comprehensive capture of deep temporal-spatial associations, including the aforementioned asynchronous associations. With the help of a two-stage temporal-spatial attention mechanism, we can model the correlations among observations of different channels and time steps within the MTS data, which is key to addressing the issue of asynchronous time-delay correlations.
- (3)
- Multi-scale feature fusion strategy. As is widely known, NPPs’ operational data encompass a diverse array of variables, such as current, temperature, flow rate, pressure, etc. Due to differences in the data characteristics of these variables, it is challenging to determine a fixed window size that can be universally applicable to all variables. Considering this, we have adopted a multi-scale feature fusion approach in our research. By employing sliding windows of varying sizes, our model is capable of extracting features at different scales. These features are then integrated to enhance the model’s performance. The use of a multi-scale mechanism allows the model to extract and integrate features of MTS data from different scales, thereby obtaining more stable multi-scale feature representations. In the process of signal reconstruction, features of various scales are fused, as shown in Figure 4.
- (4)
- Averaged Window Reconstruction. As is widely known, the input MTS data are transformed into a multiple-window sequence as the model’s input through a sliding-window process. Therefore, the output of the temporal-spatial transformer model is still a sequence of reconstructed windows. To obtain the reconstruction signal of the original MTS data, a typical resolution involves using the mean value or a specific value of the window (the first or the last value) to represent the reconstructed signal corresponding to that time step. Here, we propose a new approach called Averaged Window Reconstruction (AWR). Utilizing AWR, the sliding step is set as 1, and the sliding window size is K. Therefore, every single time step is reconstructed by a series of windows of number K. In the proposed AWR method, we take the average of the corresponding data points from all reconstructed windows that contain a particular timestamp t as the reconstruction data , as shown in the equation below:
5. Experiments and Results Discussion
5.1. Experiments on Simulation Dataset
5.1.1. Simulation Dataset and Metrics
5.1.2. Results and Analysis
5.2. Experiments on Real NPP Dataset
5.2.1. Real NPP Dataset
5.2.2. Results and Discussion
- All five models successfully detected the occurrence of an anomalous event.
- The anomaly onset points identified by the models, except for TranAD, occurred after the true anomaly onset point and before the pump shutdown.
- Among these four models, the proposed TS-Trans model detected the system state deviations at an earlier stage.
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Appendix A
Index | Abbreviated Name | Corresponding Variable |
---|---|---|
1 | TAVG | Coolant average temperature |
2 | THA | Branch A hot-leg temperature |
3 | THB | Branch B hot-leg temperature |
4 | TCA | Branch A cold-leg temperature |
5 | TCB | Branch B cold-leg temperature |
6 | WRCA | Branch A coolant flow rate |
7 | WRCB | Branch B coolant flow rate |
8 | PSGA | Branch A steam generator pressure |
9 | PSGB | Branch B steam generator pressure |
10 | VOL | Coolant volume |
11 | LVPZ | Pressurizer liquid level |
12 | WECS | Emergency core cooling system flow rate |
13 | QMWT | Thermal power |
14 | QMGA | Branch A steam generator extraction thermal power |
15 | QMGB | Branch B steam generator extraction thermal power |
16 | TSAT | Pressurizer saturation temperature |
17 | LVCR | Core water level |
18 | PWR | Core thermal power |
19 | WCHG | Top-up flow rate |
20 | P | Primary side pressure of the reactor coolant system |
References
- Stamford, L.; Azapagic, A. Sustainability indicators for the assessment of nuclear power. Energy 2011, 36, 6037–6057. [Google Scholar] [CrossRef]
- Karakosta, C.; Pappas, C.; Marinakis, V.; Psarras, J. Renewable energy and nuclear power towards sustainable development: Characteristics and prospects. Renew. Sustain. Energy Rev. 2013, 22, 187–197. [Google Scholar] [CrossRef]
- Adamantiades, A.; Kessides, I. Nuclear power for sustainable development: Current status and future prospects. Energy Policy 2009, 37, 5149–5166. [Google Scholar] [CrossRef]
- Ramana, M.V. Nuclear power and the public. Bull. At. Sci. 2011, 67, 43–51. [Google Scholar] [CrossRef]
- Hashemian, H.M. On-line monitoring applications in nuclear power plants. Prog. Nucl. Energy 2011, 53, 167–181. [Google Scholar] [CrossRef]
- Ayo-Imoru, R.M.; Cilliers, A.C. A survey of the state of condition-based maintenance (CBM) in the nuclear power industry. Ann. Nucl. Energy 2018, 112, 177–188. [Google Scholar] [CrossRef]
- Ma, J.; Jiang, J. Applications of fault detection and diagnosis methods in nuclear power plants: A review. Prog. Nucl. Energy 2011, 53, 255–266. [Google Scholar] [CrossRef]
- Arunthavanathan, R.; Khan, F.; Ahmed, S.; Imtiaz, S.; Rusli, R. Fault detection and diagnosis in process system using artificial intelligence-based cognitive technique. Comput. Chem. Eng. 2020, 134, 106697. [Google Scholar] [CrossRef]
- Pang, G.; Shen, C.; Cao, L.; Hengel, A.V.D. Deep learning for anomaly detection: A review. ACM Comput. Surv. CSUR 2021, 54, 1–38. [Google Scholar] [CrossRef]
- Khentout, N.; Magrotti, G. Fault supervision of nuclear research reactor systems using artificial neural networks: A review with results. Ann. Nucl. Energy 2023, 185, 109684. [Google Scholar] [CrossRef]
- Dong, F.; Chen, S.; Demachi, K.; Yoshikawa, M.; Seki, A.; Takaya, S. Attention-based time series analysis for data-driven anomaly detection in nuclear power plants. Nucl. Eng. Des. 2023, 404, 112161. [Google Scholar] [CrossRef]
- Deng, A.; Hooi, B. Graph neural network-based anomaly detection in multivariate time series. In Proceedings of the AAAI Conference on Artificial Intelligence, Virtually, 2–9 February 2021; Volume 35, pp. 4027–4035. [Google Scholar]
- Zong, B.; Song, Q.; Min, M.R.; Cheng, W.; Lumezanu, C.; Cho, D.; Chen, H. Deep autoencoding gaussian mixture model for unsupervised anomaly detection. In Proceedings of the International Conference on Learning Representations, Vancouver, BC, Canada, 30 April–3 May 2018. [Google Scholar]
- He, Y.; Zhao, J. Temporal convolutional networks for anomaly detection in time series. J. Phys. Conf. Ser. 2019, 1213, 042050. [Google Scholar] [CrossRef]
- Hundman, K.; Constantinou, V.; Laporte, C.; Colwell, I.; Soderstrom, T. Detecting spacecraft anomalies using lstms and nonparametric dynamic thresholding. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, London, UK, 19–23 August 2018; pp. 387–395. [Google Scholar]
- Park, D.; Hoshi, Y.; Kemp, C.C. A multimodal anomaly detector for robot-assisted feeding using an LSTM-based variational autoencoder. IEEE Robot. Autom. Lett. 2018, 3, 1544–1551. [Google Scholar] [CrossRef]
- Tuli, S.; Casale, G.; Jennings, N.R. Tranad: Deep transformer networks for anomaly detection in multivariate time series data. arXiv 2022, arXiv:2201.07284. [Google Scholar] [CrossRef]
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, Ł.; Polosukhin, I. Attention is all you need. Adv. Neural Inf. Process. Syst. 2017, 30, 5998–6008. [Google Scholar]
- Islam, S.; Elmekki, H.; Elsebai, A.; Bentahar, J.; Drawel, N.; Rjoub, G.; Pedrycz, W. A comprehensive survey on applications of transformers for deep learning tasks. Expert Syst. Appl. 2023, 241, 122666. [Google Scholar] [CrossRef]
- Abdulaal, A.; Liu, Z.; Lancewicki, T. Practical approach to asynchronous multivariate time series anomaly detection and localization. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining, Singapore, 14–18 August 2021; pp. 2485–2494. [Google Scholar]
- Gamboa, J.C.B. Deep learning for time-series analysis. arXiv 2017, arXiv:1701.01887. [Google Scholar]
- Choi, K.; Yi, J.; Park, C.; Yoon, S. Deep learning for anomaly detection in time-series data: Review, analysis, and guidelines. IEEE Access 2021, 9, 120043–120065. [Google Scholar] [CrossRef]
- Nie, Y.; Nguyen, N.H.; Sinthong, P.; Kalagnanam, J. A Time Series is Worth 64 Words: Long-term Forecasting with Transformers. In Proceedings of the Eleventh International Conference on Learning Representations, Virtual Event, 25–29 April 2022. [Google Scholar]
- Zhang, Y.; Yan, J. Crossformer: Transformer utilizing cross-dimension dependency for multivariate time series forecasting. In Proceedings of the Eleventh International Conference on Learning Representations, Kigali, Rwanda, 1–5 May 2023. [Google Scholar]
- Liu, Y.; Hu, T.; Zhang, H.; Wu, H.; Wang, S.; Ma, L.; Long, M. itransformer: Inverted transformers are effective for time series forecasting. arXiv 2023, arXiv:2310.06625. [Google Scholar]
- Yang, C.; Cai, B.P.; Wu, Q.B.; Wang, C.Y.S.; Ge, W.F.; Hu, Z.M.; Zhu, W.; Zhang, L.; Wang, L.T. Digital twin-driven fault diagnosis method for composite faults by combining virtual and real data. J. Ind. Inf. Integr. 2023, 33, 100469. [Google Scholar] [CrossRef]
- Song, Q.; Wang, M.S.; Lai, W.X.; Zhao, S.F. On Bayesian optimization-based residual CNN for estimation of inter-turn short circuit fault in PMSM. IEEE Trans. Power Electron. 2022, 38, 2456–2468. [Google Scholar] [CrossRef]
- Nechibvute, A.; Mudzingwa, C. Wireless sensor networks for scada and industrial control systems. Int. J. Eng. Technol. 2013, 3, 1025–1035. [Google Scholar]
- McLachlan, G.J. Mahalanobis distance. Resonance 1999, 4, 20–26. [Google Scholar] [CrossRef]
- De Maesschalck, R.; Jouan-Rimbaud, D.; Massart, D.L. The mahalanobis distance. Chemom. Intell. Lab. Syst. 2000, 50, 1–18. [Google Scholar] [CrossRef]
- Qi, B.; Xiao, X.; Liang, J.; Po, L.-C.C.; Zhang, L.; Tong, J. An open time-series simulated dataset covering various accidents for nuclear power plants. Sci. Data 2022, 9, 766. [Google Scholar] [CrossRef]
- Cheng, Y.-H.; Shih, C.; Chiang, S.-C.; Weng, T.-L. Introducing PCTRAN as an evaluation tool for nuclear power plant emergency responses. Ann. Nucl. Energy 2012, 40, 122–129. [Google Scholar] [CrossRef]
- Jin, Y.; Qiu, C.; Sun, L.; Peng, X.; Zhou, J. Anomaly detection in time series via robust PCA. In Proceedings of the 2017 2nd IEEE International Conference on Intelligent Transportation Engineering (ICITE), Singapore, 1–3 September 2017; pp. 352–355. [Google Scholar]
Dataset | Basic Operational Status | Accident Type | Accident Severity | Total Size | Anomalies |
---|---|---|---|---|---|
Training Set | 100% FP | Normal | / | 1800 | 0 |
Test Set | 100% FP | LOCA | 0.01 | 300 | 150 |
LOCAC | 0.01 | 300 | 150 | ||
LR | 0.01 | 300 | 150 | ||
SGTR | 0.01 | 300 | 150 | ||
SLBIC | 0.01 | 300 | 150 | ||
SLBOC | 0.01 | 300 | 150 |
Model | LOCA | LOCAC | LR | ||||||
F1 | P | R | F1 | P | R | F1 | P | R | |
DAGMM | 0.983 | 0.966 | 1 | 0.966 | 0.934 | 1 | 0.977 | 0.955 | 1 |
LSTM-VAE | 0.955 | 0.913 | 1 | 0.977 | 0.955 | 1 | 0.961 | 0.926 | 1 |
PCA | 0.988 | 0.985 | 0.99 | 0.965 | 0.968 | 1 | 0.802 | 0.963 | 0.687 |
TranAD | 0.982 | 0.965 | 1 | 0.956 | 0.916 | 1 | 0.966 | 0.934 | 1 |
TS-Trans | 0.997 | 1 | 0.995 | 0.984 | 0.968 | 1 | 0.987 | 0.974 | 1 |
Model | SGTR | SLBIC | SLBOC | ||||||
F1 | P | R | F1 | P | R | F1 | P | R | |
DAGMM | 0.946 | 0.898 | 1 | 0.977 | 0.955 | 1 | 0.983 | 0.966 | 1 |
LSTM-VAE | 0.937 | 0.882 | 1 | 0.971 | 0.943 | 1 | 0.959 | 0.922 | 1 |
PCA | 0.987 | 0.974 | 1 | 0.748 | 0.958 | 0.613 | 0.982 | 0.99 | 0.975 |
TranAD | 0.966 | 0.934 | 1 | 0.922 | 0.855 | 1 | 0.974 | 0.95 | 1 |
TS-Trans | 0.993 | 0.987 | 1 | 0.984 | 0.968 | 1 | 0.988 | 0.976 | 1 |
Dataset | Basic Operational Status | Dimensions | Total Size | Anomalies | Anomaly Rate |
---|---|---|---|---|---|
Training Set | 85% FP Steady-State | 30 | 30,000 | / | / |
Test Set | 85% FP Steady-State | 30 | 18,000 | 859 | 4.77% |
Model | Window Size | F1 | P | R |
---|---|---|---|---|
DAGMM | 60 | 0.711 | 0.6 | 0.873 |
LSTM-VAE | 60 | 0.907 | 0.877 | 0.939 |
PCA | / | 0.877 | 0.867 | 0.886 |
TranAD | 60 | 0.768 | 0.624 | 0.999 |
TS-Trans | 60 | 0.962 | 0.928 | 0.998 |
Model | F1 | P | R |
---|---|---|---|
DAGMM | 0.913923 | 0.97775 | 0.857918 |
LSTM-VAE | 0.915176 | 0.968254 | 0.867615 |
PCA | 0.920541 | 0.984887 | 0.864088 |
TranAD | 0.854501 | 0.966527 | 0.765746 |
TS-Trans | 0.979798 | 0.995439 | 0.964641 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Yi, S.; Zheng, S.; Yang, S.; Zhou, G.; Cai, J. Anomaly Detection for Asynchronous Multivariate Time Series of Nuclear Power Plants Using a Temporal-Spatial Transformer. Sensors 2024, 24, 2845. https://doi.org/10.3390/s24092845
Yi S, Zheng S, Yang S, Zhou G, Cai J. Anomaly Detection for Asynchronous Multivariate Time Series of Nuclear Power Plants Using a Temporal-Spatial Transformer. Sensors. 2024; 24(9):2845. https://doi.org/10.3390/s24092845
Chicago/Turabian StyleYi, Shuang, Sheng Zheng, Senquan Yang, Guangrong Zhou, and Jiajun Cai. 2024. "Anomaly Detection for Asynchronous Multivariate Time Series of Nuclear Power Plants Using a Temporal-Spatial Transformer" Sensors 24, no. 9: 2845. https://doi.org/10.3390/s24092845
APA StyleYi, S., Zheng, S., Yang, S., Zhou, G., & Cai, J. (2024). Anomaly Detection for Asynchronous Multivariate Time Series of Nuclear Power Plants Using a Temporal-Spatial Transformer. Sensors, 24(9), 2845. https://doi.org/10.3390/s24092845