Adaptive Switching Redundant-Mode Multi-Core System for Photovoltaic Power Generation
Abstract
:1. Introduction
- A hardware-based dynamic switching mechanism that analyzes the performance and error rate of the system in the current scenario is used to adaptively transition between different scenarios. This mechanism utilizes the hardware architecture to statistically analyze the error rate and performance, which serve as the switching metrics.
- A redundancy-mode switchable multi-core(RMSM) processor system that can switch between three modes: balanced (using DMR and single-core parallelism), high-reliability (based on TMR), and high-performance (using three-core parallelism) is designed and constructed.
- Checkpoint backup, pipeline rollback techniques, and fault isolation mechanisms are optimized to rectify errors in processors that go beyond redundant fault tolerance while also enhancing system reliability in high-performance modes.
- A soft error probability model is constructed based on the mechanism and patterns of soft error occurrences, and soft error injection techniques are implemented to validate system reliability, which is an efficient method for validating redundant systems.
2. Analysis
2.1. Challenge 1: Selection of Switching Indicators
2.2. Challenge 2: Balance Between Reliability and Overhead
- Output comparison is necessary for TMR and DMR designs. TMR is capable of identifying the origin of a mistake by means of a majority voting circuit. It then proceeds to rectify the problem by moving forward with the recovery process, guided by the outcome of the voting circuits. However, DMR has the capability to detect the outcome but lacks the ability to accurately identify the proper outcome [23].
- Mistakes like branch prediction failures, aside from the impact of soft errors, cannot be resolved by redundant computations. For error recovery, these mistakes need extra rollback technology [24].
- The suggested adaptive dynamic switching system aims to address the issue of soft errors by alternating between different modes. Lock-stepping several CPUs imposes requirements on the structure.
2.3. Challenge 3: A Method of Efficient Evaluation
3. Dynamic Switching Strategy
3.1. Switching Metrics Based on Error Rate and Performance
3.2. Joint Statistical Dynamic Switching Mechanism Based on Error Rate and Performance
3.3. Dynamic Mode Switching
3.4. Cache Mode Switching
4. Redundancy Mode Switchable Multi-Core Processor System
- High-reliability ModeThe high-reliability mode is a fault-tolerant system that employs TMR. In this mode, the data stream of Processor 1 is fed into Processors 2 and 3. All three processors execute the same program, resulting in a performance equivalent to that of a single-core processor. TMR mode is a type of forward error correction that can mask error outputs caused by soft errors without requiring a checkpoint backup mechanism. It is built on a dual-core lockstep that features triple modular redundancy and adds a multiple-input voting circuit for correct status output.System faults can be classified as repairable or non-repairable. If only one processor has an error, it can be restored to normal operation. However, if two or more processors have errors, the system will enter an unrecoverable state, necessitating a reset. The TMR mode, with its robust architecture, offers a high degree of reliability and fault tolerance for mission-critical applications.
- Balanced ModeThe balanced mode of operation involves Processor 1 and Processor 2 working in a dual-core redundancy mode, while Processor 3 operates independently. This mode of operation aims to balance reliability and performance. The input data stream of Processor 1 is bypassed to Processor 2 after passing through mode selection logic. The output data streams of Processor 1 and Processor 2 are synchronized through a dual-core lock-step before being released. Processor 3 operates with independent input and output data streams. The dual-core lock-step redundancy fault-tolerance method typically employs fault detection and fault recovery techniques. This paper utilizes hardware pipeline lock-step redundancy technology, which encompasses fault detection, fault recovery, and fault isolation at the hardware level. Fault detection combines processor replication with checksum circuits, while fault recovery employs fully hardware-based checkpoint backup and pipeline rollback. Fault isolation prevents errors from propagating to external storage.
- High-performance ModeWhen operating in high-performance mode, the system foregoes fault-tolerance capabilities. Specifically, Processor 1, Processor 2, and Processor 3 operate concurrently to process programs. All three processors maintain independent input and output data streams, each with its own distinct bus, data storage, and instruction storage. Communication between multiple cores is facilitated through shared RAM, allowing each core to trigger an interruption for the others. This communication occurs via data preparation and triggered interruptions, rendering processors in this mode comparable to a standard tri-core processor.
4.1. Optimizing for a System with Enhanced Reliability
4.1.1. Software-Hardware Coordinated Checkpoint Backup Method
4.1.2. Pipeline Structure for Lock-Step Fast Rollback
4.1.3. Fault Isolation Method for Pipeline Processors Based on Read-Write Cache Flag
5. Evaluation
5.1. Set Up
5.2. Soft Error Injection
5.3. Evaluation Framework of TMR
5.4. Results and Analysis
5.4.1. Validation of the Intelligent MPPT Algorithm on the RMSM System
5.4.2. Single-Mode Reliability Analysis
5.4.3. Dynamic Switching State System Reliability and Performance Comparison
5.4.4. Benchmark-Based Simulation of PV System Control
- 531.deepsjeng_r benchmark involves extensive use of search algorithms and decision-making logic. Intelligent MPPT algorithms need to explore multiple scenarios to find the maximum power point under various conditions. The search-heavy nature of 531.deepsjeng_r is analogous to the optimization process in MPPT, where it continuously searches for the optimal power output.
- 541.leela_r Benchmark uses Monte Carlo Tree Search and neural network evaluations. Intelligent MPPT algorithms take advantage of machine learning for optimization. The decision-making and search processes in 541.leela_r mimic the MPPT algorithms’ need to adjust to changing environmental conditions to maximize power output, making this model suitable for evaluating AI-based MPPT algorithms.
- 505.mcf_r involves heavy integer computations and optimization algorithms. Similarly to how 505.mcf_r optimizes transportation networks, MPPT algorithms optimize the power output of PV systems. This model tests the system’s ability to handle dynamic optimization and resource allocation, which aligns well with the continuous adjustment needed in MPPT to maximize efficiency.
- 519.lbm_r benchmark requires intensive numerical computations and high memory bandwidth. Intelligent algorithms require high-frequency numerical calculations to adapt to rapidly changing environmental inputs (current, voltage, and irradiance). The dense computational workload of 519.lbm_r is effective in simulating the high computational demands of MPPT algorithms, especially for evaluating the performance of the RMSM system under high-load conditions.
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Babu, N.; Guerrero, J.M.; Siano, P.; Peesapati, R.; Panda, G. A Novel Modified Control Scheme in Grid-Tied Photovoltaic System for Power Quality Enhancement. IEEE Trans. Ind. Electron. 2021, 68, 11100–11110. [Google Scholar] [CrossRef]
- Esram, T.; Chapman, P.L. Comparison of Photovoltaic Array Maximum Power Point Tracking Techniques. IEEE Trans. Energy Convers. 2007, 22, 439–449. [Google Scholar] [CrossRef]
- Bendib, B.; Belmili, H.; Krim, F. A survey of the most used MPPT methods: Conventional and advanced algorithms applied for photovoltaic systems. Renew. Sustain. Energy Rev. 2015, 45, 637–648. [Google Scholar] [CrossRef]
- Colli, A. Failure Mode and Effect Analysis for Photovoltaic Systems. Renew. Sustain. Energy Rev. 2015, 50, 804–809. [Google Scholar] [CrossRef]
- Tsoumanis, P.; Paliaroutis, G.I.; Evmorfopoulos, N.; Stamoulis, G. Analysis of the Impact of Electrical and Timing Masking on Soft Error Rate Estimation in VLSI Circuits. Technologies 2022, 10, 23. [Google Scholar] [CrossRef]
- Parashar, A.; Sivasubramaniam, A.; Gurumurthi, S. SlicK: Slice-based locality exploitation for efficient redundant multithreading. ACM SIGOPS Oper. Syst. Rev. 2006, 40, 95–105. [Google Scholar] [CrossRef]
- de Oliveira, A.B.; Kastensmidt, F.L. Evaluating Fault-Tolerant Techniques on COTS RISC-V NOEL-V Processor in Zynq UltraScale+ FPGA Under Proton Testing. IEEE Trans. Nucl. Sci. 2023, 70, 1708–1715. [Google Scholar] [CrossRef]
- Wilson, A.E.; Wirthlin, M.; Baker, N.G. Neutron Radiation Testing of RISC-V TMR Soft Processors on SRAM-Based FPGAs. IEEE Trans. Nucl. Sci. 2023, 70, 603–610. [Google Scholar] [CrossRef]
- Berg, M.; LaBel, K.; Campola, M.; Xapsos, M. Analyzing System on A Chip Single Event Upset Responses Using Single Event Upset Data, Classical Reliability Models, and Space Environment Data. In Proceedings of the 2017 Radiation Effects on Components and Systems (RADECS) Conference, Geneva, Switzerland, 2–6 October 2017. [Google Scholar]
- Dos Santos, F.F.; Kritikakou, A.; Sentieys, O. Experimental Evaluation of Neutron-Induced Errors on a Multicore RISC-V Platform. In Proceedings of the 2022 IEEE 28th International Symposium on On-Line Testing and Robust System Design (IOLTS), Torino, Italy, 12–14 September 2022; pp. 1–7. [Google Scholar] [CrossRef]
- dos Santos, A.F.; Tambara, L.A.; Benevenuti, F.; Tonfat, J.; Kastensmidt, F.L. Applying TMR in Hardware Accelerators Generated by High-Level Synthesis Design Flow for Mitigating Multiple Bit Upsets in SRAM-Based FPGAs. In Proceedings of the Applied Reconfigurable Computing; Wong, S., Beck, A.C., Bertels, K., Carro, L., Eds.; Lecture Notes in Computer Science; Springer: Cham, Switzerland, 2017; pp. 202–213. [Google Scholar] [CrossRef]
- Vadlamani, R.; Zhao, J.; Burleson, W.; Tessier, R. Multicore soft error rate stabilization using adaptive dual modular redundancy. In Proceedings of the 2010 Design, Automation & Test in Europe Conference & Exhibition (DATE 2010), Dresden, Germany, 8–12 March 2010; pp. 27–32. [Google Scholar]
- Vera, X.; Abella, J.; Carretero, J.; González, A. Selective Replication: A Lightweight Technique for Soft Errors. ACM Trans. Comput. Syst. 2010, 27, 1–30. [Google Scholar] [CrossRef]
- Andjelkovic, M.; Chen, J.; Simevski, A.; Stamenkovic, Z.; Krstic, M.; Kraemer, R. A Review of Particle Detectors for Space-Borne Self-Adaptive Fault-Tolerant Systems. In Proceedings of the 2020 IEEE East-West Design & Test Symposium (EWDTS), Varna, Bulgaria, 4–7 September 2020; pp. 1–8. [Google Scholar] [CrossRef]
- Baharvand, F.; Miremadi, S.G. ARMOR: Adaptive Reliability Management by On-the-Fly Redundancy in Multicore Embedded Processors. In Proceedings of the 2015 IEEE 21st Pacific Rim International Symposium on Dependable Computing (PRDC), Zhangjiajie, China, 18–20 November 2015; pp. 215–224. [Google Scholar] [CrossRef]
- Kottke, T.; Steininger, A. A Reconfigurable Generic Dual-Core Architecture. In Proceedings of the International Conference on Dependable Systems and Networks (DSN’06), Philadelphia, PA, USA, 25–28 June 2006; pp. 45–54. [Google Scholar] [CrossRef]
- Asghari, S.A.; Binesh Marvasti, M.; Rahmani, A.M. Enhancing transient fault tolerance in embedded systems through an OS task level redundancy approach. Future Gener. Comput. Syst. 2018, 87, 58–65. [Google Scholar] [CrossRef]
- Subramanyan, P.; Singh, V.; Saluja, K.K.; Larsson, E. Multiplexed redundant execution: A technique for efficient fault tolerance in chip multiprocessors. In Proceedings of the 2010 Design, Automation & Test in Europe Conference & Exhibition (DATE 2010), Dresden, Germany, 8–12 March 2010; pp. 1572–1577. [Google Scholar] [CrossRef]
- Bolchini, C.; Carminati, M.; Miele, A. Self-Adaptive Fault Tolerance in Multi-/Many-Core Systems. J. Electron. Test. 2013, 29, 159–175. [Google Scholar] [CrossRef]
- Chen, J.; Lu, L.; Andjelkovic, M.; Ulbricht, M.; Krstic, M. Adaptive Lock-Step System for Resilient Multiprocessing Architectures. In Proceedings of the 2023 IEEE Nordic Circuits and Systems Conference (NorCAS), Aalborg, Denmark, 31 October–1 November 2023; pp. 1–6. [Google Scholar] [CrossRef]
- Krstić, M.; Andjelković, M.; Chen, J.; Lu, L.; Veronesi, A.; Ulbricht, M. Resiliency in Digital Processing Systems. In Proceedings of the 2023 IEEE 33rd International Conference on Microelectronics (MIEL), Nis, Serbia, 16–18 October 2023; pp. 1–8. [Google Scholar] [CrossRef]
- de Oliveira, Á.B.; Rodrigues, G.S.; Kastensmidt, F.L.; Added, N.; Macchione, E.L.A.; Aguiar, V.A.P.; Medina, N.H.; Silveira, M.A.G. Lockstep Dual-Core ARM A9: Implementation and Resilience Analysis Under Heavy Ion-Induced Soft Errors. IEEE Trans. Nucl. Sci. 2018, 65, 1783–1790. [Google Scholar] [CrossRef]
- Nedeljković, J.N.; Đošić, S.M.; Nikolić, G.S. A Survey of Hardware Fault Tolerance Techniques. In Proceedings of the 2023 58th International Scientific Conference on Information, Communication and Energy Systems and Technologies (ICEST), Nis, Serbia, 29 June–1 July 2023; pp. 223–226. [Google Scholar] [CrossRef]
- Aung, T.; Min, H.Y.; Maw, A.H. Coordinate Checkpoint Mechanism on Real-Time Messaging System in Kafka Pipeline Architecture. In Proceedings of the 2019 International Conference on Advanced Information Technologies (ICAIT), Yangon, Myanmar, 6–7 November 2019; pp. 37–42. [Google Scholar] [CrossRef]
- Eslami, M.; Ghavami, B.; Raji, M.; Mahani, A. A Survey on Fault Injection Methods of Digital Integrated Circuits. Integration 2020, 71, 154–163. [Google Scholar] [CrossRef]
- Violante, M.; Meinhardt, C.; Reis, R.; Sonza Reorda, M. A Low-Cost Solution for Deploying Processor Cores in Harsh Environments. IEEE Trans. Ind. Electron. 2011, 58, 2617–2626. [Google Scholar] [CrossRef]
Resources | Resources Occupation |
---|---|
LUT | 43,115 |
LUTRAM | 218 |
FF | 32,155 |
BRAM | 558 |
DSP | 12 |
MMCM | 3 |
Operation Mode | Invalid | Success | Fail | IFR | FRR | |
---|---|---|---|---|---|---|
Our Work | High-Performance | 4114 | 756 | 130 | 82.28% | 85.33% |
Balanced | 4129 | 864 | 7 | 85.28% | 99.20% | |
High-Reliability | 4163 | 834 | 3 | 83.26% | 99.64% | |
Chen et al. [20] | Single-Core | - | 3843 | 517 | - | 87.08% |
DCLS | - | 7062 | 938 | - | 88.28% | |
TMR | - | 8964 | 36 | - | 99.60% |
The Bit Width per Injection | Invalid | Success | Fail | IFR | FRR | |
---|---|---|---|---|---|---|
Chen et al. [20] | 3 | - | 8964 | 36 | - | 99.60% |
Our Work | 1 | 4163 | 834 | 3 | 83.26% | 99.64% |
2 | 3547 | 1450 | 3 | 70.94% | 99.79% | |
4 | 784 | 4178 | 39 | 15.68% | 99.09% | |
8 | 126 | 4831 | 43 | 2.52% | 99.11% |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Liu, L.; Zhang, X.; Zhou, J.; Niu, K.; Guo, Z.; Zhao, Y.; Zhang, M. Adaptive Switching Redundant-Mode Multi-Core System for Photovoltaic Power Generation. Sensors 2024, 24, 7561. https://doi.org/10.3390/s24237561
Liu L, Zhang X, Zhou J, Niu K, Guo Z, Zhao Y, Zhang M. Adaptive Switching Redundant-Mode Multi-Core System for Photovoltaic Power Generation. Sensors. 2024; 24(23):7561. https://doi.org/10.3390/s24237561
Chicago/Turabian StyleLiu, Liang, Xige Zhang, Jiahui Zhou, Kai Niu, Zixuan Guo, Yawen Zhao, and Meng Zhang. 2024. "Adaptive Switching Redundant-Mode Multi-Core System for Photovoltaic Power Generation" Sensors 24, no. 23: 7561. https://doi.org/10.3390/s24237561
APA StyleLiu, L., Zhang, X., Zhou, J., Niu, K., Guo, Z., Zhao, Y., & Zhang, M. (2024). Adaptive Switching Redundant-Mode Multi-Core System for Photovoltaic Power Generation. Sensors, 24(23), 7561. https://doi.org/10.3390/s24237561