REFINE: REalistic Fault INjEction using compiler-based instrumentation
Compiler-based fault injection (FI) has become a popular technique for resilience studies to understand the impact of soft errors in supercomputing systems. Compiler-based FI frameworks inject faults at a high intermediate-representation level. However, they are less accurate than machine code, binary-level FI because they lack access to all dynamic instructions, thus they fail to mimic certain fault manifestations. REFINE, a novel framework that addresses these limitations, performs FI in a compiler backend. This approach provides the portability and efficiency of compiler-based FI while keeping accuracy comparable to binary-level FI methods.
Related Publications
- Giorgis Georgakoudis, Ignacio Laguna, Dimitrios S. Nikolopoulos, Martin Schulz. REFINE: Realistic Fault Injection via Compiler-based Instrumentation for Accuracy, Portability and Speed. In Proceedings of the 30th IEEE/ACM International Conference on High Performance Computing, Networking, Storage and Analysis (SC) 2017, Denver, CO, USA, November 12-17, 2017. IEEE Computer Society, Los Alamitos, CA, USA. Acceptance rate 18.7% (61/327). (Paper)
Source Code
GitHub: https://github.com/ggeorgakoudis/REFINE