REFINE: REalistic Fault INjEction using compiler-based instrumentation
Compiler-based fault injection (FI) has become a popular technique for resilience studies to understand the impact of soft errors in supercomputing systems. Compiler-based FI frameworks inject faults at a high intermediate-representation level. However, they are less accurate than machine code, binary-level FI because they lack access to all dynamic instructions, thus they fail to mimic certain fault manifestations. REFINE, a novel framework that addresses these limitations, performs FI in a compiler backend. This approach provides the portability and efficiency of compiler-based FI while keeping accuracy comparable to binary-level FI methods.
GitHub: https://github.com/ggeorgakoudis/REFINE