Major Contributions
A concurrent error detection scheme that offers low latency and low overhead using architectural features.
A Comprehensive Recovery Protocol (CReP) that exploits characteristics of the concurrent error detection to improve rollback performance.
An Object-oriented Testbed for Checkpointing and Recovery Systems (OTEC) for evaluating error detection, checkpointing and recovery schemes.