Replication-Based Fault Tolerance for MPI Applications. Walters, J. & Chaudhary, V. Parallel and Distributed Systems, IEEE Transactions on, 20(7):997–1010, IEEE, 2009.
Replication-Based Fault Tolerance for MPI Applications [pdf]Paper  bibtex   

Downloads: 0