Int J Performability Eng ›› 2011, Vol. 7 ›› Issue (2): 195-197.doi: 10.23940/ijpe.11.2.p195.mag

• Short Communications • Previous Articles     Next Articles

Recovering Lagging Replicas in a Fault Tolerant System


  1. Department of Electrical and Computing Engineering, Cleveland State University
    2121 Euclid Ave., Cleveland, OH 44115


In this paper, we discuss an often-ignored, but very important issue, i.e., how to recover slow replicas quickly in a fault tolerant system. Despite the fact that the replicas are deployed in identically-equipped computing nodes, under heavy load, some replicas would lag behind due to various reasons. Quickly recovering slow replicas is important because not doing so could result in reduced throughput, high jitters in end-to-end latency, and reduced replication degree.
Received on July 14, 2010, revised on November 10, 2010
References: 5