%A Shengji Yu Xiwei Qiu %T Performability Analysis of a Parallel Service Considering Multiple Types of Failures %0 Journal Article %D 2017 %J Int J Performability Eng %R 10.23940/ijpe.17.03.p9.330333 %P 330-333 %V 13 %N 3 %U {https://www.ijpe-online.com/CN/abstract/article_3892.shtml} %8 2017-05-01 %X

Parallel computing is an important approach to achieve a high throughput of serving user requests, which has significant influence on improving performance. Parallel computing service can be realized by hosting multiple copies of the software that performs the same service tasks on different physical machines running in parallel. However, the execution of the software may be interrupted by various kinds of failures, including software failures, hardware failures, and common cause failures (CCF) of co-located copies of the software caused by the failures of the host machine. To analyze the performability of a parallel service, unexpected change of performance caused by random failures and subsequent process of recovery should be counted. This paper presents a theoretical modeling approach encompassing Markov reward models to analyze the performability of a parallel service, which considers software failures, hardware failures, and common cause failures to ensure high fidelity. Simulation results are illustrated to verify the new model.


Submitted on January 22, 2017; First Revised on March 1, 2017; Final Rivised on April 18, 2017; Accepted on April 19, 2017
References: 5