Int J Performability Eng ›› 2018, Vol. 14 ›› Issue (9): 2015-2020.doi: 10.23940/ijpe.18.09.p9.20152020

Previous Articles     Next Articles

Reliability Simulation in Cloud Computing System

Sa Meng, Xiwei Qiu, Liang Luo*, Han Xu, and Meilian Lei   

  1. School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, 611731, China
  • Revised on ; Accepted on
  • Contact: * E-mail address: luolianng@gmail.com

Abstract: With the large-scale increase of users, the reliability of the cloud system has become a challenging issue in the industry and academia. Many researchers have studied the reliability mechanism of cloud computing systems and proposed reliability awareness methods to achieve resource integration and improve system reliability. However, various hardware and software failures occur inevitably and cannot be accurately found and repaired in a timely manner. Moreover, since most of the studies cannot determine the background operation mechanism of the cloud system, this brings significant problems to the research of cloud computing reliability. To solve this problem, we first extract the key features that can be used to increase system reliability in cloud computing architectures. Secondly, we present an architecture framework for reliability simulation and analyze four types of common system failures: hardware failures, virtual machine failures, data inconsistency failures, and service timeout failures. Finally, experiments and verification based on a set of realistic configurations and operation runtimes are implemented as an extension of a well-known cloud simulation tool, CloudSim, to illustrate how these failures affect the reliability of cloud computing systems and how different resource scheduling algorithms handle these failures.

Key words: cloud computing, reliability simulation, failures, virtual machine