Int J Performability Eng ›› 2020, Vol. 16 ›› Issue (10): 1627-1636.doi: 10.23940/ijpe.20.10.p14.16271636

Previous Articles     Next Articles

Fast Pruning Algorithm and Task Scheduling under Map/Reduce

Shujun Pei*, Yu Zhang, and Chao Liang   

  1. School of Computer Science and Technology, Harbin University of Science and Technology, Harbin, 150080, China
  • Submitted on ; Revised on ; Accepted on
  • Contact: * E-mail address: peisj@hrbust.edu.cn

Abstract: In the cloud environment, the resource utilization and overall efficiency of traditional task allocation and scheduling algorithms are very low. In order to improve the overall efficiency of Map/Reduce processing task allocation and improve the resource utilization efficiency of nodes, a pruning algorithm based on task processing time is proposed. In the pruning algorithm, the processing time of each task in each node is used for quantitative modeling, the task and processing time measurement matrix is established, and the rank pruning method is used to reduce the size of the assigned tasks. Only by assigning (N-1) tasks can the allocation problem of N nodes processing N tasks be solved, and the optimal solution of the task can be obtained. This article uses MATLAB to simulate the cloud environment and compares the pruning algorithm with traditional algorithms (including FIFO scheduling and capacity scheduling algorithms). Simulation results show that the pruning algorithm can significantly improve the overall efficiency of task scheduling under a large amount of data testing, and make full use of the computing power of nodes to improve the efficiency of Map/Reduce scheduling.

Key words: pruning optimization, task scheduling, cloud environment, Hadoop, Map/Reduce