Int J Performability Eng ›› 2019, Vol. 15 ›› Issue (8): 2237-2248.doi: 10.23940/ijpe.19.08.p24.22372248

Previous Articles     Next Articles

Deep Walk Algorithm based on Improved Random Walk with Equal Probability

Zhonglin Yea,b,c,d, Haixing Zhaoa,b,c,d,*, Ke Zhanga,c,d, Yanlin Yanga,c,d, and Lei Menga,c,d   

  1. a College of Computer, Qinghai Normal University, Xining, 810008, China
    b College of Computer Science, Shaanxi Normal University, Xi'an, 710119, China
    c Tibetan Information Processing and Machine Translation Key Laboratory of Qinghai Province, Xining, 810008, China
    d Key Laboratory of Tibetan Information Procedureing, Ministry of Education, Xining, 810008, China
  • Submitted on ;
  • Contact: * E-mail address: haixing_zhao@163.com
  • About author:Zhonglin Ye is a graduate student in the School of Computer Science in Shaanxi Normal University. His research interests include data mining, knowledge discovery, and representation learning and understanding. Haixing Zhao is a professor and Ph.D. supervisor. He received his Doctor of Engineering Degree from the School of Computer Science at Northwestern Polytechnical University in 2004. He also received his Doctor of Science Degree from Twente University in Holland. His research interests include complex networks, semantic networks, machine translation, hypergraph theory and databases, and network reliability. Ke Zhang is a Ph.D. candidate. His research interests include complex networks. Yanlin Yang is an M.S. candidate. Her research interests include complex networks and link prediction. Lei Meng is an M.S. candidate. His research interests include data mining and hyper networks.

Abstract: Most of the existing network representation learning algorithms are mainly based on the DeepWalk algorithm. The main improvement is to modify DeepWalk's three-layer neural network to multi-layer neural network or to introduce the network's external attributes into the DeepWalk algorithm for joint representation learning. In network representation learning, the random walk strategy can be considered as network data preprocessing of network representation learning tasks. Learning the random walk sequences, including most of the network structure features, is very important for the network representation learning algorithm, because the subsequent training procedure of neural networks is to continuously adjust the representation of each node in the network based on the co-occurrence of node pairs. Therefore, the purpose of this paper is to improve the random walk strategy of the DeepWalk algorithm. We propose a novel DeepWalk algorithm based on the improved random walk with equal probability (EPDW). Although the next node in the random walk of the DeepWalk algorithm is chosen with equal probability, one of the neighbouring nodes of the current node is chosen as the next node of the random walk by the pseudo-random number. The improvement of EPDW is to choose the next hop node in the random walk using the roulette method of cumulative probability sum of random walk nodes. Although this method also chooses the next hop node with equal probability, it can choose the next hop random walk node more reasonably and effectively.

Key words: network representation learning, network representation, network embedding, feature learning