Int J Performability Eng ›› 2018, Vol. 14 ›› Issue (11): 2864-2876.doi: 10.23940/ijpe.18.11.p32.28642876

Previous Articles     Next Articles

An Online HDP Mixture Model for Video Mining

Lin Tanga, Lin Liub, Mingjing Tangc, and Yu Sunb, *   

  1. a Key Laboratory of Educational Informatization for Nationalities Ministry of Education, Yunnan Normal University, Kunming, 650500, China;
    b School of Information, Yunnan Normal University, Kunming, 650500, China;
    c President Office, Yunnan Normal University, Kunming, 650500, China
  • Submitted on ;
  • Contact: * E-mail address: speed8101@sina.com
  • About author:Lin Tang graduated from Shanghai Jiaotong University with a bachelor's degree and Chinese Academy of Armament Sciences with a Ph.D. He worked as an engineer at China North Industries Group Corporation Limited from 2004 to 2015. He is currently a senior engineer at Yunnan Normal University. His main research interests include video mining and probabilistic graphical models.Lin Liu graduated from the School of Information at Yunnan University with a bachelor's degree, Master's degree, and Ph.D. She is currently a lecturer at Yunnan Normal University. Her main research interests include bioinformatics and machine learning.Mingjing Tang is a Ph.D. candidate at Yunnan University and a lecturer at Yunnan Normal University. His research focuses on software engineering, process mining, and machine learning.Yu Sun received her bachelor's degree from East China Normal University, Master's degree from Yunnan Normal University, and Ph.D. from Chinese Academy of Sciences. She is currently a professor at Yunnan Normal University. Her current research interests include intelligent education.

Abstract: In this paper, we address two problems in video mining: real-time inference and the automatic decision of the number of activities in videos. To solve these problems, we present a real-time Bayesian non-parametric model that is able to discover activities and interactions of videos in real-time. In this model, there are two layers modeled in each scene, which are activities and interactions. An activity is represented as the distribution over visual words, and an interaction is represented as the distribution over activities. Then, the Hierarchical Dirichlet Process (HDP) model connects these two layers of video and automatically decides the number of clusters. Moreover, we developed a hybrid stochastic variational Gibbs sampling algorithm for inferring the parameters of the HDP mixture model. This online inference algorithm has the capacity to process the massive video stream dataset. Finally, the detailed experimental results in a crowded traffic scene and a simulated dataset are described and reveal that our online HDP mixture model achieves superior performance in real-time anomaly activity detection.

Key words: video mining, non-parametric model, inference algorithm