Int J Performability Eng ›› 2017, Vol. 13 ›› Issue (4): 511-518.doi: 10.23940/ijpe.17.04.p18.511518

• Original articles • Previous Articles     Next Articles

Active Learning Method for Chinese Spam Filtering

Guanglu Suna, b, Shaobo Lia, Teng Chena, Xuhang Lia, and Suxia Zhua, b   

  1. aSchool of Computer Science and Technology, Harbin University of Science and Technology, Harbin 150080, China
    bResearch Center of Information Security & Intelligent Technology, Harbin University of Science and Technology, Harbin, 150080, China

Abstract:

An active learning method is put forward to filter Chinese spam. In terms of training the filtering model, labeling all of the emails seems to be costly and time-consuming, while unlabeled emails can be easily accessed. Misclassification and a low-certainty method is proposed to reduce the number of labeled emails. The ROSVM model is also utilized as the online filtering model. The experimental results show that the proposed method not only decreases the number of training emails and the computational cost of spam filter, but also improves the accuracy of the filter.


Submitted on February 20, 2017; Revised on May 11, 2017; Accepted on June 15, 2017
References: 21