|
E.J. Chen, E. B. Jiang. "Review of Studies on Text Similarity Measures," [J]. Data Analysis and Knowledge Discovery,201 7,1(06):1-11.
|
|
F. J. Feng, J. P. Yao, X. S. Li, J. C. Ma. "Research on the Data Cleaning Framework," [J]. COMPUTER ENGINEERING&SOFTWARE,2017,38(12):193-196.
|
|
Gartner IT Glossary. "Dark Data," [EB/OL]. [2015-03-16]. http: / / www. gartner . com / it - glossary / dark - data.
|
|
G. M. Hu, L. Zhou, L. X. Ke. "Research on Hadoop-base Network Log Analysis System," [J]. Computer Knowledge and Technology,2010,6(22):6163-6164+6185.
|
|
D. X. Li. "Web Log Analysis Based on Data Mining," [J]. Computer Knowledge and Technology,2011,7(25):6074-6075+6078.
|
|
Kumar N. "Approximate String Matching Algorithm," [J]. International Journal on Computer Science and Engineering,2010,2(3):641-644.
|
|
K. L. Shen, B. Shao, J. Du. "The Realization of Digital Resource Monitoring System Based on Network Log Analysis," [J]. RESEARCH ON LIBRARY SCIENCE,2015(16):21-25.
|
|
S. M. Xie. "Forum Log Analysis Based on The Big Data Processing Technology Hadoop," [D]. Jiangxi Agricultural University,2014.
|
|
Z. M. Xia, X. Liu. "A Similarity Algorithm for Chinese Text Based on Semantics," [J]. JI SUAN JI YU XIAN DAI HUA,2015(04):6-9.
|
|
X. J. Xiang, Y. Gao, L. Shang, Y. B. Yang. "Parallel Text Categorization of Massive Text Based on Hadoop," [J]. Computer Science,2011,38(10):184-188.
|
|
D. H. Yang, N. N. Li, H. Z. Wang, J. Z. Li, H. Gao. "The Optimization of the Big Data Cleaning Based on Task Merging," [J]. CHINESE JOURNAL OF COMPUTER,2016,39(01):97-108.
|
|
F. Y. Yang, H. C. Liu. "Research on Hadoop Base Online Network Log Analysis System," [J]. Computer Application and Software,2014,31(08):311-316.
|
|
Q. L. Yang. "Internet User Behavior Analysis Based on Web Log," [D]. Huazhong University of Science &Technology,2013.
|
|
L. L. Zhang. "Research and Implementation of Chinese Text Categorization Based on Hadoop and SVM Algorithms," [D]. Kunming University of Science and Technology,2015.
|
|
P. Z. Zou. "Website Evaluation Index and Construction Status Analysis," [J]. Computer CD Software and Application,2012,20:151-155.
|