[1] M. S. Chen, J. Han,P. S. Yu, “Data Mining: An Overview from a Database Perspective,” IEEE Transactions on Knowledge and Data Engineering, Vol. 8, No. 6, pp. 866-883, June 2002 [2] M. H. Dunham, “Data Mining Introductory and Advanced Topics,” Tsinghua University Press, Beijing, China, 2005 [3] Y. T.Zhang and L. Gong, “Data Mining Principles and Technology,” Electronic Industry Press, Beijing, China, 2004 [4] Q. J.Wang and L. F. Xue, “Data Mining Technology and Its Application in Geoscience,” World Geology, Vol. 19, No. 3, pp. 235-239, March 2000 [5] Z. Y. Li, H. Z. Wang, W. Shao, J. Z. Li,H. Gao, “Repairing Data through Regular Expressions,” PVLDB, Vol. 9, No. 5, pp. 432-443, 2016 [6] R. Agrawal, T. Imielinski,A. Swami, “Database Mining: A Performance Perspective,” IEEE Transactions on Knowledge and Data Engineering, Vol. 5, No. 6, pp. 914-925, June 2002 [7] “The Apache Software Foundation. Apache Hadoop 2.9.2,” (http://hadoop.apache.org/docs/stable/, accessed November 13, 2018 [8] Q. L. Han, S. Liang,H. L. Zhang, “Mobile Cloud Sensing, Big Data, and 5G Networks Make an Intelligent and Smart World,” IEEE Network, Vol. 29, No. 2, pp. 40-45, 2015 [9] Y. H. Huang, “In-Depth Understanding of Big Data: Big Data Processing and Programming Practice,” Machinery Industry Press, Beijing, China, 2014 [10] G. Holmes, A. Donkin,I. H. Witten, “WEKA: A Machine Learning Workbench,” inProceedings of the 2nd Australian and New Zealand Conference on Intelligent Information Systems, pp. 357-361, Brisbane, Australia, December 1994 [11] D. Talia, P. Trunfio,O. Verta, “Weka4WS: A WSRF-Enabled Weka Toolkit for Distributed Data Mining on Grids,” inProceedings of European Conference on Principles and Practice of Knowledge Discovery in Databases, pp. 309-320, Berlin, Germany, September 2005 [12] G. Liu, B. Hou,Z. W. Zhai, “Hadoop Open Source Cloud Computing Platform,” Beijing University of Posts and Telecommunications Press, Beijing, China, 2011 [13] Q. He, F. Z. Zhuang, L. Zeng, W. Z. Zhao,Q. Tan, “PDMiner: A Cloud Computing based Parallel and Distributed Data Mining Toolkit Platform,” Scientia Sinica, Vol. 44, No. 7, pp. 871-885, July 2014 [14] L. J. Yang, “Application of Data Mining Technology in Weather Data based on Hadoop Cloud Platform,” Beijing University of Posts and Telecommunications, 2015 [15] W. B. Pan, “Research and Application of Parallel K-Means Meteorological Data Mining based on Cloud Computing,” Nanjing University of Information Science and Technology, 2013 [16] L. M. Bao, “Research on Parallelization of Dynamic K-Means Algorithm in Remote Sensing Image Mining,” Nanjing University of Posts and Telecommunications, 2017 [17] Y. W. Li, “Research on File Copy Storage Improvement and Small File Merge and Access Optimization on Hadoop Platform,” Wuhan University of Technology, 2015 |