Int J Performability Eng ›› 2019, Vol. 15 ›› Issue (11): 2860-2870.doi: 10.23940/ijpe.19.11.p4.28602870

Previous Articles     Next Articles

Hadoop-based Parallel Algorithm for Data Mining in Remote Sensing Images

Yanhua Wang, Yaqiu Liu*, and Weipeng Jing   

  1. School of Information and Computer Engineering, Northeast Forestry University, Harbin, 150040, China
  • Submitted on ; Revised on ; Accepted on
  • Contact: * E-mail address: Yaqiuliu@126.com
  • About author:Yanhua Wang is a Ph.D. student in the School of Information and Computer Engineering at Northeast Forestry University. Her research interests include cloud computing and software engineering. Yaqiu Liu is a professor in the School of Information and Computer Engineering at Northeast Forestry University. His research interests include cloud computing and software engineering.Weipeng Jing is an associate professor in the School of Information and Computer Engineering at Northeast Forestry University. His research interests include cloud computing and software engineering.

Abstract: As a typical distributed parallel computing model, cloud computing can greatly reduce the execution time of computing tasks. Remote sensing image data mining, an important part of data mining, plays a significant role in meteorological analysis and earthquake prediction. By constructing a Hadoop cloud computing platform, this paper studies the Hadoop-based parallel algorithm for remote sensing image data mining. In accordance with the Hadoop distributed computing framework, the parallel algorithm for remote sensing image data mining is realized through data preprocessing, image feature extraction, and clustering analysis. The main work of this paper includes image preprocessing, Hadoop-based parallelization of remote sensing image feature extraction, and a Hadoop-based parallel algorithm for remote sensing image data mining.

Key words: cloud computing, data mining, feature extraction