Int J Performability Eng ›› 2018, Vol. 14 ›› Issue (11): 2808-2819.doi: 10.23940/ijpe.18.11.p27.28082819

Previous Articles     Next Articles

Prediction of Daily Pollen Concentration using Support Vector Machine and Particle Swarm Optimization Algorithm

Wenfang Zhaoa, b, Jingli Wanga, Dongchang Yub, *, and Ge Zhangc   

  1. a Institute of Urban Meteorology, China Meteorological Administration, Beijing, 100089, China;
    b Beijing Meteorological Information Center, Beijing Meteorological Bureau, Beijing, 100089, China;
    c Beijing Space Eye Innovation Technology Company, Beijing, 100089, China
  • Submitted on ;
  • Contact: * E-mail address: 67932323@qq.com
  • About author:Wenfang Zhao is a senior engineer at the Beijing Meteorological Information Center, Beijing Meteorological Bureau, Beijing, China. Her research interests include big data, cloud computers, machine learning, and meteorological big data analysis.Jingli Wang is a professor at the Institute of Urban Meteorology, China Meteorological Administration, Beijing, China. She is also a member of the Urban Meteorological Committee of China Meteorological Society. Her main research interests include atmospheric sounding, instrument development, and atmospheric environments.Dongchang Yu is a senior engineer at the Beijing Meteorological Information Center, Beijing Meteorological Bureau, Beijing, China. His current research interests include data mining and cloud computing.Ge Zhang is an advanced software engineer at Beijing Space Eye Innovation Technology Company. He received his Master's degree in GIS from Wuhan University in 2014. His current research interests include computer vision, data mining, machine learning, and artificial intelligence.

Abstract: In this paper, a support vector regression model for daily pollen concentration forecasting combined with the particle swarm optimization algorithm was proposed. Firstly, feature vector extraction was carried out by using the correlation analysis technique from meteorological data such as temperature, wind, relative humidity, precipitation, sunshine hours, and atmospheric pressure. Secondly, a support vector regression prediction model based on these vectors and pollen concentration observation data were established. Based on the Spark framework, a parallel particle swarm optimization algorithm was designed to optimize the parameters in the support vector regression algorithm, and then the optimal parameters were used to construct the daily pollen concentration prediction model. Finally, daily prediction of pollen concentration was made by using the optimized support vector regression model. The comparison among the accuracy of this optimized support vector regression model, the multiple linear regression (MLR) model, and the back propagation neural network (BPNN) model is performed to evaluate their performance. The results show that the proposed support vector regression model performs better than the MLR and BPNN models. Meanwhile, they also indicate that SVM provides promising results for prediction of daily pollen concentration.

Key words: pollen concentration, support vector regression, particle swarm optimization algorithm, Spark, pollen concentration forecast