Int J Performability Eng ›› 2024, Vol. 20 ›› Issue (3): 139-148.doi: 10.23940/ijpe.24.03.p2.139148

Previous Articles     Next Articles

Hybrid Technique of Topic Modelling and Text Summarization: A Case Study on Predicting Trends in Green Computing

Mansi Pandeya,b, Chetan Sharmab,*, Shamneesh Sharmab, and Trapty Aggarwala   

  1. aSchool of Data Science, Maharishi University of Information Technology, Uttar Pradesh, India;
    bupGrad Campus, upGrad Education Private Limited, Karnataka, India
  • Submitted on ; Revised on ; Accepted on
  • Contact: *E-mail address: chetanshekhu@gmail.com

Abstract: Text mining techniques are used for trend prediction using keyword analysis; however, these processes result in the formation of relevant and irrelevant keywords. Based on the keywords, various clusters are formed, resulting in various topics. Due to the presence of irrelevant keywords, there are chances of the formation of wrong topics. To overcome this problem, this research contributes to developing an algorithm that deals with topic prediction using a noble technique wherein text summarization is inculcated into topic modeling algorithms. This research focuses on implementing text summarizing to generate summaries of published publications by diverse researchers using the Genism library with an extractive text summarization approach and then applying text mining to it to predict trends in various fields. The current approach was compared with existing techniques based on the parameters used in automatic and semi-automatic text mining techniques.

Key words: natural language processing (NLP), text summarization, latent semantic analysis (LSA), latent dirichlet allocation (LDA), extractive text summarization (ETS), abstractive text summarization (ATS)