Int J Performability Eng ›› 2025, Vol. 21 ›› Issue (3): 131-140.doi: 10.23940/ijpe.25.03.p2.131140

Previous Articles     Next Articles

Optimizing Latent Dirichlet Allocation using Metaheuristic Technique: A Comparative Study

Sneh Prabha* and Neetu Sardana   

  1. Department of CSE and IT, Jaypee Institute of Information Technology, Noida, India
  • Submitted on ; Revised on ; Accepted on
  • Contact: * E-mail address: 19403032@mail.jiit.ac.in

Abstract: Community websites offer specialized online platforms for people to connect and share their knowledge about specific topics or objectives, encouraging deep engagement. A significant amount of unstructured text data can be analyzed to uncover valuable insights and trends. Latent Dirichlet Allocation (LDA) is a commonly used technique in topic modeling. However, many people use LDA with default parameters, leading to inaccurate and less cohesive topics. The selection of hyperparameters impacts the effectiveness of LDA models, and we seek to investigate the influence of the varied metaheuristic approaches on enhancing LDA's performance. In this study, we aim to analyze and compare five metaheuristic optimization algorithms for tuning the hyperparameters of the LDA model. We compare the effectiveness of Genetic Algorithms (GA), Particle Swarm Optimization algorithm (PSO), Grey Wolf Optimizer algorithm (GWO), Firefly Algorithm (FA), and Whale Optimization Algorithm (WOA). It has been found that GA is a superior metaheuristic in producing enhanced results from LDA. LDA+GA has achieved an improvement of 12.4 % over the baseline LDA technique. It has demonstrated a notable improvement in perplexity score compared to GA.

Key words: LDA, GA, PSO, GWO, topic modeling