A Novel Attention-Based BiLSTM-CNN Model in Valence-Arousal Space

doi:10.23940/ijpe.22.12.p1.833843

Abstract

Abstract: This paper focuses on analyzing the text sentiment tendency based on the deep learning model and starts with improving the neural network model based on public corpora to provide fine-grained analysis of text sentiment tendency and more accurate predictions. In the existing research, the extraction and utilization of text emotional features are usually based on Valence-Arousal space (VA space), but they do not pay attention to some subjective text details with emotional tendencies, such as the punctuation marks or emotional words, which could in turn to decrease the prediction accuracies made by models. Aiming at this issue, this paper proposes a hybrid Bidirectional Long Short-term Memory (BiLSTM) and Convolution Neural Network (CNN) model with an attention mechanism. Notably, in order to make our models easier to be applied to some light-weight products, we adopt the most basic components of nature language process (NLP) models. Firstly, BiLSTM is used to extract bidirectional context dependency information, and an attention mechanism is exploited to assign different weights to words that play different roles in sentiment judgment. Further, CNN is used to extract the local features of the upper layer’s output to ensure the robustness of feature extraction. The experiment shows that the combination of the methods with this order, BiLSTM first and CNN later, can achieve results which are significantly better than the existing baseline models reported in the literature.

Key words: text sentiment intensity prediction, valence-arousal space, bidirectional long short-term memory, convolution neural network, attention mechanism

Guilan Dai, Jie Zhang, and Xu Han. A Novel Attention-Based BiLSTM-CNN Model in Valence-Arousal Space [J]. Int J Performability Eng, 2022, 18(12): 833-843.

Add to citation manager EndNote|Reference Manager|ProCite|BibTeX|RefWorks

References

1. X. Tan, M. Zhuang, X. Lu,T. Mao, “An analysis of the emotional evolution of large-scale internet public opinion events based on the bertlda hybrid model,” IEEE Access, vol. 9, pp. 15860-15871, January 2021.
2. K. S. Kumar, J. Desai,J. Majumdar, “Opinion mining and sentiment analysis on online customer review,” in 2016 IEEE International Conference on Computational Intelligence and Computing Research (ICCIC), IEEE, pp. 1-4, May 2016.
3. L. B.Batista and S. Ratte, “A multi-classifier system for sentiment´ analysis and opinion mining,” in Proceedings of the 2012 International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2012), pp. 96-100, August 2012.
4. Z. Xu, K. Dong,H. Zhu, “Text sentiment analysis method based on attention word vector,” in 2020 International Conference on Modern Education and Information Management (ICMEIM), pp. 500-504, September 2020.
5. W. Yue and L. Li, “Sentiment analysis using word2vec-cnn-bilstm classification,” in 2020 Seventh International Conference on Social Networks Analysis, Management and Security (SNAMS), pp. 1-5, December 2020.
6. S. R.Safavian and D. Landgrebe, “A survey of decision tree classifier methodology,” IEEE Transactions on Systems, Man, and Cybernetics, vol. 21, no. 3, pp. 660-674, June 1991.
7. D. Gokc¸ay, E.¨ ˙Is¸bilir,G. Yildirim, “Predicting the sentiment in sentences based on words: An exploratory study on anew and anet,” in 2012 IEEE 3rd International Conference on Cognitive Infocommunications (CogInfoCom). IEEE, pp. 715-718, December 2012.
8. S. Thavareesan and S. Mahesan, “Sentiment lexicon expansion using word2vec and fasttext for sentiment prediction in tamil texts,” in 2020 Moratuwa Engineering Research Conference (MERCon), pp. 272- 276, July 2020.
9. T. Xu, Q. Peng,Y. Cheng, “Identifying the semantic orientation of terms using s-hal for sentiment analysis,” Knowledge-Based Systems, vol. 35, pp. 279-289, September 2012.
10. Y. Kim, “Convolutional neural networks for sentence classification,” in Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1746-1751, October 2014.
11. Z. Yang, D. Yang, C. Dyer, X. He, A. Smola,E. Hovy, “Hierarchical attention networks for document classification,” in Proceedings of the 2016 conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 1480-1489, January 2016.
12. J. Wang, L.-C. Yu, K. R. Lai, and X. Zhang, “Tree-structured regional cnn-lstm model for dimensional sentiment analysis,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 28, pp. 581- 591, January 2020.
13. J. A. Russell, “A circumplex model of affect.” Journal of Personality and Social Psychology, vol. 39, no. 6, pp. 1161-1178, December 1980.
14. W. J, “Sentiment analysis for chinese text in continuous valence-arousal space,” Ph.D. dissertation, Yunnan University, 2016.
15. C. Gao, X. Zhang,H. Liu, “Data and knowledge-driven named entity recognition for cyber security,” Cybersecurity, vol. 4, no. 1, pp. 1-13, December 2021.
16. R. Socher, A. Perelygin, J. Wu, J. Chuang, C. D. Manning, A. Y. Ng,C. Potts, “Recursive deep models for semantic compositionality over a sentiment treebank,” in Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1631-1642, January 2013.
17. S. Hochreiter and J. Schmidhuber, “Long short-term memory,” Neural Computation, vol. 9, no. 8, pp. 1735-1780, December 1997.
18. H. Meisheri, R. Saha, P. Sinha,L. Dey, “Textmining at emoint2017: A deep learning approach to sentiment intensity scoring of english tweets,” in Proceedings of the 8th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, pp. 193-199, September 2017.
19. R. Wang, Z. Li, J. Cao, T. Chen,L. Wang, “Convolutional recurrent neural networks for text classification,” in 2019 International Joint Conference on Neural Networks (IJCNN). IEEE, pp. 1-6, July 2019.
20. J. P.Chiu and E. Nichols, “Named entity recognition with bidirectional lstm-cnns,” Transactions of the association for computational linguistics, vol. 4, pp. 357-370, November 2016.
21. I. Priyadarshini and C. Cotton, “A novel lstm-cnn-grid search-based deep neural network for sentiment analysis,” The Journal of Supercomputing, vol. 77, no. 12, pp. 13911-13932, May 2021.
22. M. M.Bradley and P. J. Lang, “Affective norms for english words (anew): Instruction manual and affective ratings,” Technical report C-1, the center for research in psychophysiology, University of Florida., vol. 30, no. 1, pp. 25-36, January 1999.
23. G. Paltoglou, M. Theunis, A. Kappas,M. Thelwall, “Predicting emotional responses to long informal text,” IEEE Transactions on Affective Computing, vol. 4, no. 1, pp. 106-115, January 2012.
24. R. Wijayanti and A. Arisal, “Automatic indonesian sentiment lexicon curation with sentiment valence tuning for social media sentiment analysis,” ACM Transactions on Asian and Low-Resource Language Information Processing, vol. 20, no. 1, pp. 1-16, March 2021.
25. W. Jin, L. C. Yu, K. R. Lai,X. Zhang, “Community-based weighted graph model for valence-arousal prediction of affective words,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 24, no. 11, pp. 1957-1968, November 2016.
26. N. Malandrakis, A. Potamianos, E. Iosif,S. Narayanan, “Distributional semantic models for affective text analysis,” IEEE Transactions on Audio, Speech, and Language Processing, vol. 21, no. 11, pp. 2379- 2392, November 2013.
27. G. Paltoglou and M. Thelwall, “Seeing stars of valence and arousal in blog posts,” IEEE Transactions on Affective Computing, vol. 4, no. 1, pp. 116-123, January 2012.
28. T. Mikolov, I. Sutskever, K. Chen, G. Corrado,J. Dean, “Distributed representations of words and phrases and their compositionality,” arXiv preprint arXiv:1310.4546, October 2013.
29. Q. Le and T. Mikolov, “Distributed representations of sentences and documents,” in International Conference on Machine Learning. PMLR, pp. 1188-1196, May 2014.
30. J. Pennington, R. Socher,C. D. Manning, “Glove: Global vectors for word representation,” in Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1532-1543, October 2014.
31. Y.-Y. Cheng, Y.-M. Chen, W.-C. Yeh, and Y.-C. Chang, “Valence and arousal-infused bi-directional lstm for sentiment analysis of government social media management,” Applied Sciences, vol. 11, no. 2, p. 880, January 2021.
32. U. Naseem, I. Razzak, K. Musial,M. Imran, “Transformer based deep intelligent contextual embedding for twitter sentiment analysis,” Future Generation Computer Systems, vol. 113, pp. 58-69, July 2020.
33. A. Meb, A. Sn, B. Ma, C. Ec,D. Ura, “Abcdm: An attentionbased bidirectional cnn-rnn deep model for sentiment analysis,” Future Generation Computer Systems, vol. 115, pp. 279-294, October 2021.
34. J. Zhang, C. Chen, P. Liu, C. He,W. K. Leung, “Target-guided structured attention network for target-dependent sentiment analysis,” Transactions of the Association for Computational Linguistics, vol. 8, no. 1, pp. 172-182, April 2020.
35. G. E. Hinton, N. Srivastava, A. Krizhevsky, I. Sutskever,R. R. Salakhutdinov, “Improving neural networks by preventing co-adaptation of feature detectors,” arXiv preprint arXiv:1207.0580, July 2012.
36. S. Ioffe and C. Szegedy, “Batch normalization: Accelerating deep network training by reducing internal covariate shift,” in International Conference on Machine Learning. PMLR, pp. 448-456, February 2015.
37. B. Liu, M. Hu,J. Cheng, “Opinion observer: analyzing and comparing opinions on the web,” in Proceedings of the 14th International Conference on World Wide Web, pp. 342-351, May 2005.
38. C. Hutto and E. Gilbert, “Vader: A parsimonious rule-based model for sentiment analysis of social media text,” in Proceedings of the International AAAI Conference on Web and Social Media, vol. 8, no. 1, pp.216-225, January 2014.
39. S. Buechel and U. Hahn, “Emobank: Studying the impact of annotation perspective and representation format on dimensional emotion analysis,” in Proceedings of European Association of Computational Linguistics, January 2017, pp. 778-785.
40. S. Buechel and U. Hahn, “Readers vs. writers vs. texts: Coping with different perspectives of text understanding in emotion annotation,” in Proceedings of the 11th Linguistic Annotation Workshop, pp. 1-12, April 2017.
41. C. Strapparava and R. Mihalcea, “Semeval-2007 task 14: Affective text,” in Proceedings of the Fourth International Workshop on Semantic Evaluations (SemEval-2007), pp. 70-74, June 2007.
42. N. Ide, C. Baker, C. Fellbaum,R. J. Passonneau, “The manually annotated sub-corpus: A community resource for and by the people,” in Proceedings of the ACL 2010 Conference Short Papers, pp. 68-73, January 2010.
43. J. H. Friedman, “On bias, variance, 0/1—loss, and the curse-of dimensionality,” Data Mining and Knowledge Discovery, vol. 1, no. 1, pp. 55-77, March 1997.
44. P. B. Nemenyi, “Distribution-free multiple comparisons,” Princeton University, 1963.

[1]	Shreshtha Singh and Arun Sharma. State of the Art Convolutional Neural Networks [J]. Int J Performability Eng, 2023, 19(5): 342-349.
[2]	Shalaka Prasad Deore. SongRec: A Facial Expression Recognition System for Song Recommendation using CNN [J]. Int J Performability Eng, 2023, 19(2): 115-121.
[3]	Sagnik Pal, Rutvik Patel, Vijayasherly V., and Ramani Selvanambi. Hashtag Recommendation System for Instagram Posts using Transfer Learning with EfficientNet and ALS Model [J]. Int J Performability Eng, 2022, 18(8): 552-558.
[4]	Pawan Wawage and Yogesh Deshpande. Real-Time Prediction of Car Driver’s Emotions using Facial Expression with a Convolutional Neural Network-based Intelligent System [J]. Int J Performability Eng, 2022, 18(11): 791-797.
[5]	J Akilandeswaria, G. Jothib, A Naveenkumara, R. S. Sabeenianc, P. Iyyanara, and M. E Paramasivamc . Detecting Pulmonary Embolism using Deep Neural Networks [J]. Int J Performability Eng, 2021, 17(3): 322-332.
[6]	Abdul Ghafoor Etemad, Ali Imam Abidi, and Megha Chhabra. Fine-Tuned T5 for Abstractive Summarization [J]. Int J Performability Eng, 2021, 17(10): 900-906.
[7]	Guanyu Chen, Zhihua Cai, and Xiang Li. Recognition and Classification of High Resolution Remote Sensing Image based on Convolutional Neural Network [J]. Int J Performability Eng, 2018, 14(11): 2852-2863.