Int J Performability Eng ›› 2025, Vol. 21 ›› Issue (2): 65-73.doi: 10.23940/ijpe.25.02.p1.6573

• Original article •     Next Articles

Modeling Discourse for Dialogue Systems using Spectral Learning

Akanksha Mehndiratta* and Krishna Asawa   

  1. Department of Computer Science & Engineering and Information Technology, Jaypee Institute of Information Technology, Uttar Pradesh, India
  • Submitted on ; Revised on ; Accepted on
  • Contact: *E-mail address: akanksha.mehndiratta@mail.jiit.ac.in

Abstract: Technological advancements in deep-learning based models have resulted in the utilization of sophisticated architectures to model discourse in dialogue systems. However, despite their effectiveness, these systems lack transparency in the discourse modeling process, making them difficult to adapt and interpret. Recently, spectral-learning based algorithms have gained interest in data-driven approaches. This study proposes two spectral algorithms using canonical correlation analysis to develop two discourse modeling frameworks for a multi-turn retrieval-based dialogue system. The proposed spectral-learning based models learn discourse units that capture the intent/hidden associations in a conversation. Both models utilize the multi-view nature of textual data, i.e. each turn in a turn-taking conversation. The proposed variants aim to construct an attention state: the first develops a global attention state consisting of global discourse units abstracted to encapsulate the long-term dependencies in a conversation. On the other hand, the second model establishes a local attention state consisting of local discourse units conceptualized as the focus of attention for each utterance in a conversation. Subsequently, the models employ Word Movers Distance to measure the semantic distance between the established attention state and the candidate dialogues to retrieve the top-k-ranked candidate responses. The models are simple and adaptable, eliminating the need for a large labeled corpus. Qualitative and quantitative evaluations on the UBUNTU dataset demonstrate the efficacy of the proposed models.

Key words: canonical correlation analysis, discourse modeling, discourse units, intent, low resource language, dialog system