Skip to main content
Log in

Integrating breakdown detection into dialogue systems to improve knowledge management: encoding temporal utterances with memory attention

  • Published:
Information Technology and Management Aims and scope Submit manuscript

Abstract

With the increasing pervasiveness of smart phones and smart devices, dialogue systems are gaining ever growing attention from both academic and industry. These systems can be broadly classified into two categories, one that is aimed at helping user to gain new knowledge and one that can chat with users without completing any specific tasks. Although dialogue systems are improving substantially, the user experience of such systems are still unsatisfactory as there are no specific rules covering all possible situations of real human–machine dialogue, resulting in breakdowns. There are two technical issues affecting the detection of dialogue breakdown in an open domain conversation: human resources to prepare and annotate a large chunk of conversation data and dialogue histories containing words that don’t appear directly in training data. To tackle these issues, we propose a novel encoding method for temporal utterances with memory attention based on end-to-end dialogue breakdown detection. Specifically, long short-term memory (LSTM) is employed to encode each word of all previous user and system utterances. Encoded vectors from LSTM (user and system utterances), along with system and user utterances from sentence embedding, are then stored in memory wherein an attention mechanism is applied to select the most relevant piece of words from system and user utterances for breakdown detection. An evaluation of the proposed approach on a breakdown detection task (DBDC3) showed that the model for single-labeled breakdown detection outperforms other state-of-the-art methods in a classification task. In conclusion, a more effective knowledge gain and management can be achieved by integration of our proposed breakdown detection into dialogue systems.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1

Similar content being viewed by others

Notes

  1. We use the evaluation code provided at https://github.com/dbd-challenge/dbdc3/tree/master/prog.

References

  1. Nevo D, Furneaux B, Wand Y (2008) Towards an evaluation framework for knowledge management systems. Inf Technol Manag 9(4):233–249

    Article  Google Scholar 

  2. Piccoli G, Ahmad R, Ives B (2000) Knowledge management in academia: a proposed framework. Inf Technol Manag 1(4):229–245

    Article  Google Scholar 

  3. Martinovsky B, Traum D (2006) The error is the clue: breakdown in human–machine interaction. Technical report, University of Southern California Marina Del Reyca Inst for Creative Technologies

  4. Bickmore T, Cassell J (2001) Relational agents: a model and implementation of building user trust. In: Proceedings of the SIGCHI conference on human factors in computing systems. ACM, New York, pp 396–403

  5. Williams J, Young S (2007) Partially observable markov decision processes for spoken dialog systems. Comput Speech Lang 21(2):393–422

    Article  Google Scholar 

  6. Zhou X, Dong D, Wu H, Zhao S, Yu D, Tian H, Liu X, Yan R (2016) Multi-view response selection for human–computer conversation. In: Proceedings of the 2016 conference on empirical methods in natural language processing, pp 372–381

  7. Saito A, Iki T (2017) End-to-end character-level dialogue breakdown detection with external memory models. In: Proceedings of the dialog system technology challenges workshop (DSTC6)

  8. Bahdanau D, Cho K, Bengio Y (2014) Neural machine translation by jointly learning to align and translate. Preprint arXiv:1409.0473

  9. Serban IV, Sordoni A, Bengio Y, Courville AC, Pineau J (2016) Building end-to-end dialogue systems using generative hierarchical neural network models. AAAI 16:3776–3784

    Google Scholar 

  10. Hochreite S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780

    Article  Google Scholar 

  11. Walker MA, Geary IL, Hastie HW, Wright J, Gorin A (2002) Automatically training a problematic dialogue predictor for a spoken dialogue system. J Artif Intell Res 16:293–319

    Article  Google Scholar 

  12. Higashinaka R, Funakoshi K, Kobayashi Y, Inaba M (2016) The dialogue breakdown detection challenge: task description, datasets, and evaluation metrics. In: LREC

  13. Higashinaka R, Funakoshi K, Araki M, Tsukahara H, Kobayashi Y, Mizukami M (2015) Towards taxonomy of errors in chat-oriented dialogue systems. In: Proceedings of the 16th annual meeting of the special interest group on discourse and dialogue, pp 87–95

  14. Bulyko I, Kirchhoff K, Ostendorf M, Goldberg J (2005) Error-correction detection and response generation in a spoken dialogue system. Speech Commun 45(3):271–288

    Article  Google Scholar 

  15. Black AW, Eskenazi M (2009) The spoken dialogue challenge. In: Proceedings of the SIGDIAL 2009 conference: the 10th annual meeting of the special interest group on discourse and dialogue. Association for Computational Linguistics, pp 337–340

  16. Kobayashi S, Unno Y, Fukuda M (2015) Multitask learning of recurrent neural network for detecting breakdowns of dialog and language modeling. JSAI technical report (SIG-SLUD-75-B502), pp 41–46

  17. Taniguchi R, Kano Y (2015) Construction of automatic detector for dialogue breakdowns based on rules with keywords extraction. JSAI technical report (SIG-SLUD-75-B502), pp 37–40

  18. Sugiyama H (2017) Dialogue breakdown detection based on estimating appropriateness of topic transition. In: Dialog system technology challenges (DSTC6)

  19. Piramuthu S (2005) Feature selection for reduction of tabular knowledge-based systems. Inf Technol Manag 6(4):351–362

    Article  Google Scholar 

  20. Park C, Kim K, Kim S (2017) Attention-based dialog embedding for dialog breakdown detection. In: Proceedings of the dialog system technology challenges workshop (DSTC6)

  21. Al-Rfou R, Perozzi B, Skiena S (2013) Polyglot: distributed word representations for multilingual NLP. Preprint arXiv:1307.1662

  22. Luong T, Pham H, Manning CD (2015) Effective approaches to attention-based neural machine translation. In: EMNLP. The Association for Computational Linguistics, pp 1412–1421

  23. Higashinaka R, Funakoshi K, Inaba M, Tsunomori Y, Takahashi T, Kaji N (2017) Overview of dialogue breakdown detection challenge 3. In: Proceedings of dialog system technology challenge, p 6

  24. Zeiler MD (2012) ADADELTA: an adaptive learning rate method. Preprint arXiv:1212.5701

  25. Lopes J (2017) “How generic can dialogue breakdown detection be?” the kth entry to DBDC3. In: Proceedings of dialog system technology challenge, vol 6

  26. Kato S, Sakai T (2017) RSL17BD at DBDC3: computing utterance similarities based on term frequency and word embedding vectors. In: Proceedings of DSTC6. http://workshop.colips. org/dstc6/papers/track3_paper13_kato.pdf

  27. Lee S, Hooshyar D, Ji H, Nam K, Lim H (2018) Mining biometric data to predict programmer expertise and task difficulty. Clust Comput 21(1):1097–1107

    Article  Google Scholar 

  28. Hooshyar D, Yousefi M, Lim H (2018) Data-driven approaches to game player modeling: a systematic literature review. ACM Comput Surv (CSUR) 50(6):90

    Article  Google Scholar 

Download references

Acknowledgements

This work was supported by a Ministry of Culture, Sports and Tourism (No. R2017030045).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Heuiseok Lim.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Lee, S., Lee, D., Hooshyar, D. et al. Integrating breakdown detection into dialogue systems to improve knowledge management: encoding temporal utterances with memory attention. Inf Technol Manag 21, 51–59 (2020). https://doi.org/10.1007/s10799-019-00308-x

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10799-019-00308-x

Keywords

Navigation