Skip to main content

Incremental Human-Machine Dialogue Simulation

  • Chapter
  • First Online:
Dialogues with Social Robots

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 427))

Abstract

This chapter introduces a simulator for incremental human-machine dialogue in order to generate artificial dialogue datasets that can be used to train and test data-driven methods. We review the various simulator components in detail, including an unstable speech recognizer, and their differences with non-incremental approaches. Then, as an illustration of its capacities, an incremental strategy based on hand-crafted rules is implemented and compared to several non-incremental baselines. Their performances in terms of dialogue efficiency are presented under different noise conditions and prove that the simulator is able to handle several configurations which are representative of real usages.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 149.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 199.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    This rule is kept as an indication, it is the French way of telling time and it does not apply to English.

  2. 2.

    Here, the value of the priority variable designate its importance. Therefore, the higher the priority, the more important the task is.

  3. 3.

    We do not claim that the Intent Manager algorithm solves the task in an optimal way. Moreover, there are some pathological examples that are not handled at all. Yet it complies with our objective to have a simple algorithm able to run realistic dialogues to study turn-taking mechanisms.

  4. 4.

    This N-Best corresponds to the last input word only. It is important to make the distinction between this N-Best and the one corresponding to the last partial utterance as a whole. In Fig. 3, the block New word N-Best is a word N-Best whereas the other three blocks are partial utterances N-Best.

  5. 5.

    Here, we only use the best hypothesis of the N-Best. However, the others are indirectly used through the boost mechanism.

References

  1. Plátek, O., Jurčíček, F.: Free on-line speech recogniser based on Kaldi ASR toolkit producing word posterior lattices. In: Proceedings of the 15th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL) (2014)

    Google Scholar 

  2. Allen, J., Ferguson, G., Stent, A.: An architecture for more realistic conversational systems. In: Proceedings of the 6th International Conference on Intelligent User Interfaces (2001)

    Google Scholar 

  3. Dohsaka, K., Shimazu, A.: A system architecture for spoken utterance production in collaborative dialogue. In: Working Notes of IJCAI 1997 Workshop on Collaboration, Cooperation and Conflict in Dialogue Systems (1997)

    Google Scholar 

  4. Skantze, G., Schlangen, D.: Incremental dialogue processing in a micro-domain. In: Proceedings of the 12th Conference of the European Chapter of the ACL (EACL) (2009)

    Google Scholar 

  5. Tanenhaus, M.K., Spivey-Knowlton, M.J., Eberhard, K.M., Sedivy, J.C.: Integration of visual and linguistic information in spoken language comprehension. Science 268, 1632–1634 (1995)

    Article  Google Scholar 

  6. Schlangen, D., Skantze, G.: A general, abstract model of incremental dialogue processing. Dialogue and Discourse 2, 83–111 (2011)

    Google Scholar 

  7. Sutton, R.S., Barto, A.G.: Reinforcement Learning, An Introduction. The MIT Press, Cambridge (1998)

    Google Scholar 

  8. Levin, E., Pieraccini, R.: A stochastic model of computer-human interaction for learning dialogue strategies. In: Proceedings of the 5th Biennial European Conference on Speech Communication and Technology (Eurospeech) (1997)

    Google Scholar 

  9. Lemon, O., Pietquin, O.: Data-Driven Methods for Adaptive Spoken Dialogue Systems. Springer Publishing Company, Incorporated (2012)

    Book  MATH  Google Scholar 

  10. Eckert, W., Levin, E., Pieraccini, R.: User modeling for spoken dialogue system evaluation. In: Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding (1997)

    Google Scholar 

  11. Pietquin, O., Hastie, H.: A survey on metrics for the evaluation of user simulations. Knowl. Eng. Rev. (2013)

    Google Scholar 

  12. Taylor, M.E., Stone, P.: Transfer learning for reinforcement learning domains: a survey. J. Mach. Learn. Res. 10, 1633–1685 (2009)

    Google Scholar 

  13. Selfridge, E.O., Heeman, P.A.: A temporal simulator for developing turn-taking methods for spoken dialogue systems. In: Proceedings of the 13th Annual Meeting of the Special Interest Group on Discourse and Dialogue (2012)

    Google Scholar 

  14. McGraw, I., Gruenstein, A.: Estimating word-stability during incremental speech recognition. In: Proceedings of the 12th Annual Conference of the International Speech Communication Association (Interspeech) (2012)

    Google Scholar 

  15. Selfridge, E.O., Arizmendi, I., Heeman, P.A., Williams, J.D.: Stability and accuracy in incremental speech recognition. In: Proceedings of the 12th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL) (2011)

    Google Scholar 

  16. Khouzaimi, H., Laroche, R., Lefèvre, F.: An easy method to make dialogue systems incremental. In: Proceedings of the 15th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL) (2014)

    Google Scholar 

  17. Khouzaimi, H., Laroche, R., Lefèvre, F.: Optimising turn-taking strategies with reinforcement learning. In: Proceedings of the 16th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL) (2015)

    Google Scholar 

  18. Ghigi, F., Eskenazi, M., Torres, M.I., Lee, S.: Incremental dialog processing in a task-oriented dialog. In: Proceedings of the 15th Annual Conference of the International Speech Communication Association (Interspeech) (2014)

    Google Scholar 

  19. Yuan, J., Liberman, M., Cieri, C.: Towards an integrated understanding of speaking rate in conversation. In: Proceedings of the 9th International Conference on Spoken Language Processing (Interspeech-ICLSP) (2006)

    Google Scholar 

  20. Pietquin, O., Beaufort, R.: Comparing asr modeling methods for spoken dialogue simulation and optimal strategy learning. In: Proceedings of the 9th European Conference on Speech Communication and Technology (Eurospeech/Interspeech) (2005)

    Google Scholar 

  21. Jiang, H.: Confidence measures for speech recognition: a survey. Speech Commun. 45, 455–470 (2005)

    Article  Google Scholar 

  22. Seigel, M.S., Woodland, P.C.: Combining information sources for confidence estimation with crf models. In: Proceedings of the 11th Annual Conference of the International Speech Communication Association (Interspeech) (2011)

    Google Scholar 

  23. Nakano, M., Miyazaki, N., Hirasawa, J.I., Dohsaka, K., Kawabata, T.: Understanding unsegmented user utterances in real-time spoken dialogue systems. In: Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics (ACL) (1999)

    Google Scholar 

  24. Clark, H.H.: Using Language. Cambridge University Press (1996)

    Google Scholar 

  25. Sacks, H., Schegloff, E.A., Jefferson, G.: A simplest systematics for the organization of turn-taking for conversation. Language 50, 696–735 (1974)

    Article  Google Scholar 

  26. Khouzaimi, H., Laroche, R., Lefèvre, F.: Turn-taking phenomena in incremental dialogue systems. In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing (2015)

    Google Scholar 

  27. DeVault, D., Sagae, K., Traum, D.: Incremental interpretation and prediction of utterance meaning for interactive dialogue. Dialogue and Discourse 2, 143–170 (2011)

    Google Scholar 

  28. El Asri, L., Lemonnier, R., Laroche, R., Pietquin, O., Khouzaimi, H.: NASTIA: Negotiating Appointment Setting Interface. In: Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC) (2014)

    Google Scholar 

  29. Selfridge, E.O., Arizmendi, I., Heeman, P.A., Williams, J.D.: Continuously predicting and processing barge-in during a live spoken dialogue task. In: Proceedings of the 14th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL) (2013)

    Google Scholar 

Download references

Acknowledgements

This work is part of the FUI project VoiceHome.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hatim Khouzaimi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer Science+Business Media Singapore

About this chapter

Cite this chapter

Khouzaimi, H., Laroche, R., Lefèvre, F. (2017). Incremental Human-Machine Dialogue Simulation. In: Jokinen, K., Wilcock, G. (eds) Dialogues with Social Robots. Lecture Notes in Electrical Engineering, vol 427. Springer, Singapore. https://doi.org/10.1007/978-981-10-2585-3_4

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-2585-3_4

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-2584-6

  • Online ISBN: 978-981-10-2585-3

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics