Abstract
What should dialogue systems do while looking for information or planning their next utterance? We conducted a study in which participants listened to (constructed) conversations between a user and an information system. In one condition, the system remained silent while preparing a reply, whereas in the other, it “bought time” conversationally, using strategies from previously recorded human interactions. Participants perceived the second system as better at responding within an appropriate amount of time. Additionally, we varied between mid- and high-quality voices, and found that the high-quality voice time-buying system was also seen as more willing to help, better at understanding and more human-like than the silent system. We speculate that participants may have perceived this voice as a better match for the more human-like behavior of the second system.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
- 3.
The customers’ utterances were taken from the DSG-Travel corpus [9].
- 4.
We considered 12 seconds to be a realistic waiting period a relatively lengthy lookup might take, yet not so long that the WAIT strategy would obviously be disadvantaged.
- 5.
In this study, information about duration of the wait did not make perceived waiting time shorter than actual waiting time, but it did reduce overestimation of its length in comparison to other experimental conditions.
References
Antonides G, Verhoef P, van Aalst M (2002) Consumer perception and evaluation of waiting time: a field experiment. J Consum Psychol 12(3):193–202
Baumann T, Schlangen D (2013) Open-ended, extensible system utterances are preferred, even if they require filled pauses. In: Proceedings of short papers at SIGdial 2013
Betz S, Carlmeyer B, Wagner P, Wrede B (2017) Interactive hesitation synthesis and its evaluation. https://www.preprints.org/manuscript/201712.0058/v1
Buschmeier H, Baumann T, Dosch B, Kopp S, Schlangen D (2012) Combining incremental language generation and incremental speech synthesis for adaptive information presentation. In: Proceedings of the 13th annual meeting of the special interest group on discourse and dialogue, pp 295–303
Byron D, Heeman P (1997) Discourse marker use in task-oriented spoken dialog. In: Proceedings of Euro speech 97
Clark H, Fox Tree J (2002) Using uh and um in spontaneous speaking. Cognition 84(1):73–111
Edlund J, Gustafson J, Heldner M, Hjalmarsson A (2008) Towards human-like spoken dialogue systems. Speech Commun 50:630–645
Hirsch I, Bilger R, Heatherage B (1950) The effect of auditory and visual background on apparent duration. Am J Psychol, 69
Lopez Gambino S, Zarrieß S, Schlangen D (2017) Beyond on-hold messages: conversational time-buying in task-oriented dialogue. In: Proceedings of SIGdial 2017
Munichor N, Rafaeli A (2007) Numbers or apologies? customer reactions to telephone waiting time fillers. J Appl Psychol 92(2):511–518
Schlangen D, Skantze G (2011) A general, abstract model of incremental dialogue processing. Dialogue Discourse 2(1):83–111
Schröder M, Trouvain J (2003) The German text-to-speech synthesis system MARY: a tool for research, development and teaching. Int J Speech Technol 6:365–377
Skantze G, Hjalmarsson A (2010) Towards incremental speech generation in dialogue systems. In: Proceedings of the 11th annual meeting of the special interest group on discourse and dialogue, SIGDIAL ’10. Association for Computational Linguistics, Stroudsburg, PA, USA , pp 1–8
Tom G, Burns M, Zeng Y (1997) Your life on hold: the effect of telephone waiting time on customer perception. J Direct Mark 11(3):25–31
Walker M, Kamm C, Litman D (2000) Towards developing general models of usability with PARADISE. Nat Lang Eng 6:3–4
Whittaker S, Walker M (2005) Evaluating dialogue strategies in multimodal dialogue systems. In: Minker W, Bühler D (eds) Spoken multimodal human-computer dialogue in mobile environments. text, speech and language technology, vol 28
Acknowledgements
This work was supported by the Cluster of Excellence Cognitive Interaction Technology ‘CITEC’ (EXC 277) at Bielefeld University, which is funded by the German Research Foundation (DFG).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
López Gambino, S., Zarrieß, S., Schlangen, D. (2019). Testing Strategies For Bridging Time-To-Content In Spoken Dialogue Systems. In: D'Haro, L., Banchs, R., Li, H. (eds) 9th International Workshop on Spoken Dialogue System Technology. Lecture Notes in Electrical Engineering, vol 579. Springer, Singapore. https://doi.org/10.1007/978-981-13-9443-0_9
Download citation
DOI: https://doi.org/10.1007/978-981-13-9443-0_9
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-9442-3
Online ISBN: 978-981-13-9443-0
eBook Packages: Literature, Cultural and Media StudiesLiterature, Cultural and Media Studies (R0)