Testing Strategies For Bridging Time-To-Content In Spoken Dialogue Systems

López Gambino, Soledad; Zarrieß, Sina; Schlangen, David

doi:10.1007/978-981-13-9443-0_9

Soledad López Gambino³⁷,
Sina Zarrieß³⁷ &
David Schlangen³⁷

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 579))

300 Accesses
1 Citations

Abstract

What should dialogue systems do while looking for information or planning their next utterance? We conducted a study in which participants listened to (constructed) conversations between a user and an information system. In one condition, the system remained silent while preparing a reply, whereas in the other, it “bought time” conversationally, using strategies from previously recorded human interactions. Participants perceived the second system as better at responding within an appropriate amount of time. Additionally, we varied between mid- and high-quality voices, and found that the high-quality voice time-buying system was also seen as more willing to help, better at understanding and more human-like than the silent system. We speculate that participants may have perceived this voice as a better match for the more human-like behavior of the second system.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
URLs: https://www.mturk.com/, https://www.crowdflower.com, https://www.soscisurvey.de/.
2.
http://mary.dfki.de/, https://www.cereproc.com/.
3.
The customers’ utterances were taken from the DSG-Travel corpus [9].
4.
We considered 12 seconds to be a realistic waiting period a relatively lengthy lookup might take, yet not so long that the WAIT strategy would obviously be disadvantaged.
5.
In this study, information about duration of the wait did not make perceived waiting time shorter than actual waiting time, but it did reduce overestimation of its length in comparison to other experimental conditions.

References

Antonides G, Verhoef P, van Aalst M (2002) Consumer perception and evaluation of waiting time: a field experiment. J Consum Psychol 12(3):193–202
Article Google Scholar
Baumann T, Schlangen D (2013) Open-ended, extensible system utterances are preferred, even if they require filled pauses. In: Proceedings of short papers at SIGdial 2013
Google Scholar
Betz S, Carlmeyer B, Wagner P, Wrede B (2017) Interactive hesitation synthesis and its evaluation. https://www.preprints.org/manuscript/201712.0058/v1
Buschmeier H, Baumann T, Dosch B, Kopp S, Schlangen D (2012) Combining incremental language generation and incremental speech synthesis for adaptive information presentation. In: Proceedings of the 13th annual meeting of the special interest group on discourse and dialogue, pp 295–303
Google Scholar
Byron D, Heeman P (1997) Discourse marker use in task-oriented spoken dialog. In: Proceedings of Euro speech 97
Google Scholar
Clark H, Fox Tree J (2002) Using uh and um in spontaneous speaking. Cognition 84(1):73–111
Article Google Scholar
Edlund J, Gustafson J, Heldner M, Hjalmarsson A (2008) Towards human-like spoken dialogue systems. Speech Commun 50:630–645
Article Google Scholar
Hirsch I, Bilger R, Heatherage B (1950) The effect of auditory and visual background on apparent duration. Am J Psychol, 69
Google Scholar
Lopez Gambino S, Zarrieß S, Schlangen D (2017) Beyond on-hold messages: conversational time-buying in task-oriented dialogue. In: Proceedings of SIGdial 2017
Google Scholar
Munichor N, Rafaeli A (2007) Numbers or apologies? customer reactions to telephone waiting time fillers. J Appl Psychol 92(2):511–518
Article Google Scholar
Schlangen D, Skantze G (2011) A general, abstract model of incremental dialogue processing. Dialogue Discourse 2(1):83–111
Article Google Scholar
Schröder M, Trouvain J (2003) The German text-to-speech synthesis system MARY: a tool for research, development and teaching. Int J Speech Technol 6:365–377
Article Google Scholar
Skantze G, Hjalmarsson A (2010) Towards incremental speech generation in dialogue systems. In: Proceedings of the 11th annual meeting of the special interest group on discourse and dialogue, SIGDIAL ’10. Association for Computational Linguistics, Stroudsburg, PA, USA , pp 1–8
Google Scholar
Tom G, Burns M, Zeng Y (1997) Your life on hold: the effect of telephone waiting time on customer perception. J Direct Mark 11(3):25–31
Article Google Scholar
Walker M, Kamm C, Litman D (2000) Towards developing general models of usability with PARADISE. Nat Lang Eng 6:3–4
Article Google Scholar
Whittaker S, Walker M (2005) Evaluating dialogue strategies in multimodal dialogue systems. In: Minker W, Bühler D (eds) Spoken multimodal human-computer dialogue in mobile environments. text, speech and language technology, vol 28
Google Scholar

Download references

Acknowledgements

This work was supported by the Cluster of Excellence Cognitive Interaction Technology ‘CITEC’ (EXC 277) at Bielefeld University, which is funded by the German Research Foundation (DFG).

Author information

Authors and Affiliations

CITEC Bielefeld University, Universitätsstraße 25, 33615, Bielefeld, Germany
Soledad López Gambino, Sina Zarrieß & David Schlangen

Authors

Soledad López Gambino
View author publications
You can also search for this author in PubMed Google Scholar
Sina Zarrieß
View author publications
You can also search for this author in PubMed Google Scholar
David Schlangen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Soledad López Gambino .

Editor information

Editors and Affiliations

Universidad Politécnica de Madrid, Madrid, Spain
Luis Fernando D'Haro
Nanyang Technological University, Singapore, Singapore
Rafael E. Banchs
Department of Electrical and Computer Engineering, National University of Singapore, Singapore, Singapore
Haizhou Li

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

López Gambino, S., Zarrieß, S., Schlangen, D. (2019). Testing Strategies For Bridging Time-To-Content In Spoken Dialogue Systems. In: D'Haro, L., Banchs, R., Li, H. (eds) 9th International Workshop on Spoken Dialogue System Technology. Lecture Notes in Electrical Engineering, vol 579. Springer, Singapore. https://doi.org/10.1007/978-981-13-9443-0_9

Download citation

DOI: https://doi.org/10.1007/978-981-13-9443-0_9
Published: 25 September 2019
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-9442-3
Online ISBN: 978-981-13-9443-0
eBook Packages: Literature, Cultural and Media StudiesLiterature, Cultural and Media Studies (R0)

Publish with us

Policies and ethics