Skip to main content

Part of the book series: Springer Handbooks ((SHB))

Abstract

Spoken dialogue systems are a new breed of interfaces that enable humans to communicate with machines naturally and efficiently using a conversational paradigm. Such a system makes use of many human language technology (HLT) components, including speech recognition and synthesis, natural language understanding and generation, discourse modeling, and dialogue management. In this contribution, we introduce the nature of these interfaces, describe the underlying HLTs on which they are based, and discuss some of the development issues. After providing a historical perspective, we outline some new research directions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 579.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 729.00
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Abbreviations

ARISE:

Automatic Railway Information Systems for Europe

ASR:

automatic speech recognition

ATN:

augmented transition networks

BBN:

Bolt, Beranek and Newman

DARPA:

Defense Advanced Research Projects Agency

FST:

finite state transducer

HLT:

human language technologies

ISU:

information state update

IVR:

interactive voice response

MDP:

Markov decision process

NLG:

natural language generation

NLU:

natural language understanding

SLS:

spoken language system

SUNDIAL:

speech understanding and dialog

TTS:

text-to-speech

WER:

word error rate

References

  1. E. Barnard, A. Halberstadt, C. Kotelly, M. Phillips: A consistent approach to designing spoken-dialog systems, Proc. ASRU Workshop (ASRU, Keystone 1999)

    Google Scholar 

  2. S.J. Boyce: Natural spoken dialogue systems for telephony applications, Commun. ACM 43(9), 29-34 (2000)

    Article  Google Scholar 

  3. A. Raux, B. Langner, D. Bohus, A. Black, M. Eskenazi: Letʼs Go public! Taking a spoken dialog system to the real world, Proc. Interspeech (2005) pp. 885-888

    Google Scholar 

  4. A. Raux, B. Langner, A. Black, M. Eskenazi: Letʼs Go: Improving spoken dialog systems for the elderly and non-natives, Proc. Interspeech (2003) pp. 753-756

    Google Scholar 

  5. G. Flammia: Discourse Segmentation of Spoken Dialogue: An Empirical Approach. Ph.D. Thesis (MIT, Cambridge 1998)

    Google Scholar 

  6. S. Seneff, R. Lau, J. Glass, J. Polifroni: The mercury system for flight browsing and pricing, MIT Spoken Language System Group Annual Progress Report (1999) pp. 23-28

    Google Scholar 

  7. J. Glass, G. Flammia, D. Goodine, M. Phillips, J. Polifroni, S. Sakai, S. Seneff, V. Zue: Multilingual spoken-language understanding in the MIT voyager system, Speech Commun. 17, 1-18 (1995)

    Article  Google Scholar 

  8. R. Lau, G. Flammia, C. Pao, V. Zue: WebGalaxy - Integrating spoken language and hypertext navigation, Proc. Eurospeech (1997) pp. 883-886

    Google Scholar 

  9. H. Meng, S. Busayapongchai, J. Glass, D. Goddeau, L. Hetherington, E. Hurley, C. Pao, J. Polifroni, S. Seneff, V. Zue: A conversational system in the automobile classifieds domain, Proc. ICSLP (1996) pp. 542-545

    Google Scholar 

  10. S. Seneff, V. Zue, J. Polifroni, C. Pao, L. Hetherington, D. Goddeau, J. Glass: The preliminary development of a displayless Pegasus system, Proc. ARPA Spoken Language Technology Workshop (1995) pp. 212-217

    Google Scholar 

  11. A. Gruenstein, S. Seneff, C. Wang: Scalable and portable web-based multimodal dialogue interaction with geographical databases, Proc. Interspeech (2006)

    Google Scholar 

  12. W. Wang, J. Glass, H. Meng, J. Polifroni, S. Seneff, V. Zue: Yinhe: A Mandarin Chinese version of the galaxy system, Proc. Eurospeech (1997) pp. 351-354

    Google Scholar 

  13. V. Zue, S. Seneff, J. Polifroni, M. Phillips, C. Pao, D. Goddeau, J. Glass, E. Brill: Pegasus: A spoken language interface for on-line air travel planning, Speech Commun. 15, 331-340 (1994)

    Article  Google Scholar 

  14. V. Zue, S. Seneff, J. Glass, J. Polifroni, C. Pao, T. Hazen, L. Hetherington: JUPITER: A telephone-based conversational interface for weather information, IEEE Trans. Speech Audio. Process. 8(1), 85-96 (2000)

    Article  Google Scholar 

  15. P. Dalsgaard, L. Larsen, I. Thomsen (Eds.): Proc. ESCA Tutorial and Research Workshop on Spoken Dialogue Systems: Theory and Application (1995)

    Google Scholar 

  16. P. Cohen, M. Johnson, D. McGee, S. Oviatt, J. Clow, I. Smith: The efficiency of multimodal interaction: A case study, Proc. ICSLP (1998) pp. 249-252

    Google Scholar 

  17. S. Seneff, D. Goddeau, C. Pao, J. Polifroni: Multimodal discourse modelling in a multi-user multi-domain environment, Proc. ICSLP (1996) pp. 188-191

    Google Scholar 

  18. D. Massaro: Perceiving Talking Faces: From Speech Perception to a Behavioral Principle (MIT Press, Cambridge 1997)

    Google Scholar 

  19. W. Ward: Modelling non-verbal sounds for speech recognition, Proc. DARPA Workshop on Speech and Natural Language (1989) pp. 47-50

    Google Scholar 

  20. J. Butzberger, H. Murveit, M. Weintraub: Spontaneous speech effects in large vocabulary speech recognition applications, Proc. ARPA Workshop on Speech and Natural Language (1992) pp. 339-344

    Google Scholar 

  21. L. Hetherington, V. Zue: New words: Implications for continuous speech recognition, Proc. Eurospeech (1991) pp. 475-931

    Google Scholar 

  22. A. Asadi, R. Schwartz, J. Makhoul: Automatic modelling for adding new words to a large vocabulary continuous speech recognition system, Proc. ICASSP (1991) pp. 305-308

    Google Scholar 

  23. G. Chung, S. Seneff, C. Wang: Automatic acquisition of names using speak and spell mode in spoken dialogue systems, Proc. HLT-NAACL (2003) pp. 197-200

    Google Scholar 

  24. C. Pao, P. Schmid, J. Glass: Confidence scoring for speech understanding systems, Proc. ICSLP (1998) pp. 815-818

    Google Scholar 

  25. M. Gabsdil, O. Lemon: Combining acoustic and pragmatic features to predict recognition performance in spoken dialogue systems, Proc. ACL (2004)

    Google Scholar 

  26. A. Kellner, B. Rueber, H. Schramm: Using combined decisions and confidence measures for name recognition in automatic directory assistance systems, Proc. ICSLP (1998) pp. 2859-2862

    Google Scholar 

  27. D. Bohus, A. Rudnicky: Constructing accurate beliefs in spoken dialog systems, Proc. ASRU (2005) pp. 272-277

    Google Scholar 

  28. V. Souvignier, A. Kellner, B. Rueber, H. Schramm, F. Seide: The thoughtful elephant: Strategies for spoken dialogue systems, IEEE T. Speech Audi. P. 8(1), 51-62 (2000)

    Article  Google Scholar 

  29. R. Billi, R. Canavesio, C. Rullent: Automation of Telecom Italia Directory Assistance Service: Field trial results, Proc. IVTTA (1998) pp. 11-16

    Google Scholar 

  30. R.J. Lippmann: Speech recognition by humans and machines, Speech Commun. 22(1), 1-15 (1997)

    Article  Google Scholar 

  31. R. Bobrow, R. Ingria, D. Stallard: Syntactic and semantic knowledge in the DELPHI unification grammar, Proc. DARPA Speech and Natural Language Workshop (1990) pp. 230-236

    Google Scholar 

  32. S. Seneff: TINA: A natural language system for spoken language applications, Comput. Linguist. 18(1), 61-86 (1992)

    Google Scholar 

  33. J. Dowding, J. Gawron, D. Appelt, J. Bear, L. Cherny, R. Moore, D. Moran: Gemini: A natural language system for spoken language understanding, Proc. ARPA Workshop on Human Language Technology (1993) pp. 21-24

    Google Scholar 

  34. W. Ward: The CMU air travel information service: Understanding spontaneous speech, Proc. ARPA Workshop on Speech and Natural Language (1990) pp. 127-129

    Google Scholar 

  35. E. Jackson, D. Appelt, J. Bear, R. Moore, A. Podlozny: A template matcher for robust NL interpretation, Proc. DARPA Speech and Natural Language Workshop (1991) pp. 190-194

    Google Scholar 

  36. S. Seneff: Robust parsing for spoken language systems, Proc. ICASSP (1992) pp. 189-192

    Google Scholar 

  37. D. Stallard, R. Bobrow: Fragment processing in the DELPHI system, Proc. DARPA Speech and Natural Language Workshop (1992) pp. 305-310

    Google Scholar 

  38. A. Gorin, G. Riccardi, J. Wright: How may I help you?, Speech Commun. 23, 113-127 (1997)

    Article  MATH  Google Scholar 

  39. S. Miller, R. Schwartz, R. Bobrow, R. Ingria: Statistical language processing using hidden understanding models, Proc. ARPA Speech and Natural Language Workshop (1994) pp. 278-282

    Google Scholar 

  40. Y. Chow, R. Schwartz: The N-Best Algorithm: An efficient procedure for finding top N sentence hypotheses, Proc. ARPA Workshop on Speech and Natural Language (1989) pp. 199-202

    Google Scholar 

  41. L. Hetherington, M. Phillips, J. Glass, V. Zue: A word network search for continuous speech recognition, Proc. Eurospeech (1993) pp. 1533-1536

    Google Scholar 

  42. L. Mangu, E. Brill, A. Stolcke: Finding consensus among words: Lattice-based word error minimization, Proc. Eurospeech (1999)

    Google Scholar 

  43. D. Goodine, S. Seneff, L. Hirschman, M. Phillips: Full integration of speech and language understanding in the MIT spoken language system, Proc. Eurospeech (1991) pp. 845-848

    Google Scholar 

  44. D. Goddeau: Using probabilistic shift-reduce parsing in speech recognition systems, Proc. ICSLP (1992) pp. 321-324

    Google Scholar 

  45. W. Ward: Integrating semantic constraints into the SPHINX-II recognition search, Proc. ICASSP (1994) pp. 17-20

    Google Scholar 

  46. R. Moore, D. Appelt, J. Dowding, J. Gawron, D. Moran: Combining linguistic and statistical knowledge sources in natural-language processing for ATIS, Proc. ARPA Spoken Language Systems Workshop (1995) pp. 261-264

    Google Scholar 

  47. E. Reiter, R. Dale: Building Natural Language Generation Systems (Cambridge Univ. Press, Cambridge 2000)

    Book  Google Scholar 

  48. D. McDonald, L. Bolc (Eds.): Natural Language Generation Systems (Symbolic Computation Artificial Intelligence) (Springer, Heidelberg, Berlin 1998)

    Google Scholar 

  49. J. Glass, J. Polifroni, S. Seneff: Multilingual language generation across multiple domains, Proc. ICSLP (1994)

    Google Scholar 

  50. A. Oh: Stochastic Natural Language Generation for Spoken Dialog Systems. M.S. Thesis (CMU, Mt. Pleasant 2000)

    Google Scholar 

  51. V. Demberg, J. Moore: Information presentation in spoken dialogue systems, Proc. EACL (2006)

    Google Scholar 

  52. D. Klatt: Review of text-to-speech conversion for English, J. Acoust. Soc. Am. 82(3), 737-793 (1987)

    Article  Google Scholar 

  53. Y. Sagisaka, N. Kaiki, N. Iwahashi, K. Mimura: ATR ν -talk speech synthesis system, Proc. ICSLP (1992) pp. 483-486

    Google Scholar 

  54. M. Beutnagel, A. Conkie, J. Schroeter, Y. Stylianou, A. Syrdal: The AT&T next-gen TTS system, Proc. ASA (ASA, Berlin 1999)

    Google Scholar 

  55. X. Huang, A. Acero, J. Adcock, H.W. Hon, J. Goldsmith, J. Liu, M. Plumpe: Whistler: A trainable text-to-speech system, Proc. ICSLP (1996) pp. 2387-2390

    Google Scholar 

  56. J. van Santen, L. Pols, M. Abe, D. Kahn, E. Keller, J. Vonwiller: Report on the 3rd ESCA TTS workshop evaluation procedure, Proc. 3rd ESCA Workshop on Speech Synthesis (1998) pp. 329-332

    Google Scholar 

  57. K. McKeown, S. Pan, J. Shaw, D. Jordan, B. Allen: Language generation for multimedia healthcare briefings, Proc. Applied Natural Language Proc (1997)

    Google Scholar 

  58. A. Black, K. Lenzo: Limited domain synthesis, Proc. ICSLP (2000)

    Google Scholar 

  59. R. Sproat, A. Hunt, M. Ostendorf, P. Taylor, A. Black, K. Lenzo, M. Edgington: Sable: A standard for TTS markup, Proc. ICSLP (1998) pp. 1719-1722

    Google Scholar 

  60. B. Grosz, C. Sidner: Plans for discourse, Intentions in Communication (MIT Press, 1990)

    Google Scholar 

  61. D. Thomson, J. Wisowaty: User confusion in natural language services, Proc. ESCA Workshop on Interactive Dialogue in Multi-Modal Systems (1999) pp. 189-196

    Google Scholar 

  62. D. Sadek: Design considerations on dialogue systems: From theory to technology - The case of Artimis, Proc. ESCA Workshop on Interactive Dialogue in Multi-Modal Systems (1999) pp. 173-188

    Google Scholar 

  63. L. Boves, E. den Os: Applications of speech technology: Designing for usability, Proc. IEEE Worshop on ASR and Understanding (1999) pp. 353-362

    Google Scholar 

  64. J. Allen, L. Schubert, G. Ferguson, P. Heeman, C. Hwang, T. Kato, W. Light, N. Martin, B. Miller, M. Poesio, D. Traum: The TRAINS project: A case study in defining a conversational planning agent, J. Exp. Theor. Artif. Intell. 7, 7-48 (1995)

    Article  MATH  Google Scholar 

  65. M. Denecke, A. Waibel: Dialogue strategies guiding users to their communicative goals, Proc. Eurospeech (1997) pp. 2227-2230

    Google Scholar 

  66. L. Devillers, H. Bonneau-Maynard: Evaluation of dialog strategies for a tourist information retrieval system, Proc. ICSLP (1998) pp. 1187-1190

    Google Scholar 

  67. S. Rosset, S. Bennacef, L. Lamel: Design strategies for spoken language dialog systems, Proc. Eurospeech (1999) pp. 1535-1538

    Google Scholar 

  68. A. Rudnicky, E. Thayer, P. Constantinides, C. Tchou, R. Shern, K. Lenzo, W. Xu, A. Oh: Creating natural dialogs in the Carnegie Mellon communicator system, Proc. Eurospeech (1999) pp. 1531-1534

    Google Scholar 

  69. E. Filisko, S. Seneff: Error detection and recovery in spoken dialogue systems, Proc. Workshop on Spoken Language Understanding for Conversational Systems and Higher Level Linguistic Knowledge for Speech Processing (2004)

    Google Scholar 

  70. G. Di Fabbrizio, C. Lewis: Florence: A dialogue manager framework for spoken dialogue systems, Proc. ICSLP (2004)

    Google Scholar 

  71. D. Goddeau, H. Meng, J. Polifroni, S. Seneff, S. Busayapongchai: A form-based dialogue manager for spoken language applications, Proc. ICSLP (1996) pp. 701-704

    Google Scholar 

  72. R. Pieraccini, E. Levin, W. Eckert: AMICA: The AT&T mixed initiative conversational architecture, Proc. Eurospeech (1997) pp. 1875-1879

    Google Scholar 

  73. S. Seneff: Response planning and generation in the MERCURY flight reservation system, Comput. Speech Lang., Vol. 16 (2002) pp. 283-312

    Google Scholar 

  74. P. Constantinides, S. Hansma, A. Rudnicky: A schema-based approach to dialog control, Proc. ICSLP (1998) pp. 409-412

    Google Scholar 

  75. X. Wei, A. Rudnicky: Task-based dialog management using an agenda, Proc. ANLP/NAACL Workshop on Conversational Systems (2000)

    Google Scholar 

  76. J. Allen, D. Byron, M. Dzikovska, G. Ferguson, L. Galescu, A. Stent: An architecture for a generic dialogue shell, Nat. Lang. Eng. 6(3), 1-16 (2000)

    Google Scholar 

  77. E. den Os, L. Boves, L. Lamel, P. Baggia: Overview of the ARISE project, Proc. Eurospeech (1999) pp. 1527-1530

    Google Scholar 

  78. T. Kemp, A. Waibel: Unsupervised training of a speech recognizer: Recent experiments, Proc. Eurospeech (1999) pp. 2725-2728

    Google Scholar 

  79. A. Rudnicky, M. Sakamoto, J. Polifroni: Evaluating spoken language interaction, Proc. DARPA Speech and Natural Language Workshop (1989) pp. 150-159

    Google Scholar 

  80. L. Lamel, S. Bennacef, J.L. Gauvain, H. Dartigues, J. Temem: User evaluation of the mask kiosk, Proc. ICSLP (1998) pp. 2875-2878

    Google Scholar 

  81. M. Walker, D. Litman, C. Kamm, A. Abella: PARADISE: A general framework for evaluating spoken dialogue agents, Proc. ACL/EACL (1997) pp. 271-280

    Google Scholar 

  82. M. Walker, J. Boland, C. Kamm: The utility of elapsed time as a usability metric for spoken dialogue systems, Proc. ASRU Workshop (ASRU, Keystone 1999) pp. 1167-1170

    Google Scholar 

  83. L. Hirschman: Multi-site data collection for a spoken language corpus, Proc. DARPA Workshop on Speech and Natural Language (1992) pp. 7-14

    Google Scholar 

  84. J. Polifroni, S. Seneff, J. Glass, T. Hazen: Evaluation methodology for a telephone-based conversational system, Proc. Int. Conf. on Lang. Resources and Evaluation (1998) pp. 42-50

    Google Scholar 

  85. D. Pallett, J. Fiscus, W. Fisher, J. Garafolo, B. Lund, A. Martin, M. Pryzbocki: Benchmark tests for the ARPA spoken language program, Proc. ARPA Spoken Language Systems Technology Workshop (1995) pp. 5-36

    Google Scholar 

  86. J. Peckham: A new generation of spoken dialogue systems: Results and lessons from the SUNDIAL project, Proc. Eurospeech (1993) pp. 33-40

    Google Scholar 

  87. M. Walker, J. Aberdeen, J. Boland, E. Bratt, J. Garofolo, L. Hirschman, A. Le, S. Lee, S. Narayanan, K. Papineni, B. Pellom, J. Polifroni, A. Potamianos, P. Prabhu, A. Rudnicky, G. Sanders, S. Seneff, D. Stallard, S. Whittaker: DARPA communicator dialog travel planning systems: The June 2000 data collection, Proc. Eurospeech (2001)

    Google Scholar 

  88. M. Walker, A. Rudnicky, R. Prasad, J. Aberdeen, E.O. Bratt, J. Garofolo, H. Hastie, A. Le, B. Pellom, A. Potamianos, R. Passonneau, S. Roukos, G. Sanders, S. Seneff, D. Stallard: DARPA communicator: Cross-system results for the 2001 evaluation, Proc. ICSLP (2002) pp. 273-276

    Google Scholar 

  89. S. Seneff, E. Hurley, R. Lau, C. Pao, P. Schmid, V. Zue: GALAXY-II: A reference architecture for conversational system development, Proc. ICSLP (1998) pp. 931-934

    Google Scholar 

  90. M. Eskenazi, A. Rudnicky, K. Gregory, P. Constantinides, R. Brennan, C. Bennett, J. Allen: Data collection and processing in the Carnegie Mellon communicator, Proc. Eurospeech (1999) pp. 2695-2698

    Google Scholar 

  91. D. Jurafsky, C. Wooters, G. Tajchman, J. Segal, A. Stolcke, E. Fosler, N. Morgan: The Berkeley restaurant project, Proc. ICSLP (1994) pp. 2139-2142

    Google Scholar 

  92. M. Blomberg, R. Carlson, K. Elenius, B. Granstrom, J. Gustafson, S. Hunnicutt, R. Lindell, L. Neovius: An experimental dialogue system: Waxholm, Proc. Eurospeech (1993) pp. 1867-1870

    Google Scholar 

  93. H. Aust, M. Oerder, F. Seide, V. Steinbiss: The Philips automatic train timetable information system, Speech Commun. 17, 249-262 (1995)

    Article  MATH  Google Scholar 

  94. V. Zue, J. Glass: Conversational interfaces: Advances and challenges, Proc. IEEE, Vol. 88 (2000), Special Issue on Spoken Language Processing

    Google Scholar 

  95. J. Kowtko, P. Price: Data collection and analysis in the air travel planning domain, Proc. DARPA Speech and Natural Language Workshop (1989)

    Google Scholar 

  96. G. Castagnieri, P. Baggia, M. Danieli: Field trials of the Italian ARISE train timetable system, Proc. IVTTA (1998) pp. 97-102

    Google Scholar 

  97. M. Cohen, Z. Rivlin, H. Bratt: Speech recognition in the ATIS domain using multiple knowledge sources, Proc. DARPA Spoken Language Systems Technology Workshop (1995) pp. 257-260

    Google Scholar 

  98. L. Lamel, S. Rosset, J.L. Gauvain, S. Bennacef, M. Garnier-Rizet, B. Prouts: The LIMSI ARISE system, Proc. IVTTA (1998) pp. 209-214

    Google Scholar 

  99. A. Sanderman, J. Sturm, E. den Os, L. Boves, A. Cremers: Evaluation of the Dutch train timetable information system developed in the ARISE project, Proc. IVTTA (1998) pp. 91-96

    Google Scholar 

  100. J. Sturm, E. den Os, L. Boves: Dialogue management in the Dutch ARISE train timetable information system, Proc. Eurospeech (1999) pp. 1419-1422

    Google Scholar 

  101. R. Carlson, S. Hunnicutt: Generic and domain- specific aspects of the Waxholm NLP and dialogue modules, Proc. ICSLP (1996) pp. 677-680

    Google Scholar 

  102. B. Buntschuh: VPQ: A spoken language interface to large scale directory information, Proc. ICSLP (1998) pp. 2863-2866

    Google Scholar 

  103. M. Johnston, S. Bangalore, G. Vasireddy, A. Stent, P. Ehlen, M. Walker, S. Whattaker, P. Maloor: MATCH: An architecture for multimodal dialogue systems, Proc. ACL (2002) pp. 376-383

    Google Scholar 

  104. S. Oviatt: Multimodal interfaces for dynamic interactive maps, Proc. CHI (1996) pp. 95-102

    Google Scholar 

  105. E. Levin, R. Pieraccini, W. Eckert: A stochastic model of human-machine interaction for learning dialogue strategies, IEEE T. Speech Audi. P. 8(1), 11-23 (2000)

    Article  Google Scholar 

  106. W. Eckert, E. Levin, R. Pieraccini: User modelling for spoken dialogue system evaluation, Proc. ASRU (1997) pp. 80-87

    Google Scholar 

  107. R. Lopez-Cozar, A. De la Torre, J.C. Segura, A.J. Rubio: Assessment of dialogue systems by means of a new simulation technique, Speech Commun. 40, 387-407 (2003)

    Article  Google Scholar 

  108. G. Chung: Developing a flexible spoken dialog system using simulation, Proc. ACL (2004) pp. 63-70

    Google Scholar 

  109. G. Chung, S. Seneff, C. Wang: Automatic induction of language model data for a spoken dialogue system, Proc. 6th SIGdial Workshop on Discourse and Dialogue Lisbon (2005)

    Google Scholar 

  110. E. Filisko, S. Seneff: Learning decision models in spoken dialogue systems via user simulation, Proc. AAAI Workshop: Statistical and Empirical Approaches for Spoken Dialog Systems (2006)

    Google Scholar 

  111. J. Schatzmann, K. Georgila, S. Young: Quantitative evaluation of user simulation techniques for spoken dialogue systems, Proc. Workshop on Discourse and Dialogue (2005)

    Google Scholar 

  112. P. Poupart, S. Seneff, J. Williams, S. Young: Cochairs. In: Statistical and Empirical Approaches for Spoken Dialogue Systems, ed. by AAAI (AAAI, Menlo Park 2006), Tech. Rep. WS-06-14

    Google Scholar 

  113. E. Levin, R. Pieraccini, W. Eckert: Using Markov decision process for learning dialogue strategies, Proc. ICASSP (1998) pp. 201-204

    Google Scholar 

  114. H. Meng, C. Wai, R. Pieraccini: The use of belief networks for mixed-initiative dialog modeling, IEEE T. Speech Audi. P. 11(6), 757-773 (2003)

    Article  Google Scholar 

  115. J. Williams, S. Young: Scaling POMDPs for dialog management with composite summary point-based value iteration (CSPBVI), Proc. AAAI Workshop on Statistical and Empirical Approaches for Spoken Dialogue Systems (2006) pp. 7-12

    Google Scholar 

  116. W. Cohen: Fast effective rule induction, Proc. 12th Int. Conf. Machine Learning (1995) pp. 115-123

    Google Scholar 

  117. M. Johnston, S. Bangalore: Learning edit machines for robust multimodal understanding, Proc. ICASSP (2006) pp. I617-I620

    Google Scholar 

  118. G. Tur: Multitask learning for spoken language understanding, Proc. ICASSP (2006) pp. I585-I588

    Google Scholar 

  119. S. Sutton: Universal speech tools: The CSLU toolkit, Proc. ICSLP (1998) pp. 3221-3224

    Google Scholar 

  120. J. Glass, S. Seneff: Flexible and personalizable mixed-initiative dialogue systems, HLT-NAACL 2003 Workshop on Research Directions in Dialogue Processing (2003)

    Google Scholar 

  121. S. Sutton, E. Kaiser, A. Cronk, R. Cole: Bringing spoken language systems to the classroom, Proc. Eurospeech (1997) pp. 709-712

    Google Scholar 

  122. J. Glass, E. Weinstein, S. Cyphers, J. Polifroni, G. Chung, M. Nakano: A framework for developing conversational user interfaces, Proc. CADUI (2004) pp. 354-365

    Google Scholar 

  123. T. Harris, R. Rosenfeld: A universal speech interface for appliances, Proc. ICSLP (2004)

    Google Scholar 

  124. M. Johnston, S. Bangalore: Finite-state multimodal parsing and understanding, Proc. 18th Intrenational Conference on Computational Linguistics (2000) pp. 369-375

    Google Scholar 

  125. O. Lemon, A. Gruenstein, S. Peters: Collaborative activities and multitasking in dialogue systems, Traitement Automatique des Langues 43(2), 131-154 (2002)

    Google Scholar 

  126. M. Turunen, E.-P. Salonen, M. Hartikainen, J. Hakulinen, W. Black, A. Ramsay, A. Funk, A. Conroy, P. Thompson, M. Stairmand, L.K. Jokenen, J. Rissanen, K. Kanto, A. Kerminen, B. Gamback, M. Cheadle, F. Olsson, M. Sahlgren: AthosMail - A multilingual adaptive spoken dialogue system for e-mail domain, Proc. COLING Workshop (2004)

    Google Scholar 

  127. S. Seneff, R. Lau, J. Polifroni: Organization, communication, and control in the GALAXY-II conversational system, Proc. Eurospeech (1999) pp. 1271-1274

    Google Scholar 

  128. W.-T. Hsu, H.-M. Wang, Y.-C. Lin: The design of a multi-domain Chinese dialogue system, Proc. ISCSLP (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Victor Zue Prof. or Stephanie Seneff Dr. .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Zue, V., Seneff, S. (2008). Spoken Dialogue Systems. In: Benesty, J., Sondhi, M.M., Huang, Y.A. (eds) Springer Handbook of Speech Processing. Springer Handbooks. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-49127-9_35

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-49127-9_35

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-49125-5

  • Online ISBN: 978-3-540-49127-9

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics