Spoken Dialogue Systems

Zue, Victor; Seneff, Stephanie

doi:10.1007/978-3-540-49127-9_35

Victor Zue Prof.⁴ &
Stephanie Seneff Dr.⁵

Part of the book series: Springer Handbooks ((SHB))

8064 Accesses
1 Citations

Abstract

Spoken dialogue systems are a new breed of interfaces that enable humans to communicate with machines naturally and efficiently using a conversational paradigm. Such a system makes use of many human language technology (HLT) components, including speech recognition and synthesis, natural language understanding and generation, discourse modeling, and dialogue management. In this contribution, we introduce the nature of these interfaces, describe the underlying HLTs on which they are based, and discuss some of the development issues. After providing a historical perspective, we outline some new research directions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 579.00; Price excludes VAT (USA)

Hardcover Book: USD 729.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Abbreviations

ARISE:: Automatic Railway Information Systems for Europe
ASR:: automatic speech recognition
ATN:: augmented transition networks
BBN:: Bolt, Beranek and Newman
DARPA:: Defense Advanced Research Projects Agency
FST:: finite state transducer
HLT:: human language technologies
ISU:: information state update
IVR:: interactive voice response
MDP:: Markov decision process
NLG:: natural language generation
NLU:: natural language understanding
SLS:: spoken language system
SUNDIAL:: speech understanding and dialog
TTS:: text-to-speech
WER:: word error rate

References

E. Barnard, A. Halberstadt, C. Kotelly, M. Phillips: A consistent approach to designing spoken-dialog systems, Proc. ASRU Workshop (ASRU, Keystone 1999)
Google Scholar
S.J. Boyce: Natural spoken dialogue systems for telephony applications, Commun. ACM 43(9), 29-34 (2000)
Article Google Scholar
A. Raux, B. Langner, D. Bohus, A. Black, M. Eskenazi: Letʼs Go public! Taking a spoken dialog system to the real world, Proc. Interspeech (2005) pp. 885-888
Google Scholar
A. Raux, B. Langner, A. Black, M. Eskenazi: Letʼs Go: Improving spoken dialog systems for the elderly and non-natives, Proc. Interspeech (2003) pp. 753-756
Google Scholar
G. Flammia: Discourse Segmentation of Spoken Dialogue: An Empirical Approach. Ph.D. Thesis (MIT, Cambridge 1998)
Google Scholar
S. Seneff, R. Lau, J. Glass, J. Polifroni: The mercury system for flight browsing and pricing, MIT Spoken Language System Group Annual Progress Report (1999) pp. 23-28
Google Scholar
J. Glass, G. Flammia, D. Goodine, M. Phillips, J. Polifroni, S. Sakai, S. Seneff, V. Zue: Multilingual spoken-language understanding in the MIT voyager system, Speech Commun. 17, 1-18 (1995)
Article Google Scholar
R. Lau, G. Flammia, C. Pao, V. Zue: WebGalaxy - Integrating spoken language and hypertext navigation, Proc. Eurospeech (1997) pp. 883-886
Google Scholar
H. Meng, S. Busayapongchai, J. Glass, D. Goddeau, L. Hetherington, E. Hurley, C. Pao, J. Polifroni, S. Seneff, V. Zue: A conversational system in the automobile classifieds domain, Proc. ICSLP (1996) pp. 542-545
Google Scholar
S. Seneff, V. Zue, J. Polifroni, C. Pao, L. Hetherington, D. Goddeau, J. Glass: The preliminary development of a displayless Pegasus system, Proc. ARPA Spoken Language Technology Workshop (1995) pp. 212-217
Google Scholar
A. Gruenstein, S. Seneff, C. Wang: Scalable and portable web-based multimodal dialogue interaction with geographical databases, Proc. Interspeech (2006)
Google Scholar
W. Wang, J. Glass, H. Meng, J. Polifroni, S. Seneff, V. Zue: Yinhe: A Mandarin Chinese version of the galaxy system, Proc. Eurospeech (1997) pp. 351-354
Google Scholar
V. Zue, S. Seneff, J. Polifroni, M. Phillips, C. Pao, D. Goddeau, J. Glass, E. Brill: Pegasus: A spoken language interface for on-line air travel planning, Speech Commun. 15, 331-340 (1994)
Article Google Scholar
V. Zue, S. Seneff, J. Glass, J. Polifroni, C. Pao, T. Hazen, L. Hetherington: JUPITER: A telephone-based conversational interface for weather information, IEEE Trans. Speech Audio. Process. 8(1), 85-96 (2000)
Article Google Scholar
P. Dalsgaard, L. Larsen, I. Thomsen (Eds.): Proc. ESCA Tutorial and Research Workshop on Spoken Dialogue Systems: Theory and Application (1995)
Google Scholar
P. Cohen, M. Johnson, D. McGee, S. Oviatt, J. Clow, I. Smith: The efficiency of multimodal interaction: A case study, Proc. ICSLP (1998) pp. 249-252
Google Scholar
S. Seneff, D. Goddeau, C. Pao, J. Polifroni: Multimodal discourse modelling in a multi-user multi-domain environment, Proc. ICSLP (1996) pp. 188-191
Google Scholar
D. Massaro: Perceiving Talking Faces: From Speech Perception to a Behavioral Principle (MIT Press, Cambridge 1997)
Google Scholar
W. Ward: Modelling non-verbal sounds for speech recognition, Proc. DARPA Workshop on Speech and Natural Language (1989) pp. 47-50
Google Scholar
J. Butzberger, H. Murveit, M. Weintraub: Spontaneous speech effects in large vocabulary speech recognition applications, Proc. ARPA Workshop on Speech and Natural Language (1992) pp. 339-344
Google Scholar
L. Hetherington, V. Zue: New words: Implications for continuous speech recognition, Proc. Eurospeech (1991) pp. 475-931
Google Scholar
A. Asadi, R. Schwartz, J. Makhoul: Automatic modelling for adding new words to a large vocabulary continuous speech recognition system, Proc. ICASSP (1991) pp. 305-308
Google Scholar
G. Chung, S. Seneff, C. Wang: Automatic acquisition of names using speak and spell mode in spoken dialogue systems, Proc. HLT-NAACL (2003) pp. 197-200
Google Scholar
C. Pao, P. Schmid, J. Glass: Confidence scoring for speech understanding systems, Proc. ICSLP (1998) pp. 815-818
Google Scholar
M. Gabsdil, O. Lemon: Combining acoustic and pragmatic features to predict recognition performance in spoken dialogue systems, Proc. ACL (2004)
Google Scholar
A. Kellner, B. Rueber, H. Schramm: Using combined decisions and confidence measures for name recognition in automatic directory assistance systems, Proc. ICSLP (1998) pp. 2859-2862
Google Scholar
D. Bohus, A. Rudnicky: Constructing accurate beliefs in spoken dialog systems, Proc. ASRU (2005) pp. 272-277
Google Scholar
V. Souvignier, A. Kellner, B. Rueber, H. Schramm, F. Seide: The thoughtful elephant: Strategies for spoken dialogue systems, IEEE T. Speech Audi. P. 8(1), 51-62 (2000)
Article Google Scholar
R. Billi, R. Canavesio, C. Rullent: Automation of Telecom Italia Directory Assistance Service: Field trial results, Proc. IVTTA (1998) pp. 11-16
Google Scholar
R.J. Lippmann: Speech recognition by humans and machines, Speech Commun. 22(1), 1-15 (1997)
Article Google Scholar
R. Bobrow, R. Ingria, D. Stallard: Syntactic and semantic knowledge in the DELPHI unification grammar, Proc. DARPA Speech and Natural Language Workshop (1990) pp. 230-236
Google Scholar
S. Seneff: TINA: A natural language system for spoken language applications, Comput. Linguist. 18(1), 61-86 (1992)
Google Scholar
J. Dowding, J. Gawron, D. Appelt, J. Bear, L. Cherny, R. Moore, D. Moran: Gemini: A natural language system for spoken language understanding, Proc. ARPA Workshop on Human Language Technology (1993) pp. 21-24
Google Scholar
W. Ward: The CMU air travel information service: Understanding spontaneous speech, Proc. ARPA Workshop on Speech and Natural Language (1990) pp. 127-129
Google Scholar
E. Jackson, D. Appelt, J. Bear, R. Moore, A. Podlozny: A template matcher for robust NL interpretation, Proc. DARPA Speech and Natural Language Workshop (1991) pp. 190-194
Google Scholar
S. Seneff: Robust parsing for spoken language systems, Proc. ICASSP (1992) pp. 189-192
Google Scholar
D. Stallard, R. Bobrow: Fragment processing in the DELPHI system, Proc. DARPA Speech and Natural Language Workshop (1992) pp. 305-310
Google Scholar
A. Gorin, G. Riccardi, J. Wright: How may I help you?, Speech Commun. 23, 113-127 (1997)
Article MATH Google Scholar
S. Miller, R. Schwartz, R. Bobrow, R. Ingria: Statistical language processing using hidden understanding models, Proc. ARPA Speech and Natural Language Workshop (1994) pp. 278-282
Google Scholar
Y. Chow, R. Schwartz: The N-Best Algorithm: An efficient procedure for finding top N sentence hypotheses, Proc. ARPA Workshop on Speech and Natural Language (1989) pp. 199-202
Google Scholar
L. Hetherington, M. Phillips, J. Glass, V. Zue: A word network search for continuous speech recognition, Proc. Eurospeech (1993) pp. 1533-1536
Google Scholar
L. Mangu, E. Brill, A. Stolcke: Finding consensus among words: Lattice-based word error minimization, Proc. Eurospeech (1999)
Google Scholar
D. Goodine, S. Seneff, L. Hirschman, M. Phillips: Full integration of speech and language understanding in the MIT spoken language system, Proc. Eurospeech (1991) pp. 845-848
Google Scholar
D. Goddeau: Using probabilistic shift-reduce parsing in speech recognition systems, Proc. ICSLP (1992) pp. 321-324
Google Scholar
W. Ward: Integrating semantic constraints into the SPHINX-II recognition search, Proc. ICASSP (1994) pp. 17-20
Google Scholar
R. Moore, D. Appelt, J. Dowding, J. Gawron, D. Moran: Combining linguistic and statistical knowledge sources in natural-language processing for ATIS, Proc. ARPA Spoken Language Systems Workshop (1995) pp. 261-264
Google Scholar
E. Reiter, R. Dale: Building Natural Language Generation Systems (Cambridge Univ. Press, Cambridge 2000)
Book Google Scholar
D. McDonald, L. Bolc (Eds.): Natural Language Generation Systems (Symbolic Computation Artificial Intelligence) (Springer, Heidelberg, Berlin 1998)
Google Scholar
J. Glass, J. Polifroni, S. Seneff: Multilingual language generation across multiple domains, Proc. ICSLP (1994)
Google Scholar
A. Oh: Stochastic Natural Language Generation for Spoken Dialog Systems. M.S. Thesis (CMU, Mt. Pleasant 2000)
Google Scholar
V. Demberg, J. Moore: Information presentation in spoken dialogue systems, Proc. EACL (2006)
Google Scholar
D. Klatt: Review of text-to-speech conversion for English, J. Acoust. Soc. Am. 82(3), 737-793 (1987)
Article Google Scholar
Y. Sagisaka, N. Kaiki, N. Iwahashi, K. Mimura: ATR ν -talk speech synthesis system, Proc. ICSLP (1992) pp. 483-486
Google Scholar
M. Beutnagel, A. Conkie, J. Schroeter, Y. Stylianou, A. Syrdal: The AT&T next-gen TTS system, Proc. ASA (ASA, Berlin 1999)
Google Scholar
X. Huang, A. Acero, J. Adcock, H.W. Hon, J. Goldsmith, J. Liu, M. Plumpe: Whistler: A trainable text-to-speech system, Proc. ICSLP (1996) pp. 2387-2390
Google Scholar
J. van Santen, L. Pols, M. Abe, D. Kahn, E. Keller, J. Vonwiller: Report on the 3rd ESCA TTS workshop evaluation procedure, Proc. 3rd ESCA Workshop on Speech Synthesis (1998) pp. 329-332
Google Scholar
K. McKeown, S. Pan, J. Shaw, D. Jordan, B. Allen: Language generation for multimedia healthcare briefings, Proc. Applied Natural Language Proc (1997)
Google Scholar
A. Black, K. Lenzo: Limited domain synthesis, Proc. ICSLP (2000)
Google Scholar
R. Sproat, A. Hunt, M. Ostendorf, P. Taylor, A. Black, K. Lenzo, M. Edgington: Sable: A standard for TTS markup, Proc. ICSLP (1998) pp. 1719-1722
Google Scholar
B. Grosz, C. Sidner: Plans for discourse, Intentions in Communication (MIT Press, 1990)
Google Scholar
D. Thomson, J. Wisowaty: User confusion in natural language services, Proc. ESCA Workshop on Interactive Dialogue in Multi-Modal Systems (1999) pp. 189-196
Google Scholar
D. Sadek: Design considerations on dialogue systems: From theory to technology - The case of Artimis, Proc. ESCA Workshop on Interactive Dialogue in Multi-Modal Systems (1999) pp. 173-188
Google Scholar
L. Boves, E. den Os: Applications of speech technology: Designing for usability, Proc. IEEE Worshop on ASR and Understanding (1999) pp. 353-362
Google Scholar
J. Allen, L. Schubert, G. Ferguson, P. Heeman, C. Hwang, T. Kato, W. Light, N. Martin, B. Miller, M. Poesio, D. Traum: The TRAINS project: A case study in defining a conversational planning agent, J. Exp. Theor. Artif. Intell. 7, 7-48 (1995)
Article MATH Google Scholar
M. Denecke, A. Waibel: Dialogue strategies guiding users to their communicative goals, Proc. Eurospeech (1997) pp. 2227-2230
Google Scholar
L. Devillers, H. Bonneau-Maynard: Evaluation of dialog strategies for a tourist information retrieval system, Proc. ICSLP (1998) pp. 1187-1190
Google Scholar
S. Rosset, S. Bennacef, L. Lamel: Design strategies for spoken language dialog systems, Proc. Eurospeech (1999) pp. 1535-1538
Google Scholar
A. Rudnicky, E. Thayer, P. Constantinides, C. Tchou, R. Shern, K. Lenzo, W. Xu, A. Oh: Creating natural dialogs in the Carnegie Mellon communicator system, Proc. Eurospeech (1999) pp. 1531-1534
Google Scholar
E. Filisko, S. Seneff: Error detection and recovery in spoken dialogue systems, Proc. Workshop on Spoken Language Understanding for Conversational Systems and Higher Level Linguistic Knowledge for Speech Processing (2004)
Google Scholar
G. Di Fabbrizio, C. Lewis: Florence: A dialogue manager framework for spoken dialogue systems, Proc. ICSLP (2004)
Google Scholar
D. Goddeau, H. Meng, J. Polifroni, S. Seneff, S. Busayapongchai: A form-based dialogue manager for spoken language applications, Proc. ICSLP (1996) pp. 701-704
Google Scholar
R. Pieraccini, E. Levin, W. Eckert: AMICA: The AT&T mixed initiative conversational architecture, Proc. Eurospeech (1997) pp. 1875-1879
Google Scholar
S. Seneff: Response planning and generation in the MERCURY flight reservation system, Comput. Speech Lang., Vol. 16 (2002) pp. 283-312
Google Scholar
P. Constantinides, S. Hansma, A. Rudnicky: A schema-based approach to dialog control, Proc. ICSLP (1998) pp. 409-412
Google Scholar
X. Wei, A. Rudnicky: Task-based dialog management using an agenda, Proc. ANLP/NAACL Workshop on Conversational Systems (2000)
Google Scholar
J. Allen, D. Byron, M. Dzikovska, G. Ferguson, L. Galescu, A. Stent: An architecture for a generic dialogue shell, Nat. Lang. Eng. 6(3), 1-16 (2000)
Google Scholar
E. den Os, L. Boves, L. Lamel, P. Baggia: Overview of the ARISE project, Proc. Eurospeech (1999) pp. 1527-1530
Google Scholar
T. Kemp, A. Waibel: Unsupervised training of a speech recognizer: Recent experiments, Proc. Eurospeech (1999) pp. 2725-2728
Google Scholar
A. Rudnicky, M. Sakamoto, J. Polifroni: Evaluating spoken language interaction, Proc. DARPA Speech and Natural Language Workshop (1989) pp. 150-159
Google Scholar
L. Lamel, S. Bennacef, J.L. Gauvain, H. Dartigues, J. Temem: User evaluation of the mask kiosk, Proc. ICSLP (1998) pp. 2875-2878
Google Scholar
M. Walker, D. Litman, C. Kamm, A. Abella: PARADISE: A general framework for evaluating spoken dialogue agents, Proc. ACL/EACL (1997) pp. 271-280
Google Scholar
M. Walker, J. Boland, C. Kamm: The utility of elapsed time as a usability metric for spoken dialogue systems, Proc. ASRU Workshop (ASRU, Keystone 1999) pp. 1167-1170
Google Scholar
L. Hirschman: Multi-site data collection for a spoken language corpus, Proc. DARPA Workshop on Speech and Natural Language (1992) pp. 7-14
Google Scholar
J. Polifroni, S. Seneff, J. Glass, T. Hazen: Evaluation methodology for a telephone-based conversational system, Proc. Int. Conf. on Lang. Resources and Evaluation (1998) pp. 42-50
Google Scholar
D. Pallett, J. Fiscus, W. Fisher, J. Garafolo, B. Lund, A. Martin, M. Pryzbocki: Benchmark tests for the ARPA spoken language program, Proc. ARPA Spoken Language Systems Technology Workshop (1995) pp. 5-36
Google Scholar
J. Peckham: A new generation of spoken dialogue systems: Results and lessons from the SUNDIAL project, Proc. Eurospeech (1993) pp. 33-40
Google Scholar
M. Walker, J. Aberdeen, J. Boland, E. Bratt, J. Garofolo, L. Hirschman, A. Le, S. Lee, S. Narayanan, K. Papineni, B. Pellom, J. Polifroni, A. Potamianos, P. Prabhu, A. Rudnicky, G. Sanders, S. Seneff, D. Stallard, S. Whittaker: DARPA communicator dialog travel planning systems: The June 2000 data collection, Proc. Eurospeech (2001)
Google Scholar
M. Walker, A. Rudnicky, R. Prasad, J. Aberdeen, E.O. Bratt, J. Garofolo, H. Hastie, A. Le, B. Pellom, A. Potamianos, R. Passonneau, S. Roukos, G. Sanders, S. Seneff, D. Stallard: DARPA communicator: Cross-system results for the 2001 evaluation, Proc. ICSLP (2002) pp. 273-276
Google Scholar
S. Seneff, E. Hurley, R. Lau, C. Pao, P. Schmid, V. Zue: GALAXY-II: A reference architecture for conversational system development, Proc. ICSLP (1998) pp. 931-934
Google Scholar
M. Eskenazi, A. Rudnicky, K. Gregory, P. Constantinides, R. Brennan, C. Bennett, J. Allen: Data collection and processing in the Carnegie Mellon communicator, Proc. Eurospeech (1999) pp. 2695-2698
Google Scholar
D. Jurafsky, C. Wooters, G. Tajchman, J. Segal, A. Stolcke, E. Fosler, N. Morgan: The Berkeley restaurant project, Proc. ICSLP (1994) pp. 2139-2142
Google Scholar
M. Blomberg, R. Carlson, K. Elenius, B. Granstrom, J. Gustafson, S. Hunnicutt, R. Lindell, L. Neovius: An experimental dialogue system: Waxholm, Proc. Eurospeech (1993) pp. 1867-1870
Google Scholar
H. Aust, M. Oerder, F. Seide, V. Steinbiss: The Philips automatic train timetable information system, Speech Commun. 17, 249-262 (1995)
Article MATH Google Scholar
V. Zue, J. Glass: Conversational interfaces: Advances and challenges, Proc. IEEE, Vol. 88 (2000), Special Issue on Spoken Language Processing
Google Scholar
J. Kowtko, P. Price: Data collection and analysis in the air travel planning domain, Proc. DARPA Speech and Natural Language Workshop (1989)
Google Scholar
G. Castagnieri, P. Baggia, M. Danieli: Field trials of the Italian ARISE train timetable system, Proc. IVTTA (1998) pp. 97-102
Google Scholar
M. Cohen, Z. Rivlin, H. Bratt: Speech recognition in the ATIS domain using multiple knowledge sources, Proc. DARPA Spoken Language Systems Technology Workshop (1995) pp. 257-260
Google Scholar
L. Lamel, S. Rosset, J.L. Gauvain, S. Bennacef, M. Garnier-Rizet, B. Prouts: The LIMSI ARISE system, Proc. IVTTA (1998) pp. 209-214
Google Scholar
A. Sanderman, J. Sturm, E. den Os, L. Boves, A. Cremers: Evaluation of the Dutch train timetable information system developed in the ARISE project, Proc. IVTTA (1998) pp. 91-96
Google Scholar
J. Sturm, E. den Os, L. Boves: Dialogue management in the Dutch ARISE train timetable information system, Proc. Eurospeech (1999) pp. 1419-1422
Google Scholar
R. Carlson, S. Hunnicutt: Generic and domain- specific aspects of the Waxholm NLP and dialogue modules, Proc. ICSLP (1996) pp. 677-680
Google Scholar
B. Buntschuh: VPQ: A spoken language interface to large scale directory information, Proc. ICSLP (1998) pp. 2863-2866
Google Scholar
M. Johnston, S. Bangalore, G. Vasireddy, A. Stent, P. Ehlen, M. Walker, S. Whattaker, P. Maloor: MATCH: An architecture for multimodal dialogue systems, Proc. ACL (2002) pp. 376-383
Google Scholar
S. Oviatt: Multimodal interfaces for dynamic interactive maps, Proc. CHI (1996) pp. 95-102
Google Scholar
E. Levin, R. Pieraccini, W. Eckert: A stochastic model of human-machine interaction for learning dialogue strategies, IEEE T. Speech Audi. P. 8(1), 11-23 (2000)
Article Google Scholar
W. Eckert, E. Levin, R. Pieraccini: User modelling for spoken dialogue system evaluation, Proc. ASRU (1997) pp. 80-87
Google Scholar
R. Lopez-Cozar, A. De la Torre, J.C. Segura, A.J. Rubio: Assessment of dialogue systems by means of a new simulation technique, Speech Commun. 40, 387-407 (2003)
Article Google Scholar
G. Chung: Developing a flexible spoken dialog system using simulation, Proc. ACL (2004) pp. 63-70
Google Scholar
G. Chung, S. Seneff, C. Wang: Automatic induction of language model data for a spoken dialogue system, Proc. 6th SIGdial Workshop on Discourse and Dialogue Lisbon (2005)
Google Scholar
E. Filisko, S. Seneff: Learning decision models in spoken dialogue systems via user simulation, Proc. AAAI Workshop: Statistical and Empirical Approaches for Spoken Dialog Systems (2006)
Google Scholar
J. Schatzmann, K. Georgila, S. Young: Quantitative evaluation of user simulation techniques for spoken dialogue systems, Proc. Workshop on Discourse and Dialogue (2005)
Google Scholar
P. Poupart, S. Seneff, J. Williams, S. Young: Cochairs. In: Statistical and Empirical Approaches for Spoken Dialogue Systems, ed. by AAAI (AAAI, Menlo Park 2006), Tech. Rep. WS-06-14
Google Scholar
E. Levin, R. Pieraccini, W. Eckert: Using Markov decision process for learning dialogue strategies, Proc. ICASSP (1998) pp. 201-204
Google Scholar
H. Meng, C. Wai, R. Pieraccini: The use of belief networks for mixed-initiative dialog modeling, IEEE T. Speech Audi. P. 11(6), 757-773 (2003)
Article Google Scholar
J. Williams, S. Young: Scaling POMDPs for dialog management with composite summary point-based value iteration (CSPBVI), Proc. AAAI Workshop on Statistical and Empirical Approaches for Spoken Dialogue Systems (2006) pp. 7-12
Google Scholar
W. Cohen: Fast effective rule induction, Proc. 12th Int. Conf. Machine Learning (1995) pp. 115-123
Google Scholar
M. Johnston, S. Bangalore: Learning edit machines for robust multimodal understanding, Proc. ICASSP (2006) pp. I617-I620
Google Scholar
G. Tur: Multitask learning for spoken language understanding, Proc. ICASSP (2006) pp. I585-I588
Google Scholar
S. Sutton: Universal speech tools: The CSLU toolkit, Proc. ICSLP (1998) pp. 3221-3224
Google Scholar
J. Glass, S. Seneff: Flexible and personalizable mixed-initiative dialogue systems, HLT-NAACL 2003 Workshop on Research Directions in Dialogue Processing (2003)
Google Scholar
S. Sutton, E. Kaiser, A. Cronk, R. Cole: Bringing spoken language systems to the classroom, Proc. Eurospeech (1997) pp. 709-712
Google Scholar
J. Glass, E. Weinstein, S. Cyphers, J. Polifroni, G. Chung, M. Nakano: A framework for developing conversational user interfaces, Proc. CADUI (2004) pp. 354-365
Google Scholar
T. Harris, R. Rosenfeld: A universal speech interface for appliances, Proc. ICSLP (2004)
Google Scholar
M. Johnston, S. Bangalore: Finite-state multimodal parsing and understanding, Proc. 18th Intrenational Conference on Computational Linguistics (2000) pp. 369-375
Google Scholar
O. Lemon, A. Gruenstein, S. Peters: Collaborative activities and multitasking in dialogue systems, Traitement Automatique des Langues 43(2), 131-154 (2002)
Google Scholar
M. Turunen, E.-P. Salonen, M. Hartikainen, J. Hakulinen, W. Black, A. Ramsay, A. Funk, A. Conroy, P. Thompson, M. Stairmand, L.K. Jokenen, J. Rissanen, K. Kanto, A. Kerminen, B. Gamback, M. Cheadle, F. Olsson, M. Sahlgren: AthosMail - A multilingual adaptive spoken dialogue system for e-mail domain, Proc. COLING Workshop (2004)
Google Scholar
S. Seneff, R. Lau, J. Polifroni: Organization, communication, and control in the GALAXY-II conversational system, Proc. Eurospeech (1999) pp. 1271-1274
Google Scholar
W.-T. Hsu, H.-M. Wang, Y.-C. Lin: The design of a multi-domain Chinese dialogue system, Proc. ISCSLP (2002)
Google Scholar

Download references

Author information

Authors and Affiliations

Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, 32 Vassar Street, 02139, Cambridge, MA, USA
Victor Zue Prof.
Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, 32 Vassar Street, 02139, Cambridge, MA, USA
Stephanie Seneff Dr.

Authors

Victor Zue Prof.
View author publications
You can also search for this author in PubMed Google Scholar
Stephanie Seneff Dr.
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Victor Zue Prof. or Stephanie Seneff Dr. .

Editor information

Editors and Affiliations

INRS-EMT, University of Quebec, 800 de la Gauchetiere Ouest, H5A 1K6, Montreal, Quebec, Canada
Jacob Benesty Dr.
Avayalabs Research, 233 Mount Airy Road, 07920, Basking Ridge, NJ, USA
M. Mohan Sondhi Ph.D.
Alcatel-Lucent, Bell Laboratories, 600 Mountain Avenue, 07974, Murray Hill, NJ, USA
Yiteng Arden Huang Dr.

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Zue, V., Seneff, S. (2008). Spoken Dialogue Systems. In: Benesty, J., Sondhi, M.M., Huang, Y.A. (eds) Springer Handbook of Speech Processing. Springer Handbooks. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-49127-9_35

Download citation

DOI: https://doi.org/10.1007/978-3-540-49127-9_35
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-49125-5
Online ISBN: 978-3-540-49127-9
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics