Learning Adaptive Referring Expression Generation Policies for Spoken Dialogue Systems

Janarthanam, Srinivasan; Lemon, Oliver

doi:10.1007/978-3-642-15573-4_4

Srinivasan Janarthanam²¹ &
Oliver Lemon²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5790))

Included in the following conference series:

1193 Accesses
7 Citations

Abstract

We address the problem that different users have different lexical knowledge about problem domains, so that automated dialogue systems need to adapt their generation choices online to the users’ domain knowledge as it encounters them. We approach this problem using Reinforcement Learning in Markov Decision Processes (MDP). We present a reinforcement learning framework to learn adaptive referring expression generation (REG) policies that can adapt dynamically to users with different domain knowledge levels. In contrast to related work we also propose a new statistical user model which incorporates the lexical knowledge of different users. We evaluate this framework by showing that it allows us to learn dialogue policies that automatically adapt their choice of referring expressions online to different users, and that these policies are significantly better than hand-coded adaptive policies for this problem. The learned policies are consistently between 2 and 8 turns shorter than a range of different hand-coded but adaptive baseline REG policies.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bell, A.: Language style as audience design. Language in Society 13(2), 145–204 (1984)
Article Google Scholar
Belz, A., Varges, S.: Generation of repeated references to discourse entities. In: Proc. ENLG 2007 (2007)
Google Scholar
Boye, J.: Dialogue management for automatic troubleshooting and other problem-solving applications. In: Proc. SIGDial 2007 (2007)
Google Scholar
Branigan, H.P., Pickering, M.J., Pearson, J., McLean, J.F.: Linguistic alignment between people and computers. Journal of Pragmatics (in Press)
Google Scholar
Brennan, S.E.: Conversation with and through computers. User Modeling and User-Adaptive Interaction 1(1), 67–86 (1991)
Article Google Scholar
Bromme, R., Jucks, R., Wagner, T.: How to refer to diabetes? Language in online health advice. Applied Cognitive Psychology 19, 569–586 (2005)
Article Google Scholar
Buschmeier, H., Bergmann, K., Kopp, S.: Modelling and evaluation of lexical and syntactic alignment with a priming-based microplanner. In: Krahmer, E., Theune, M. (eds.) Empirical Methods in NLG. LNCS (LNAI), vol. 5790, pp. 85–104. Springer, Heidelberg (2010)
Google Scholar
Clark, H.H.: Using Language. Cambridge University Press, Cambridge (1996)
Book Google Scholar
Clark, H.H., Murphy, G.L.: Audience design in meaning and reference. In: Le Ny, J.F., Kintsch, W. (eds.) Language and comprehension. North-Holland Publishing Company, Amsterdam (1982)
Google Scholar
Dale, R.: Cooking up referring expressions. In: Proc. ACL 1989(1989)
Google Scholar
van Deemter, K.: Generating referring expressions: Boolean extensions of the Incremental Algorithm. Computational Linguistics 28(1), 37–52 (2002)
Article MATH Google Scholar
van Deemter, K.: What game theory can do for NLG: the case of vague language. In: Proc. ENLG 2009 (2009)
Google Scholar
Gatt, A., Belz, A.: Attribute selection for referring expression generation: New algorithms and evaluation methods. In: Proc. INLG 2008 (2008)
Google Scholar
Georgila, K., Henderson, J., Lemon, O.: Learning user simulations for information state update dialogue systems. In: Proc. Eurospeech/Interspeech (2005)
Google Scholar
Heller, D., Skovbroten, K., Tanenhaus, M.K.: Experimental evidence for speakers sensitivity to common vs. privileged ground in the production of names. In: Proc. PRE-CogSci 2009 (2009)
Google Scholar
Hinds, P.: The curse of expertise: The effects of expertise and debiasing methods on predictions of novice performance. Experimental Psychology: Applied 5(2), 205–221 (1999)
Google Scholar
Issacs, E.A., Clark, H.H.: References in conversations between experts and novices. Journal of Experimental Psychology: General 116, 26–37 (1987)
Article Google Scholar
Janarthanam, S., Lemon, O.: User simulations for online adaptation and knowledge-alignment in troubleshooting dialogue systems. In: Proc. SEMdial 2008 (2008)
Google Scholar
Janarthanam, S., Lemon, O.: A two-tier user simulation model for reinforcement learning of adaptive referring expression generation policies. In: Proc. SIGDial 2009 (2009)
Google Scholar
Janarthanam, S., Lemon, O.: A wizard-of-oz environment to study referring expression generation in a situated spoken dialogue task. In: Proc. ENLG 2009 (2009)
Google Scholar
Komatani, K., Ueno, S., Kawahara, T., Okuno, H.G.: Flexible guidance generation using user model in spoken dialogue systems. In: Proc. ACL 2003 (2003)
Google Scholar
Komatani, K., Ueno, S., Kawahara, T., Okuno, H.G.: User modeling in spoken dialogue systems to generate flexible guidance. User Modeling and User-Adapted Interaction 15(1), 169–183 (2005)
Article Google Scholar
Krahmer, E., van Erk, S., Verleg, A.: Graph-based generation of referring expressions. Computational Linguistics 29(1), 53–72 (2003)
Article MATH Google Scholar
Lemon, O.: Adaptive natural language generation in dialogue using reinforcement learning. In: Proc. SEMdial 2008 (2008)
Google Scholar
Levin, E., Pieraccini, R., Eckert, W.: Learning dialogue strategies within the markov decision process framework. In: Proc. ASRU 1997 (1997)
Google Scholar
McKeown, K., Robin, J., Tanenblatt, M.: Tailoring lexical choice to the user’s vocabulary in multimedia explanation generation. In: Proc. ACL 1993 (1993)
Google Scholar
Molich, R., Nielsen, J.: Improving a human-computer dialogue. Communications of the ACM 33(3), 338–348 (1990)
Article Google Scholar
Pickering, M.J., Garrod, S.: Toward a mechanistic psychology of dialogue. Behavioral and Brain Sciences 27, 169–225 (2004)
Google Scholar
Porzel, R., Scheffler, A., Malaka, R.: How entrainment increases dialogical efficiency. In: Proc. Workshop on Effective Multimodal Dialogue Interfaces, Sydney (2006)
Google Scholar
Reiter, E., Dale, R.: Computational interpretations of the Gricean maxims in the generation of referring expressions. Cognitive Science 18, 233–263 (1995)
Google Scholar
Rieser, V., Lemon, O.: Natural language generation as planning under uncertainty for spoken dialogue systems. In: Proc. EACL 2009 (2009)
Google Scholar
Rieser, V., Lemon, O.: Learning effective multimodal dialogue strategies from wizard-of-oz data: Bootstrapping and evaluation. In: Proc. ACL 2008 (2008)
Google Scholar
Rieser, V., Lemon, O.: Natural Language Generation as Planning Under Uncertainty for Spoken Dialogue Systems. In: Krahmer, E., Theune, M. (eds.) Empirical Methods in NLG. LNCS (LNAI), vol. 5790, pp. 105–120. Springer, Heidelberg (2010)
Google Scholar
Schatzmann, J., Thomson, B., Weilhammer, K., Ye, H., Young, S.J.: Agenda-based user simulation for bootstrapping a POMDP dialogue system. In: Proc. HLT/NAACL 2007 (2007)
Google Scholar
Schatzmann, J., Weilhammer, K., Stuttle, M.N., Young, S.J.: A survey of statistical user simulation techniques for reinforcement learning of dialogue management strategies. Knowledge Engineering Review, 97–126 (2006)
Google Scholar
Schlangen, D.: Causes and strategies for requesting clarification in dialogue. In: Proc. SIGDial 2004 (2004)
Google Scholar
Shapiro, D., Langley, P.: Separating skills from preference: Using learning to program by reward. In: Proc. ICML 2002 (2002)
Google Scholar
Siddharthan, A., Copestake, A.: Generating referring expressions in open domains. In: Proc. ACL 2004 (2004)
Google Scholar
Stoia, L., Shockley, D.M., Byron, D.K., Fosler-Lussier, E.: Noun phrase generation for situated dialogs. In: Proc. INLG 2006, pp. 81–88 (July 2006)
Google Scholar
Sutton, R., Barto, A.: Reinforcement Learning. MIT Press, Cambridge (1998)
Google Scholar
Williams, J.: Applying POMDPs to dialog systems in the troubleshooting domain. In: Proc. HLT/NAACL Workshop on Bridging the Gap: Academic and Industrial Research in Dialog Technology (2007)
Google Scholar
Wittwer, J., Nckles, M., Renkl, A.: What happens when experts over- or underestimate a laypersons knowledge in communication? Effects on learning and question asking. In: Proc. CogSci 2005 (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Informatics, University of Edinburgh, UK
Srinivasan Janarthanam
School of Mathematical and Computer Sciences, Heriot Watt University, UK
Oliver Lemon

Authors

Srinivasan Janarthanam
View author publications
You can also search for this author in PubMed Google Scholar
Oliver Lemon
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Humanities, Department of Communication and Information Sciences (DCI), Tilburg University, P.O.Box 90153, 5000 LE, Tilburg, The Netherlands
Emiel Krahmer
Human Media Interaction (HMI), Department of Electrical Engineering, Mathematics and Computer Science (EEMCS), University of Twente, P.O. Box 217, 7500 AE, Enschede, The Netherlands
Mariët Theune

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Janarthanam, S., Lemon, O. (2010). Learning Adaptive Referring Expression Generation Policies for Spoken Dialogue Systems. In: Krahmer, E., Theune, M. (eds) Empirical Methods in Natural Language Generation. EACL ENLG 2009 2009. Lecture Notes in Computer Science(), vol 5790. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15573-4_4

Download citation

DOI: https://doi.org/10.1007/978-3-642-15573-4_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15572-7
Online ISBN: 978-3-642-15573-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics