Skip to main content

Learning Adaptive Referring Expression Generation Policies for Spoken Dialogue Systems

  • Chapter
Empirical Methods in Natural Language Generation (EACL 2009, ENLG 2009)

Abstract

We address the problem that different users have different lexical knowledge about problem domains, so that automated dialogue systems need to adapt their generation choices online to the users’ domain knowledge as it encounters them. We approach this problem using Reinforcement Learning in Markov Decision Processes (MDP). We present a reinforcement learning framework to learn adaptive referring expression generation (REG) policies that can adapt dynamically to users with different domain knowledge levels. In contrast to related work we also propose a new statistical user model which incorporates the lexical knowledge of different users. We evaluate this framework by showing that it allows us to learn dialogue policies that automatically adapt their choice of referring expressions online to different users, and that these policies are significantly better than hand-coded adaptive policies for this problem. The learned policies are consistently between 2 and 8 turns shorter than a range of different hand-coded but adaptive baseline REG policies.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bell, A.: Language style as audience design. Language in Society 13(2), 145–204 (1984)

    Article  Google Scholar 

  2. Belz, A., Varges, S.: Generation of repeated references to discourse entities. In: Proc. ENLG 2007 (2007)

    Google Scholar 

  3. Boye, J.: Dialogue management for automatic troubleshooting and other problem-solving applications. In: Proc. SIGDial 2007 (2007)

    Google Scholar 

  4. Branigan, H.P., Pickering, M.J., Pearson, J., McLean, J.F.: Linguistic alignment between people and computers. Journal of Pragmatics (in Press)

    Google Scholar 

  5. Brennan, S.E.: Conversation with and through computers. User Modeling and User-Adaptive Interaction 1(1), 67–86 (1991)

    Article  Google Scholar 

  6. Bromme, R., Jucks, R., Wagner, T.: How to refer to diabetes? Language in online health advice. Applied Cognitive Psychology 19, 569–586 (2005)

    Article  Google Scholar 

  7. Buschmeier, H., Bergmann, K., Kopp, S.: Modelling and evaluation of lexical and syntactic alignment with a priming-based microplanner. In: Krahmer, E., Theune, M. (eds.) Empirical Methods in NLG. LNCS (LNAI), vol. 5790, pp. 85–104. Springer, Heidelberg (2010)

    Google Scholar 

  8. Clark, H.H.: Using Language. Cambridge University Press, Cambridge (1996)

    Book  Google Scholar 

  9. Clark, H.H., Murphy, G.L.: Audience design in meaning and reference. In: Le Ny, J.F., Kintsch, W. (eds.) Language and comprehension. North-Holland Publishing Company, Amsterdam (1982)

    Google Scholar 

  10. Dale, R.: Cooking up referring expressions. In: Proc. ACL 1989(1989)

    Google Scholar 

  11. van Deemter, K.: Generating referring expressions: Boolean extensions of the Incremental Algorithm. Computational Linguistics 28(1), 37–52 (2002)

    Article  MATH  Google Scholar 

  12. van Deemter, K.: What game theory can do for NLG: the case of vague language. In: Proc. ENLG 2009 (2009)

    Google Scholar 

  13. Gatt, A., Belz, A.: Attribute selection for referring expression generation: New algorithms and evaluation methods. In: Proc. INLG 2008 (2008)

    Google Scholar 

  14. Georgila, K., Henderson, J., Lemon, O.: Learning user simulations for information state update dialogue systems. In: Proc. Eurospeech/Interspeech (2005)

    Google Scholar 

  15. Heller, D., Skovbroten, K., Tanenhaus, M.K.: Experimental evidence for speakers sensitivity to common vs. privileged ground in the production of names. In: Proc. PRE-CogSci 2009 (2009)

    Google Scholar 

  16. Hinds, P.: The curse of expertise: The effects of expertise and debiasing methods on predictions of novice performance. Experimental Psychology: Applied 5(2), 205–221 (1999)

    Google Scholar 

  17. Issacs, E.A., Clark, H.H.: References in conversations between experts and novices. Journal of Experimental Psychology: General 116, 26–37 (1987)

    Article  Google Scholar 

  18. Janarthanam, S., Lemon, O.: User simulations for online adaptation and knowledge-alignment in troubleshooting dialogue systems. In: Proc. SEMdial 2008 (2008)

    Google Scholar 

  19. Janarthanam, S., Lemon, O.: A two-tier user simulation model for reinforcement learning of adaptive referring expression generation policies. In: Proc. SIGDial 2009 (2009)

    Google Scholar 

  20. Janarthanam, S., Lemon, O.: A wizard-of-oz environment to study referring expression generation in a situated spoken dialogue task. In: Proc. ENLG 2009 (2009)

    Google Scholar 

  21. Komatani, K., Ueno, S., Kawahara, T., Okuno, H.G.: Flexible guidance generation using user model in spoken dialogue systems. In: Proc. ACL 2003 (2003)

    Google Scholar 

  22. Komatani, K., Ueno, S., Kawahara, T., Okuno, H.G.: User modeling in spoken dialogue systems to generate flexible guidance. User Modeling and User-Adapted Interaction 15(1), 169–183 (2005)

    Article  Google Scholar 

  23. Krahmer, E., van Erk, S., Verleg, A.: Graph-based generation of referring expressions. Computational Linguistics 29(1), 53–72 (2003)

    Article  MATH  Google Scholar 

  24. Lemon, O.: Adaptive natural language generation in dialogue using reinforcement learning. In: Proc. SEMdial 2008 (2008)

    Google Scholar 

  25. Levin, E., Pieraccini, R., Eckert, W.: Learning dialogue strategies within the markov decision process framework. In: Proc. ASRU 1997 (1997)

    Google Scholar 

  26. McKeown, K., Robin, J., Tanenblatt, M.: Tailoring lexical choice to the user’s vocabulary in multimedia explanation generation. In: Proc. ACL 1993 (1993)

    Google Scholar 

  27. Molich, R., Nielsen, J.: Improving a human-computer dialogue. Communications of the ACM 33(3), 338–348 (1990)

    Article  Google Scholar 

  28. Pickering, M.J., Garrod, S.: Toward a mechanistic psychology of dialogue. Behavioral and Brain Sciences 27, 169–225 (2004)

    Google Scholar 

  29. Porzel, R., Scheffler, A., Malaka, R.: How entrainment increases dialogical efficiency. In: Proc. Workshop on Effective Multimodal Dialogue Interfaces, Sydney (2006)

    Google Scholar 

  30. Reiter, E., Dale, R.: Computational interpretations of the Gricean maxims in the generation of referring expressions. Cognitive Science 18, 233–263 (1995)

    Google Scholar 

  31. Rieser, V., Lemon, O.: Natural language generation as planning under uncertainty for spoken dialogue systems. In: Proc. EACL 2009 (2009)

    Google Scholar 

  32. Rieser, V., Lemon, O.: Learning effective multimodal dialogue strategies from wizard-of-oz data: Bootstrapping and evaluation. In: Proc. ACL 2008 (2008)

    Google Scholar 

  33. Rieser, V., Lemon, O.: Natural Language Generation as Planning Under Uncertainty for Spoken Dialogue Systems. In: Krahmer, E., Theune, M. (eds.) Empirical Methods in NLG. LNCS (LNAI), vol. 5790, pp. 105–120. Springer, Heidelberg (2010)

    Google Scholar 

  34. Schatzmann, J., Thomson, B., Weilhammer, K., Ye, H., Young, S.J.: Agenda-based user simulation for bootstrapping a POMDP dialogue system. In: Proc. HLT/NAACL 2007 (2007)

    Google Scholar 

  35. Schatzmann, J., Weilhammer, K., Stuttle, M.N., Young, S.J.: A survey of statistical user simulation techniques for reinforcement learning of dialogue management strategies. Knowledge Engineering Review, 97–126 (2006)

    Google Scholar 

  36. Schlangen, D.: Causes and strategies for requesting clarification in dialogue. In: Proc. SIGDial 2004 (2004)

    Google Scholar 

  37. Shapiro, D., Langley, P.: Separating skills from preference: Using learning to program by reward. In: Proc. ICML 2002 (2002)

    Google Scholar 

  38. Siddharthan, A., Copestake, A.: Generating referring expressions in open domains. In: Proc. ACL 2004 (2004)

    Google Scholar 

  39. Stoia, L., Shockley, D.M., Byron, D.K., Fosler-Lussier, E.: Noun phrase generation for situated dialogs. In: Proc. INLG 2006, pp. 81–88 (July 2006)

    Google Scholar 

  40. Sutton, R., Barto, A.: Reinforcement Learning. MIT Press, Cambridge (1998)

    Google Scholar 

  41. Williams, J.: Applying POMDPs to dialog systems in the troubleshooting domain. In: Proc. HLT/NAACL Workshop on Bridging the Gap: Academic and Industrial Research in Dialog Technology (2007)

    Google Scholar 

  42. Wittwer, J., Nckles, M., Renkl, A.: What happens when experts over- or underestimate a laypersons knowledge in communication? Effects on learning and question asking. In: Proc. CogSci 2005 (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Janarthanam, S., Lemon, O. (2010). Learning Adaptive Referring Expression Generation Policies for Spoken Dialogue Systems. In: Krahmer, E., Theune, M. (eds) Empirical Methods in Natural Language Generation. EACL ENLG 2009 2009. Lecture Notes in Computer Science(), vol 5790. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15573-4_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-15573-4_4

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-15572-7

  • Online ISBN: 978-3-642-15573-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics