Domain Complexity and Policy Learning in Task-Oriented Dialogue Systems

Papangelis, Alexandros; Ultes, Stefan; Stylianou, Yannis

doi:10.1007/978-3-319-92108-2_8

Alexandros Papangelis³⁵,
Stefan Ultes³⁶ &
Yannis Stylianou³⁵

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 510))

934 Accesses

Abstract

In the present paper, we conduct a comparative evaluation of a multitude of information-seeking domains, using two well-known but fundamentally different algorithms for policy learning: GP-SARSA and DQN. Our goal is to gain an understanding of how the nature of such domains influences performance. Our results indicate several main domain characteristics that play an important role in policy learning performance in terms of task success rates.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Gašić M, Mrkšić N, Rojas-Barahona LM, Su P-H, Ultes S, Vandyke D, Wen T-H, Young S (2016) Dialogue manager domain adaptation using gaussian process reinforcement learning. Comput Speech Lang
Google Scholar
Cuayáhuitl H, Yu S, Williamson A, Carse J (2016) Deep reinforcement learning for multi-domain dialogue systems. arXiv preprint arXiv:1611.08675
Papangelis A, Stylianou, Y (2016) Multi-domain spoken dialogue systems using domain-independent parameterisation. In: Domain adaptation for dialogue agents
Google Scholar
Engel Y, Mannor S, Meir R (2005) Reinforcement learning with gaussian processes. In: Proceedings of the 22nd ICML. ACM, pp 201–208
Google Scholar
Mnih V, Kavukcuoglu K, Silver D, Rusu AA, Veness J, Bellemare MG, Graves A, Riedmiller M, Fidjeland AK, Ostrovski G et al (2015) Human-level control through deep reinforcement learning. Nature 518(7540):529–533
Article Google Scholar
Remus R (2012) Domain adaptation using domain similarity- and domain complexity-based instance selection for cross-domain sentiment analysis. In: 2012 IEEE 12th international conference on data mining workshops, Dec 2012, pp 717–723
Google Scholar
Freitas A, Sales JE, Handschuh S, Curry E (2015) How hard is this query? Measuring the semantic complexity of schema-agnostic queries. In: IWCS 2015, p 294
Google Scholar
Grubinger M, Leung C, Clough P (2005) Linguistic estimation of topic difficulty in cross-language image retrieval. In: Workshop of the cross-language evaluation forum for European languages. Springer, pp 558–566
Google Scholar
Sebastiani F (1994) A probabilistic terminological logic for modelling information retrieval. In: SIGIR94. Springer, pp 122–130
Google Scholar
Bagga A, Biermann AW (1997) Analyzing the complexity of a domain with respect to an information extraction task. In: Proceedings of the tenth international conference on research on computational linguistics (ROCLING X), pp 175–194
Google Scholar
Pollard S, Biermann AW (2000) A measure of semantic complexity for natural language systems. In: Proceedings of the NAACL SSCNLPS, Stroudsburg, PA, USA, pp 42–46
Google Scholar
Gašić M, Breslin C, Henderson M, Kim D, Szummer M, Thomson B, Tsiakoulis P, Young S (2013) On-line policy optimisation of bayesian spoken dialogue systems via human interaction. In: 2013 IEEE international conference on acoustics, speech and signal processing (ICASSP). IEEE, pp 8367–8371
Google Scholar
Ultes S, Rojas-Barahona L, Su PH, Vandyke D, Kim D, Casanueva I, Budzianowski P, Mrkšić N, Wen TH, Gašić M, Young S (2017) Pydial: a multi-domain statistical dialogue system toolkit. In: ACL 2017 Demo, Vancouver. ACL
Google Scholar
Schatzmann J, Young SJ (2009) The hidden agenda user simulation model. IEEE Trans Audio Speech Lang Process 17(4):733–747
Google Scholar

Download references

Author information

Authors and Affiliations

Toshiba Research Europe, Cambridge, UK
Alexandros Papangelis & Yannis Stylianou
Department of Engineering, University of Cambridge, Cambridge, UK
Stefan Ultes

Authors

Alexandros Papangelis
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Ultes
View author publications
You can also search for this author in PubMed Google Scholar
Yannis Stylianou
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Alexandros Papangelis .

Editor information

Editors and Affiliations

Language Technologies Institute, Carnegie Mellon University, Pittsburgh, Pennsylvania, USA
Maxine Eskenazi
LIMSI-CNRS, Sorbonne University, Paris, France
Laurence Devillers
LIMSI-CNRS, Paris-Saclay University, Orsay, France
Joseph Mariani

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Papangelis, A., Ultes, S., Stylianou, Y. (2019). Domain Complexity and Policy Learning in Task-Oriented Dialogue Systems. In: Eskenazi, M., Devillers, L., Mariani, J. (eds) Advanced Social Interaction with Agents . Lecture Notes in Electrical Engineering, vol 510. Springer, Cham. https://doi.org/10.1007/978-3-319-92108-2_8

Download citation

DOI: https://doi.org/10.1007/978-3-319-92108-2_8
Published: 02 August 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-92107-5
Online ISBN: 978-3-319-92108-2
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics