Minimizing Expected Termination Time in One-Counter Markov Decision Processes

Brázdil, Tomáš; Kučera, Antonín; Novotný, Petr; Wojtczak, Dominik

doi:10.1007/978-3-642-31585-5_16

Tomáš Brázdil²⁰,
Antonín Kučera²⁰,
Petr Novotný²⁰ &
…
Dominik Wojtczak²¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7392))

Included in the following conference series:

International Colloquium on Automata, Languages, and Programming

1657 Accesses
7 Citations

Abstract

We consider the problem of computing the value and an optimal strategy for minimizing the expected termination time in one-counter Markov decision processes. Since the value may be irrational and an optimal strategy may be rather complicated, we concentrate on the problems of approximating the value up to a given error ε > 0 and computing a finite representation of an ε-optimal strategy. We show that these problems are solvable in exponential time for a given configuration, and we also show that they are computationally hard in the sense that a polynomial-time approximation algorithm cannot exist unless P=NP.

The full version of this paper can be found at http://arxiv.org/abs/1205.1473 .

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Proceedings of FST&TCS 2010. LIPIcs, vol. 8. Schloss Dagstuhl (2010)
Google Scholar
Brázdil, T., Brožek, V., Etessami, K.: One-counter stochastic games. In: Proceedings of FST&TCS 2010 [1], pp. 108–119
Google Scholar
Brázdil, T., Brožek, V., Etessami, K., Kučera, A.: Approximating the Termination Value of One-Counter MDPs and Stochastic Games. In: Aceto, L., Henzinger, M., Sgall, J. (eds.) ICALP 2011, Part II. LNCS, vol. 6756, pp. 332–343. Springer, Heidelberg (2011)
Chapter Google Scholar
Brázdil, T., Brožek, V., Etessami, K., Kučera, A., Wojtczak, D.: One-counter Markov decision processes. In: Proceedings of SODA 2010, pp. 863–874. SIAM (2010)
Google Scholar
Brázdil, T., Brožek, V., Forejt, V., Kučera, A.: Reachability in recursive Markov decision processes. I&C 206(5), 520–537 (2008)
MATH Google Scholar
Brázdil, T., Brožek, V., Kučera, A., Obdržálek, J.: Qualitative reachability in stochastic BPA games. I&C 208(7), 772–796 (2010)
Google Scholar
Brázdil, T., Kučera, A., Novotný, P., Wojtczak, D.: Minimizing expected termination time in one-counter Markov decision processes. CoRR abs/1205.1473 (2012)
Google Scholar
Chatterjee, K., Doyen, L.: Energy Parity Games. In: Abramsky, S., Gavoille, C., Kirchner, C., Meyer auf der Heide, F., Spirakis, P.G. (eds.) ICALP 2010, Part II. LNCS, vol. 6199, pp. 599–610. Springer, Heidelberg (2010)
Chapter Google Scholar
Chatterjee, K., Doyen, L., Henzinger, T., Raskin, J.F.: Generalized mean-payoff and energy games. In: Proceedings of FST&TCS 2010 [1], pp. 505–516
Google Scholar
Etessami, K., Wojtczak, D., Yannakakis, M.: Recursive Stochastic Games with Positive Rewards. In: Aceto, L., Damgård, I., Goldberg, L.A., Halldórsson, M.M., Ingólfsdóttir, A., Walukiewicz, I. (eds.) ICALP 2008, Part I. LNCS, vol. 5125, pp. 711–723. Springer, Heidelberg (2008)
Chapter Google Scholar
Etessami, K., Wojtczak, D., Yannakakis, M.: Quasi-birth-death processes, tree-like QBDs, probabilistic 1-counter automata, and pushdown systems. Performance Evaluation 67(9), 837–857 (2010)
Article Google Scholar
Etessami, K., Yannakakis, M.: Recursive Markov Decision Processes and Recursive Stochastic Games. In: Caires, L., Italiano, G.F., Monteiro, L., Palamidessi, C., Yung, M. (eds.) ICALP 2005. LNCS, vol. 3580, pp. 891–903. Springer, Heidelberg (2005)
Chapter Google Scholar
Etessami, K., Yannakakis, M.: Efficient Qualitative Analysis of Classes of Recursive Markov Decision Processes and Simple Stochastic Games. In: Durand, B., Thomas, W. (eds.) STACS 2006. LNCS, vol. 3884, pp. 634–645. Springer, Heidelberg (2006)
Chapter Google Scholar
Filar, J., Vrieze, K.: Competitive Markov Decision Processes. Springer (1996)
Google Scholar
Göller, S., Lohrey, M.: Branching-time model checking of one-counter processes. In: Proceedings of STACS 2010. LIPIcs, vol. 5, pp. 405–416. Schloss Dagstuhl (2010)
Google Scholar
Jančar, P., Kučera, A., Moller, F., Sawa, Z.: DP lower bounds for equivalence-checking and model-checking of one-counter automata. I&C 188(1), 1–19 (2004)
MATH Google Scholar
Jančar, P., Sawa, Z.: A note on emptiness for alternating finite automata with a one-letter alphabet. IPL 104(5), 164–167 (2007)
Article MATH Google Scholar
Kučera, A.: The complexity of bisimilarity-checking for one-counter processes. TCS 304(1-3), 157–183 (2003)
Article MATH Google Scholar
Latouche, G., Ramaswami, V.: Introduction to Matrix Analytic Methods in Stochastic Modeling. ASA-SIAM series on statistics and applied probability (1999)
Google Scholar
Puterman, M.: Markov Decision Processes. Wiley (1994)
Google Scholar
Serre, O.: Parity Games Played on Transition Graphs of One-Counter Processes. In: Aceto, L., Ingólfsdóttir, A. (eds.) FOSSACS 2006. LNCS, vol. 3921, pp. 337–351. Springer, Heidelberg (2006)
Chapter Google Scholar
Williams, D.: Probability with Martingales. Cambridge University Press (1991)
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Informatics, Masaryk University, Czech Republic
Tomáš Brázdil, Antonín Kučera & Petr Novotný
Department of Computer Science, University of Liverpool, UK
Dominik Wojtczak

Authors

Tomáš Brázdil
View author publications
You can also search for this author in PubMed Google Scholar
Antonín Kučera
View author publications
You can also search for this author in PubMed Google Scholar
Petr Novotný
View author publications
You can also search for this author in PubMed Google Scholar
Dominik Wojtczak
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science and Centre for Discrete Mathematics and its Applications, University of Warwick, Warwick, UK
Artur Czumaj
Max-Planck-Institut für Informatik, Saarbrücken, Germany
Kurt Mehlhorn
Computer Laboratory,, University of Cambridge, UK
Andrew Pitts
ETH Zurich, Switzerland
Roger Wattenhofer

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Brázdil, T., Kučera, A., Novotný, P., Wojtczak, D. (2012). Minimizing Expected Termination Time in One-Counter Markov Decision Processes. In: Czumaj, A., Mehlhorn, K., Pitts, A., Wattenhofer, R. (eds) Automata, Languages, and Programming. ICALP 2012. Lecture Notes in Computer Science, vol 7392. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31585-5_16

Download citation

DOI: https://doi.org/10.1007/978-3-642-31585-5_16
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-31584-8
Online ISBN: 978-3-642-31585-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics