Two Person Zero-Sum Semi-Markov Games with Unknown Holding Times Distribution on One Side: A Discounted Payoff Criterion

Minjárez-Sosa, J. Adolfo; Luque-Vásquez, Fernando

doi:10.1007/s00245-007-9016-7

Two Person Zero-Sum Semi-Markov Games with Unknown Holding Times Distribution on One Side: A Discounted Payoff Criterion

Published: 05 September 2007

Volume 57, pages 289–305, (2008)
Cite this article

Applied Mathematics and Optimization Submit manuscript

J. Adolfo Minjárez-Sosa¹ &
Fernando Luque-Vásquez¹

100 Accesses
13 Citations
Explore all metrics

Abstract

This paper deals with two person zero-sum semi-Markov games with a possibly unbounded payoff function, under a discounted payoff criterion. Assuming that the distribution of the holding times H is unknown for one of the players, we combine suitable methods of statistical estimation of H with control procedures to construct an asymptotically discount optimal pair of strategies.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Zero-sum semi-Markov games with state-action-dependent discount factors

Article 05 November 2022

Zero-Sum Markov Games with Random State-Actions-Dependent Discount Factors: Existence of Optimal Strategies

Article 03 March 2018

Semi-stationary Equilibrium Strategies in Non-cooperative N-person Semi-Markov Games

References

Bhattacharya, R.N., Majumdar, M.: Controlled semi-Markov models—the discounted case. J. Stat. Plann. Inference 21, 365–381 (1989)
Article MathSciNet MATH Google Scholar
Gordienko, E.I., Minjárez-Sosa, J.A.: Adaptive control for discrete-time Markov processes with unbounded costs: discounted criterion. Kybernetika 34, 217–234 (1998)
MathSciNet Google Scholar
Guo, X.P., Hernández-Lerma, O.: Zero-sum games for continuous-time Markov chains with unbounded transition and average payoff rates. J. Appl. Probab. 40, 327–345 (2003)
Article MathSciNet MATH Google Scholar
Guo, X.P., Hernández-Lerma, O.: Zero-sum continuous-time Markov games with unbounded transition and discounted payoffs. Bernoulli 11, 1009–1029 (2005)
Article MathSciNet MATH Google Scholar
Guo, X.P., Hernández-Lerma, O.: Nonzero-sum games for continuous-time Markov chains with unbounded payoffs. J. Appl. Probab. 42, 303–320 (2005)
Article MathSciNet MATH Google Scholar
Hernández-Lerma, O., Lasserre, J.B.: Discrete-Time Markov Control Processes: Basic Optimality Criteria. Springer, New York (1996)
Google Scholar
Hernández-Lerma, O., Lasserre, J.B.: Further Topics on Discrete-Time Markov Control Processes. Springer, New York (1999)
MATH Google Scholar
Hasminskii, R., Ibragimov, I.: On density estimation in the view of Kolmogorov’s ideas in approximation theory. Ann. Stat. 18, 999–1010 (1990)
Article MathSciNet Google Scholar
Hilgert, N., Minjárez-Sosa, J.A.: Adaptive policies for time-varying stochastic systems under discounted criterion. Math. Methods Oper. Res. 54, 491–505 (2001)
Article MathSciNet MATH Google Scholar
Jaskiewicz, A.: Zero-sum semi-Markov games. SIAM J. Control Optim. 41, 723–739 (2002)
Article MathSciNet MATH Google Scholar
Lal, A.K., Sinha, S.: Zero-sum two person semi-Markov games. J. Appl. Probab. 29, 56–72 (1992)
Article MathSciNet MATH Google Scholar
Luque-Vásquez, F., Robles-Alcaraz, M.T.: Controlled semi-Markov models with discounted unbounded costs. Bol. Soc. Mat. Mexicana 39, 51–68 (1994)
MathSciNet MATH Google Scholar
Lippman, S.A.: Semi-Markov decision processes with unbounded rewards. Manag. Sci. 19, 717–731 (1973)
Article MathSciNet MATH Google Scholar
Lippman, S.A.: On dynamic programming with unbounded rewards. Manag. Sci. 21, 1225–1233 (1975)
MathSciNet MATH Google Scholar
Luque-Vásquez, F.: Zero-sum semi-Markov games in Borel spaces: discounted and average payoff. Bol. Soc. Mat. Mexicana 8, 227–241 (2002)
MathSciNet MATH Google Scholar
Luque-Vásquez, F., Minjárez-Sosa, J.A.: Semi-Markov control processes with unknown holding times distribution under a discounted criterion. Math. Methods Oper. Res. 61, 455–468 (2005)
Article MathSciNet MATH Google Scholar
Nowak, A.S.: Some remarks on equilibria in semi-Markov games. Appl. Math. (Warsaw) 27-4, 385–394 (2000)
Google Scholar
Polowczuk, W.: Nonzero semi-Markov games with countable state spaces. Appl. Math. (Warsaw) 27-4, 395–402 (2000)
MathSciNet Google Scholar
Rieder, U.: Measurable selection theorems for optimization problems. Manuscr. Math. 24, 115–131 (1978)
Article MathSciNet MATH Google Scholar
Ross, S.M.: Applied Probability Models with Optimization Applications. Holden-Day, San Francisco (1970)
MATH Google Scholar
Schäl, M.: Estimation and control in discounted stochastic dynamic programming. Stochastics 20, 51–131 (1987)
MathSciNet MATH Google Scholar
Shapley, L.: Stochastic games. Proc. Natl. Acad. Sci. U.S.A. 39, 1095–1100 (1953)
Article MathSciNet MATH Google Scholar
Vega-Amaya, O.: Average optimality in semi-Markov control models on Borel spaces: unbounded costs and controls. Bol. Soc. Mat. Mexicana 38, 47–60 (1993)
MathSciNet MATH Google Scholar
Vega-Amaya, O.: Zero-sum semi-Markov games: fixed point solutions of the Shapley equation. SIAM J. Control Optim. 42-5, 1876–1894 (2003)
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Departamento de Matemáticas, Universidad de Sonora, Rosales s/n, Centro, 83000, Hermosillo, Sonora, Mexico
J. Adolfo Minjárez-Sosa & Fernando Luque-Vásquez

Authors

J. Adolfo Minjárez-Sosa
View author publications
You can also search for this author in PubMed Google Scholar
Fernando Luque-Vásquez
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to J. Adolfo Minjárez-Sosa.

Additional information

Work supported partially by Consejo Nacional de Ciencia y Tecnología (CONACyT) under Grant 46633-F.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Minjárez-Sosa, J.A., Luque-Vásquez, F. Two Person Zero-Sum Semi-Markov Games with Unknown Holding Times Distribution on One Side: A Discounted Payoff Criterion. Appl Math Optim 57, 289–305 (2008). https://doi.org/10.1007/s00245-007-9016-7

Download citation

Published: 05 September 2007
Issue Date: June 2008
DOI: https://doi.org/10.1007/s00245-007-9016-7

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Two Person Zero-Sum Semi-Markov Games with Unknown Holding Times Distribution on One Side: A Discounted Payoff Criterion

Abstract

Access this article

Similar content being viewed by others

Zero-sum semi-Markov games with state-action-dependent discount factors

Zero-Sum Markov Games with Random State-Actions-Dependent Discount Factors: Existence of Optimal Strategies

Semi-stationary Equilibrium Strategies in Non-cooperative N-person Semi-Markov Games

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Two Person Zero-Sum Semi-Markov Games with Unknown Holding Times Distribution on One Side: A Discounted Payoff Criterion

Abstract

Access this article

Similar content being viewed by others

Zero-sum semi-Markov games with state-action-dependent discount factors

Zero-Sum Markov Games with Random State-Actions-Dependent Discount Factors: Existence of Optimal Strategies

Semi-stationary Equilibrium Strategies in Non-cooperative N-person Semi-Markov Games

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation