Fuzzy Q(λ)-Learning Algorithm

Zajdel, Roman

doi:10.1007/978-3-642-13208-7_33

Roman Zajdel²⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6113))

Included in the following conference series:

International Conference on Artificial Intelligence and Soft Computing

1800 Accesses
2 Citations

Abstract

The adaptation of temporal differences method TD(λ> 0) to reinforcement learning algorithms with fuzzy approximation of action-value function is proposed. Eligibility traces are updated using the normalized degree of activation of fuzzy rules. The two types of fuzzy reinforcement learning algorithm are formulated: with discrete and with continuous action values. These new algorithms are practically tested in the control of two typical models of continuous object, like ball-beam and cart-pole system. The achievement results are compared with two popular reinforcement learning algorithms with CMAC and table approximation of action-value function.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Barto, A.G., Sutton, R.S., Anderson, C.W.: Neuronlike adaptive elements that can solve difficult learning problem. IEEE Trans. SMC 13, 834–847 (1983)
Google Scholar
Bonarini, A., Lazaric, A., Montrone, F., Restelli, M.: Reinforcement distribution in Fuzzy Q-learning. Fuzzy Sets and Systems 160, 1420–1443 (2009)
Article MATH MathSciNet Google Scholar
Cichosz, P.: Learning systems. WNT, Warsaw (2000) (in Polish)
Google Scholar
Gu, D., Hu, H.: Accuracy based fuzzy Q-learning for robot behaviours. In: Proc. of the IEEE Int. Conf. on Fuzzy Systems, vol. 3, pp. 1455–1460 (2004)
Google Scholar
Min, H., Zeng, J., Luo, R.: Fuzzy CMAC with automatic state partition for reinforcement learning. In: Proc. of the First ACM/SIGEVO Summit on Genetic and Evolutionary Computation, pp. 421–428 (2009)
Google Scholar
Nguyen, M.N., Shi, D., Quek, C.: Self-Organizin Gaussian fuzzy CMAC with Truth Value Restriction. In: Proc. of the Third Int. Conf. on Information Technology and Applications (ICITA 2005), vol. 2, pp. 185–190 (2005)
Google Scholar
Shi, D., Harkisanka, A., Quek, C.: CMAC with Fuzzy Logic Reasoning. In: Pal, N.R., Kasabov, N., Mudi, R.K., Pal, S., Parui, S.K. (eds.) ICONIP 2004. LNCS, vol. 3316, pp. 898–903. Springer, Heidelberg (2004)
Google Scholar
Sutton, R.S.: Generalization in Reinforcement Learning: Successful Examples Using Sparse Coarse Coding. Advances in Neural information Processing Systems 8, 1038–1044 (1996)
Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement learning: An Introduction. MIT Press, Cambridge (1998)
Google Scholar
Theodoridis, T., Hu, H.: The Fuzzy Sars’a’(λ) Learning Approach Applied to a Strategic Route Learning Robot Behaviour. In: Proc. of the IEEE/RSJ Int. Conf. on Intelligent Robots and Systems, pp. 1767–1772 (2006)
Google Scholar
Watkins, C.J.C.H.: Learning from delayed Rewards. PhD thesis, Cambridge University, Cambridge, England (1989)
Google Scholar
Wellstead, P.E.: Introduction to Physical System Modelling, Control System Principles (2000)
Google Scholar
Xu, X., Hu, D., He, H.: Accelerated Reinforcement learning control using modified CMAC neural networks. In: Proc. of the Ninth Int. Conf. on Neural Information Processing, vol. 5, pp. 2575–2578 (2002)
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Electrical and Computer Engineering, Rzeszow University of Technology, W. Pola 2, 35-959, Rzeszow, Poland
Roman Zajdel

Authors

Roman Zajdel
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Artificial Intelligence, Academy of Humanities and Economics, Poland
Leszek Rutkowski
Academy of Humanities and Economics in Łódź,, ul. Rewolucji 1905 nr 64, Łódź, Poland
Rafał Scherer
Institute of Automatics,, AGH University of Science and Technology, Al. Mickiewicza 30, PL-30-059, Kraków, Poland
Ryszard Tadeusiewicz
Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, Initiative in Soft Computing (BISC), 94720-1776, Berkeley, CA,
Lotfi A. Zadeh
Computational Intelligence Laboratory Department of Electrical and Computer Engineering, University of Louisville, 40292, Louisville, KY
Jacek M. Zurada

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zajdel, R. (2010). Fuzzy Q(λ)-Learning Algorithm. In: Rutkowski, L., Scherer, R., Tadeusiewicz, R., Zadeh, L.A., Zurada, J.M. (eds) Artificial Intelligence and Soft Computing. ICAISC 2010. Lecture Notes in Computer Science(), vol 6113. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-13208-7_33

Download citation

DOI: https://doi.org/10.1007/978-3-642-13208-7_33
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-13207-0
Online ISBN: 978-3-642-13208-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics