Adaptive Value Function Approximations in Classifier Systems

Booker, Lashon B.

doi:10.1007/978-3-540-71231-2_15

Lashon B. Booker¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4399))

Included in the following conference series:

454 Accesses

Abstract

Previous work [1] introduced a new approach to value function approximation in classifier systems called hyperplane coding. Hyperplane coding is a closely related variation of tile coding [13] in which classifier rule conditions fill the role of tiles, and there are few restrictions on the way those “tiles” are organized. Experiments with hyperplane coding have shown that, given a relatively small population of random classifiers, it computes much better approximations than more conventional classifier system methods in which individual rules compute approximations independently. The obvious next step in this line of research is to use the approximation resources available in a random population as a starting point for a more refined approach to approximation that re-allocates resources adaptively to gain greater precision in those regions of the input space where it is needed. This paper shows how to compute such an adaptive function approximation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Booker, L.B.: Approximating value functions in classifier systems. In: Bull, L., Kovacs, T. (eds.) Foundations of Learning Classifier Systems, Springer, Heidelberg (2005)
Google Scholar
Booker, L.B., Goldberg, D.E., Holland, J.H.: Classifier Systems and Genetic Algorithms. Artificial Intelligence 40, 235–282 (1989)
Article Google Scholar
Bull, L., O’Hara, T.: Accuracy-based neuro and neuro-fuzzy classifier systems. In: Langdon, W.B., et al. (eds.) GECCO 2002: Proceedings of the Genetic and Evolutionary Computation Conference, 9-13 July 2002, pp. 905–911. Morgan Kaufmann, San Francisco (2002)
Google Scholar
Butz, M.V., Wilson, S.W.: An Algorithmic Description of XCS. In: Lanzi, P.L., Stolzmann, W., Wilson, S.W. (eds.) IWLCS 2000. LNCS (LNAI), vol. 1996, pp. 253–272. Springer, Heidelberg (2001)
Chapter Google Scholar
Chen, F., Lambert, D., Pinheiro, J.C.: Incremental quantile estimation for massive tracking. In: Proceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 516–522. ACM Press, New York (2000)
Chapter Google Scholar
Donoho, D.L., Johnstone, I.M.: Ideal spatial adaptation by wavelet shrinkage. Biometrika 81, 425–455 (1994)
Article MATH MathSciNet Google Scholar
Faloutsos, C.: Gray codes for partial match and range queries. IEEE Transactions on Software Engineering 14(10), 1381–1393 (1988)
Article MATH MathSciNet Google Scholar
Hinton, G.E., McClelland, J.L., Rumelhart, D.E.: Distributed representations. In: Rumelhart, D.E., McClelland, J.L., CORPORATE PDP Research Group (eds.) Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations, pp. 77–109. MIT Press, Cambridge (1986)
Google Scholar
Iglewicz, B., Hoaglin, D.C.: How to Detect and Handle Outliers. In: American Society for Quality Control Basic References in Quality Control: Statistical Techniques (vol. 16), ASQC Quality Press, Milwaukee (1993)
Google Scholar
Miller, W.T., Glanz, F.H., Kraft, L.G.: CMAC: An associative neural network alternative to backpropagation. Proceedings of the IEEE 78(10), 1561–1567 (1990)
Article Google Scholar
Singh, S.P., Jaakkola, T., Jordan, M.I.: Reinforcement learning with soft state aggregation. In: Tesauro, G., Touretzky, D., Leen, T. (eds.) Advances in Neural Information Processing Systems, vol. 7, pp. 361–368. MIT Press, Cambridge (1995)
Google Scholar
Sutton, R.S.: Adapting bias by gradient descent: An incremental version of delta-bar-delta. In: Proceedings of the Tenth National Conference on Artificial Intelligence, pp. 171–176 (1992)
Google Scholar
Sutton, R.S., Barto, A.G.: Introduction to Reinforcement Learning. MIT Press, Cambridge (1998)
Google Scholar
Sutton, R.S., Whitehead, S.D.: Online Learning with Random Representations. In: Machine Learning: Proceedings of the Tenth International Conference, pp. 314–321. Morgan Kaufmann, San Mateo (1993)
Google Scholar
Venturini, G.: Apprentissage Adaptatif et Apprentissage Supervisé par Algorithme Génétique. PhD thesis, Université de Paris-Sud (1994)
Google Scholar
Wilson, S.W.: Classifiers that approximate functions. Natural Computing 1(2-3), 211–234 (2002)
Article MATH MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

The MITRE Corporation, 7515 Colshire Drive, McLean, VA 22102-7508, USA
Lashon B. Booker

Authors

Lashon B. Booker
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Tim Kovacs Xavier Llorà Keiki Takadama Pier Luca Lanzi Wolfgang Stolzmann Stewart W. Wilson

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Booker, L.B. (2007). Adaptive Value Function Approximations in Classifier Systems. In: Kovacs, T., Llorà, X., Takadama, K., Lanzi, P.L., Stolzmann, W., Wilson, S.W. (eds) Learning Classifier Systems. IWLCS IWLCS IWLCS 2003 2004 2005. Lecture Notes in Computer Science(), vol 4399. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71231-2_15

Download citation

DOI: https://doi.org/10.1007/978-3-540-71231-2_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-71230-5
Online ISBN: 978-3-540-71231-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics