YACS: Combining Dynamic Programming with Generalization in Classifier Systems

Gérard, Pierre; Sigaud, Olivier

doi:10.1007/3-540-44640-0_5

Pierre Gérard^4,5 &
Olivier Sigaud⁴

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1996))

Included in the following conference series:

International Workshop on Learning Classifier Systems

542 Accesses
17 Citations

Abstract

This paper describes our work on the use of anticipation in Learning Classifier Systems (LCS) applied to Markov problems. We present YACS¹, a new kind of Anticipatory Classifier System. It calls upon classifiers with a [Condition], an [Action] and an [Effect] part. As in the traditional LCS framework, the classifier discovery process relies on a selection and a creation mechanism. As in the Anticipatory Classifier System (ACS), YACS looks for classifiers which anticipate well rather than for classifiers which propose an optimal action. The creation mechanism does not rely on classical genetic operators but on a specialization operator, which is explicitly driven by experience. Likewise, the action qualities of the classifiers are not computed by a classical bucket-brigade algorithm, but by a variety of the value iteration algorithm that takes advantage of the effect part of the classifiers.

This paper presents the latent learning process of YACS. The description of the reinforcement learning process is focussed on the problem induced by the joint use of generalization and dynamic programming methods.

YACS stands for “Yet Another Classifier System”

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bellman, R. E. (1957). Dynamic Programming. Princeton University Press, Princeton, NJ.
Google Scholar
Booker, L., Goldberg, D. E., and Holland, J. H. (1989). Classifier systems and genetic algorithms. Artificial Intelligence, 40(1–3):235–282.
Article Google Scholar
Butz, M. V., Goldberg, D. E., and Stolzmann, W. (2000a). Introducing a genetic generalization pressure to the anticipatory classifier system part i: Theoretical approach. In Proceedings of the 2000 Genetic and Evolutionary Computation Conference (GECCO 2000).
Google Scholar
Butz, M. V., Goldberg, D. E., and Stolzmann, W. (2000b). Investigating generalization in the anticipatory classifier system. In Proceedings of the Sixth International Conference on Parallel Problem Solving from Nature.
Google Scholar
Butz, M. V. and Stolzmann, W. (1999). Action-planning in anticipatory classifier sytems. In Proceedings of the 1999 Genetic and Evolutionary Computation Conference Workshop Program.
Google Scholar
Cliff, D. and Ross, S. (1994). Adding memory to ZCS. Adaptive Behavior, 3(2):101–150.
Article Google Scholar
Dorigo, M. (1994). Genetic and non-genetic operators in alecsys. Evolutionary Computation, 1(2):151–164.
Article Google Scholar
Goldberg, D. E. (1989). Genetic Algorithms in Search, Optimization, and Machine Learning. Addison Wesley.
Google Scholar
Holland, J. H., Holyoak, K. J., Nisbett, R. E., and Thagard, P. R. (1986). Induction. MIT Press.
Google Scholar
Lanzi, P. L. (1998). Adding memory to XCS. In Proceedings of the IEEE Conference on Evolutionary Computation (ICEC98). IEEE Press.
Google Scholar
Lanzi, P. L. (1999). An analysis of generalization in the XCS classifier system. Evolutionary Computation, 2(7):125–149.
Article Google Scholar
Lanzi, P. L. (2000). Toward optimal performance in classifier systems. Evolutionary Computation Journal. in print.
Google Scholar
McCallum, R. A. (1996). Learning to use selective attention and short-term memory. In Maes, P., Mataric, M., Meyer, J.-A., Pollack, J., and Wilson, S. W., (Eds.), Proceedings of the Fourth International Conference on Simulation of Adaptive Behavior, pages 315–324, Cambridge, MA. MIT Press.
Google Scholar
Riolo, R. L. (1991). Lookahead planning and latent learning in a classifier system. In Meyer, J.-A. and Wilson, S. W., (Eds.), From annimals to animats: Proceedings of the First International Conference on Simulation of Adaptative Behavior, pages 316–326, Cambridge, MA. MIT Press.
Google Scholar
Sigaud, O. (2000). Using classifier systems as adaptive expert systems for control. In Stolzmann, W., Lanzi, P.-L., and Wilson, S. W., (Eds.), LNCS: New trends in Classifier Systems. Springer-Verlag.
Google Scholar
Stolzmann, W. (1998). Anticipatory classifier systems. In Koza, J., Banzhaf, W., Chellapilla, K., Deb, K., Dorigo, M., Fogel, D., Garzon, M., Goldberg, D., Iba, H., and Riolo, R., (Eds.), Genetic Programming. Morgan Kaufmann Publishers, Inc., San Francisco, CA.
Google Scholar
Stolzmann, W. (1999). Latent learning in khepera robots with anticipatory classifier systems. In Proceedings of the 1999 Genetic and Evolutionary Computation Conference Workshop Program.
Google Scholar
Sutton, R. S. and Barto, A. (1998). Reinforcement Learning: An Introduction. MIT Press.
Google Scholar
Watkins, C. J. (1989). Learning with delayed rewards. PhD thesis, Psychology Department, University of Cambridge, England.
Google Scholar
Wilson, S. W. (1994). ZCS, a zeroth level classifier system. Evolutionary Computation, 2(1):1–18.
Article Google Scholar
Wilson, S. W. (1995). Classifier fitness based on accuracy. Evolutionary Computation, 3(2):149–175.
Article Google Scholar
Witkowski, C. M. (1999). Integrating unsupervised learning, motivation and action selection in an a-life agent. In Floreano, D., Mondada, F., and Nicoud, J.-D., (Eds.), 5th European Conference on Artificial Life (ECAL-99), pages 355–364, Lausanne. Springer.
Google Scholar

Download references

Author information

Authors and Affiliations

Dassault Aviation, DGT/DPR/ESA, 78, Quai Marcel Dassault, 92552, St-Cloud Cedex
Pierre Gérard & Olivier Sigaud
AnimatLab (LIP6), 8, rue du capitaine Scott, 75015, PARIS
Pierre Gérard

Authors

Pierre Gérard
View author publications
You can also search for this author in PubMed Google Scholar
Olivier Sigaud
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Politecnico di Milano Dipartimento di Elettronica e Informazione, Artificial Intelligence and Robotics Laboratory, Piazza Leonardo da Vinci 32, 20133, Milan, Italy
Pier Luca Lanzi
DaimlerChrysler AG Research and Technology,Cognition and Robotics, Alt-Moabit 96A, 10559, Berlin, Germany
Wolfgang Stolzmann
Prediction Dynamics, Concord, MA 01742, USA
Stewart W. Wilson

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gérard, P., Sigaud, O. (2001). YACS: Combining Dynamic Programming with Generalization in Classifier Systems. In: Luca Lanzi, P., Stolzmann, W., Wilson, S.W. (eds) Advances in Learning Classifier Systems. IWLCS 2000. Lecture Notes in Computer Science(), vol 1996. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44640-0_5

Download citation

DOI: https://doi.org/10.1007/3-540-44640-0_5
Published: 24 August 2001
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42437-6
Online ISBN: 978-3-540-44640-8
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics