Crossbar Adaptive Array: The first connectionist network that solved the delayed reinforcement learning problem

Bozinovski, S.

doi:10.1007/978-3-7091-6384-9_54

S. Bozinovski⁵

630 Accesses
8 Citations

Abstract

The paper discusses important issues for reinforcement learning agents, the issue of delayed reinforcement learning (DRL). It points out that an early agent, the Crossbar Adaptive Array (CAA) architecture, not widely known in connectionist and reinforcement learning community, was the first to solve the DRL problem among connectionist agents. The work contributes toward understanding the initial neuron-like computational efforts to solve the DRL problem, giving a comparison between CAA and the well-known Actor/Critic (AC) architecture. It also points out relevant contemporary issues of autonomous agents, the issue of genetic/behavioral environment and the issue of emotion based learning architectures.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Keller, F., Schoenfeld, W.: Principles of Psychology. Appleton-Century-Croffts, 1950.
Google Scholar
Bellman R.: Dynamic Programming. Princeton University Press 1957
Google Scholar
Minsky, M.: Steps toward artificial intelligence, Proceedings of the IRE, pp. 8–30, 1961
Google Scholar
Michie, D., Chambers, R.: BOXES: An experiment in adaptive control. In E. Dale, and D. Michie, eds. Machine Intelligence 2: pp. 137–152. Oliver and Boyd, 1968
Google Scholar
Bozinovski S.: Inverted pendulum learning control. ANW Memo, December 10, COINS Department, University of Massachusetts, Amherst, 1981
Google Scholar
Bozinovski S. A self-learning system using secondary reinforcement. Published Abstracts of the Sixth European Meeting on Cybernetics and Systems, Vienna, April 1982a
Google Scholar
Bozinovski, S. A self-learning system using secondary reinforcement. In R. Trappl, ed. Cybernetics and Systems Research, pp. 397–402, North Holland. 1982b
Google Scholar
Bozinovski, S.; Anderson C: Associative memory as a controller of an unstable system: Simulation of a learning control. In Proceedings of the IEEE Mediterranean Electrotechnical Conference, C5.11, Athens, Greece, 1983
Google Scholar
Barto, A.; Sutton, R.; Anderson, C: Neuronlike elements that can solve difficult learning control problems. IEEE Trans. Systems, Man, and Cybernetics 13: pp. 834–846, 1983.
Google Scholar
Rumelhart D., McClelland J., and the PDP group: Parallel Distributed Processing, MIT Press, 1986
Google Scholar
Watkins, C: Learning from Delayed Rewards. Ph. D. Thesis, Kings College, Cambridge, England, 1989
Google Scholar
Bozinovski, S.: Consequence Driven Systems, Go-cmar Press 1995
Google Scholar
Barto A.: Reinforcement learning, In O. Omidvar and D. Elliot (Eds.) Neural Systems for Control, pp. 7–29, Academic Press 1997
Google Scholar

Download references

Author information

Authors and Affiliations

Electrical Engineering Faculty, University of Skopje, Karpos II, 91000, Skopje, Macedonia
S. Bozinovski

Authors

S. Bozinovski
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bozinovski, S. (1999). Crossbar Adaptive Array: The first connectionist network that solved the delayed reinforcement learning problem. In: Artificial Neural Nets and Genetic Algorithms. Springer, Vienna. https://doi.org/10.1007/978-3-7091-6384-9_54

Download citation

DOI: https://doi.org/10.1007/978-3-7091-6384-9_54
Publisher Name: Springer, Vienna
Print ISBN: 978-3-211-83364-3
Online ISBN: 978-3-7091-6384-9
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics