Abstract
The paper discusses important issues for reinforcement learning agents, the issue of delayed reinforcement learning (DRL). It points out that an early agent, the Crossbar Adaptive Array (CAA) architecture, not widely known in connectionist and reinforcement learning community, was the first to solve the DRL problem among connectionist agents. The work contributes toward understanding the initial neuron-like computational efforts to solve the DRL problem, giving a comparison between CAA and the well-known Actor/Critic (AC) architecture. It also points out relevant contemporary issues of autonomous agents, the issue of genetic/behavioral environment and the issue of emotion based learning architectures.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Keller, F., Schoenfeld, W.: Principles of Psychology. Appleton-Century-Croffts, 1950.
Bellman R.: Dynamic Programming. Princeton University Press 1957
Minsky, M.: Steps toward artificial intelligence, Proceedings of the IRE, pp. 8–30, 1961
Michie, D., Chambers, R.: BOXES: An experiment in adaptive control. In E. Dale, and D. Michie, eds. Machine Intelligence 2: pp. 137–152. Oliver and Boyd, 1968
Bozinovski S.: Inverted pendulum learning control. ANW Memo, December 10, COINS Department, University of Massachusetts, Amherst, 1981
Bozinovski S. A self-learning system using secondary reinforcement. Published Abstracts of the Sixth European Meeting on Cybernetics and Systems, Vienna, April 1982a
Bozinovski, S. A self-learning system using secondary reinforcement. In R. Trappl, ed. Cybernetics and Systems Research, pp. 397–402, North Holland. 1982b
Bozinovski, S.; Anderson C: Associative memory as a controller of an unstable system: Simulation of a learning control. In Proceedings of the IEEE Mediterranean Electrotechnical Conference, C5.11, Athens, Greece, 1983
Barto, A.; Sutton, R.; Anderson, C: Neuronlike elements that can solve difficult learning control problems. IEEE Trans. Systems, Man, and Cybernetics 13: pp. 834–846, 1983.
Rumelhart D., McClelland J., and the PDP group: Parallel Distributed Processing, MIT Press, 1986
Watkins, C: Learning from Delayed Rewards. Ph. D. Thesis, Kings College, Cambridge, England, 1989
Bozinovski, S.: Consequence Driven Systems, Go-cmar Press 1995
Barto A.: Reinforcement learning, In O. Omidvar and D. Elliot (Eds.) Neural Systems for Control, pp. 7–29, Academic Press 1997
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer-Verlag Wien
About this paper
Cite this paper
Bozinovski, S. (1999). Crossbar Adaptive Array: The first connectionist network that solved the delayed reinforcement learning problem. In: Artificial Neural Nets and Genetic Algorithms. Springer, Vienna. https://doi.org/10.1007/978-3-7091-6384-9_54
Download citation
DOI: https://doi.org/10.1007/978-3-7091-6384-9_54
Publisher Name: Springer, Vienna
Print ISBN: 978-3-211-83364-3
Online ISBN: 978-3-7091-6384-9
eBook Packages: Springer Book Archive