Non-analytical Reasoning Assisted Deep Reinforcement Learning

Schonefeld, John; Karim, Md

doi:10.1007/978-3-031-06527-9_32

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13259))

Included in the following conference series:

International Work-Conference on the Interplay Between Natural and Artificial Computation

1158 Accesses

Abstract

Addressing the sparse reward problem in Deep Reinforcement Learning (DRL) using human supplied external knowledge or reasoning is a common practice. Such external knowledge and reasoning should not be so complete that a DRL agent does not almost need to perform any exploration questioning its utility. Non-analytical Reasoning could shape an agent’s actions sufficiently yet take away minimal credit from the DRL exploration process. We generalize the solution approaches to Non-analytical Reasoning Assisted Deep Reinforcement Learning and present an example solution to “Montezuma’s Revenge,” a notorious Atari game, applying such reasoning.

This material is based upon work supported by the National Science Foundation under Award No. OIA-1946391. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 79.99; Price excludes VAT (USA)

Softcover Book: USD 99.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Bellemare, M.G., Naddaf, Y., Veness, J., Bowling, M.: The arcade learning environment: an evaluation platform for general agents. J. Artif. Intell. Res. 47, 253–279 (2013)
Article Google Scholar
Burda, Y., Edwards, H., Pathak, D., Storkey, A., Darrell, T., Efros, A.A.: Large-scale study of curiosity-driven learning. arXiv preprint arXiv:1808.04355 (2018)
Colin, T.R., Belpaeme, T.: Reinforcement learning and insight in the artificial pigeon. In: 41st Annual Meeting of the Cognitive Science Society (CogSci 2019), pp. 1533–1539. Cognitive Science Society (2019)
Google Scholar
Esteves, J.J.A., Boubendir, A., Guillemin, F., Sens, P.: A heuristically assisted deep reinforcement learning approach for network slice placement. IEEE Trans. Netw. Serv. Manag. (2021)
Google Scholar
IntelLabs: Intellabs/coach: Reinforcement learning coach by intel ai lab enables easy experimentation with state of the art reinforcement learning algorithms. https://github.com/IntelLabs/coach
Kaplan, C.A., Simon, H.A.: In search of insight. Cogn. Psychol. 22(3), 374–419 (1990)
Article Google Scholar
McCrea, S.M.: Intuition, insight, and the right hemisphere: emergence of higher sociocognitive functions. Psychol. Res. Behav. Manag. (2010)
Google Scholar
Mnih, V., et al.: Playing Atari with deep reinforcement learning. arXiv preprint arXiv:1312.5602 (2013)
Ng, A.Y., Harada, D., Russell, S.: Policy invariance under reward transformations: theory and application to reward shaping. In: Icml, vol. 99, pp. 278–287 (1999)
Google Scholar
Berner, C., et al.: Dota 2 with large scale deep reinforcement learning (2019). OpenAI
Google Scholar
Romanycia, M.H., Pelletier, F.J.: What is a heuristic? Comput. Intell. 1(1), 47–58 (1985)
Article Google Scholar
Salimans, T., Chen, R.: Learning montezuma’s revenge from a single demonstration. CoRR abs/1812.03381 (2018). http://arxiv.org/abs/1812.03381
Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT press, Cambridge (2018)
Google Scholar
Zander, T., Öllinger, M., Volz, K.G.: Intuition and insight: two processes that build on each other or fundamentally differ? Front. Psychol. 7, 1395 (2016)
Google Scholar

Download references

Author information

Authors and Affiliations

Southern Arkansas University, Magnolia, AR, 71753, USA
John Schonefeld & Md Karim

Authors

John Schonefeld
View author publications
You can also search for this author in PubMed Google Scholar
Md Karim
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to John Schonefeld .

Editor information

Editors and Affiliations

Universidad Politécnica de Cartagena, Cartagena, Spain
José Manuel Ferrández Vicente
Universidad Nacional de Educación a Distancia, Madrid, Spain
José Ramón Álvarez-Sánchez
Universidad Nacional de Educación a Distancia, Madrid, Spain
Félix de la Paz López
Ohio State University, Columbus, OH, USA
Hojjat Adeli

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Schonefeld, J., Karim, M. (2022). Non-analytical Reasoning Assisted Deep Reinforcement Learning. In: Ferrández Vicente, J.M., Álvarez-Sánchez, J.R., de la Paz López, F., Adeli, H. (eds) Bio-inspired Systems and Applications: from Robotics to Ambient Intelligence. IWINAC 2022. Lecture Notes in Computer Science, vol 13259. Springer, Cham. https://doi.org/10.1007/978-3-031-06527-9_32

Download citation

DOI: https://doi.org/10.1007/978-3-031-06527-9_32
Published: 24 May 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-06526-2
Online ISBN: 978-3-031-06527-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics