Curiosity and Boredom Based on Prediction Error as Novel Internal Rewards

Yamamoto, Naoyuki; Ishikawa, Masumi

doi:10.1007/978-3-642-04025-2_8

Naoyuki Yamamoto⁴ &
Masumi Ishikawa⁴

Part of the book series: Studies in Computational Intelligence ((SCI,volume 266))

900 Accesses
2 Citations

Abstract

In this paper, the use of two internal reward models, curiosity and boredom, is proposed. Experiments on a maze navigation task demonstrated that appropriate values of parameters simultaneously improved the performance of the predictor of the environment and increase the external rewards compared with the conventional reinforcement learning. In conclusions, the relation between the proposed method and active learning, diversive curiosity, and specific curiosity is also discussed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Sutton, R.S., Barto, A.G.: Reinforcement Learning. MIT Press, Cambridge (1998)
Google Scholar
Shmidhuber, J.: Self-motivated development through rewards for predictor errors / improvements. In: Developmental Robotics, 2005 AAAI Spring Symposium (2005)
Google Scholar
Oudeyer, P.-Y., Kaplan, F., Hafner, V.V.: Intrinsic Motivation Systems for Autonomous Mental Development. IEEE Trans. EC 11(2), 265–286 (2007)
Google Scholar
Stout, A., Konidaris, G.D., Barto, A.G.: Intrinsically motivated reinforcement learning: A promising framework for developmental robot learning. In: Developmental Robotics AAAI Spring Symp. (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Brain Science and Engineering, Graduate School of Life Science and Systems Engineering, Kyushu Institute of Technology, 2-4 Hibikino, Wakamatsu, Kitakyushu, 808-0196, Japan
Naoyuki Yamamoto & Masumi Ishikawa

Authors

Naoyuki Yamamoto
View author publications
You can also search for this author in PubMed Google Scholar
Masumi Ishikawa
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Brain Science and Engineering, Graduate School of Life Science and System Engineering, Kyushu Institute of Technology, Hibikino 2-4,Wakamatsu, 808-0196, Kitakyushu, Fukuoka, Japan
Akitoshi Hanazawa & Tsutom Miki &
Department of Brain Science and Engineering , Graduate School of Life Science and System Engineering, Kyushu Institute of Technology, Hibikino 2-4,Wakamatsu, 808-0196, Kitakyushu, Fukuoka, Japan
Keiichi Horio

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Yamamoto, N., Ishikawa, M. (2010). Curiosity and Boredom Based on Prediction Error as Novel Internal Rewards. In: Hanazawa, A., Miki, T., Horio, K. (eds) Brain-Inspired Information Technology. Studies in Computational Intelligence, vol 266. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04025-2_8

Download citation

DOI: https://doi.org/10.1007/978-3-642-04025-2_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04024-5
Online ISBN: 978-3-642-04025-2
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics