Trajectory Generation for a Mobile Robot by Reinforcement Learning

Shimizu, Masaki; Fujita, Makoto; Miyamoto, Hiroyuki

doi:10.1007/3-540-29344-2_18

Masaki Shimizu⁴,
Makoto Fujita⁴ &
Hiroyuki Miyamoto⁴

1038 Accesses

Summary

Q-learning in the Reinforcement Learning (RL) field is the powerful and attractive tool to make robots generate autonomous behavior. But it needs large amount of computational cost because of its discrete state and action. To generated smooth trajectory with less computational cost, we propose two ingredients for Q-learning. We applied Q-learning to the simulated two wheeled robot to generate trajectory for Ball-To-Goal task in robot soccer. …

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

R.S. Sutton, A. Barto, “Reinforcement Learning:An Introduction”, MIT Press, 1998.
Google Scholar
J. Peng, R.J. Williams, “Incremental Multi-Step Q-Learning”, Machine Learning, Vol.22,pp283–290,1996.
Google Scholar
A. Sherstov, P. Stone, “On Continuous-Action Q-Learning via Tile Coding Function Approximation”, In Under Review., June 2004.
Google Scholar
S. Hagen, B. Kröse, “Neural Q-learning”, In Neural Computing &, Applications, 12(2), pages 81–88, November 2003.
Article Google Scholar
A. Barto, S. Mahadevan, “Recent Advances in Hierarchical Reinforcement Learning”, Discrete Event Dynamic Systems:Theory and Applications, 13,41–77,2003.
Article MathSciNet Google Scholar
Y. Takahashi, M. Asada, “Multi-Layered Learning Systems for Vision-Based Behavior Acquision of A Real Mobile Robot”, Proceedings of SICE Annual Conference 2003 in Fukui, Vol. CD-ROM, pp. 2937–2942, 2003.
Google Scholar

Download references

Author information

Authors and Affiliations

Kyushu Institute of Technology, Kitakyushu, Japan
Masaki Shimizu, Makoto Fujita & Hiroyuki Miyamoto

Authors

Masaki Shimizu
View author publications
You can also search for this author in PubMed Google Scholar
Makoto Fujita
View author publications
You can also search for this author in PubMed Google Scholar
Hiroyuki Miyamoto
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dept. of Human and Artificial Intelligence Systems, University of Fukui, Bunkyo 3-9-1, 910-8507, Fukui, Japan
Kazuyuki Murase , Kosuke Sekiyama & Tomohide Naniwa , &
Dept. of Mechanical Engineering, Tokyo Metropolitan University, Minami-Osawa 1-1, 192-0397, Hachioji Tokyo, Japan
Naoyuki Kubota
Faculty of Information Technology School of Software Engineering & Data Communication, Queensland University of Technology, GPO Box 2434, 001 Brisbane, Australia
Joaquin Sitte

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Shimizu, M., Fujita, M., Miyamoto, H. (2006). Trajectory Generation for a Mobile Robot by Reinforcement Learning. In: Murase, K., Sekiyama, K., Naniwa, T., Kubota, N., Sitte, J. (eds) Proceedings of the 3rd International Symposium on Autonomous Minirobots for Research and Edutainment (AMiRE 2005). Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-29344-2_18

Download citation

DOI: https://doi.org/10.1007/3-540-29344-2_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28496-3
Online ISBN: 978-3-540-29344-6
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics