Control of a Free-Falling Cat by Policy-Based Reinforcement Learning

Nakano, Daichi; Maeda, Shin-ichi; Ishii, Shin

doi:10.1007/978-3-642-33266-1_15

Daichi Nakano²¹,
Shin-ichi Maeda²¹ &
Shin Ishii²¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7553))

Included in the following conference series:

International Conference on Artificial Neural Networks

3197 Accesses
2 Citations

Abstract

Autonomous control of nonholonomic systems is one big challenge, because there is no unified control method that can handle any nonholonomic systems even if the dynamics are known. To this challenge, in this study, we propose a reinforcement learning (RL) approach which enables the controller to acquire an appropriate control policy even without knowing the detailed dynamics. In particular, we focus on the control problem of a free-falling cat system whose dynamics are highly-nonlinear and nonholonomic. To accelerate the learning, we take the policy gradient method that exploits the basic knowledge of the system, and present an appropriate policy representation for the task. It is shown that this RL method achieves remarkably faster learning than that by the existing genetic algorithm-based method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Nakamura, Y.: Nonholonomic robot systems, Part 1: what’s a nonholonomic robot? Journal of RSJ 11, 521–528 (1993)
Google Scholar
Brockett, R.W.: Asymptotic stability and feedback stabilization. Progress in Mathematics 27, 181–208 (1983)
MathSciNet Google Scholar
Mita, T.: Introduction to nonlinear control Theory-Skill control of underactuated robots. SHOKODO Co., Ltd. (2000) (in Japanese)
Google Scholar
Murray, R.M., Sastry, S.S.: Nonholonomic motion planning: steering using sinusoids. IEEE Transactions on Automatic Control 38, 700–716 (1993)
Article MATH MathSciNet Google Scholar
Holamoto, S., Funasako, T.: Feedback control of a planar space robot using a moving manifold. Journal of RSJ 25, 745–751 (1993)
Google Scholar
Peters, J., Schaal, S.: Reinforcement learning of motor skills with policy gradients. Neural Networks 21, 682–697 (2008)
Article Google Scholar
Miyamae, A., et al.: Instance-based policy learning by real-coded genetic algorithms and its application to control of nonholonomic systems. Transactions of the Japanese Society for Artificial Intelligence 24, 104–115 (2009)
Article Google Scholar
Tsuchiya, C., et al.: SLIP: A sophisticated learner for instance-based policy using hybrid GA. Transactions of SICE 42, 1344–1352 (2006)
Google Scholar
Nakamura, Y., Mukherjee, R.: Nonholonomic path planning of space robots via a bidirectional approach. IEEE Transactions on Robotics and Automation 7, 500–514 (1991)
Article Google Scholar
Baxter, J., Bartlett, P.L.: Infinite-horizon policy-gradient estimation. Journal of Artificial Intelligence Research 15, 319–350 (2001)
MATH MathSciNet Google Scholar
Ge, X., Chen, L.: Optimal control of nonholonomic motion planning for a free-falling cat. Applied Mathematics and Mechanics 28, 601–607 (2007)
Article MATH MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Graduate School of Informatics, Kyoto University, Gokasho, Uji, Kyoto, 611-0011, Japan
Daichi Nakano, Shin-ichi Maeda & Shin Ishii

Authors

Daichi Nakano
View author publications
You can also search for this author in PubMed Google Scholar
Shin-ichi Maeda
View author publications
You can also search for this author in PubMed Google Scholar
Shin Ishii
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Neuro Heuristic Research Group, University of Lausanne, 1015, Lausanne, Switzerland
Alessandro E. P. Villa
Department of Informatics, Nicolaus Copernicus University, 87-100, Toruń, Poland
Włodzisław Duch
Center for Complex Systems Studies, Kalamazoo College, 49006, Kalamazoo, MI, USA
Péter Érdi
Dipartimento di Informatica e Scienze dell’Informazione, Università di Genova, 16146, Genoa, Italy
Francesco Masulli
Institut für Neuroinformatik, Universität Ulm, 89069, Ulm, Germany
Günther Palm

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Nakano, D., Maeda, Si., Ishii, S. (2012). Control of a Free-Falling Cat by Policy-Based Reinforcement Learning. In: Villa, A.E.P., Duch, W., Érdi, P., Masulli, F., Palm, G. (eds) Artificial Neural Networks and Machine Learning – ICANN 2012. ICANN 2012. Lecture Notes in Computer Science, vol 7553. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33266-1_15

Download citation

DOI: https://doi.org/10.1007/978-3-642-33266-1_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33265-4
Online ISBN: 978-3-642-33266-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics