Skip to main content

Control of a Free-Falling Cat by Policy-Based Reinforcement Learning

  • Conference paper
Artificial Neural Networks and Machine Learning – ICANN 2012 (ICANN 2012)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7553))

Included in the following conference series:

Abstract

Autonomous control of nonholonomic systems is one big challenge, because there is no unified control method that can handle any nonholonomic systems even if the dynamics are known. To this challenge, in this study, we propose a reinforcement learning (RL) approach which enables the controller to acquire an appropriate control policy even without knowing the detailed dynamics. In particular, we focus on the control problem of a free-falling cat system whose dynamics are highly-nonlinear and nonholonomic. To accelerate the learning, we take the policy gradient method that exploits the basic knowledge of the system, and present an appropriate policy representation for the task. It is shown that this RL method achieves remarkably faster learning than that by the existing genetic algorithm-based method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Nakamura, Y.: Nonholonomic robot systems, Part 1: what’s a nonholonomic robot? Journal of RSJ 11, 521–528 (1993)

    Google Scholar 

  2. Brockett, R.W.: Asymptotic stability and feedback stabilization. Progress in Mathematics 27, 181–208 (1983)

    MathSciNet  Google Scholar 

  3. Mita, T.: Introduction to nonlinear control Theory-Skill control of underactuated robots. SHOKODO Co., Ltd. (2000) (in Japanese)

    Google Scholar 

  4. Murray, R.M., Sastry, S.S.: Nonholonomic motion planning: steering using sinusoids. IEEE Transactions on Automatic Control 38, 700–716 (1993)

    Article  MATH  MathSciNet  Google Scholar 

  5. Holamoto, S., Funasako, T.: Feedback control of a planar space robot using a moving manifold. Journal of RSJ 25, 745–751 (1993)

    Google Scholar 

  6. Peters, J., Schaal, S.: Reinforcement learning of motor skills with policy gradients. Neural Networks 21, 682–697 (2008)

    Article  Google Scholar 

  7. Miyamae, A., et al.: Instance-based policy learning by real-coded genetic algorithms and its application to control of nonholonomic systems. Transactions of the Japanese Society for Artificial Intelligence 24, 104–115 (2009)

    Article  Google Scholar 

  8. Tsuchiya, C., et al.: SLIP: A sophisticated learner for instance-based policy using hybrid GA. Transactions of SICE 42, 1344–1352 (2006)

    Google Scholar 

  9. Nakamura, Y., Mukherjee, R.: Nonholonomic path planning of space robots via a bidirectional approach. IEEE Transactions on Robotics and Automation 7, 500–514 (1991)

    Article  Google Scholar 

  10. Baxter, J., Bartlett, P.L.: Infinite-horizon policy-gradient estimation. Journal of Artificial Intelligence Research 15, 319–350 (2001)

    MATH  MathSciNet  Google Scholar 

  11. Ge, X., Chen, L.: Optimal control of nonholonomic motion planning for a free-falling cat. Applied Mathematics and Mechanics 28, 601–607 (2007)

    Article  MATH  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Nakano, D., Maeda, Si., Ishii, S. (2012). Control of a Free-Falling Cat by Policy-Based Reinforcement Learning. In: Villa, A.E.P., Duch, W., Érdi, P., Masulli, F., Palm, G. (eds) Artificial Neural Networks and Machine Learning – ICANN 2012. ICANN 2012. Lecture Notes in Computer Science, vol 7553. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33266-1_15

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-33266-1_15

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-33265-4

  • Online ISBN: 978-3-642-33266-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics