Adaptation of Motor Primitives to the Environment Through Learning and Statistical Generalization

Deniša, Miha; Ude, Aleš; Gams, Andrej

doi:10.1007/978-3-319-21290-6_45

Miha Deniša³,
Aleš Ude³ &
Andrej Gams³

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 371))

1717 Accesses
1 Citations

Abstract

In this paper we propose a method of adapting motion to the environment based on force feedback. Our method combines two approaches of motor primitive adaptation. Starting from a single demonstration of motion, we use iterative learning control to adapt the motion to different conditions of the environment, for example, the height of the table. The adaptation is realized through coupling terms at the velocity level of a dynamic movement primitive, and acts as a feedforward component, predetermined for the given external condition. As adaptation to each condition takes several iterations, we combine this method with statistical generalization, employing Gaussian process regression. By generating a small database of coupling terms through iterative learning, we adapt to the environment by generalizing between the coupling terms in the database, thus either already achieving an appropriate coupling term for our demonstration trajectory or providing an initial estimate for the adaptation. Consequently, the learning doesn’t need to be executed for every condition of the environment, but only for a small set. In the paper we provide the details of the method and evaluate it in a simulated setting for the use case of placing a glass on a table.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Dillmann, R.: Teaching and learning of robot tasks via observation of human performance. Robot. Auton. Syst. 47(2–3), 109–116 (2004)
Article Google Scholar
Kober, J., Bagnell, D., Peters, J.: Reinforcement learning in robotics: a survey. Int. J. Robot. Res. 32(11), 1238–1274 (2013)
Article Google Scholar
Stulp, F., Sigaud, O.: Robot skill learning: from reinforcement learning to evolution strategies. Paladyn. J. Behav. Robot. 4(1), 49–61 (2013)
Google Scholar
Ijspeert, A., Nakanishi, J., Pastor, P., Hoffmann, H., Schaal, S.: Dynamical movement primitives: learning attractor models for motor behaviors. Neural Comput. 25(2), 328–373 (2013)
Article MathSciNet MATH Google Scholar
Tamosiunaite, M., Nemec, B., Ude, A., Woergoetter, F.: Learning to pour with a robot arm combining goal and shape learning for dynamic movement primitives. Robot. Auton. Syst. 59(11), 910–922 (2011)
Article Google Scholar
Bristow, D., Tharayil, M., Alleyne, A.: A survey of iterative learning control. IEEE Control Syst. Mag. 26(3), 96–114 (2006)
Article Google Scholar
Gams, A., Nemec, B., Ijspeert, A., Ude, A.: Coupling movement primitives: interaction with the environment and bimanual tasks. IEEE Trans. Rob. 30(4), 816–830 (2014)
Article Google Scholar
Ude, A., Gams, A., Asfour, T., Morimoto, J.: Task-specific generalization of discrete and periodic dynamic movement primitives. IEEE Trans. Rob. 26(5), 800–815 (2010)
Article Google Scholar
Forte, D., Gams, A., Morimoto, J., Ude, A.: On-line motion synthesis and adaptation using a trajectory database. Robot. Auton. Syst. 60(10), 1327–1339 (2012)
Article Google Scholar
Deniša, M., Ude, A.: Discovering new motor primitives in transition graphs. In: Intelligent Autonomous Systems, vol. 12, pp. 219–230. Springer, Heidelberg (2013)
Google Scholar
Pastor, P., Righetti, L., Kalakrishnan, M., Schaal, S.: Online movement adaptation based on previous sensor experiences. In: 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 365–371. IEEE Press, San Francisco (2011)
Google Scholar
Rasmussen, C., Williams, C.: Gaussian processes for machine learning. MIT Press, Cambridge (2006)
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Humanoid and Cognitive Robotics Lab, Department of Automatics, Biocybernetics and Robotics, Jožef Stefan Institute, Jamova 39, Ljubljana, Slovenia
Miha Deniša, Aleš Ude & Andrej Gams

Authors

Miha Deniša
View author publications
You can also search for this author in PubMed Google Scholar
Aleš Ude
View author publications
You can also search for this author in PubMed Google Scholar
Andrej Gams
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Miha Deniša .

Editor information

Editors and Affiliations

Fac of Auto Ctrl & Comp Sci (Automatica), University Politehnica of Bucharest, Bucharest, Romania
Theodor Borangiu

Appendix

Dynamic movement primitives have been extensively studied in robotics. For the completeness of the paper we provide a short summary based on [8]. A nonlinear system of differential equations defines DMP for discrete movements

$$\tau \dot{z} = \alpha_{z} (\beta_{z} (g - y) - z) + f(x)$$

(13)

$$\tau \dot{y} = z .$$

(14)

f(x) is defined as a linear combination of nonlinear radial basis functions

$$f\left( x \right) = \frac{{\mathop \sum \nolimits_{i = 1}^{N} w_{i}\Psi _{i} (x) }}{{\Psi _{i} (x)}}x ,$$

(15)

$$\Psi _{i} \left( x \right) = \exp ( - h_{i} \left( {x - c_{i} } \right)^{2} ) ,$$

(16)

where c _i are the centres of radial basis functions distributed along the trajectory and h_i > 0 their widths. If $\alpha_{\text{z}} ,\,\upbeta_{\text{z}} ,\,\,\uptau > 0,\upalpha_{\text{z}} = 4\upbeta_{\text{z}}$ the linear part of the system (13) and (14) is critically damped and has a unique attractor point at y = g, z = 0. A phase variable x is used in (15) and (16). It is utilized to avoid direct dependency of f(x) on time. Its dynamics is defined by

$$\tau \dot{x} = - \alpha_{x} x ,$$

(17)

with initial value at x(0) = 1. α _x is a positive constant. The weight vector w composed of w _i defines the shape of the encoded trajectory. The learning of the weight vector is described in [8]. Multiple DOFs are realized by maintaining separate sets of (13)–(16), while a single canonical system given by (17) is used for synchronization.

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Deniša, M., Ude, A., Gams, A. (2016). Adaptation of Motor Primitives to the Environment Through Learning and Statistical Generalization. In: Borangiu, T. (eds) Advances in Robot Design and Intelligent Control. Advances in Intelligent Systems and Computing, vol 371. Springer, Cham. https://doi.org/10.1007/978-3-319-21290-6_45

Download citation

DOI: https://doi.org/10.1007/978-3-319-21290-6_45
Published: 08 August 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-21289-0
Online ISBN: 978-3-319-21290-6
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Adaptation of Motor Primitives to the Environment Through Learning and Statistical Generalization

Abstract

Access this chapter

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Appendix

Appendix

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation