Abstract
In this paper we propose a method of adapting motion to the environment based on force feedback. Our method combines two approaches of motor primitive adaptation. Starting from a single demonstration of motion, we use iterative learning control to adapt the motion to different conditions of the environment, for example, the height of the table. The adaptation is realized through coupling terms at the velocity level of a dynamic movement primitive, and acts as a feedforward component, predetermined for the given external condition. As adaptation to each condition takes several iterations, we combine this method with statistical generalization, employing Gaussian process regression. By generating a small database of coupling terms through iterative learning, we adapt to the environment by generalizing between the coupling terms in the database, thus either already achieving an appropriate coupling term for our demonstration trajectory or providing an initial estimate for the adaptation. Consequently, the learning doesn’t need to be executed for every condition of the environment, but only for a small set. In the paper we provide the details of the method and evaluate it in a simulated setting for the use case of placing a glass on a table.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Dillmann, R.: Teaching and learning of robot tasks via observation of human performance. Robot. Auton. Syst. 47(2–3), 109–116 (2004)
Kober, J., Bagnell, D., Peters, J.: Reinforcement learning in robotics: a survey. Int. J. Robot. Res. 32(11), 1238–1274 (2013)
Stulp, F., Sigaud, O.: Robot skill learning: from reinforcement learning to evolution strategies. Paladyn. J. Behav. Robot. 4(1), 49–61 (2013)
Ijspeert, A., Nakanishi, J., Pastor, P., Hoffmann, H., Schaal, S.: Dynamical movement primitives: learning attractor models for motor behaviors. Neural Comput. 25(2), 328–373 (2013)
Tamosiunaite, M., Nemec, B., Ude, A., Woergoetter, F.: Learning to pour with a robot arm combining goal and shape learning for dynamic movement primitives. Robot. Auton. Syst. 59(11), 910–922 (2011)
Bristow, D., Tharayil, M., Alleyne, A.: A survey of iterative learning control. IEEE Control Syst. Mag. 26(3), 96–114 (2006)
Gams, A., Nemec, B., Ijspeert, A., Ude, A.: Coupling movement primitives: interaction with the environment and bimanual tasks. IEEE Trans. Rob. 30(4), 816–830 (2014)
Ude, A., Gams, A., Asfour, T., Morimoto, J.: Task-specific generalization of discrete and periodic dynamic movement primitives. IEEE Trans. Rob. 26(5), 800–815 (2010)
Forte, D., Gams, A., Morimoto, J., Ude, A.: On-line motion synthesis and adaptation using a trajectory database. Robot. Auton. Syst. 60(10), 1327–1339 (2012)
Deniša, M., Ude, A.: Discovering new motor primitives in transition graphs. In: Intelligent Autonomous Systems, vol. 12, pp. 219–230. Springer, Heidelberg (2013)
Pastor, P., Righetti, L., Kalakrishnan, M., Schaal, S.: Online movement adaptation based on previous sensor experiences. In: 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 365–371. IEEE Press, San Francisco (2011)
Rasmussen, C., Williams, C.: Gaussian processes for machine learning. MIT Press, Cambridge (2006)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Appendix
Appendix
Dynamic movement primitives have been extensively studied in robotics. For the completeness of the paper we provide a short summary based on [8]. A nonlinear system of differential equations defines DMP for discrete movements
f(x) is defined as a linear combination of nonlinear radial basis functions
where c i are the centres of radial basis functions distributed along the trajectory and h_i > 0 their widths. If \(\alpha_{\text{z}} ,\,\upbeta_{\text{z}} ,\,\,\uptau > 0,\upalpha_{\text{z}} = 4\upbeta_{\text{z}}\) the linear part of the system (13) and (14) is critically damped and has a unique attractor point at y = g, z = 0. A phase variable x is used in (15) and (16). It is utilized to avoid direct dependency of f(x) on time. Its dynamics is defined by
with initial value at x(0) = 1. α x is a positive constant. The weight vector w composed of w i defines the shape of the encoded trajectory. The learning of the weight vector is described in [8]. Multiple DOFs are realized by maintaining separate sets of (13)–(16), while a single canonical system given by (17) is used for synchronization.
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Deniša, M., Ude, A., Gams, A. (2016). Adaptation of Motor Primitives to the Environment Through Learning and Statistical Generalization. In: Borangiu, T. (eds) Advances in Robot Design and Intelligent Control. Advances in Intelligent Systems and Computing, vol 371. Springer, Cham. https://doi.org/10.1007/978-3-319-21290-6_45
Download citation
DOI: https://doi.org/10.1007/978-3-319-21290-6_45
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-21289-0
Online ISBN: 978-3-319-21290-6
eBook Packages: EngineeringEngineering (R0)