16
Inverse Dynamics Daniel Hennes Maastricht University December 21, 2010

Inverse Dynamics - ROS.org · Learning Inverse Dynamics. Applications of Inverse Dynamics. • Feed forward control • Optimal control (ILQR) D. Mitrovic, S. Klanke, and S. Vijayakumar

  • Upload
    others

  • View
    39

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Inverse Dynamics - ROS.org · Learning Inverse Dynamics. Applications of Inverse Dynamics. • Feed forward control • Optimal control (ILQR) D. Mitrovic, S. Klanke, and S. Vijayakumar

Inverse Dynamics Daniel Hennes Maastricht University December 21, 2010

Page 2: Inverse Dynamics - ROS.org · Learning Inverse Dynamics. Applications of Inverse Dynamics. • Feed forward control • Optimal control (ILQR) D. Mitrovic, S. Klanke, and S. Vijayakumar

Outline •  Learning Dynamic Tasks •  Gaussian Process Regression •  Learning Inverse Dynamics •  Packages • What’s next?

Page 3: Inverse Dynamics - ROS.org · Learning Inverse Dynamics. Applications of Inverse Dynamics. • Feed forward control • Optimal control (ILQR) D. Mitrovic, S. Klanke, and S. Vijayakumar

Learning Dynamic Tasks •  Policy Gradient RL •  Pole balancing and

pendulum swing-up •  Works in simulation •  Problems during

transition to PR2 –  Image pipeline delays – Arm dynamics –  ...

Page 4: Inverse Dynamics - ROS.org · Learning Inverse Dynamics. Applications of Inverse Dynamics. • Feed forward control • Optimal control (ILQR) D. Mitrovic, S. Klanke, and S. Vijayakumar

Learning Inverse Dynamics

Page 5: Inverse Dynamics - ROS.org · Learning Inverse Dynamics. Applications of Inverse Dynamics. • Feed forward control • Optimal control (ILQR) D. Mitrovic, S. Klanke, and S. Vijayakumar

Applications of Inverse Dynamics •  Feed forward control • Optimal control (ILQR)

D. Mitrovic, S. Klanke, and S. Vijayakumar. Adaptive Optimal Feedback Control with Learned Internal Dynamics Model. 2010.

•  Collision detection A. De Luca et al. Collision Detection and Safe Reaction with the DLR-III

Lightweight Manipulator Arm. 2006 (- 2009).

•  Haptic feedback

Page 6: Inverse Dynamics - ROS.org · Learning Inverse Dynamics. Applications of Inverse Dynamics. • Feed forward control • Optimal control (ILQR) D. Mitrovic, S. Klanke, and S. Vijayakumar

Gaussian process regression •  How does it work?

1.  Provide training data 2.  Select kernal function (easy: Gaussian kernel)

3.  Tune hyperparameters (easy: Quasi-Newton minimization)

4.  Inference (complexity O(n3))

5.  Prediciton (complexity O(n))

Page 7: Inverse Dynamics - ROS.org · Learning Inverse Dynamics. Applications of Inverse Dynamics. • Feed forward control • Optimal control (ILQR) D. Mitrovic, S. Klanke, and S. Vijayakumar

Motion data

Page 8: Inverse Dynamics - ROS.org · Learning Inverse Dynamics. Applications of Inverse Dynamics. • Feed forward control • Optimal control (ILQR) D. Mitrovic, S. Klanke, and S. Vijayakumar

Avg. nMSE performance (GPR)

0

0.05

0.1

0.15

0.2

0.25

circle figure8 reaching sinoid

shoulder_pan

shoulder_li8

upper_arm_roll

elbow_flex

forearm_roll

wrist_flex

wrist_roll

Page 9: Inverse Dynamics - ROS.org · Learning Inverse Dynamics. Applications of Inverse Dynamics. • Feed forward control • Optimal control (ILQR) D. Mitrovic, S. Klanke, and S. Vijayakumar

nMSE performance (sinoid motion)

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

GPR IDM

shoulder_pan

shoulder_li8

upper_arm_roll

elbow_flex

forearm_roll

wrist_flex

wrist_roll

Page 10: Inverse Dynamics - ROS.org · Learning Inverse Dynamics. Applications of Inverse Dynamics. • Feed forward control • Optimal control (ILQR) D. Mitrovic, S. Klanke, and S. Vijayakumar

GPR interpolation

Page 11: Inverse Dynamics - ROS.org · Learning Inverse Dynamics. Applications of Inverse Dynamics. • Feed forward control • Optimal control (ILQR) D. Mitrovic, S. Klanke, and S. Vijayakumar

GPR extrapolation

Page 12: Inverse Dynamics - ROS.org · Learning Inverse Dynamics. Applications of Inverse Dynamics. • Feed forward control • Optimal control (ILQR) D. Mitrovic, S. Klanke, and S. Vijayakumar

Packages •  : : inverse_dynamics – GPR model – Newton-Euler model (KDL or WBC) – Motion generation + other tools

•  : : dynamics_markers – Joint torque visualization in rviz

Page 13: Inverse Dynamics - ROS.org · Learning Inverse Dynamics. Applications of Inverse Dynamics. • Feed forward control • Optimal control (ILQR) D. Mitrovic, S. Klanke, and S. Vijayakumar

: : dynamics_markers

Page 14: Inverse Dynamics - ROS.org · Learning Inverse Dynamics. Applications of Inverse Dynamics. • Feed forward control • Optimal control (ILQR) D. Mitrovic, S. Klanke, and S. Vijayakumar

Conclusion •  GPR model – extrapolation is limited – suffers from curse of dimensionality – can still be used for repetitive tasks

•  IDM – performance strongly depends on clean signals

Page 15: Inverse Dynamics - ROS.org · Learning Inverse Dynamics. Applications of Inverse Dynamics. • Feed forward control • Optimal control (ILQR) D. Mitrovic, S. Klanke, and S. Vijayakumar

What’s next? •  Future work

– NN or SVR extrapolation performance – Predicting residuals – Dimensionality reduction

•  NLPCA, elastic maps – Online learning

•  Local GPR •  Projection based sampling

–  IDM Identification •  a lot to improve, e.g. modeling friction, estimating inertia

parameters

Page 16: Inverse Dynamics - ROS.org · Learning Inverse Dynamics. Applications of Inverse Dynamics. • Feed forward control • Optimal control (ILQR) D. Mitrovic, S. Klanke, and S. Vijayakumar