[ieee 2008 ieee international conference on robotics and automation (icra) - pasadena, ca, usa...

1
Learning Capture Points for Bipedal Push Recovery John R. Rebula, Fabian Canas, Jerry E Pratt, Ambarish Goswami Researchers at IHMC and Honda Research Institute are de- veloping techniques for Learning Capture Points for Bipedal Push Recovery. A Capture Point is a point on the ground where a biped can step to in order to stop. Humans are very adept at stepping to Capture Points, while most bipedal robots cannot recover from significant pushes. To calculate approximate Capture Point locations, we use the Linear Inverted Pendulum Model introduced by Kajita and Tani. For a point mass biped walking at a constant height, this model exactly predicts the Capture Point. However, for a distributed mass biped, it is only an approximation. In order to better predict Capture Points, we learn a correction function to the Linear Inverted Pendulum Model. A Linear Inverted Pendulum maintains a conserved quantity called the orbital energy throughout the stance phase, which traces out trajectories on the phase plot. The equation for orbital energy can be used to calculate a Capture Point by calculating the current energy, predicting the velocity at the support foot transfer, and then setting the next swing orbital energy to zero and solving for the body position. We used two learning methods, one online and one offline, to improve Capture Point prediction. In the offline learning method, the robot is pushed multiple times with a given force magnitude and direction. During each learning trial, the robot starts balanced upright and stationary. Pushes, represented in the video as projectiles, are applied as a constant force for 0.05 seconds. We vary the magnitude of the force from 150 to 550 Newtons, and its direction in the horizontal plane. When the Center of Mass reaches a set horizontal distance from the support foot, the robot predicts a Capture Point and starts to step to that location. For each applied force, several simulation runs are performed, each one resulting in the robot taking a step within a grid centered around the Capture Point predicted by the Linear Inverted Pendulum Model. We find multiple capture points because the actuated feet allow for a margin of error. These points define a capture region. For failed points we calculate the center of mass position and velocity vectors at the top of swing. These are then used to calculate the residual Orbital Energy. For trials that result in successfully stopping, the residual Orbital Energy is zero. This process is repeated for five different forces at 10 different angles around the circle. For each of these pushes we calculate the centroid of the capture region and the J. Rebula, J. Pratt, and F. Canas are with the Florida Institute for Human and Machine Cognition, Pensacola, Florida 32502, USA. [email protected] A. Goswami is with the Honda Research Institute Mountain View, CA 94041, USA [email protected] offset vector from the Capture Point predicted by the Linear Inverted Pendulum model. We then fit a curve to the angle and magnitude of the offset vector as a function of the angle of the center of mass velocity. Using this corrected capture point, we find significant reduction in residual energy error when compared to using only the Linear Inverted Pendulum Model. In the online learning technique, we use a Radial Basis Function to represent the learned offsets from the Capture Point predicted by the Linear Inverted Pendulum Model. After each push, the robot predicts where to step to, steps there, and determines the residual orbital energy error. We use the residual energy to estimate the capture point. This estimate is then used to update the weights of the Radial Basis Function. After approximately 500 learning trials, the Radial Basis Function converges and the residual energy error is significantly reduced in further trials. References J. Rebula, F. Canas, J. Pratt, and A. Goswami. Learning Capture Points for Bipedal Push Recovery. Humanoids 2007 S. Kajita, F. Kanehiro, K. Kaneko, K. Yokoi, and H. Hirukawa. The 3d linear inverted pendulum mode: a simple modeling for a biped walking pattern generation. Proceed- ings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, pages 239-246, 2001. Jerry E. Pratt and Sergey V. Drakunov. Derivation and application of a conserved orbital energy for the inverted pendulum bipedal walking model. Proceedings of the IEEE International Conference on Robotics and Automation, pages 4653-4660, 2007. Jerry E. Pratt, John Carff, Sergey Drakunov, and Am- barish Goswami. Capture point: A step toward humanoid push recovery. Proceedings of the IEEE-RAS International Conference on Humanoid Robots, 2006. Russ Tedrake, Teresa Weirui Zhang, and H.Sebastian Seung. Stochastic policy gradient reinforcement learning on a simple 3d biped. Proceedings of the IEEE International Conference on Intelligent Robots and Systems, pages 2849- 2854, 2004. 2008 IEEE International Conference on Robotics and Automation Pasadena, CA, USA, May 19-23, 2008 978-1-4244-1647-9/08/$25.00 ©2008 IEEE. 1774

Upload: ambarish

Post on 24-Feb-2017

213 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: [IEEE 2008 IEEE International Conference on Robotics and Automation (ICRA) - Pasadena, CA, USA (2008.05.19-2008.05.23)] 2008 IEEE International Conference on Robotics and Automation

Learning Capture Points for Bipedal Push Recovery

John R. Rebula, Fabian Canas, Jerry E Pratt, Ambarish Goswami

Researchers at IHMC and Honda Research Institute are de-veloping techniques for Learning Capture Points for BipedalPush Recovery. A Capture Point is a point on the groundwhere a biped can step to in order to stop. Humans arevery adept at stepping to Capture Points, while most bipedalrobots cannot recover from significant pushes.

To calculate approximate Capture Point locations, we usethe Linear Inverted Pendulum Model introduced by Kajitaand Tani. For a point mass biped walking at a constant height,this model exactly predicts the Capture Point. However, fora distributed mass biped, it is only an approximation. Inorder to better predict Capture Points, we learn a correctionfunction to the Linear Inverted Pendulum Model. A LinearInverted Pendulum maintains a conserved quantity called theorbital energy throughout the stance phase, which traces outtrajectories on the phase plot. The equation for orbital energycan be used to calculate a Capture Point by calculatingthe current energy, predicting the velocity at the supportfoot transfer, and then setting the next swing orbital energyto zero and solving for the body position. We used twolearning methods, one online and one offline, to improveCapture Point prediction. In the offline learning method, therobot is pushed multiple times with a given force magnitudeand direction. During each learning trial, the robot startsbalanced upright and stationary. Pushes, represented in thevideo as projectiles, are applied as a constant force for 0.05seconds. We vary the magnitude of the force from 150 to550 Newtons, and its direction in the horizontal plane. Whenthe Center of Mass reaches a set horizontal distance fromthe support foot, the robot predicts a Capture Point andstarts to step to that location. For each applied force, severalsimulation runs are performed, each one resulting in the robottaking a step within a grid centered around the Capture Pointpredicted by the Linear Inverted Pendulum Model.

We find multiple capture points because the actuated feetallow for a margin of error. These points define a captureregion. For failed points we calculate the center of massposition and velocity vectors at the top of swing. Theseare then used to calculate the residual Orbital Energy. Fortrials that result in successfully stopping, the residual OrbitalEnergy is zero.

This process is repeated for five different forces at 10different angles around the circle. For each of these pusheswe calculate the centroid of the capture region and the

J. Rebula, J. Pratt, and F. Canas are with the Florida Institutefor Human and Machine Cognition, Pensacola, Florida 32502, [email protected]

A. Goswami is with the Honda Research Institute Mountain View, CA94041, USA [email protected]

offset vector from the Capture Point predicted by the LinearInverted Pendulum model.

We then fit a curve to the angle and magnitude of theoffset vector as a function of the angle of the center ofmass velocity. Using this corrected capture point, we findsignificant reduction in residual energy error when comparedto using only the Linear Inverted Pendulum Model.

In the online learning technique, we use a Radial BasisFunction to represent the learned offsets from the CapturePoint predicted by the Linear Inverted Pendulum Model.After each push, the robot predicts where to step to, stepsthere, and determines the residual orbital energy error. Weuse the residual energy to estimate the capture point. Thisestimate is then used to update the weights of the RadialBasis Function. After approximately 500 learning trials, theRadial Basis Function converges and the residual energyerror is significantly reduced in further trials.

ReferencesJ. Rebula, F. Canas, J. Pratt, and A. Goswami. Learning

Capture Points for Bipedal Push Recovery. Humanoids 2007S. Kajita, F. Kanehiro, K. Kaneko, K. Yokoi, and H.

Hirukawa. The 3d linear inverted pendulum mode: a simplemodeling for a biped walking pattern generation. Proceed-ings of the IEEE/RSJ International Conference on IntelligentRobots and Systems, pages 239-246, 2001.

Jerry E. Pratt and Sergey V. Drakunov. Derivation andapplication of a conserved orbital energy for the invertedpendulum bipedal walking model. Proceedings of the IEEEInternational Conference on Robotics and Automation, pages4653-4660, 2007.

Jerry E. Pratt, John Carff, Sergey Drakunov, and Am-barish Goswami. Capture point: A step toward humanoidpush recovery. Proceedings of the IEEE-RAS InternationalConference on Humanoid Robots, 2006.

Russ Tedrake, Teresa Weirui Zhang, and H.SebastianSeung. Stochastic policy gradient reinforcement learning ona simple 3d biped. Proceedings of the IEEE InternationalConference on Intelligent Robots and Systems, pages 2849-2854, 2004.

2008 IEEE International Conference onRobotics and AutomationPasadena, CA, USA, May 19-23, 2008

978-1-4244-1647-9/08/$25.00 ©2008 IEEE. 1774