projection operator strategies for trajectory optimization: application...

42
1 / 34 Projection Operator Strategies for Trajectory Optimization: Application to Cooperative Planning John Hauser CU Boulder

Upload: others

Post on 12-Mar-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

1 / 34

Projection Operator Strategies for Trajectory Optimizati on:Application to Cooperative Planning

John HauserCU Boulder

Page 2: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

Introduction

Introduction❖ Minimization ofTrajectory Functionals

❖ Why do TrajectoryOptimization?

The Projection Operatorapproach to TrajectoryFunctional Minimization

A Sampling of SystemInvestigations

Positive Definiteness

Constraints

2 / 34

Page 3: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

Minimization of Trajectory Functionals

Introduction❖ Minimization ofTrajectory Functionals

❖ Why do TrajectoryOptimization?

The Projection Operatorapproach to TrajectoryFunctional Minimization

A Sampling of SystemInvestigations

Positive Definiteness

Constraints

3 / 34

Consider the problem of minimizing a functional

h(x(·), u(·)) :=

∫ T

0

l(x(τ), u(τ), τ) dτ +m(x(T ))

over the set T of bounded trajectories of the nonlinear system

x = f(x, u) , x(0) = x0

Here, x(t) ∈ Rn and u(t) ∈ R

m.

Page 4: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

Minimization of Trajectory Functionals

Introduction❖ Minimization ofTrajectory Functionals

❖ Why do TrajectoryOptimization?

The Projection Operatorapproach to TrajectoryFunctional Minimization

A Sampling of SystemInvestigations

Positive Definiteness

Constraints

3 / 34

Consider the problem of minimizing a functional

h(x(·), u(·)) :=

∫ T

0

l(x(τ), u(τ), τ) dτ +m(x(T ))

over the set T of bounded trajectories of the nonlinear system

x = f(x, u) , x(0) = x0

Here, x(t) ∈ Rn and u(t) ∈ R

m.

We write this constrained problem as

minξ∈T

h(ξ)

where• ξ = (α(·), µ(·)) is a bounded curve with α(·) continuous• ξ ∈ T means α(t) = f(α(t), u(t)) and α(0) = x0

Page 5: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

Why do Trajectory Optimization?

Introduction❖ Minimization ofTrajectory Functionals

❖ Why do TrajectoryOptimization?

The Projection Operatorapproach to TrajectoryFunctional Minimization

A Sampling of SystemInvestigations

Positive Definiteness

Constraints

4 / 34

Well known:● Optimal control may be used to provide stabilization, tracking, etc., for

nonlinear systems(trajs are characteristics of HJB soln)

● Model predictive/receding horizon strategies have been usedsuccessful for a number of nonlinear systems with constraints

Page 6: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

Why do Trajectory Optimization?

Introduction❖ Minimization ofTrajectory Functionals

❖ Why do TrajectoryOptimization?

The Projection Operatorapproach to TrajectoryFunctional Minimization

A Sampling of SystemInvestigations

Positive Definiteness

Constraints

4 / 34

Well known:● Optimal control may be used to provide stabilization, tracking, etc., for

nonlinear systems(trajs are characteristics of HJB soln)

● Model predictive/receding horizon strategies have been usedsuccessful for a number of nonlinear systems with constraints

Also:● Trajectory exploration : What cool stuff can this system do?

✦ capabilities✦ limitations

● Trajectory modeling : Can the trajectories of a (complex) system bemodeled by those of a simpler system?

● Systems analysis : investigate system structure, e.g., controllability

Page 7: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

The Projection Operator approach to TrajectoryFunctional Minimization

Introduction

The Projection Operatorapproach to TrajectoryFunctional Minimization❖ Unconstrained (?)Optimal Control

❖ Projection OperatorApproach

❖ Projection Operator

❖ Projection OperatorProperties

❖ Trajectory Manifold

❖ EquivalentOptimization Problems

❖ Projection operatorNewton method

A Sampling of SystemInvestigations

Positive Definiteness

Constraints

5 / 34

Page 8: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

Unconstrained (?) Optimal Control

Introduction

The Projection Operatorapproach to TrajectoryFunctional Minimization❖ Unconstrained (?)Optimal Control

❖ Projection OperatorApproach

❖ Projection Operator

❖ Projection OperatorProperties

❖ Trajectory Manifold

❖ EquivalentOptimization Problems

❖ Projection operatorNewton method

A Sampling of SystemInvestigations

Positive Definiteness

Constraints

6 / 34

● The choice of a control trajectory u(·) determines the state trajectoryx(·) (recall that x0 has been specified).

● With such a trajectory parametrization , one obtains so-calledunconstrained optimal control problem

minu(·)

h(x(·;x0, u(·)), u(·))

● Why not just search over control trajectories u(·)?If the system described by f is sufficiently stable, then such a shootingmethod may be effective.

● Unfortunately, the modulus of continuity of the mapu(·) 7→ (x(·), u(·)) is often so large that such shooting iscomputationally useless :

small changes in u(·) may give LARGE changes in x(·)

Page 9: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

Projection Operator Approach

Introduction

The Projection Operatorapproach to TrajectoryFunctional Minimization❖ Unconstrained (?)Optimal Control

❖ Projection OperatorApproach

❖ Projection Operator

❖ Projection OperatorProperties

❖ Trajectory Manifold

❖ EquivalentOptimization Problems

❖ Projection operatorNewton method

A Sampling of SystemInvestigations

Positive Definiteness

Constraints

7 / 34

Key idea: a trajectory tracking controller may be used to minimize theeffects of system instabilities, providing anumerically effective , redundant trajectory parametrization .

● Let ξ(t) = (α(t), µ(t)), t ≥ 0, be a bounded curve● Let η(t) = (x(t), u(t)), t ≥ 0, be the trajectory of f determined by

the nonlinear feedback system

x = f(x, u), x(0) = x0,

u = µ(t) +K(t)(α(t)− x) .

● The map

P : ξ = (α(·), µ(·)) 7→ η = (x(·), u(·))

is a continuous, Nonlinear Projection Operator .

Page 10: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

Projection Operator Approach

Introduction

The Projection Operatorapproach to TrajectoryFunctional Minimization❖ Unconstrained (?)Optimal Control

❖ Projection OperatorApproach

❖ Projection Operator

❖ Projection OperatorProperties

❖ Trajectory Manifold

❖ EquivalentOptimization Problems

❖ Projection operatorNewton method

A Sampling of SystemInvestigations

Positive Definiteness

Constraints

7 / 34

Key idea: a trajectory tracking controller may be used to minimize theeffects of system instabilities, providing anumerically effective , redundant trajectory parametrization .

● Let ξ(t) = (α(t), µ(t)), t ≥ 0, be a bounded curve● Let η(t) = (x(t), u(t)), t ≥ 0, be the trajectory of f determined by

the nonlinear feedback system

x = f(x, u), x(0) = x0,

u = µ(t) +K(t)(α(t)− x) .

● The map

P : ξ = (α(·), µ(·)) 7→ η = (x(·), u(·))

is a continuous, Nonlinear Projection Operator .● For each ξ ∈ domP, the curve η = P(ξ) is a trajectory.

Note: the trajectory contains both state and control curves.

Page 11: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

Projection Operator

Introduction

The Projection Operatorapproach to TrajectoryFunctional Minimization❖ Unconstrained (?)Optimal Control

❖ Projection OperatorApproach

❖ Projection Operator

❖ Projection OperatorProperties

❖ Trajectory Manifold

❖ EquivalentOptimization Problems

❖ Projection operatorNewton method

A Sampling of SystemInvestigations

Positive Definiteness

Constraints

8 / 34

η = P(ξ)

ξ

Page 12: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

Projection Operator Properties

Introduction

The Projection Operatorapproach to TrajectoryFunctional Minimization❖ Unconstrained (?)Optimal Control

❖ Projection OperatorApproach

❖ Projection Operator

❖ Projection OperatorProperties

❖ Trajectory Manifold

❖ EquivalentOptimization Problems

❖ Projection operatorNewton method

A Sampling of SystemInvestigations

Positive Definiteness

Constraints

9 / 34

Suppose that f is Cr and that K is bounded andexponentially stabilizes ξ0 ∈ T . Then

● P is well defined on an L∞ neighborhood of ξ0● P is Cr (Frechet diff wrt L∞ norm)● ξ ∈ T if and only if ξ = P(ξ)● P = P ◦ P (projection )

Page 13: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

Projection Operator Properties

Introduction

The Projection Operatorapproach to TrajectoryFunctional Minimization❖ Unconstrained (?)Optimal Control

❖ Projection OperatorApproach

❖ Projection Operator

❖ Projection OperatorProperties

❖ Trajectory Manifold

❖ EquivalentOptimization Problems

❖ Projection operatorNewton method

A Sampling of SystemInvestigations

Positive Definiteness

Constraints

9 / 34

Suppose that f is Cr and that K is bounded andexponentially stabilizes ξ0 ∈ T . Then

● P is well defined on an L∞ neighborhood of ξ0● P is Cr (Frechet diff wrt L∞ norm)● ξ ∈ T if and only if ξ = P(ξ)● P = P ◦ P (projection )

● On the finite interval [0, T ], choose K(·) to obtain stability-likeproperties so that the modulus of continuity of P is relatively small .

● On the infinite horizon, instabilities must be stabilized in order toobtain a projection operator; consider x = x+ u.

Page 14: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

Trajectory Manifold

Introduction

The Projection Operatorapproach to TrajectoryFunctional Minimization❖ Unconstrained (?)Optimal Control

❖ Projection OperatorApproach

❖ Projection Operator

❖ Projection OperatorProperties

❖ Trajectory Manifold

❖ EquivalentOptimization Problems

❖ Projection operatorNewton method

A Sampling of SystemInvestigations

Positive Definiteness

Constraints

10 / 34

ξ

ξ+ζ

η = P(ξ+ζ)

Theorem: T is a Banach manifold : Every η ∈ T near ξ ∈ T can beuniquely represented as

η = P(ξ + ζ), ζ ∈ TξT

Key: the projection operator DP(ξ) provides the required subspacesplitting . Note: ζ ∈ TξT if and only if ζ = DP(ξ) · ζ

Page 15: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

Equivalent Optimization Problems

Introduction

The Projection Operatorapproach to TrajectoryFunctional Minimization❖ Unconstrained (?)Optimal Control

❖ Projection OperatorApproach

❖ Projection Operator

❖ Projection OperatorProperties

❖ Trajectory Manifold

❖ EquivalentOptimization Problems

❖ Projection operatorNewton method

A Sampling of SystemInvestigations

Positive Definiteness

Constraints

11 / 34

Using the projection operator , we see that

minξ∈T

h(ξ) = minξ=P(ξ)

h(ξ)

where

h(x(·), u(·)) =

∫ T

0

l(x(τ), u(τ), τ) dτ +m(x(T ))

Furthermore, definingg(ξ) = h(P(ξ))

for ξ ∈ U with P(U) ⊂ U ⊂ domP, we see that

minξ∈T

h(ξ)

︸ ︷︷ ︸

constrained

and minξ∈U

g(ξ)

︸ ︷︷ ︸

unconstrained

are equivalent in the sense that

● if ξ∗ ∈ T ∩ U is a constrained local minimum of h,then it is an unconstrained local minimum of g;

● if ξ+ ∈ U is an unconstrained local minimum of g in U ,then ξ∗ = P(ξ+) is a constrained local minimum of h.

Page 16: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

Projection operator Newton method

Introduction

The Projection Operatorapproach to TrajectoryFunctional Minimization❖ Unconstrained (?)Optimal Control

❖ Projection OperatorApproach

❖ Projection Operator

❖ Projection OperatorProperties

❖ Trajectory Manifold

❖ EquivalentOptimization Problems

❖ Projection operatorNewton method

A Sampling of SystemInvestigations

Positive Definiteness

Constraints

12 / 34

given initial trajectory ξ0 ∈ T

for i = 0, 1, 2, . . .

redesign feedback K(·) if desired/needed

descent direction ζi = arg minζ∈Tξi

TDh(ξi)·ζ +

12D2g(ξi)·(ζ, ζ) (LQ)

line search γi = arg minγ∈(0,1]

h(P(ξi + γζi))

update ξi+1 = P(ξi + γiζi)

end

Page 17: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

Projection operator Newton method

Introduction

The Projection Operatorapproach to TrajectoryFunctional Minimization❖ Unconstrained (?)Optimal Control

❖ Projection OperatorApproach

❖ Projection Operator

❖ Projection OperatorProperties

❖ Trajectory Manifold

❖ EquivalentOptimization Problems

❖ Projection operatorNewton method

A Sampling of SystemInvestigations

Positive Definiteness

Constraints

12 / 34

given initial trajectory ξ0 ∈ T

for i = 0, 1, 2, . . .

redesign feedback K(·) if desired/needed

descent direction ζi = arg minζ∈Tξi

TDh(ξi)·ζ +

12D2g(ξi)·(ζ, ζ) (LQ)

line search γi = arg minγ∈(0,1]

h(P(ξi + γζi))

update ξi+1 = P(ξi + γiζi)

end

This direct method generates a descending trajectory sequence inBanach space , with quadratic convergence to SSC local minimizers.

Page 18: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

Projection operator Newton method

Introduction

The Projection Operatorapproach to TrajectoryFunctional Minimization❖ Unconstrained (?)Optimal Control

❖ Projection OperatorApproach

❖ Projection Operator

❖ Projection OperatorProperties

❖ Trajectory Manifold

❖ EquivalentOptimization Problems

❖ Projection operatorNewton method

A Sampling of SystemInvestigations

Positive Definiteness

Constraints

12 / 34

given initial trajectory ξ0 ∈ T

for i = 0, 1, 2, . . .

redesign feedback K(·) if desired/needed

descent direction ζi = arg minζ∈Tξi

TDh(ξi)·ζ +

12D2g(ξi)·(ζ, ζ) (LQ)

line search γi = arg minγ∈(0,1]

h(P(ξi + γζi))

update ξi+1 = P(ξi + γiζi)

end

This direct method generates a descending trajectory sequence inBanach space , with quadratic convergence to SSC local minimizers.

When D2g(ξi) is not positive definite on TξiT , one may obtain aquasi-Newton descent direction by solving

ζi = arg minζ∈Tξi

TDh(ξi)·ζ +

12q(ξi) · (ζ, ζ)

where q(ξi) is positive definite on TξiT (e.g., an approximation to D2g(ξi))

Page 19: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

A Sampling of System Investigations

Introduction

The Projection Operatorapproach to TrajectoryFunctional Minimization

A Sampling of SystemInvestigations

❖ Inverted Pendulum❖ Heisenberg system -Brockett’s nonholonomicintegrator

❖ Raceline optimization

❖ Collision freemaneuvering

❖ PVTOL❖ Optimal control on Liegroups

❖ Submarinemaneuvering

Positive Definiteness

Constraints

13 / 34

Page 20: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

Inverted Pendulum

Introduction

The Projection Operatorapproach to TrajectoryFunctional Minimization

A Sampling of SystemInvestigations

❖ Inverted Pendulum❖ Heisenberg system -Brockett’s nonholonomicintegrator

❖ Raceline optimization

❖ Collision freemaneuvering

❖ PVTOL❖ Optimal control on Liegroups

❖ Submarinemaneuvering

Positive Definiteness

Constraints

14 / 34

al t

L2 trajectory exploration: minimize

h(ξ) =

∫ T

0

‖x(τ)−xd(τ)‖2Q/2+‖u(τ)−ud(τ)‖

2R/2 dτ+‖x(T )−xd(T )‖

2P1

/2

over trajectories ξ = (x(·), u(·)) ∈ T .

Driven inverted pendulum system is

ϕ = (g/l) sinϕ− (1/l) cosϕ u

where the control u is taken to be the pivot point lateral acceleration al.Extensions to pendubot.

Page 21: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

Heisenberg system - Brockett’s nonholonomic integrator

Introduction

The Projection Operatorapproach to TrajectoryFunctional Minimization

A Sampling of SystemInvestigations

❖ Inverted Pendulum❖ Heisenberg system -Brockett’s nonholonomicintegrator

❖ Raceline optimization

❖ Collision freemaneuvering

❖ PVTOL❖ Optimal control on Liegroups

❖ Submarinemaneuvering

Positive Definiteness

Constraints

15 / 34

min

∫ 1

0

‖u(τ)‖2/2 dτ + ‖x(T )‖2P1/2

x1 = u1

x2 = u2

x3 = x2u1 − x1u2

P1 = diag([10 10 100])

Page 22: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

Raceline optimization

Introduction

The Projection Operatorapproach to TrajectoryFunctional Minimization

A Sampling of SystemInvestigations

❖ Inverted Pendulum❖ Heisenberg system -Brockett’s nonholonomicintegrator

❖ Raceline optimization

❖ Collision freemaneuvering

❖ PVTOL❖ Optimal control on Liegroups

❖ Submarinemaneuvering

Positive Definiteness

Constraints

16 / 34

−150 −100 −50 0 50 100 150−80

−60

−40

−20

0

20

40

60

80track

−10

−8

−6

−4

−2

0

2

4

6

8

10

Page 23: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

Collision free maneuvering

Introduction

The Projection Operatorapproach to TrajectoryFunctional Minimization

A Sampling of SystemInvestigations

❖ Inverted Pendulum❖ Heisenberg system -Brockett’s nonholonomicintegrator

❖ Raceline optimization

❖ Collision freemaneuvering

❖ PVTOL❖ Optimal control on Liegroups

❖ Submarinemaneuvering

Positive Definiteness

Constraints

17 / 34

Page 24: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

PVTOL

Introduction

The Projection Operatorapproach to TrajectoryFunctional Minimization

A Sampling of SystemInvestigations

❖ Inverted Pendulum❖ Heisenberg system -Brockett’s nonholonomicintegrator

❖ Raceline optimization

❖ Collision freemaneuvering

❖ PVTOL❖ Optimal control on Liegroups

❖ Submarinemaneuvering

Positive Definiteness

Constraints

18 / 34

0 5 10 15 20 25 30 35

0

5

10

15

20

desired vs feasible path

desiredρ = 4ρ = 1

y = u1 sinϕ− ǫu2 cosϕz = −u1 cosϕ− ǫu2 sinϕ+ gϕ = u2.

Page 25: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

Optimal control on Lie groups

Introduction

The Projection Operatorapproach to TrajectoryFunctional Minimization

A Sampling of SystemInvestigations

❖ Inverted Pendulum❖ Heisenberg system -Brockett’s nonholonomicintegrator

❖ Raceline optimization

❖ Collision freemaneuvering

❖ PVTOL❖ Optimal control on Liegroups

❖ Submarinemaneuvering

Positive Definiteness

Constraints

19 / 34

x17 x18 x19 x20 x21 x22 x23 x24 x25 x26 x27

to exploit the structure of Lie groups and retain second order properties, anew second order geometric derivative was developed

applicable to quantum spin systems and many classical mechanicalsystems including spacecraft (and aircraft)

Page 26: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

Submarine maneuvering

Introduction

The Projection Operatorapproach to TrajectoryFunctional Minimization

A Sampling of SystemInvestigations

❖ Inverted Pendulum❖ Heisenberg system -Brockett’s nonholonomicintegrator

❖ Raceline optimization

❖ Collision freemaneuvering

❖ PVTOL❖ Optimal control on Liegroups

❖ Submarinemaneuvering

Positive Definiteness

Constraints

20 / 34

turn to turn maneuverscritical velocity (loss of linear controllability)

Page 27: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

Positive Definiteness

Introduction

The Projection Operatorapproach to TrajectoryFunctional Minimization

A Sampling of SystemInvestigations

Positive Definiteness❖ Descent directioncalculation

❖ Derivatives

❖ Computation of D2P

❖D2g LagrangeMultiplier

❖D2g

❖ descent direction LQOCP

Constraints

21 / 34

Page 28: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

Descent direction calculation

Introduction

The Projection Operatorapproach to TrajectoryFunctional Minimization

A Sampling of SystemInvestigations

Positive Definiteness❖ Descent directioncalculation

❖ Derivatives

❖ Computation of D2P

❖D2g LagrangeMultiplier

❖D2g

❖ descent direction LQOCP

Constraints

22 / 34

Recall that a descent direction may be found using a quadraticapproximation of the cost functional.

Making use of the representation theorem, the trajectory functional is fullydescribed locally by

h(P(ξi + ζ)) = h(ξi) +Dh(ξi) · ζ +1

2D2g(ξi) · (ζ, ζ) + h.o.t.

where ζ ranges over the space of tangent trajectories TξiT .

We thus seek a (Newton) descent direction as the solution of a linearquadratic optimal control problem:

ζi = arg minζ∈Tξi

TDh(ξi)·ζ +

12D2g(ξi)·(ζ, ζ)

Page 29: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

Derivatives

Introduction

The Projection Operatorapproach to TrajectoryFunctional Minimization

A Sampling of SystemInvestigations

Positive Definiteness❖ Descent directioncalculation

❖ Derivatives

❖ Computation of D2P

❖D2g LagrangeMultiplier

❖D2g

❖ descent direction LQOCP

Constraints

23 / 34

First and second derivatives of g(ξ) = h(P(ξ)) are given by

Dg(ξ) · ζ = Dh(P(ξ)) ·DP(ξ) · ζ

D2g(ξ) · (ζ1, ζ2) =

D2h(P(ξ)) · (DP(ξ) · ζ1, DP(ξ) · ζ2)

+Dh(P(ξ)) ·D2P(ξ) · (ζ1, ζ2)

Page 30: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

Derivatives

Introduction

The Projection Operatorapproach to TrajectoryFunctional Minimization

A Sampling of SystemInvestigations

Positive Definiteness❖ Descent directioncalculation

❖ Derivatives

❖ Computation of D2P

❖D2g LagrangeMultiplier

❖D2g

❖ descent direction LQOCP

Constraints

23 / 34

First and second derivatives of g(ξ) = h(P(ξ)) are given by

Dg(ξ) · ζ = Dh(P(ξ)) ·DP(ξ) · ζ

D2g(ξ) · (ζ1, ζ2) =

D2h(P(ξ)) · (DP(ξ) · ζ1, DP(ξ) · ζ2)

+Dh(P(ξ)) ·D2P(ξ) · (ζ1, ζ2)

When ξ ∈ T and ζi ∈ TξT , they specialize to

Dg(ξ) · ζ = Dh(ξ) · ζ

D2g(ξ) · (ζ1, ζ2) = D2h(ξ) · (ζ1, ζ2) + Dh(ξ) ·D2P(ξ) · (ζ1, ζ2)︸ ︷︷ ︸generalizes Lagrange multiplier

Page 31: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

Derivatives

Introduction

The Projection Operatorapproach to TrajectoryFunctional Minimization

A Sampling of SystemInvestigations

Positive Definiteness❖ Descent directioncalculation

❖ Derivatives

❖ Computation of D2P

❖D2g LagrangeMultiplier

❖D2g

❖ descent direction LQOCP

Constraints

23 / 34

First and second derivatives of g(ξ) = h(P(ξ)) are given by

Dg(ξ) · ζ = Dh(P(ξ)) ·DP(ξ) · ζ

D2g(ξ) · (ζ1, ζ2) =

D2h(P(ξ)) · (DP(ξ) · ζ1, DP(ξ) · ζ2)

+Dh(P(ξ)) ·D2P(ξ) · (ζ1, ζ2)

When ξ ∈ T and ζi ∈ TξT , they specialize to

Dg(ξ) · ζ = Dh(ξ) · ζ

D2g(ξ) · (ζ1, ζ2) = D2h(ξ) · (ζ1, ζ2) + Dh(ξ) ·D2P(ξ) · (ζ1, ζ2)︸ ︷︷ ︸generalizes Lagrange multiplier

How to compute D2P(ξ) · (ζ1, ζ2) ?

Page 32: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

Computation of D2P

Introduction

The Projection Operatorapproach to TrajectoryFunctional Minimization

A Sampling of SystemInvestigations

Positive Definiteness❖ Descent directioncalculation

❖ Derivatives

❖ Computation of D2P

❖D2g LagrangeMultiplier

❖D2g

❖ descent direction LQOCP

Constraints

24 / 34

We may use ODEs to calculate D2P(ξ) · (ζ1, ζ2):

η = (x, u) = P(ξ) = P(α, µ)γi = (zi, vi) = DP(ξ) · ζi = DP(ξ) · (βi, νi)ω = (y, w) = D2P(ξ) · (ζ1, ζ2)

η(t) : x(t) = f(x(t), u(t)), x(0) = x0

u(t) = µ(t) +K(t)(α(t)− x(t))

γi(t) : zi(t) = A(η(t))zi(t) +B(η(t))vi(t), zi(0) = 0vi(t) = νi(t) +K(t)(βi(t)− zi(t))

ω(t) : y(t) = A(η(t))y(t) +B(η(t))w(t) +D2f(η(t)) · (γ1(t), γ2(t))w(t) = −K(t)y(t), y(0) = 0

● The derivatives are about the trajectory η = P(ξ)● The feedback K(·) stabilizes the state at each level

Page 33: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

D2g Lagrange Multiplier

Introduction

The Projection Operatorapproach to TrajectoryFunctional Minimization

A Sampling of SystemInvestigations

Positive Definiteness❖ Descent directioncalculation

❖ Derivatives

❖ Computation of D2P

❖D2g LagrangeMultiplier

❖D2g

❖ descent direction LQOCP

Constraints

25 / 34

Dh(ξ) ·D2P(ξ) · (ζ, ζ) =

∫ T

0

D2l(τ, ξ(τ)) · (D2P(ξ) · (ζ, ζ))(τ) dτ

=

∫ T

0

D2l(τ, ξ(τ)) ·

[I

−K(τ)

] ∫ τ

0

Φc(τ, s)D2f(ξ(s)) · (ζ(s), ζ(s)) ds dτ

=

∫ T

0

∫ T

s

D2l(τ, ξ(τ)) ·

[I

−K(τ)

]Φc(τ, s) dτ D2f(ξ(s)) · (ζ(s), ζ(s)) ds

=

∫ T

0

q(s)T D2f(ξ(s)) · (ζ(s), ζ(s)) ds

where

q(t) = −[A(ξ(t))−B(ξ(t))K(t)]T q(t)− lTx (t) +K(t)T lTu (t), q(T ) = 0

We obtain a stabilized adjoint variable, independent of stationaryconsiderations!

(for nonzero terminal cost, q(T ) = mTx (x(T )))

Page 34: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

D2g

Introduction

The Projection Operatorapproach to TrajectoryFunctional Minimization

A Sampling of SystemInvestigations

Positive Definiteness❖ Descent directioncalculation

❖ Derivatives

❖ Computation of D2P

❖D2g LagrangeMultiplier

❖D2g

❖ descent direction LQOCP

Constraints

26 / 34

For ξ ∈ T and ζ ∈ TξP, D2g(ξ) · (ζ, ζ) has the form

∫ T

0

(z(τ)v(τ)

)T [Q(τ) S(τ)S(τ)T R(τ)

](z(τ)v(τ)

)dτ + z(T )TP1z(T )

where

W (t) =

[Q(τ) S(τ)S(τ)T R(τ)

]

has elements

wij(t) =∂2l

∂ξi∂ξj(t, ξ(t)) +

n∑

k=1

qk(t)∂2fk∂ξi∂ξj

(ξ(t))

and P1 = ∂2m∂x2 (x(T )).

In fact, W (·) is just the second derivative matrix of the Hamiltonian

H(t, x, u, q) = l(t, x, u) + qT f(x, u)

Again, no stationary considerations.

Page 35: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

descent direction LQ OCP

Introduction

The Projection Operatorapproach to TrajectoryFunctional Minimization

A Sampling of SystemInvestigations

Positive Definiteness❖ Descent directioncalculation

❖ Derivatives

❖ Computation of D2P

❖D2g LagrangeMultiplier

❖D2g

❖ descent direction LQOCP

Constraints

27 / 34

The descent direction problem is a linear quadratic optimal control problem

min∫ T

0

(a(τ)b(τ)

)T(z(τ)v(τ)

)+

1

2

(z(τ)v(τ)

)T[Q(τ) S(τ)S(τ)T R(τ)

](z(τ)v(τ)

)dτ

+ rT1 z(T ) + z(T )TP1z(T )/2

subj to z = A(t)z +B(t)v, z(0) = 0,

where the cost is, in general, non-convex.

This LQ OCP (with PD R(·)) has a unique solution if and only if

P + ATP + PA− PBR−1BTP + Q = 0, P (T ) = P1

has a bounded solution on [0, T ].[ A = A−BR−1ST , Q = Q− SR−1ST ]

Page 36: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

Constraints

Introduction

The Projection Operatorapproach to TrajectoryFunctional Minimization

A Sampling of SystemInvestigations

Positive Definiteness

Constraints

❖ Constraints❖ Barrier FunctionApproach n

❖ Barrier FunctionApproach ∞

❖ Approximate BarrierFunction❖ Approximate BarrierFunctional

❖ Collision Avoidance

28 / 34

Page 37: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

Constraints

Introduction

The Projection Operatorapproach to TrajectoryFunctional Minimization

A Sampling of SystemInvestigations

Positive Definiteness

Constraints

❖ Constraints❖ Barrier FunctionApproach n

❖ Barrier FunctionApproach ∞

❖ Approximate BarrierFunction❖ Approximate BarrierFunctional

❖ Collision Avoidance

29 / 34

Use a barrier function approach to approximate the (local) solution ofconstrained optimal control problems of the form

minimize∫ T

0

l(τ, x(τ), u(τ)) dτ +m(x(T ))

subject to x(t) = f(x(t), u(t)), x(0) = x0

cj(t, x(t), u(t)) ≤ 0, t ∈ [0, T ], a.e.j = 1, . . . , k,

where the data satisfies some reasonable smoothness and convexityproperties.

Approximating OCPs will be unconstrained .

Page 38: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

Barrier Function Approach n

Introduction

The Projection Operatorapproach to TrajectoryFunctional Minimization

A Sampling of SystemInvestigations

Positive Definiteness

Constraints

❖ Constraints❖ Barrier FunctionApproach n

❖ Barrier FunctionApproach ∞

❖ Approximate BarrierFunction❖ Approximate BarrierFunctional

❖ Collision Avoidance

30 / 34

In finite dimensions, a solution to a C2 convex problem

min f(x)s.t. cj(x) ≤ 0, j = 1, . . . , k

is found by solving a sequence of convex problems

minx∈C

f(x)− ǫ∑

j log(−cj(x))

where C = {x ∈ Rn : cj(x) < 0} is the open strictly feasible set.

Page 39: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

Barrier Function Approach ∞

Introduction

The Projection Operatorapproach to TrajectoryFunctional Minimization

A Sampling of SystemInvestigations

Positive Definiteness

Constraints

❖ Constraints❖ Barrier FunctionApproach n

❖ Barrier FunctionApproach ∞

❖ Approximate BarrierFunction❖ Approximate BarrierFunctional

❖ Collision Avoidance

31 / 34

The direct OCP translation is

min

∫ T

0

l(τ, x(τ), u(τ))− ǫ∑

j log(−cj(τ, x(τ), u(τ))) dτ

+ m(x(T ))

s.t. x(t) = f(x(t), u(t)), x(0) = x0

Suppose that at some ǫ0 > 0, this problem possesses a locally optimaltrajectory ξ∗ǫ0 = (x∗

ǫ0(·), u∗ǫ0(·)) that is SSC and that the Hamiltonian is

strongly convex in u.

Then ξ∗ǫ0 is a strictly feasible trajectory (of constrained problem) and theIFT indicates nice dependence on ǫ.

Looks promising ... but guaranteeing strict feasibility during optimizationprocess may be difficult .

Page 40: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

Approximate Barrier Function

Introduction

The Projection Operatorapproach to TrajectoryFunctional Minimization

A Sampling of SystemInvestigations

Positive Definiteness

Constraints

❖ Constraints❖ Barrier FunctionApproach n

❖ Barrier FunctionApproach ∞

❖ Approximate BarrierFunction❖ Approximate BarrierFunctional

❖ Collision Avoidance

32 / 34

For 0 < δ ≤ 1, define the C2 approximate log barrier function

βδ : (−∞,∞) → (0,∞)

βδ(z) =

− log z z > δ

k − 1

k

[(z − kδ

(k − 1)δ

)k

− 1

]− log δ z ≤ δ

where k > 1 is an even integer, e.g., k = 2.

βδ(·) retains many of the important properties of the log barrier function

Similar to z 7→ − log z: for strictly convex proper c : R → R, z 7→ βδ(−c(z))is also strictly convex so that

minx∈C

f(x) + ǫ∑

j βδ(−cj(x))

is a convex problem that has the same solution (x∗ǫ ) provided δ < cj(x

∗ǫ ) for

all j.

Page 41: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

Approximate Barrier Functional

Introduction

The Projection Operatorapproach to TrajectoryFunctional Minimization

A Sampling of SystemInvestigations

Positive Definiteness

Constraints

❖ Constraints❖ Barrier FunctionApproach n

❖ Barrier FunctionApproach ∞

❖ Approximate BarrierFunction❖ Approximate BarrierFunctional

❖ Collision Avoidance

33 / 34

Returning to infinite dimensions, define, for ξ = (α(·), µ(·))

bδ(ξ) =

∫ T

0

∑jβδ(−cj(τ, α(τ), µ(τ))) dτ

and consider unconstrained approximation (to constrained OCP)

minξ∈T

h(ξ) + ǫbδ(ξ)

Note: h(·) + ǫbδ(·) can be evaluated on any curve ξ in X.

Page 42: Projection Operator Strategies for Trajectory Optimization: Application ...users.isr.ist.utl.pt/~pedro/ifac2011workshop/Hauser.pdf · Projection Operator Strategies for Trajectory

Collision Avoidance

Introduction

The Projection Operatorapproach to TrajectoryFunctional Minimization

A Sampling of SystemInvestigations

Positive Definiteness

Constraints

❖ Constraints❖ Barrier FunctionApproach n

❖ Barrier FunctionApproach ∞

❖ Approximate BarrierFunction❖ Approximate BarrierFunctional

❖ Collision Avoidance

34 / 34

For collision avoidance, the constraint function is not convex.

‖(xi, yi)− (xj , yj)‖2 > r2

Nevertheless, this approach seems to work due since, in radial directions,local convexity does hold.To take into account the notion that being far away is not really better thanmerely being feasible, we replace − log−z by

− log tanh−z

for z < 0. For z ≥ 0, we use −z in place of tanh−z, and use the usualapproximation.