introduction to the mathematical theory of control, lecture 5motta/lectures valona...

26
logoslides Introduction to the Mathematical Theory of Control, Lecture 5 Monica Motta Dipartimento di Matematica Università di Padova Valona, September 15, 2017 M. Motta (Padua Un.) Control Theory Valona, September 15, 2017 1 / 19

Upload: others

Post on 11-Oct-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Introduction to the Mathematical Theory of Control, Lecture 5motta/Lectures Valona 2017/ControlThe… · the differential equation p˙ (t)= )A , (8) ... LECTURE 5: the PMP, proof

logoslides

Introduction to the Mathematical Theory of Control,Lecture 5

Monica Motta

Dipartimento di MatematicaUniversità di Padova

Valona, September 15, 2017

M. Motta (Padua Un.) Control Theory Valona, September 15, 2017 1 / 19

Page 2: Introduction to the Mathematical Theory of Control, Lecture 5motta/Lectures Valona 2017/ControlThe… · the differential equation p˙ (t)= )A , (8) ... LECTURE 5: the PMP, proof

Table of contents

1 Necessary optimality conditions: Pontryagin Maximum Principle (PMP)

2 Examples on the use of the PMP

3 Proof of the Pontryagin Maximum Principle

4 Some sufficient optimality conditions

M. Motta (Padua Un.) Control Theory Valona, September 15, 2017 2 / 19

Page 3: Introduction to the Mathematical Theory of Control, Lecture 5motta/Lectures Valona 2017/ControlThe… · the differential equation p˙ (t)= )A , (8) ... LECTURE 5: the PMP, proof

PMP without final constraints

Given T > 0, an initial state x , and a cost function : Rn ! R weconsider the maximization problem

(P)

8>><

>>:

maxu2U

(x(T , u))

over the pairs (x , u)(·) verifying

x(t) = f (t , x(t), u(t)) u 2 U , x(0) = x .

We shall assume:

(HP

) The set ⌦ ⇢ Rn is open, f is continuous in ⌦⇥ U and C

1 in x . Thepayoff : Rn ! R is differentiable.

Notice that here we are not making any assumption on the set U ofadmissible control values. In particular, we may well have U = Rm.(We deal with optimization problems without endpoint constraints, since this is a case where the key

ideas can be more clearly described)M. Motta (Padua Un.) Control Theory Valona, September 15, 2017 3 / 19

Page 4: Introduction to the Mathematical Theory of Control, Lecture 5motta/Lectures Valona 2017/ControlThe… · the differential equation p˙ (t)= )A , (8) ... LECTURE 5: the PMP, proof

Theorem 1 (PMP without endpoint constraints).Assume (H

P

). Let u

⇤(·) be a bounded, admissible optimal control for

problem (P), whose corresponding trajectory x

⇤(·) is optimal. Denote

p : [0,T ] ! Rn

the solution of the adjoint equation

p(t) = �p(t)D

x

f (t , x⇤(t), u⇤(t)), p(T ) = D (x⇤(T )). (1)

Then the maximality condition

p(t) · f (t , x⇤(t), u⇤(t)) = maxw2U

p(t) · f (t , x⇤(t),w), (2)

holds for a.e. t 2 [0,T ].

M. Motta (Padua Un.) Control Theory Valona, September 15, 2017 4 / 19

Page 5: Introduction to the Mathematical Theory of Control, Lecture 5motta/Lectures Valona 2017/ControlThe… · the differential equation p˙ (t)= )A , (8) ... LECTURE 5: the PMP, proof

In coordinates, the adjoint equation can be rewritten as

p

i

(t) = �nX

j=1

p

j

(t)@f

j

@x

i

(t , x⇤(t), u⇤(t)), p

i

(T ) =@

@x

i

(x⇤(T )) (3)

(i = 1, . . . , n), and the maximality condition as

nX

i=1

p

i

(t) f

i

(t , x⇤(t), u⇤(t)) = maxw2U

(nX

i=1

p

i

(t) f

i

(t , x⇤(t),w)

), (4)

for a.e. t 2 [0,T ].

M. Motta (Padua Un.) Control Theory Valona, September 15, 2017 5 / 19

Page 6: Introduction to the Mathematical Theory of Control, Lecture 5motta/Lectures Valona 2017/ControlThe… · the differential equation p˙ (t)= )A , (8) ... LECTURE 5: the PMP, proof

Geometric meaning of the maximality condition

’At each time ⌧ , among the possible attainable speedsx(⌧) 2 {f (⌧, x⇤(⌧),w) : w 2 U} , we should choose the one thatmaximizes the scalar product p(⌧) · x(⌧)’.

M. Motta (Padua Un.) Control Theory Valona, September 15, 2017 6 / 19

Page 7: Introduction to the Mathematical Theory of Control, Lecture 5motta/Lectures Valona 2017/ControlThe… · the differential equation p˙ (t)= )A , (8) ... LECTURE 5: the PMP, proof

The Pontryagin Maximum Principle motivates a practical method forfinding optimal solutions.

1 Define the function u = u(t , x , u) 2 U, in terms of the maximalitycondition:

p · f (t , x , u(t , x , p)) = maxw2U

p · f (t , x ,w)

2 solve the system of 2n differential equations in the variables x , p:(

x = f (t , x , u(t , x , p))

p = �D

x

f (t , x , u(t , x , p))

with boundary conditions

x(0) = x , p(T ) = D (x(T )).

M. Motta (Padua Un.) Control Theory Valona, September 15, 2017 7 / 19

Page 8: Introduction to the Mathematical Theory of Control, Lecture 5motta/Lectures Valona 2017/ControlThe… · the differential equation p˙ (t)= )A , (8) ... LECTURE 5: the PMP, proof

In general, this method encounters two main difficulties:

1 The map (t , x , p) 7! u(t , x , p) may be multivalued and/ordiscontinuous.

2 The above equations in (x , p)(·) do not constitute a Cauchy problemon R2n, but a (usually harder) two–point boundary value problem.

In particular cases, however, the equations for p and x can beuncoupled, and a solution is more easily found.

M. Motta (Padua Un.) Control Theory Valona, September 15, 2017 8 / 19

Page 9: Introduction to the Mathematical Theory of Control, Lecture 5motta/Lectures Valona 2017/ControlThe… · the differential equation p˙ (t)= )A , (8) ... LECTURE 5: the PMP, proof

Examples

Example 2 (Linear pendulum with external force).Let q be the position of a linearized pendulum controlled by an externalforce u, with magnitude constraint

|u(t)| 1 8t .

Assume that the motion is determined by the equations

q(t) + q(t) = u(t), q(0) = q(0) = 0.

We wish to maximize the displacement q(T ) at a fixed terminal time T .If x1 := q, x2 := q, the optimization problem can be formulated as

(P)

8>><

>>:

maxu

x1(T , u)

over the pairs (x , u)(·) verifying u(t) 2 [�1, 1] and

(x1, x2) = (x2,�x1 + u), (x1, x2)(0) = (0, 0).

M. Motta (Padua Un.) Control Theory Valona, September 15, 2017 9 / 19

Page 10: Introduction to the Mathematical Theory of Control, Lecture 5motta/Lectures Valona 2017/ControlThe… · the differential equation p˙ (t)= )A , (8) ... LECTURE 5: the PMP, proof

Example 3.To appreciate the effect of the map u = u(t , x , p) being multivalued,consider the problem

(P)

8>><

>>:

maxu

x3(T , u)

over the pairs (x , u)(·) verifying u(t) 2 [�1, 1] and

(x1, x2, x3) = (u,�x1, x2 � x

21 ), (x1, x2, x3)(0) = (0, 0, 0).

M. Motta (Padua Un.) Control Theory Valona, September 15, 2017 10 / 19

Page 11: Introduction to the Mathematical Theory of Control, Lecture 5motta/Lectures Valona 2017/ControlThe… · the differential equation p˙ (t)= )A , (8) ... LECTURE 5: the PMP, proof

Example 4.To see that the conditions of the PMP are not sufficient for optimality,consider the problem

(P)

8>><

>>:

maxu

x2(T , u)

over the pairs (x , u)(·) verifying u(t) 2 [�1, 1] and

(x1, x2) = (u, x21 ), (x1, x2)(0) = (0, 0),

and notice that the constant control u

⇤ ⌘ 0 yields the constant trajectoryx

⇤ ⌘ 0 and they satisfy the conditions of the PMP (verify!). Howeveru

⇤ ⌘ 0 is NOT optimal.

M. Motta (Padua Un.) Control Theory Valona, September 15, 2017 11 / 19

Page 12: Introduction to the Mathematical Theory of Control, Lecture 5motta/Lectures Valona 2017/ControlThe… · the differential equation p˙ (t)= )A , (8) ... LECTURE 5: the PMP, proof

Basic steps of the proof of the PMPTo understand the derivation of the Pontryagin Maximum Principle (PMP), it is best to assume first theoptimal control u

⇤(·) is continuous. In this case the maximality condition holds for all t 2 [0, T ].

1 Fix any time ⌧ > 0 and any value w 2 U. For any " > 0, consider a perturbed control function

u"(t) =

(w if t 2 [⌧ � ", ⌧ ]

u

⇤(t) otherwise

called a needle variation of u.2 Let x"(·) := x(·, u"). Evaluate for any t � ⌧ :

v(t) :=dx"

d"(t)

���"=0+

. Observe that v(⌧) = f (⌧, x

⇤(⌧),w)� f (⌧, x

⇤(⌧), u

⇤(⌧)).

3 Since u

⇤(·) is optimal, (x⇤(T )) � (x"(T )) for any " > 0. Hence, differentiating w.r.t. ",

0 � lim"!0+

(x"(T ))� (x⇤(T ))

"= D (x⇤(T )) ·

dx"

d"(T )

���"=0+

= D (x⇤(T )) · v(T )

4 Prove that the the scalar product p(t) · v(t) is constant, so that the condition at T can be"transported" at ⌧ and the maximality condition follows:

0 � D (x⇤(T ))·v(T ) = p(T )·v(T )=p(⌧)·v(⌧) = p(⌧)·[f (⌧, x

⇤(⌧),w)�f (⌧, x

⇤(⌧), u

⇤(⌧))].

M. Motta (Padua Un.) Control Theory Valona, September 15, 2017 12 / 19

Page 13: Introduction to the Mathematical Theory of Control, Lecture 5motta/Lectures Valona 2017/ControlThe… · the differential equation p˙ (t)= )A , (8) ... LECTURE 5: the PMP, proof

Some pictures:

Needle variation of u

Perturbed trajectories and variation v

M. Motta (Padua Un.) Control Theory Valona, September 15, 2017 13 / 19

Page 14: Introduction to the Mathematical Theory of Control, Lecture 5motta/Lectures Valona 2017/ControlThe… · the differential equation p˙ (t)= )A , (8) ... LECTURE 5: the PMP, proof

Tools:

Theorem 5 (Differentiability respect to initial conditions).

Let g : R⇥ Rn ! Rn

be a function, measurable in t , C

1in x and such that

there exists constants C, L such that

|g(t , x)| C, |g(t , x)� g(t , y)| L|x � y | 8(t , x , y). (5)

Let x(·) := x(·, t0, x0) be the solution of

x(t) = g(t , x(t)), x(t0) = x0. (6)

For a vector v0 2 Rn, call v(·) the solution of the linear Cauchy problem

v(t) = D

x

g(t , x(t))v(t), v(t0) = v0.

Then for any t > t0,

lim"!0+

x(t , t0, x0 + "v0)� x(t)

"= v(t).

M. Motta (Padua Un.) Control Theory Valona, September 15, 2017 14 / 19

Page 15: Introduction to the Mathematical Theory of Control, Lecture 5motta/Lectures Valona 2017/ControlThe… · the differential equation p˙ (t)= )A , (8) ... LECTURE 5: the PMP, proof

Set y"(·) := x(·) + " v(·). The picture describes the first order variationof a solution of a ODE, as the initial point x0 is changed to x0 + "v0.

M. Motta (Padua Un.) Control Theory Valona, September 15, 2017 15 / 19

Page 16: Introduction to the Mathematical Theory of Control, Lecture 5motta/Lectures Valona 2017/ControlThe… · the differential equation p˙ (t)= )A , (8) ... LECTURE 5: the PMP, proof

Adjoint systemsWe call adjoint system of the linear system

x(t) = A(t) x(t) (7)

the differential equationp(t) = �p(t)A(t), (8)

that is,

(p1(t), . . . , p

n

(t)) = � (p1(t), . . . , p

n

(t))

0

B@

a11(t) . . . a1n

(t)...

...a

n1(t) . . . a

nn

(t)

1

CA

Theorem 6 (Property of the adjoint system).Let x(·), p(·) two solutions of (7), (8) respectively, defined on the same

time interval [a, b]. Then their scalar product p(t) · x(t) is constant.

Proof. This is verified by the direct computation

d

dt

(x · p) = x · p + x · p = p · A(t)x � pA(t) · x = 0.

M. Motta (Padua Un.) Control Theory Valona, September 15, 2017 16 / 19

Page 17: Introduction to the Mathematical Theory of Control, Lecture 5motta/Lectures Valona 2017/ControlThe… · the differential equation p˙ (t)= )A , (8) ... LECTURE 5: the PMP, proof

For the general case, of possibly discontinuous optimal control u

⇤(·), weapply to t 7! h(t) = f (t , x⇤(t), u⇤(t)) the following

Theorem 7 (Lebesgue Differentiation Theorem).Let h : [a, b] ! Rn

be integrable. Then, almost every ⌧ 2 [a, b] is a

Lebesgue point, i.e.

lim"!0+

12"

Z ⌧+"

⌧�"|h(⌧)� h(s)| ds = 0.

M. Motta (Padua Un.) Control Theory Valona, September 15, 2017 17 / 19

Page 18: Introduction to the Mathematical Theory of Control, Lecture 5motta/Lectures Valona 2017/ControlThe… · the differential equation p˙ (t)= )A , (8) ... LECTURE 5: the PMP, proof

Some sufficient optimality conditions

The simplest sufficient optimality conditions can be obtained combiningthe conditions for the existence of optimal controls with the PMP.

Theorem 8 (Existence + PMP).Assume (H) and let be continuously differentiable. Assume that an

optimal solution for problem (P) exists.

If there is a finite number of admissible controls u1, . . . , uk

that satisfy the

PMP, then, among them, the one which yields the lowest value of the cost

is optimal.

Less trivial results can be obtained combining convexity assumptions withthe PMP.

Let me mention a different approach to derive sufficient optimalityconditions, NOT based on the PMP, known as Dynamic Programming.

M. Motta (Padua Un.) Control Theory Valona, September 15, 2017 18 / 19

Page 19: Introduction to the Mathematical Theory of Control, Lecture 5motta/Lectures Valona 2017/ControlThe… · the differential equation p˙ (t)= )A , (8) ... LECTURE 5: the PMP, proof

Thank you for your attention!

WEB page: http://www.math.unipd.it/⇠ motta/

E-Mail: [email protected]

M. Motta (Padua Un.) Control Theory Valona, September 15, 2017 19 / 19

Page 20: Introduction to the Mathematical Theory of Control, Lecture 5motta/Lectures Valona 2017/ControlThe… · the differential equation p˙ (t)= )A , (8) ... LECTURE 5: the PMP, proof

LECTURE 5: the PMP, proofgiovedì 14 settembre 2017 16:01

Nuova sezione 1 Pagina 1

Page 21: Introduction to the Mathematical Theory of Control, Lecture 5motta/Lectures Valona 2017/ControlThe… · the differential equation p˙ (t)= )A , (8) ... LECTURE 5: the PMP, proof

Nuova sezione 1 Pagina 2

Page 22: Introduction to the Mathematical Theory of Control, Lecture 5motta/Lectures Valona 2017/ControlThe… · the differential equation p˙ (t)= )A , (8) ... LECTURE 5: the PMP, proof

Nuova sezione 1 Pagina 3

Page 23: Introduction to the Mathematical Theory of Control, Lecture 5motta/Lectures Valona 2017/ControlThe… · the differential equation p˙ (t)= )A , (8) ... LECTURE 5: the PMP, proof

Nuova sezione 1 Pagina 4

Page 24: Introduction to the Mathematical Theory of Control, Lecture 5motta/Lectures Valona 2017/ControlThe… · the differential equation p˙ (t)= )A , (8) ... LECTURE 5: the PMP, proof

Nuova sezione 1 Pagina 5

Page 25: Introduction to the Mathematical Theory of Control, Lecture 5motta/Lectures Valona 2017/ControlThe… · the differential equation p˙ (t)= )A , (8) ... LECTURE 5: the PMP, proof

Nuova sezione 1 Pagina 6

Page 26: Introduction to the Mathematical Theory of Control, Lecture 5motta/Lectures Valona 2017/ControlThe… · the differential equation p˙ (t)= )A , (8) ... LECTURE 5: the PMP, proof

Nuova sezione 1 Pagina 7