ellipsoid 4
TRANSCRIPT
-
8/2/2019 Ellipsoid 4
1/26
T he E llipsoid m ethod
R.M. Freund/ C. Roos (MIT/TUD)e-mail: [email protected]
URL: http://www.isa.ewi.tudelft.nl/roos
WI 4218March 21, A.D. 2007
Optimization Group 1/26
-
8/2/2019 Ellipsoid 4
2/26
Outline
Sherman-Morrison formula
Ellipsoids
Ellipsoid method
Basic construction Prototypical iteration
Iteration bound
Two theorems Two more theorems
Optimization with the ellipsoid method
Convexity of the homogeneous problem The Key Proposition
Optimization Group 2/26
-
8/2/2019 Ellipsoid 4
3/26
In memory of George Danzig and Leonid Khachyan (1)
Leonid Khachiyan (1952-2005) passed away Friday April 29 at the age of 52. He died in hissleep, apparently of a heart attack, colleagues said. Khachiyan was best known for his 1979
use of the ellipsoid algorithm, originally developed for convex programming, to give the first
polynomial-time algorithm to solve linear programming problems. While the simplex algorithm
solved linear programs well in practice, Khachiyan gave the first formal proof of an efficientalgorithm in the worst case.
He was among the worlds most famous computer scientists, said Haym Hirsh, chairman of
the computer science department at Rutgers.
George B. Dantzig, the father of Linear Programming and the creator of the Simplex Method,
died at the age of 90 on May 13. In a statement INFORMS President Richard C. Larsonmourned his death (see http://www.informs.org/Press/dantzigobit.htm). The tributes in-
cluded obituaries in the Washington Post, the Francisco Chronicle, Mercury News and New
York Times. National Public Radio commentator Keith Devlin remembered George Dantzig
in a broadcast on Saturday, May 21.
Optimization Group 3/26
-
8/2/2019 Ellipsoid 4
4/26
In memory of George Danzig and Leonid Khachyan (2)
Optimization Group 4/26
-
8/2/2019 Ellipsoid 4
5/26
Leonid Khachyan in Shanghai (with Bai, Peng and Terlaky, 2002)
Optimization Group 5/26
-
8/2/2019 Ellipsoid 4
6/26
Sherman-Morrison formulaLet Q,R,S,T be matrices such that
Q and Q + RST are nonsingular.
R and S are n
k matrices of rank k
n.
Then (Q + RST)1 = Q1 Q1R(I + STQ1R)1STQ1
Lemma 1 Let U be such that QU = R. Then I + STU is invertible.
Proof: Suppose w Rk satisfies (I + STU)w = 0. Then,(Q + RST)U w = (QU)w + R(STU w) = Rw Rw = 0.
Q + RST being nonsingular, this gives U w = 0. Since rank (U) = k this implies w = 0. Theorem 1 If Qx0 = q and (I + STU)y = STx0 then x = x0
U y satisfies (Q + RST)x = q.
Proof: (Q + RST)(x0 U y) = Qx0 + RSTx0 QU y RSTU y= q + R(I + STU)y Ry RSTU y = q. 2
The solution x of (Q + RST)x = q is given by
x = x0 U y = Q1q Q1R(I + STU)1STx0= (Q1 Q1R(I + STQ1R)1STQ1)q.
Since Theorem 1 holds for all q Rn the Sherman-Morrison formula follows.Reference:W. W. Hager. Updating the inverse of a matrix. SIAM Rev., 31(2):221239, June 1989.
Optimization Group 6/26
-
8/2/2019 Ellipsoid 4
7/26
Ellipsoids
Let M be a (symmetric) positive definite n n matrix and z Rn. Then
EM,z :=
x : (x z)T
M(x z) 1denotes an ellipsoid centered at z. Note that
x EM,z
M1
2 (x z)
1.
Hence, putting u = M
1
2
(x z), we have x = z + M1
2
u and u 1. Therefore,EM,z =
x = z + M
1
2 u : uTu 1
.
In other words,
EM,z = z + M1
2 B(0, 1),
where B(0, 1) denotes the unit sphere, centered at the origin. The volume of B(0, 1) is given by
(n) =
n
2
n2
+ 1,
and the volume of EM,z by
vol
EM,z
=(n)
det(M).
Hence
ln volEM,z = ln (n) 1
2
ln det(M).
Optimization Group 7/26
-
8/2/2019 Ellipsoid 4
8/26
Ellipsoid method
z
a
Given is a set convex S. We want to find s
S. We assume
that an ellipsoid EM,z is given such that S EM,z.Step 1 : k = 0; Mk = M; zk = z;
Step 2 : if zk S: STOP;Step 3 : Find nonzero vector a such that
aTx aTzk, x S;
Step 4 : Construct smallest volume ellipsoid that contains
EM,z x Rn : aT(x zk) 0
;
Let this ellipsoid have matrix Mk+1 and centerzk+1.
Step 5 : k = k + 1;
Step 6 : Goto Step 2.
Optimization Group 8/26
-
8/2/2019 Ellipsoid 4
9/26
Basic construction (1)
Find m = (z, 0, . . . , 0), M = diag (a1, a2, . . . , an), such that the ellipsoid
EM,m = x Rn : (x m)TM(x m) 1contains B(0, 1) {x : x1 0} and has minimal volume. Note that
(x m)TM(x m) = a1(x1 z)2 +n
i=2
aix2i .
The unit vectors e1,
e2, . . . ,
en are on the boundary of B(0, 1)
{x : x1
0
}. We require them to lie
on the boundary of the ellipsoid. This gives:
a1(1 z)2 = 1a1z2 + ai = 1, 2 i n.
From this we obtain
a1 =1
(1 z)2 , ai = 1 a1z2 = 1 z
2
(1 z)2 =1 2z
(1 z)2 , i 2.
Recall that vol
EM,m
is minimal if det M is maximal. Since
det M = (1 2z)n
1
(1 z)2n ,
one may easily verify that this occurs if z = 1n+1
. Substituting this value we obtain
a1 = 1 + 1n
2
, ai = 1
1
n2, i
2.
Optimization Group 9/26
-
8/2/2019 Ellipsoid 4
10/26
Illustration for n = 2
-1 0 1
-1
0
1
Optimization Group 10/26
-
8/2/2019 Ellipsoid 4
11/26
Basic construction (2)
a1 =(n + 1)2
n2, ai =
n2 1n2
, i 2.
B =
y : yTy 1 , H = {y : y1 0}M = n
21n2
I + 2
n1e1eT1
, z = 1
n+1e1
E = EM ,z
Theorem 2 (B H) E.Proof: One has y E if and only if
y e1
n + 1
Tn2 1
n2
I + 2e1eT1
n 1
y e1n + 1
1.
This is equivalent to
n2 1n
2
n
i=1
y2i +1
n2
+ y1 (y1
1)
2n + 2
n2
1.
Now let y B H. Then 0 y1 1 implies y1 (y1 1) 0. Alson
i=1 y2i 1. Since
n2 1n2
+1
n2= 1,
we obtain y E.
Optimization Group 11/26
-
8/2/2019 Ellipsoid 4
12/26
Basic construction (3)
B = y : yTy 1M = n
21n2
I + 2n1e1e
T1
E = EM,z
Theorem 3 vol (E) < vol (B) e1
2(n+1) .
Proof: One has vol(E)vol(B)
=
det I
det M= 1
det M. Moreover,
det M =
n21
n2
n 1 + 2n1
=
n21
n2
n1 n+1
n
2.
Hence, using 1 + x
ex we get
1det M
=
n2
n21n1
nn+1
2=
1 + 1
n21n1
1 1n+12
en1
n21 e2
n+1 = e1
n+1 .
This implies the theorem. Optimization Group 12/26
-
8/2/2019 Ellipsoid 4
13/26
Prototypical iteration
EM,z := x : (x z)T M(x
z)
1 = z + M
12B(0, 1)We know M, z, and a nonzero vector a. Define
M = n21n2
M + 2n1
aaT
aTM1a
, z = z + 1n+1
M1aaTM1a
By the Sherman-Morrison formula, one has
M1 = n2
n2 1
M1 2
n + 1
M1aaTM1aTM1a
Theorem 4
EM,z
x : aTx aTz
EM,z.
Theorem 5
vol EM,z < vol EM,z e1
2(n+1) .
Optimization Group 13/26
P f f Th 5
-
8/2/2019 Ellipsoid 4
14/26
Proof of Theorem 5
u := M1
2 a, b := u e1, R := 2(u + b)(u + b)T
u + b2 I
One has RT = R, Ru = b, R2 = I.Hence
M = n21n2
M + 2
n1aaT
aTM1a
= n
21n2
M1
2 I + 2n1M
12 aaTM
12
aTM1a M12= n
21n2
M1
2 R
I + 2n1
RM12 aaTM
12 R
aTM1a
RM
1
2
= n21n2
M1
2 R
I + 2n1
RM12 aaTM
12 R
aTM12 RRM
12 a
RM
1
2
= n21n2
M1
2 RI +2
n1e1eT1RM
1
2 .
Therefore,
det M =
n21n2
ndet M(det R)2 det
I + 2
n1e1eT1
=
n21n2
n 1 + 2
n1
det M,
and
det M
det M =n2 1n2
n 1 +
2
n 1
> e
1
n+1 .
Hence
vol
EM ,z
vol EM,z=
det Mdet M
< e1
2(n+1)
This proves Theorem 5.
Optimization Group 14/26
P f f Th 4
-
8/2/2019 Ellipsoid 4
15/26
Proof of Theorem 4
EM,z
x : aTx
aTz
EM,z.
Proof: Suppose (x z)TM(x z) 1 and aTx aTz. Putu = M
1
2 a, Ru = b =
u
e1, y := RM
1
2 (x
z)
Then
yTy = (x z)TM12 R2M12 (x z) = (x z)TM(x z) 1.Moreover,
u
y1 =
u
eT1 y = bTy = uTRu
= aTM1
2 RRM1
2 (x z) = aT(x z) 0.Now
(x z)TM(x z) =
(x z 1
n+1M1a
aTM1a)T
M(x z 1
n+1M1a
aTM1a) =(x z 1
n+1M1aaTM1a
)TM1
2 R
n21n2
I + 2
n1e1eT1
RM
1
2
x z M1a
aTM1a
=
y 1n+1
ue1u
I + 2
n1e1eT1
y 1
n+1ue1u
1.
The inequality follows just as in the proof of Theorem 1, since yTy 1 and y1 0.
Optimization Group 15/26
T th
-
8/2/2019 Ellipsoid 4
16/26
Two theorems
Theorem 6 Suppose we want to find a point in the set S, with S EM0,z0 and vol (S) > 0. Then thealgorithm will find a point in S after at most
2(n + 1) ln
vol
EM0,z0
vol (S)
iterations.
Proof: After k iterations, we havevol
EMk,zk
vol EM0,z0 e k2(n+1) .Since S EMk,zk for each k we obtain
vol (S)
vol EM0,z0 e k2(n+1) .
Hence
ln vol (S) ln vol
EM0,z0 k
2(n + 1),
which implies
k 2(n + 1) ln
vol
EM0,z0
vol (S)
.
This proves the theorem.
Optimization Group 16/26
Two theorems (cont )
-
8/2/2019 Ellipsoid 4
17/26
Two theorems (cont.)
Let
B(c, ) :=
{x :
x
c
}.
Theorem 7 Suppose we knowR such thatS B(0, R) and thatS contains a ballB (x, r) for somex andr > 0. Then the algorithm will find a point in S after at most
2n(n + 1 ) l n
Rr
iterations.
Proof: After k iterations, we have
vol (B (x, r)) vol (S) vol (B(0, R)) ek
2(n+1) .Taking logarithms again we get
ln vol (B (x, r)) ln vol (B(0, R)) k2(n + 1)
,
and hence
k 2(n + 1) ln
vol(B(0,R))vol(B(x,r))
= 2(n + 1) ln
(n)Rn
(n)rn
= 2(n + 1)n ln
Rr
.
This proves the theorem.
Optimization Group 17/26
Two more theorems
-
8/2/2019 Ellipsoid 4
18/26
Two more theorems
Actually, we can improve the previous two results a lot in terms of what we need to know.
Theorem 8 Suppose vol
S EM0,z0
> 0. Then the algorithm will find a point in S
after at most
2(n + 1) lnvol
EM0,z0
vol
S EM0,z0iterations.
Proof: Obvious.
Theorem 9 Suppose we know R and that S B(0, R) contains a ball B (x, r) for somex and r > 0. Then the algorithm will find a point in S after at most
2n(n + 1) ln
Rr
iterations.
Proof: Obvious. Optimization Group 18/26
Optimization with the ellipsoid method
-
8/2/2019 Ellipsoid 4
19/26
Optimization with the ellipsoid method
Consider the minimization problem
z = mincTx : x S .
Let
P :=
x : x S, cTx z + .Theorem 10 Suppose we know EM0,z0 such that EM0,z0
P
=
. Then the algorithm will find a point in
P after at most 2(n + 1) ln
vol
EM0,z0
vol
EM0,z0
P
iterations.
Proof: Obvious. Theorem 11 Let us know R and B(0, R) P contains a ball B (x, r) for some x and r > 0. Then thealgorithm will find a point in P after at most
2(n + 1)n ln
Rr
iterations.
Proof: Obvious.
Optimization Group 19/26
Optimization (cont )
-
8/2/2019 Ellipsoid 4
20/26
Optimization (cont.)
Unfortunately we must know R, and this is most unfortunate. Let us fix this.
min
cTx : x S (H) mincTy
: y
S, > 0 .Note that the problem (H) at the right is homogeneous: if (y, ) is feasible (or optimal)
then so is (y,) for any > 0. Taking = 1/ we get a feasible (or optimal) solution
of (P).
The objective function in (H) is not convex. However, the level sets are convex and this is
enough for the ellipsoid method.
As far as y concerns, the feasible region is the cone generated by the convex set S, without
the origin (since > 0). Hence the (convex!) feasible region H is not closed, but also thisdoes not hurt the ellipsoid method.
Finally, by adding the constraint
(y, ) 1
we lose nothing, and the whole feasible region lies in B((0, 0), 1)!
Optimization Group 20/26
Convexity of the homogeneous problem
-
8/2/2019 Ellipsoid 4
21/26
Convexity of the homogeneous problem
Proposition 1 The set
(y, ) :
y
S, > 0is convex.
Proof: Let (y1, 1), (y2, 2) belong to the set. For 0
1, let (y, ) = (y1, 1) +
(1 )(y2, 2).y
=
y1 + (1 )y21 + (1 )2
=1
y11
+ (1 )2 y221 + (1 )2
= y11
+ (1 )y22
S,
where = 11+(1)2 .
Proposition 2 The level sets of cTy are convex.
Proof: For any z R one hascTy
z cTy z 0.
Here we used > 0. This implies the result. Optimization Group 21/26
The Key Proposition
-
8/2/2019 Ellipsoid 4
22/26
The Key Proposition
We have seen that we can apply the ellipsoid method to the homogeneous problem (H), with
R = 1.
Bn denotes the unit sphere in Rn.
Proposition 3 Suppose the level set P contains a ball B(x, r). Let
HP :=
(y, ) :
y
P, > 0
.
Then
vol
Bn+1
vol
HP Bn+1 (n + 1)(n + 1) 1 + (r + x)
2n+12(n)rn
.
Theorem 12 Solving (P) via (H), starting at the unit ball Bn+1, the algorithm will findan -solution of (H) (and hence of (P)) after at most(n + 1)(n + 2) ln
1
r2+
1 +
xr
2+ 2(n + 2) ln
r
n
iterations.
Optimization Group 22/26
Graphical illustration
-
8/2/2019 Ellipsoid 4
23/26
G p c st t o
x
Bn+1
Optimization Group 23/26
Proof of Proposition 3
-
8/2/2019 Ellipsoid 4
24/26
p
HB(x, r) :=
(y, ) :y
B(x, r), > 0
.
The longest vector in the set
T :=
(y, ) :y
B(x, r), 1
HB(x, r)
is the vector
x1
+ r xx0 =
1 + r
x
x1 and has length
:= 1 + 1 +r
x2
x
2 = 1 + (r +
x
)
2.
Hence
1T Bn+1 HB(x, r) Bn+1 HP.Therefore,
vol (Bn+1 HP) vol
1T
= 1n+1
vol (T) = 1n+1
10
(n)nrnd
=(n)rn
(n + 1)n+1=
(n)rn
(n + 1) 1 + (r + x)2
n+1
2
.
Hence Proposition 3 follows. Optimization Group 24/26
-
8/2/2019 Ellipsoid 4
25/26
Proof of Theorem 12
-
8/2/2019 Ellipsoid 4
26/26
By Theorem 10 the algorithm needs at most
2(n + 2) ln
vol (Bn+1)
vol (HP Bn+1)iterations. Due to Proposition 3 this is
2(n + 2) ln
(n + 1)(n + 1)
1 + (r + x)2
n+1
2
(n)rn .
Since
(n + 1)(n + 1)
(n)=
(n + 1)n+1
2
n2
+ 1
n
2 n+1
2+ 1
n,
we obtain (omitting the brackets) the bound
2(n + 2) ln
n(1+(r+x)2)n+1
2
rn
= 2(n + 2) ln rn 1r
2 + 1 + xr2
n+1
2
= (n + 1)(n + 2) ln
1r2
+
1 +x
r
2+ 2(n + 2) ln
r
n
.
This completes the proof.
Optimization Group 26/26