beyond amls: domain decomposition with rational filteringxiaj/fastsolvers2018/ka... ·...

40
Beyond AMLS: Domain decomposition with rational filtering Vassilis Kalantzis (joint work with Yuanzhe Xi and Yousef Saad) IBM Research Conference on Fast Direct Solvers 11-10-2018 VK (IBM) Conference on Fast Direct Solvers 11-10-2018 1 / 22

Upload: others

Post on 10-Mar-2020

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Beyond AMLS: Domain decomposition with rational filteringxiaj/FastSolvers2018/ka... · 2018-11-10 · Beyond AMLS: Domain decomposition with rational ltering Vassilis Kalantzis (joint

Beyond AMLS: Domain decomposition with rational filtering

Vassilis Kalantzis (joint work with Yuanzhe Xi and Yousef Saad)

IBM Research

Conference on Fast Direct Solvers11-10-2018

VK (IBM) Conference on Fast Direct Solvers 11-10-2018 1 / 22

Page 2: Beyond AMLS: Domain decomposition with rational filteringxiaj/FastSolvers2018/ka... · 2018-11-10 · Beyond AMLS: Domain decomposition with rational ltering Vassilis Kalantzis (joint

Acknowledgments

The computational results featured in this talk were performed using resources of the University ofMinnesota Supercomputing Institute.

VK (IBM) Conference on Fast Direct Solvers 11-10-2018 2 / 22

Page 3: Beyond AMLS: Domain decomposition with rational filteringxiaj/FastSolvers2018/ka... · 2018-11-10 · Beyond AMLS: Domain decomposition with rational ltering Vassilis Kalantzis (joint

Introduction and preliminary discussion

Contents

1 Introduction and preliminary discussion

2 The domain decomposition (DD) framework

3 Combining domain decomposition with rational filtering

4 Numerical experiments

VK (IBM) Conference on Fast Direct Solvers 11-10-2018 3 / 22

Page 4: Beyond AMLS: Domain decomposition with rational filteringxiaj/FastSolvers2018/ka... · 2018-11-10 · Beyond AMLS: Domain decomposition with rational ltering Vassilis Kalantzis (joint

Introduction and preliminary discussion

The algebraic generalized eigenvalue problem

The symmetric generalized eigenvalue problem is formally defined as

Ax = λMx .

Matrices A and M are assumed sparse and symmetric, while M is also SPD.

The pencil (A,M) has n eigenpairs which we will denote by(λi , x

(i)), i = 1, . . . , n.

We are only interested in computing those eigenpairs(λi , x

(i))

for which λi ∈ [α, β].

We will denote the number of eigenvalues which satisfy the above property by ’nev ’.

0.5 1 1.5 2 2.5 3 3.5

Location of the eigenvalues

[α,β]=[1,2.5], nev = 7

VK (IBM) Conference on Fast Direct Solvers 11-10-2018 4 / 22

Page 5: Beyond AMLS: Domain decomposition with rational filteringxiaj/FastSolvers2018/ka... · 2018-11-10 · Beyond AMLS: Domain decomposition with rational ltering Vassilis Kalantzis (joint

The domain decomposition (DD) framework

Contents

1 Introduction and preliminary discussion

2 The domain decomposition (DD) framework

3 Combining domain decomposition with rational filtering

4 Numerical experiments

VK (IBM) Conference on Fast Direct Solvers 11-10-2018 5 / 22

Page 6: Beyond AMLS: Domain decomposition with rational filteringxiaj/FastSolvers2018/ka... · 2018-11-10 · Beyond AMLS: Domain decomposition with rational ltering Vassilis Kalantzis (joint

The domain decomposition (DD) framework

Reordering equations/unknowns (p ≥ 2 subdomains)

A =

B1 E1

B2 E2

. . ....

Bp Ep

ET1 ET

2 · · · ETp C

,

M =

M(1)B M

(1)E

M(2)B M

(2)E

. . ....

M(p)B M

(p)E(

M(1)E

)T (M

(2)E

)T. . .

(M

(p)E

)TMC

.

VK (IBM) Conference on Fast Direct Solvers 11-10-2018 6 / 22

Page 7: Beyond AMLS: Domain decomposition with rational filteringxiaj/FastSolvers2018/ka... · 2018-11-10 · Beyond AMLS: Domain decomposition with rational ltering Vassilis Kalantzis (joint

The domain decomposition (DD) framework

Reordering equations/unknowns (p ≥ 2 subdomains)

A =

B1 E1

B2 E2

. . ....

Bp Ep

ET1 ET

2 · · · ETp C

,

M =

M(1)B M

(1)E

M(2)B M

(2)E

. . ....

M(p)B M

(p)E(

M(1)E

)T (M

(2)E

)T. . .

(M

(p)E

)TMC

.

Notation: write as

A =

(B E

ET C

),M =

(MB ME

MTE MC

),

x (i) =

(u(i)

y (i)

)=

u(i)1

...

u(i)p

y(i)1

...

y(i)p

.

VK (IBM) Conference on Fast Direct Solvers 11-10-2018 7 / 22

Page 8: Beyond AMLS: Domain decomposition with rational filteringxiaj/FastSolvers2018/ka... · 2018-11-10 · Beyond AMLS: Domain decomposition with rational ltering Vassilis Kalantzis (joint

The domain decomposition (DD) framework

An example of the sparsity pattern of A and M for p = 4

A =

(B E

ET C

)=

B1 E1

B2 E2

. . ....

Bp Ep

ET1 ET

2 · · · ETp C

Sparsity pattern of matrix |A|+ |M |

VK (IBM) Conference on Fast Direct Solvers 11-10-2018 8 / 22

Page 9: Beyond AMLS: Domain decomposition with rational filteringxiaj/FastSolvers2018/ka... · 2018-11-10 · Beyond AMLS: Domain decomposition with rational ltering Vassilis Kalantzis (joint

The domain decomposition (DD) framework

Invariant subspaces from a Schur complement viewpoint

(A− λiM)x (i) =

(B − λiMB E − λiME

ET − λiMTE C − λiMC

)(u(i)

y (i)

)= 0.

Eliminating u(i) from the first block of rows gives:C − λiMC − (E − λiME )T (B − λiMB)−1(E − λiME )︸ ︷︷ ︸block-diagonal

y (i) = 0,

u(i) = −(B − λiMB)−1(E − λiME )︸ ︷︷ ︸block-diagonal

y (i).

To compute the eigenpairs(λi , x (i)

)i=1,...,nev

Perform a Rayleigh-Ritz projection onto Z = U ⊕ Y:

Y = span{y (i)}i=1,...,nev

,

U = span{−(B − λiMB)−1(E − λiME )y (i)

}i=1,...,nev

.

VK (IBM) Conference on Fast Direct Solvers 11-10-2018 9 / 22

Page 10: Beyond AMLS: Domain decomposition with rational filteringxiaj/FastSolvers2018/ka... · 2018-11-10 · Beyond AMLS: Domain decomposition with rational ltering Vassilis Kalantzis (joint

The domain decomposition (DD) framework

Invariant subspaces from a Schur complement viewpoint

(A− λiM)x (i) =

(B − λiMB E − λiME

ET − λiMTE C − λiMC

)(u(i)

y (i)

)= 0.

Eliminating u(i) from the first block of rows gives:

C − λiMC − (E − λiME )T (B − λiMB)−1(E − λiME )︸ ︷︷ ︸block-diagonal

y (i) = 0,

u(i) = −(B − λiMB)−1(E − λiME )︸ ︷︷ ︸block-diagonal

y (i).

To compute the eigenpairs(λi , x (i)

)i=1,...,nev

Perform a Rayleigh-Ritz projection onto Z = U ⊕ Y:

Y = span{y (i)}i=1,...,nev

,

U = span{−(B − λiMB)−1(E − λiME )y (i)

}i=1,...,nev

.

VK (IBM) Conference on Fast Direct Solvers 11-10-2018 9 / 22

Page 11: Beyond AMLS: Domain decomposition with rational filteringxiaj/FastSolvers2018/ka... · 2018-11-10 · Beyond AMLS: Domain decomposition with rational ltering Vassilis Kalantzis (joint

The domain decomposition (DD) framework

Invariant subspaces from a Schur complement viewpoint

(A− λiM)x (i) =

(B − λiMB E − λiME

ET − λiMTE C − λiMC

)(u(i)

y (i)

)= 0.

Eliminating u(i) from the first block of rows gives:C − λiMC − (E − λiME )T (B − λiMB)−1(E − λiME )︸ ︷︷ ︸block-diagonal

y (i) = 0,

u(i) = −(B − λiMB)−1(E − λiME )︸ ︷︷ ︸block-diagonal

y (i).

To compute the eigenpairs(λi , x (i)

)i=1,...,nev

Perform a Rayleigh-Ritz projection onto Z = U ⊕ Y:

Y = span{y (i)}i=1,...,nev

,

U = span{−(B − λiMB)−1(E − λiME )y (i)

}i=1,...,nev

.

VK (IBM) Conference on Fast Direct Solvers 11-10-2018 9 / 22

Page 12: Beyond AMLS: Domain decomposition with rational filteringxiaj/FastSolvers2018/ka... · 2018-11-10 · Beyond AMLS: Domain decomposition with rational ltering Vassilis Kalantzis (joint

The domain decomposition (DD) framework

Invariant subspaces from a Schur complement viewpoint

(A− λiM)x (i) =

(B − λiMB E − λiME

ET − λiMTE C − λiMC

)(u(i)

y (i)

)= 0.

Eliminating u(i) from the first block of rows gives:C − λiMC − (E − λiME )T (B − λiMB)−1(E − λiME )︸ ︷︷ ︸block-diagonal

y (i) = 0,

u(i) = −(B − λiMB)−1(E − λiME )︸ ︷︷ ︸block-diagonal

y (i).

To compute the eigenpairs(λi , x (i)

)i=1,...,nev

Perform a Rayleigh-Ritz projection onto Z = U ⊕ Y:

Y = span{y (i)}i=1,...,nev

,

U = span{−(B − λiMB)−1(E − λiME )y (i)

}i=1,...,nev

.

VK (IBM) Conference on Fast Direct Solvers 11-10-2018 9 / 22

Page 13: Beyond AMLS: Domain decomposition with rational filteringxiaj/FastSolvers2018/ka... · 2018-11-10 · Beyond AMLS: Domain decomposition with rational ltering Vassilis Kalantzis (joint

The domain decomposition (DD) framework

Invariant subspaces from a Schur complement viewpoint

(A− λiM)x (i) =

(B − λiMB E − λiME

ET − λiMTE C − λiMC

)(u(i)

y (i)

)= 0.

Eliminating u(i) from the first block of rows gives:C − λiMC − (E − λiME )T (B − λiMB)−1(E − λiME )︸ ︷︷ ︸block-diagonal

y (i) = 0,

u(i) = −(B − λiMB)−1(E − λiME )︸ ︷︷ ︸block-diagonal

y (i).

To compute the eigenpairs(λi , x (i)

)i=1,...,nev

Perform a Rayleigh-Ritz projection onto Z = U ⊕ Y:

Y = span{y (i)}i=1,...,nev

,

U = span{−(B − λiMB)−1(E − λiME )y (i)

}i=1,...,nev

.

VK (IBM) Conference on Fast Direct Solvers 11-10-2018 9 / 22

Page 14: Beyond AMLS: Domain decomposition with rational filteringxiaj/FastSolvers2018/ka... · 2018-11-10 · Beyond AMLS: Domain decomposition with rational ltering Vassilis Kalantzis (joint

Combining domain decomposition with rational filtering

Contents

1 Introduction and preliminary discussion

2 The domain decomposition (DD) framework

3 Combining domain decomposition with rational filtering

4 Numerical experiments

VK (IBM) Conference on Fast Direct Solvers 11-10-2018 10 / 22

Page 15: Beyond AMLS: Domain decomposition with rational filteringxiaj/FastSolvers2018/ka... · 2018-11-10 · Beyond AMLS: Domain decomposition with rational ltering Vassilis Kalantzis (joint

Combining domain decomposition with rational filtering

Rational filtering

We consider the following rational filter

ρ(ζ) =

2Nc∑`=1

ω`

ζ − ζ`≈

1

2πi

∫Γ[α,β]

1

ν − ζdν︸ ︷︷ ︸

I[α,β](ζ)

= 2<e

{Nc∑`=1

ω`

ζ − ζ`

}.

-2 0 2

ζ

10-10

10-5

100

|ρ(ζ)|

Midpoint

Nc = 4Nc = 8Nc = 12Nc = 16

It is possible to apply ρ(.) to (A,M):

ρ(M−1A) = 2<e

{Nc∑`=1

ω`(A− ζ`M)−1M

}.

Examples: FEAST (Subspace Iteration),Sakurai-Sugiura (Moments-based).Krylov projection schemes are also possible(RF-KRYLOV).

Our idea: Decouple application of ρ(ζ)to interior/interface variables.

Potential advantages:1 Reduced use of complex arithmetic.2 Orthonormalization of shorter vectors

(interface variables).3 Faster convergence.

VK (IBM) Conference on Fast Direct Solvers 11-10-2018 11 / 22

Page 16: Beyond AMLS: Domain decomposition with rational filteringxiaj/FastSolvers2018/ka... · 2018-11-10 · Beyond AMLS: Domain decomposition with rational ltering Vassilis Kalantzis (joint

Combining domain decomposition with rational filtering

Rational filtering

We consider the following rational filter

ρ(ζ) =

2Nc∑`=1

ω`

ζ − ζ`≈

1

2πi

∫Γ[α,β]

1

ν − ζdν︸ ︷︷ ︸

I[α,β](ζ)

= 2<e

{Nc∑`=1

ω`

ζ − ζ`

}.

-2 0 2

ζ

10-10

10-5

100

|ρ(ζ)|

Midpoint

Nc = 4Nc = 8Nc = 12Nc = 16

It is possible to apply ρ(.) to (A,M):

ρ(M−1A) = 2<e

{Nc∑`=1

ω`(A− ζ`M)−1M

}.

Examples: FEAST (Subspace Iteration),Sakurai-Sugiura (Moments-based).Krylov projection schemes are also possible(RF-KRYLOV).

Our idea: Decouple application of ρ(ζ)to interior/interface variables.

Potential advantages:1 Reduced use of complex arithmetic.2 Orthonormalization of shorter vectors

(interface variables).3 Faster convergence.

VK (IBM) Conference on Fast Direct Solvers 11-10-2018 11 / 22

Page 17: Beyond AMLS: Domain decomposition with rational filteringxiaj/FastSolvers2018/ka... · 2018-11-10 · Beyond AMLS: Domain decomposition with rational ltering Vassilis Kalantzis (joint

Combining domain decomposition with rational filtering

Rational filtering

We consider the following rational filter

ρ(ζ) =

2Nc∑`=1

ω`

ζ − ζ`≈

1

2πi

∫Γ[α,β]

1

ν − ζdν︸ ︷︷ ︸

I[α,β](ζ)

= 2<e

{Nc∑`=1

ω`

ζ − ζ`

}.

-2 0 2

ζ

10-10

10-5

100

|ρ(ζ)|

Midpoint

Nc = 4Nc = 8Nc = 12Nc = 16

It is possible to apply ρ(.) to (A,M):

ρ(M−1A) = 2<e

{Nc∑`=1

ω`(A− ζ`M)−1M

}.

Examples: FEAST (Subspace Iteration),Sakurai-Sugiura (Moments-based).Krylov projection schemes are also possible(RF-KRYLOV).

Our idea: Decouple application of ρ(ζ)to interior/interface variables.

Potential advantages:1 Reduced use of complex arithmetic.2 Orthonormalization of shorter vectors

(interface variables).3 Faster convergence.

VK (IBM) Conference on Fast Direct Solvers 11-10-2018 11 / 22

Page 18: Beyond AMLS: Domain decomposition with rational filteringxiaj/FastSolvers2018/ka... · 2018-11-10 · Beyond AMLS: Domain decomposition with rational ltering Vassilis Kalantzis (joint

Combining domain decomposition with rational filtering

Rational filtering

We consider the following rational filter

ρ(ζ) =

2Nc∑`=1

ω`

ζ − ζ`≈

1

2πi

∫Γ[α,β]

1

ν − ζdν︸ ︷︷ ︸

I[α,β](ζ)

= 2<e

{Nc∑`=1

ω`

ζ − ζ`

}.

-2 0 2

ζ

10-10

10-5

100

|ρ(ζ)|

Midpoint

Nc = 4Nc = 8Nc = 12Nc = 16

It is possible to apply ρ(.) to (A,M):

ρ(M−1A) = 2<e

{Nc∑`=1

ω`(A− ζ`M)−1M

}.

Examples: FEAST (Subspace Iteration),Sakurai-Sugiura (Moments-based).Krylov projection schemes are also possible(RF-KRYLOV).

Our idea: Decouple application of ρ(ζ)to interior/interface variables.

Potential advantages:1 Reduced use of complex arithmetic.2 Orthonormalization of shorter vectors

(interface variables).3 Faster convergence.

VK (IBM) Conference on Fast Direct Solvers 11-10-2018 11 / 22

Page 19: Beyond AMLS: Domain decomposition with rational filteringxiaj/FastSolvers2018/ka... · 2018-11-10 · Beyond AMLS: Domain decomposition with rational ltering Vassilis Kalantzis (joint

Combining domain decomposition with rational filtering

Rational filtering

We consider the following rational filter

ρ(ζ) =

2Nc∑`=1

ω`

ζ − ζ`≈

1

2πi

∫Γ[α,β]

1

ν − ζdν︸ ︷︷ ︸

I[α,β](ζ)

= 2<e

{Nc∑`=1

ω`

ζ − ζ`

}.

-2 0 2

ζ

10-10

10-5

100

|ρ(ζ)|

Midpoint

Nc = 4Nc = 8Nc = 12Nc = 16

It is possible to apply ρ(.) to (A,M):

ρ(M−1A) = 2<e

{Nc∑`=1

ω`(A− ζ`M)−1M

}.

Examples: FEAST (Subspace Iteration),Sakurai-Sugiura (Moments-based).Krylov projection schemes are also possible(RF-KRYLOV).

Our idea: Decouple application of ρ(ζ)to interior/interface variables.

Potential advantages:1 Reduced use of complex arithmetic.2 Orthonormalization of shorter vectors

(interface variables).3 Faster convergence.

VK (IBM) Conference on Fast Direct Solvers 11-10-2018 11 / 22

Page 20: Beyond AMLS: Domain decomposition with rational filteringxiaj/FastSolvers2018/ka... · 2018-11-10 · Beyond AMLS: Domain decomposition with rational ltering Vassilis Kalantzis (joint

Combining domain decomposition with rational filtering

Summary of the proposed technique

Our goal is to construct a subspace Z = U ⊕ Y to perform a Rayleigh-Ritz projection onto.

Recall that, ideally,

Y = span{y (i)}i=1,...,nev

,

U = span{−(B − λiMB)−1(E − λiME )y (i)

}i=1,...,nev

.

The technique proposed in this talk:

1 Constructs Y by applying the rational filter ρ(ζ) to the interface region (Schur complementmatrices).

2 Uses the above subspace to construct U . This step is performed in real arithmetic and isembarrassingly parallel.

VK (IBM) Conference on Fast Direct Solvers 11-10-2018 12 / 22

Page 21: Beyond AMLS: Domain decomposition with rational filteringxiaj/FastSolvers2018/ka... · 2018-11-10 · Beyond AMLS: Domain decomposition with rational ltering Vassilis Kalantzis (joint

Combining domain decomposition with rational filtering

Summary of the proposed technique

Our goal is to construct a subspace Z = U ⊕ Y to perform a Rayleigh-Ritz projection onto.

Recall that, ideally,

Y = span{y (i)}i=1,...,nev

,

U = span{−(B − λiMB)−1(E − λiME )y (i)

}i=1,...,nev

.

The technique proposed in this talk:

1 Constructs Y by applying the rational filter ρ(ζ) to the interface region (Schur complementmatrices).

2 Uses the above subspace to construct U . This step is performed in real arithmetic and isembarrassingly parallel.

VK (IBM) Conference on Fast Direct Solvers 11-10-2018 12 / 22

Page 22: Beyond AMLS: Domain decomposition with rational filteringxiaj/FastSolvers2018/ka... · 2018-11-10 · Beyond AMLS: Domain decomposition with rational ltering Vassilis Kalantzis (joint

Combining domain decomposition with rational filtering

How to approximate span{y (1), . . . , y (nev)

}(I)

Let ζ ∈ C and define

Bζ = B − ζMB , Eζ = E − ζME , Cζ = C − ζMC ,

S(ζ) = Cζ − ETζ B−1

ζ Eζ .

Then,

(A− ζM)−1 =

(B−1ζ + B−1

ζ EζS(ζ)−1ETζ B−1

ζ −B−1ζ EζS(ζ)−1

−S(ζ)−1ETζ B−1

ζ S(ζ)−1

).

The matrix inverse (A− ζM)−1 can be also written as:

(A− ζM)−1 =n∑

i=1

1

λi − ζx (i)

(x (i))T

=n∑

i=1

1

λi − ζ

u(i)(u(i))T

u(i)(y (i))T

y (i)(u(i))T

y (i)(y (i))T .

VK (IBM) Conference on Fast Direct Solvers 11-10-2018 13 / 22

Page 23: Beyond AMLS: Domain decomposition with rational filteringxiaj/FastSolvers2018/ka... · 2018-11-10 · Beyond AMLS: Domain decomposition with rational ltering Vassilis Kalantzis (joint

Combining domain decomposition with rational filtering

How to approximate span{y (1), . . . , y (nev)

}(I)

Let ζ ∈ C and define

Bζ = B − ζMB , Eζ = E − ζME , Cζ = C − ζMC ,

S(ζ) = Cζ − ETζ B−1

ζ Eζ .

Then,

(A− ζM)−1 =

(B−1ζ + B−1

ζ EζS(ζ)−1ETζ B−1

ζ −B−1ζ EζS(ζ)−1

−S(ζ)−1ETζ B−1

ζ S(ζ)−1

).

The matrix inverse (A− ζM)−1 can be also written as:

(A− ζM)−1 =n∑

i=1

1

λi − ζx (i)

(x (i))T

=n∑

i=1

1

λi − ζ

u(i)(u(i))T

u(i)(y (i))T

y (i)(u(i))T

y (i)(y (i))T .

VK (IBM) Conference on Fast Direct Solvers 11-10-2018 13 / 22

Page 24: Beyond AMLS: Domain decomposition with rational filteringxiaj/FastSolvers2018/ka... · 2018-11-10 · Beyond AMLS: Domain decomposition with rational ltering Vassilis Kalantzis (joint

Combining domain decomposition with rational filtering

How to approximate span{y (1), . . . , y (nev)

}(I)

Let ζ ∈ C and define

Bζ = B − ζMB , Eζ = E − ζME , Cζ = C − ζMC ,

S(ζ) = Cζ − ETζ B−1

ζ Eζ .

Then,

(A− ζM)−1 =

(B−1ζ + B−1

ζ EζS(ζ)−1ETζ B−1

ζ −B−1ζ EζS(ζ)−1

−S(ζ)−1ETζ B−1

ζ S(ζ)−1

).

The matrix inverse (A− ζM)−1 can be also written as:

(A− ζM)−1 =n∑

i=1

1

λi − ζx (i)

(x (i))T

=n∑

i=1

1

λi − ζ

u(i)(u(i))T

u(i)(y (i))T

y (i)(u(i))T

y (i)(y (i))T .

VK (IBM) Conference on Fast Direct Solvers 11-10-2018 13 / 22

Page 25: Beyond AMLS: Domain decomposition with rational filteringxiaj/FastSolvers2018/ka... · 2018-11-10 · Beyond AMLS: Domain decomposition with rational ltering Vassilis Kalantzis (joint

Combining domain decomposition with rational filtering

How to approximate span{y (1), . . . , y (nev)

}(II)

Recall that

ρ(M−1A) = 2<e

{Nc∑`=1

ω`(A− ζ`M)−1M

}.

Combining alltogether we get:

ρ(M−1A) = 2<e

Nc∑`=1

ω`

B−1ζ`

+ B−1ζ`

Eζ`S(ζ`)−1ET

ζ`B−1ζ`

−B−1ζ`

Eζ`S(ζ`)−1

−S(ζ`)−1ET

ζ`B−1ζ`

S(ζ`)−1

M

=n∑

i=1

ρ(λi)

u(i)(u(i))T

u(i)(y (i))T

y (i)(u(i))T

y(i)(y (i))T

M.

(ρ(λi) = 2<e

{Nc∑`=1

ω`λi − ζ`

})

VK (IBM) Conference on Fast Direct Solvers 11-10-2018 14 / 22

Page 26: Beyond AMLS: Domain decomposition with rational filteringxiaj/FastSolvers2018/ka... · 2018-11-10 · Beyond AMLS: Domain decomposition with rational ltering Vassilis Kalantzis (joint

Combining domain decomposition with rational filtering

How to approximate span{y (1), . . . , y (nev)

}(II)

Recall that

ρ(M−1A) = 2<e

{Nc∑`=1

ω`(A− ζ`M)−1M

}.

Combining alltogether we get:

ρ(M−1A) = 2<e

Nc∑`=1

ω`

B−1ζ`

+ B−1ζ`

Eζ`S(ζ`)−1ET

ζ`B−1ζ`

−B−1ζ`

Eζ`S(ζ`)−1

−S(ζ`)−1ET

ζ`B−1ζ`

S(ζ`)−1

M

=n∑

i=1

ρ(λi)

u(i)(u(i))T

u(i)(y (i))T

y (i)(u(i))T

y(i)(y (i))T

M.

(ρ(λi) = 2<e

{Nc∑`=1

ω`λi − ζ`

})

VK (IBM) Conference on Fast Direct Solvers 11-10-2018 14 / 22

Page 27: Beyond AMLS: Domain decomposition with rational filteringxiaj/FastSolvers2018/ka... · 2018-11-10 · Beyond AMLS: Domain decomposition with rational ltering Vassilis Kalantzis (joint

Combining domain decomposition with rational filtering

How to approximate span{y (1), . . . , y (nev)

}(III)

Equating blocks leads to:

2<e

{Nc∑`=1

ω`S(ζ`)−1

}=

n∑i=1

ρ(λi)y(i)(y (i))T

.

Since ρ(λ1), . . . , ρ(λnev ) 6= 0:

span{y (1), . . . , y (nev)

}⊆ range

(2<e

{Nc∑`=1

ω`S(ζ`)−1

}).

Capture range(<e{∑Nc

`=1 ω`S(ζ`)−1})

by a Krylov projection scheme.

VK (IBM) Conference on Fast Direct Solvers 11-10-2018 15 / 22

Page 28: Beyond AMLS: Domain decomposition with rational filteringxiaj/FastSolvers2018/ka... · 2018-11-10 · Beyond AMLS: Domain decomposition with rational ltering Vassilis Kalantzis (joint

Combining domain decomposition with rational filtering

How to approximate span{y (1), . . . , y (nev)

}(III)

Equating blocks leads to:

2<e

{Nc∑`=1

ω`S(ζ`)−1

}=

n∑i=1

ρ(λi)y(i)(y (i))T

.

Since ρ(λ1), . . . , ρ(λnev ) 6= 0:

span{y (1), . . . , y (nev)

}⊆ range

(2<e

{Nc∑`=1

ω`S(ζ`)−1

}).

Capture range(<e{∑Nc

`=1 ω`S(ζ`)−1})

by a Krylov projection scheme.

VK (IBM) Conference on Fast Direct Solvers 11-10-2018 15 / 22

Page 29: Beyond AMLS: Domain decomposition with rational filteringxiaj/FastSolvers2018/ka... · 2018-11-10 · Beyond AMLS: Domain decomposition with rational ltering Vassilis Kalantzis (joint

Combining domain decomposition with rational filtering

How to approximate span{y (1), . . . , y (nev)

}(IV)

0 50 100 150

Singular values index

10-15

10-10

10-5

100

Magnitude

bcsst24

Nc=4

Nc=8

Nc=12

Nc=16

0 20 40 60 80 100

Singular values index

10-20

10-15

10-10

10-5

100

Ma

gn

itu

de

FDmesh1

Nc=4

Nc=8

Nc=12

Nc=16

Figure: Leading singular values of 2<e{∑Nc

`=1 ω`S(ζ`)−1}

=∑n

i=1 ρ(λi )y (i)(y (i))T, ([α, β] = [λ1, λ100]).

What if rank([

y (1), . . . , y (nev)])

< nev?

VK (IBM) Conference on Fast Direct Solvers 11-10-2018 16 / 22

Page 30: Beyond AMLS: Domain decomposition with rational filteringxiaj/FastSolvers2018/ka... · 2018-11-10 · Beyond AMLS: Domain decomposition with rational ltering Vassilis Kalantzis (joint

Combining domain decomposition with rational filtering

How to approximate span{y (1), . . . , y (nev)

}(IV)

0 50 100 150

Singular values index

10-15

10-10

10-5

100

Magnitude

bcsst24

Nc=4

Nc=8

Nc=12

Nc=16

0 20 40 60 80 100

Singular values index

10-20

10-15

10-10

10-5

100

Ma

gn

itu

de

FDmesh1

Nc=4

Nc=8

Nc=12

Nc=16

Figure: Leading singular values of 2<e{∑Nc

`=1 ω`S(ζ`)−1}

=∑n

i=1 ρ(λi )y (i)(y (i))T, ([α, β] = [λ1, λ100]).

What if rank([

y (1), . . . , y (nev)])

< nev?

VK (IBM) Conference on Fast Direct Solvers 11-10-2018 16 / 22

Page 31: Beyond AMLS: Domain decomposition with rational filteringxiaj/FastSolvers2018/ka... · 2018-11-10 · Beyond AMLS: Domain decomposition with rational ltering Vassilis Kalantzis (joint

Combining domain decomposition with rational filtering

Finalizing the proposed scheme (RF-DDES)

Ideally, U ={u(1), . . . , u(nev)

}, where

u(i) = −B−1λi

Eλiy (i)

= −(B−1λi

Eσ + (λi − σ)B−1λi

ME

)y (i).

Set

B−1λi≈ B−1

σ

ψ−1∑k=0

(λi − σ)MBB−1σ .

We finally set U = span([V ,U1,U2]) where

U1 = −[B−1σ EσY , . . . , (B

−1σ MB)ψ−1B−1

σ EσY],

U2 =[B−1σ MEY , . . . , (B

−1σ MB)ψ−1B−1

σ MEY],

V includes the eigenvectors associated with the nevBp smallest eigenvalues of (Bσ,MB).∥∥∥u(i) − u(i)∥∥∥MB

≤ max`≥(nevBp)+1 O

((λi − σ)ψ+1

(δ` − λi)(δ` − σ)ψ

).

VK (IBM) Conference on Fast Direct Solvers 11-10-2018 17 / 22

Page 32: Beyond AMLS: Domain decomposition with rational filteringxiaj/FastSolvers2018/ka... · 2018-11-10 · Beyond AMLS: Domain decomposition with rational ltering Vassilis Kalantzis (joint

Combining domain decomposition with rational filtering

Finalizing the proposed scheme (RF-DDES)

Ideally, U ={u(1), . . . , u(nev)

}, where

u(i) = −B−1λi

Eλiy (i)

= −(B−1λi

Eσ + (λi − σ)B−1λi

ME

)y (i).

Set

B−1λi≈ B−1

σ

ψ−1∑k=0

(λi − σ)MBB−1σ .

We finally set U = span([V ,U1,U2]) where

U1 = −[B−1σ EσY , . . . , (B

−1σ MB)ψ−1B−1

σ EσY],

U2 =[B−1σ MEY , . . . , (B

−1σ MB)ψ−1B−1

σ MEY],

V includes the eigenvectors associated with the nevBp smallest eigenvalues of (Bσ,MB).∥∥∥u(i) − u(i)∥∥∥MB

≤ max`≥(nevBp)+1 O

((λi − σ)ψ+1

(δ` − λi)(δ` − σ)ψ

).

VK (IBM) Conference on Fast Direct Solvers 11-10-2018 17 / 22

Page 33: Beyond AMLS: Domain decomposition with rational filteringxiaj/FastSolvers2018/ka... · 2018-11-10 · Beyond AMLS: Domain decomposition with rational ltering Vassilis Kalantzis (joint

Combining domain decomposition with rational filtering

Finalizing the proposed scheme (RF-DDES)

Ideally, U ={u(1), . . . , u(nev)

}, where

u(i) = −B−1λi

Eλiy (i)

= −(B−1λi

Eσ + (λi − σ)B−1λi

ME

)y (i).

Set

B−1λi≈ B−1

σ

ψ−1∑k=0

(λi − σ)MBB−1σ .

We finally set U = span([V ,U1,U2]) where

U1 = −[B−1σ EσY , . . . , (B

−1σ MB)ψ−1B−1

σ EσY],

U2 =[B−1σ MEY , . . . , (B

−1σ MB)ψ−1B−1

σ MEY],

V includes the eigenvectors associated with the nevBp smallest eigenvalues of (Bσ,MB).

∥∥∥u(i) − u(i)∥∥∥MB

≤ max`≥(nevBp)+1 O

((λi − σ)ψ+1

(δ` − λi)(δ` − σ)ψ

).

VK (IBM) Conference on Fast Direct Solvers 11-10-2018 17 / 22

Page 34: Beyond AMLS: Domain decomposition with rational filteringxiaj/FastSolvers2018/ka... · 2018-11-10 · Beyond AMLS: Domain decomposition with rational ltering Vassilis Kalantzis (joint

Combining domain decomposition with rational filtering

Finalizing the proposed scheme (RF-DDES)

Ideally, U ={u(1), . . . , u(nev)

}, where

u(i) = −B−1λi

Eλiy (i)

= −(B−1λi

Eσ + (λi − σ)B−1λi

ME

)y (i).

Set

B−1λi≈ B−1

σ

ψ−1∑k=0

(λi − σ)MBB−1σ .

We finally set U = span([V ,U1,U2]) where

U1 = −[B−1σ EσY , . . . , (B

−1σ MB)ψ−1B−1

σ EσY],

U2 =[B−1σ MEY , . . . , (B

−1σ MB)ψ−1B−1

σ MEY],

V includes the eigenvectors associated with the nevBp smallest eigenvalues of (Bσ,MB).∥∥∥u(i) − u(i)∥∥∥MB

≤ max`≥(nevBp)+1 O

((λi − σ)ψ+1

(δ` − λi)(δ` − σ)ψ

).

VK (IBM) Conference on Fast Direct Solvers 11-10-2018 17 / 22

Page 35: Beyond AMLS: Domain decomposition with rational filteringxiaj/FastSolvers2018/ka... · 2018-11-10 · Beyond AMLS: Domain decomposition with rational ltering Vassilis Kalantzis (joint

Numerical experiments

Contents

1 Introduction and preliminary discussion

2 The domain decomposition (DD) framework

3 Combining domain decomposition with rational filtering

4 Numerical experiments

VK (IBM) Conference on Fast Direct Solvers 11-10-2018 18 / 22

Page 36: Beyond AMLS: Domain decomposition with rational filteringxiaj/FastSolvers2018/ka... · 2018-11-10 · Beyond AMLS: Domain decomposition with rational ltering Vassilis Kalantzis (joint

Numerical experiments

Approximation of the nev = 100 algebraically smallest eigenvalues ofpencil qa8fk/qa8fm

0 50 100

Eigenvalue index

10 -14

10 -12

10 -10

10 -8

10 -6

10 -4

10 -2

10 0

Rela

tive e

rror

nevB

= 50

RF-DDES(1)

RF-DDES(2)

RF-DDES(3)

0 50 100

Eigenvalue index

10 -14

10 -12

10 -10

10 -8

10 -6

10 -4

10 -2

10 0

Rela

tive e

rror

nevB

=100

RF-DDES(1)

RF-DDES(2)

RF-DDES(3)

0 50 100

Eigenvalue index

10 -15

10 -10

10 -5

10 0

Rela

tive e

rror

nevB

=200

RF-DDES(1)

RF-DDES(2)

RF-DDES(3)

VK (IBM) Conference on Fast Direct Solvers 11-10-2018 19 / 22

Page 37: Beyond AMLS: Domain decomposition with rational filteringxiaj/FastSolvers2018/ka... · 2018-11-10 · Beyond AMLS: Domain decomposition with rational ltering Vassilis Kalantzis (joint

Numerical experiments

A comparison of RF-KRYLOV and RF-DDES (I)

Table: Wall-clock times of RF-KRYLOV and RF-DDES using τ = 2, 4, 8, 16 and τ = 32 computational cores.RFD(2) and RFD(4) denote RF-DDES with p = 2 and p = 4 subdomains, respectively.

nev = 100 nev = 200 nev = 300Matrix RFK RFD(2) RFD(4) RFK RFD(2) RFD(4) RFK RFD(2) RFD(4)shipsec8(τ = 2) 114 195 - 195 207 - 279 213 -

(τ = 4) 76 129 93 123 133 103 168 139 107(τ = 8) 65 74 56 90 75 62 127 79 68(τ = 16) 40 51 36 66 55 41 92 57 45(τ = 32) 40 36 28 62 41 30 75 43 34

boneS01(τ = 2) 94 292 - 194 356 - 260 424 -(τ = 4) 68 182 162 131 230 213 179 277 260(τ = 8) 49 115 113 94 148 152 121 180 187(τ = 16) 44 86 82 80 112 109 93 137 132(τ = 32) 51 66 60 74 86 71 89 105 79

VK (IBM) Conference on Fast Direct Solvers 11-10-2018 20 / 22

Page 38: Beyond AMLS: Domain decomposition with rational filteringxiaj/FastSolvers2018/ka... · 2018-11-10 · Beyond AMLS: Domain decomposition with rational ltering Vassilis Kalantzis (joint

Numerical experiments

A comparison of RF-KRYLOV and RF-DDES (II)

Table: Wall-clock times of RF-KRYLOV and RF-DDES using τ = 2, 4, 8, 16 and τ = 32 computational cores.RFD(2) and RFD(4) denote RF-DDES with p = 2 and p = 4 subdomains, respectively.

nev = 100 nev = 200 nev = 300Matrix RFK RFD(2) RFD(4) RFK RFD(2) RFD(4) RFK RFD(2) RFD(4)FDmesh2(τ = 2) 241 85 - 480 99 - 731 116 -

(τ = 4) 159 34 63 305 37 78 473 43 85(τ = 8) 126 22 23 228 24 27 358 27 31(τ = 16) 89 16 15 171 17 18 256 20 21(τ = 32) 51 12 12 94 13 14 138 15 20

FDmesh3(τ = 2) 1021 446 - 2062 502 - 3328 564 -(τ = 4) 718 201 281 1281 217 338 1844 237 362(τ = 8) 423 119 111 825 132 126 1250 143 141(τ = 16) 355 70 66 684 77 81 1038 88 93(τ = 32) 177 47 49 343 51 58 706 62 82

VK (IBM) Conference on Fast Direct Solvers 11-10-2018 21 / 22

Page 39: Beyond AMLS: Domain decomposition with rational filteringxiaj/FastSolvers2018/ka... · 2018-11-10 · Beyond AMLS: Domain decomposition with rational ltering Vassilis Kalantzis (joint

Numerical experiments

Amount of time spent on orthonormalization

100

101

102

# of MPI processes

10-1

100

101

102

Tim

e (

s)

FDmesh2

RF-KRYLOV, nev=100

RF-KRYLOV, nev=200

RF-KRYLOV, nev=300

RF-DDES, max

100

101

102

# of MPI processes

10-1

100

101

102

103

Tim

e (

s)

FDmesh3

RF-KRYLOV, nev=100

RF-KRYLOV, nev=200

RF-KRYLOV, nev=300

RF-DDES, max

Figure: Left: “FDmesh2” (n = 250, 000). Right: “FDmesh3” (n = 1, 000, 000).

VK (IBM) Conference on Fast Direct Solvers 11-10-2018 22 / 22

Page 40: Beyond AMLS: Domain decomposition with rational filteringxiaj/FastSolvers2018/ka... · 2018-11-10 · Beyond AMLS: Domain decomposition with rational ltering Vassilis Kalantzis (joint

Numerical experiments

Thank you

Questions?

VK (IBM) Conference on Fast Direct Solvers 11-10-2018 23 / 22