a revised 4d-var algorithm yannick trémolet ecmwf ima w/s 29 april, 2002 mike fisher, lars isaksen...

36
A revised 4D-Var algorithm Yannick Trémolet ECMWF IMA W/S 29 April, 2002 Mike Fisher, Lars Isaksen and Erik Andersson For increased efficiency and improved accuracy.

Upload: aron-carter

Post on 29-Jan-2016

216 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: A revised 4D-Var algorithm Yannick Trémolet ECMWF IMA W/S 29 April, 2002 Mike Fisher, Lars Isaksen and Erik Andersson For increased efficiency and improved

A revised 4D-Var algorithm

Yannick TrémoletECMWF

IMA W/S 29 April, 2002

Mike Fisher, Lars Isaksen and Erik Andersson

For increased efficiency and improved accuracy.

Page 2: A revised 4D-Var algorithm Yannick Trémolet ECMWF IMA W/S 29 April, 2002 Mike Fisher, Lars Isaksen and Erik Andersson For increased efficiency and improved

2

Outline

Introduction

Current 4D-Var with 12-hourly cyclingMain characteristics, used data types.The incremental formulation - and associated

approximations and limitations.

The revised algorithmQuadratic inner iterations - using Conjugate

Gradient.Hessian eigen-vector preconditioning.Trajectory interpolation from T511 to T42/T95/T159.Evaluation.

Prospects and Conclusions

Page 3: A revised 4D-Var algorithm Yannick Trémolet ECMWF IMA W/S 29 April, 2002 Mike Fisher, Lars Isaksen and Erik Andersson For increased efficiency and improved

3

Used Data

SYNOPSurf.Press, Wind-10m, RH-2m

AIREPWind, Temperature

SATOBCloud drift winds

DRIBUSurf.Press, Wind-10m

TEMPWind, Temp, Humidity

profiles DROPSONDE

Wind and Temp profiles PILOT/Am+Eu Profilers

Wind profiles PAOB

Surface pressure proxy

ATOVSHIRS, MSU and AMSU-A

radiances SSM/I

TCWV, Wind speed METEOSAT

Water Vapour channel QuikScat

Ambiguous winds SBUV

Layer ozone GOME

Total ozone

Conventional Satellite

Page 4: A revised 4D-Var algorithm Yannick Trémolet ECMWF IMA W/S 29 April, 2002 Mike Fisher, Lars Isaksen and Erik Andersson For increased efficiency and improved

4

ECMWF forecast model geometry

Vertical resolution60 levels

1000

200

50

10

1

0.1

60

30

22

14

5

1

12 levels below 850 hPa

Horizontal resolutionTL511 ~ 40 km

Observations arecompared against

a short-range3-15 hour forecast

Page 5: A revised 4D-Var algorithm Yannick Trémolet ECMWF IMA W/S 29 April, 2002 Mike Fisher, Lars Isaksen and Erik Andersson For increased efficiency and improved

5

The current operational 4D-Var system

Forecast model at T511 (40 km) resolution

Observation minus Background departures are computed using the full model at full resolution at the observed time.

Analysis increments are computed at coarser T159 resolution (125 km), using a tangent linear forecast model and its adjoint.

All observations are analysed simultaneously.

12 hours worth of global obser-vations are used in one go.

Around 1 000 000 data are used, in total, per 12-hour cycle.

Satellite radiances are the most numerous data source

Page 6: A revised 4D-Var algorithm Yannick Trémolet ECMWF IMA W/S 29 April, 2002 Mike Fisher, Lars Isaksen and Erik Andersson For increased efficiency and improved

6

4D-Var finds the 12-hour forecast evolution that best fits the available observations

It does so by adjusting 1) surface pressure, and the upper-air fields of 2) temperature, 3) wind, 4) specific humidity and 5) ozone

A few 4D-Var Characteristics

All data within a 12-hour period are used simultaneously, in one global (iterative) estimation problem

Page 7: A revised 4D-Var algorithm Yannick Trémolet ECMWF IMA W/S 29 April, 2002 Mike Fisher, Lars Isaksen and Erik Andersson For increased efficiency and improved

7

The i-summation is over 1h or ½h-long sub-divisions (or time slots) of the 12-hour assimilation period.

The incremental formulation of 4D-Var

In the incremental formulation (Courtier et al. 1994) the cost function is expressed in terms of increments with respect to the background state, with and linearized around .

The innovations are calculated using the non-linear operators, and :

This ensures the highest possible accuracy for the calculation of the innovations, which are the primary input to the assimilation!

Page 8: A revised 4D-Var algorithm Yannick Trémolet ECMWF IMA W/S 29 April, 2002 Mike Fisher, Lars Isaksen and Erik Andersson For increased efficiency and improved

8

Approximations at inner iterations

1) The tangent-linear approximation:

and

2) Approximations to reduce the cost: this involves degrading the tangent-linear (and its adjoint) with respect to the full model.

Lower resolution (T159 instead of T511),

Simplified physics (some processes ignored),

Simpler dynamics (e.g. spectral instead of grid-point humidity).

This results in a shorter control vector, and cheaper TL and AD model during the minimisation - i.e. the inner iterations.

Page 9: A revised 4D-Var algorithm Yannick Trémolet ECMWF IMA W/S 29 April, 2002 Mike Fisher, Lars Isaksen and Erik Andersson For increased efficiency and improved

9

The outer iterations

After each minimisation at inner level;

is updated: ,

and are re-linearized around .

Innovations are re-calculated using the full non-linear model :

Superscript represents the outer iterations.

The full model remains at T511 throughout.

Page 10: A revised 4D-Var algorithm Yannick Trémolet ECMWF IMA W/S 29 April, 2002 Mike Fisher, Lars Isaksen and Erik Andersson For increased efficiency and improved

10

TL testing Test of linear model based on Taylor series:

Valid for any perturbation (in practice a set of random vectors).

TL and NL run withthe same setup:

Resolution Physics Time step Simpler dynamics Configuration (IFS)

Page 11: A revised 4D-Var algorithm Yannick Trémolet ECMWF IMA W/S 29 April, 2002 Mike Fisher, Lars Isaksen and Erik Andersson For increased efficiency and improved

11

Test of incremental approximation

In 4D-VAR the perturbation is not any vector, it is an analysis increment. It is not random and it is the result of a algorithm which involves the linear model.

The linear and non-linear models are used at different resolutions (T511/T159).

The non-linear model uses more physics.

Humidity is represented in spectral space in the linear model, in grid point space in the non-linear model.

Relative error: vs.

Page 12: A revised 4D-Var algorithm Yannick Trémolet ECMWF IMA W/S 29 April, 2002 Mike Fisher, Lars Isaksen and Erik Andersson For increased efficiency and improved

12

Test of incremental approximation

Compare TL output with finite difference in 4D-VAR setup (resolution, physics, …).

All the necessary information is present during the minimisation:

All the components are used exactly in their 4D-VAR configuration.

High resolution non-linear updateLow resolution non-linear trajectoryLow resolution TL (cost function)Minimisation (TL)Low resolution TL (diagnostic)

High resolution non-linear update

Page 13: A revised 4D-Var algorithm Yannick Trémolet ECMWF IMA W/S 29 April, 2002 Mike Fisher, Lars Isaksen and Erik Andersson For increased efficiency and improved

13

Evolution of TL model error

Operational configuration.

The error is large.

It grows very rapidly in the first hours.

It is not the case in the adiabatic test.

Page 14: A revised 4D-Var algorithm Yannick Trémolet ECMWF IMA W/S 29 April, 2002 Mike Fisher, Lars Isaksen and Erik Andersson For increased efficiency and improved

14

Impact of TL model resolution

T511 outer loop.

Varying inner loop resolution.

The resolution of the inner loop may have reached a limit.

Page 15: A revised 4D-Var algorithm Yannick Trémolet ECMWF IMA W/S 29 April, 2002 Mike Fisher, Lars Isaksen and Erik Andersson For increased efficiency and improved

15

Small scales

•TL at T255,•12h forecast,•Spectral norms

Page 16: A revised 4D-Var algorithm Yannick Trémolet ECMWF IMA W/S 29 April, 2002 Mike Fisher, Lars Isaksen and Erik Andersson For increased efficiency and improved

16

Impact of TL model resolution (adiab.)

Adiabatic test.

Better linear physics is needed.

It is expensive both in development work and CPU.

Page 17: A revised 4D-Var algorithm Yannick Trémolet ECMWF IMA W/S 29 April, 2002 Mike Fisher, Lars Isaksen and Erik Andersson For increased efficiency and improved

17

Hessian eigenvector preconditioning

The optimal pre-conditioner for the 4D-Var minimisation problem is the Hessian of the cost function, .

The full 4D-Var Hessian is not known.

So far has been used as an approximate preconditioner, neglecting the observation term.

The consequence is that patches of very dense or particularly accurate observations may deteriorate the conditioning and slow down the rate-of-convergence.

Page 18: A revised 4D-Var algorithm Yannick Trémolet ECMWF IMA W/S 29 April, 2002 Mike Fisher, Lars Isaksen and Erik Andersson For increased efficiency and improved

18

Trajectory (in)consistency

T319-T63 T511-T159

The discrepancies between the high resolution non-linear update and the low resolution non-linear trajectory runs can be important.

Page 19: A revised 4D-Var algorithm Yannick Trémolet ECMWF IMA W/S 29 April, 2002 Mike Fisher, Lars Isaksen and Erik Andersson For increased efficiency and improved

19

The revised 4D-Var algorithm: Motivation

Improved efficiency

To offset some the cost of planned 1) higher resolution, 2) improved TL physics and 3) increased numbers of satellite data.

Increased TL accuracy

Discrepancies between and can introduce errors which grow quickly over the 12-hour assimilation window, especially affecting the analysis of small-scale phenomena and humidity.

Preparation for new high density satellite data

Coping with large numbers of observational data without deterioration of the rate-of-convergence.

Preparation for cloud and rain assimilation

Requires more extensive use of TL physics, and a good agreement between at inner iterations and at outer.

Page 20: A revised 4D-Var algorithm Yannick Trémolet ECMWF IMA W/S 29 April, 2002 Mike Fisher, Lars Isaksen and Erik Andersson For increased efficiency and improved

20

The revised 4D-Var algorithm: Specification

Quadratic inner iterations. Variational quality control and SCAT ambiguity removal moved to outer level.

Conjugate Gradient minimisation. With objective stopping-criterion based on the gradient-norm reduction.

Hessian eigenvector pre-conditioning. Updated after each inner minimisation.

Multi-Incremental, T42/T95/T159. With some tests at T255.

Interpolation of the trajectory. From T511 to T42/T95/T159.

TL physics. Used during all inner iterations.

Page 21: A revised 4D-Var algorithm Yannick Trémolet ECMWF IMA W/S 29 April, 2002 Mike Fisher, Lars Isaksen and Erik Andersson For increased efficiency and improved

21

The conjugate gradient algorithm minimizes a quadratic function with a symmetric positive-definite Hessian:

Conjugate Gradients and Lanczos Algorithms

The algorithm is:step to the line minimum

recalculate the gradient

calculate a new direction

where:

Eliminate to get the 3-term recurrence (Lanczos):

Page 22: A revised 4D-Var algorithm Yannick Trémolet ECMWF IMA W/S 29 April, 2002 Mike Fisher, Lars Isaksen and Erik Andersson For increased efficiency and improved

22

Conjugate Gradients and Lanczos Algorithms

The gradient vectors in conjugate gradients are orthogonal.

Let be the matrix whose columns are . Then

where is tri-diagonal and The residual term becomes small during the

minimization as the gradient decreases. After iterations, we get . i.e. has the same eigenvalues as . Intermediate matrices have interleaving eigenvalues:

Even for , some eigenvalues are well approximated.

Page 23: A revised 4D-Var algorithm Yannick Trémolet ECMWF IMA W/S 29 April, 2002 Mike Fisher, Lars Isaksen and Erik Andersson For increased efficiency and improved

23

Preconditioning Write the analysis cost function as:

Preconditioning replaces by:

The Hessian of this new function is The trick is to choose so that has a small

condition number. Eigenvector preconditioning sets:

Writing , gives:

If we choose so that , then the condition number of is .

with

Page 24: A revised 4D-Var algorithm Yannick Trémolet ECMWF IMA W/S 29 April, 2002 Mike Fisher, Lars Isaksen and Erik Andersson For increased efficiency and improved

24

Preconditioning

Eig

enva

lue

N=1

1=3105.4

26=492.75

Preconditioning reduces the condition number k=1/N from 3105.4 to 492.75

Page 25: A revised 4D-Var algorithm Yannick Trémolet ECMWF IMA W/S 29 April, 2002 Mike Fisher, Lars Isaksen and Erik Andersson For increased efficiency and improved

25

Preconditioning

Variational Quality Control

Page 26: A revised 4D-Var algorithm Yannick Trémolet ECMWF IMA W/S 29 April, 2002 Mike Fisher, Lars Isaksen and Erik Andersson For increased efficiency and improved

26

Preconditioning

Convergence is roughly twice as fast with Hessian preconditioning.

Page 27: A revised 4D-Var algorithm Yannick Trémolet ECMWF IMA W/S 29 April, 2002 Mike Fisher, Lars Isaksen and Erik Andersson For increased efficiency and improved

27

Preconditioning: Spectrum of Hessian

0

200

400

600

800

1000

1200

1 3 5 7 9 11 13 15 17 19 21 23

Min_42

Min_95

Min_255

The leading eigenvectors of the Hessian are large-scale. It is very effective and cost-efficient to calculate them at

low resolution (T42/T95), They can be used as pre-conditioner to reduce the number

of iterations at higher resolutions (T95/T159 or T255). This naturally leads to a multi-incremental setup.

Page 28: A revised 4D-Var algorithm Yannick Trémolet ECMWF IMA W/S 29 April, 2002 Mike Fisher, Lars Isaksen and Erik Andersson For increased efficiency and improved

28

Multi-incremental: RMS of T analysis increments

Most of the total An-increment is formed at T42. There is a clear scale-separation between successive minimisation. The rapid decrease beyond ~T100 is due to the filtering properties of Jb, and the lack of observational information on smallest scales.

Page 29: A revised 4D-Var algorithm Yannick Trémolet ECMWF IMA W/S 29 April, 2002 Mike Fisher, Lars Isaksen and Erik Andersson For increased efficiency and improved

29

Conjugate-gradient: Reduction of Norm of gradient

With C.G. minimisation the gradient norm reduces nearly monotonically with iteration. It is therefore possible to introduce an objective stopping-criterion based on its ratio. We have chosen a value =0.05.

0.05

Page 30: A revised 4D-Var algorithm Yannick Trémolet ECMWF IMA W/S 29 April, 2002 Mike Fisher, Lars Isaksen and Erik Andersson For increased efficiency and improved

30

C.G. and Lanczos Summary

The close connection between conjugate gradients and the Lanczos algorithm allows us to simultaneously:

Minimize the cost function.

Calculate the eigenvectors and eigenvalues of the Hessian.

The extra computational effort required to calculate the eigenpairs is negligible.

The connection can be exploited to improve the minimization.

The consequence is a more efficient and more robust 4D-Var minimisation – w. r. t. observation amounts and distribution.

Page 31: A revised 4D-Var algorithm Yannick Trémolet ECMWF IMA W/S 29 April, 2002 Mike Fisher, Lars Isaksen and Erik Andersson For increased efficiency and improved

31

Interpolated Trajectory

The increments which are appropriate for the low resolution situation are not always suitable for the high resolution situation.

The trajectory can be interpolated:

This algorithm could not be tested with the traditional tangent linear test because of the two resolutions involved.

High resolution non-linear updateLow resolution non-linear trajectoryLow resolution TL (cost function)Minimisation (TL)Low resolution TL (diagnostic)

High resolution non-linear update

Interpolation

Page 32: A revised 4D-Var algorithm Yannick Trémolet ECMWF IMA W/S 29 April, 2002 Mike Fisher, Lars Isaksen and Erik Andersson For increased efficiency and improved

32

Interpolated trajectory

Page 33: A revised 4D-Var algorithm Yannick Trémolet ECMWF IMA W/S 29 April, 2002 Mike Fisher, Lars Isaksen and Erik Andersson For increased efficiency and improved

33

Performance: Jo cost function

0.E+00

1.E+05

2.E+05

3.E+05

4.E+05

5.E+05

6.E+05

7.E+05

Traj0

Min_

42

Min_

42Tr

aj1

Min_

95

Min_

95Tr

aj2

Min_

255

Min_

T255

Traj3

Traj_Jo

Min_Jo

Min_Jb

0.E+00

1.E+05

2.E+05

3.E+05

4.E+05

5.E+05

6.E+05

7.E+05

Traj0

Min_

42

Min_

42

Min_

159

Min_

159

Traj1

Min_

159

Min_

159

Traj2

Current 4D-Var Revised 4D-Var

Page 34: A revised 4D-Var algorithm Yannick Trémolet ECMWF IMA W/S 29 April, 2002 Mike Fisher, Lars Isaksen and Erik Andersson For increased efficiency and improved

34

Scores

Page 35: A revised 4D-Var algorithm Yannick Trémolet ECMWF IMA W/S 29 April, 2002 Mike Fisher, Lars Isaksen and Erik Andersson For increased efficiency and improved

35

Performance: CPU cost

Operational setup: 1h20min. Revised algorithm: 1h13min. On Fujitsu VPP5000, 16 processors. Elapsed time, I/O not fully optimised.

0

500

1000

1500

2000

2500

Traj0

Min_

42

Min_

42Tr

aj1

Min_

95

Min_

95Tr

aj2

Min_

255

Min_

T255

Traj3

Traj CPU

Min CPU

Page 36: A revised 4D-Var algorithm Yannick Trémolet ECMWF IMA W/S 29 April, 2002 Mike Fisher, Lars Isaksen and Erik Andersson For increased efficiency and improved

36

Conclusions

ECMWF’s 4D-Var has been improved:

Conj. Gradient minimisation Hessian pre-conditioning Inner/outer iteration algorithm Improved TL approximations Multi-incremental T42/T95/T159

These developments will help facilitate:

Use of higher density data Higher resolution Enhanced use of (relatively

costly) TL physics Cloud and rain assimilation

Prospects

The minimisation of the cost function has been improved. More work is needed to improve the representation of

the small scales in the inner loop. Efficiency gains will pay for improved inner loop physics

and resolution.