improving prediction skill of imperfect turbulent models through statistical response...

69
Improving Prediction Skill of Imperfect Turbulent Models through Statistical Response and Information Theory Di Qi, and Andrew J. Majda Courant Institute of Mathematical Sciences Fall 2016 Advanced Topics in Applied Math Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 1 / 61

Upload: others

Post on 01-Feb-2021

2 views

Category:

Documents


0 download

TRANSCRIPT

  • Improving Prediction Skill of Imperfect TurbulentModels through Statistical Response and Information

    Theory

    Di Qi, and Andrew J. Majda

    Courant Institute of Mathematical Sciences

    Fall 2016 Advanced Topics in Applied Math

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 1 / 61

  • Challenges for turbulent dynamical systems

    Uncertainty quantification (UQ) deals with the probabilistic characterization of all thepossible evolutions of a dynamical system given an initial set of possible states as well asthe random forcing or parameters.

    Turbulent dynamical systems are characterized by a large dimensional phasespace and high degrees of internal instability.

    Instabilities through energy-conserving nonlinear interactions result in astatistical steady state that is usually non-Gaussian.

    Accurate quantification for the statistical variability to general externalperturbations is important in climate change sciences.

    Major Task of this talk:Investigate a concise systematic framework for measuring and optimizingconsistency and sensitivity of imperfect dynamical models.

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 2 / 61

  • General framework for statistical modelingThe system setup will be a finite-dimensional system of, u ∈ RN , with lineardynamics and an energy preserving quadratic part

    du

    dt= L [u] = (L + D) u + B (u,u) + F (t) + σk (t) Ẇk (t;ω) , (1)

    L∗ = −L; D ≤ 0; u · B (u,u) ≡ 0.NEW STRATEGIES FOR REDUCED-ORDER MODELS FOR PREDICTING COMPLEX TURBULENT DYNAMICAL SYSTEMS 5

    Modeling Stages Math. & Computational Tools

    Model SeclectionErgodic Theory

    Statistical Measures in Equilibrium

    Model CalibrationEmpirical Information Theory

    Linear Response TheoryTotal Statistical Energy Equations

    Model PredictionAccurate and Efficient Schemes

    Numerical Stability Analysis

    Rigorous Math Theories

    Computational Methods

    Figure 1.1. The general strategy for the development of reduced-order statistical models. Threesequential stages are required to carry out the reduced-order statistical model, and rigorous math-ematical theories are combined with numerical analysis to calibrate model errors and improve theimperfect model prediction skill.

    The model calibration procedure is usually carried out in a training phase before the prediction, so that theoptimal imperfect model parameters can be achieved through a careful calibration about the true higher-orderstatistics. The ideal way is to find a unified systematic strategy where various external perturbations can be predictedfrom the same set of optimal parameters through this training phase. To achieve this, various statistical theoriesand numerical strategies need to be consulted. Most importantly, we need to consider the linear response theoryto calibrate the model responses in mean and variances; and use empirical information theory to get a balancedmeasure for the error in the leading order moments. In the final model prediction stage, the optimal imperfectmodel parameters are applied for the forecast of various model responses to perturbations. In the constructionabout numerical models, numerical issues also need taking into account to make sure numerical stability andaccuracy. Especially, proper schemes with accuracy order consistent with the reduced model approximation errorshould be proposed to ensure optimal performance.

    2. Statistical Theory Toolkits for Improving Model Prediction Skill

    In this section we introduce the general theoretical toolkits that are useful for capturing the key statisticalfeatures in turbulent systems like (1.1) and improving imperfect model prediction skill. Despite the complexmodel statistical responses in each component as the turbulent dynamical system gets perturbed, there exists asimple and exact statistical energy conservation principle for the total statistical energy of the system describingthe overall (inhomogeneous) statistical structure in the system through a simple scalar dynamical equation. Thetheory is briefly described in Section 2.1. Then the construction about the imperfect reduced-order models concernsabout the consistency in equilibrium (climate fidelity) and the responses to perturbations (model sensitivity).Equilibrium fidelity should be guaranteed in the first place so that the reduced-order model will converge to thetrue equilibrium statistics. To further calibrate the detailed model sensitivity to perturbations in each statisticalcomponent, the linear response theory can offer useful quantities to measure for quantifying the crucial statistics inthe model structure. Combining with the relative entropy under empirical information theory, a general information-theoretical framework can be proposed to tune the imperfect model parameters in a training phase, thus optimalmodel parameters can be used for model prediction in various dynamical regimes. We will describe the basic ideasfirst in this section.

    2.1. A statistical energy conservation principle. Despite the fact that the exact equations for the statisticalmean (1.4) and the covariance fluctuations (1.5) are not closed equations, there is suitable statistical symmetry sothat the energy of the mean plus the trace of the covariance matrix satisfies an energy conservation principle evenwith general deterministic and random forcing. Here we briefly introduce the theory developed in [13] about a totalstatistical energy dynamics for the abstract system (1.1).

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 3 / 61

  • Outline

    1 Turbulent systems with quadratic nonlinearitiesGeneral setups for turbulent systems with quadratic nonlinearities

    2 Statistical dynamical models and closure methodsThe Gaussian closure method for statistical predictionSingle point statistics and information barrierImperfect model calibration and empirical information theoryForecast skill of closure methods for forced responses in L-96 model

    3 Strategies for reduced order models in low dimensional subspaceLow order models in reduced subspaceForecast skill of reduced models for forced responses

    4 Predicting statistical response in two-layer baroclinic turbulenceTwo-layer baroclinic turbulence in ocean and atmosphere regimesLow-dimensional reduced-order models for the two-layer systemImperfect model prediction skill with 2× 102 resolved modes for originalsystem with 2× 2562 dimension

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 4 / 61

  • Outline

    1 Turbulent systems with quadratic nonlinearitiesGeneral setups for turbulent systems with quadratic nonlinearities

    2 Statistical dynamical models and closure methodsThe Gaussian closure method for statistical predictionSingle point statistics and information barrierImperfect model calibration and empirical information theoryForecast skill of closure methods for forced responses in L-96 model

    3 Strategies for reduced order models in low dimensional subspaceLow order models in reduced subspaceForecast skill of reduced models for forced responses

    4 Predicting statistical response in two-layer baroclinic turbulenceTwo-layer baroclinic turbulence in ocean and atmosphere regimesLow-dimensional reduced-order models for the two-layer systemImperfect model prediction skill with 2× 102 resolved modes for originalsystem with 2× 2562 dimension

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 5 / 61

  • General setup of turbulent systems with quadraticnonlinearities

    The system setup will be a finite-dimensional system of, u ∈ RN , with lineardynamics and an energy preserving quadratic part

    du

    dt= L [u] = (L + D) u + B (u,u) + F (t) + σk (t) Ẇk (t;ω) . (2)

    L being a skew-symmetric linear operator L∗ = −L, representing the β-effect ofEarth’s curvature, topography etc.

    D being a negative definite symmetric operator D∗ = D, representing dissipativeprocesses such as surface drag, radiative damping, viscosity etc.

    B (u, u) being the quadratic operator which conserves the energy by itself so that itsatisfies B (u, u) · u = 0.F (t) + σk (t) Ẇk (t;ω) being the effects of external forcing, i.e. solar forcing, whichcan be split into a mean component F (t) and a stochastic component with whitenoise characteristics.

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 6 / 61

  • Exact statistical moment equationsStatistical mean and covariance dynamics, u = ū + Zi vi , Rij =

    〈Zi Z∗j

    〉,

    d ū

    dt= (L + D) ū + B (ū, ū) + Rij B

    (vi , vj

    )+ F (t) ,

    dR

    dt= Lv R + RL

    ∗v + QF + Qσ .

    the linear dynamics operator Lv expressing energy transfers between the mean field and thestochastic modes (B), as well as energy dissipation (D), and non-normal dynamics (L)

    {Lv}ij =[(L + D) vj + B

    (ū, vj

    )+ B

    (vj , ū

    )]· vi .

    the positive definite operator Qσ expressing energy transfer due to external stochasticforcing

    {Qσ}ij = v∗i σ∗k σk vj .

    the third-order moments expressing the energy flux between different modes due tonon-linear terms

    QF =〈ZmZnZj

    〉B (vm, vn) · vi + 〈ZmZnZi 〉B (vm, vn) · vj .

    note that energy is still conserved in this nonlinear interaction part

    Tr [QF ] = 2 〈ZmZnZi 〉B (vm, vn) · vi= 2 〈B (Zmvm,Znvn) · Zi vi 〉 = 2

    〈B(u′, u′

    )· u′〉

    = 0.

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 7 / 61

  • Outline

    1 Turbulent systems with quadratic nonlinearitiesGeneral setups for turbulent systems with quadratic nonlinearities

    2 Statistical dynamical models and closure methodsThe Gaussian closure method for statistical predictionSingle point statistics and information barrierImperfect model calibration and empirical information theoryForecast skill of closure methods for forced responses in L-96 model

    3 Strategies for reduced order models in low dimensional subspaceLow order models in reduced subspaceForecast skill of reduced models for forced responses

    4 Predicting statistical response in two-layer baroclinic turbulenceTwo-layer baroclinic turbulence in ocean and atmosphere regimesLow-dimensional reduced-order models for the two-layer systemImperfect model prediction skill with 2× 102 resolved modes for originalsystem with 2× 2562 dimension

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 8 / 61

  • Statistical Accurate Closure ModelsThe true statistical model

    d ū

    dt= (L + D) ū + B (ū, ū) + Rij B (vi , vj ) + F (t) , ū ∈ RN ,

    dR

    dt= Lv (ū) R + RL

    ∗v (ū) + QF + Qσ, R ∈ RN×N .

    QF ,ij = 〈ZmZnZj〉B (vm, vn) · vi + 〈ZmZnZi 〉B (vm, vn) · vj , trQF = 0.

    Basic Idea:The hierarchical moment equations will never be closed by adding higherorder momentsEquations for the stochastic coefficients

    dZidt

    = Zj [(L + D) vj + B (ū, vj ) + B (vj , ū)] · vi+ [B (u′,u′)− Rjk B (vj , vk )] · vi + Ẇkσk · vi .

    Replace nonlinear interaction term by linear damping and Gaussian noise

    Statistical closure models

    B (u′,u′) · vi =∑

    Bij Zi Zj −→ −dM,i (R) Zi + σM,i Ẇi .Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 9 / 61

  • Statistical Accurate Closure ModelsThe reduced-order approximation ūM ∈ RM , M � N

    d ūMdt

    = (L + D) ūM + B (ūM , ūM ) + RM,ij B (vi , vj ) + F,

    dRMdt

    = Lv RM + RM L∗v + Q

    MF + Qσ,

    Basic Idea:The hierarchical moment equations will never be closed by adding higherorder momentsEquations for the stochastic coefficients

    dZidt

    = Zj [(L + D) vj + B (ū, vj ) + B (vj , ū)] · vi+ [B (u′,u′)− Rjk B (vj , vk )] · vi + Ẇkσk · vi .

    Replace nonlinear interaction term by linear damping and Gaussian noise

    Statistical closure models

    B (u′,u′) · vi =∑

    Bij Zi Zj −→ −dM,i (R) Zi + σM,i Ẇi .

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 9 / 61

  • Quasilinear Gaussian (QG) closure

    The simplest closure scheme for the moment problem is to completely neglect thethird-order moments, i.e. set QF = 0.

    d ū

    dt= (L + D) ū + B (ū, ū) + Rij B (vi , vj ) + F (t) , (3)

    dR

    dt= Lv R + RL

    ∗v +��QF + Qσ. (4)

    persistent instabilities that would cause uncontrollable growth of the unstablemodes.

    for L-96 system the QG closure scheme avoids blow-up.Consider homogeneous statistical solutions of QG closure for the L-96 system: the meanū (t) is a time varying constant; the covariance multiplier is diagonal in Fourier spaceR = rjδij ; the linear operator is diagonal Fourier multiplier Lv = diag

    (lj),

    drj

    dt= 2Relj (ū) rj , lj (ū) = (exp (2πij/J)− exp (−4πij/J)) ū − 1

    dū

    dt= −ū + F +

    20∑j=0

    rjB(vj , vj

    )· v0 + B

    (vj , vj

    )∗ · v02

    .

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 10 / 61

  • The statistical steady state solution requires rj,∞ = 0 unless Relj (ū) = 0, that is,

    maxj

    Relj (ū) = 0

    ⇒ ū(

    cos2πj

    J− cos 4πj

    J

    )− 1 = 0

    ⇒ ūcr =1

    cos 2πjcrJ − cos4πjcr

    J

    ≈ 0.8944, jcr = 8

    ⇒ rj 6= 0, j = 8; rj = 0, ∀j 6= 8.

    Correspondingly

    ūcr = F + r8B (v8, v8) · v0 + B (v8, v8)∗ · v0

    2

    = F + r8

    (cos

    4πjcrJ− cos 2πjcr

    J

    )< F .

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 11 / 61

  • Number of positive eigenvalues of Lv (ū) for L-96 system with respect to themagnitude of the mean field ū. No matter how large the external forcing F is, theQG closure method always converges to the unique critical value ūcr , satisfyingmarginal linear stability.

    0 0.5 1 1.5 2 2.5 3 3.5 40

    1

    2

    3

    4

    5

    6

    7

    8

    9

    10

    Num

    ber o

    f pos

    itive

    eig

    enva

    lues

    F=6 F=8 F=16F=5 F=32

    Equilibrium point for QG models

    Figure 2: Number of positive eigenvalues of Lv (ū) for L-96 with respect to the magnitude of themean field ū. The red dashed lines indicate exact equilibrium points for di§erent value of the forcingparameter F. The green solid lines indicate equilibrium points for the DO UQ scheme for N = 10.

    linearized dynamics - described by Lv (ū) - which will magnify the initial uncertainty. In Figure 2we present the number of unstable wavenumbers, i.e. the number of eigenvalue pairs with positivereal part for the linearized matrix Lv (ū) , with respect to the value of the steady state mean field(note that spatial homogeneity implies spatially constant mean field). In the same plot we showthe with dashed lines the steady state value of the mean field for specific values of the forcingparameter F. Based on the presence of persistent positive eigenvalues in the steady-state we have(for su¢ciently large F ) the following energy cycle (Figure 3):

    1. Energy from the external excitation F leads to the growth of the mean field energy 12 ū.ū(equation (2)).

    2. The important magnitude of ū leads to the activation of positive eigenvalues of Lv (ū) (seeFigure 2) that essentially absorb energy from the mean field and transform it to variance forthe stochastic modes that are associated with this process.

    3. The nonlinear conservative term B(u0,u0) absorbs part of this energy, transferring it to thestable stochastic modes. It acts as dissipative mechanism for the unstable modes (balancingtheir positive eigenvalues) and external noise for the stable modes bringing all of them into astatistical equilibrium.

    6

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 12 / 61

  • Energy flow in the quadratic systemsStatistical mean and covariance dynamics

    d ū

    dt= (L + D) ū + B (ū, ū) + Rij B

    (vi , vj

    )+ F (t) ,

    dR

    dt= Lv R + RL

    ∗v + QF + Qσ .

    In imperfect closure models, replace

    QF =〈ZmZnZj

    〉B (vm, vn) · vi + c.c.

    −→ QMF = QMF− + Q

    MF + = −2dM,k (R) rk + σ

    2M,k (R)

    Figure 3: Energy flow in the L-96 system. Energy flows from the mean field to the linearly unstablemodes and then redistributed through nonlinear, conservative terms to the stable modes. Redarrows denotes dissipation, while the dashed box indicates terms that conserve energy.

    4. The stable modes receive energy from the unstable ones through the nonlinear conservativeterms. A portion of this energy is dissipated and the rest is subsequently returned to the meanthrough the modes with negative eigenvalues. All modes including the mean flow dissipateenergy through the negative definite part of the linearized dynamics.

    This cycle of energy in the L-96 model is representative of any general model that contains i)unstable linearized modes whose stability depends on the mean field energy level (i.e. that theyabsorb energy from the mean field), ii) stable modes, and iii) nonlinear conservative terms thattransfer energy between the modes and through this transfer the system is reaching an equilibrium.This structure is ubiquitous in turbulent systems in the atmosphere and ocean with forcing anddissipation [26, 22, 1, 2] as well as in fluid flows with lower dimensional attractors [23]. However,there are also examples of idealized truncated geophysical flows without dissipation and forcingwith a Gaussian statistical equilibrium where the linear operator at the climate mean state is stablewhile the system has many positive Lyapunov exponents [19].

    7

    Figure 4: Energy flow in the MQG UQ scheme. Energy flows from the mean field to the linearlyunstable modes and then redistributed through empirical, conservative fluxes to the stable modes.Red arrows denotes dissipation, while the dashed box indicates terms that conserve energy.

    we substitute the nonlinear conservative mechanism by a conservative pair of positive and negativeenergy fluxes having the form of additional damping for the unstable modes and additive noisefor the stable modes (Figure 4). Note that this additional damping is chosen so that the unstableeigenvalues of the original linearized dynamics are guaranteed to have zero real part in the statisticalsteady state. In that sense this is the minimal amount of additional damping and noise required toachieve marginal stability (non-positive eigenvalues) of the correct steady state statistics. Thus, wehave a minimally changed model compared to the original equation that admits the correct steadystate statistics as a stable equilibrium stage. In the next subsections we will see that for numericalreasons it is required to add a small amount of additional damping (and noise) so that the correctstatistical steady state is not just marginally stable but it is associated with eigenvalues havingpurely negative real part. Moreover, in the transient phase of the dynamics the intensity of thenonlinear energy fluxes should depend on the energy level of the system and to this end we willapply a scaling factor to the additional damping and noise (that represent the nonlinear energyfluxes) which will take into account this dependence.Note that all of the required fluxesQ−F1, Q

    +F1 are evaluated explicitly from available information

    involving the linear operator, Lv (ū1), and the covariance matrix, R1 in a statistical steady state.In addition, since the nonlinear flux model is kept separate from the unmodified linear dynamics, itexpresses an inherent property of the system, a direct link between second and third-order statisticalmoments in the same spirit that Karman-Howarth equation [10] does for isotropic turbulence.

    11

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 13 / 61

  • L-96 system with homogeneous assumptionThe homogeneous L-96 system

    duj

    dt= uj−1

    (uj+1 − uj−2

    )− duj + F , j = 0, 1, ..., J − 1,

    satisfies 〈ui1 ui2 · · · uin

    〉=〈ui1+l ui2+l · · · uin+l

    〉, ∀l .

    Fourier basis naturally becomes eigenfunctions for the L-96 operator, the moment equations aresimplified

    Moment equations for homogeneous L-96

    dū (t)

    dt= −d (t) ū (t) +

    1

    J

    J/2∑k=−J/2+1

    rk (t) Γk + F (t) ,

    drk (t)

    dt= 2 [−Γk ū (t)− d (t)] rk (t) + QF ,kk , k = 0, 1, ..., J/2.

    Note that Γk = cos4πk

    J− cos 2πk

    J, and the nonlinear flux QF becomes diagonal

    QF ,kk′ = 2

    (1

    J

    )1/2∑m

    Re{〈ZmZ−m−k Zk〉

    (e−2πı

    2m+kJ − e2πı

    m+2kJ

    )}δkk′ ,

    with energy conservation trQF = 0.

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 14 / 61

  • Models with consistent equilibrium single point statistics

    QMF ,kk = QMF−,kk + Q

    MF +,kk = −2dM,k (R) rk + σ

    2M,k (R) .

    Gaussian closure 1 (GC1-1pt): let

    dM,k (R) = dM ≡ const., σ2M,k (R) = σ2M ≡ const.,

    QGC1F = − (dM R + RdM ) + σ2M I ; (5)Gaussian closure 2 (GC2-1pt): let

    dM,k (R) = �MJ

    2

    (trR)1/2

    (trR∞)3/2≡ �M d̄ , σ2M,k (R) = �M

    (trR)3/2

    (trR∞)3/2

    QGC2F = −�M(d̄R + Rd̄

    )+ �M

    (trR)3/2

    (trR∞)3/2

    I . (6)

    Note that GC1-1pt includes parameters(dM , σ

    2M

    )and the nonlinear energy

    trQGC1F = −2dMtrRM + Jσ2M may not be conserved, while GC2-1pt has one parameter�M and nonlinear energy conservation is enforced by construction trQ

    GC2F = 0.

    Single point statistics consistency can be fulfilled through tuning the control parameters.But these models are calibrated by ignoring spatial correlations and a natural informationbarrier is present which cannot be overcome by these imperfect models.

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 15 / 61

  • Equation for statistical energy

    The statistical mean and variance at each individual grid point focus on the single point meanand variance of the system (and ignore the cross-correlation between different grids)

    ū1pt =1

    J

    J−1∑j=0

    uj = ū, R1pt =1

    J

    J/2∑k=−J/2+1

    rk =1

    JtrR.

    By multiplying ū on both sides of the mean equation, and summing up all the modes in thevariance equation

    dū2

    dt= −2dū2 + 2ūF + 2

    ∑k

    Γk rk ū.

    dtrR

    dt= 2

    (−∑

    k

    Γk rk ū

    )− 2dtrR + trQF .

    It is convenient to define the statistical energy as

    E (t) =1

    2ū2 +

    1

    2trR.

    Thus the corresponding dynamical equation for the energy E can be easily derived as

    dE

    dt= −2d (t) E + F · ū +

    1

    2trQF = −2E + F · ū,

    with symmetry of nonlinear energy conservation assumed.

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 16 / 61

  • Realizability of 1-point statistics

    In a similar way, we can calculate EM from imperfect model. With the same set ofassumptions, the responses between perfect and imperfect model satisfy

    d

    dt(δE − δEM ) = −2d (δE − δEM ) + F · (δū − δūM ) .

    Proposition

    Consider a system with homogenous statistical solution and an imperfect closuremodel with flux QMF satisfying symmetry of nonlinear energy conservation.Statistical equilibrium fidelity for one point statistics of the variancetrRM,∞ = trR∞ is satisfied if the same mean state ūM,∞ = ū∞ is achieved atequilibrium. Furthermore, with equilibrium consistency, we can successfully predictthe response for the one point statistics using the imperfect model‖trRδ,M − trRδ‖ = O

    (δ2)

    if we can predict the mean response successfully

    ‖ūδ,M − ūδ‖ = O(δ2).

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 17 / 61

  • Information barrier in 1-point statistics

    the information distance between the truth and imperfect model prediction has the form

    P(π, πM1pt

    )= [S (πG )− S (π)] + P

    πG , J−1∏j=0

    πG1pt,j

    + J−1∑j=0

    P(πG1pt,j , π

    M1pt,j

    ).

    The first part on the right hand side of is the inherent information barrier in Gaussianapproximation. Using spectral random fields

    uG = ū +

    J/2∑k=−J/2+1

    R1/2k e

    ikxj Ŵk , uG1pt = ū +

    J/2∑k=−J/2+1

    (∑J−1j=0 Rj

    J

    )1/2e ikxj Ŵk .

    PropositionThe information barrier between the Gaussian random field and the uncorrelated one pointstatistics random field is given by

    P

    πG , J−1∏j=0

    πG1pt

    = J log det

    (∑J−1j=0 Rj/J

    )(∏J−1

    j=0 det Rj

    )1/J ∼ R−11pt (σmax − σmin)2 .

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 18 / 61

  • Hierarchy of closure models with consistent equilibrium foreach mode

    QMF ,kk = QMF−,kk + Q

    MF +,kk = −2dM,k (R) rk + σ

    2M,k (R) .

    Gaussian closure 1 (GC1): let

    dM,k (R) = dM,k , σ2M,k (R) = σ

    2M , dM,k = [−Γk ū∞ − d] + σ

    2M/2rk,∞; (7)

    Gaussian closure 2 (GC2): let

    dM,k (R) = �1,k d̄ , σ2M,k (R) = �M

    (trR)3/2

    (trR∞)3/2

    ,

    �1,k =2 [−Γk ū∞ − d] rk,∞ + �M

    Jrk,∞/trR∞, d̄ =

    J

    2

    (trR)1/2

    (trR∞)3/2

    . (8)

    Above dM,k or �1,k is chosen so that the system has the same equilibrium mean ū∞ andvariance rk,∞ as the true model, therefore ensuring equilibrium consistency by finding thesteady state solutions through simple algebraic manipulations.Modified Gaussian closure (MQG): let

    dM (R) =f (R)

    f (R∞)N∞, σ

    2M (R) = −

    trQMQGF−

    trQMQGF +,∞

    [(Γk ū∞ + d) rk,∞δI+ + qs

    ],

    with

    N∞,kk = [Γk ū∞ + d] δI− −1

    2qs r−1k,∞.

    Above I− represents the unstable modes with Γk ū∞ + d > 0 while I+ is the stable oneswith Γk ū∞ + d ≤ 0.

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 19 / 61

  • Model calibration blending statistical response andinformation theory

    Accurate modeling about the model sensitivity to various external perturbationsrequires the imperfect reduced-order models to correctly reflect the true system’s“memory” about its previous states.

    the linear response operator can characterize the model sensitivity involvingthe nonlinear effects in the system regardless of the specific forms of theexternal perturbations.

    empirical information theory can be used as the distance between these twooperators to calculate the unbiased and invariant measure for modeldistributions.

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 20 / 61

  • Empirical information theoryEmpirical information theory: A natural way to measure the lack of informationin one probability density from the imperfect model, πM , compared with the trueprobability density, π, is through the relative entropy, given by

    P(π, πM

    )=

    ∫π log

    π

    πM.

    This functional on probability densities has two attractive features as a metric:

    P (p, q) ≥ 0 with equality if and only if p = q;P (p, q) is invariant under general nonlinear changes of variables.

    Model optimization: Consider a class of imperfect models M. The bestimperfect model for the coarse-grained variable u is the M∗ ∈M so that

    P(π, πM

    ∗)

    = minM∈M

    P(π, πM

    ). (9)

    Also, actual improvements in a given imperfect model with distribution πM (u)resulting in a new πMpost (u) should result in improved information for the perfect

    model, so that P(π, πMpost

    )≤ P

    (π, πM

    ).

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 21 / 61

  • Empirical information theory

    The actual model density πL (u) only reflects the best unbiased estimate of theperfect model given the L measurements ĒL.

    P(π, πML′

    )= P (π, πL) + P

    (πL, π

    ML′)

    = [S (πL)− S (π)] + P(πL, π

    ML′), for L′ ≤ L.

    The entropy difference, S (πL)− S (π), precisely measures an intrinsic error fromthe L measurements of the perfect system.

    Practical setup for calibration of contemporary models involves only mean andcovariance. If the density functions π, πM involve only the first two moments,

    P(π, πM

    )=

    1

    2(ū− ūM )T R−1 (ū− ūM ) +

    1

    2

    (tr(RR−1M

    )− J − log det

    (RR−1M

    )).

    First term is the signal, reflecting the model error in the mean but weighted by theinverse of the covariance R−1, while the second term, the dispersion, involves onlythe model error covariance ratio, RR−1M .

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 22 / 61

  • Theories for improving imperfect model prediction skill

    Linear response theory: The linear response of a system to small perturbationscan be predicted by observing appropriate statistics of the system in equilibriumwithout the need of applying any perturbations.

    Assume the perfect model with equilibrium measure πeq, and external perturbation

    ut = f (u) , fδ = f + δf ′ (t) , δf ′ = δw (u) f (t) .

    For any functional A (u)

    EδA (u) = Eeq (A) + δE ′A + O(δ2). (10)

    Eeq (A) =

    ∫A (u)πeq (u) du, δE

    ′A =

    ∫ t

    0

    RA (t − s) δf ′ (s) ds, (11)

    where RA (t) is called the linear response operator.

  • Linear response operator RA (t)The linear operator RA (t) can be calculated via the correlations in the unperturbed climate

    RA (t) = 〈A (u (t + s)) B (u (s))〉 ,

    with the special nonlinear functional perturbed by the change δw (u) f (t)

    B (u) ≡LFPπeq (u)

    πeq= −

    divu (wπeq)

    πeq.

    The problem in calculating the leading order response above is that the linear response operatorRA (t) is expensive to calculate for general systems.

    kicked response1:

    π |t=0= πeq (u− δu) = πeq − δu · ∇πeq + O(δ2).

    For δ small enough, the linear response operator RA (t) can be calculated bysolving the unperturbed system with a perturbed initial distribution

    δRA (t) ≡ δu · RA =∫

    A (u) δπ′ + O(δ2).

    1Majda, Abramov, and Grote, Information theory and stochastics for multiscale nonlinearsystems vol. 25.

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 24 / 61

  • Link between statistical equilibrium fidelity and forecastingskill

    Given the optimal model for the unperturbed climate πeq, how can we assess the error in theclimate change prediction P

    (πδ, π

    )based on the unperturbed climate?

    Under assumptions with diagonal covariance matrices R = diag (Rk ) and equilibrium modelfidelity P

    (πG , π

    MG

    )= 0, the relative entropy between perturbed model density πMδ and the true

    perturbed density πδ with small perturbation δ can be expanded as

    P(πδ, π

    )= S

    (πG ,δ

    )− S (πδ)

    +1

    2

    ∑k

    (δūk − δūM,k

    )R−1k

    (δūk − δūM,k

    )+

    1

    4

    ∑k

    R−2k(δRk − δRM,k

    )2+ O

    (δ3).

    Here in the first line S(πG ,δ

    )− S (πδ) is the intrinsic error from Gaussian approximation of the

    system. Rk is the equilibrium variance in k-th component, and δūk and δRk are the linearresponse operators for the mean and variance in k-th component. (Majda & Gershogorin, PNAS,2011)

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 25 / 61

  • Optimization in the training phase

    To summarize, consider a class of imperfect models, M. The optimal modelM∗ ∈M that ensures best information consistent responses is characterized withthe smallest additional information in the linear response operator RA, such that

    ∥∥∥P(

    pδ, pM∗

    δ

    )∥∥∥L1([0,T ])

    = minM∈M

    ∥∥P(pδ, p

    )∥∥L1([0,T ])

    ,

    pMδ can be achieved through a kicked response procedure in the trainingphase compared with the actual observed data pδ in nature;

    the information distance between perturbed responses P(pδ, p

    )can be

    calculated through the expansion formula;

    the information distance P(pδ (t) , p

    Mδ (t)

    )is measured at each time instant,

    so the entire error is averaged under the L1-norm inside a time window [0,T ].

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 26 / 61

  • Lorenz 96 system

    The Lorenz 96 model was introduced to mimick the large-scale behavior of themid-latitude atmosphere around a circle of constant latitude. The model is adiscrete periodic system described by the equations

    dujdt

    = uj−1 (uj+1 − uj−2)− duj + F , j = 0, 1, ..., J − 1. (12)

    with J = 40 the number of grids and Fi the deterministic forcing. The quadraticpart conserves energy, that is, B (u, u) · u = 0. The L-96 model offers morecontrollable, simplified scenarios but still mimicking the complex features of thevastly more complex true turbulent system..

    FLinear Analysis Mean Statistics

    kbegin kend Reω ū E (ū) Ep

    5 5 11 0.08 1.6344 53.4276 110.0132

    6 4 12 0.0998 2.0127 81.0260 160.5063

    8 4 12 0.1004 2.3418 109.6825 265.0064

    16 4 12 0.0876 3.0869 190.5900 797.2113

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 27 / 61

  • F = 5

    space

    10 20 30 40

    tim

    e

    0

    5

    10

    15

    20

    -3

    -2

    -1

    0

    1

    2

    3

    4

    5

    6

    7

    wavenumber

    0 5 10 15 20P

    erc

    enta

    ge o

    f E

    nerg

    y %

    0

    0.05

    0.1

    0.15

    0.2

    0.25

    0.3

    0.35

    0.4F = 5

    Re(uk)

    -60 -40 -20 0 20 40 60

    dis

    trib

    utio

    n function

    0

    0.005

    0.01

    0.015

    0.02

    0.025

    0.03

    0.035

    pdfs for Re(uk)

    mode 7mode 8

    time

    0 5 10 15 20 25 30

    corr

    ela

    tion function

    -0.5

    0

    0.5

    1Autocorrelation function for F = 5

    F = 8

    space

    10 20 30 40

    tim

    e

    0

    5

    10

    15

    20

    -6

    -4

    -2

    0

    2

    4

    6

    8

    10

    12

    wavenumber

    0 5 10 15 20

    Perc

    enta

    ge o

    f E

    nerg

    y %

    0.02

    0.04

    0.06

    0.08

    0.1

    0.12F = 8

    Re(uk)

    -150 -100 -50 0 50 100 150

    dis

    trib

    ution function

    0

    0.005

    0.01

    0.015

    0.02

    pdfs for Re(uk)

    mode 8mode 7

    time

    0 5 10 15 20

    corr

    ela

    tion function

    -0.2

    0

    0.2

    0.4

    0.6

    0.8

    1Autocorrelation function for F = 8

    Figure: Numerical solutions of L-96 model in space-time through MC simulations for weaklychaotic (F = 5), and strongly chaotic (F = 8) regime.

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 28 / 61

  • Tuning imperfect model in the training phase

    50 100 150 200

    σM

    2

    0.1

    0.2

    0.3

    0.4

    0.5

    P(δπ

    ,δπ

    M)

    tuning model, GC1

    50 100 150 200

    ǫM

    0.1

    0.2

    0.3

    0.4

    0.5

    P(δπ

    ,δπ

    M)

    tuning model, GC2

    0 0.2 0.4 0.6 0.8

    qsM

    0.1

    0.12

    0.14

    0.16

    0.18

    0.2

    P(δπ

    ,δπ

    M)

    tuning model, MQG

    (a) Information distance in response operator

    0 0.5 1 1.5 2 2.5 3

    time

    -0.5

    0

    0.5

    1

    Ru

    Linear response for the mean (optimal parameter)

    truthGC1GC2MQG

    0 0.5 1 1.5 2 2.5 3

    time

    -50

    0

    50

    100

    150

    200

    2

    Linear response for total variance (optimal parameter)

    (b) Linear response operators for the mean and total variance

    Figure: Training imperfect models in the training phase with full dimensional models by tuningmodel parameters. Time-averaged information distances in linear response operators for the

    three closure models GC1, GC2, and MQG as the model parameter changes are shown in the

    first row (the point where the information error is minimized is marked with a circle).

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 29 / 61

  • Prediction skill of closure methods in L-96 system

    0 5 10 15 20 25 302.3

    2.4

    2.5State of the Mean

    truthGC1GC2MQG

    0 5 10 15 20 25 30

    time

    500

    600

    700Total variance

    0 5 10 15 20 25 302.3

    2.4

    2.5State of the Mean

    truthGC1GC2MQG

    0 5 10 15 20 25 30

    time

    500

    600

    700Total variance

    Model with non-optimal parameterModel with optimal parameter

    Figure: Upward ramp-type forcing: imperfect model predictions in mean state and totalvariance for the closure methods GC1 (dotted-dashed blue), GC2 (dashed green), andMQG (solid red).

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 30 / 61

  • Prediction skill of closure methods in L-96 system

    0 5 10 15 20 25 302.2

    2.25

    2.3

    2.35State of the Mean

    truthGC1GC2MQG

    0 5 10 15 20 25 30

    time

    400

    450

    500

    550Total variance

    0 5 10 15 20 25 302.2

    2.25

    2.3

    2.35State of the Mean

    truthGC1GC2MQG

    0 5 10 15 20 25 30

    time

    400

    450

    500

    550Total variance

    Model with non-optimal parameterModel with optimal parameter

    Figure: Downward ramp-type forcing: imperfect model predictions in mean state andtotal variance for the closure methods GC1 (dotted-dashed blue), GC2 (dashed green),and MQG (solid red).

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 30 / 61

  • Prediction skill of closure methods in L-96 system

    0 5 10 15 202

    2.2

    2.4

    2.6State of the Mean

    truthGC1GC2MQG

    0 5 10 15 20

    time

    400

    500

    600

    700Total variance

    0 5 10 15 202

    2.2

    2.4

    2.6State of the Mean

    truthGC1GC2MQG

    0 5 10 15 20

    time

    400

    500

    600

    700Total variance

    Model with non-optimal parameterModel with optimal parameter

    Figure: Periodic forcing: imperfect model predictions in mean state and total variance forthe closure methods GC1 (dotted-dashed blue), GC2 (dashed green), and MQG (solidred).

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 30 / 61

  • Prediction skill of closure methods in L-96 system

    0 5 10 15 20

    2.2

    2.4

    2.6State of the Mean

    truthGC1GC2MQG

    0 5 10 15 20

    time

    400

    500

    600

    Total variance

    0 5 10 15 20

    2.2

    2.4

    2.6State of the Mean

    truthGC1GC2MQG

    0 5 10 15 20

    time

    400

    500

    600

    Total variance

    Model with non-optimal parameterModel with optimal parameter

    Figure: Random forcing: imperfect model predictions in mean state and total variance forthe closure methods GC1 (dotted-dashed blue), GC2 (dashed green), and MQG (solidred).

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 30 / 61

  • Information errors of closure methods in L-96 system

    0 10 20 308

    8.2

    8.4

    8.6

    8.8Forcing 1: upward ramp-type forcing

    0 10 20 300

    0.1

    0.2

    0.3

    0.4total information error, GC1

    optimal

    non-optimal

    0 10 20 300

    0.02

    0.04

    0.06

    0.08total information error, GC2

    optimal

    non-optimal

    0 10 20 300

    0.005

    0.01

    0.015

    0.02total information error, MQG

    optimal

    non-optimal

    (a) climate change forcing

    0 5 10 15 207

    7.5

    8

    8.5

    9Forcing 2: periodic forcing

    0 5 10 15 200

    0.1

    0.2

    0.3total information error, GC1

    optimal non-optimal

    0 5 10 15 200

    0.02

    0.04

    0.06total information error, GC2

    optimal non-optimal

    0 5 10 15 200

    0.02

    0.04

    0.06total information error, MQG

    optimal non-optimal

    (b) periodic forcing

    time

    0 5 10 15 206

    7

    8

    9Forcing 3: random forcing

    time

    0 5 10 15 200

    0.2

    0.4

    0.6total information error, GC1

    optimal

    non-optimal

    time

    0 5 10 15 200

    0.02

    0.04

    0.06

    0.08total information error, GC2

    optimal

    non-optimal

    time

    0 5 10 15 200

    0.02

    0.04

    0.06total information error, MQG

    optimal

    non-optimal

    (c) random forcing

  • Outline

    1 Turbulent systems with quadratic nonlinearitiesGeneral setups for turbulent systems with quadratic nonlinearities

    2 Statistical dynamical models and closure methodsThe Gaussian closure method for statistical predictionSingle point statistics and information barrierImperfect model calibration and empirical information theoryForecast skill of closure methods for forced responses in L-96 model

    3 Strategies for reduced order models in low dimensional subspaceLow order models in reduced subspaceForecast skill of reduced models for forced responses

    4 Predicting statistical response in two-layer baroclinic turbulenceTwo-layer baroclinic turbulence in ocean and atmosphere regimesLow-dimensional reduced-order models for the two-layer systemImperfect model prediction skill with 2× 102 resolved modes for originalsystem with 2× 2562 dimension

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 32 / 61

  • Reduced-order models in low dimensional subspace – Meanevolution

    Use s orthonormal eigenvectors of full covariance matrix R∞ to be {vi}si=1 (e.g.EOF modes). Form projection matrix P = [v1, · · · , vs ], and the reducedcovariance

    Rs = P∗RP.

    To compensate for the influence of quadratic terms only partially modeled due totruncation, introduce additional forcing G∞

    d ū

    dt= (L + D) ū + B (ū, ū) +

    s∑

    i,j=1

    Rs,ij B (vi , vj ) + F + G∞.

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 33 / 61

  • Reduced-order models in low dimensional subspace – Meanevolution

    d ū

    dt= (L + D) ū + B (ū, ū) +

    s∑

    i,j=1

    Rs,ij B (vi , vj ) + F + G∞.

    Determine G∞ through statistical steady-state information for covariance andmean. The equilibrium equation is:

    G∞ = − (L + D) ū∞ − B (ū∞, ū∞)−s∑

    i,j=1

    Rs∞,ij B (vi , vj )− F,

    where Rs∞ = P∗R∞P, thereby guaranteeing that ū∞ is the steady state of the

    truncated equation.

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 34 / 61

  • Reduced-order models in low dimensional subspace –Covariance evolution

    The exact equation for the reduced covariance is

    dRsdt

    = Lv ,sRs + RsL∗v ,s + QF ,s .

    QF ,s contains nonlinear dynamics between modes as well as linear dynamicsmissing from truncation. Note in general we no longer have conservationproperty, Tr [QF ,s ] 6= 0.Still we can use steady state information

    QF ,s∞ = −Lv ,sRs∞ − Rs∞L∗v ,s .

    As done in MQG, split QF ,s∞ into positive/negative definite parts as anoise/damping pair Q+F ,s∞, Q

    −F ,s∞.

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 35 / 61

  • Low order models in reduced subspaceSolve the statistics in a low-order subspace spanned by leading EOFs, vk , k = 0, 1, · · · , s,{

    dūdt

    = −d (t) ū (t) + 1J

    ∑|k|≤s rk (t) Γk + F (t) + G ,

    drkdt

    = 2 [−Γk ū (t)− d (t)] rk (t) + QMF ,kk (trR) , k = 0, 1, ..., s.

    First approach:

    I Get the unresolved variances in the mean dynamics from the equilibrium statistics

    G =trRs

    trRs,∞G∞, G∞ = dū∞ −

    1

    J

    ∑|k|≤s

    rk,∞Γk + F ;

    I Scale the nonlinear flux QMF ,kk (trRs ) by only resolved modes trRs =∑|k|≤s rk .

    Further improvements:

    I Get unresolved variances from linear first order correction from FDT

    rk,un ∼ rk,∞ + δr ′k = rk,∞ +∫ t

    0Rrk (t − s) δF

    ′ (s) ds;

    I Get the total variance from the scalar energy equation

    dE

    dt= −2d (t) E + F (t) · ū, trR = 2E (t)− ū2.

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 36 / 61

  • Tuning parameters of reduced models in a training regimeCOMPARISON OF PREDICTION SKILL OF REDUCED-ORDER GAUSSIAN CLOSURE MODELS 7

    0 50 100 150 2000

    0.1

    0.2

    0.3

    0.4GC1 F = 8, Information distance with reduced models

    corrected model

    original model

    σ2

    0 50 100 150 2000

    0.05

    0.1

    0.15GC1, corrected model

    signal errordispersion error

    0 50 100 150 2000

    0.1

    0.2

    0.3

    0.4GC1, original model

    signal errordispersion error

    (a) GC1, optimal �21 = 91,�22 = 61

    0 50 100 1500.02

    0.04

    0.06

    0.08

    0.1

    0.12GC2 F = 8, Information distance with reduced models

    corrected model

    original model

    ϵ

    0 50 100 1500.01

    0.02

    0.03

    0.04

    0.05GC2, corrected model

    signal errordispersion error

    0 50 100 1500.02

    0.04

    0.06

    0.08

    0.1

    0.12GC2, original model

    signal errordispersion error

    (b) GC2, optimal ✏1 =200, ✏2 = 20

    0 0.2 0.4 0.6 0.8 10.02

    0.04

    0.06

    0.08

    0.1

    0.12MQG F = 8, Information distance with reduced models

    corrected model

    original model

    ds0 0.2 0.4 0.6 0.8 1

    0

    0.01

    0.02

    0.03

    0.04

    0.05

    0.06ROMQG, corrected model

    signal errordispersion error

    0 0.2 0.4 0.6 0.8 10

    0.02

    0.04

    0.06

    0.08

    0.1

    0.12ROMQG, original model

    signal errordispersion error

    (c) ROMQG, optimalds1 = 0.455, ds2 = 0.145

    Figure 2.1. Tuning parameters in the training regime for GC1, GC2, and ROMQG. The originalmodel results are shown in red while the corrected model results are in blue. It can be seen thatwith the original model there is always large error in the mean (signal error) compared with therelatively small error from the variances in the resolved modes (dispersion error). The correctedmethods can effectively reduce the signal error. And the fitting process is almost to find the pointwhere the error from mean decreases to the minimum amount. The point where the optimal valueis achieved is marked by circles.

    in Figure 3.2-3.8 are compared. It can be seen that by adding the corrections from the global energy E, the errorin the variances are effectively reduced while large errors appear in the predictions for the mean. And if the linearresponse corrections for the mean dynamics are also considered, the overall improvements for both the mean andvariances are achieved with no further increasement for more computational costs. Specifically, we can summarizethe performance of these methods and the improvements through the corrections in the following several points:

    • The effects of the correction from the global energy equation for total variance can be shown by comparingthe dispersion error between GC1 and GC2. In GC1, we only use constant damping and noise at eachtime instant, while in GC2 the variance equations can be corrected by the scale factor using total variancerather than that in the 1-dimensional subspace. Comparing the dispersion errors between original modeland the corrected model, it can be seen that the energy correction can effectively improve the performanceof the prediction for variances in GC2 results (and also for the ROMQG results when energy correction isapplied), while GC1 has little improvement for the variance since no correction method can be applied;

    • The effects of the correction from linear response theory for mean dynamics can be seen by checking thesignal error part for all the three methods. When linear response corrections for the unresolved variances areadded to the corrected models, improvements can be seen for all three methods compared with the originalmodels with only steady state statistics applied. Especially for the case with also energy corrections, theaccuracy for the mean can in turn improve the accuracy in the global energy dynamics and further improvethe prediction for the variances. This can also be seen by comparing GC1 and GC2 results;

    Figure: Tuning parameters in the training regime for reduced-order GC1, GC2, and ROMQGwith s = 3. The corrected methods can effectively reduce the signal error.

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 37 / 61

  • Check model sensitivity to perturbations

    0 5 10 15 20 25 308

    8.1

    8.2

    8.3

    8.4

    8.5

    8.6

    8.7

    8.8

    8.9

    9upward ramp−type forcing

    time

    F

    (a) upward ramp-type forcing

    0 5 10 15 20 25 307.2

    7.3

    7.4

    7.5

    7.6

    7.7

    7.8

    7.9

    8downward ramp−type forcing

    time

    F

    (b) downward ramp-type forcing

    0 2 4 6 8 10 12 14 16 18 207.2

    7.4

    7.6

    7.8

    8

    8.2

    8.4

    8.6

    8.8

    9periodic forcing

    time

    F

    (c) periodic forcing

    0 2 4 6 8 10 12 14 16 18 206.5

    7

    7.5

    8

    8.5

    9random forcing

    time

    F

    (d) random forcing

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 38 / 61

  • Upward Ramp-type Forcing

    0 5 10 15 20 25 3062

    64

    66

    68

    70

    72GC2, Variance in principal direction, F = 8

    0 5 10 15 20 25 302.34

    2.36

    2.38

    2.4

    2.42

    2.44

    2.46GC2, State of the Mean, F =8

    truth

    original GC2

    corrected GC2

    0 5 10 15 20 25 3062

    64

    66

    68

    70

    72GC1, Variance in principal direction, F = 8

    0 5 10 15 20 25 302.3

    2.35

    2.4

    2.45

    2.5GC1, State of the Mean, F =8

    truth

    original GC1

    corrected GC1

    time0 5 10 15 20 25 30

    62

    64

    66

    68

    70

    72ROMQG, Variance in principal direction, F = 8

    time0 5 10 15 20 25 30

    2.3

    2.35

    2.4

    2.45

    2.5ROMQG, State of the Mean, F =8

    truth

    original ROMQG

    corrected ROMQG

    time0 5 10 15 20 25 30

    0

    0.005

    0.01

    0.015

    0.02

    0.025total information distance, original method

    GC2

    GC1

    ROMQG

    time0 5 10 15 20 25 30

    0

    0.005

    0.01

    0.015

    0.02

    0.025total information distance, corrected method

  • Information Error with Upward Ramp-type Forcing— Signal part & Dispersion part

    0 5 10 15 20 25 30

    ×10-3

    0

    0.2

    0.4

    0.6

    0.8

    1GC2, signal error

    original GC2

    corrected GC2

    0 5 10 15 20 25 30

    ×10-3

    0

    0.5

    1

    1.5

    2

    2.5

    3

    3.5GC2, dispersion error

    0 5 10 15 20 25 30

    ×10-3

    0

    1

    2

    3

    4

    5

    6

    7GC1, signal error

    original GC1

    corrected GC1

    0 5 10 15 20 25 300

    0.005

    0.01

    0.015

    0.02GC1, dispersion error

    time0 5 10 15 20 25 30

    0

    0.002

    0.004

    0.006

    0.008

    0.01

    0.012

    0.014ROMQG, dispersion error

    time0 5 10 15 20 25 30

    ×10-3

    0

    2

    4

    6

    8ROMQG, signal error

    original MQG

    corrected MQG

    Figure: Comparison of the information errors in signal and dispersion part separatelybetween the closure methods GC1, GC2, and MQG.

  • Downward Ramp-type Forcing

    0 5 10 15 20 25 3054

    56

    58

    60

    62

    64GC2, Variance in principal direction, F = 8

    0 5 10 15 20 25 302.2

    2.25

    2.3

    2.35GC2, State of the Mean, F =8

    truth

    original GC2

    corrected GC2

    time0 5 10 15 20 25 30

    54

    56

    58

    60

    62

    64GC1, Variance in principal direction, F = 8

    time0 5 10 15 20 25 30

    2.15

    2.2

    2.25

    2.3

    2.35GC1, State of the Mean, F =8

    truth

    original GC1

    corrected GC1

    0 5 10 15 20 25 3054

    56

    58

    60

    62

    64ROMQG, Variance in principal direction, F = 8

    0 5 10 15 20 25 302.15

    2.2

    2.25

    2.3

    2.35ROMQG, State of the Mean F =8

    truth

    original ROMQG

    corrected ROMQG

    time0 5 10 15 20 25 30

    0

    0.005

    0.01

    0.015

    0.02

    0.025

    0.03total information distance, original method

    GC2

    GC1

    ROMQG

    time0 5 10 15 20 25 30

    0

    0.005

    0.01

    0.015

    0.02

    0.025

    0.03total information distance, corrected method

  • Periodic Forcing

    0 2 4 6 8 10 12 14 16 18 2050

    55

    60

    65

    70

    75GC2, Variance in principal direction, F = 8

    0 2 4 6 8 10 12 14 16 18 202.1

    2.2

    2.3

    2.4

    2.5

    2.6GC2, State of the Mean, F =8

    time0 2 4 6 8 10 12 14 16 18 20

    50

    55

    60

    65

    70

    75GC1, Variance in principal direction, F = 8

    time0 2 4 6 8 10 12 14 16 18 20

    2.1

    2.2

    2.3

    2.4

    2.5

    2.6GC1, State of the Mean, F =8

    0 2 4 6 8 10 12 14 16 18 2050

    55

    60

    65

    70

    75ROMQG, Variance in principal direction, F = 8

    0 2 4 6 8 10 12 14 16 18 202.1

    2.2

    2.3

    2.4

    2.5

    2.6ROMQG, State of the Mean, F =8

    time0 2 4 6 8 10 12 14 16 18 20

    0

    0.005

    0.01

    0.015

    0.02

    0.025total information distance, original method

    time0 2 4 6 8 10 12 14 16 18 20

    0

    0.005

    0.01

    0.015

    0.02

    0.025total information distance, corrected method

  • Random Forcing

    0 2 4 6 8 10 12 14 16 18 2050

    55

    60

    65

    70

    75GC2, Variance in principal direction, F = 8

    0 2 4 6 8 10 12 14 16 18 202.1

    2.2

    2.3

    2.4

    2.5GC2, State of the Mean, F =8

    time0 2 4 6 8 10 12 14 16 18 20

    50

    55

    60

    65

    70

    75GC1, Variance in principal direction, F = 8

    time0 2 4 6 8 10 12 14 16 18 20

    2.1

    2.2

    2.3

    2.4

    2.5

    2.6GC1, State of the Mean, F =8

    0 2 4 6 8 10 12 14 16 18 2050

    55

    60

    65

    70

    75ROMQG, Variance in principal direction, F = 8

    0 2 4 6 8 10 12 14 16 18 202.1

    2.2

    2.3

    2.4

    2.5

    2.6ROMQG, State of the Mean, F =8

    time0 2 4 6 8 10 12 14 16 18 20

    0

    0.01

    0.02

    0.03

    0.04total information distance, original method

    time0 2 4 6 8 10 12 14 16 18 20

    0

    0.01

    0.02

    0.03

    0.04total information distance, corrected method

  • Check model sensitivity to perturbations in L-96 system

    time0 5 10 15 20 25 30

    0

    0.005

    0.01

    0.015

    0.02

    0.025total information distance, original method

    GC2

    GC1

    ROMQG

    time0 5 10 15 20 25 30

    0

    0.005

    0.01

    0.015

    0.02

    0.025total information distance, corrected method

    (g) climate change forcing

    time0 2 4 6 8 10 12 14 16 18 20

    0

    0.005

    0.01

    0.015

    0.02

    0.025total information distance, original method

    time0 2 4 6 8 10 12 14 16 18 20

    0

    0.005

    0.01

    0.015

    0.02

    0.025total information distance, corrected method

    (h) periodic forcing

    time0 2 4 6 8 10 12 14 16 18 20

    0

    0.01

    0.02

    0.03

    0.04total information distance, original method

    time0 2 4 6 8 10 12 14 16 18 20

    0

    0.01

    0.02

    0.03

    0.04total information distance, corrected method

    (i) random forcing

  • Outline

    1 Turbulent systems with quadratic nonlinearitiesGeneral setups for turbulent systems with quadratic nonlinearities

    2 Statistical dynamical models and closure methodsThe Gaussian closure method for statistical predictionSingle point statistics and information barrierImperfect model calibration and empirical information theoryForecast skill of closure methods for forced responses in L-96 model

    3 Strategies for reduced order models in low dimensional subspaceLow order models in reduced subspaceForecast skill of reduced models for forced responses

    4 Predicting statistical response in two-layer baroclinic turbulenceTwo-layer baroclinic turbulence in ocean and atmosphere regimesLow-dimensional reduced-order models for the two-layer systemImperfect model prediction skill with 2× 102 resolved modes for originalsystem with 2× 2562 dimension

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 45 / 61

  • The two-layer flow with forcing and dissipation

    The two-layer quasi-geostrophic model with baroclinic instability is one simplebut fully nonlinear fluid model capable in capturing the essential physics in oceanand atmosphere science.

    Two-layer model

    ∂qψ

    ∂t+ J

    (ψ, qψ

    )+ J (τ, qτ ) + β

    ∂ψ

    ∂x+ U

    ∂∆τ

    ∂x= −

    κ

    2∆ (ψ − τ)− ν∆s qψ + Fψ ,

    ∂qτ

    ∂t+ J (ψ, qτ ) + J

    (τ, qψ

    )+ β

    ∂τ

    ∂x+ U

    ∂x

    (∆ψ + k2dψ

    )= −

    κ

    2∆ (τ − ψ)− ν∆s qτ + Fτ .

    Barotropic and baroclinic modes:

    qψ = ∆ψ, ψ =1

    2(ψ1 + ψ2) ,

    qτ = ∆τ − k2dτ, τ =1

    2(ψ1 − ψ2) .

    Normalized energy-consistent modes:

    pψ,k =qψ,k|k| = − |k|ψk,

    pτ,k =qτ,k√|k|2 + k2d

    = −√|k|2 + k2dτk.

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 46 / 61

  • Flow in high-latitude homogeneous regimes

    regime N β kd U κ (kmin, kmax) σmax (kx , ky )max

    ocean, high lat. 256 10 10 1 9 (2.25, 14.61) 0.411 (4, 0)

    atmosphere, high lat. 256 1 4 0.2 0.2 (1.58, 6.78) 0.099 (2, 0)4 Reduced-order models with homogeneous mean flow 14

    linear growth rate, ocean regime

    -15 -10 -5 0 5 10 15zonal wavenumber

    -15

    -10

    -5

    0

    5

    10

    15

    me

    rid

    ion

    al w

    ave

    nu

    mb

    er

    0.05

    0.1

    0.15

    0.2

    0.25

    0.3

    0.35

    0.4nonlinear flux, sum of eigenvalues

    -10 -5 0 5 10zonal wavenumber

    -10

    -5

    0

    5

    10

    me

    rid

    ion

    al w

    ave

    nu

    mb

    er

    -1.5

    -1

    -0.5

    0

    0.5

    0 5 10 15-8

    -6

    -4

    -2

    0

    2linear growth rate

    0 5 10 15wavenumber

    -8

    -4

    0

    4

    8nonlinear flux eigenvalues

    (a) high-latitude ocean regime

    linear growth rate, atmosphere regime

    -6 -4 -2 0 2 4 6zonal wavenumber

    -6

    -4

    -2

    0

    2

    4

    6

    me

    rid

    ion

    al w

    ave

    nu

    mb

    er

    0.01

    0.02

    0.03

    0.04

    0.05

    0.06

    0.07

    0.08

    0.09nonlinear flux, sum of eigenvalues

    -4 -2 0 2 4zonal wavenumber

    -4

    -3

    -2

    -1

    0

    1

    2

    3

    4

    me

    rid

    ion

    al w

    ave

    nu

    mb

    er

    -0.1

    -0.05

    0

    0.05

    0.1

    0 1 2 3 4 5 6-0.5

    0

    0.5linear growth rate

    0 1 2 3 4 5 6wavenumber

    -0.2

    0

    0.2nonlinear flux eigenvalues

    (b) high-latitude atmosphere regime

    Fig. 4.4: Stability from linear analysis and nonlinear flux in ocean (upper) and atmosphere (lower) regime using param-eters in Table 1. The growth rate from linear analysis including Ekman damping effect, and the eigenvaluesof the nonlinear flux trQF,k in each wavenumber combining barotropic and baroclinic mode are displayed inthe two-dimensional spectral domain. The last column shows the radial averaged growth rate and nonlinearflux eigenvalues in positive and negative components.

    and validates the feasibility of using quasi-Gaussian approximation in calculating the linear response operators as a2⇥2 blocked system.

    4.2 Testing reduced-order models in homogeneous regime

    In the previous section we displayed the unperturbed statistical structures of the two-layer QG system in high-latituderegime with important nonlinear non-Gaussian features. The major task now is to test the reduced-order model skillsin predicting statistical responses to both stochastic and deterministic forcing perturbations as prescribed in (4.1) and(4.2) using only low-order closures. Only the large-scale modes |k| < 10 are calculated here, which cover the regime ofmost energetic directions. And to investigate the model sensitivity in each component, the perturbations in barotropicmode and baroclinic mode are applied individually in the tests. Three statistical quantities are of special importance

    in characterizing the two-layer system, that is, the barotropic energy,��py,k

    ��2, baroclinic energy��pt,k

    ��2, and the heatflux ikxy⇤k tk. Due to the homogeneous statistics as we have shown before, the mean states become zero and thus wecan focus on the second-order variances in this situation. Therefore we will majorly check the reduced-order method’sability in capturing the responses in these key quantities. Like the Algorithm summarized in Section 3.3, the modelingprocess are decomposed into a training phase for finding optimal model parameters and a prediction phase for gettingresponses to various perturbations.

    4.2.1 Training phase with linear response operator

    Equilibrium consistency for the reduced-order methods

    In testing the reduced-order models, we need to first guarantee climate consistency with the true unperturbed equilib-rium in statistical steady state. In the construction of low-order correction in Section 3.2, higher-order statistics fromequilibrium are combined with additional damping and noise corrections. It needs to be emphasized that neither theadditional damping and noise (3.7) nor the equilibrium high-order correction (3.6) is stable on its own even with the

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 47 / 61

  • Flow in low/mid-latitude regimes with zonal jets

    5 Reduced-order models with inhomogeneous jet flow 25

    0 20 40 60 80 100 120 1400

    50

    100

    150mean in relative vorticity

    barotropicbaroclinic

    100 101 102

    100

    variance in relative vorticity

    barotropicbaroclinic

    100 101 102

    100

    statistical energy

    barotropicbaroclinicpotential

    k-1.6

    k-3.6

    (a) ocean regime

    0 20 40 60 80 100 120 1400

    0.5

    1

    1.5

    2mean in relative vorticity

    barotropicbaroclinic

    100 101 102

    100

    variance in relative vorticity

    barotropicbaroclinic

    100 101 10210-10

    100

    statistical energy

    barotropicbaroclinicpotential

    k-3.5

    k-1.5

    (b) atmosphere regime

    Fig. 5.4: Time-averaged statistics (in radial average) in mean and second-order moments in low/mid-latitude regime.The first row compares the statistical mean states. The following two rows show the variances, and statisticalenergy, in barotropic and baroclinic modes, as well as the potential energy.

    0 5 10 15 20 25 30-1

    0

    1mode (6,3)

    0 5 10 15 20 25 30-1

    0

    1mode (6,2)

    0 5 10 15 20 25 30-1

    0

    1mode (6,1)

    -10 -5 0 5 100

    0.2

    0.4mode (6,3)

    -10 -5 0 5 100

    0.2

    0.4mode (6,2)

    -10 -5 0 5 100

    0.2

    0.4mode (6,1)

    (a) ocean

    0 10 20 30 40 50-1

    0

    1mode (1,0)

    0 50 100 150 200-1

    0

    1mode (0,1)

    0 50 100 150 200-1

    0

    1mode (0,2)

    -3 -2 -1 0 1 2 30

    0.5

    1mode (1,0)

    -3 -2 -1 0 1 2 30

    0.5

    1

    1.5mode (0,1)

    -3 -2 -1 0 1 2 30

    1

    2mode (0,2)

    (b) atmosphere

    Fig. 5.5: Autocorrelation functions and the probability distribution functions in low/mid-latitude ocean and atmo-sphere regime. The first three most energetic baroclinic modes are displayed. In the autocorrelations, thesolid lines show the real part while the dashed lines are the imaginary part of the functions. In the pdfs, thecorresponding Gaussian distributions with the same variance are also plotted in dashed black lines.

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 48 / 61

  • Exact statistical moment equations

    The rescaled set of equations of (1) can be summarized in the abstract form

    dpkdt

    = Bk (pk, pk) + (Lk −Dk) pk + Fk, pk = (pψ,k, pτ,k)T ,∑

    k

    pk · Bk (pk, pk) ≡ 0,

    where the normalized state variable pk =(pψ,k, pτ,k

    )Tis in barotropic and baroclinic mode, the

    linear operator is decomposed into non-symmetric part Lk involving β-effect and shear flow Uand dissipation part Dk, together with the forcing Fk combining deterministic component andstochastic component.

    Lk =

    ikxβ|k|2 − ikx U√1+(kd/|k|)2−ikx U 1−(kd/|k|)

    2√

    1+(kd/|k|)2ikxβ

    |k|2+k2d

    , Dk = κ2

    −1 1√1+(kd/|k|)21√

    1+(kd/|k|)2− 1

    1+(kd/|k|)2

    ,

    Fk =

    fψ,k|k| + σψ,kẆψ,k|k|fτ,k√|k|2+k2

    d

    +στ,kẆτ,k√|k|2+k2

    d

    .

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 49 / 61

  • Exact statistical moment equations

    The rescaled set of equations of (1) can be summarized in the abstract form

    dpkdt

    = Bk (pk, pk) + (Lk −Dk) pk + Fk, pk = (pψ,k, pτ,k)T ,∑

    k

    pk · Bk (pk, pk) ≡ 0,

    where the normalized state variable pk =(pψ,k, pτ,k

    )Tis in barotropic and baroclinic mode, the

    linear operator is decomposed into non-symmetric part Lk involving β-effect and shear flow Uand dissipation part Dk, together with the forcing Fk combining deterministic component andstochastic component.

    Bk (pk,pk) =

    [Bψ,kBτ,k

    ]

    =

    ∑m+n=k

    m⊥·n|k|

    (|n||m|pψ,mpψ,n +

    √|n|2+k2d|m|2+k2d

    pτ,mpτ,n

    )

    ∑m+n=k

    m⊥·n√|k|2+k2d

    (√|n|2+k2d|m| pψ,mpτ,n +

    |n|√|m|2+k2d

    pτ,mpψ,n

    )

    .

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 49 / 61

  • Exact statistical moment equations

    Statistical energy in each spectral mode

    Rk = pk∗pk =

    [|pψ,k|2 p∗ψ,kpτ,k

    pψ,kp∗τ,k |pτ,k|

    2

    ], p∗1,kp2,k = p̄

    ∗1,kp̄2,k + p

    ′∗1,kp′2,k.

    Rk combines the variability in both mean and variance. The true statisticaldynamical equations form a 2× 2 system about Rk ∈ C2×2�

    �dRkdt = (Lk −Dk) Rk + QF ,k + Qσ,k + c.c., |k| ≤ N,

    QF ,k = pk∗Bk (pk,pk) =

    [p∗ψ,kBψ,k p

    ∗ψ,kBτ,k

    p∗τ,kBψ,k p∗τ,kBτ,k

    ],∑

    k

    trQF ,k ≡ 0.

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 50 / 61

  • Exact statistical moment equations

    Statistical energy in each spectral mode

    Rk = pk∗pk =

    [|pψ,k|2 p∗ψ,kpτ,k

    pψ,kp∗τ,k |pτ,k|

    2

    ], p∗1,kp2,k = p̄

    ∗1,kp̄2,k + p

    ′∗1,kp′2,k.

    Rk combines the variability in both mean and variance. The true statisticaldynamical equations form a 2× 2 system about Rk ∈ C2×2�

    �dRkdt = (Lk −Dk) Rk + QF ,k + Qσ,k + c.c., |k| ≤ N,

    QF ,k = pk∗Bk (pk,pk) =

    [p∗ψ,kBψ,k p

    ∗ψ,kBτ,k

    p∗τ,kBψ,k p∗τ,kBτ,k

    ],∑

    k

    trQF ,k ≡ 0.

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 50 / 61

  • Statistical energy conservation principle

    The total statistical energy dynamical equation2 concerns the evolution of thetotal variability in mean and variance in response to external perturbations

    E =1

    2

    ∑1≤|k|≤N

    |k|2 |ψk|2 +(|k|2 + k2d

    )|τk|2 =

    1

    2

    ∑1≤|k|≤N

    |pψ,k|2 + |pτ,k|2.

    The exact dynamics for the statistical energy can be derived as

    dE

    dt+ Hf = −κE +

    κ

    2F − νH + Qσ.

    Hf is the meridional heat flux due to baroclinic instability, F is the additional damping effectsdue to the non-symmetry in Ekman drag only applied on the bottom layer

    Hf = k2d U

    ∫ψxτ = k

    2d U∑

    ikxψ∗k τk, F =∑

    k2d |τk|2 + 2 |k|2 Reψ∗k τk.

    2Majda, Statistical energy conservation principle for inhomogeneous turbulent dynamicalsystems, PNAS, 2016

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 51 / 61

  • Reduced-Order Statistical Energy ClosureThe true statistical model

    dRkdt

    = (Lk −Dk) Rk + QF ,k + Qσ,k + c.c., |k| ≤ N.

    Qψ,k =∑

    m+n=k

    m⊥ · n|k|

    |n||m|

    p∗ψ,kpψ,mpψ,n +

    √√√√ |n|2 + k2d|m|2 + k2d

    p∗ψ,kpτ,mpτ,n

    ,Qτ,k =

    ∑m+n=k

    m⊥ · n√|k|2 + k2d

    √|n|2 + k2d|m|

    p∗τ,kpψ,mpτ,n +|n|√|m|2 + k2d

    p∗τ,kpτ,mpψ,n

    .A preferred approach for the nonlinear flux QM,k combining both the detailedmodel energy mechanism and control over model sensitivity is proposed��

    QM,k = Q−M,k+Q+M,k = f1 (E ) [− (NM,k,eq + dM ) RM,k]+f2 (E )

    [Q+F ,k,eq + σ

    2M,k

    ].

    Higher-order corrections from equilibrium statistics:

    QF ,k,eq = Q−F ,k,eq + Q

    +F ,k,eq = − (Lk −Dk) Rk,eq + c.c., NM,k,eq =

    1

    2Q−F ,k,eqR

    −1k,eq.

    Additional damping and noise to model nonlinear flux:

    QaddM,k = −dM RM,k + σ2M,k, dM =[

    dM,ψ iωM−iωM dM,τ

    ].

    Statistical energy-consistent scaling to improve model sensitivity:

    f1 (E) =

    (E

    Eeq

    )1/2, f2 (E) =

    (E

    Eeq

    )3/2.

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 52 / 61

  • Reduced-Order Statistical Energy ClosureThe reduced-order approximation

    dRM,kdt

    = (Lk −Dk) RM,k + QM,k + Qσ,k + c.c., |k| ≤ M � N.

    A preferred approach for the nonlinear flux QM,k combining both the detailedmodel energy mechanism and control over model sensitivity is proposed��

    QM,k = Q−M,k+Q+M,k = f1 (E ) [− (NM,k,eq + dM ) RM,k]+f2 (E )

    [Q+F ,k,eq + σ

    2M,k

    ].

    Higher-order corrections from equilibrium statistics:

    QF ,k,eq = Q−F ,k,eq + Q

    +F ,k,eq = − (Lk −Dk) Rk,eq + c.c., NM,k,eq =

    1

    2Q−F ,k,eqR

    −1k,eq.

    Additional damping and noise to model nonlinear flux:

    QaddM,k = −dM RM,k + σ2M,k, dM =[

    dM,ψ iωM−iωM dM,τ

    ].

    Statistical energy-consistent scaling to improve model sensitivity:

    f1 (E) =

    (E

    Eeq

    )1/2, f2 (E) =

    (E

    Eeq

    )3/2.

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 52 / 61

  • Reduced-Order Statistical Energy Closure

    A preferred approach for the nonlinear flux QM,k combining both the detailedmodel energy mechanism and control over model sensitivity is proposed��

    QM,k = Q−M,k+Q+M,k = f1 (E ) [− (NM,k,eq + dM ) RM,k]+f2 (E )

    [Q+F ,k,eq + σ

    2M,k

    ].

    Higher-order corrections from equilibrium statistics:

    QF ,k,eq = Q−F ,k,eq + Q

    +F ,k,eq = − (Lk −Dk) Rk,eq + c.c., NM,k,eq =

    1

    2Q−F ,k,eqR

    −1k,eq.

    Additional damping and noise to model nonlinear flux:

    QaddM,k = −dM RM,k + σ2M,k, dM =[

    dM,ψ iωM−iωM dM,τ

    ].

    Statistical energy-consistent scaling to improve model sensitivity:

    f1 (E) =

    (E

    Eeq

    )1/2, f2 (E) =

    (E

    Eeq

    )3/2.

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 52 / 61

  • Climate fidelity for equilibrium

    Equilibrium fidelity refers to the convergence to the same final unperturbedstatistical equilibrium Req in the reduced-order models RM in each resolvedcomponent.

    Specifically, it requires that the model nonlinear flux correction term QMconverges to the truth, QM → QF ,eq, when no external perturbation is added

    dRM,eqdt

    = 0 = Lv (ūeq) RM,eq + RM,eqL∗v (ūeq) + Q

    MF ,eq + Qσ → RM,eq = Req.

    the first component(

    NM,eq,Q+F ,eq

    )comes from the true equilibrium

    statistics.

    climate consistency requires the second component correction makes nocontribution in the unperturbed case

    ΣM =1

    2dM Req, f1 (Eeq) = 1, f2 (Eeq) = 1.

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 53 / 61

  • Set-up for the numerical problem

    The true statistics is calculated by a pseudo-spectra code with 128 spectralmodes zonally and meridionally, corresponding to 256× 256× 2 grid points intotal.

    In the reduced-order methods, only the large-scale modes |k| ≤ 10 areresolved, which is about 0.15% of the full model resolution.

    External forcing in stochastic and deterministic component:

    The amplitude of the stochastic forcing σk Ẇk is introduced according to theequilibrium energy so that

    σ2ψ,k = δσ20 |qψ,k|2eq, σ2τ,k = δσ20 |qτ,k|

    2eq.

    The deterministic forcing is introduced through a perturbation in thebackground shear Uδ = U + δU

    δfψ,k = δUikx(− |k|2

    )τk, δfτ,k = δUikx

    (− |k|2 + k2d

    )ψk.

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 54 / 61

  • Tuning parameters in the training phase4 Reduced-order models with homogeneous mean flow 18

    tuning parameters with energy correction

    0.50.50.5

    0.5

    0 0.05 0.1 0.15 0.2

    0

    0.05

    0.1

    0.15

    0.2

    0

    0.5

    1

    1.5

    (a) tuning with energy scaling

    tuning parameters without energy correction

    2.5

    2.5

    2.5

    0 0.05 0.1 0.15 0.2

    0

    0.05

    0.1

    0.15

    0.2

    1.5

    2

    2.5

    3

    3.5

    (b) tuning without energy scaling

    0 1 2 3 4 5 6 7 8 9

    wavenumber

    0

    0.2

    0.4

    0.6

    0.8

    1information error

    w/ energy correctionw/o energy correction

    0 1 2 3 4 5 6 7 8 9

    wavenumber

    0

    2

    4

    6

    8baroclinic energy

    unperturbed spectrumperturbed responsew/ energy correctionw/o energy correction

    (c) prediction and info. errors

    Fig. 4.10: Tuning imperfect model parameters in the training phase. The information errors with varying model pa-rameters, dM =

    �dy ,dt

    �, are plotted for stochastic barotropic perturbation case. The errors using total

    energy as scalar factor from the statistical equation and method without the scaling factor are compared.The prediction skill and information error with and without using the total energy correction are comparedin the last row for a typical test case of perturbing the barotropic mode.

    corrections from low-order moments (that is, mean and variance) are used.The high-latitude ocean regime responses in barotropic and baroclinic energy and heat flux in large-scale wavenum-

    bers to stochastic perturbations are first shown in Figure 4.11. The perturbation amplitude is chosen as ds20 = 0.5 of theequilibrium energy in the stochastic forcing (4.1) so that the response is large and nonlinear. We compare the responsesin perturbing only the barotropic mode and baroclinic mode. The most energetic and most sensitive scales take place atwavenumbers |k| = 4,5,6. Both barotropic and baroclinic perturbations can lead to large changes in a wide spectrumin both barotropic and baroclinic component due to the strong coupling between the modes. In the reduced-order meth-ods, only the first large-scale modes |k| < 10 are resolved, while the responses in these dominant modes are all capturedwith accuracy in both perturbation cases though the complicated higher-order interactions with small-scale modes arenot computed explicitly. Further the time-series with the total statistical energy from the equation (3.11) are compared.The dashed black lines mark the level of energy in unperturbed and perturbed case. In this regime, the total statisticalenergy can also be recovered exactly with little error. This in turn explains the high skill of the reduced-order modelsin predicting this regime. Instead, if we only consider the energy in the resolved subspace shown by blue lines, a largegap can be observed compared with the total energy. Figure 4.12 shows the results in the high-latitude atmosphereregime. Alternating blocked and unblocked structures appear in this regime and generate quite complicated statisticalfeatures. The leading mode |k| = 1 contains most of the energy and becomes highly sensitive to perturbations. Thereduced-order method keeps the skill in capturing the responses in the most sensitive directions in this difficult regime.Also it is observed that the baroclinic perturbation case becomes a little less accurate in both spectra and total statisticalenergy. This might be due to the stronger nonlinear energy interactions from baroclinic to barotropic mode.

    A further test requires to check the model’s robustness in predicting perturbations with different amplitudes. Figure4.13 displays the prediction results with changing stochastic forcing amplitude ds20 in the barotropic modes. Thereduced-order model maintains the skill in predicting responses with various forcing strength, and the nonlinear trendsin the total resolved barotropic and baroclinic energy as well as the heat flux are captured compared with the linearprediction in the FDT shown by dashed lines.

    Figure: Tuning imperfect model parameters in the training phase. The information errorswith varying model parameters, dM = (dψ, dτ ), are plotted for stochastic barotropicperturbation case.

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 55 / 61

  • High-latitude: Stochastic perturbation

    The model is perturbed with stochastic forcing with variance amplitudeδσ20 = 0.5;

    Either the barotropic or the baroclinic modes in large scales |k| ≤ 10 areperturbed.

    4 Reduced-order models with homogeneous mean flow 19

    0 1 2 3 4 5 6 7 8 90

    5

    10barotropic energy

    unperturbedperturbed truthred. model

    0 1 2 3 4 5 6 7 8 90

    5

    10baroclinic energy

    0 1 2 3 4 5 6 7 8 9

    wavenumber

    -0.2

    -0.1

    0heat flux

    0 1 2 3 4 5 6 7 8 90

    5

    10barotropic energy

    unperturbedperturbed truthred. model

    0 1 2 3 4 5 6 7 8 90

    5

    10baroclinic energy

    0 1 2 3 4 5 6 7 8 9

    wavenumber

    -0.2

    -0.1

    0heat flux

    0 5 10 15 20 25 30

    time

    30

    35

    40

    45time-series of total energy, barotropic perturbation

    resolved

    total

    truth

    (a) barotropic perturbation

    0 5 10 15 20 25 30

    time

    30

    35

    40

    45time-series of total energy, baroclinic perturbation

    resolved

    total

    truth

    (b) baroclinic perturbation

    Fig. 4.11: Reduced-order model predictions to stochastic perturbations with amplitude ds20 = 0.5 in barotropic (left)and baroclinic (right) mode in high-latitude ocean regime. The spectra for the resolved modes 1 |k| < 10are compared. Black lines with circles show the perturbed model responses in the normalized variables,��py,k

    ��2 (barotropic energy),��pt,k

    ��2 (baroclinic energy), and ikxy⇤k tk (heat flux). The dashed black lines arethe unperturbed statistics. And the reduced order model predictions are in red lines. The last row shows themodel prediction of the energy equation in red lines and the energy in the resolved subspace shown in bluelines. For comparison, the unperturbed and perturbed total energy from the true system is shown in dashedblack lines.

    (a) ocean regime

    4 Reduced-order models with homogeneous mean flow 20

    0 1 2 3 4 5 6 7 8 90

    5

    10

    barotropic energy

    unperturbedperturbed truthred. model

    0 1 2 3 4 5 6 7 8 90

    2

    4

    baroclinic energy

    0 1 2 3 4 5 6 7 8 9

    wavenumber

    -0.2

    -0.1

    0heat flux

    0 1 2 3 4 5 6 7 8 90

    5

    10barotropic energy

    unperturbedperturbed truthred. model

    0 1 2 3 4 5 6 7 8 90

    2

    4baroclinic energy

    0 1 2 3 4 5 6 7 8 9

    wavenumber

    -0.2

    -0.1

    0heat flux

    0 50 100 150 200

    time

    2

    4

    6

    8

    time-series of total energy, barotropic perturbation

    resolved

    total

    truth

    (a) barotropic perturbation

    0 50 100 150 200

    time

    3

    4

    5

    6

    7time-series of total energy, baroclinic perturbation

    resolved

    total

    truth

    (b) baroclinic perturbation

    Fig. 4.12: Reduced-order model predictions to stochastic perturbations with amplitude ds20 = 0.5 in barotropic (left)and baroclinic (right) mode in high-latitude atmosphere regime. The spectra for the resolved modes 1 |k| < 10 are compared. The last row shows the model prediction of the energy equation in red lines and theenergy in the resolved subspace shown in blue lines. For comparison, the unperturbed and perturbed totalenergy from the true system is shown in dashed black lines.

    0 1 2 3 4 5 6 7 8 90

    2

    4

    6

    8barotropic energy

    0 1 2 3 4 5 6 7 8 90

    2

    4

    6

    8baroclinic energy

    0 1 2 3 4 5 6 7 8 9

    wavenumber

    -0.2

    -0.15

    -0.1

    -0.05

    0heat flux

    (a) response in spectra

    0 0.2 0.4 0.6 0.8

    25

    30

    35

    response in barotropic energy

    0 0.2 0.4 0.6 0.836

    38

    40

    42

    44

    46

    response in baroclinic energy

    0 0.2 0.4 0.6 0.8

    δσ2

    -0.9

    -0.8

    -0.7

    -0.6response in heat flux

    (b) total responses

    Fig. 4.13: Imperfect model predictions to responses with changing perturbation amplitude ds20 in the high-latitudeocean regime (with barotropic perturbation). In the first part on the left we the predicted spectra with threedifferent perturbation amplitude, ds20 = 0.1,0.5,0.8, are shown. On the right the responses in total energyand heat flux with changing amplitudes ds20 2 [0,0.8] are plotted. For clarification in display, we only plotreduced model predictions by red markers and the truth is in black lines.

    (b) atmosphere regime

    Di Qi, and Andrew J. Majda (CIMS) Improving Prediction Skill of Imperfect Models Dec. 8, 2016 56 / 61

  • High-latitude: Mean shear flow perturbation

    The model is perturbed by changing the background zonal flow strength U;

    The entire spectral is perturbed due to the mean flow advection in eachspectral mode.

    5 Reduced-order models with inhomogeneous jet flow 21

    0 1 2 3 4 5 6 7 8 90

    5

    10barotropic energy

    0 1 2 3 4 5 6 7 8 90

    5

    10baroclinic energy

    0 1 2 3 4 5 6 7 8 9

    wavenumber

    -0.2

    -0.1

    0heat flux

    (a) U = 1.05

    0 1 2 3 4 5 6 7 8 90

    2

    4

    6barotropic energy

    0 1 2 3 4 5 6 7 8 90

    5

    10baroclinic energy

    0 1 2 3 4 5 6 7 8 9

    wavenumber

    -0.2

    -0.1

    0heat flux

    (b) U = 0.95

    Fig. 4.14: Reduced-order model predictions to mean shear flow perturbation dU = ±0.05 (that is, 5% of the originalvalue U0) in the ocean regime. The spectra for the resolved modes 1 |k| < 10 are compared. Black lineswith circles show the perturbed model responses in the normalized barotropic energy, baroclinic energy, andheat flux. The dashed black lines are the unperturbed statistics. And the reduced order model predictionsare in red lines.

    Model responses to the perturbed mean shear dU

    In checking the model responses to deterministic forcing, we introduce the forcing perturbation by changing the back-ground jet strength U as in (4.2). The same perturbation is tested in [24] for a more complicated reduced-order modifiedquasi-Gaussian closure (RoMQG), and we test the same perturbation form here under our systematic reduced-ordermodeling framework. Note that the deterministic perturbation in (4.2) forms a more difficult test case compared withthe stochastic forcing (4.1) because the forcing is applied along all wavenumbers with stronger mean-fluctuation in-teractions involved. On the other hand, for the reduced order methods, only the perturbations at the limited resolvedmodes are quantified. This gives the inherent difficulty for applying the reduced order models to this kind of perturba-tions since we have no knowledge of the unresolved modes where large amount of energy is contained. Therefore thestatistical energy equation (3.11) plays a crucial role.

    The results with mean flow perturbations dU = ±0.05 in the ocean regime and perturbations dU = 0.02,�0.01 inthe atmosphere regime are shown in Figure 4.14 and 4.15 separately. The perturbation accounts for about 5%-10% ofthe original shear strength U , and the correspon