uq & sensitivity analysis - for correlated variables · 12/18 inpractice,... i...

UQ & Sensitivity analysisfor correlated variables

Anne Eggels

CASA dayApril 19, 2017

1/18

Uncorrelated data

For uncorrelated data,

Independent variables ⇒ stochastic collocation

Number of nodes k grows exponentiallywith number of variables p.

2/18

Wind and wave data

Our data is correlated!

Dependent variables⇒ clustering-based collocation (CBC)

3/18

ResultI The method was tested by integrating test

functions over different data sets.

I Higher correlation leads to smaller error.

Example with 50 nodes and 105

data points

4/18

What now?

Collocation points and weights are known.

Output can be computed.

What to do with the output?

5/18

Goals of sensitivity analysis (1/2)

Increased understanding of the relationships betweeninput and output variables in a system or model

I Which input variables influence the output and howmuch?

I How to detect/quantify correlations between inputvariables?

I Causality

6/18

Goals of sensitivity analysis (2/2)

Improved uncertainty analysis to better estimate theoutput probability distribution function

I More weight to more important variables

I What to do with groups of correlated variables?

7/18

Example - x,y ∼ U[0, 1], f(x,y) = f(x) = x2

8/18

Adaptive CBC

I Apply CBC with a fraction of the computationalbudget

I Apply SA and determine important variables

I Re-apply CBC with the rest of the budget on onlythe important variables

I Iterate?

Remark: if iterations are used, then variables can notbe added again.

9/18

Example for SA - minimum spanning trees

L = 0.0995 L = 3.3695

10/18

Reference distribution

11/18

Method for SA - minimum spanning trees

I Length of the minimum spanning tree is an inversemeasure for correlation

I Include the weights of the clusters

I Compare length to reference length

I Use only “important” variables in the collocationmethod

12/18

In practice, ...

I we would like to compute the MST on the completedataset

I output data is far from complete

Solution:

I Interpolate the output data

I Use the previous method based on CBC forinput/output-relations

I Use a multilevel method (with corrections) forinput/input-relations

13/18

Mutual information

I Measure of mutual dependence between tworandom variables

I “Amount of information” obtained about onerandom variable by knowing the other

I I(X,Y) = HS(X) +HS(Y)−HS(X,Y)

I HS(·) Shannon entropy (measure of information /unpredictability)

I Easy for distributions, hard for datasets

14/18

Minimum spanning trees and mutualdependence

I α-entropy:

Hα(f) =1

1− αlog

(∫Ωfα(ω)dω

)for α ∈ (0, 1)

I Length of MST converges (within a constant) to the1/2-entropy!

log(

LNβ√N

)→ H1/2

I Hα(X) = Hα(Y) for all X,Y =⇒ Hα(X,Y) measure fordependence

15/18

How to handle groups of important variables?

Clique idea (future work)

16/18

Testcase - wind op zee

Dataset: OWEZ (Offshore Windpark Egmond aan Zee)

Model: wake model for Horns Rev

Real input does not match restrictionsof model on input data!

17/18

Current and future work

I How to handle groups of important variables?(clique idea)

I How to determine which variables are importantenough?

I How to allocate simulation budget to first andsecond collocation?

I Real-life tests

18/18

Conclusions

I We developed a new method for sensitivity analysiswhich works for correlated variables.

I The method has a theoretical basis, but detailshave to be worked out.

I When combined with uncertainty analysis, this canlead to more insight in complex problems.

1/1

Questions?

This research is supported by the Dutch Technology Foundation STW, whichis part of the Netherlands Organisation for Scientific Research (NWO), and

which is partly funded by the Ministry of Economic Affairs.

uq & sensitivity analysis - for correlated variables · 12/18 inpractice,... i...

Documents