uq & sensitivity analysis - for correlated variables · 12/18 inpractice,... i...
TRANSCRIPT
UQ & Sensitivity analysisfor correlated variables
Anne Eggels
CASA dayApril 19, 2017
1/18
Uncorrelated data
For uncorrelated data,
Independent variables ⇒ stochastic collocation
Number of nodes k grows exponentiallywith number of variables p.
2/18
Wind and wave data
Our data is correlated!
Dependent variables⇒ clustering-based collocation (CBC)
3/18
ResultI The method was tested by integrating test
functions over different data sets.
I Higher correlation leads to smaller error.
Example with 50 nodes and 105
data points
4/18
What now?
Collocation points and weights are known.
Output can be computed.
What to do with the output?
5/18
Goals of sensitivity analysis (1/2)
Increased understanding of the relationships betweeninput and output variables in a system or model
I Which input variables influence the output and howmuch?
I How to detect/quantify correlations between inputvariables?
I Causality
6/18
Goals of sensitivity analysis (2/2)
Improved uncertainty analysis to better estimate theoutput probability distribution function
I More weight to more important variables
I What to do with groups of correlated variables?
7/18
Example - x,y ∼ U[0, 1], f(x,y) = f(x) = x2
8/18
Adaptive CBC
I Apply CBC with a fraction of the computationalbudget
I Apply SA and determine important variables
I Re-apply CBC with the rest of the budget on onlythe important variables
I Iterate?
Remark: if iterations are used, then variables can notbe added again.
9/18
Example for SA - minimum spanning trees
L = 0.0995 L = 3.3695
10/18
Reference distribution
11/18
Method for SA - minimum spanning trees
I Length of the minimum spanning tree is an inversemeasure for correlation
I Include the weights of the clusters
I Compare length to reference length
I Use only “important” variables in the collocationmethod
12/18
In practice, ...
I we would like to compute the MST on the completedataset
I output data is far from complete
Solution:
I Interpolate the output data
I Use the previous method based on CBC forinput/output-relations
I Use a multilevel method (with corrections) forinput/input-relations
13/18
Mutual information
I Measure of mutual dependence between tworandom variables
I “Amount of information” obtained about onerandom variable by knowing the other
I I(X,Y) = HS(X) +HS(Y)−HS(X,Y)
I HS(·) Shannon entropy (measure of information /unpredictability)
I Easy for distributions, hard for datasets
14/18
Minimum spanning trees and mutualdependence
I α-entropy:
Hα(f) =1
1− αlog
(∫Ωfα(ω)dω
)for α ∈ (0, 1)
I Length of MST converges (within a constant) to the1/2-entropy!
log(
LNβ√N
)→ H1/2
I Hα(X) = Hα(Y) for all X,Y =⇒ Hα(X,Y) measure fordependence
15/18
How to handle groups of important variables?
Clique idea (future work)
16/18
Testcase - wind op zee
Dataset: OWEZ (Offshore Windpark Egmond aan Zee)
Model: wake model for Horns Rev
Real input does not match restrictionsof model on input data!
17/18
Current and future work
I How to handle groups of important variables?(clique idea)
I How to determine which variables are importantenough?
I How to allocate simulation budget to first andsecond collocation?
I Real-life tests
18/18
Conclusions
I We developed a new method for sensitivity analysiswhich works for correlated variables.
I The method has a theoretical basis, but detailshave to be worked out.
I When combined with uncertainty analysis, this canlead to more insight in complex problems.
1/1
Questions?
This research is supported by the Dutch Technology Foundation STW, whichis part of the Netherlands Organisation for Scientific Research (NWO), and
which is partly funded by the Ministry of Economic Affairs.