continuous distribution functions

50
Continuous Distribution Functions Jake Blanchard Spring 2010 Uncertainty Analysis for Engineers 1

Upload: conley

Post on 23-Feb-2016

45 views

Category:

Documents


0 download

DESCRIPTION

Continuous Distribution Functions. Jake Blanchard Spring 2010. The Normal Distribution. This is probably the most famous of all distributions At one time, many felt that this was the underlying distribution of nature and that it would govern all measurements - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Continuous Distribution Functions

Uncertainty Analysis for Engineers 1

Continuous Distribution FunctionsJake BlanchardSpring 2010

Page 2: Continuous Distribution Functions

Uncertainty Analysis for Engineers 2

The Normal DistributionThis is probably the most famous of all

distributionsAt one time, many felt that this was the

underlying distribution of nature and that it would govern all measurements

It is also called the Gaussian distributionMany random variables are not well-

represented by this distribution, so its popularity is not always warranted

Since limits are +/- infinity, this distribution is problematic in some situations

Page 3: Continuous Distribution Functions

Uncertainty Analysis for Engineers 3

For ExampleSuppose we measure the height of

many people and want to represent the data with a normal distribution

Obviously, the distribution will predict a finite probability for negative heights, which makes no sense

On the other hand, a height of 0 will be several standard deviations from the mean, so the error will be negligible

In some cases, we can just truncate the predictions

Page 4: Continuous Distribution Functions

Uncertainty Analysis for Engineers 4

Normal distribution

x

xxf 2

2

2exp

21)(

-6.00 -4.00 -2.00 0.00 2.00 4.00 6.000.00

0.05

0.10

0.15

0.20

0.25

0.30

0.35

0.40

0.45sigma=1,mu=0sigma=1, mu=2sigma=1.2, mu=0

x

f(x)

3

0

24

3

stdmean

Page 5: Continuous Distribution Functions

Uncertainty Analysis for Engineers 5

Central Limit TheoremOne of the key reasons the

normal distribution is the CLTIt states that the distribution of

the mean of n independent observations from any distribution with finite mean and variance will approach a normal distribution for large n

Page 6: Continuous Distribution Functions

Uncertainty Analysis for Engineers 6

ExamplesHere are some examples of

phenomena that are believed to follow normal distributions◦Particle velocities in a gas◦Scores on intelligence tests◦Average temperatures in a particular

location◦Random electrical noise◦Instrumentation error

Page 7: Continuous Distribution Functions

Uncertainty Analysis for Engineers 7

The Half-Normal DistributionUseful when we are interested in

deviations from the mean

x

xxf

02

exp2)( 2

2

2

Page 8: Continuous Distribution Functions

Uncertainty Analysis for Engineers 8

Half-Normal Distribution

0 0.5 1 1.5 2 2.5 3 3.50

0.2

0.4

0.6

0.8

1

1.2

1.4

1.6

1.8

Var(x)=0.5Var(x)=1Var(x)=2

x

f(x)

Page 9: Continuous Distribution Functions

Uncertainty Analysis for Engineers 9

When would we use this?Suppose we build a flywheel from

two parts.It is important that they be

nearly the same weightWe measure only the difference

in weightThis will be positive and is likely

to be normally distributed, with the bulk of the results near 0

The half-normal distribution should work

Page 10: Continuous Distribution Functions

Uncertainty Analysis for Engineers 10

Bivariate Normal DistributionJoint distribution

yx

y

xx

y

x

y

y

yx

yx

x

x

yx

yxCov

yVar

xExVar

yE

dxdyyxxfxE

yyxxyxf

),(

)(

)(

)(

),()(

2121exp

141),(

2

22

22

22222

Page 11: Continuous Distribution Functions

Uncertainty Analysis for Engineers 11

What if x and y are not correlated?The correlation coefficient will be

0The joint pdf becomes the

product of two separate normal distributions and x and y can be considered independent

Be careful, lack of correlation does not always imply independence

Page 12: Continuous Distribution Functions

Uncertainty Analysis for Engineers 12

The Gamma DistributionFor random variables bounded at one

end

Peak is at x=0 for less than or equal to 1

CDF is known as incomplete gamma function

dxex

x

xxxf

x

0

1

1

0,,exp

)(

Page 13: Continuous Distribution Functions

Uncertainty Analysis for Engineers 13

Gamma distribution

2

Var

mean

Page 14: Continuous Distribution Functions

Uncertainty Analysis for Engineers 14

Facts on Gamma DistributionAppropriate for time required for

a total of exactly independent events to take place if events occur at a constant rate 1/

Has been used for storm durations, time between storms, downtime for offsite power supplies (nuclear)

Page 15: Continuous Distribution Functions

Uncertainty Analysis for Engineers 15

ExamplesIf a part is ordered in lots of size and

demand for individual parts is 1/ , then time between depletions is gamma

System time to failure is gamma if system failure occurs as soon as exactly sub-failures have taken place and sub-failures occur at the rate 1/

The time between maintenance operations of an instrument that needs recalibration after uses is gamma under appropriate conditions

Some phenomena, such as capacitor failure and family income are empirically gamma, though not theoretically

Page 16: Continuous Distribution Functions

Uncertainty Analysis for Engineers 16

PracticeA ferry boat departs for a trip

across a river as soon as exactly 9 cars are loaded. Cars arrive independently at a rate of 6 per hour. What is probability that the time between consecutive trips will be less than one hour? What is the time between departures that has a 1% probability of being exceeded?

Page 17: Continuous Distribution Functions

Uncertainty Analysis for Engineers 17

SolutionTime between departures is gamma.=9 cars, 1/ =6 per hourEvaluate F(1) numerically

Matlab◦gamcdf(1,9,1/6)◦F=0.153◦Or◦f=@(x) x.^8.*exp(-6*x)◦6^9/gamma(9)*quad(f,0,1)

1

0

89

)6exp()9(

6)1( dtttF

Page 18: Continuous Distribution Functions

Uncertainty Analysis for Engineers 18

Solution continuedSolve this for x

gaminv(0.99,9,1/6)Solution is x=2.9 hoursThat is, the chances are 1 in 100

that the time between departures will exceed 2.9 hours

xtdtet

0

689

)9(699.0

Page 19: Continuous Distribution Functions

Uncertainty Analysis for Engineers 19

Generalized gamma distributionWe can redefine the gamma

distribution to be 0 below some value ()

elsewhere

xxx

xf0

exp)(

)(

1

Page 20: Continuous Distribution Functions

Uncertainty Analysis for Engineers 20

Exponential DistributionThis is just a gamma distribution

with =1 and =1/

Page 21: Continuous Distribution Functions

Uncertainty Analysis for Engineers 21

Exponential distribution

xxF

xxxf

exp1)(

0exp)(

1

1

std

mean

Page 22: Continuous Distribution Functions

Uncertainty Analysis for Engineers 22

Facts (Exp Distribution)Useful for time interval between

successive, random, independent events that occur at constant rates◦Time between equipment failures,

accidents, storms, etc.Given our discussion of the gamma

distribution, this distribution is a good model for the time for a single outcome to take places if events occur independently at a constant rate

Page 23: Continuous Distribution Functions

Uncertainty Analysis for Engineers 23

ExampleIf particles arrive independently

at a counter at a rate of 2 per second, what is the probability that a particle will arrive in 1 second?

=2F(1)=1-exp(-2*1)=0.865

Page 24: Continuous Distribution Functions

Uncertainty Analysis for Engineers 24

Beta DistributionsThis is useful when x is bounded on

both ends

x is bounded between 0 and 1f can be u-shaped, single-peaked, J-

shaped, etc.The CDF is the incomplete beta

function

21

2121

2121

11

,

0,10,1)(

21

B

xB

xxxf

Page 25: Continuous Distribution Functions

Uncertainty Analysis for Engineers 25

Beta distribution

1212

21

21

21

1

Var

mean

Page 26: Continuous Distribution Functions

Uncertainty Analysis for Engineers 26

Facts (Beta Distribution)The many shapes this distribution can take

on make it quite versatileOften used to represent judgments about

uncertaintyCan be used to represent fraction of time

individuals spend engaging in various activities

…or fraction of time soil is available for dermal contact by humans (as opposed to being covered by soil and ice)

…or fraction of time individual spends indoors

Page 27: Continuous Distribution Functions

Uncertainty Analysis for Engineers 27

More ExamplesA measuring device allows the lengths of

only the shortest and longest units in a sample to be recorded. 15 units are selected at random from a large lot. What is the probability that at least 90% have lot lengths between the recorded values?

20 electron tubes are tested until, at time t, the first one fails. What is the probability that at least 75% of the tubes will survive beyond t? Here, 1=1 and 2=0

Page 28: Continuous Distribution Functions

Uncertainty Analysis for Engineers 28

Uniform DistributionActually a special case of the

beta distribution (1=1 and 2=1)

Page 29: Continuous Distribution Functions

Uncertainty Analysis for Engineers 29

Uniform distribution

bxaab

xf

1)(

12

22abVar

bamean

Page 30: Continuous Distribution Functions

Uncertainty Analysis for Engineers 30

Lognormal DistributionThe natural log of the random

variable follows a normal distribution

It can be modified to be 0 before some non-zero value of x

x

xx

xf

0

2)ln(exp

21)( 2

2

Page 31: Continuous Distribution Functions

Uncertainty Analysis for Engineers 31

Lognormal DistributionIt can be used as a model for a

process whose value results from the multiplication of many small errors in a manner similar to the addition of many instances we discussed with respect to the normal distribution

The product of n independent, positive variates approaches a log-normal distribution for large n

Page 32: Continuous Distribution Functions

Uncertainty Analysis for Engineers 32

Lognormal distribution

1exp21exp

22

2

x

x

Var

mean

Page 33: Continuous Distribution Functions

Uncertainty Analysis for Engineers 33

Lognormal factsGood for

◦chemical concentrations in the environment, deterioration of engineered systems, etc.

◦asymmetric uncertainties◦processes where observed value is

random proportion of previous valueIt is “tail-heavy”

Page 34: Continuous Distribution Functions

Uncertainty Analysis for Engineers 34

ExamplesDistribution of personal incomesDistribution of size of organism

whose growth is subject to many small impulses, the effect of each being proportional to the instantaneous size

Distribution of particle sizes from breakage

Page 35: Continuous Distribution Functions

Uncertainty Analysis for Engineers 35

Statistical Models in Life TestingTime-to-failure models are a common

application of probability distributionsWe can define a hazard function as

where f and F are the pdf and CDF for the time to failure, respectively

h(t)dt represents the proportion of items surviving at time t that fail at time t+dt

)(1)()(tFtfth

Page 36: Continuous Distribution Functions

Uncertainty Analysis for Engineers 36

Hazard FunctionsA typical hazard function is the

so-called bathtub curve, which is high at the beginning and end of the life cycle

Uniform distribution – U(0,1)

Exponential distribution

tth

11)(

t

t

eeth )(

Probability of failure during a specified

interval is constant

Page 37: Continuous Distribution Functions

Uncertainty Analysis for Engineers 37

Weibull DistributionThis is a generalization of the

exponential distribution, but, for time-to-failure problems, the probability of failure is not constant

LxxF

Lx

LxLxxf

exp1)(

;0,

exp)(1

Page 38: Continuous Distribution Functions

Uncertainty Analysis for Engineers 38

Weibull distribution

1121

11

22Var

mean

0.00 0.50 1.00 1.50 2.00 2.50 3.000.0

0.2

0.4

0.6

0.8

1.0

1.2

1.4

alpha=1, beta=1, L=03, 1, 01, 3, 01, 1, 1

x

f(x)

Page 39: Continuous Distribution Functions

Uncertainty Analysis for Engineers 39

Weibull FactsUseful for time to completion or

time to failureCan skew negative or positiveLess tail-heavy than lognormal

Page 40: Continuous Distribution Functions

Uncertainty Analysis for Engineers 40

Extreme Value DistributionsHere we are interested in the

distribution of the “largest” or “smallest” element in a group

For example, ◦What is the largest wind gust an

airplane can expect?

Page 41: Continuous Distribution Functions

Uncertainty Analysis for Engineers 41

Types of EV DistributionsType I (Gumbel) for maximum

valuesType1 (Gumbel) for minimum

valuesType III (Weibull) for minimum

values

Page 42: Continuous Distribution Functions

Uncertainty Analysis for Engineers 42

The Gumbel DistributionLimiting model as n approaches

infinity for the distribution of the maximum of n independent values from an initial distribution whose right tail is unbounded and is “exponential” ◦original distribution could be

exponential, normal, lognormal, gamma, etc. – all have proper characteristics

Page 43: Continuous Distribution Functions

Uncertainty Analysis for Engineers 43

Type I EVCan represent

◦Time to failure of circuit with n elements in parallel

◦Yearly maximum of daily water discharges for a particular river at a particular point

◦Yearly maximum of the Dow Jones Index

◦Deepest corrosion pit expected in a metal exposed to a corrosive liquid for a given time?

Page 44: Continuous Distribution Functions

Uncertainty Analysis for Engineers 44

Type I (Gumbel)

Lxth

LxLxxf

exp1)(

expexpexp1)(

Page 45: Continuous Distribution Functions

Uncertainty Analysis for Engineers 45

Gumbel distribution

22

6

577216.0

Var

Lmean

Page 46: Continuous Distribution Functions

Uncertainty Analysis for Engineers 46

ExampleThe maximum demand for electric

power at any time during a year in a given locality is related to extreme weather conditions

Assume it follows a Type 1 distribution with L=2000 kW and =1000 kW.

A power station needs to know the probability that demand will exceed 4000 kW at any time in a year and the demand that has only a 1/20 probability of being exceeded in a year

Page 47: Continuous Distribution Functions

Uncertainty Analysis for Engineers 47

Solutionevcdf(4000,2000,1000)=0.9994For second part, solve F(y)=0.95

for yevinv(0.95,2000,1000)Result is 3097 kW

Page 48: Continuous Distribution Functions

Uncertainty Analysis for Engineers 48

Type IIIThis is the Weibull distributionIt is the limiting model as n approaches

infinity for the distribution of the minimum of n values from various distributions bounded at the left

The gamma distribution is an exampleFor example, a circuit with components

in series with individual failure distributions that are gamma, then the Type III EV distribution is appropriate

Page 49: Continuous Distribution Functions

Uncertainty Analysis for Engineers 49

Other examplesFailure strength of materialsDrought analyses

Page 50: Continuous Distribution Functions

Uncertainty Analysis for Engineers 50

ObservationsThe log of the weibull distribution is

distributed as a minimum value Type I

These extreme value distributions are only valid in the limit of large n – convergence depends on initial distributions◦10 samples can be adequate for initial

distributions that are exponential◦It may take as many as 100 for normal

distributions