today: lab 2 due monday: quizz 4 wed: a3 due friday: lab 3 due mon oct 1: exam i this room, 12 pm

31
• Today: lab 2 due • Monday: Quizz 4 • Wed: A3 due • Friday: Lab 3 due • Mon Oct 1: Exam I this room, 12 pm

Upload: ruby-fowler

Post on 17-Dec-2015

214 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Today: lab 2 due Monday: Quizz 4 Wed: A3 due Friday: Lab 3 due Mon Oct 1: Exam I  this room, 12 pm

• Today: lab 2 due

• Monday: Quizz 4

• Wed: A3 due

• Friday: Lab 3 due

• Mon Oct 1: Exam I this room, 12 pm

Page 2: Today: lab 2 due Monday: Quizz 4 Wed: A3 due Friday: Lab 3 due Mon Oct 1: Exam I  this room, 12 pm

Recap last lecture Ch 6.1

•Empirical frequency distributions

•Discrete

•Continuous

•Four forms

•F(Q=k), F(Q=k)/n, F(Qqk), F(Qqk)/n

•Four uses

•Summarization gives clue to process

•Summarization useful for comparisons

•Used to make statistical decisions

•Reliability evaluation

Page 3: Today: lab 2 due Monday: Quizz 4 Wed: A3 due Friday: Lab 3 due Mon Oct 1: Exam I  this room, 12 pm

Today

Read lecture notes!

Page 4: Today: lab 2 due Monday: Quizz 4 Wed: A3 due Friday: Lab 3 due Mon Oct 1: Exam I  this room, 12 pm

Age of mothers

Fre

qu

en

cy

15 20 25 30 35 40 45

02

46

81

01

2

Distribution of ages of mothers Sample: students that attended class in 1997

Population: MUN students Unknown distribution

Page 5: Today: lab 2 due Monday: Quizz 4 Wed: A3 due Friday: Lab 3 due Mon Oct 1: Exam I  this room, 12 pm

Distribution of ages of mothers Sample: students that attended class in 1997

Population: MUN students Unknown distribution

Solution: use theoretical frequency dist to characterize pop

Assumption: observations are distributed in the same way as theoretical dist

Theoretical distribution is a model of a frequency distribution

Page 6: Today: lab 2 due Monday: Quizz 4 Wed: A3 due Friday: Lab 3 due Mon Oct 1: Exam I  this room, 12 pm

Commonly used theoretical dist:

Discrete

Binomial

Multinomial

Poisson

Negative binomial

Hypergeometric

Uniform

Continuous

Normal

Chi-square (2)

t

F

Log-normal

Gamma

Cauchy

Weibull

Uniform

Page 7: Today: lab 2 due Monday: Quizz 4 Wed: A3 due Friday: Lab 3 due Mon Oct 1: Exam I  this room, 12 pm

Commonly used theoretical dist:

Discrete

Binomial

Multinomial

Poisson

Negative binomial

Hypergeometric

Uniform

Continuous

Normal

Chi-square (2)

t

F

Log-normal

Gamma

Cauchy

Weibull

Uniform

Page 8: Today: lab 2 due Monday: Quizz 4 Wed: A3 due Friday: Lab 3 due Mon Oct 1: Exam I  this room, 12 pm

Theoretical frequency distributions

4 forms

Empirical

(n=sample)

Theoretical

(N=pop discrete)

Theoretical

(N=pop continuous)

Page 9: Today: lab 2 due Monday: Quizz 4 Wed: A3 due Friday: Lab 3 due Mon Oct 1: Exam I  this room, 12 pm

Theoretical frequency distributions - 4 uses

1. Clue to underlying process

If an empirical dist fits one of the following, this suggests the kind of mechanism that generated the data

a) Uniform dist

e.g. # of people per table mechanism: all outcomes have equal prob

b) Normal dist

e.g. oxygen intake per day mechanism: several independent factors, no prevailing factor

Page 10: Today: lab 2 due Monday: Quizz 4 Wed: A3 due Friday: Lab 3 due Mon Oct 1: Exam I  this room, 12 pm

Theoretical frequency distributions - 4 uses

1. Clue to underlying process

c) Poisson dist

e.g. # of deaths by horsekick in the Prussian army, per year mechanism: rare & random event

c) Binomial dist

e.g. # of heads/tails on coin toss mechanism: yes/no outcome

Page 11: Today: lab 2 due Monday: Quizz 4 Wed: A3 due Friday: Lab 3 due Mon Oct 1: Exam I  this room, 12 pm

Theoretical frequency distributions - 4 uses

2. Summarize data dist info contained in parameters

e.g. number of events per unit space or time can be summarized as the expected value of a Poisson dist

Page 12: Today: lab 2 due Monday: Quizz 4 Wed: A3 due Friday: Lab 3 due Mon Oct 1: Exam I  this room, 12 pm

Theoretical frequency distributions - 4 uses

2. Summarize data

e.g. number of events per unit space or time can be summarized as the expected value of a Poisson dist

Can make comparisons

Page 13: Today: lab 2 due Monday: Quizz 4 Wed: A3 due Friday: Lab 3 due Mon Oct 1: Exam I  this room, 12 pm

Theoretical frequency distributions - 4 uses

3. Decision making. Use theoretical dist to calculate p-value

Page 14: Today: lab 2 due Monday: Quizz 4 Wed: A3 due Friday: Lab 3 due Mon Oct 1: Exam I  this room, 12 pm

Theoretical frequency distributions - 4 uses

3. Decision making. Use theoretical dist to calculate p-value

p(X1qx)

p(X2>x)

Page 15: Today: lab 2 due Monday: Quizz 4 Wed: A3 due Friday: Lab 3 due Mon Oct 1: Exam I  this room, 12 pm

Theoretical frequency distributions - 4 uses

3. Decision making. Use theoretical dist to calculate p-value

p(X1qx)

MiniTab: cdf

R: pnorm()

Page 16: Today: lab 2 due Monday: Quizz 4 Wed: A3 due Friday: Lab 3 due Mon Oct 1: Exam I  this room, 12 pm

Theoretical frequency distributions - 4 uses

4. Reliability. Put probability range around outcome

Page 17: Today: lab 2 due Monday: Quizz 4 Wed: A3 due Friday: Lab 3 due Mon Oct 1: Exam I  this room, 12 pm

Theoretical frequency distributions - 4 uses

4. Reliability. Put probability range around outcome

MiniTab: invcdf

R: qnorm()

Page 18: Today: lab 2 due Monday: Quizz 4 Wed: A3 due Friday: Lab 3 due Mon Oct 1: Exam I  this room, 12 pm

Computing probabilities from observed vs theoretical dist

Theoretical

Advantages Disadvantages

EasyAssumptions may not apply

wrong p-values

Familiar Checking assumptions is laborious

Recipes, known performance

Empirical

Advantages Disadvantages

No assumptions Computation

Easy to defend Not always easy to carry out

Page 19: Today: lab 2 due Monday: Quizz 4 Wed: A3 due Friday: Lab 3 due Mon Oct 1: Exam I  this room, 12 pm

Ch 6.3 Fit of Observed to Theoretical

Will present 2 examples: 1 continuous, 1 discrete

More examples in lecture notes

Page 20: Today: lab 2 due Monday: Quizz 4 Wed: A3 due Friday: Lab 3 due Mon Oct 1: Exam I  this room, 12 pm

Ch 6.3 Fit of Observed to Theoretical

Example 1 (Poisson)

Number of coal mining disasters, 1851-1866 (England)

NDisaster = [4 5 4 1 0 4 3 4 0 6 3 3 4 0 2 4]

sum(N)=47

k = [0 1 2 3 4 5 6] = outcomes(N)

n = 16 observations

k F(N=k)

0

1

2

3

4

5

6

n

Nsum )(̂

Page 21: Today: lab 2 due Monday: Quizz 4 Wed: A3 due Friday: Lab 3 due Mon Oct 1: Exam I  this room, 12 pm

Example 1 (Poisson)

Number of coal mining disasters, 1851-1866 (England)

k F(N=k) F(N=k)/n

0 3 0.1875

1 1 0.0625

2 1 0.0625

3 3 0.1875

4 6 0.3750

5 1 0.0625

6 1 0.0625

2.9375 47/16 ̂

Page 22: Today: lab 2 due Monday: Quizz 4 Wed: A3 due Friday: Lab 3 due Mon Oct 1: Exam I  this room, 12 pm

Example 1 (Poisson)

Number of coal mining disasters, 1851-1866 (England)

k F(N=k) F(N=k)/n Pr(N=k)

0 3 0.1875

1 1 0.0625

2 1 0.0625

3 3 0.1875

4 6 0.3750

5 1 0.0625

6 1 0.0625

2.9375 47/16 ̂

!)Pr(

k

ekN

k

Page 23: Today: lab 2 due Monday: Quizz 4 Wed: A3 due Friday: Lab 3 due Mon Oct 1: Exam I  this room, 12 pm

Example 1 (Poisson)

Number of coal mining disasters, 1851-1866 (England)

k F(N=k) F(N=k)/n Pr(N=k)

0 3 0.1875 0.053

1 1 0.0625 0.1557

2 1 0.0625 0.2287

3 3 0.1875 0.2239

4 6 0.3750 0.1644

5 1 0.0625 0.0966

6 1 0.0625 0.0473

2.9375 47/16 ̂

!)Pr(

k

ekN

k

Page 24: Today: lab 2 due Monday: Quizz 4 Wed: A3 due Friday: Lab 3 due Mon Oct 1: Exam I  this room, 12 pm

Example 1 (Poisson)

Number of coal mining disasters, 1851-1866 (England)

k F(N=k) F(N=k)/n Pr(N=k) Obs-Exp

0 3 0.1875 0.053

1 1 0.0625 0.1557

2 1 0.0625 0.2287

3 3 0.1875 0.2239

4 6 0.3750 0.1644

5 1 0.0625 0.0966

6 1 0.0625 0.0473

2.9375 47/16 ̂

!)Pr(

k

ekN

k

Page 25: Today: lab 2 due Monday: Quizz 4 Wed: A3 due Friday: Lab 3 due Mon Oct 1: Exam I  this room, 12 pm

Example 1 (Poisson)

Number of coal mining disasters, 1851-1866 (England)

k F(N=k) F(N=k)/n Pr(N=k) Obs-Exp

0 3 0.1875 0.053 0.1345

1 1 0.0625 0.1557 -0.0932

2 1 0.0625 0.2287 -0.1662

3 3 0.1875 0.2239 -0.0364

4 6 0.3750 0.1644 0.2106

5 1 0.0625 0.0966 -0.0341

6 1 0.0625 0.0473 0.0152

2.9375 47/16 ̂

!)Pr(

k

ekN

k

Page 26: Today: lab 2 due Monday: Quizz 4 Wed: A3 due Friday: Lab 3 due Mon Oct 1: Exam I  this room, 12 pm

Example 2 (Normal)

Age of mothers of students in quant 1997

Are the ages normally distributed?

Age of mothers

Fre

qu

en

cy

15 20 25 30 35 40 45

02

46

81

01

2

Page 27: Today: lab 2 due Monday: Quizz 4 Wed: A3 due Friday: Lab 3 due Mon Oct 1: Exam I  this room, 12 pm

Example 2 (Normal)

Age of mothers of students in quant 1997

Are the ages normally distributed?

Page 28: Today: lab 2 due Monday: Quizz 4 Wed: A3 due Friday: Lab 3 due Mon Oct 1: Exam I  this room, 12 pm

Example 2 (Normal)

Age of mothers of students in quant 1997

Are the ages normally distributed?

Strategy work with probability plots compute cdf

Page 29: Today: lab 2 due Monday: Quizz 4 Wed: A3 due Friday: Lab 3 due Mon Oct 1: Exam I  this room, 12 pm

Example 2 (Normal)

Age of mothers of students in quant 1997

Are the ages normally distributed?

Strategy work with probability plots compute cdf

2

2

1

2

1)Pr(

X

exAge

Expected distribution:

Page 30: Today: lab 2 due Monday: Quizz 4 Wed: A3 due Friday: Lab 3 due Mon Oct 1: Exam I  this room, 12 pm

Example 2 (Normal)

Age of mothers of students in quant 1997

Are the ages normally distributed?

Strategy work with probability plots compute cdf

2

2

1

2

1)Pr(

X

exAge

Expected distribution:

Page 31: Today: lab 2 due Monday: Quizz 4 Wed: A3 due Friday: Lab 3 due Mon Oct 1: Exam I  this room, 12 pm

Example 2 (Normal)

Age of mothers of students in quant 1997

Are the ages normally distributed?

Strategy work with probability plots compute cdf

-2 -1 0 1 2

20

25

30

35

40

Normal Q-Q Plot

Theoretical Quantiles

Sam

ple

Qua

ntile

s