is yawning contagious video - mr. brinkhus'...

Is Yawning Contagious video

10

34= .29

4

16= .25

𝑃 𝑦𝑎𝑤𝑛 𝑠𝑒𝑒𝑑

𝑃 𝑦𝑎𝑤𝑛 𝑛𝑜 𝑠𝑒𝑒𝑑

.29 − .25 = .04

../Mythbusters Top 10 Is Yawning Contagious Video at YourDiscovery.com.flv

No, maybe this occurred purely by chance.

50

subjects

Random

Assignment

Group 1

(34)

Group 2

(16)

Treatment 2

(no yawn seed)

Treatment 1

(yawn seed)

Compare

Yawning

What is this? 𝐻0

Get in your groups.

difference in proportions

𝑝-value =

http://www.rossmanchance.com/applets/ChiSqShuffle.html?yawning=1

𝑝-value = _________, not surprising at all.

Not statistically significant.

We don’t have enough evidence to prove that yawning is

contagious.

𝑝 1 𝑝1

𝑝1

𝑝1 1 − 𝑝1

𝑛1

N 𝑝1, 𝜎𝑝 1

𝑝1

𝜎𝑝 1 =

𝑝 2 𝑝2

𝑝2

𝑝2 1 − 𝑝2

𝑛2

N 𝑝2, 𝜎𝑝 2

𝑝2

𝜎𝑝 2 =

Check: 𝑛1𝑝1 ≥ 10

𝑛1 1 − 𝑝1 ≥ 10

Check: 𝑛2𝑝2 ≥ 10

𝑛2 1 − 𝑝2 ≥ 10

𝑝 1 − 𝑝 2 𝑝1 − 𝑝2

𝑝1 − 𝑝2

𝑝1 1 − 𝑝1

𝑛1+

𝑝2 1 − 𝑝2

𝑛2

N 𝑝1 − 𝑝2, 𝜎𝑝 1−𝑝 2

𝑝1 − 𝑝2

𝜎𝑝 1−𝑝 2 =

Check: 𝑛1𝑝1 ≥ 10 𝑛2𝑝2 ≥ 10

𝑛1 1 − 𝑝1 ≥ 10 𝑛2 1 − 𝑝2 ≥ 10 &

We want to estimate the true difference 𝑝1 − 𝑝2 at a 95%

confidence level.

𝑝 1 =114

150= .76 𝑝 2 =

62

100= .62 𝑝 1 − 𝑝 2 = .14

𝑝1 → true proportion of ECRCHS students who do math HW

𝑝2 → true proportion of Taft students who do math HW

Two-sample 𝑧 interval for 𝑝1 − 𝑝2

Random:

Normal:

Independent:

“random sample of 150 students from ECRCHS” and

“random sample of 100 Taft students”

𝑛1𝑝 1 ≥ 10

𝑛1 1 − 𝑝 1 ≥ 10

→ 150 .76 = 114 ≥ 10

→ 150 .24 = 36 ≥ 10

So the sampling distribution of 𝑝 1 − 𝑝 2 is approximately normal.

Two things to check:

1) The two samples need to be independent of each other.

2) Individual observations in each sample have to be independent.

When sampling without replacement for both samples, must

check 10% condition for both.

We clearly have two independent samples – one from each school.

𝑛2𝑝 2 ≥ 10

𝑛2 1 − 𝑝 2 ≥ 10

→ 100 .62 = 62 ≥ 10

→ 100 .38 = 38 ≥ 10

There are at least 10 150 = 1500 students at ECRCHS and

at least 10 100 = 1000 students at Taft.

𝑝 1 − 𝑝 2 ± 𝑧∗ 𝑝 1 1 − 𝑝 1

𝑛1+

𝑝 2 1 − 𝑝 2𝑛2

Estimate ± Margin of Error

. 76 − .62 ± 1.96

.14 ± 0.117

0.023, 0.257

With calculator:

114

0.95

0.02286, 0.25714

STAT TESTS 2-PropZInt (B)

𝑝 1 =114

150= .76 𝑝 2 =

62

100= .62 𝑝 1 − 𝑝 2 = .14

x1:

n1:

x2:

n2:

C-Level:

.76 .24

150+

.62 .38

100

Standard Error

150

62

100

We are 95% confident that the interval from 0.023 to 0.257

captures the true difference in proportions 𝑝1 − 𝑝2 of

ECRCHS and Taft students who do their math HW.

This suggests that 2.3% to 25.7% more students at

ECRCHS do math HW than Taft students.

We want to estimate the true difference 𝑝1 − 𝑝2 at a 90%

confidence level.

𝑝 1 = .63 𝑝 2 = .68 𝑝 1 − 𝑝 2 = −.05

𝑝1 → true proportion of teens who say they go online every day

𝑝2 → true proportion of adults who say they go online every day

Two-sample 𝑧 interval for 𝑝1 − 𝑝2

Random:

Normal:

Independent:

“random sample of 800 teens and a separate sample

of 2253 adults”

𝑛1𝑝 1 ≥ 10

𝑛1 1 − 𝑝 1 ≥ 10

→ 800 .63 = 504 ≥ 10

→ 800 .37 = 296 ≥ 10


We clearly have two independent samples – one of

teens and one of adults.

𝑛2𝑝 2 ≥ 10

𝑛2 1 − 𝑝 2 ≥ 10

→ 2253 .68 = 1532 ≥ 10

→ 2253 .32 = 721 ≥ 10

There are at least 10 800 = 8000 U.S. teens and

at least 10 2253 = 22530 U.S. adults.

𝑝 1 − 𝑝 2 ± 𝑧∗ 𝑝 1 1 − 𝑝 1

𝑛1+

𝑝 2 1 − 𝑝 2𝑛2

Estimate ± Margin of Error

. 63 − .68 ± 1.645

−0.05 ± 0.0324

−0.0824,−0.0176

With calculator:

504

0.9

−0.0824,−0.0176

STAT TESTS 2-PropZInt (B)

x1:

n1:

x2:

n2:

C-Level:

.63 .37

800+

.68 .32

2253

Standard Error

800

1532

2253

𝑝 1 = .63 𝑝 2 = .68 𝑝 1 − 𝑝 2 = −.05

We are 90% confident that the interval from −.0824 to

− .0176 captures the true difference in proportions

𝑝1 − 𝑝2 of teens and adults who go online every day.

This suggests that 1.76% to 8.24% more adults are online

every day than teens.

A LOT of writing!!

Don’t write too big.

State:

𝐻0:

𝐻𝑎:

𝑝1 − 𝑝2 = 0

𝑝1 → proportion of AP Calc seniors who are going to college

𝛼 = 0.05

𝑝 1 =98

110= 0.89

𝑝2 → proportion of Pre-Calc seniors who are going to college

𝑝1 − 𝑝2 > 0

𝑝1 = 𝑝2 OR

𝑝1 > 𝑝2 OR 𝑝 2 =

68

80= 0.85

𝑝 1 − 𝑝 2 = .89 − .85 = .04

Plan: Two sample 𝑧 test for 𝑝1 − 𝑝2

Random:

Normal:

Independent:

“SRS of 80 students currently taking Pre-Calc” and

“SRS of 110 students currently taking AP Calculus”

𝑛1𝑝 1 ≥ 10

𝑛1 1 − 𝑝 1 ≥ 10

→ 110 .89 = 98 ≥ 10

→ 110 .11 = 12 ≥ 10


Two things to check:

1) The two samples need to be independent of each other.

2) Individual observations in each sample have to be independent.

When sampling without replacement for both samples, must

check 10% condition for both. We clearly have two independent samples – one from each class level.

𝑛2𝑝 2 ≥ 10

𝑛2 1 − 𝑝 2 ≥ 10

→ 80 .85 = 68 ≥ 10

→ 80 .15 = 12 ≥ 10

There are at least 10 80 = 800 Pre-Calc students and at least

10 110 = 1100 AP Calculus students in California.

Do: Sampling Distribution of 𝑝 1 − 𝑝 2

N 0, ______

0

𝜎𝑝 1−𝑝 2 =𝑝1 1 − 𝑝1

𝑛1+

𝑝2 1 − 𝑝2

𝑛2

Normally, we would use

this formula for the

standard deviation:

However, since we’re assuming 𝐻0: 𝑝1 = 𝑝2 is true, we’ll be using a

different value in the formula.

The two parameters are the same, so we will call their common value 𝑝.

In order to estimate 𝑝, we can get a more precise estimate when we use

more data, so it makes sense to combine the data from the two samples.

This “pooled (or combined) sample proportion” is

=count of successes in both samples

count of individuals in both samples=

𝑥1 + 𝑥2

𝑛1 + 𝑛2 𝑝 𝑐

We will use this in place of both 𝑝1 and 𝑝2


N 0, ______

0

𝜎𝑝 1−𝑝 2 =𝑝 1 1 − 𝑝 1

𝑛1+

𝑝 2 1 − 𝑝 2𝑛2

=𝑝 𝑐 1 − 𝑝 𝑐

𝑛1+

𝑝 𝑐 1 − 𝑝 𝑐𝑛2

.049

0.04

𝑧 =𝑠𝑡𝑎𝑡𝑖𝑠𝑡𝑖𝑐 − 𝑝𝑎𝑟𝑎𝑚𝑒𝑡𝑒𝑟

𝑠𝑡. 𝑑𝑒𝑣. 𝑜𝑓 𝑠𝑡𝑎𝑡𝑖𝑠𝑡𝑖𝑐 =

𝑝 1 − 𝑝 2 − 𝑝1 − 𝑝2

𝜎𝑝 1−𝑝 2

=.04 − 0

0.049= .82

𝑎𝑟𝑒𝑎 = .2061

𝑝-value normalcdf .04, 99999, 0, .049 = 0.2071

𝑝 𝑐 =𝑥1 + 𝑥2

𝑛1 + 𝑛2=

98 + 68

110 + 80

=166

190

= .874

=.874 .126

110+

.874 .126

80= .049

𝑝-value

𝑝 1 − 𝑝 2 = .04

With calculator:

98

STAT TESTS 2-PropZTest (6)

x1:

n1:

x2:

n2:

p1: ≠p2 <p2 >p2

110

68

80

𝑧 = 0.838

𝑝 = 0.201

𝑝 1 = 0.89

𝑝 2 = 0.85

𝑝 = .874

𝑛1 = 110

𝑛2 = 80

𝑝-value

pooled

proportion

Conclude:

Assuming 𝐻0 is true 𝑝1 = 𝑝2 , there is a .2061 probability of obtaining

a 𝑝 1 − 𝑝 2 value of .04 or more purely by chance. This provides weak

evidence against 𝐻0 and is not statistically significant at 𝛼 = .05 level

Therefore, we fail to reject 𝐻0 and cannot conclude

that seniors taking AP Calc are more likely to attend college next

.2061 > .05 .

year than seniors taking Pre-Calc.

Observational study

no treatments

imposed

No

senior math students in California What population can we target?

State:

𝐻0:

𝐻𝑎:

𝑝1 − 𝑝2 = 0

𝑝1 → true proportion who quit smoking to get money

𝛼 = 0.05

𝑝 1 = 0.15

𝑝2 → true proportion who quit smoking traditional way

𝑝1 − 𝑝2 > 0

𝑝1 = 𝑝2 OR

𝑝1 > 𝑝2 OR

𝑝 2 = 0.05

𝑝 1 − 𝑝 2 = .15 − .05 = .10

Plan: Two sample 𝑧 test for 𝑝1 − 𝑝2

Random:

Normal:

Independent:

“subjects were randomly assigned to treatments”

𝑛1𝑝 1 ≥ 10

𝑛1 1 − 𝑝 1 ≥ 10

→ 439 .15 = 66 ≥ 10

→ 439 .85 = 373 ≥ 10


Due to random assignment, these two groups can be viewed as

independent.

𝑛2𝑝 2 ≥ 10

𝑛2 1 − 𝑝 2 ≥ 10

→ 439 .05 = 22 ≥ 10

→ 439 .95 = 417 ≥ 10

Individual observations in each group should also be independent:

knowing whether one subject quits smoking gives no information

whether another subject does.

𝑛1 = 𝑛2 =878

2

= 439

No 10% condition since there was no sampling.


N 0, ________

0

𝜎𝑝 1−𝑝 2 =𝑝 𝑐 1 − 𝑝 𝑐

𝑛1+

𝑝 𝑐 1 − 𝑝 𝑐𝑛2

=.10 .90

439+

.10 .90

439=. 0202

.0202

0.10

𝑧 =𝑝 1 − 𝑝 2 − 𝑝1 − 𝑝2

𝜎𝑝 1−𝑝 2

=.15 − .05 − 0

0.0202= 4.95

𝑎𝑟𝑒𝑎 ≈ 0

𝑝-value

normalcdf .10, 99999, 0, .0202 = 0

𝑝 𝑐 =𝑥1 + 𝑥2

𝑛1 + 𝑛2=

66 + 22

878

=88

878

= .10

𝑝-value

𝑝 1 − 𝑝 2 = .10

With calculator:

66

STAT TESTS 2-PropZTest (6)

x1:

n1:

x2:

n2:

p1: ≠p2 <p2 >p2

439

22

439

𝑧 = 4.945

𝑝 = 0

𝑝 1 = 0.15

𝑝 2 = 0.05

𝑝 = .10

𝑛1 = 439

𝑛2 = 439

𝑝-value

pooled

proportion

Conclude:

Assuming 𝐻0 is true 𝑝1 = 𝑝2 , there is a 0 probability of obtaining

a 𝑝 1 − 𝑝 2 value of .10 or more purely by chance. This provides strong

evidence against 𝐻0 and is statistically significant at 𝛼 = .05 level

Therefore, we reject 𝐻0 and can conclude that financial

incentive helps people quit smoking.

0 < .05 .

Experiment

treatments were

imposed

Yes

Not a random sample, so conclusion only

holds for the people in the experiment

is yawning contagious video - mr. brinkhus'...

Documents