thursday, september 12, 2013 effect size, power, and exam review

57
Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Upload: amice-webb

Post on 18-Dec-2015

216 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Thursday, September 12, 2013

Effect Size, Power, and Exam Review

Page 2: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Last Time:

Page 3: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Which tail? Above or Below?

• In hypothesis testing, we are interested in extreme values of our test statistics.

• We are interested in the tails where p-values are very low. Probability is always <.50 in the tail (by definition)

• For 2-tailed tests, we reject H0 when we encounter values more extreme than +/-Z (higher than +Z or lower than –Z)

• For 1-tailed tests, we reject H0 when we encounter extreme values consistent with HA

Page 4: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

One-Tailed

H0:

HA:

Page 5: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

One-Tailed

H0:

HA:

If is the control condition (the original, null, blue distribution) and is the treatment condition (the alternative, new, green distribution), HA predicts that a sample mean (M) from population 2 will be smaller than the population mean (μ) from population 1. M- would be negative, so a negative Z is consistent with HA, and would lead to rejection of H0 (if the value is sufficiently extreme).

Page 6: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

One-Tailed

H0:

HA:

If is the control condition (the original, null, blue distribution) and is the treatment condition (the alternative, new, green distribution), H0 predicts that a sample mean (M) from population 2 will be equal to or larger than the population mean (μ) from population 1. M- would be 0 or positive, so a positive Z is consistent with H0, and would not lead to rejection of H0, no matter how extreme.

Page 7: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

One-tailed vs. 2-tailed

• One-tailed – if the results are in the direction your theory predicts, then an extreme value of Z (in the tail) will lead you to reject H0 regardless of whether Z is positive or negative.

• Two-tailed – It does not matter whether Z is positive or negative. You reject H0 if the absolute value of Z is sufficiently extreme (in either tail)

Page 8: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Finishing up chapter 8 (some loose ends)

• Error types (quick review from last time)• Effect size: Cohen’s d• Statistical Power Analysis

Page 9: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Performing your statistical test

Real world (‘truth’)

H0 is correct

H0 is wrong

Real world (‘truth’)

H0 is correct

H0 is wrong

Type I error

Type II error

So there is only one distribution

Real world (‘truth’)

H0 is correct

H0 is wrong

Type I error

Type II error

So there are two distributionsThe original (null) distribution

The new (treatment) distribution

The original (null) distribution

Page 10: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Performing your statistical test

Real world (‘truth’)

H0 is correct

H0 is wrong

So there is only one distribution

Real world (‘truth’)

H0 is correct

H0 is wrong

Type I error

Type II error

So there are two distributionsThe original (null) distribution

The new (treatment) distribution

The original (null) distribution

Page 11: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Effect Size• Hypothesis test tells

us whether the observed difference is probably due to chance or not

• It does not tell us how big the difference is

H0 is wrong

Real world (‘truth’)

H0 is correct

H0 is wrong

Type I error

Type II error

So there are two distributionsThe original (null) distribution

The new (treatment) distribution

– Effect size tells us how much the two populations don’t overlap

Page 12: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Effect Size

• Figuring effect size

The original (null) distribution

The new (treatment) distribution

– Effect size tells us how much the two populations don’t overlap

But this is tied to the particular units of measurement

Page 13: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Effect Size

The original (null) distribution

The new (treatment) distribution

• Standardized effect size

– Effect size tells us how much the two populations don’t overlap

– Puts into neutral units for comparison (same logic as z-scores)Cohen’s d

Page 14: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Effect Size

The original (null) distribution

The new (treatment) distribution

• Effect size conventions– small d = .2– medium d = .5– large d = .8

– Effect size tells us how much the two populations don’t overlap

Page 15: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Error types

Real world (‘truth’)

H0 is correct

H0 is wrong

Experimenter’s conclusions

Reject H0

Fail to Reject H0

I conclude that there is an effect

I can’t detect an effect

There really isn’t an effect

There really isan effect

Page 16: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Error types

Real world (‘truth’)

H0 is correct

H0 is wrong

Experimenter’s conclusions

Reject H0

Fail to Reject H0

Type I error

Type II error

Type I error (α): concluding that there is a difference between groups (“an effect”) when there really isn’t.

Type II error (β): concluding that

there isn’t an effect, when there really is.

Page 17: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Statistical Power

• The probability of making a Type II error is related to Statistical Power– Statistical Power: The probability that the

study will produce a statistically significant results if the research hypothesis is true (there is an effect)

• So how do we compute this?

Page 18: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Statistical Power

H0: is true (is no treatment effect)Real world (‘truth’)

Fail to reject H0Reject H0

Real world (‘truth’)

H0 is correct

H0 is wrong

Type I error

Type II error

α = 0.05

The original (null) distribution

Page 19: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Statistical Power

H0: is false (is a treatment effect)

α = 0.05

Reject H0

Real world (‘truth’)

H0 is correct

H0 is wrong

Type I error

Type II error The original (null) distribution

Real world (‘truth’)

The new (treatment) distributionThe new (treatment) distribution

Fail to reject H0

Page 20: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Statistical Power

Fail to reject H0Reject H0

Real world (‘truth’)

H0 is correct

H0 is wrong

Type I error

Type II error

β = probability of a Type II error

The new (treatment) distribution

H0: is false (is a treatment effect)

The original (null) distribution

Real world (‘truth’)

α = 0.05

Failing to Reject H0, even though there is

a treatment effect

Page 21: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Statistical Power

Fail to reject H0Reject H0

Real world (‘truth’)

H0 is correct

H0 is wrong

Type I error

Type II error

Power = 1 - b

The new (treatment) distribution

H0: is false (is a treatment effect)

The original (null) distribution

Real world (‘truth’)

β = probability of a Type II error

α = 0.05

Failing to Reject H0, even though there is

a treatment effect

Probability of (correctly)

Rejecting H0

Page 22: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Statistical Power

– α-level

– Sample size

– Population standard deviation σ

– Effect size

– 1-tail vs. 2-tailed

Factors that affect Power:

Page 23: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Statistical Power

α = 0.05

Fail to reject H0Reject H0

b

Power = 1 - β

Factors that affect Power: a-levelChange from a = 0.05 to 0.01

Page 24: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Statistical Power

α = 0.05

Fail to reject H0Reject H0

β

Power = 1 - β

Factors that affect Power: -levelChange from α = 0.05 to 0.01

α = 0.01

Page 25: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Statistical Power

α = 0.05

Fail to reject H0Reject H0

β

Power = 1 - β

Factors that affect Power: α-levelChange from α = 0.05 to 0.01

α = 0.01

Page 26: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Statistical Power

α = 0.05

Fail to reject H0Reject H0

β

Power = 1 -β

Factors that affect Power: α-levelChange from α = 0.05 to 0.01

α = 0.01

Page 27: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Statistical Power

α = 0.05

Fail to reject H0Reject H0

β

Power = 1 - β

Factors that affect Power: α-levelChange from α = 0.05 to 0.01

α = 0.01

Page 28: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Statistical Power

α = 0.05

Fail to reject H0Reject H0

β

Power = 1 - β

Factors that affect Power: α-levelChange from α = 0.05 to 0.01

α = 0.01

So as the α level gets smaller, so does the

Power of the test

Page 29: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Statistical Power

α = 0.05

Fail to reject H0Reject H0

β

Power = 1 - β

Factors that affect Power: Sample sizeChange from n = 25 to 100 Recall that sample

size is related to the spread of the distribution

Page 30: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Statistical Power

α = 0.05

Fail to reject H0Reject H0

β

Power = 1 - β

Factors that affect Power: Sample sizeChange from n = 25 to 100

Page 31: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Statistical Power

α = 0.05

Fail to reject H0Reject H0

β

Power = 1 - β

Factors that affect Power: Sample sizeChange from n = 25 to 100

Page 32: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Statistical Power

α = 0.05

Fail to reject H0Reject H0

β

Power = 1 - β

Factors that affect Power: Sample sizeChange from n = 25 to 100

Page 33: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Statistical Power

α = 0.05

Fail to reject H0Reject H0

β

Power = 1 - β

Factors that affect Power: Sample sizeChange from n = 25 to 100 As the sample gets

bigger, the standard error gets smaller and the Power gets larger

Page 34: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Statistical Power

α = 0.05

Fail to reject H0Reject H0

β

Power = 1 - β

Factors that affect Power: Population standard deviationChange from σ = 25 to 20

Recall that standard error is related tothe spread of the distribution

Page 35: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Statistical Power

α = 0.05

Fail to reject H0Reject H0

β

Power = 1 - β

Factors that affect Power: Population standard deviationChange from σ = 25 to 20

Page 36: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Statistical Power

α = 0.05

Fail to reject H0Reject H0

β

Power = 1 - β

Factors that affect Power: Population standard deviationChange from σ = 25 to 20

Page 37: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Statistical Power

α = 0.05

Fail to reject H0Reject H0

β

Power = 1 - β

Factors that affect Power: Population standard deviationChange from σ = 25 to 20

Page 38: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Statistical Power

α = 0.05

Fail to reject H0Reject H0

β

Power = 1 - β

Factors that affect Power:

As the σ gets smaller, the standard error gets smaller and the Power gets larger

Population standard deviationChange from σ = 25 to 20

Page 39: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Statistical Power

α = 0.05

Fail to reject H0Reject H0

β

Power = 1 - β

Factors that affect Power: Effect sizeCompare a small effect (difference) to a big effect

μtreatment μno treatment

Page 40: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Statistical Power

α = 0.05

Fail to reject H0Reject H0

β

Power = 1 - β

Factors that affect Power: Effect sizeCompare a small effect (difference) to a big effect

μtreatment μno treatment

Page 41: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Statistical Power

α = 0.05

Fail to reject H0Reject H0

β

Power = 1 -β

Factors that affect Power: Effect sizeCompare a small effect (difference) to a big effect

Page 42: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Statistical Power

α= 0.05

Fail to reject H0Reject H0

β

Power = 1 - β

Factors that affect Power: Effect sizeCompare a small effect (difference) to a big effect

Page 43: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Statistical Power

α = 0.05

Fail to reject H0Reject H0

β

Power = 1 - β

Factors that affect Power: Effect sizeCompare a small effect (difference) to a big effect

Page 44: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Statistical Power

α = 0.05

Fail to reject H0Reject H0

β

Power = 1 - β

Factors that affect Power: Effect sizeCompare a small effect (difference) to a big effect

Page 45: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Statistical Power

α = 0.05

Fail to reject H0Reject H0

β

Power = 1 - β

Factors that affect Power: Effect sizeCompare a small effect (difference) to a big effect

Page 46: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Statistical Power

α = 0.05

Fail to reject H0Reject H0

β

Power = 1 - β

Factors that affect Power: Effect sizeCompare a small effect (difference) to a big effect

As the effect gets bigger, the Power gets larger

Page 47: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Statistical Power

α = 0.05

Fail to reject H0Reject H0

β

Power = 1 - β

Factors that affect Power: 1-tail vs. 2-tailedChange from α = 0.05 two-tailed to a = 0.05 two-tailed

Page 48: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Statistical Power

α = 0.05

Fail to reject H0Reject H0

β

Power = 1 - β

Factors that affect Power: 1-tail vs. 2-tailedChange from a = 0.05 two-tailed to a = 0.05 two-tailed

p = 0.025

p = 0.025

Page 49: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Statistical Power

Fail to reject H0Reject H0

β

Power = 1 - β

Factors that affect Power: 1-tail vs. 2-tailedChange from a = 0.05 two-tailed to α = 0.05 two-tailed

p = 0.025

p = 0.025

α = 0.05

Page 50: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Statistical Power

Fail to reject H0Reject H0

β

Power = 1 - β

Factors that affect Power: 1-tail vs. 2-tailedChange from a = 0.05 two-tailed to α = 0.05 two-tailed

p = 0.025

p = 0.025

α = 0.05

Page 51: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Statistical Power

Fail to reject H0Reject H0

β

Power = 1 - β

Factors that affect Power: 1-tail vs. 2-tailedChange from a = 0.05 two-tailed to a = 0.05 two-tailed

p = 0.025

p = 0.025

α = 0.05

Page 52: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Statistical Power

Fail to reject H0Reject H0

β

Power = 1 - β

Factors that affect Power: 1-tail vs. 2-tailedChange from a = 0.05 two-tailed to α = 0.05 two-tailed

p = 0.025

Two tailed functionally cuts the α-level in half, which decreases the power.

p = 0.025

α = 0.05

Page 53: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Statistical Power

Factors that affect Power:– α-level: So as the a level gets smaller, so does the

Power of the test

– Sample size: As the sample gets bigger, the standard error gets smaller and the Power gets larger

– Population standard deviation: As the population standard deviation gets smaller, the standard error gets smaller and the Power gets larger

– Effect size: As the effect gets bigger, the Power gets larger

– 1-tail vs. 2-tailed: Two tailed functionally cuts the α-level in half, which decreases the power

Page 54: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Why care about Power?

• Determining your sample size– Using an estimate of effect size, and

population standard deviation, you can determine how many participants need to achieve a particular level of power

• When a result if not statistically significant– Is is because there is no effect, or not enough

power

• When a result is significant– Statistical significance versus practical

significance

Page 55: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Ways of Increasing Power

Page 56: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Exam I (Tuesday 9/17)

• Covers first 8 chapters• Closed book (materials will be provided)• Bring calculator• Emphasis on material covered in class• No need to use SPSS during the test

Page 57: Thursday, September 12, 2013 Effect Size, Power, and Exam Review

Exam I (Tuesday 9/17)• Ways to describe distributions (using tables, graphs, numeric summaries of

center and spread)• How to enter data into SPSS• Z-scores• Probability• Characteristics of the normal distribution• Normal approximation of the binomial distribution• Use of the unit normal table• Central Limit Theorem & distribution of sample means• Standard error of the mean (σM)• Hypothesis testing

– Four steps of hypothesis testing– Error Types– One-Sample Z-test– Effect sizes and Power