power and sample size
DESCRIPTION
Power and Sample Size. Shaun Purcell & Danielle Posthuma Twin Workshop March 2002. Aims of Session. Introduce concept of power and errors in inference Practical 1 : Using probability distribution functions to calculate power Power in the classical ACE twin study - PowerPoint PPT PresentationTRANSCRIPT
Power and Sample Size
Shaun Purcell & Danielle Posthuma
Twin Workshop March 2002
Aims of Session
Introduce concept of power and errors in inference
Practical 1 : Using probability distribution
functions to calculate power
Power in the classical ACE twin study
Practical 2 : using Mx to calculate power
Practical 3 : Monte-Carlo simulation
Power primer
Statistics (e.g. chi-squared, z-score) are continuous
measures of support for a certain hypothesis
Test statistic
Inevitably leads to two types of mistake : false positive (YES instead of NO) (Type I)false negative (NO instead of YES) (Type II)
YES OR NO decision-making : significance testing
YESNO
Hypothesis testing
Null hypothesis : no effect
A ‘significant’ result means that we can reject the
null hypothesis
A ‘nonsignificant’ result means that we cannot
reject the null hypothesis
Statistical significance
The ‘p-value’
The probability of a false positive error if the null
were in fact true
Typically, we are willing to incorrectly reject the null
5% or 1% of the time (Type I error)
Misunderstandings
p - VALUES
that the p value is the probability of the null
hypothesis being true
that very low p values mean large and important
effects
NULL HYPOTHESIS
that nonrejection of the null implies its truth
Limitations
IF A RESULT IS SIGNIFICANT
leads to the conclusion that the null is false
BUT, this may be trivial
IF A RESULT IS NONSIGNIFICANT
leads only to the conclusion that it cannot be
concluded that the null is false
Alternate hypothesis
Neyman & Pearson (1928)
ALTERNATE HYPOTHESIS
specifies a precise, non-null state of affairs with
associated risk of error
P(T)
T
Critical value
Sampling distribution if HA were true
Sampling distribution if H0 were true
Rejection of H0 Nonrejection of H0
H0 true
HA true
POWER =(1- )
Nonsignificant resultType I error at rate
Type II error at rate
Significant result
Power
The probability of rejection of a false null-
hypothesis
depends on - the significance crtierion ()- the sample size (N) - the effect size (NCP)
“The probability of detecting a given effect size in a population from a sample of size N, using significance criterion ”
Impact of alpha
P(T)
T
Critical value
Impact of effect size, N
P(T)
T
Critical value
Applications
POWER SURVEYS / META-ANALYSES- low power undermines the confidence that can be
placed in statistically significant results
INTERPRETING NONSIGIFICANT RESULTS- nonsignficant results only meaningful if power is high
EXPERIMENTAL DESIGN- avoiding false positives vs. dealing with false negatives
MAGNITUDE VS. SIGNIFICANCE- highly significant very important
Practical Exercise 1
Calculation of power for simple case-control study.
DATA : frequency of risk factor in 30 cases and 30
controls
TEST : 2-by-2 contingency table : chi-squared
(1 degree of freedom)
Step 1 : determine expected chi-squared
Hypothetical risk factor frequencies
Case Control
Risk present 20 10
Risk absent 10 20
Chi-squared statistic = 6.666
E
EO 22 )(
P(T)
T
Critical value
Step 2. Determine the critical value for a given type I error rate,
- inverse central chi-squared distribution
P(T)
T
Critical value
Step 3. Determine the power for a given critical valueand non-centrality parameter
- non-central chi-squared distribution
Calculating Power
1. Calculate critical value (Inverse central 2)
Alpha 0 (under the null)
2. Calculate power (Non-central 2)
Crit. value Expected NCP
http://workshop.colorado.edu/~pshaun/
df = 1 , NCP = 0
X
0.05
0.01
0.001
3.84146
6.63489
10.82754
Determining power
df = 1 , NCP = 6.666
X Power
0.05 3.84146
0.01 6.6349
0.001 10.827
0.73
0.50
0.24
Exercise 1
Calculate power (for the 3 levels of alpha) if sample
size were two times larger (assume proportions
remain constant) ?
Hint: the NCP is a linear function of sample size, and will also
be two times larger
Answers
df = 1 , NCP = 13.333
X Power
0.05 3.84146
0.01 6.6349
0.001 10.827
nb. Stata : di 1-nchi(df,NCP,invchi(df,))
0.95
0.86
0.64
Twin 1
A C E
a c e
Twin 1
A C E
a’ 0 e’
Estimating power for twin models
The power to detect, e.g., common environment
Expected covariance matrices arecalculated under the alternate model :
Fit model to data with value of interest fixed to null value, e.g. c = 0
NCP = -2LLSUB
0.51 0.28
0.41 0.20
Model A C E
1 30% 20% 50%
2 0% 20% 80%
(350 MZ pairs, 350 DZ pairs)
Model Power to detect C
Alpha 0.05 0.01
1
2
Using power.mx script
Using power.mx script
Qu. You observe MZ and DZ correlations of 0.8
and 0.5 respectively, in 100 MZ and 100 DZ twin
pairs. What is the power to detect an additive
genetic effect, with a type I error rate of 1 in
1000?
Absolute ACE effects
Power to detect :
A C E A C
0.1 0.1 0.8 0.02 0.02
0.2 0.2 0.6 0.06 0.09
0.3 0.3 0.4 0.29 0.32
0.4 0.4 0.2 0.95 0.79
150 MZ twins, 150 DZ twins, = 0.01
Relative ACE effects
Power to detect :
A C E A C
0.2 0.2 0.6 0.06 0.09
0.2 0.0 0.8 0.57
0.0 0.2 0.8 0.82
150 MZ twins, 150 DZ twins, = 0.01
Sample Size
NMZ NDZ A C
150 150 0.83 0.53
250 250 0.98 0.86
350 350 1.00 0.96
500 500 1.00 0.99
A:C:E = 2:2:1, = 0.001
NCP and power
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
0 5 10 15 20
NCP
Power
Relative MZ and DZ sample N
NMZ NDZ A C
150 150 0.83 0.53
500 500 1.00 0.99
500 150 0.99 0.56
150 500 0.95 0.99
A:C:E = 2:2:1, = 0.001
Increasing power
Increase sample size
Increase
Multivariate analysis
Adding other family members
Adding other siblings
Power compared to twins only design
(keeping total # individuals constant)
Power to detect
A C D
+ 1 sibling + ++ ++
+ 2 siblings - ++ ++
Monte-Carlo simulation
Instead of calculating expected NCP under
population parameter values, simulate multiple
randomly-sampled datasets
Perform test on each dataset
Due to random sampling variation, the effect will
not always be detectable
The proportion of significant results Power
P(T)
T
Expected NCP
Critical value
P(T)
T
Critical value
More importantly...
Meike says …
“people are going skiing Saturday and all are
welcome to join”