# conditional logistic regression for matched data hrp 261 02/25/04 reading: agresti chapter 9.2

Embed Size (px)

DESCRIPTION

Conditional Logistic Regression for Matched Data HRP 261 02/25/04 reading: Agresti chapter 9.2. Recall: Matching. Matching can control for extraneous sources of variability and increase the power of a statistical test. - PowerPoint PPT PresentationTRANSCRIPT

Conditional Logistic Regression for Conditional Logistic Regression for Matched DataMatched Data

HRP 261 02/25/04HRP 261 02/25/04

reading: Agresti chapter 9.2reading: Agresti chapter 9.2

Recall: MatchingRecall: MatchingMatching can control for extraneous

sources of variability and increase the power of a statistical test.

Match M controls to each case based on potential confounders, such as age and gender.

Recall: Recall: Agresti Agresti example, example, diabetes and MIdiabetes and MI

Match each MI case to an MI control based on age and gender.

Ask about history of diabetes to find out if diabetes increases your risk for MI.

Diabetes

No diabetes

25 119

Diabetes No Diabetes

9 37

16 82

46

98

144

MI cases

MI controls

P(“favors” case/discordant pair) =

)~/(*)/(~)~/(~*)/(

)~/(~*)/(

DEPDEPDEPDEP

DEPDEP

=the probability of observing a case-control pair with only the control exposed

=the probability of observing a case-control pair with only the case exposed

Diabetes

No diabetes

25 119

Diabetes No Diabetes

9 37

16 82

46

98

144

MI cases

MI controls

odds(“favors” case/discordant pair) =

16

37

c

bOR

Logistic Regression for Logistic Regression for Matched Pairs Matched Pairs option 1:option 1: the logistic-normal modelthe logistic-normal model

Mixed model; logit=i+xWhere i represents the “stratum effect”

– (e.g. different odds of disease for different ages and genders)

– Example of a “random effect”Allow i’s to follow a normal distribution

with unknown mean and standard deviationGives “marginal ML estimate of ”

option 2: option 2: Conditional Logistic RegressionConditional Logistic Regression

The conditional likelihood is based on….

The conditional probability (for pair-matched data):

)/(~*)~/()~/(~*)/(

)~/(~*)/(

)~/(*)/(~)~/(~*)/(

)~/(~*)/(

EDPEDPEDPEDP

EDPEDP

DEPDEPDEPDEP

DEPDEP

or, prospectively:

P(“favors” case/discordant pair) =

The Conditional Likelihood: The Conditional Likelihood: each each discordant discordant stratumstratum (rather than individual) (rather than individual) gets 1 term in the likelihoodgets 1 term in the likelihood

control favor thethat strata discordant

1

case favor thethat strata discordant

1

)/(~*)~/(*)/(~*)~/(

)/(~*)~/(

)/(~*)~/(*)/(~*)~/(

)~/(~*)/(

m

j

n

i

EDPEDPEDPEDP

EDPEDP

xEDPEDPEDPEDP

EDPEDP

Note: the marginal probability of disease may differ in each age-gender stratum, but we assume that the (multiplicative) increase in disease risk due to exposure is constant across strata.

Recall probability terms:Recall probability terms:

e

eEDP

1)/(

e

eEDP

1)~/(

eEDP

1

1)/(~

eEDP

1

1)~/(~

α)0(α))~/(1

)~/(ln(

)1(α))/(1

)/(ln(

EDP

EDP

EDP

EDP

The conditional likelihood=The conditional likelihood=

case favor thethat strata discordant

1i

control favor the that strata discordant

1

1*

1

1

1

1*

1

1

1*

1

1

1*

11*

1

11

*1

1

n

m

j

i

i

iii

i

ii

i

jj

j

j

j

j

j

j

j

e

e

eee

eee

e

x

ee

e

e

e

e

e

e

e

Each age-gender stratum has the same baseline odds of disease; but these

baseline odds may differ across strata

Conditional Logistic RegressionConditional Logistic Regression

case favor thethat

strata discordant

1

control favor thethat strata discordant

1j

n

i

m

ii

i

jj

j

ee

ex

ee

e

nmn

i

m

j e

e

ee

ex

e

e i

)1

()1

1(

1

1

1

parameter) nuisance of rid (gets !cancel! s' The***

11

Example: MI and diabetesExample: MI and diabetes

1637 )1

()1

1()L(

e

e

e

Conditional Logistic RegressionConditional Logistic Regression

16

37

1637

53)137(

01

53-37

dlog(L)

)1log(*5337)log(

e

e

ee

e

e

d

eL

Could there be an association between exposure to ultrasound in utero and an increased risk of childhood malignancies?

Previous studies have found no association, but they have had poor statistical power to detect an association.

Swedish researchers performed a nationwide population based case-control study using prospectively assembled data on prenatal exposure to ultrasound.

Example:Example:Prenatal ultrasound examinations and risk Prenatal ultrasound examinations and risk of childhood leukemia: case-control study of childhood leukemia: case-control study

BMJBMJ 2000;320:282-283 2000;320:282-283

Example:Example:Prenatal ultrasound examinations and risk Prenatal ultrasound examinations and risk of childhood leukemia: case-control study of childhood leukemia: case-control study

BMJBMJ 2000;320:282-283 2000;320:282-283

535 cases: all children born and diagnosed as having myeloid leukemia between 1973 and 1989 in Swedish registers of birth, cancer, and causes of death.

535 matched controls: 1 control was randomly selected for each case from the Swedish Birth Registry, matched by sex and year and month of birth.

Ultrasound

No ultrasound

215 320

Ultrasound No Ultrasound

200

335

535

Leukemia cases

Myeloid leukemia controls

235100

115 85

85.100

85

c

bOR

But this type of analysis is limited to single dichotomous exposure…

Used conditional logistic regression to look at dose-response with number of ultrasounds:

Results: Reference OR = 1.0; no ultrasounds OR =.91 for 1-2 ultrasounds OR=.64 for >=3 ultrasounds

Conclusion: no evidence of a positive association between prenatal ultrasound and childhood leukemia; even evidence of inverse association (which could be explained by reasons for frequent ultrasound)

Each term in the likelihood represents a stratum of 1+M individuals

More complicated likelihood expression! See: 02/02/04 lecture

Extension: 1:M matchingExtension: 1:M matching

http://www.stanford.edu/class/hrp223/2003/Lecture15/Lecture15_223_2003.ppt

Available here:-SAS tips, explanations and code-SAS macro that generates automatic logit plots

(under “Lecture 15” at: http://www.stanford.edu/class/hrp223/) to check if predictor is linear in the logit.

Conditional Logistic Regression in Conditional Logistic Regression in SAS: Please read Ray’s slides at:SAS: Please read Ray’s slides at:

M:N Matching SyntaxM:N Matching Syntax

The basic syntax is shown here.proc phreg data=BLAH;model WEIRD*IsOUTCOME(Censor_v)= PREDICTORS /ties=discrete;strata STRATA_VARS;

run; Put the values in the IsOUTCOME variable here that are the controls. Typically this is just the value 0.

This is the switch requesting a m:n CLR.

This is the m:n matching variable.

Courtesy: Ray Balise

Part II: Rater agreement: Part II: Rater agreement: Cohen’s KappaCohen’s Kappa

Agresti, Chapter 9.5 Agresti, Chapter 9.5

Cohen’s KappaCohen’s KappaActual agreement = sum of the proportions found on the diagonals.

ii

Cohen: Compare the actual agreement with the “chance agreement” (which depends on the marginals).

iiii

Normalize by its maximum possible value.

ii

iiii

1

Ex: student teacher ratingsEx: student teacher ratingsRating by supervisor 2

Rating by supervisor 1

Authoritarian

Democratic

Permissive

Totals

Authoritarian 17 4 8 29

Democratic 5 12 0 17

Permissive 10 3 13 26

Totals 32 19 21 72

agreement chance1

agreement chanceagreementˆ

K

Example: student teacher Example: student teacher ratingsratings

Null hypothesis: Kappa=0 (no agreement beyond chance)

**.542 - .182 CI %95

362.347.1

347.583.

)72*72

21*2619*1732*29(1

)72*72

21*2619*1732*29()

72131217

(ˆ

K

Example: student teacher Example: student teacher ratingsratings

Null hypothesis: Kappa=0 (no agreement beyond chance)

Interpretation: achieved 36.2% of maximum possible improvement over

that expected by chance alone