matching (in case control studies) james stuart, fernando simón epiet dublin, 2006

28
Matching (in case control studies) James Stuart, Fernando Simón EPIET Dublin, 2006

Upload: esmond-brown

Post on 14-Dec-2015

216 views

Category:

Documents


0 download

TRANSCRIPT

Matching(in case control studies)

James Stuart, Fernando SimónEPIET

Dublin, 2006

Remember confounding…

Confounding factor is variable independently associated with

• exposure of interest• outcome

that distorts measurement of association

Control of confounders

In the study design

• Restriction• Matching

In the analysis

• Stratification• Multivariate analysis

Control of confounders

In the study design

• Restriction• Matching

In the analysis

• Stratification• Multivariate analysis

Matching

Selection of controls to match specific

characteristics of cases

a) Frequency matchingSelect controls to get same distribution of

variable as cases (e.g. age group)

b) Individual matchingSelect a specific control per case by

matching variable (e.g. date of birth)

… but matching introduces bias

because controls are no longer representative of source population

to remove this selection bias

• Stratify analysis by matching criteria

matched design matched analysis

• Can not study the effect of matching variables on the outcome

a) Frequency matching

useful if distribution of cases for a confounding variable differs markedly from distribution of that variable in source population

a) Frequency matching

Age Cases

(years)

0-14 50

15-29 30

30-44 15

45+ 5

TOTAL 100

a) Frequency matching

Age Cases Controls

(years) unmatched

0-14 50 20

15-29 30 20

30-44 15 20

45+ 5 40

TOTAL 100 100

a) Frequency matching

Age Cases Controls

(years) unmatched matched

0-14 50 10 50

15-29 30 25 30

30-44 15 25 15

45+ 5 40 5

TOTAL 100 100 100

a) Frequency matching: analysis

• Mantel-Haenszel Odds Ratio (weighted)

• Conditional logistic regression for

multiple variables

][

][

i

iMH ncb

ndaOR

a) Frequency matching: analysis

• keep stratification by age group

0-14 years

Exposed Cases Controls Total

Yes 45(a) 30(b) 75

No 5(c) 20(d) 25

Total 50 50 100(ni)

5.1

9

100150

100900

i

i

ncb

nda

a) Frequency matching: analysis

15-29 years

Exposed Cases Controls Total

Yes 15(a) 4(b) 19

No 15(c) 26(d) 41

Total 30 30 60(ni)

same process for each age group

0.1

5.6

6060

60390

i

i

ncb

nda

etc

etcORMH

15.1

5.69

b) individual matching

Each pair could be considered one stratum

4 possible outcomes per pair

Exposure

+ -

Case 1 0

Control 1 0

b) individual matching

Each pair could be considered one stratum

4 possible outcomes per pair

Exposure

+ - + -

Case 1 0 1 0

Control 1 0 0 1

b) individual matching

Each pair could be considered one stratum

4 possible outcomes per pair

Exposure

+ - + - + -

Case 1 0 1 0 0 1

Control 1 0 0 1 0 1

b) individual matching

Each pair can be considered as one stratum

4 possible outcomes per pairExposure

+ - + - + - + -Case 1 0 1 0 0 1 0 1Control 1 0 0 1 0 1 1 0

ad = zero unless case exposed, control not exposed bc = zero unless control exposed, case not exposed

b) individual matching

The only pairs that contribute to OR are discordant

ORMH= sum of discordant pairs where case exposed sum of discordant pairs where control exposed

][

][

i

iMH ncb

ndaOR

b) individual matching

If change way of presenting case and control data

to show in pairs

Controls

Exposed Unexposed

Exposed e f (ad=1)

Cases

Unexposed g (bc = 1) h

ORMH = sum of discordant pairs where case exposed sum of discordant pairs where control exposed

= f/g

b) individual matching: for n controls

each set analysed in pairs case used in as many pairs as number of controls

Case Control1 Control2 Control3 Control4 C+/Ctr- C-/Ctr+ + - + - - 3 0 + + - + + 1 0 - - - - - 0 0 + - - - + 3 0 - - + - - 0 1 + - + + + 1 0 + + + + + 0 0 Total......................................................................... 8 1

pairs case exp/control not 8pairs case not/control exp 1

OR= = = 8

Matched study: example

• 20 cases of cryptosporidiosis

• Hypothesis: associated with attendance at local swimming pool

• 2 matched studies conducted

(i) controls from same general practice and nearest date of birth

(ii) case nominated (friend) controls

Analysis: GP and age matched controls

swimming pool exposure

Controls+ -

+ 1 15Cases

- 1 3

OR = f/g = 15/1 = 15.0

Analysis: friend controls

swimming pool exposure

Controls

+ -

+ 13 3

Cases

- 1 3

OR = 3/1 = 3.0

Why do matched studies?

• Random sample may not be possible

• Quick and easy way to get controls

• Improves efficiency of study (smaller sample size)

• Can control for confounding due to factors that are difficult to measure or even for unknown confounders.

Disadvantages of matching

• Cannot examine risks associated with matching variable

• If no controls identified, more likely if too many matching variables, lose case data and vice versa

• Overmatching on exposure of interest will bias OR towards 1

• May be residual confounding in frequency matching

Over-matching

• exposure to the risk factor of interest

• under-estimates true association

• may fail to find true association

Key points

• Matching controls for confounding factors in study design

• Matched design matched analysis

• Matching for variables that are not confounders complicates design

• Frequency matching simpler than individual

• Multivariable analysis reduces need to match