tree structured prognostic model for hepatocellular carcinoma

43
KNOU IMA 2003 1 Tree Tree Structured Prognostic Model Structured Prognostic Model for for Hepatocellular Carcinoma Hepatocellular Carcinoma Taerim Lee (1) , Hyungjun Cho (2) , Hyosuck Lee (3) (1) Dept of Information Statistics Korea National Open University e-mail : [email protected] (2) Dept. of Biostatistics University of Maryland e-mail : [email protected] (3) Dept. of Internal Medicine Seoul National University e-mail : [email protected]

Upload: raleigh

Post on 11-Feb-2016

35 views

Category:

Documents


0 download

DESCRIPTION

Tree Structured Prognostic Model for Hepatocellular Carcinoma. Taerim Lee (1) , Hyungjun Cho (2) , Hyosuck Lee (3) (1) Dept of Information Statistics Korea National Open University - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 1

TreeTree Structured Prognostic Model for Structured Prognostic Model forHepatocellular CarcinomaHepatocellular Carcinoma

Taerim Lee (1) , Hyungjun Cho (2), Hyosuck Lee (3)

(1) Dept of Information Statistics Korea National Open University e-mail : [email protected] (2) Dept. of Biostatistics University of Maryland e-mail : [email protected] (3) Dept. of Internal Medicine Seoul National University e-mail : [email protected]

Page 2: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 2

OutlineOutline

1. Background 2. Motivation & Purposes 3. Analytic structure of Tree Structured Model4. Prognostic Model of Hepatocellula carcinoma5. Remarks

Page 3: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 3

BackgroundBackground

1.1. Tree structured regression modeling Tree structured regression modeling techniques have been developed using a techniques have been developed using a recursive partitioning algorithm.recursive partitioning algorithm.

2.2. There has been a strong need on the analyses There has been a strong need on the analyses of the data on the survival rate of patients with of the data on the survival rate of patients with HCC who were treated with either with surgical HCC who were treated with either with surgical resection or transaterial chemoembolization resection or transaterial chemoembolization method. method.

Page 4: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 4

ReviewReview of of LiteratureLiterature

1. 1. L. Gorden & R. Olshen & (1985)L. Gorden & R. Olshen & (1985) presented tree structured survival analysis in the Cancer Treatment Reports.

2. 2. M. Segal(1988)M. Segal(1988) Regression TreesRegression Trees for censored data for censored data

3. 3. R.B. Davis & J.R. Anderson(1989)R.B. Davis & J.R. Anderson(1989) developed developed

exponential survival treesexponential survival trees using a recursive using a recursive partitioning algorithm for incomplete survival data partitioning algorithm for incomplete survival data using CART in Statistics in Medicine.using CART in Statistics in Medicine.

4. 4. Lynn A. Sleeper & D.P. Harrington(1990)Lynn A. Sleeper & D.P. Harrington(1990) examined examined a flexible survival model with a flexible survival model with Regression SplinesRegression Splines for covariate effect in Liver Disease.for covariate effect in Liver Disease.

Page 5: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 5

Review of Literature5.M.LeBlanc & John Crowley(1992) developed a method for

obtaining tree-structured relative risk estimates for censored survival

data.

6. M. LeBlanc & John Crowley(1993) reported survival trees by

goodness of split in JASA.

7. H. Ahn & W.Y.Loh(1994) fitted piecewise proportional hazards

models to censored survival data with bootstrapping in Biometrics.

8. D. Collett(1994) reported modeling survival data in medical search.

Page 6: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 6

ReviewReview of of LiteratureLiterature

9.W.Y. Loh & Y.S. Shih(1997) derived split selection methods for classification tree in Statistica Sinica.

10. S.H. Um et al.(1997) derived the prognostic index and define the natural stage of HCC using those index.

11. S.H. Um et al.(1998) evaluated the prognosis of HCC in relation the treatment methods.

12. S.K. Han et al.(1998) estimated the survival rate and their affecting factors in patient with HCC.

Page 7: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 7

ReviewReview of of LiteratureLiterature

14. K.M. Kim et al.(2000) studied the survival rate of patient

with hepatocellular carcinoma (HCC) who were treated either

with surgical resection or transarterial chemoembolization

15. K.M. Kim et al.(2000) compare the survival and quality of life in advanced non small cell lung cancer patients in

Journal of Clinical Oncology.

16. H. J. Cho (2001) reported tree structured regression modeling for censored survival data.

Page 8: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 8

MotivationMotivation

1. A variety of powerful modeling technique have been developed for exploring the functional form of effects for HCC. 2. To find the model related to the effects which are changing over time with covariate.

3. To obtain the tree structured prognostic models of HCC patients.

Page 9: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 9

ObjectivesObjectives

1. To identify the effect of prognostic factors of HCC.

2. To quantify the patient characteristics that related to the survival time. 3. To explore the functional form and the relationships of the covariates.4. To reflect the changing effects over time to the prognostic model.

Page 10: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 10

Material and MethodMaterial and Method. Data : From 1993 through 1996, 186 patients with HCC in UICC T1-3NOMO and liver function of Child-Pugh class A were enrolled in SNUH.. Variables : Stratification Var: Surgical Resection group

Transaterial chemoembolization group

Clinical Var : sex, age, afp, viral child effect UICC. CLIP # of TACE type, tumor size, lobe, inv. Transformed variables(7) Dependent Var : prognosis(sur, sc) survivalship

Page 11: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 11

Table1. Pretreatment Base-Line Table1. Pretreatment Base-Line Characteristics of the Patients Characteristics of the Patients

CHRACTERISTIC SURGICAL TESECTION (N=91)

TRANSCATHETER ARTERIALCHEMOEMBOLIZATION (N=91)

P VALUE

Age-yr      

Mean SD 50 10 56 10 0.0002+

Male sex – no.(%) 76(84) 79(87) 0.532+

Serum alpha-fetoprotein – no.(%)

    0.756+

< 400 ng/ml 58(64) 60(66)  

400 ng/ml 33(36) 31(34)  

Viral marker-no.(%)     0.046

Hepatitis B virus 71(78) 61(67)  

Hepatitis C virus 4(4) 16(18)  

Hepatitis B and C virus 2(2) 1(1)  

NBNC& 13(14) 10(11)  

Unknown 1(1) 3(3)  

Page 12: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 12

Table1. Pretreatment Base-Line Table1. Pretreatment Base-Line Characteristics of the Patients Characteristics of the Patients

CHRACTERISTIC SURGICAL TESECTION (N=91)

TRANSCATHETER ARTERIALCHEMOEMBOLIZATION (N=91)

P VALUE

Okuda stage – no.(%)     0.155+

Okuda Ⅰ 91(100) 89(98)  

Okuda Ⅱ 0(0) 2(2)  

UICC T stage-no.(%)     0.003

UICC T1 17(19) 12(13)  

UICC T2 63(69) 49(54)  

UICC T3 11(12) 30(33)  

CLIP scoring-no.(%)     0.125

CLIP1 54(59) 39(43)  

CLIP2 27(30) 34(37)  

CLIP3 8(9) 16(18)  

CLIP4 2(2) 1(1)  

CLIP5 0(0) 1(1)  

Lipiodol retention pattern-no.(%)     0.063

Compact 56(62) 56(62)  

Non-compact 30(33) 35(38)  

Unknown 5(5) 0(0)  

Page 13: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 13

Table2. Changes in afp after Initial Table2. Changes in afp after Initial Treatment in afp Secreating HCC Treatment in afp Secreating HCC

CHRACTERISTIC SURGICAL TESECTION

TRANSCATHETER ARTERIALCHEMOEMBOLIZATION

P VALUE

Serum alpha-fetoprotein in total alpha-fetoprotein secretion HCC patients

N=73 N=64 0.001+

No.(%)

Decreased>50% 61(84) 33(52)  

Decreased 25-50% 7(10) 10(16)  

Stable0 3(4) 13(20)  

Increased≥25% 2(3) 8(13)  

Serum alpha-fetoprotein in alpha-fetoprotein secreting HCC patients with compact Lipiodol

retention

N=41 N=40 0.137+

No.(%)

Decreased>50% 34(83) 25(63)  

Decreased 25-50% 4(10) 6(15)  

Stable0 1(2) 6(15)  

Increased≥25% 2(5) 3(8)  

Page 14: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 14

AnalysisAnalysis

Survival Analysis - Cox regression

Comparison between survival groups

Tree Structured Prognostic Model

Page 15: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 15

Proportional Hazards Model (Cox 1972)

timeCensoring:C,timeSurvival:Z ii

)CI(Zδ ,)C,min(ZY iiiiii

Covariate ofVector:)'X,,,,,(XX x1ii

If is the hazard rate at time y for an individual with Risk factor X, the Cox proportional hazards Model is

X)|λ(y

p

1kkk0 )Xβ(y)exp(λX)|λ(y

Analytic Structure of Survival Model

Page 16: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 16

Fig.1 Survival Curves for HCC according to Treatment Group

Median survivalResection : 59.1 monthsTACE : 35.5 months

Variables in the Model Exp(B) CHILD 2.67 CLID 1.43 TX 0.64 EFFECT 0.54

Page 17: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 17

Fig.2 Survival Curves for HCC according to Tumor Stage

Page 18: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 18

Fig.3 Survival Curves for HCC according to CLIP Stage

Page 19: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 19

Fig.4 Survival Curves for HCC Fig.4 Survival Curves for HCC according to Lipidol Pattern according to Lipidol Pattern

Page 20: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 20

Fig.5 Survival Curves for HCC according to Child Score

Page 21: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 21

tree structured prognostic model with effective covariate

: CART uses a decision tree to display how data may be classified or predicted.

: automatically searches for important relationships

and uncovers hidden structure even in highly complex

data.

CART total

LDL D L

X1 >aX2 >b

X 3>cX4 >d

Analytic Structure of Prognostic Model

Page 22: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 22

Fig.6 Tree Structured Model based on HCC data

1

186147(0)39(1)

CHILD≤5.5

AGE≤713

TAENUM≤7.5

3911(0)28(1)

14799(0)48(1)

CART

2924(0)5(1)

9552(0)43(1)

2323(0)0(1)

12476(0)48(1)

4635(0)11(1)

3815(0)23(1)

8450(0)34(1)

112(0)9(1)

AFP ≤ 8.0

0

0

1

01

TAENUM≤1.5

1. AFP 1. AFP 100.0 100.02. TAENUM 88.8 2. TAENUM 88.8 3. CHILD 61.23. CHILD 61.24. AGE 42.24. AGE 42.25. TX 32.15. TX 32.1

Sensitivity 90.0%Sensitivity 90.0%Specificity 30.3%Specificity 30.3%Total 65.6%Total 65.6%

Page 23: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 23

Fig.7 Tree Structured Model for TACE group of HCC data

9446(0)48(1)

CHILD≤5.5

SIZE≤3.85

TAENUM≤1.5

CART

3522(0)13(1)

4915(0)34(1)

AFP≤10.40

0

INV≤0.5 33(0)0(1)

4612(0)34(1)

188(0)

10(1)

1714(0)3(1)

109(0)1(1)

8437(0)47(1)

1

10

010

7(0)3(1)

87(0)1(1)

1. TAENUM 100.01. TAENUM 100.02. AFP 87.7 2. AFP 87.7 3. CHILD 72.33. CHILD 72.34. SIZE 59.44. SIZE 59.45. INV 59.05. INV 59.06. CLIP 45.56. CLIP 45.5

Sensitivity 71.7%Sensitivity 71.7%Specificity 85.4%Specificity 85.4%Total 78.7%Total 78.7%

Page 24: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 24

1. AFP 100.01. AFP 100.02. TAENUM 72.6 2. TAENUM 72.6 3. CHILD 31.13. CHILD 31.14. AGE 7.74. AGE 7.75. TX 5.35. TX 5.3

Sensitivity 89.1%Sensitivity 89.1%Specificity 67.9%Specificity 67.9%Total 82.6%Total 82.6%

11

9273(0)19(1)

AGE≤639AGE≤639

TAENUM≤7.5TAENUM≤7.5

196(0)

13(1)

7358(0)15(1)

CART

96(0)3(1)

71(0)6(1)

167(0)9(1)

5751(0)6(1)

AFP≤8.0AFP≤8.0

00

11 00

Fig.8 Tree Structured Model based for SC group of HCC SC(Surgical Resection Group)

Page 25: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 25

tree structured prognostic model with effective covariate : FACT employs statistical hypothesis test to select a variable for splitting each node and then uses discriminant analysis to find the split point . The size of the tree is determined by a set of stopping rules

FACT total

LDL D L

X1 >aX2 >b

X 3>cX4 >d

Analytic Structure of Prognostic Model

Page 26: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 26

: QUEST is a new classification tree algorithm derived from the FACT method. It can be used with univariate splits or linear combination splits.

QUEST total

LDL D L

X1 >aX2 >b

X 3>cX4 >d

Analytic Structure of Prognostic Model

Unlike FACT, QUEST uses cross-validation pruning. It distinguishes from other decision tree classifiers is that when used with univariate splits the classifier performs approximately unbiaased variable selection.

Page 27: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 27

Survival Tree with Unbiased Detection of Interaction

: STUDI is a tree-structured regression modeling tool.

STUDItotal

L3L5L1 L L4

X1 >aX2 >b

X 3>cX4 >d

Analytic Structure of Prognostic Model

It is easy to interpret predict survival value for new case. Missing values can easily be handled and time dependent covariates can be incorporated.

Page 28: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 28

STUDI

Analytic Structure of Prognostic Model

Let the survival function for a covariate Xi be

Where is the cumulative baseline hazard rate. Then median survival time for and individual is defined as

and the cost at a node t be is defined as

)}()({)│( ii xyxxyS expexp 0

)(y0

i })(│{ 5.0inf~ ySyyi

n

iii yytR

1

~)(

Page 29: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 29

STUDI

Analytic Structure of Prognostic Model

Modified Cox-Snell(MCS) residuals: for I=1,…,n

Where is the estimator of the cumulative baseline hazard function.

)1(693.0}ˆexp{)(ˆ0 iii XY

Page 30: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 30

Covariate TypeCovariate Type

1. n-covariate: Ordered numerical predictor used for fitting and splitting.

2. f-covariate: Ordered numerical predictor used for fitting only.

3. s-covariate: Ordered predictor used for splitting only.

4. c-covariate: Categorical predictor for splitting only

Page 31: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 31

Split Covariate Selection 1. Fit a model to n and f covariates in the node.

2. Obtain the modified Cox-Snell residuals.

3. Perform a curvature test for each of n-s-and c-covariates. 4. Perform a interaction test for each pair of n- s-and c-covariates. 5. Select the covariate which has the smallest p-value.

Page 32: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 32

Bootstrap Calibration• Bootstrap estimation of a bias correction coefficientBootstrap estimation of a bias correction coefficient- Let (Y- Let (Y**, δ, δ**) be a bootstrap sample drawn from the elements in (Y, δ).) be a bootstrap sample drawn from the elements in (Y, δ).- Fit a model to (Y- Fit a model to (Y**, δ, δ**, X1, …,Xp), using only the n- and f-covariates., X1, …,Xp), using only the n- and f-covariates.- Compute the p-values from the χ- Compute the p-values from the χ22 curvature and interaction tests. curvature and interaction tests.- Convert each p-values into an absolute normal variate z*.- Convert each p-values into an absolute normal variate z*.- For r>1, if r max {z- For r>1, if r max {znn

**, z, znnnn**} } ≥ ≥ max{zmax{zss

**, z, zcc**, z, zssss

**, z, zcccc**, z, znsns

**, z, zncnc**, z, zscsc

**}, an n }, an n

covariate is selected.covariate is selected.- Repeat the above steps over a grid of r values to estimate π(r), the- Repeat the above steps over a grid of r values to estimate π(r), the probability that an n-covariate is selected.probability that an n-covariate is selected.- Find r- Find r00 such that π(r such that π(r00) equals the proportion of n-covariates of the total ) equals the proportion of n-covariates of the total

number of covariates used for spitting in the sample. Interpolate linearlynumber of covariates used for spitting in the sample. Interpolate linearly if necessary.if necessary.

Page 33: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 33

How to select the covariate with the How to select the covariate with the smallest p-value?smallest p-value?

1. If the smallest p-value is from a curvature test -select the covariate which has the smallest p-value

2. If the smallest p-value is from an interaction test

-for a pair of n-covariates, select the one which yields the smaller total costs of two subnodes its parent node is split by the sample median.

-for s or c covariates select the one with the smaller curvature p-value.

-between n-covariates and s or c covariate select the s or c covariate.

Page 34: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 34

Unbiased covariates selection

1. Fit a medel to (Y, δδ , X1…XP) using only the n and f covariates.

2. Let Z’ = max{r0Zn, r0Znn,Zs,Zc, Zss,Zcc,Zns,Znc,Zsc} select the split covariate using previous selection algorithm.

Page 35: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 35

Split Points or Split Sets

For an ordered covariate, use exhaustive search, it use sample median or sample quantiles.

For a categorical covariate, use a shortcut algorithm given in Breiman et al.

1. Cost complexity prunning technique of CART. 2. Choice of tree pruning by a test sample or by

cross-validation.

Pruning

Page 36: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 36

Fig.9 Tree-Structured Survival Model for HCC

child≤7 1

2 3

4 5

pafp≤3810

Treatment= Surgery

182

159 23

12147

27.7y~

9.21y~ taeeff=non-compact

9

18 19

36 37

tean≤6

67

24 43

11 13

4.05y~

5.43y~ 1.91y~

1.56y~ 3.28y~

8

16 17

32 33

64 65

130 131

263262

524 525

pafp≤78

lobe= right

taeeff=non-compact

age≤45.1

pafp≤7.0

afp≤16.0

80

1565

1649

34

26

917

9 8

15

8

8.28y~

0.37y~

5.53y~

2.08y~

6.50y~

Page 37: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 37

Fig.10 Scatterplot or Boxplots of the MCS Resideals

Page 38: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 38

Fig.11 Survival Curves for HCC according to Node Groups

Page 39: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 39

Fig.12 Tree-Structured Survival Model for HCC with Surgery

1

2 3

91

34 57

Pafp≤5.0

75.2y~ 4.83y~

Page 40: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 40

Fig.13 Survival Curves for HCC with Surgery according to pafp Score

Page 41: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 41

Fig.14 Tree-Structured Survival Model for HCC with TACE

1

2 3

91

39

26

taen≤7

1.63y~

9.57y~

pafp≤36.0

4 5

65

250.02y~

Page 42: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 42

Fig.15 Survival Curves for HCC with TACE according to Node Groups

Page 43: Tree  Structured Prognostic Model for Hepatocellular Carcinoma

KNOU IMA 2003 43

Remarks1. The effects of prognostic factors of HCC were identified with the tree structured survival model.2. The factors CHILD, the number of TACE

treatment, AFP, PAFP level were common to all the models, the post AFP was specifically important to the survival time of surgical resection group and TACE treatment group.

3. Tree structured survival model with minimum SAD provide the critical values of splitting covariates for each node.