motivating exampleadaptive methodsignored data · 2014-03-27 · power for interim adaptation in...
TRANSCRIPT
Motivating example Adaptive methods Ignored data
Adaptive designs for time-to-event trials
Dominic Magirr1, Thomas Jaki1, Franz Konig2 and MartinPosch2
1Department of Mathematics and Statistics, Lancaster University2Institut fur Medizinische Statistik, Medical University of Vienna
January 2014
Motivating example Adaptive methods Ignored data
Outline
Motivating example
Adaptive methods
Ignored data
Motivating example Adaptive methods Ignored data
Lung cancer trial (Schafer and Muller, 2001)
• Patients randomized to “Radiotherapy + Chemotherapy” (E)or “Chemotherapy” (C)
• Median survival on C ≈ 14 months
• Anticipated survival on E ≈ 20 months
• Sample size: 255 deaths (α = 0.025, β = 0.2)
• Exponential model ... this could be achieved with 40 monthsrecruitment and 20 months min follow-up.
Motivating example Adaptive methods Ignored data
40 months into the trial...
(a) patient recruitment was much slower than expected
– only 136 patients had been randomized
(b) the hazard rate had been over-estimated in the planning
– only 56 deaths had been observed
Recommendation of Schafer and Muller:
“abandon the trial because there [is] no chance of achieving theplanned sample size within a reasonable time”
Motivating example Adaptive methods Ignored data
Counterproposal of study group
• Look at the data to see if there is a larger treatment effectthan originally anticipated.
• If so, reduce the initially planned sample size (requirednumber of events).
• Larger the observed treatment effect → earlier the study ends.
Motivating example Adaptive methods Ignored data
Reverse scenario (Irle & Schafer, 2012)
• Look at data to see if there is a smaller than anticipatedtreatment effect.
• If so, increase the sample size (required number of events) togive a better chance of achieving a statistically significantresult.
Standard analysis will not control the type I error rate...
Motivating example Adaptive methods Ignored data
Adaptive design with immediate responses
Enrollment & follow-up
X1
Enrollment & follow-up
Y
Interim Analysis Final Analysis
Calendar time
int
E.g., under H0,
1√2
Φ−1{
1− p1(X int1 )}
+1√2
Φ−1 {1− p2(Y )} ∼ N (0, 1)
Motivating example Adaptive methods Ignored data
Adaptive design with delayed responses
Enrollment & follow-up
X1
Enrollment & follow-up
Y
Interim Analysis
Follow-up
Follow-up
Final Analysis
Calendar time
int
1√2
Φ−1{
1− p1(X int1 )}
+1√2
Φ−1 {1− p2(Y )} ?∼ N (0, 1)
Motivating example Adaptive methods Ignored data
When is this valid?
X Interim decision strategy based solely on (primary endpoint)treatment effect estimate.
× Interim decisions are based on partial information frompatients who are yet to provide full primary endpoint response
e.g. second-stage sample size is chosen on basis ofprogression-free survival when primary endpoint is overallsurvival.
Motivating example Adaptive methods Ignored data
Potential solution
Enrollment & follow-up
X1
Enrollment & follow-up
Y
Interim Analysis
Follow-up
Follow-up
Final Analysis
Calendar time
1√2
Φ−1 {1− p1(X1)}+1√2
Φ−1 {1− p2(Y )} ∼ N (0, 1)
e.g., Liu & Pledger (2005) – Gaussian responses
Schmidli, Bretz & Racine-Poon (2007) – Binary responses
Motivating example Adaptive methods Ignored data
Extra problem with time-to-event endpoint?Jenkins, Stone & Jennison (2011); Irle & Schafer (2012)
Enrollment & follow-up
X1
Enrollment & follow-up
Y
Interim Analysis
Follow-up
Follow-up
Final Analysis
Unspecified follow-up
Calendar time
T1
• Must pre-specify end of follow-up of first-stage patients, T1, indefinition of p1.
• Otherwise, p1(X1)x∼ U[0, 1] under H0, and type I error may be
inflated.
Motivating example Adaptive methods Ignored data
Data is “thrown away”
• Final test decision only depends on a subset of the recordedsurvival times; part of the observed data is ignored.
• Particularly damaging if long-term survival is of most concern(it is the survival times of earliest recruited patients that isignored).
• Therefore, we investigated the effect of naıvely incorporatingthis illegitimate data into the final test statistic...
Motivating example Adaptive methods Ignored data
Adaptive log-rank test
“Correct” adaptive test statistic
ZCORRECT = w1L1(T1) + w2Φ−1(1− p2)
“Naıve” adaptive test statistic
ZNAIVE = w1L1(T ∗) + w2Φ−1(1− p2)
• L1(t) is the log-rank statistic based on Stage 1 patients,followed up until calendar time t.
• wi are explicitly (Jenkins et al.) or implicitly (Irle & Schafer)fixed weights with w2
1 + w22 = 1.
• T1 is the (implicitly) fixed end of first-stage follow up.
• T ∗ is the time of final analysis (dependent on interimdecisions).
Motivating example Adaptive methods Ignored data
Worst-case assumption
• The null distribution of ZCORRECT is N (0, 1).
• The null distribution of ZNAIVE is completely unknown.
• However, we can look at the stochastic process
Z (t) = w1L1(t) + w2Φ−1(1− p2), t ∈ [T1,Tmax].
Worst-case: the interim data (PFS, early endpoints, etc) can beused to predict exactly when L1(t) reaches its maximum.
Motivating example Adaptive methods Ignored data
Upper bound on type I error
An upper bound can be found assuming second-stage design isengineered such that T ∗ coincides with arg max L1(t):
maxα = PH0
{maxt≥T1
w1L1(t) + w2Φ−1(1− p2) > 1.96
}= · · ·
≈∫ 1
0PH0
[1
maxu=u1
B(u) >√u
{1.96 + w2Φ−1(x)
}w1
]dx ,
with u1 = {# stage 1 deaths at T1} / {# stage 1 deaths at Tmax}
Motivating example Adaptive methods Ignored data
0.2 0.4 0.6 0.8
0.2
0.4
0.6
0.8
u1
w1
0.04
0.06
0.08
0.10
0.12
0.14
Figure: Worst case type I error for various choices of weights andinformation fractions.
Motivating example Adaptive methods Ignored data
Irle & Schafer example revisited
• Original trial design:
- 248 deaths; 40 months recruitment; 20 months min follow-up.
• Interim analysis after 23 months:
- 190 patients recruited; 60 deaths.- Treatment effect in terms of PFS less impressive than
expected.- Decision made to increase required number of deaths from 248
to 400.
• At calendar time T1 ≈ 60 months:
- 170 first-stage patient deaths; 78 second-stage patient deaths.- u1 = 170/190; w1 =
√170/248; maxα = 0.040.
Motivating example Adaptive methods Ignored data
Guaranteed level-α test
Simply increase cut-off such that PH0 {maxt≥T1 Z (t) ≥ k∗} = α.
In assessing the effect on power, four relevant probabilities are:
A. Pθ=θR
(ZCORRECT > 1.96
)B. Pθ=θR
(ZCORRECT > k∗
)C. Pθ=θR {Z (Tmax) > k∗}
D. Pθ=θR {maxt≥T1 Z (t) > k∗}
Motivating example Adaptive methods Ignored data
0.0 0.2 0.4 0.6 0.8 1.0
0.0
0.2
0.4
0.6
0.8
1.0
p2
Pow
er
(a)
0.0 0.2 0.4 0.6 0.8 1.0
0.0
0.2
0.4
0.6
0.8
1.0
p2
Pow
er
(b)
Figure: Conditional power as defined by A (thin line), B (medium line),C (thick line) and D (dashed line) given p2, under two scenarios.
(a): u1 = 170/190, w1 = (170/248)1/2 and θR = 0.36.(b): u1 = 147/288, w1 = (147/248)1/2 and θR = 0.36.
Motivating example Adaptive methods Ignored data
Conclusion: methods trade-off
Type I Informed interim All survival Powercontrol decisions times in test
“Independent increments” X × X X“Correct” adaptive X X × X“Naıve” adaptive × X X X
“Naıve” + k∗ X X X ×
Other research avenue: use joint distribution of, e.g., PFS and overall survival.
Motivating example Adaptive methods Ignored data
References
1. Schafer, H., & Muller, H. H. (2001). Modification of the sample size andthe schedule of interim analyses in survival trials based on datainspections. Statistics in medicine, 20(24), 3741-3751.
2. Irle, S., & Schafer, H. (2012). Interim Design Modifications inTime-to-Event Studies. Journal of the American Statistical Association,107(497), 341-348.
3. Liu, Q., & Pledger, G. W. (2005). Phase 2 and 3 combination designs toaccelerate drug development. Journal of the American StatisticalAssociation, 100(470).
4. Schmidli, H., Bretz, F., & Racine-Poon, A. (2007). Bayesian predictivepower for interim adaptation in seamless phase II/III trials where theendpoint is survival up to some specified timepoint. Statistics inmedicine, 26(27), 4925-4938.
5. Jenkins, M., Stone, A., & Jennison, C. (2011). An adaptive seamlessphase II/III design for oncology trials with subpopulation selection usingcorrelated survival endpoints. Pharmaceutical statistics, 10(4), 347-356.