an update from the jsm 2006 - seattle ryan woods – january 8, 2007

An update from the JSM 2006 - Seattle

Ryan Woods – January 8, 2007

Topics for this presentation…

• Adaptive Designs and Randomizations in clinical trials

• Statistical Methods for studies of Bioequivalence

• Some models for competing risks in cancer research

Part 1: Adaptive Designs in Clinical Trials…

“Adaptive” has two common meanings in the design of clinical trials literature:

1) A trial in which the design changes in some fashion after the study has commenced (e.g. # of interim looks)

2) A trial in which the randomization probabilities change throughout the study in some fashion

Case 1: Change in Trial Design…

Review of Sequential Hypothesis Testing:

• We choose a number of interim looks

• Select desired power for study, sample size, overall Type I error rate

• Select a spending function for Type I error that reflects how much “alpha” we want to spend at each look

• The spending function determines our rejection boundaries for test statistics computed at each analysis → example…

Example…• Suppose we choose two

interim looks + final analysis• Sample size of 600 with

interim looks at n=200, 400• Overall Type I error = 0.05• Want fairly conservative

boundaries early, with some alpha left for final analysis…

process looks like…

Cumulative Sample Size

Sta

nd

ard

ize

d T

est

Sta

tistic

0 200 400 600

0.0

0.5

1.0

1.5

2.0

2.5

3.0

3.5

Three-look Sequential Boundary for Rejection of Ho

Reject Ho

What if we want to make a change?

At an interim look we may want to:• Increase in sample size• Increase number of looks• Change the shape of the spending

function for remaining looks• Change inclusion criteria

BUT: can we do this without inflating Type I error? Can we estimate the Rx effect at the end?

We Want Change!• If at some look L in a K-look trial, we

want to make a design change, we need to consider ε:

•This is referred to as the conditional rejection probability. •Zj, bj are values of test statistic and

boundary at look J. •Any change to the trial at look L must preserve ε for the modified trial (Muller & Schafer)


Sta

nd

ard

ize

d T

est

Sta

tistic

0 200 400 600

0.0

0.5

1.0

1.5

2.0

2.5

3.0

3.5

Conditional Rejection Probability at N=400 = 0.255

Three-look Sequential Boundary for Rejection of Ho

Original Design BoundariesAccumulated Data

Reject Ho

Conditional Rejection Probability at N=400 → 0.255


Sta

nd

ard

ize

d T

est

Sta

tistic

0 200 400 600 800

0.0

0.5

1.0

1.5

2.0

2.5

3.0

3.5

4.0

Modified Design to Four-look Larger Study (N=800)

Original Design BoundariesAccumulated DataModified Study Boundaries

Reject Ho

Such changes are fine, provided ε=0.255 for modified trial

How to calculate these ε’s?• These conditional rejection probabilities can be calculated

conveniently under various adaptations using EaSt (Cytel Software)• At present our license expired and we are updating it!• I promise an example delivered to you when our license is upgraded

…Example to come!!!

What about Estimation?

• Interval Estimation for the treatment difference δ is explained in Mehta’s slides

• In summary, the concept is an extension of Jennison and Turnbull’s Repeated Confidence Interval Method from sequential trials (1989)

• Mehta adapts this method by applying the results of Muller & Schafer to allow for the adaptive design change

http://bigfoot.hivnet.ubc.ca:8991/other_topics/rwoods/CyrusMehta_JSM2006.pdf

Case 2: Adaptive Randomizations

• Designs in which allocation probabilities change over the course of the study

• Why? 1) Ensure more patients receive better treatment2) Ensure balance between allocations

• Examples include: Play the Winner Rule, Randomized Play the Winner Rule, Drop the Loser rule, Efron’s biased coin design, etc

…more to come on these….

Some serious examples…1) Connor (1994, NEJM) report a trial

of AZT to reduce rate of mother to child HIV transmission; coin toss randomization used with results:

AZT: 20/239 transmissions

Placebo: 60/238 transmissions

So transmission in control group was 3 times rate in treated group.

Some serious examples…2) Bartlett (1985, Pediatrics) report

study of ECMO in infants; Play The Winner rule used with results:

ECMO: 0/11 deaths

Control: 1/1 deaths → study stopped

- Follow-up study proceeded with very high death rate in control group (40% versus 3% in ECMO)

- See debate in Statistical Science over this! (Nov.1989, Vol 4, No. 4, p298)

Some allocation methods…1) Simple Play the Winner Rule- First patient is randomized by a coin

toss- If patient is Rx success, next patient

gets same Rx; if Rx failure, next patient gets other Rx (Zelen, 1969)

Pros/Cons:- Should put more patients on better Rx- Need current patient’s outcome

before next patient allocated- Not a randomized design

Some allocation methods…

2) Randomized Play the Winner Rule [RPW(Δ,μ,β)]- Start with an urn with Δ red balls, Δ blue balls

inside (one colour per Rx group)- Patient randomized by taking a ball from urn;

ball is then replaced- When patient outcome is obtained, the urn

changes in the following way:

i) If Rx success: we add μ balls of current Rx to urn and β balls of other Rx

ii) If Rx fails: we add β balls of current Rx to urn and μ balls of other Rx

where μ ≥ β ≥ 0


2) RPW(Δ,μ,β) continued…- Commonly discussed design is RPW(0,1,0)

[See Wei and Durham: JASA, 1978]

Pros/Cons:- Can lead to selection bias issues even in a

blinded scenario if investigator enrolls selectively

- Should put more patients on better Rx- Does not require instantaneous outcome- Implementation can be difficult- What about the analysis of such data?


3) Drop the Loser Rule [DL(k)]- An urn contains K+1 types of balls, 1 for each

Rx 1, .., K, plus an “immigration” ball

- Initially, there are Z0,K balls of type K in the urn

- After M draws, the urn composition is equal to ZM = (ZM,0,ZM,1,…, ZM,K)

- To allocate patient, draw a ball; if it is type K, give patient Rx K – ball is not replaced!

- Response is observed on patient; if successful, replace ball in urn (thus ZM= ZM+1). If failure, do not replace ball (thus ZM+1,K= ZM,K-1 for Rx K).

…more…


3) Drop the Loser Rule continued…- If immigration ball drawn, replace the ball,

and add to the urn one of each of the Rx balls

Pros/Cons:- Can accommodate several Rx groups- Also should put patients on the better Rx- Can also be extended to delayed response- Again, what about analysis of data?- Implementation is also not easy

For all of these methods…• Extensive discussion in the literature about how

these various methods for allocation behave in simulation

• Some issues include:

1) how these methods perform when multiple Rx’s exist and variation in efficacy of Rx’s is high

2) how to balance discrimination of efficacy of Rx’s with minimizing number of patients allocated to poorer Rx’s

• Many, many, many generalizations of these methods to improve the “undesirables” of previous incarnations

• Interim analyses versus adaptive randomization

At the JSM some talks included…

• Re-sampling methods for Adaptive Designs• Issues associated with non-inferiority and

superiority trials and adaptive designs• Dynamic Rx Allocation and regulatory issues

(EMEA’s request for analysis)• General commentary on risks and benefits of

adaptive methods in clinical trials

In general, a very active area right now!

- Also Feifang Hu gave a recent talk at UBC!

http://bigfoot.hivnet.ubc.ca:8991/other_topics/rwoods/Chris_Khedouri_JSM06_806b.ppt

http://bigfoot.hivnet.ubc.ca:8991/other_topics/rwoods/Feifanghu.pdf



Bioequivalence Studies

I will attempt to cover (briefly):

• Purpose of a bioequivalence study

• Type of data typically collected

• Common methods of data analysis and hypothesis testing/estimation

• Additional comments

What is bioequivalence?• Bioequivalence (BE) studies are performed to

demonstrate that different formulations or regimens of drug product are similar in terms of efficacy and safety

• BE Studies are done even when formulations are identical between new and old drugs, but type of delivery differ (capsule vs. tablet); could also be generic versus previously patented drug

• Even small changes to formulation can affect bioavailability/absorption/etc so BE studies can reassure regulators new formulation is good WITHOUT repeating entire drug development program (e.g. several phase III trials with clinical endpoints)

Typical Study Design• Typically, BE studies are done as

cross-over trials in healthy volunteer subjects

• Each individual will be administered two formulations (Reference and Test) in one of two sequences (e.g. RT and TR)Sequence Rx

1Wash-out

Rx 2

# subjects

1 (RT) R --- T n/2

2 (TR) T --- R n/2

* Wash-out is period of time where patient takes neither of the formulations

Typical Outcomes• In Clinical Pharmacology (CP) and BE

studies, the central outcomes are pharmacokinetic (PK) summaries

• These PK measures have more to do with what the body does with the drug, than what the drug does to the body

• Many of the outcomes of interest are taken from the drug concentration time curve and include: AUC(0-t), AUC(0-∞), Tmax, Cmax, T1/2

….more to come on these…

Time (hours)

Dru

g C

on

cen

tra

tion

- m

g/L

0 12 24 36 44

0

20

40

60

80

100

Plasma Concentration (mg/L) versus Time (hours)

CMAX

TMAX

AUC

More on these outcomes…• FDA defines BE as: “the absence of a

significant difference in the rate and extent to which the active ingredient becomes available at site of drug action”

• AUC is taken as the measure of extent of exposure; Cmax as the rate of exposure

• In general, these two outcomes are assumed to be log-normally distributed

• A small increase/decrease of Cmax can result in a safety issue → T and R cannot differ “too much”

Testing what is ”too much”…• The chosen hypothesis testing procedure

by regulatory agencies has been called TOST (two one-sided tests)

• For each PK parameter, we apply a set of two one-sided hypothesis tests to determine if the formulations are bioequivalent

• One of the hypotheses is that the data in the new formulation are “too low” (H01) relative to the reference; another hypothesis is that they are “too high” (H02)

…mathematically we have…

Testing what is ”too much”…

The two tests can be written:

H01: μT - μR ≤ -Δ

versus

H11: μT - μR ≥ -Δ

And then…

H02: μT - μR ≥ Δ

versus

H12: μT - μR ≤ Δ

Testing what is ”too much”…• The testing parameter Δ was chosen by

the FDA to be Δ=log(1.25)

• Both of the two tests are carried out with a 5% level of significance

• Thus, there is a maximum 5% chance of declaring two products bioequivalent when in fact they are not

• TOST has some drawbacks: drugs which small changes in dose → BIG change in clin. response, test limit too narrow for high variability products, doesn’t address individual BE (“Can I safely switch my patient’s formulation?”)

Models for the outcomes…

• They suggest modeling data from the two period, two Rx cross-over via a linear mixed model:

• Let Yijk be the (log-transformed) response obtained from Subject k, in period j, in sequence i, taking formulation l

• If we assume no carry-over effects the model resembles:

Yijk= μi + λj + πl + βk + εijkl

where μi, λj, and πl are fixed; βk, εijkl are random

Models for the outcomes…

So to estimate πT – πR we are supposed to take:

½ [(Y21-Y22)-(Y11-Y12)] which in expectation is equal to the treatment difference

Yij is the sample mean from the i,j’th cell above

Group Period 1 Period 2

1 (RT) μ1 + λ1 + πR

μ1 + λ2 + πT

2 (TR) μ2 + λ1 + πT

μ2 + λ2 + πR

* μ parameters could likely be dropped

Some general comments…• Guidelines from the FDA on methodology

are very specific in this field (e.g. numerical method for AUC, “goal posts” for determining BE, distributional assumptions, etc)

• Interesting history of how these regulations came about/evolved:

- 75/75 rule (70’s): 75% of subjects’ individual ratios of T to R must be ≥ 0.75 to prove BE

- 80/20 rule (80’s): set up H0 such that the two formulations are equal. If the test is NOT rejected, and a difference of 20% not shown, then the formulations are BE

SAS tricks!!!Some mixing of UNIX commands and SAS:

%SYSEXEC %str(mkdir MYNEWDATA; cd MYNEWDATA; mkdir Output; cd ..;);

%let value=%sysget(PWD);%put &VALUE;

libname data "MYNEWDATA";

data data.junk; variable="ONE VARIABLE";run;

ods rtf file="MYNEWDATA/Output/file.rtf";proc print; title "&VALUE.";

run;

Competing risks models in the monogenic cancer

susceptibility syndromes

7 Aug 2006, JSM 2006 Session #99

Philip S. Rosenberg, Ph.D.Bingshu E. Chen, Ph.D.

Biostatistics BranchDivision of Cancer Epidemiology and Genetics

National Cancer Institute

AcknowledgementsStatistics

Bingshu E Chen

SCNDavid C Dale

Blanche P AlterSevere Chronic Neutropenia International Registry

FABlanche P Alter

Wolfram Ebell

Eliane Gluckman

Gerard Socié

HBOCMark H Greene

Joan L Kramer

OutlineI. Monogenic cancer susceptibility syndromes

a) Fanconi Anemia (FA)b) Severe Congenital Neutropenia (SCN)c) Hereditary Breast and Ovary Cancer (HBOC)

II. Cause-specific hazard functions for competing risks

• B-Spline

III. Individualized Risks• Covariates

IV. Cumulative Incidence versus Actuarial Risk

V. Conclusions

Cancer Syndromes

•Single gene defects predisposes to more than one event type (pleitropy).

•Occurrence of one event type censors or alters the natural history of other event types.

•Heterogeneity.

Competing risks theory provides a unifying framework.

FA

Death

BMT

AML

Solid

Tumor

SCN

MDS/AML

SepsisDeath

HBOC

Breast Cancer

FanconiAnemia (FA)

SevereCongenital

Neutropenia (SCN)

HereditaryBreast and OvaryCancer (HBOC)

GENES: FANCA, FANCB FANCC, FANCD1/BRCA2, FANCD2, FANCE, FANCF, FANCG, (FANCI), FANCJ, FANCL, FANCM

ELA2, other genes BRCA1, BRCA2, other genes

Competing Risks

Death

•Cause-specific hazards:

•Cumulative Incidence (In the presence of other causes):

•Actuarial Risk (“Removes” other causes):

Modeling the Natural History

0

1( ) lim [ , ) & | , 1, ,kh t P T t t K k T t k K

, 0( ) 1 exp ( ) , 1, ,

t

A k kF t h s ds k K

0 01

( ) ( )exp ( ) , 1, ,Kt a

k k kk

F t h a h s ds da k K

B-Spline Models of Cause-Specific Hazards:

, , ,1

( ) ( ) , 0km

k j k j k j kj ORDER

h t B t

0 10 20 30 40 50

0

0.25

0.5

0.75

1

t

B-S

plin

e B

asis

Fun

ctio

n

Linear combination

min ,1, ,

ik ik ik

ik ik ik

X Y Ci n

I Y C

0

1

( ) log ( ) ( )ikxn

k ik ik ki T

h x h s ds

•For each cause k separately:

•Knot selection by Akaike Information Criterion (AIC).

•Variance calculations via Bootstrap (because of constraint).

Rosenberg P.S. Biometrics 1995;51:874-887

Fanconi Anemia (n=145) Natural History

Same modeling approach identifies distinct hazard curves.

Rosenberg P.S. et al. Blood 2003;101:2136

Rosenberg, P. S. et al. Blood 2006;107:4628-4635

Severe Congenital Neutropenia (n=374)Natural History

HBOC – BRCA1 (n=98)Natural History

Individualized Risks •Covariates:

•A covariate may affect one endpoint or multiple endpoints.

•(Different endpoints may be affected by different covariates.)

•Analysis:

•Cox regression models for each endpoint.

•Define a summary categorical risk variable.

•Estimate hazards and Cumulative Incidence for each level.

FA: Impact of Congenital Abnormalities

Covariate: No Congenital

Abnormalities

Covariate: Specific Congenital

Abnormalities


•Abnormalities are associated only with hazard of BMF.

•BMF curve goes up, other curves go down.

Years on RxYears on Rx

Covariate: Low Response

Covariate: Good Response

SCN: Impact of Hypo-Responsiveness to Rx

•Low Response is associated with hazard of both endpoints.

•Both curves go up or down together.


Actuarial Risk vs. Cumulative Incidence

•Actuarial Risk:•“Removes” other causes.

•Estimate using 1 – KM curve.

•Cumulative Incidence:

•Impact of each cause in real-world setting.

•Estimate using non-parametric MLE orspline-smoothed hazards.

, 0( ) 1 exp ( ) , 1, ,

t

A k kF t h s ds k K

0 01

( ) ( )exp ( ) , 1, ,Kt a

k k kk

F t h a h s ds da k K


Example: Fanconi Anemia1-KM vs. Cumulative Incidence

•If you removed other causes, risk of Solid Tumor by age 50 would increase from ~25% to ~75%.

Extrapolating 1 – KM: A Cautionary Tale

Disease Intervention 1-KM Prediction

Result of Intervention

Observed Cumulative Incidence

HBOC BRCA1**

Oophorectomy to reduce risk

of Ovary Cancer

Moderate increase in Cumulative Incidence of

Breast Cancer

Risk of Breast Cancer declines

by 2.6-fold

Much lower than

expected.

**Kramer, J. L. et al. JCO 2005; 23: 8629-8635

Conclusions•B-spline models of cause-specific hazards elucidate the natural history.

•Physicians understand Cumulative Incidence (our experience).

•Stand-alone software will be available from us. [email protected]

•Much room for methodological refinements.

an update from the jsm 2006 - seattle ryan woods – january 8, 2007

Documents