lara mangravite sage bionetworks on behalf of the ra challenge organizing team the dream rheumatoid...
TRANSCRIPT
LARA MANGRAVITESAGE BIONETWORKS
ON BEHALF OF THE RA CHALLENGE ORGANIZING TEAM
The DREAM Rheumatoid Arthritis Responder Challenge:
Motivation, Data, Scoring and Results
Challenge Organizers
Eli Stahl, Mt SinaiGaurav Pandey, Mt SinaiJing Cui, Brigham and Women’sAndre Falcao, U Lisbon Robert Plenge, MerckPeter Gregersen, Feinstein InstituteJeff Greenberg, CorronaDimitrios Pappas, CorronaKaleb Michaud, Arthritis Internet RegistryGenerators of Training Dataset
Solly SiebertsAbhi PratapChristine SuverBruce HoffThea NormanVenkat BalagurusamyStephen FriendGustavo Stolovitzky
Funders
~30% of RA patients fail to respond to anti-TNF therapy
Rheumatoid Arthritis Treatment
-- Predicting nonresponse would assist in precision medicine, clinical trial design, and development of new therapies
Robert Plenge
n=2,706
Pharmacogenetics of antiTNF response
Drug NSNP-
heritability (se)
P-value
All patients 2617 0.18 (0.10) 0.02
etanercept 716 0 (0.34) 0.5
infliximab 857 0.62 (0.29) 0.02
adalimumab 1027 0.36 (0.25) 0.08
infliximab + adalimumab
1899 0.36 (0.13) 0.003
Ciu and Stahl et al PLoS Genetis 2013 Eli Stahl
Rationale
Given sizable estimated heritability, is it possible to use genetic features to predict treatment response?
Polygenic approach: Combined influence of weak effects
Population subtypes: Not all individuals react similarly Does genetic heritability foretell genetic prediction?
RA Responder Challenge Design
Discovery (phase I)
GWAS of treatment
response in RA(n≈2,700 patients)
Genomic data(e.g.,
expression profiling)
Polygenic SNP
predictor of response
Refine model
Plenge et. al. Nature Genetics 2013
Discovery (phase I)
Validation (phase II)GWAS of treatment
response in RA(n≈2,700 patients)
Genomic data(e.g.,
expression profiling)
Polygenic SNP
predictor of response
Refine model
Submit models
GWAS of treatment
response in RA(n≈1,100 patients)
Score models
RA Responder Challenge Design
Plenge et. al. Nature Genetics 2013
Discovery (phase I)
Validation (phase II)GWAS of treatment
response in RA(n≈2,700 patients)
Genomic data(e.g.,
expression profiling)
Polygenic SNP
predictor of response
Refine modelGWAS of treatment
response in RA(n≈1,100 patients)
Submit models
Score models
RA Responder Challenge Design
Plenge et. al. Nature Genetics 2013
RA Challenge Data
Genotypes~ 2.3 million SNPs
Clinical ~ 6 traits
N=2076
Response
Discovery Dataset
Combine set from 4 studies
Test Data
Genotypes~ 2.3 million SNPs
Clinical ~ 6 traits
N=723
Generated for this challenge
RA Challenge: Build the best possible predictors of anti-TNFa response in RA
TEAM PHASE February - June 2014
Self-aggregate into teams and build the best possible predictor of response.
COMMUNITY PHASE July - October 2014
Work together across teams to assess the contribution of genetics to prediction.
Team PhaseCommunity
Phase
RA Responders Challenge
Predict treatment response as measured by change in disease activity score (DAS28) in response to ant-TNFa therapy.
Scoring: Average rank of pearson correlation and spearman correlation.
Identify poor responders to anti-TNFa therapy as defined by EULAR criteria.
Scoring: Average rank of AUC and PR.
Team Phase Results
Subchallenge 1:Predicting deltaDAS
Subchallenge 2: Predicting nonresponders
Best models: Team Guan Lab Best models: Team Guan Lab&
Team SBI_Lab
Solly Sieberts 32 teams
The Community Phase (July – October)
Work in collaboration to determine:
-- Whether genetic information contributes in a meaningful way to predictions?
-- Best possible predictors of response. -- What components of the modeling
approaches are most beneficial for this question.
Community Phase Logistics
First part: teams split into groups and shared knowledge to help inform one another’s efforts
Second part: all teams came together to devise an analytical plan to explicitly address these questions.
Teams share ideas and then work individually to provide:
Do models using genetic features improve on prediction relative to clinical models?
What is the contribution of feature selection vs. modeling algorithm on performance?
Does the use of biological priors in feature selection improve relative to random selection?
Can supervised ensemble approach improve upon individual predictions?
Conclusions
Gaussian Process Regression appears to work best with this type of problem.
SNP selection more important than algorithmic selection in most cases.
Genetic information improves prediction of nonresponders over use of clinical information.
Ability to predict response based on clinical features may be valuable to clinicians in and of themselves.
Today’s Speakers: Best Performers from Independent Team Phase
Fan Zhu on behalf of Team Guan Lab A generic method for predicting clinical outcomes and
drug response
Javier Garcia-Garcia on behalf of Team SBI_Lab Predicting response to arthritis treatments:
regression-based gaussian processes on small sets of SNPs