creating scalable, secured, clinical and genomics platforms...

54
Paul Avillach, MD, PhD Assistant Professor of Biomedical Informatics and Pediatrics Harvard Medical School - Boston Children’s Hospital Assistant Professor of Epidemiology Harvard T.H. Chan School of Public Health Creating scalable, secured, clinical and genomics platforms across clouds integrating i2b2 and tranSMART Platforms

Upload: others

Post on 25-May-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART

Paul Avillach, MD, PhD Assistant Professor of Biomedical Informatics and Pediatrics

Harvard Medical School - Boston Children’s Hospital

Assistant Professor of Epidemiology

Harvard T.H. Chan School of Public Health

Creating scalable, secured, clinical

and genomics platforms across clouds

integrating i2b2 and tranSMART Platforms

Page 2: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART

Phenotypes Genotypes

Biosamples

Geno/Pheno integration

Insights & many iterations

Registries EHR

Baseline Genotypes

Derived Genotypes

Baseline Phenotypes

Derived Phenotypes

Burden, LOF, etc Custom algos, etc

Page 3: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART
Page 4: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART

https://www.i2b2.org/disease/index.html

Page 5: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART

v1.1

I2b2 database still there But NOT i2b2 jboss app

Page 6: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART
Page 7: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART
Page 8: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART

www.avillach-lab.hms.harvard.edu

Alumni

Staff / Software developers Students / Postdocs

Cartik Li Romain Qiu-Yue

Libby

Ranjay

Anoush

Ombeline Maxime Antoine Laurie Gabor Thomas

Jeremy

Alex

Antoine Claire

Andrew Andre Sean

Jaspreet

Emmanuelle

Joany

Yuri Samuel Michael Sushma Pei Ephi

Mahdi Niloofar

Alba

Jason

Laura Haishuai

Carlos Romina Cassandra Sophia Alyssa

Page 9: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART

Open Source

Research Infrastructure

Page 10: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART

Harvard IRB security levels: Level 5 - Extremely sensitive information Level 4 – Very sensitive information Level 3 – Sensitive, or Confidential information Level 2 - Benign information to be held confidentially Level 1 - Non-confidential research information

Page 11: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART

1. Authentication

Page 12: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART

Secured access control

1. Authentication

Who are you?

2. Authorization

What are you allowed to do?

Level 0 : Authenticated BUT no access to data

Level 1 : Aggregated data

Level 2 : patient level data

Page 13: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART

Play with ExAC API: http://exac.hms.harvard.edu

Play with PIC-SURE API: https://bd2k-picsure.hms.harvard.edu

2 Datathons

Page 14: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART
Page 15: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART

HIPPA Compliance on AWS

Page 16: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART
Page 17: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART
Page 18: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART
Page 19: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART

RedOctober

Page 20: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART

UDN Patient centric information commons

• Demographics

• Online registration Form

• PhenoTips

• HPO clinical terms

• Ethnicity

• Diagnosis OMIM

• Knowledge extracted from

referral letters (via NLP)

• Type of sequencing

• Candidate genes

• Candidate variants

• Full Annotated VCF

• 524 Whole genomes

• 616 Whole exomes

• 343 probands

• 259 trios • + ….

i2b2/tranSMART/gNOME ALL UDN data available

Page 21: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART

Modular Open Source

Research platform

1 2

Page 22: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART

UDN338723

• 17 year old male with skeletal features

suggestive of a hereditary connective tissue

disease, also with proximal renal tubular

acidosis, medullary nephrocalcinosis,

hypophosphatemic rickets, polycythemia and

chronic kidney disease, stage 3

• Marfan syndrome Confirmed

• FBN1 c.871G>T/p.E291X

Page 23: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART
Page 24: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART
Page 25: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART

[….]

Page 26: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART

OPEN DATA

Page 27: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART

https://nhanes.hms.harvard.edu

CDC National Health and Nutrition Examination Survey

Page 28: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART

Gold standard resource for phenotypes and exposure

information: National Health and Nutrition

Examination Survey since the 1960s

now biannual: 1999 onwards

5000 participants per year

1 http://www.cdc.gov/nchs/nhanes.htm

1999-2006:

N=41,474

1,181 total measurements!

>250 exposures (serum + urine)

>85 quantitative clinical traits

(e.g., serum glucose, lipids,

body mass index)

nutritional and health status interviews

Page 29: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART

I2b2/tranSMART user Interface: https://nhanes.hms.harvard.edu

41k patients 2k patient level environmental variables

CDC National Health and Nutrition Examination Survey

10 min video https://vimeo.com/182576739

Page 30: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART

• 56 full exomes with Phenotypic data from

1000 Genome project:

• no registration at all

https://demo-ngs.hms.harvard.edu

Page 31: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART

Physical location e.g. Chr:start-end Cytoband … Gene e.g. Gene name Variant function … Gene set e.g. Pathway Molecular process … Predicted variant impact e.g. SIFT PolyPhen … Conservation

e.g. GERP PhyloP … Population frequency

e.g. 1000 Genomes ESP 6500 … Clinical significance

e.g. ClinVar OMIM … Expression patterns

e.g. GTEx BrainSpan … Transcriptional regulation

e.g. ENCODE TFBS Histone modifications …

Exome sequence

data processing Variant calling Variant annotation I2b2/tranSMART

input

Comprehensively

annotated variants

SNPs,

indels

Individual genotypes

> > >

ANNOVAR

https://demo-ngs.hms.harvard.edu

Page 32: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART

Exome variant annotations

Phenotypic variables

https://demo-ngs.hms.harvard.edu

Page 34: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART

Rare disease registry Patient count Variables Version

Clinical Registry Investigating Bardet-Biedl Syndrome 180 708 V4

International Pachyonychia Congenita Research Registry 569 496 V3

International Plastic Bronchitis registry 66 63 V2

Intracranial Hypertension Registry 963 77 V4

North American Malignant Hyperthermia Registry 2,107 162 V4

Wolfram Syndrome International Registry 124 580 V2

Coordination of Rare Diseases at Sanford Registry 1,218 46 V2

including: including: Additional

data:

National Ataxia Foundation 487 16 V1

International WAGR syndrome association 50 380 V1

Cornelia De Lange Syndrome Registry 61 486 V1

Total 5,227 3,014

Rare Cancer Registry (soon) 800 1,800

Centronuclear and myotubular myopathy (soon)

https://grdr.hms.harvard.edu

Page 36: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART

Phelan McDermid Syndrome

• Extremely rare genetic disease: ~1100 diagnosed patients

worldwide today

• Autistic traits, intellectual deficiency, slight dysmorphic

features

• Also called deletion 22q13 syndrome

• Caused by deletions of the terminus of chromosome 22

Page 37: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART

Heterogeneity of

the genetic material

alterations

1 to 140 affected genes

Page 38: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART

Heterogeneity of the phenotypes

All organs can be affected:

• Neuro-developmental

• Facial dysmorphic features

• GERD (Gastro-Esophagal Reflux)

• Renal problems

• Lax joints

• Dysplastic toenails

• Congenital cardiac diseases

• …

Sources : globalgenes.org autismspeaks.org sfari.org

Page 39: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART

Deep phenotyping

Knowledge from Clinical Notes

Page 40: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART

Peter B. Jensen, Lars J. Jensen and Søren Brunak, Nat Rev Genet. 2012

Page 41: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART
Page 42: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART
Page 43: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART
Page 44: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART

BEFORE validation AFTER validation

Pop-up validation window

Page 45: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART

Objective:

Identify phenotypes linked

with the deletion of other genes than SHANK3

Chr deletion size

Phenotypes

Page 46: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART

Results – Kidney malformations associated with other genes that SHANK3

Page 47: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART
Page 48: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART

Results – Gradient of gross motor delays showing a cumulative effect

Page 49: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART

1.0_HOTFIX

1.7.x

Harvard – DBMI AvillachLab improvements

Page 50: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART

1.7.x

Harvard – DBMI AvillachLab improvements

16.2 (17.1?)

Page 51: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART

An i2b2/tranSMART

project management committee

If interested to join, email us:

Diane:

[email protected]

Paul:

[email protected]

Page 53: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART

www.avillach-lab.hms.harvard.edu

Alumni

Staff / Software developers Students / Postdocs

Cartik Li Romain Qiu-Yue

Libby

Ranjay

Anoush

Ombeline Maxime Antoine Laurie Gabor Thomas

Jeremy

Alex

Antoine Claire

Andrew Andre Sean

Jaspreet

Emmanuelle

Joany

Yuri Samuel Michael Sushma Pei Ephi

Mahdi Niloofar

Alba

Jason

Laura Haishuai

Carlos Romina Cassandra Sophia Alyssa

Page 54: Creating scalable, secured, clinical and genomics platforms …blogs.aphp.fr/wp-content/blogs.dir/221/files/2017/10/T3K... · 2017-10-18 · Variant calling Variant annotation I2b2/tranSMART

Postdoc? / Mobilité?

www.avillach-lab.hms.harvard.edu

Paul_Avillach.hms.harvard.edu