genomics dual factors for physical lifecontents.kocw.or.kr/document/wcu/2012/bio_data... · 2007:...

Post on 04-Aug-2020

0 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

1

Genomics Dual Factors for Physical Life◦ Genetic factors for systems healthcare◦ Acquired factors for systems healthcare

Opportunities and Challenges

Anatomy Microscope/Cell Biology Molecular Biology

Bioinformatics and Systems Biology

5

Personal Physical Life = f (Nature, Nurture)Nature: Genes – Personal Genome Project

Nurture: Environment, Food, Exercise, Medication, …

Data Collection Data Mining Understanding/Prediction

G1 G2 … … Gp E1 E2 … … Eq L1 L2 … … Lr

P1

……Pm

Feature SelectionModel-Based Data MiningNew Approaches ?

6

7

1 SNP in every 2kb of genomic sequences Synonymous vs. non-synonymous SNP

1….ATCCTGTACCTACGTGTACAATAGTA…..CTGATCATCTCTATGGG….2….ATCCTGTTCCTACGTGTACAATAGTA….. CTGATCATCTCTATGGG….3….ATCCTGTACCTACGTGTACAATAGTA…..CTGATCAGCTCTATGGG….

1 2 3

SNP1 SNP2

8

9

10

1G: Sanger

2G: Parallel

3G: Single Molecule

4G: Non-optical

11

12

Human Genome Project Consortium◦ 1990 ~ 2005 (16 years), US$ 3 billion (3조원)◦ Haploid from many anonymous donors (cf. RP11, a male from Buffalo, NY)

Celera Genomics◦ 1998 ~ 2005 (8 yrs), US$ 300 million (3천억원)◦ Consensus from five anonymous donors (including Craig Ventor)

2007: Market, US$ 20 million (2백억원) 2007: Knome, US$ 350,000 (3억5천만원) for diploid sequencing 2009: US$ 100,000 (1억원) – NIH RFP Objective 2011: US$ 20,000 (2천만원) – George Church’s prediction 2014: US$ 1,000 (1백만원) – NIH RFP Objective

13

6,099 GWAS studies as of Sep. 6, 201114

Pharmacogenomics

DNA(SNP) chip

Cf. HER2 Overexpress, Herceptin, Genentech, 199815

Disease risk assessment for 119 diseases◦ Clinical Reports (33) BRCA Cancer Mutations, Celiac Disease (소아지방변증) Diabetes, Parkinson’s Disease, Prostate Cancer, Rheumatoid Arthritis,

Resistance to HIV/AIDS, and so on◦ Research Reports (86) Asthma, Baldness, Bipolar Disorder, Breast Cancer, Food Preference,

Height, Longevity, Memory, Obesity … Ancestry tracking => New International Social networks?◦ Maternal line with mitochondrial DNA◦ Paternal line with Y chromosome

16

17

Interleukin Genetics, Inc. & Amway Global Cost: US$ 100~200 / per kit1. Kit-based sampling from oral cavity2. SBE(Single Base Extension)-based detection of

SNP markers3. Bioinformatics analysis for SNP-to-trait mapping4. Recommendation for nutrition, exercise, and

medication

18

19

Genomic sequences from whole genome parallel sequencing

Image from the sequencing machines (usually discarded after processing)

Raw sequence reads: ~300GB Genome-mapped sequences: ~300GB Binary compressed sequences: ~150GB Intermediate results: ~300GB Over 1TB/sample 1000 Genome => 1PB

20

21

(3 x 109) bp x 30 rd x 3 = ~ 3 x 1011 bytes = ~ 300GB

@HWI-ST621:206:B0202ACXX:1:1101:1216:2021 1:Y:0:ATCACG TitleNTTTANNNNTGAATNNTGTCAAAATTACAGAAGAACTGCAAGAATATCACATGGTACACTCATACAATCTCCACCCANANNNNNNNNNNNNNNNNNTTTGC Base+ Comment##################################################################################################### Base quality@HWI-ST621:206:B0202ACXX:1:1101:1116:2024 1:Y:0:ATCACGNCTTNNNNNCACAGNNTTTAACCTTTCTTTTCTTAGAGCACTTTAGAAACACTCTGCTTGTTATGTCTGCAAGTGGANANNNNNNNNNNNNNNNNNCCTTC+#####################################################################################################

22

HWI-ST621:206:B0202ACXX:1:1101:1128:2173 147 chr1 81092578 60 101M = 81092212 -467 AGGGCAGAATACCGTATCCTTGGAAAATTAAATAGTAAGAGGAGAGAGGCTTCAGTGGCAGACCATTCGGAAAGTGTGGGGAAATCCAGGAAGGAAAGTAN ##################################################################################################### XT:A:U NM:i:1 SM:i:37 AM:i:37 X0:i:1 X1:i:0 XM:i:1 XO:i:0 XG:i:0 MD:Z:100G0HWI-ST621:206:B0202ACXX:1:1101:1022:2177 73 chr3 110819717 37 101M = 110819717 0 NTCCNTTTTCATGCTGCTGATAAAGACATAGCTGAGACTGGGTAATTAAAAAAAAAAGCGGTTTAATGAACTCACAGTTTCACATGGCTGGGGGGGGCTCA ##################################################################################################### XT:A:U NM:i:4 SM:i:37 AM:i:0 X0:i:1 X1:i:0 XM:i:4 XO:i:0 XG:i:0 MD:Z:0G3A88A2C4

23

24

Clockwork Business Solutions ©

25

EDI (Electronic Data Interchange) OCS (Order Communication System) LIS (Laboratory Information System) PACS (Picture Archiving and Communication System) PIS (Pharmacy Information System) CIS (Clinical Information System) EMR (Electronic Medical Records) PHR (Personal Health Records) Etc…

26

27

28

29

30

31

Molecular snapshots◦ Transcriptomics◦ Proteomics◦ Metabolomics

Electronics Medical Records PACS images Life log Etc…

32

Gene1 Gene2 Gene3 Gene4Genome

Transcriptome mRNA1mRNA1mRNA1mRNA1mRNA1mRNA1mRNA1mRNA1mRNA1

mRNA1mRNA1mRNA2mRNA1mRNA1mRNA1mRNA1mRNA1mRNA1mRNA4

Proteome mRNA1Protein1 mRNA1mRNA1mRNA1mRNA1mRNA1mRNA1Protein4

mRNA1mRNA1mRNA1mRNA1mRNA1Protein2’

Transcriptional Regulation

Translational Regulation, Post-translational modification

mRNA1mRNA1mRNA1mRNA1mRNA1mRNA1mRNA1mRNA1Protein2

Metabolome

Metabolic Regulation

Metabolite-A Metabolite-B Metabolite-C

33

34

35

A large number of structured tables with text-based fields Different schema for different organizations, cf. HL7 Security and privacy is extremely critical

36

High resolution images with structured meta data

37

Body composition analyzerTask: Body balance inspection Application: Wellness & fitness programCost: $ 2K

SNP genotypingTask: Individual genetic variation detection in single nucleotide polymorphismApplication: Disease prognosisCost: $ 5K

Expression profiling chipTask: Individual genomic response inspection Application:Disease prognosis (e.g. caner)Cost: $ 10K

Diabetes phoneTask:Measurement of glucose levelApplication: Diabetes, dietary managementCost: $ 400

Genomic profile

Physiologicalsignal

CNV genotypingTask: Individual genetic variation detection at copy number variationApplication: Disease prognosisCost: $ 1M

Healthcare bidetTask: Examination of user’s secretionApplication: Patient monitoring systemCost: $ 400

Diabetes watchTask: Measurement of glucose levelApplication: Diabetes, dietary managementCost: $ 100

PCR-based genetic diagnosisTask: Detection of genetic disease and predisposition to a diseaseApplication: Disease prognosisCost: $ 10K

Life shirtTask: Monitor vital signals (respiration flow, heart rate, sweat) Application:Patient monitoring systemCost: $ 2K

38

Yahoo

Google

Overture => Yahoo

Amazon

Auction

Nexon

Blizzard

YouTube

Facebook

Much more …

39

Medical History Health Information

Comprehensive at-home DNA test

NavigenicsNavigenicsRevealing genetic predisposition

Managing health information

Healthcare software solutions

Making personal genetics

23 and me23 and me

Helix HealthHelix Health

deCODEmedeCODEme

Scanning Traits & Disease Tracing Ancestry Features

Microsoft Health VaultMicrosoft Health Vault

Patient ManagementPersonalized Prevention Family History

Complete Scan Cardio Scan Cancer Scan

Nursing Home Application

YOU40

Data Acquisition Data Mining Information DeliveryPersonal Genomes

- Cheap Sequencing- Accurate Annotation

Personal Life Logging- EMR- Food, Exercise, ..

ULDB- Cloud computing

ULDM- Extreme bias on SV ratio- Dynamic and noisy- Incremental

Biomedical Information Models- Ultra-scale- Multi-level- Multi-precision- Multi-modality

Mobile interactionRecommendationPoint-on-treatments…

Scientific Aspects

Industrial AspectsCreative Business Models

(1) Utilizing existing resources(2) Timely join to new emerging markets(3) Accumulating intellectual properties

41

42

top related