inferring subnetworks from perturbed expression profiles d. pe’er a. regev g. elidan n. friedman

23
. Inferring Subnetworks from Perturbed Expression Profiles D. Pe’er A. Regev G. Elidan N. Friedman

Upload: eric-hubbard

Post on 17-Dec-2015

218 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Inferring Subnetworks from Perturbed Expression Profiles D. Pe’er A. Regev G. Elidan N. Friedman

.

Inferring Subnetworks from Perturbed Expression

Profiles

D. Pe’er

A. Regev G. Elidan N. Friedman

Page 2: Inferring Subnetworks from Perturbed Expression Profiles D. Pe’er A. Regev G. Elidan N. Friedman

Expression Profiling An Expression Profile is

a simultaneous measurement of the level of all mRNAs in a cell population

Experimental design: Measure profiles of mutated or treated cultures

Goal: infer regulatory and molecular interactions

Wild-Type Mutant

Profile

Compare

Page 3: Inferring Subnetworks from Perturbed Expression Profiles D. Pe’er A. Regev G. Elidan N. Friedman

Common Approaches

Comparative Analysis (Holstage et al. 1998) Clustering (Hughes et al. 2000) Limitations:

Cannot distinguish between direct and indirect interactions

Limited to pair-wise relations Can not infer a finer context

Page 4: Inferring Subnetworks from Perturbed Expression Profiles D. Pe’er A. Regev G. Elidan N. Friedman

Bayesian Network FrameworkFriedman, Linial, Nachman ,Pe’er (JCB 2000)

Probabilistic: Characterize statistical relationships between expression patterns of different genes

Multi-variable interactions (beyond pair-wise): Identify intermediate interactions Handle combinatorial regulation by several

gene-products Statistical confidence: Asses the statistical

significance of interactions found

Page 5: Inferring Subnetworks from Perturbed Expression Profiles D. Pe’er A. Regev G. Elidan N. Friedman

Our Contributions

Modeling of mutations and treatments into the Bayesian network framework

Novel data discretization based on guided k-means clustering

New features: Mediator and Regulator

Automatic reconstruction of statistically significant sub-networks.

- 0 +

+

Page 6: Inferring Subnetworks from Perturbed Expression Profiles D. Pe’er A. Regev G. Elidan N. Friedman

Modeling Gene Expression

Gene 1

Expression level of each gene = Random variable

Gene 3Gene 4

Gene 5

Gene 2

Gene interaction =Probabilistic dependency

Directed Acyclic graphModels dependency structure of distribution

0.9 0.1

1

2

1

0.2 0.8

0.6 0.4

0.9 0.1

21

2

2

1

21 P(3 | 1,2)

Each node has a probabilistic functionConditioned on its parents in the graph

Activator Inhibitor

Graph structure + local probabilityDefine a unique multivariate distribution

Page 7: Inferring Subnetworks from Perturbed Expression Profiles D. Pe’er A. Regev G. Elidan N. Friedman

Mutational AssayWild-Type

Measurements

0.9 0.1

pgk10.1 0.9

pgk1

P(rap1|pgk1)

Equivalence: Two models explain

correlation between RAP1 & PGK1

RAP1 PGK1

RAP1 PGK1

Mutated pgk1Measurements

0.5 0.5

pgk10.5 0.5

pgk1

P(rap1|pgk1)

Note causalityinto mutated variable

Page 8: Inferring Subnetworks from Perturbed Expression Profiles D. Pe’er A. Regev G. Elidan N. Friedman

Compendium Dataset (Hughes et al., 2000)

300 samples of yeast deletion mutants and other treatments Deleted genes are from various functional families

A rich variety of profiles, but… There is only one sample from each mutation

cell growth, division, DNA

synthesis

cell rescue, cell defense, aging

cellular biogenesis

cellular organization

unclear classification

energy

intracellular transport

ionic homeostasis

metabolism

protein destination

protein synthesis

signal transduction

transcription

transport facilitation

unclassified proteins

Page 9: Inferring Subnetworks from Perturbed Expression Profiles D. Pe’er A. Regev G. Elidan N. Friedman

Guided K-meansDiscretization

Guided K-meansDiscretization

Expressiondata

MarkovMarkov SeparatorSeparatorEdgeEdge RegulatorRegulator

Bayesian Network Learning Algorithm

+ Bootstrap

Bayesian Network Learning Algorithm

+ Bootstrap

Reconstruct SubNetworksReconstruct SubNetworks

Visualize UsingPathway ExplorerVisualize Using

Pathway Explorer

Preprocess

Learn model

Featureextraction

Featureassembly

Visualization

E

R

B

A

CS

Page 10: Inferring Subnetworks from Perturbed Expression Profiles D. Pe’er A. Regev G. Elidan N. Friedman

Resulting PDAG

Page 11: Inferring Subnetworks from Perturbed Expression Profiles D. Pe’er A. Regev G. Elidan N. Friedman

Confidence Estimates: Bootstrap

D resample

resample

resample

D1

D2

Dm

...

Learn

Learn

Learn

E

R

B

A

C

E

R

B

A

C

E

R

B

A

C

m

iiGf

mfC

1

11

)(Estimate:

Bootstrap approach[FGW, UAI99]

Page 12: Inferring Subnetworks from Perturbed Expression Profiles D. Pe’er A. Regev G. Elidan N. Friedman

Estimating Confidence

Common Practice: Pick a single top scoring modelProblem: Insufficient information!!In gene expression data: only few hundred experiments => many high scoring models

Answer based on one model uselessSolution: Search for features common to

many likely models!Sample models from posterior distribution

P(Model|Data)Confidence of feature:

G

DGPGfDfP )|()()|(Feature of G,e.g., XY

Page 13: Inferring Subnetworks from Perturbed Expression Profiles D. Pe’er A. Regev G. Elidan N. Friedman

Guided K-meansDiscretization

Guided K-meansDiscretization

Expressiondata

MarkovMarkov SeparatorSeparatorEdgeEdge RegulatorRegulator

Bayesian Network Learning Algorithm

+ Bootstrap

Bayesian Network Learning Algorithm

+ Bootstrap

Reconstruct SubNetworksReconstruct SubNetworks

Visualize UsingPathway ExplorerVisualize Using

Pathway Explorer

Preprocess

Learn model

Featureextraction

Featureassembly

Visualization

Page 14: Inferring Subnetworks from Perturbed Expression Profiles D. Pe’er A. Regev G. Elidan N. Friedman

Markov Relations

Question: Do X and Y directly interact? Parent-child (one gene regulating the other)

Hidden Parent (two genes co-regulated by a hidden factor)

(0.91,0.67) SST2 STE6 SST2 STE6

Mating pathway regulator

Exporter of mating factor

ARG5 ARG3(0.84,0.79)

ARG3 ARG5

GCN4

Arginine Biosynthesis

Transcription factor

Page 15: Inferring Subnetworks from Perturbed Expression Profiles D. Pe’er A. Regev G. Elidan N. Friedman

Low Correlation Relations

Previously unknown link strongly supported by evidence in the literature

High confidence, Low correlation Processes occur under specific conditions Captured by our context specific score

ESC4 KU70(0.91, 0.16)

DNA ds break repair

Chromatin silencing

Page 16: Inferring Subnetworks from Perturbed Expression Profiles D. Pe’er A. Regev G. Elidan N. Friedman

Separators

Question: Given that X andY are indirectly dependant, who mediates this dependence?

Separator relation: X affects Z who in turn affects Z Z regulates both X and Y

AGA1 FUS1

KAR4

Mating transcriptional

regulator of nuclear fusion

Cell fusion Cell fusion

Page 17: Inferring Subnetworks from Perturbed Expression Profiles D. Pe’er A. Regev G. Elidan N. Friedman

Separators: Intra-cluster Context

CRH1 YPS3

SLT2

Cell wall protein

MAPK of cell wall

integrity pathway

Cell wall protein

YPS1Cell wall protein

SLR3Protein of unknown function

++

All gene pairs have high correlation, clustering groups them together

assigned putative function to SLR3 - cell wall protein We can assign regulatory role to SLT2 Many other signaling and regulatory proteins were identified

as direct and indirect separators

Page 18: Inferring Subnetworks from Perturbed Expression Profiles D. Pe’er A. Regev G. Elidan N. Friedman

Guided K-meansDiscretization

Guided K-meansDiscretization

Expressiondata

MarkovMarkov SeparatorSeparatorEdgeEdge RegulatorRegulator

Bayesian Network Learning Algorithm

+ Bootstrap

Bayesian Network Learning Algorithm

+ Bootstrap

Reconstruct SubNetworksReconstruct SubNetworks

Visualize UsingPathway ExplorerVisualize Using

Pathway Explorer

Preprocess

Learn model

Featureextraction

Featureassembly

Visualization

Page 19: Inferring Subnetworks from Perturbed Expression Profiles D. Pe’er A. Regev G. Elidan N. Friedman

Sub-Networks

Reconstruct a Conserved sub-network Provides a more global picture Allows to include features with lower-confidence Preserved in most networks with high posterior Probably reflects a real biological process

Automatic algorithm Score: high concentration of pairwise features Greedy search for high scoring subgraphs

Page 20: Inferring Subnetworks from Perturbed Expression Profiles D. Pe’er A. Regev G. Elidan N. Friedman

Increased Confidence(simulated data)

Percent of

False positives

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

Confidence

Entire network

Subnetwork

Page 21: Inferring Subnetworks from Perturbed Expression Profiles D. Pe’er A. Regev G. Elidan N. Friedman

Guided K-meansDiscretization

Guided K-meansDiscretization

Expressiondata

MarkovMarkov SeparatorSeparatorEdgeEdge RegulatorRegulator

Bayesian Network Learning Algorithm

+ Bootstrap

Bayesian Network Learning Algorithm

+ Bootstrap

Reconstruct SubNetworksReconstruct SubNetworks

Visualize UsingPathway ExplorerVisualize Using

Pathway Explorer

Preprocess

Learn model

Featureextraction

Featureassembly

Visualization

Page 22: Inferring Subnetworks from Perturbed Expression Profiles D. Pe’er A. Regev G. Elidan N. Friedman

Rosetta networks in Pathway Explorer

http://www.cs.huji.ac.il/labs/compbio/ismb01

Page 23: Inferring Subnetworks from Perturbed Expression Profiles D. Pe’er A. Regev G. Elidan N. Friedman

Summary

Primary contribution: automated methodology for finding patterns of interactions among genes

Clear semantics Principled handing of mutations and interventions

Built in handling of statistical significance Feature confidence Extracts significant sub-networks

Differs from clustering Inter-cluster relations Finer intra-cluster structure

Provides biologist with promising hypothesis