le song lesong@cs.cmu.edu joint work with mladen kolar and eric xing keller: estimating time...

Post on 14-Jan-2016

216 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Le Songlesong@cs.cmu.edu

Joint work with Mladen Kolar and Eric Xing

KELLER: Estimating Time Evolving KELLER: Estimating Time Evolving Interactions Between GenesInteractions Between Genes

2

Transient Biological Processes

33

PPI Network

4

Time-Varying Interactions

5

The Big-Picture Questions

What are the interactions?

What pathways are activeactive at a particular time point and location?

How will biological networks respond to stimuli (eg. heat shot)?

6

Regulation of cell response to stimuli is paramount, but we can usually only measure (or compute) steady-state interactions

Transcriptionalinteractions

Protein—proteininteractions

Biochemicalreactions

▲ Chromatin IP ▲Microarrays

▲ Protein coIP▲ Yeast two-hybrid

▲ Metabolic flux measurements

Transcriptionalinteractions

Protein—proteininteractions

Biochemicalreactions

▲ Chromatin IP ▲Microarrays

▲ Protein coIP▲ Yeast two-hybrid

▲ Metabolic flux measurements

7

t=1 2 3 T

Current PracticeStatic Networks

Microarray Time Series

Dynamic Bayesian Networks

8

Our GoalReverse engineer temporal/spatial-specific

“rewiring” gene networks

Time

t*

n=1

--- what are the difficulties?--- what are the difficulties?

9

Two Scenarios

Smoothly evolving networksSmoothly evolving networks Abruptly changing networksAbruptly changing networks

10

Scenario I (This paper)Kernel reweighted L1-regularized logistic regression

(KELLER)

Key Idea I: reweighting observations

Key Idea II: regularized neighborhood estimation

11

Key IdeaWeight temporally adjacent observations more than

distal observations

12

Key IdeaEstimate the neighborhood of each gene separately

via L1-regularized logistic regression

Kernel Reweighting

Log-likelihood

L1-regularization

13

Consistency

Theorem 1: Under certain verifiable conditions (omitted here for simplicity), KELLER recovers the true topology of the networks:

14

Synthetic data

DBN and static networks do not benefit from more observations

Number of Samples

15

Key idea: Temporally Smoothing

Tesla (Amr and Xing, PNAS 2009)

TESLA:

Senario II

16

Drosophila Life Cycle

Larva

Embryo

Pupa

Adult66 microarrays across

full life cycle

588 genes related to development

17

molecular function

biological process

cellular component

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

Network Size vs. Clustering Coefficient

mid-embryonic

mid-pupal

41

Network Size vs. Clustering Coefficient

mid-embryonic stagetight local clusters

mid-pupal stageloose local clusters

42

Interactivity of Gene Sets

27 genes based on ontology

43

Interactivity of Gene Sets

25 genes based on ontology

44

Transient Gene InteractionsTime

Gene Pairs

Active

Inactive

msn dock

sno Dl

45

Transcriptional Factor Cascade

Summary networks 36 transcription factorsNode size its total activity

46

TF Cascade – mid-embryonic stage

47

TF Cascade – mid-larva stage

48

TF Cascade – mid-pupal stage

49

TF Cascade – mid-adult stage

50

Transient Group Interactions

51

ConclusionKELLER for reverse engineering “rewiring” networks

Key advantages:Computationally efficient (scalable to 10Computationally efficient (scalable to 1044 genes) genes)

Global optimal solution is attainableGlobal optimal solution is attainable

Theoretical guaranteeTheoretical guarantee

Glimpse to temporal evolution of gene networks

Many interactions are rewiring and transient

Availability: http://www.sailing.cs.cmu.edu/

52

The EndThanks

Travel fellowship:Office of Science (BER), U.S. Department of Energy, Grant No. DE-FG02-06ED64270

Funding: Lane Fellowship,

Questions?

53

Interactivity of Gene Sets

30 genes based on ontology

54

Timing of Regulatory Program

Galactose

55

ChallengesVery small sample size

Experimental data are scarce and costly

Noisy measurement

More genes than microarraysComplexity regularization needed to avoid over-

fitting

Observations no longer iid since the networks are changing!

top related