1 cis 4930/6930 – recent advances in bioinformatics spring 2014 network construction from rnai...

Post on 16-Dec-2015

221 Views

Category:

Documents

4 Downloads

Preview:

Click to see full reader

TRANSCRIPT

1

CIS 4930/6930 – Recent Advances in Bioinformatics

Spring 2014

Network construction from RNAi data

Tamer Kahveci

Signaling Networks

2

MAPK network

Signal reachability

3

Receptor Reporter

Luciferase

Signaling and RNA Interference

4

Receptor Reporter

Luciferase

X Not critical

X Critical

Signaling Network Reconstruction from RNAi data

5

Receptor Reporter

Not criticalCritical

RNAi data and Reference Network

6

Receptor Reporter

Not criticalCritical

Reference network

Inse

rt

Delete

Not consistent !Consistent

!

Overview

7

GR = (VR, ER)

Reference network

Constraints

1 10 0 0

GT = (VT, ET)

Target network

1 0

SiNeC (Signal Network Constructor)

S-SiNeC (Scalable Signal Network Constructor)

Giv

en Find

Goal: Minimize the number of edit

operations to make the reference

consistent.

NP-Complete !

SiNeC algorithm

Three steps

1. Order the critical genes left to right based on the topology of GR. [Sloan, 1986]

– v1, v2, …, vc

2. Edge deletion phase

3. Edge insertion phase

8

Step 1: Order critical genes

9

Receptor Reporter3

1

2

Prioritize based on distance to the reporter + degree

Step 2: Edge deletion

10

Purpose: Eliminate detours around critical genes

Receptor Reportervi vkvj

• Find all (undesirable) paths between non-consecutive critical genes.

• i.e., Paths which go through only noncritical genes• Edges are weighted with the number of such paths they

belong to.• Remove greedily starting from the largest weight until al

paths are disrupted.

Bypassed !!!

Step 3: Edge insertion

11

Purpose: Make sure that critical are connected + noncritical genes are consistent

Receptor Reportervi-1 vi+1vi

Insert an edge from vi-1 to vi if 1. There is no path from vi-1 to vi.2. There is a noncritical gene on all paths from vi-1

to vi.

Overview

12

GR = (VR, ER)

Reference network

Constraints

1 10 0 0

GT = (VT, ET)

Target network

1 0

SiNeC (Signal Network Constructor)

S-SiNeC (Scalable Signal Network Constructor)

Giv

en Find

Finding all the paths can be to

o time

consuming for la

rge networks

S-SiNeC algorithm

13

Edge insertion0 0 0 None

0 0 1 None

0 1 0 None

0 1 1 A1

1 0 0 A2 + A3 + A4

1 0 1 A2 + A4

1 1 0 A3 + A4

1 1 1 A4

Critic

alLe

ft rea

chab

le

Rig

ht rea

chab

le

Edge deletion

Reference network

vs vtvi

S-SiNeC: Edge insertion (A1)

14

Reference network

vs vtvi

L R

Purpose: Make sure that noncritical genes are consistent

S-SiNeC: Edge insertion (A2)

15

Reference network

vs vtvi

L R

Purpose: Make sure that critical genes are left reachable

S-SiNeC: Edge insertion (A3)

16

Reference network

vs vtvi

L R

Purpose: Make sure that critical genes are right reachable

S-SiNeC: Edge insertion

17

L/R e1 e2 e3

1 X

2 X X

3 X X

4 X X

5 X X

6 X X

7 X

8 X

S-SiNeC: Edge deletion (A4)

18

Reference network

vs vtvi

L R

Purpose: Make sure that no detours exist around critical genes

Solve minimum cut between L &

R

Dataset

19

• Reference networks are obtained by random edge shuffling at 5% to 40% mutation rates.

• 200 references per target network & per mutation rate.

Average distance to the true network

20

Accuracy based on edge class

21

vs vt

Hot

Cold

Running time results

22SiNeC > 1 hour per reference network.

Success rate on constraints

23

24

Accuracy

25

Functional Enrichment of the Pathway

Last Remarks

• Constructing very large signaling networks from RNAi data is possible in practical running time.

• Both SiNeC and S-SiNeC are robust to errors in reference network.

• We recommend

– S-SiNeC for very large OR dense networks.

– SiNeC otherwise.26

Acknowledgements

27

CCF - 0829867 IIS - 0845439260429

top related