network clustering experimental network mapping graph theory and terminology scale-free architecture...

35
Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer: Trey Ideker

Upload: jared-young

Post on 29-Jan-2016

216 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:

Network Clustering

Experimental network mappingGraph theory and terminology

Scale-free architectureIntegrating with gene essentiality

Robustness

Lecturer: Trey Ideker

Page 2: Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:

2

Measurements of molecular interactions

Protein-protein interactions• Yeast-two-hybrid• Kinase-substrate assays• Co-immunoprecipitation w/ mass spec

Protein-DNA interactions• ChIP-on-chip and ChIP-seq

Genetic interactions• Systematic Genetic Analysis

Page 3: Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:

3

Yeast two-hybrid method

Fields and Song

Page 4: Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:

4

Kinase-target interactions

Mike Snyder and colleagues

Page 5: Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:

5

Protein interactions by protein immunoprecipitation followed by mass spectrometry

Gavin / Cellzome

TEV = Tobacco Etch Virus proteolytic site

CBP = Calmodulin binding peptide

Protein A = IgG binding from Staphylococcus

Page 6: Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:

ChIP measurement of protein→DNA interactions

From Figure 1 of Simon et al. Cell 2001

Page 7: Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:

Genetic interactions: synthetic lethals and suppressors

• Genetic Interactions:

• Widespread method used by geneticists to discover pathways in yeast, fly, and worm

• Implications for drug targeting and drug development for human disease

• Thousands are now reported in literature and systematic studies

• As with other types, the number of known genetic interactions is exponentially increasing…

Adapted from Tong et al., Science 2001

Page 8: Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:

8

Most recorded genetic interactions are synthetic lethal relationships

Adapted from Hartman, Garvik, and Hartwell, Science 2001

A B A ΔB ΔA B ΔA ΔB

Page 9: Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:

α

ω

A

B

Parallel Effects (Redundant or Additive)

Sequential Effects (Additive)

Single A or B mutations typically abolish their biochemical activities

Single A or B mutations typically reduce their biochemical activities

Interpretation of genetic interactions (Guarente T.I.G. 1990)

α

ω

A B

GOAL: Identify downstream physical

pathways

Page 10: Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:

Yeast protein-protein interaction network

What are its network properties?

Page 11: Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:

Graphs

• Graph G=(V,E) is a set of vertices V and edges E

• A subgraph G’ of G is induced by some V’ V and E’ E

• Graph properties:– Node degree– Directed vs. undirected– Loops– Paths– Cyclic vs. acyclic– Simple vs. multigraph– Complete– Connected– Bipartite

Page 12: Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:

Paths

A path is a sequence {x1, x2,…, xn} such that (x1,x2), (x2,x3), …, (xn-1,xn) are edges of the graph.

A closed path xn=x1 on a graph is called a graph cycle or circuit.

Page 13: Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:

Network measures

• Degree ki

The number of edges involving node i

• Degree distribution P(k)The probability (frequency) of nodes of degree k

• Mean path lengthThe avg. shortest path between all node pairs

• Network Diameter“The longest shortest path”

How do the above definitions differ between undirected and directed networks?

Page 14: Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:

WHAT DOES SCALE FREE

REALLY MEAN, ANYWAY?

P(k) is probability of each degree k

For scale free: P(k) ~ k

What happens for

small vs. large ?

Page 15: Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:

Random vs Preferential Attachment

• Erdos-RenyiStart with N nodes and connect each pair with equal probability p

• Scale-freeAdd nodes incrementally. New nodes connect to each existing node I with probability proportional to its degree:

J

J

I

k

k

Scale-free networks have small avg. path lengths ~ log (log N)– this is called the ‘small world’ effect

Page 16: Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:

Clustering coefficient

12

2

kk

nkn

C III

The combination “k choose 2”

# edges between node I’s neighbors

# of neighbors of I

The density of the network surrounding node I, characterized as the number of triangles through I.Related to network modularity

C(k) = avg. clustering coefficient for nodes of degree k

Page 17: Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:

Directionality and Degree

What is the clustering coefficient of A in either case?

Page 18: Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:

Integrating networks with functional gene information:Gene replacement for yeast & other model species

Using HR-based gene replacement, genes can be replaced with drug resistance cassette, tagged with GFP, epitope tagged, etc.

Page 19: Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:

Systematic phenotyping

yfg1Δ yfg2Δ yfg3Δ

CTAACTC TCGCGCA TCATAATBarcode

(UPTAG):

DeletionStrain:

Growth 6hrsin minimal media

(how many doublings?)

Rich media

Harvest and label genomic DNA

Page 20: Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:

Systematic phenotyping with a barcode array

Ron Davis and friends…

• These oligo barcodes are also spotted on a DNA microarray

• Growth time in minimal media:

– Red: 0 hours– Green: 6 hours

Page 21: Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:
Page 22: Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:

The amazing result from that paper

% E

ssen

tial

P(k

)

k k

Page 23: Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:

Robustness

• Complex systems, from the cell to the Internet, can be amazingly resilient to component failure

• Network topology plays an important role in this robustness

• Even if ~80% of nodes fail, the remaining ~20% still maintain network connectivity

• This also leads to attack vulnerability if hubs are selectively targeted

• In yeast, only ~20% of proteins are lethal when deleted, and are 5 times more likely to have degree k>15 than k<5.

Page 24: Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:

Network Motifs (Milo, Alon et al.)

• Motifs are “patterns of interconnections occurring in complex networks.”

• That is, connected subgraphs of a particular isomorphic topology

• The approach queries the network for small motifs (e.g., of < 5 nodes) that occur much more frequently than would be expected in random networks

• Significant motifs have been found in a variety of biological networks and, for instance, correspond to feed-forward and feed-back loops that are well known in circuit design and other engineering fields.

• Pioneered by Uri Alon and colleagues

Page 25: Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:

Motif searches in 3 different contexts

Page 26: Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:

All 3-node directed subgraphs

What is the frequency of each in the network?

Page 27: Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:

Outline of the Approach

• Search network to identify all possible n-node connected subgraphs (here n=3 or 4)

• Get # occurrences of each subgraph type

• The significance for each type is determined using permutation testing, in which the above process is repeated for many randomized networks (preserving node degrees– why?)

• Use random distributions to compute a p-value for each subgraph type. The “network motifs” are subgraphs with p < 0.001

Page 28: Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:

Schematic view of network motif detection

Networks are randomized preserving node degree

Page 29: Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:

Concentration of feedforward motif:

Mean+/-SD of 400 subnetworks

(Num. appearances of motif divided byall 3 node connected subgraphs)

Page 30: Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:

Transcriptional network results

Page 31: Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:

Neural networks

Page 32: Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:

Food webs

Page 33: Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:

World Wide Web

Page 34: Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:

Electronic circuits

Page 35: Network Clustering Experimental network mapping Graph theory and terminology Scale-free architecture Integrating with gene essentiality Robustness Lecturer:

Interesting questions

• Which networks have motifs in common?• Which networks have completely distinct motifs versus

the others?• Does this tell us anything about the design constraints

on each network?• E.g., the feedforward loop may function to activate

output only if the input signal is persistent (i.e., reject noisy or transient signals) and to allow rapid deactivation when the input turns off

• E.g., food webs evolve to allow flow of energy from top to bottom (?!**!???), whereas transcriptional networks evolve to process information