inferring gene regulatory networks from transcriptomic profiles dirk husmeier biomathematics &...

112
Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Upload: abraham-baker

Post on 13-Jan-2016

216 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Inferring gene regulatory networks from transcriptomic profiles

Dirk Husmeier

Biomathematics & Statistics Scotland

Page 2: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Overview

• Introduction

• Application to synthetic biology

• Lessons from DREAM

Page 3: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Network reconstruction from postgenomic data

Page 4: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Accuracy

Computational complexity

Methods based on correlation and

mutual information

Conditional independence graphs

Mechanistic models

Bayesian networks

Page 5: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Accuracy

Computational complexity

Methods based on correlation and

mutual information

Conditional independence graphs

Mechanistic models

Bayesian networks

Page 6: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

direct

interaction

common

regulator

indirect

interaction

co-regulation

Pairwise associations do not take the context of the systeminto consideration

Shortcomings

Page 7: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Accuracy

Computational complexity

Methods based on correlation and mutual

information

Conditional independence graphs

Mechanistic models

Bayesian networks

Page 8: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Conditional Independence Graphs (CIGs)

jjii

ijij

)()(

)(111

1

2

2

1

1

Direct interaction

Partial correlation, i.e. correlation

conditional on all other domain variables

Corr(X1,X2|X3,…,Xn)

Problem: #observations < #variables

Covariance matrix is singular

strong partial

correlation π12

Inverse of the covariance

matrix

Page 9: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Accuracy

Computational complexity

Methods based on correlation and mutual

information

Conditional independence graphs

Mechanistic models

Bayesian networks

Page 10: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Model Parameters q

Probability theory Likelihood

Page 11: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

1) Practical problem: numerical optimization

q

2) Conceptual problem: overfitting

ML estimate increases on increasing the network complexity

Page 12: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Overfitting problem

True pathway

Poorer fit to the data

Poorer fit to the data

Equal or better fit to the data

Page 13: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Regularization

E.g.: Bayesian information criterion

Maximum likelihood parameters

Number of parameters

Number of data points

Data misfit term Regularization term

Page 14: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Complexity Complexity

Likelihood BIC

Page 15: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Model selection: find the best pathway

Select the model with the highest posterior probability:

This requires an integration over the whole parameter space:

Page 16: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Problem: huge computational costs

q

Page 17: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Accuracy

Computational complexity

Methods based on correlation and mutual

information

Conditional independence graphs

Mechanistic models

Bayesian networks

Page 18: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Friedman et al. (2000), J. Comp. Biol. 7, 601-620

Marriage between

graph theory

and

probability theory

Page 19: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Bayes net

ODE model

Page 20: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Model Parameters q

Bayesian networks: integral analytically tractable!

Page 21: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

UAI 1994

Page 22: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

[A]= w1[P1] + w2[P2] + w3[P3] +

w4[P4] + noise

Linearity assumption

A

P1

P2

P4

P3

w1

w4

w2

w3

Page 23: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

t1 t2 t3 t4 t5 t6 t7 t8 t9 t10

X(1) X1,1 X1,2 X1,3 X1,4 X1,5 X1,6 X1,7 X1,8 X1,9 X1,10

X(2) X2,1 X2,2 X2,3 X2,4 X2,5 X2,6 X2,7 X2,8 X2,9 X2,10

X(3) X3,1 X3,2 X3,3 X3,4 X3,5 X3,6 X3,7 X3,8 X3,9 X3,10

X(4) X4,1 X4,2 X4,3 X4,4 X4,5 X4,6 X4,7 X4,8 X4,9 X4,10

Homogeneity assumption

Page 24: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Accuracy

Computational complexity

Methods based on correlation and mutual

information

Conditional independence graphs

Mechanistic models

Bayesian networks

Page 25: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Example: 4 genes, 10 time points

t1 t2 t3 t4 t5 t6 t7 t8 t9 t10

X(1) X1,1 X1,2 X1,3 X1,4 X1,5 X1,6 X1,7 X1,8 X1,9 X1,10

X(2) X2,1 X2,2 X2,3 X2,4 X2,5 X2,6 X2,7 X2,8 X2,9 X2,10

X(3) X3,1 X3,2 X3,3 X3,4 X3,5 X3,6 X3,7 X3,8 X3,9 X3,10

X(4) X4,1 X4,2 X4,3 X4,4 X4,5 X4,6 X4,7 X4,8 X4,9 X4,10

Page 26: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

t1 t2 t3 t4 t5 t6 t7 t8 t9 t10

X(1) X1,1 X1,2 X1,3 X1,4 X1,5 X1,6 X1,7 X1,8 X1,9 X1,10

X(2) X2,1 X2,2 X2,3 X2,4 X2,5 X2,6 X2,7 X2,8 X2,9 X2,10

X(3) X3,1 X3,2 X3,3 X3,4 X3,5 X3,6 X3,7 X3,8 X3,9 X3,10

X(4) X4,1 X4,2 X4,3 X4,4 X4,5 X4,6 X4,7 X4,8 X4,9 X4,10

Standard dynamic Bayesian network: homogeneous model

Page 27: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Limitations of the homogeneity assumption

Page 28: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Our new model: heterogeneous dynamic Bayesian network. Here: 2 components

t1 t2 t3 t4 t5 t6 t7 t8 t9 t10

X(1) X1,1 X1,2 X1,3 X1,4 X1,5 X1,6 X1,7 X1,8 X1,9 X1,10

X(2) X2,1 X2,2 X2,3 X2,4 X2,5 X2,6 X2,7 X2,8 X2,9 X2,10

X(3) X3,1 X3,2 X3,3 X3,4 X3,5 X3,6 X3,7 X3,8 X3,9 X3,10

X(4) X4,1 X4,2 X4,3 X4,4 X4,5 X4,6 X4,7 X4,8 X4,9 X4,10

Page 29: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

t1 t2 t3 t4 t5 t6 t7 t8 t9 t10

X(1) X1,1 X1,2 X1,3 X1,4 X1,5 X1,6 X1,7 X1,8 X1,9 X1,10

X(2) X2,1 X2,2 X2,3 X2,4 X2,5 X2,6 X2,7 X2,8 X2,9 X2,10

X(3) X3,1 X3,2 X3,3 X3,4 X3,5 X3,6 X3,7 X3,8 X3,9 X3,10

X(4) X4,1 X4,2 X4,3 X4,4 X4,5 X4,6 X4,7 X4,8 X4,9 X4,10

Our new model: heterogeneous dynamic Bayesian network. Here: 3 components

Page 30: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Learning with MCMC

q

k

h

Number of components (here: 3)

Allocation vector

Page 31: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Learning with MCMC

q

k

h

Number of components (here: 3)

Allocation vector

Page 32: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Non-homogeneous model

Non-linear model

Page 33: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

[A]= w1[P1] + w2[P2] + w3[P3] +

w4[P4] + noise

BGe: Linear model

A

P1

P2

P4

P3

w1

w4

w2

w3

Page 34: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Can we get an approximate nonlinear model without data discretization?

y

x

Page 35: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Can we get an approximate nonlinear model without data discretization?

Idea: piecewise linear model

y

x

Page 36: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

t1 t2 t3 t4 t5 t6 t7 t8 t9 t10

X(1) X1,1 X1,2 X1,3 X1,4 X1,5 X1,6 X1,7 X1,8 X1,9 X1,10

X(2) X2,1 X2,2 X2,3 X2,4 X2,5 X2,6 X2,7 X2,8 X2,9 X2,10

X(3) X3,1 X3,2 X3,3 X3,4 X3,5 X3,6 X3,7 X3,8 X3,9 X3,10

X(4) X4,1 X4,2 X4,3 X4,4 X4,5 X4,6 X4,7 X4,8 X4,9 X4,10

Inhomogeneous dynamic Bayesian network with common changepoints

Page 37: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Inhomogenous dynamic Bayesian network with node-specific changepoints

t1 t2 t3 t4 t5 t6 t7 t8 t9 t10

X(1) X1,1 X1,2 X1,3 X1,4 X1,5 X1,6 X1,7 X1,8 X1,9 X1,10

X(2) X2,1 X2,2 X2,3 X2,4 X2,5 X2,6 X2,7 X2,8 X2,9 X2,10

X(3) X3,1 X3,2 X3,3 X3,4 X3,5 X3,6 X3,7 X3,8 X3,9 X3,10

X(4) X4,1 X4,2 X4,3 X4,4 X4,5 X4,6 X4,7 X4,8 X4,9 X4,10

Page 38: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

NIPS 2009

Page 39: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Non-stationarity in the regulatory process

Page 40: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Non-stationarity in the network structure

Page 41: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Flexible network structure .

Page 42: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Flexible network structure with regularization

Page 43: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Flexible network structure with regularization

Page 44: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Flexible network structure with regularization

Page 45: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

ICML 2010

Page 46: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Morphogenesis in Drosophila melanogaster

• Gene expression measurements over 66 time steps of 4028 genes (Arbeitman et al., Science, 2002).

• Selection of 11 genes involved in muscle development.

Zhao et al. (2006),

Bioinformatics 22

Page 47: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Transition probabilities: flexible structure with regularization

Morphogenetic transitions: Embryo larva larva pupa pupa adult

Page 48: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland
Page 49: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland
Page 50: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Overview

• Introduction

• Application to synthetic biology

• Lessons from DREAM

Page 51: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland
Page 52: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland
Page 53: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland
Page 54: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Can we learn the switch Galactose Glucose?

Can we learn the network structure?

Page 55: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

NIPS 2010

Page 56: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Node 1

Node i

Node p

Hierarchical Bayesian model

Page 57: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland
Page 58: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland
Page 59: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland
Page 60: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Node 1

Node i

Node p

Hierarchical Bayesian model

Page 61: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Exponential versus binomial prior distribution

Exploration of various information sharing options

Page 62: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Task 1:Changepoint detection

Switch of the carbon source:Galactose Glucose

Page 63: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Galactose Glucose

Page 64: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Task 2:Network reconstruction

PrecisionProportion of identified interactions

that are correct

Recall Proportion of true interactions that

we successfully recovered

Page 65: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

BANJO: Conventional homogeneous DBN TSNI: Method based on differential equations

Inference: optimization, “best” network

Page 66: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland
Page 67: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Sample of high-scoring networks

Page 68: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Sample of high-scoring networks

Feature extraction, e.g. marginal posterior probabilities of the edges

Page 69: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Galactose

Page 70: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Glucose

Page 71: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland
Page 72: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Prior Coupling Average AUC

None None 0.70

Exponential Hard 0.77

Binomial Hard 0.75

Binomial Soft 0.75

Average performance over both phases:Galactose and glucose

Page 73: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

How are we getting from here …

Page 74: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

… to there ?!

Page 75: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Overview

• Introduction

• Application to synthetic biology

• Lessons from DREAM

Page 76: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

DREAM:Dialogue for Reverse Engineering

Assessments and Methods

International network reconstruction competition: June-Sept 2010

Network # Transcription Factors

# Genes # Chips

Network 1 (in silico)

195 1643 805

Network 2 99 2810 160

Network 3 334 4511 805

Network 4 333 5950 536

Page 77: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Marco GrzegorczykUniversity of Dortmund

Germany

Frank Dondelinger BioSS / University of Edinburgh

United Kingdom

Sophie LèbreUniversité de Strasbourg

France

Our team

Andrej AderholdBioSS / University of St Andrews

United Kingdom

Page 78: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Our model:Developed for time series

Data:Different experimental conditions, perturbations (e.g. ligand injection), interventions (e.g. gene knock-out,

overexpression), time points

How do we get an ordering of the genes?

Page 79: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

PCA

Page 80: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

SOM

Page 81: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

No time series Use 1-dim SOM to get a chip order

Page 82: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Ordering of chips changepoint model

Page 83: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Problems with MCMC convergence

Network # Transcription Factors

# Genes # Chips

Network 1 (in silico)

195 1643 805

Network 2 99 2810 160

Network 3 334 4511 805

Network 4 333 5950 536

Page 84: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Problems with MCMC convergence

Network # Transcription Factors

# Genes # Chips

Network 1 (in silico)

195 1643 805

Network 2 99 2810 160

Network 3 334 4511 805

Network 4 333 5950 536

PNAS 2009

Page 85: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

[A]= w1[P1] + w2[P2] + w3[P3] +

w4[P4] + noise

Linear model

A

P1

P2

P4

P3

w1

w4

w2

w3

Page 86: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

L1 regularized linear regression

Page 87: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Problems with MCMC convergence

Network # Transcription Factors

# Genes # Chips

Network 1 (in silico)

195 1643 805

Network 2 99 2810 160

Network 3 334 4511 805

Network 4 333 5950 536

Page 88: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Problems with MCMC convergence

Network # Transcription Factors

# Genes # Chips

Network 1 (in silico)

195 1643 805

Network 2 99 2810 160

Network 3 334 4511 805

Network 4 333 5950 536

Page 89: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Assessment

Participants Had to submit rankings of all interactions

OrganisersComputed areas under 1)Precision-recall curves

2)ROC curves (plotting sensitivity=recall against specificity)

Page 90: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Uncertainty about the best network structure

Limited number of experimental replications, high noise

Page 91: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Sample of high-scoring networks

Page 92: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Sample of high-scoring networks

Feature extraction, e.g. marginal posterior probabilities of the edges

Page 93: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Sample of high-scoring networks

Feature extraction, e.g. marginal posterior probabilities of the edges

High-confident edge

High-confident non-edge

Uncertainty about edges

Page 94: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

ROC curves

True positive rate

Sensitivity

False positive rate

Complementary specificity

Page 95: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Definition of metrics

Total number of true edges

Total number of predicted edges

Total number of non-edges

Total number of true edges

Page 96: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

The relation between Precision-Recall (PR) and ROC curves

Page 97: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

The relation between Precision-Recall (PR) and ROC curves

Better performance Better

performance

Page 98: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Assessment

Participants Had to submit rankings of all interactions

OrganisersComputed areas under 1)Precision-recall curves

2)ROC curves (plotting sensitivity=recall against specificity)

Page 99: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland
Page 100: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland
Page 101: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Proportion of recovered true

edges

Proportion of avoided non-edges

AUROC = 0.5

Page 102: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland
Page 103: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Joint work with Wolfgang Lehrach on ab initio prediction of protein interactions

AUROC= 0.61,0.67,0.67

Page 104: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland
Page 105: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

ICML 2006

Page 106: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

The relation between Precision-Recall (PR) and ROC curves

Better performance Better

performance

Page 107: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Potential advantage of Precision-Recall (PR) over ROC curves

Large number of negative examples (TN+FP)

Large change in FP may have a small effect on the false positive rate

Large change in FP has a strong effect on the precision

Small difference

Large difference

Page 108: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland
Page 109: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland
Page 110: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland
Page 111: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland
Page 112: Inferring gene regulatory networks from transcriptomic profiles Dirk Husmeier Biomathematics & Statistics Scotland

Room for improvement:Higher-dimensional changepoint process

Perturbations

Experimental conditions