create and assess protein networks through molecular characteristics of individual proteins

15
Create and assess Create and assess protein networks protein networks through molecular through molecular characteristics of characteristics of individual proteins individual proteins Yanay Ofran et al. ISMB ’06 Yanay Ofran et al. ISMB ’06 Presenter: Danhua Guo Presenter: Danhua Guo 12/07/2006 12/07/2006

Upload: abie

Post on 11-Feb-2016

39 views

Category:

Documents


0 download

DESCRIPTION

Create and assess protein networks through molecular characteristics of individual proteins. Yanay Ofran et al. ISMB ’06 Presenter: Danhua Guo 12/07/2006. Roadmap. Motivation Introduction Methods Results and Discussion Conclusion. Motivation. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Create and assess protein networks through molecular characteristics of individual proteins

Create and assess Create and assess protein networks protein networks through molecular through molecular characteristics of characteristics of individual proteinsindividual proteinsYanay Ofran et al. ISMB ’06Yanay Ofran et al. ISMB ’06

Presenter: Danhua GuoPresenter: Danhua Guo12/07/200612/07/2006

Page 2: Create and assess protein networks through molecular characteristics of individual proteins

RoadmapRoadmap MotivationMotivation IntroductionIntroduction MethodsMethods Results and DiscussionResults and Discussion ConclusionConclusion

Page 3: Create and assess protein networks through molecular characteristics of individual proteins

MotivationMotivation Study of biological systems relies Study of biological systems relies

on network topology.on network topology. Integrating protein information Integrating protein information

into the network enhance the into the network enhance the analysis of biological systems.analysis of biological systems.

Page 4: Create and assess protein networks through molecular characteristics of individual proteins

IntroductionIntroduction Protein-Protein Interaction (PPI) Protein-Protein Interaction (PPI)

NetworkNetwork– Help identify process or functionsHelp identify process or functions– Major problemMajor problem

Generation problemGeneration problem– Experimental errors: should not be in the networkExperimental errors: should not be in the network– ““In vitroIn vitro”: should be include in the network”: should be include in the network

Data representation problemData representation problem– Essential connection between PPI and proteinEssential connection between PPI and protein

Page 5: Create and assess protein networks through molecular characteristics of individual proteins

IntroductionIntroduction An ideal frameworkAn ideal framework

– Macro level: network topologyMacro level: network topology– Micro level: characteristics of each Micro level: characteristics of each

proteinprotein LocalizationLocalization Functional annotationFunctional annotation

Page 6: Create and assess protein networks through molecular characteristics of individual proteins

IntroductionIntroduction Protein interaction Network Protein interaction Network

Assessment Tool (PiNAT)Assessment Tool (PiNAT)

Page 7: Create and assess protein networks through molecular characteristics of individual proteins

MethodsMethods Large-scale Assessment of PPIsLarge-scale Assessment of PPIs

– Based on localizationBased on localization– Based on GO annotation (if applicable)Based on GO annotation (if applicable)

Automatic generation of networksAutomatic generation of networks– Get submitted list of proteins from userGet submitted list of proteins from user– Search DIP and IntActSearch DIP and IntAct

Display of networks in the cellular Display of networks in the cellular contextcontext

Alzheimer’s disease related pathwayAlzheimer’s disease related pathway

Page 8: Create and assess protein networks through molecular characteristics of individual proteins

MethodsMethods Localization criteriaLocalization criteria

– LOCtree: classify eukaryotic proteins LOCtree: classify eukaryotic proteins (60%)(60%) Threshold: confidence score >=4Threshold: confidence score >=4

– PHDhtm: predict transmembrane helices PHDhtm: predict transmembrane helices (7%)(7%) Threshold: average score among 20 reliable Threshold: average score among 20 reliable

predictions >8.5predictions >8.5– Experiment on 4800 interactions (2191 Experiment on 4800 interactions (2191

proteins)proteins) High-confidence prediction: 2312 (1482 proteins)High-confidence prediction: 2312 (1482 proteins) Total protein pairs: 1,097,421Total protein pairs: 1,097,421 Binomial approximation to the cumulative Binomial approximation to the cumulative

hypergeometric probability distribution to get a p-hypergeometric probability distribution to get a p-value for over and under representationvalue for over and under representation

Page 9: Create and assess protein networks through molecular characteristics of individual proteins

MethodsMethods GO criteriaGO criteria

– The functionality annotation of a proteinThe functionality annotation of a protein– Distance between 2 GO terms measure the Distance between 2 GO terms measure the

similaritysimilarity

m,n: respective numbers of annotations in i and jm,n: respective numbers of annotations in i and j simGo: GO similarity defined by Lord et al.simGo: GO similarity defined by Lord et al. Ck, Cp: respective individual annotation in protein Ck, Cp: respective individual annotation in protein

i and ji and j Cjmax: Ck’s most similar term in jCjmax: Ck’s most similar term in j Cimax: Cp’s most similar term in iCimax: Cp’s most similar term in i

Page 10: Create and assess protein networks through molecular characteristics of individual proteins

MethodsMethods Display of networks in the cellular Display of networks in the cellular

contextcontext– Based on LOCtree and PHDhtm Based on LOCtree and PHDhtm

predictionspredictions– Generate Graph Markup Language Generate Graph Markup Language

(GML)(GML)– Localization overide rule:Localization overide rule:

High PHDhtm > High LOCtree > Low PHDhtm > Low High PHDhtm > High LOCtree > Low PHDhtm > Low LOCtreeLOCtree

Page 11: Create and assess protein networks through molecular characteristics of individual proteins

ResultsResults Interactions across subcellular Interactions across subcellular

compartmentscompartments

– Intra-compartment interactions: high scoreIntra-compartment interactions: high score– Distant compartment: low scoreDistant compartment: low score– Nearby compartment: likelyNearby compartment: likely

Page 12: Create and assess protein networks through molecular characteristics of individual proteins

ResultsResults Likely and unlikely interactions Likely and unlikely interactions

across GOacross GO– Likely: >Likely: >3.253.25– Unlikely: Unlikely: <1.3<1.3– Neutral: elseNeutral: else

Page 13: Create and assess protein networks through molecular characteristics of individual proteins

ResultResult Alzheimer in the Alzheimer in the

perspective of PiNATperspective of PiNAT– Reflects the unclarity Reflects the unclarity

regarding Amyloid regarding Amyloid beta A4 protein (APP) beta A4 protein (APP) ’s localization’s localization

– APP interacts APP interacts extensively with extensively with almost every almost every compartment of the compartment of the cellcell

Page 14: Create and assess protein networks through molecular characteristics of individual proteins

ResultResult APP’s role in APP’s role in

AlzheimerAlzheimer– APP-related PPI APP-related PPI

deemed deemed “unlikely”“unlikely”

– Conflicts Conflicts between 2 between 2 scoring systemsscoring systems

Page 15: Create and assess protein networks through molecular characteristics of individual proteins

ConclusionConclusion Molecular knowledge and network Molecular knowledge and network

structure can enhance our structure can enhance our understanding of biological understanding of biological processes.processes.

PiNAT is efficient and meaningful.PiNAT is efficient and meaningful.