multiple alignment using hidden markove models november 21, 2001 kim hye jin intelligent multimedia...

17
Multiple alignment using Multiple alignment using hidden Markove models hidden Markove models November 21, 2001 Kim Hye Jin Intelligent Multimedia Lab [email protected]

Upload: marcus-shepherd

Post on 18-Jan-2016

215 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Multiple alignment using hidden Markove models November 21, 2001 Kim Hye Jin Intelligent Multimedia Lab marisan@postech.ac.kr

Multiple alignment using hidden Multiple alignment using hidden Markove modelsMarkove models

November 21, 2001

Kim Hye Jin

Intelligent Multimedia Lab

[email protected]

Page 2: Multiple alignment using hidden Markove models November 21, 2001 Kim Hye Jin Intelligent Multimedia Lab marisan@postech.ac.kr

Outline

• Introduction

• Methods and algorithm

• Result

• Discussion

IM lab

Page 3: Multiple alignment using hidden Markove models November 21, 2001 Kim Hye Jin Intelligent Multimedia Lab marisan@postech.ac.kr

IntroductionIntroduction

• Why HMM?– Mathematically consistent description of

insertions and deletions– Theoretical insight into the difficulties of

combining disparate forms of information

Ex) sequences / 3D structures– Possible to train models from initially unaligned

sequences

Introduction| why HMM

IM lab

Page 4: Multiple alignment using hidden Markove models November 21, 2001 Kim Hye Jin Intelligent Multimedia Lab marisan@postech.ac.kr

Methods and algorithms

• State transition – State sequence is a 1st

order Markov chain

– Each state is hidden

– match/Insert/delete state

• Symbol emission

Methods and algorithms|HMMs

States transition

Symbol emission

IM lab

Page 5: Multiple alignment using hidden Markove models November 21, 2001 Kim Hye Jin Intelligent Multimedia Lab marisan@postech.ac.kr

Deletion state

Match state

Insertion state

IM lab

Methods and algorithms|HMMs

Page 6: Multiple alignment using hidden Markove models November 21, 2001 Kim Hye Jin Intelligent Multimedia Lab marisan@postech.ac.kr

Methods and algorithms

Methods and algorithms|HMMs

IM lab

• Replacing arbitrary scores with probabilities relative to consensus

• Model M consists of N states S1 …SN.

• Observe sequence O consists of T symbols

O1 … ON from an alphabet x• aij : a transition from Si to Sj • bj(x) : emission probabilities for emission of a

symbol x from each state Sj

Page 7: Multiple alignment using hidden Markove models November 21, 2001 Kim Hye Jin Intelligent Multimedia Lab marisan@postech.ac.kr

Methods and algorithms

Methods and algorithms|HMMs

IM lab

• Model of HMM : example of ACCY

Page 8: Multiple alignment using hidden Markove models November 21, 2001 Kim Hye Jin Intelligent Multimedia Lab marisan@postech.ac.kr

Methods and algorithms

Methods and algorithms|HMMs

IM lab

• Forward algorithm

- a sum rather than a maximum

Page 9: Multiple alignment using hidden Markove models November 21, 2001 Kim Hye Jin Intelligent Multimedia Lab marisan@postech.ac.kr

Methods and algorithms

Methods and algorithms|HMMs

IM lab

• Viterbi algorithm- the most likely path through the model- following the back pointers

Page 10: Multiple alignment using hidden Markove models November 21, 2001 Kim Hye Jin Intelligent Multimedia Lab marisan@postech.ac.kr

Methods and algorithms

Methods and algorithms|HMMs

IM lab

• Baum-Welch algorithm– A variation of the forward algorithm– Reasonable guess for initial model and then

calculates a score for each sequence in the training set using EM algorithms

• Local optima problem: – forward algorithm /Viterbi algorithm – Baum-welch algorithm

Page 11: Multiple alignment using hidden Markove models November 21, 2001 Kim Hye Jin Intelligent Multimedia Lab marisan@postech.ac.kr

Methods and algorithms

Methods and algorithms|HMMs

IM lab

• Simulated annealing– support global suboptimal – kT = 0 : standard Viterbi training procesure– kT goes down while in training

Page 12: Multiple alignment using hidden Markove models November 21, 2001 Kim Hye Jin Intelligent Multimedia Lab marisan@postech.ac.kr

Methods and algorithms

Methods and algorithms|HMMs

IM lab

ClustalW

Page 13: Multiple alignment using hidden Markove models November 21, 2001 Kim Hye Jin Intelligent Multimedia Lab marisan@postech.ac.kr

Methods and algorithms

Methods and algorithms|HMMs

IM lab

ClustalX

Page 14: Multiple alignment using hidden Markove models November 21, 2001 Kim Hye Jin Intelligent Multimedia Lab marisan@postech.ac.kr

Results

Results

IM lab

• len : consensus length of the alignment

• ali : the # structurally aligned sequences

• %id: the percentage sequence identity

• Homo: the # homologues identified in and extraced from SwissProt 30

• %id : the average percentage sequence identity in the set of homologues

Page 15: Multiple alignment using hidden Markove models November 21, 2001 Kim Hye Jin Intelligent Multimedia Lab marisan@postech.ac.kr

Results

Results

IM lab

Page 16: Multiple alignment using hidden Markove models November 21, 2001 Kim Hye Jin Intelligent Multimedia Lab marisan@postech.ac.kr

Discussion

Discussion

IM lab

• HMM- a consistent theory for insertion and deletion

penality- EGF : fairly difficult alignments are well done

• ClusterW- progressive alignment- Disparaties between the sequence identity of the

structures and the sequence identity of the homologoues

- Large non-correlation between score and quality

Page 17: Multiple alignment using hidden Markove models November 21, 2001 Kim Hye Jin Intelligent Multimedia Lab marisan@postech.ac.kr

Discussion

Discussion

IM lab

• The ability of HMM to sensitive fold recognition is apparent