chapter 5 phylogeny inference based on distance methods
DESCRIPTION
Chapter 5 Phylogeny inference based on distance methods. - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/1.jpg)
In addition to maximum parsimony (MP) and likelihood methods, pairwise distance methods form the third large group of methods to infer evolutionary trees from sequence data.
Evolutionary model: When all the pairwise distances have been computed for a set of sequences, a tree topology can then be inferred by a variety of methods.
Chapter 5Phylogeny inference based on distance
methods
![Page 2: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/2.jpg)
![Page 3: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/3.jpg)
![Page 4: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/4.jpg)
The main distance-based tree-building methods are cluster analysis and minimum evolution.
Ultrametricity is satisfied dAC max(≦ dAB, dBC)
unweighted-pair group method with arithmetic means (UPGMA)
that are most similar to each other (that is for which the genetic distance is the smallest). When two OTUs are group, they are treated as a new single OTU. From the new group of OTUs, the pair for which the similarity is highest is again identified, and so on, until only two OTUs are left.
5.2
![Page 5: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/5.jpg)
![Page 6: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/6.jpg)
![Page 7: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/7.jpg)
![Page 8: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/8.jpg)
![Page 9: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/9.jpg)
![Page 10: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/10.jpg)
UPGMA are extremely sensitive to unequal rates in different lineages.
![Page 11: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/11.jpg)
Additive distances satisfy the following condition, known as the four-point metric condition
(dAB+dCD) max (≦ dAC + dBD, dAD+dBC)
Only additive distances can be fitted precisely into an unrooted tree such that the genetic distance between a pair of OTUs equals the sum of the lengths of the branches connecting them, rather than an average,
5.2.2 Minimum evolution and neighbor-joining
![Page 12: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/12.jpg)
![Page 13: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/13.jpg)
Minimum evolution (ME) was first described by Kidd and Sgaramella-Zonta (1971); Rzhetsky and Nei(1992) described a method with only a minor difference. In ME, the tree that minimizes the lengths of the tree, which is the sum of the lengths of the branches, is regarded as the best estimate of the phylogeny:
n is the number of taxa in the tree and vi is the ith branch (remember that there are 2n-3 branches in an unrooted tree of n taxa).
![Page 14: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/14.jpg)
• Distances are rarely, exactly tree metrics, and hence one class of ‘goodness of fit’ methods seeks the metric tree that best accounts for the ‘observed’ distances.
• The goodness of fit F between observed distance dij and tree distances pij for each pair of sequences i and j is given by.
• In the example just given we were fitting an additive tree with (2n-3) branches to
( ) = n (n-1)/2 pairwise distances.n2
Distance methods
![Page 15: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/15.jpg)
![Page 16: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/16.jpg)
Minimum evolution• Given an unrooted metric tree for n sequences there
are (2n-3) branches, each with length ei. The sum of these branch lengths is the length L of the tree:
The minimum evolution tree (ME) is the tree which minimizes L.
• More commonly, the branch lengths of the minimum evolution tree are estimated using least-squares methods. The branch lengths are estimated in the same way as for goodness of fit measures; however, rather than compare the fit of the observed distances the least squares branch lengths are added together to give the length of the tree.
![Page 17: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/17.jpg)
![Page 18: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/18.jpg)
![Page 19: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/19.jpg)
![Page 20: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/20.jpg)
![Page 21: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/21.jpg)
A drawback of the ME method is that, in principle, all different tree topologies have to be investigated to find the minimum tree. However, this is impossible in practice because of the explosive increase in the number of tree topologies as the number of OTUs increases; an exhaustive search can no longer be applied when more than ten sequences are being used. Page 19
![Page 22: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/22.jpg)
A good heuristic method for estimating the ME tree is the neighbor-joining (NJ) method, developed by Saitou and Nei (1987) and modified by Studier and Keppler (1988). Because NJ is conceptually related to clustering, but without assuming a clock-like behavior, it combines computational speed with uniqueness of results.
![Page 23: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/23.jpg)
However, NJ trees have proven to be the same or similar to the ME tree. Several methods have been proposed to find ME trees, staring from an NJ tree but evaluating alternative topologies close to the NJ tree by conducting local rearrangements.
![Page 24: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/24.jpg)
![Page 25: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/25.jpg)
![Page 26: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/26.jpg)
![Page 27: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/27.jpg)
![Page 28: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/28.jpg)
![Page 29: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/29.jpg)
3 - (21+24) / 3
![Page 30: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/30.jpg)
U
3/2 (21-24) /2 * 3
![Page 31: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/31.jpg)
DC
EC
![Page 32: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/32.jpg)
5 19+17
![Page 33: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/33.jpg)
SDW=dDE/2+(rD-rE)/2(N-2)SEW=dDE-SDW
![Page 34: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/34.jpg)
![Page 35: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/35.jpg)
![Page 36: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/36.jpg)
![Page 37: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/37.jpg)
However, NJ trees have proven to be the same or similar to the ME tree. Several methods have been proposed to find ME trees, staring from an NJ tree but evaluating alternative topologies close to the NJ tree by conducting local rearrangements.
![Page 38: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/38.jpg)
Alternative versions of the NJ algorithm have been proposed, including BIONJ (Gascuel, 1997), weighted neighbor-joining (weighbor), and generalized neighbor-joining.
BIONJ and weighbor both consider that long genetic distances present a higher variance than short ones when distances from a newly defined node to all other nodes are estimated.
![Page 39: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/39.jpg)
The weighted neighbor-joining method of Bruno et al. (2000) uses a likelihood-based criterion rather than the ME criterion of Saitou and Nei (1987) to decide which pair of OTUs should be joined.
The generalized neighbor-joining method of Pearson et al.(1999) keeps track of multiple, partial, and potentially good solutions during its execution, thus exploring a greater part of the tree space. As a result, the program is able to discover topologically distinct solutions that are close to the ME tree.
![Page 40: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/40.jpg)
![Page 41: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/41.jpg)
![Page 42: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/42.jpg)
![Page 43: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/43.jpg)
![Page 44: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/44.jpg)
![Page 45: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/45.jpg)
Consensus tree
![Page 46: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/46.jpg)
Compare trees derived from different sequences, or from the same sequence
using different methods.
![Page 47: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/47.jpg)
![Page 48: Chapter 5 Phylogeny inference based on distance methods](https://reader035.vdocument.in/reader035/viewer/2022081420/56815932550346895dc66d1c/html5/thumbnails/48.jpg)
Jackknifing