plant phylogenomics: lessons from the 1kp project · person1 saruea ma horde triticuum oryzm zea a...

33
Plant Phylogenomics: Lessons from the 1KP Project Jim Leebens-Mack Department of Plant Biology University of Georgia New Methods for Phylogenomics and Metagenomics Symposium Feb, 2013

Upload: others

Post on 22-Sep-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Plant Phylogenomics: Lessons from the 1KP Project · Person1 Saruea ma Horde Triticuum Oryzm Zea a Sorghmays Saccharum Asparagum Asparagus1 Alliuus2 Yuccm a Medica Lotugo Glycis nePopulus

Plant Phylogenomics: Lessons from the 1KP Project!

Jim Leebens-Mack!

Department of Plant Biology!

University of Georgia!

New Methods for Phylogenomics and

Metagenomics Symposium !Feb, 2013!

Page 2: Plant Phylogenomics: Lessons from the 1KP Project · Person1 Saruea ma Horde Triticuum Oryzm Zea a Sorghmays Saccharum Asparagum Asparagus1 Alliuus2 Yuccm a Medica Lotugo Glycis nePopulus

Collaborators!

Funding: NSF, iPlant, University of Georgia, OneKP

Ancestral Angiosperm/!Amborella Genome!Vic Albert!Raj Ayyampalayam!Brad Barbazuk!John Bowers!Jim Burnette!Srikar Chamala !Andre Chanderbali!Josh Der!Claude dePamphils!Jamie Estill!Hong Ma!Doug & Pam Soltis!Stephan Schuster!Sue Wessler!Rod Wing!Kerr Wall!Norm Wickett!Eric Wafula!

MonAToL!Claude dePamphilis!Tom Givnish !Cecile Ané !Raj Ayyampalayam!Sean Graham!Dennis Stevenson!Jerry Davis!Alejandra Gandolfo !Chris Pires !Norm Wickett!Wendy Zomlefer!Michael McKain!Jill Duarte!

OneKP/MSA AToL!Norman Wickett!Nam Nguyen!Siavash Mirarab!Naim Mataci!Gane Ka-Shu Wong!BGI !Eric Carpenter!Brad Ruhfel!Herve Philippe.!Gordon Burleigh!Matt Barker!Claude dePamphilis!Tandy Warnow!Jamie Estill!Raj Ayyampalayam!Doug & Pam Soltis!Sean Graham!Dennis Stevenson!Michael Melkonian!……OneKP Consortium!

Page 3: Plant Phylogenomics: Lessons from the 1KP Project · Person1 Saruea ma Horde Triticuum Oryzm Zea a Sorghmays Saccharum Asparagum Asparagus1 Alliuus2 Yuccm a Medica Lotugo Glycis nePopulus

“Nothing in biology makes sense except in the light of evolution” (Theodosius Dobzhansky, 1973)!

“Nothing in evolution makes sense except in the light of phylogeny” !

Darwin (1837) First Notebook on Transmutation of Species!

Page 4: Plant Phylogenomics: Lessons from the 1KP Project · Person1 Saruea ma Horde Triticuum Oryzm Zea a Sorghmays Saccharum Asparagum Asparagus1 Alliuus2 Yuccm a Medica Lotugo Glycis nePopulus

“Phylogenomics” - Jonathan Eisen (1998; Genome Research 8:163-167)!

Page 5: Plant Phylogenomics: Lessons from the 1KP Project · Person1 Saruea ma Horde Triticuum Oryzm Zea a Sorghmays Saccharum Asparagum Asparagus1 Alliuus2 Yuccm a Medica Lotugo Glycis nePopulus

Current Usages!

1.  Using genome-scale data to resolve phylogentic relationships!

2.  Genome-Scale comparisons placed within a phylogenetic context!

Page 6: Plant Phylogenomics: Lessons from the 1KP Project · Person1 Saruea ma Horde Triticuum Oryzm Zea a Sorghmays Saccharum Asparagum Asparagus1 Alliuus2 Yuccm a Medica Lotugo Glycis nePopulus

Jansen et al. 2007 PNAS!

Liriodendron tulipifera 159,885 bp

Phylogenomics1: Plastid Genome Phylogeny Resolving Many Previously Intractable Questions in Plant Systematics !

Page 7: Plant Phylogenomics: Lessons from the 1KP Project · Person1 Saruea ma Horde Triticuum Oryzm Zea a Sorghmays Saccharum Asparagum Asparagus1 Alliuus2 Yuccm a Medica Lotugo Glycis nePopulus

What About Nuclear Gene Histories?!

monatol.uga.edu iPlant Tree of Life

www.onekp.com

ancangio.uga.edu

GaneKa‐ShuWong(Alberta)

Page 8: Plant Phylogenomics: Lessons from the 1KP Project · Person1 Saruea ma Horde Triticuum Oryzm Zea a Sorghmays Saccharum Asparagum Asparagus1 Alliuus2 Yuccm a Medica Lotugo Glycis nePopulus

Phylo-Transcriptomics… !

Amborella Liriodendr

on2 Liriodendron1 Pers

ea Saruma Horde

um Triticum Oryz

a Zea mays Sorghum Saccharum Asparag

us1 Asparagus2 Alliu

m Yucca Medica

go Lotus Glyci

ne Populus trichocarpa Populus tremuloides Ros

a Arabidopsis Descurainia Gossypi

um Vitis Suaeda Solanum Solanum lycopersicon Solanum hirsutum

Page 9: Plant Phylogenomics: Lessons from the 1KP Project · Person1 Saruea ma Horde Triticuum Oryzm Zea a Sorghmays Saccharum Asparagum Asparagus1 Alliuus2 Yuccm a Medica Lotugo Glycis nePopulus

Gene Family Circumscription, Sequence Alignment, Gene Tree Estimation and Species Tree Estimation !

Full-Length Coding Sequence Database

(Sequenced Genomes, Refseq…)

Core Gene Family

Scaffolds!

MCL-based Gene Family Circumscription !

HMM Sorting of !Coding Sequences!Into Gene Families !

HMM Estimation !for each Gene

Family !

Coding Sequence

Databases for each Species (Transcriptome

Assemblies)!

Multiple-sequence Alignment !

Gene Tree Estimation!

Gene Family HMMs!

SATe´

SuperTree Analysis!

(Burleigh & Eulenstein)!

SuperMatrix Analysis!

(deSalle et al. [AMNH])!

Low Copy Gene SuperMatrix

Analysis!(dePamphilis et

al.)!

Gene Family Fasta Files!

Species Tree Estimation

Reference Proteomes + ! New Transcriptome !Data!

Page 10: Plant Phylogenomics: Lessons from the 1KP Project · Person1 Saruea ma Horde Triticuum Oryzm Zea a Sorghmays Saccharum Asparagum Asparagus1 Alliuus2 Yuccm a Medica Lotugo Glycis nePopulus

Gene Family Classification for Full-Length Genes from Sequenced Genomes – orthoMCL clustering !

Page 11: Plant Phylogenomics: Lessons from the 1KP Project · Person1 Saruea ma Horde Triticuum Oryzm Zea a Sorghmays Saccharum Asparagum Asparagus1 Alliuus2 Yuccm a Medica Lotugo Glycis nePopulus

Gain and Loss of “Gene Families”!

1000 GF clusters!

Large increases !on internal branches!

Angiosperms!

monocots!

rosids!asterids!

eudicots!

Page 12: Plant Phylogenomics: Lessons from the 1KP Project · Person1 Saruea ma Horde Triticuum Oryzm Zea a Sorghmays Saccharum Asparagum Asparagus1 Alliuus2 Yuccm a Medica Lotugo Glycis nePopulus

basaleudicots asterids

monocots

conifers mosseschlorophytes

charophytes

angiospermseudicots greenalgae

commelinidsferns hornworts

liverwortsmagnoliids rosids

gymnosperms

1,000,000genewidthscalebar

1kP

NCBI

Page 13: Plant Phylogenomics: Lessons from the 1KP Project · Person1 Saruea ma Horde Triticuum Oryzm Zea a Sorghmays Saccharum Asparagum Asparagus1 Alliuus2 Yuccm a Medica Lotugo Glycis nePopulus

1kp –Questions to be Addressed through Estimation of Gene Trees and Species Relationships!

•  What are the relationships among lineages across the green plant tree of life?!

•  What was the nature and gene composition of the likely ancestor of the Viridiplantae?!

•  What was the impact of lateral gene transfer (from bacteria) on the early evolution of the Viridiplantae and the Chlorophyta/Streptophytra?!

•  Were gene and/or genome duplication events associated with innovations in green plant evolution including the origin of the flower, the seed, the vascular cambium (wood), alternation of generations, the shift from a life history dominated by the haploid phase to a dominant diploid life stage, colonization of land and shifts from single cells to multicellularity,!

•  Has polyploidy played a role in the diversification of angiosperms?!

Page 14: Plant Phylogenomics: Lessons from the 1KP Project · Person1 Saruea ma Horde Triticuum Oryzm Zea a Sorghmays Saccharum Asparagum Asparagus1 Alliuus2 Yuccm a Medica Lotugo Glycis nePopulus

Check out OneKP data!

Page 15: Plant Phylogenomics: Lessons from the 1KP Project · Person1 Saruea ma Horde Triticuum Oryzm Zea a Sorghmays Saccharum Asparagum Asparagus1 Alliuus2 Yuccm a Medica Lotugo Glycis nePopulus

Surprises from Phylogenomics: Monophyly of Conifers?!

Plastid Genome Analyses! Large Nuclear Gene Analyses!

Cibrián et al. 2010, Lee et al 2011, OneKp unpublished!

doublefertilizationporosevessels

Page 16: Plant Phylogenomics: Lessons from the 1KP Project · Person1 Saruea ma Horde Triticuum Oryzm Zea a Sorghmays Saccharum Asparagum Asparagus1 Alliuus2 Yuccm a Medica Lotugo Glycis nePopulus

sporangium

gametangia

GreenPlants(Viridophytes)

[Fossilgroups]

Age~1.5billionyears

retenLonofegg,phragmoplastandplasmodesmata

filamentousgrowth

Char

ales!

Cole

ocha

ete!

Origin of Land Plants: Retention of egg, phragmoplast and plasmodesmata preadaptations for colonization of land!

cell plate cell wall

Page 17: Plant Phylogenomics: Lessons from the 1KP Project · Person1 Saruea ma Horde Triticuum Oryzm Zea a Sorghmays Saccharum Asparagum Asparagus1 Alliuus2 Yuccm a Medica Lotugo Glycis nePopulus

Plastid phylogenomic analyses yielding surprising results for the origin of land plants – filamentous rather than more complex algal lineages inferred sister to Land Plants! !

Chara vulgaris

cp genome

184,933 bp ML MP

ML-distance LogDet-distance

Land Plants

Monique Turmel et al 2006 MBE!

Page 18: Plant Phylogenomics: Lessons from the 1KP Project · Person1 Saruea ma Horde Triticuum Oryzm Zea a Sorghmays Saccharum Asparagum Asparagus1 Alliuus2 Yuccm a Medica Lotugo Glycis nePopulus

Analysis of 604 single-copy nuclear genes agrees with plastome analysis suggesting that retention of egg, plasmodesmata, and phragmoplast lost in

diverse Zygnemophyceae !

Land Plants

retenL

onofe

gg,phragmop

last

andplasmod

esmata

Loss!

Page 19: Plant Phylogenomics: Lessons from the 1KP Project · Person1 Saruea ma Horde Triticuum Oryzm Zea a Sorghmays Saccharum Asparagum Asparagus1 Alliuus2 Yuccm a Medica Lotugo Glycis nePopulus

Why should you believe these results?!

•  You shouldn’t…. Inferred trees are hypotheses!!

Page 20: Plant Phylogenomics: Lessons from the 1KP Project · Person1 Saruea ma Horde Triticuum Oryzm Zea a Sorghmays Saccharum Asparagum Asparagus1 Alliuus2 Yuccm a Medica Lotugo Glycis nePopulus

Consider Possible Complications!

•  Lots of missing data!!• Remove genes/taxa with lots of missing data!

•  Ortholog identification!•  Focus on genes/genomes that tend not to be duplicated!

•  Contamination!• BLAST, Sequence placement on reference tree (SEPP?), Long branch trimming !

•  Model misspecification!• Model validation – simulations!

•  Test robustness of hypothesis among inferences derived from alternative models!

Page 21: Plant Phylogenomics: Lessons from the 1KP Project · Person1 Saruea ma Horde Triticuum Oryzm Zea a Sorghmays Saccharum Asparagum Asparagus1 Alliuus2 Yuccm a Medica Lotugo Glycis nePopulus

Contamination is an issue!!

Page 22: Plant Phylogenomics: Lessons from the 1KP Project · Person1 Saruea ma Horde Triticuum Oryzm Zea a Sorghmays Saccharum Asparagum Asparagus1 Alliuus2 Yuccm a Medica Lotugo Glycis nePopulus

Data Matrix/Analysis Perturbation !•  DNA vs Amino Acids!•  All codon positions vs remove third

position!•  Remove gappy genes or sites!•  Remove genes on long branches in

gene trees!•  Remove genes where contamination

seems to persist.!•  Supermatrix and Supertree estimation!

Page 23: Plant Phylogenomics: Lessons from the 1KP Project · Person1 Saruea ma Horde Triticuum Oryzm Zea a Sorghmays Saccharum Asparagum Asparagus1 Alliuus2 Yuccm a Medica Lotugo Glycis nePopulus

Good News: Some results seem to be very robust!

Page 24: Plant Phylogenomics: Lessons from the 1KP Project · Person1 Saruea ma Horde Triticuum Oryzm Zea a Sorghmays Saccharum Asparagum Asparagus1 Alliuus2 Yuccm a Medica Lotugo Glycis nePopulus

Issues with inclusion of 3rd codon position?!

Amino Acids!1st & 2nd !

Codon Pos!Supertree!

AA/1st2nd All Pos !All Codon !

Pos!

Page 25: Plant Phylogenomics: Lessons from the 1KP Project · Person1 Saruea ma Horde Triticuum Oryzm Zea a Sorghmays Saccharum Asparagum Asparagus1 Alliuus2 Yuccm a Medica Lotugo Glycis nePopulus

Problems due to heterogeneous substitution process?!

Page 26: Plant Phylogenomics: Lessons from the 1KP Project · Person1 Saruea ma Horde Triticuum Oryzm Zea a Sorghmays Saccharum Asparagum Asparagus1 Alliuus2 Yuccm a Medica Lotugo Glycis nePopulus

Filamentous Algae + Land Plant Hypothesis Seems to be Robust!

Amino Acids! 1st & 2nd !Codon Pos!

Supertree! AA/1st2nd!

Page 27: Plant Phylogenomics: Lessons from the 1KP Project · Person1 Saruea ma Horde Triticuum Oryzm Zea a Sorghmays Saccharum Asparagum Asparagus1 Alliuus2 Yuccm a Medica Lotugo Glycis nePopulus

Land Plants

retenL

onofe

gg,phragmop

last

andplasmod

esmata

Loss!

Page 28: Plant Phylogenomics: Lessons from the 1KP Project · Person1 Saruea ma Horde Triticuum Oryzm Zea a Sorghmays Saccharum Asparagum Asparagus1 Alliuus2 Yuccm a Medica Lotugo Glycis nePopulus

Supertree and Supermatrix Analyses Provide Different Inferences Concerning the Monophyly

of Conifers!Amino Acids! 1st & 2nd !

Codon Pos!Supertree! AA/1st2nd!

Page 29: Plant Phylogenomics: Lessons from the 1KP Project · Person1 Saruea ma Horde Triticuum Oryzm Zea a Sorghmays Saccharum Asparagum Asparagus1 Alliuus2 Yuccm a Medica Lotugo Glycis nePopulus

Mosses Sister + Liverworts Clade Sister to Vascular Plants?!

Amino Acids! 1st & 2nd !Codon Pos!

Supertree! AA/1st2nd!

Page 30: Plant Phylogenomics: Lessons from the 1KP Project · Person1 Saruea ma Horde Triticuum Oryzm Zea a Sorghmays Saccharum Asparagum Asparagus1 Alliuus2 Yuccm a Medica Lotugo Glycis nePopulus

Conclusions!•  Genome-scale phylogenetic inference is

complex!!

•  Model mis-specification can results in statistical inconsistency: more data -> stronger support for the wrong answer.!

•  Proposal: Apply multiple analysis strategies and explore/understand basis of conflicts among resulting trees !

Page 31: Plant Phylogenomics: Lessons from the 1KP Project · Person1 Saruea ma Horde Triticuum Oryzm Zea a Sorghmays Saccharum Asparagum Asparagus1 Alliuus2 Yuccm a Medica Lotugo Glycis nePopulus

Phylogenomics???!

1.  Using genome-scale data to resolve phylogentic relationships!

2.  Genome-Scale comparisons placed in a phylogenetic context!

Page 32: Plant Phylogenomics: Lessons from the 1KP Project · Person1 Saruea ma Horde Triticuum Oryzm Zea a Sorghmays Saccharum Asparagum Asparagus1 Alliuus2 Yuccm a Medica Lotugo Glycis nePopulus

Phylogenetic Analysis and Molecular Dating of 100’s of Gene Families Implicates Genome Duplications Associated

with Origin of Angiosperms and Seed Plants !

Jiao et al. 2011!

Page 33: Plant Phylogenomics: Lessons from the 1KP Project · Person1 Saruea ma Horde Triticuum Oryzm Zea a Sorghmays Saccharum Asparagum Asparagus1 Alliuus2 Yuccm a Medica Lotugo Glycis nePopulus

Thank You!!