the eukaryotic tree of life: endosymbiosis takes its tol · cells, a process known as...

8
Opinion Cel The eukaryotic tree of life: endosymbiosis takes its TOL Christopher E. Lane and John M. Archibald The Canadian Institute for Advanced Research, Integrated Microbial Biodiversity Program, Department of Biochemistry and Molecular Biology, Dalhousie University, 5850 College Street, Halifax, NS B3H 1X5, Canada Resolving the structure of the eukaryotic tree of life remains one of the most important and challenging tasks facing biologists. The notion of six eukaryotic 'supergroups' has recently gained some acceptance, and several papers in 2007 suggest that resolution of higher taxonomic levels is possible. However, in organ isms that acquired photosynthesis via secondary (i.e. eukaryote-eukaryote) endosymbiosis, the host nuclear genome is a mosaic of genes derived from two (or more) nuclei, a fact that is often overlooked in studies attempt ing to reconstruct the deep evolutionary history of eukaryotes. Accurate identification of gene transfers and replacements involving eukaryotic donor and reci pient genomes represents a potentially formidable chal lenge for the phylogenomics community as more protist genomes are sequenced and concatenated data sets grow. Evolution of the eukaryotic tree of life A well-resolved and accurate phylogenetic reconstruction of life on Earth is the ultimate goal of systematics. Although the possibility of ever realizing this goal is a point of considerable debate in prokaryotes (see the review in this issue by Mclnerney et ál. [1]), it might still be achievable for eukaryotes. A robustly supported eukaryotic tree of life (TOL) would provide an important foundation for research targeted at understanding the evolution of a myriad of traits in this domain, including adaptation to anaerobic environments and the origins of multicellularity and photosynthesis. Over the last 50 years, significant advances in taxon sampling and methodology have increased our understanding of eukaryotic evolutionary relationships and dramatically altered our view of high-level diversity. This is especially true of unicellular eukaryotes (protists), which are now recognized as being scattered throughout the tree rather than mostly belonging to a single clade (e.g. Refs [2,3]). Whereas early phylogenetic analyses relied primarily on the information contained within a single gene (i.e. that encoding 18S rRNA), and were thus limited in terms of their resolution (and in some cases positively misleading [4] ), the genomics revolution has brought an ever-increasing amount of data to bear on the question of protist phylogeny and, more generally, the ancient divergences among eukaryotic lineages. Research in the last decade has led to the hypothesis of six eukaryotic ‘supergroups’ (Figure 1), erected on the basis Corresponding author: Lane, C.E. ([email protected]). of an eclectic mix of morphological and molecular sequence data [5,6]. The strength of the evidence supporting these superassemblages (summarized in the accompanying Glossary) has been the subject of much debate [7] and the relationships between the supergroups are largely unknown. Indeed, whether molecular data can accurately resolve relationships between taxa that diverged ca. one billion years ago is unclear. In photosynthetic eukaryotes, the problem of ancient divergence times is confounded by Glossary Alveolata: composed of the apicomplexans, dinoflagellate algae and ciliates (e.g. Tetrahymena), a trio whose shared common ancestry is very well supported in nuclear gene phylogenies. Amoebozoa: composed of lineages of unicellular organisms, generally well supported in molecular phylogenies; without clear-cut morphological apo- morphies, although most lineages produce pseudopodia that are broad (lobose). Archaeplastida: includes all primary plastid-containing lineages, and is well supported in both plastid and phylogenomic-scale nuclear gene phylogenies. Molecular evidence for the single origin of red algal, green algal and glaucophyte plastids is supported by the structure of plastid genomes and the light-harvesting complex. Chroma I veo lata: predominantly unicellular collection of photosynthetic and non photosynthetic organisms united by the 'chromalveolate hypothesis' (Box 2), which states that the plastids of chromists and alveolates are the product of a single secondary endosymbiosis in the common ancestor of the two groups. Support for this group is based largely on plastid-related characters between subsets of its component lineages, with no single character or phylogeny that has been shown to unite all of its hypothesized members. Chromists: include cryptophytes and haptophytes, two predominantly photo synthetic algal groups, and the stramenopiles, a group of unicellular (e.g. diatoms) and multicellular (e.g. kelp) algal species together with a diverse array of nonphotosynthetic free-living and parasitic lineages. Endosymbiotic gene replacement (EGR): a specific case of EGT, where an endosymbiont gene is transferred to the host nucleus and replaces the function of a nu cl ear-encoded gene. Endosymbiotic gene transfer (EGT): genes transferred from the genome of an endosymbiont to the nucleus of the host, where they can be degraded, assume novel functions, replace host genes (EGR) or acquire a targeting signal(s) so that their protein products are directed back to the endosymbiont compart ment. Excavata: encompass unicellular eukaryotes that share a distinctive ventral feeding groove and an array of cytoskeletal features (and their relatives as determined by molecular means). The common ancestry of the group as a whole is, at best, weakly supported by published molecular data. Expressed sequence tag (EST): a single sequencing read of a cloned mRNA- derived cDNA, isolated from an organism. Multiple reads can be assembled to produce the entire sequence of a mature transcript, which can then be used in place of the genomic copy of the sequence in phylogenetic analyses. Opisthokonta: includes animals, fungi and their unicellular relatives (such as Capsaspora and choanoflagellatea), which share (or are derived from organisms with) a single posterior flagellum and are strongly supported by sequence data, including a unique shared insertion in the genes encoding Elongation Factor 1-a and enolase. Rhizaria: a group united only by molecular phylogenies. Members include ecologically important and abundant organisms, such as foraminiferans and cercozoans, which are largely understudied. 268 0169-5347/$ - see front matter © 2008 Elsevier Ltd. All rights reserved. doi:10.1016/j.tree.2008.02.004 Available online 2 April 2008

Upload: others

Post on 17-Aug-2020

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: The eukaryotic tree of life: endosymbiosis takes its TOL · cells, a process known as ‘secondary’ endosymbiosis [9,10]. In the case of secondary endosymbiosis, the plastid acts

Opinion Cel

The eukaryotic tree of life: endosymbiosis takes its TOLChristopher E. Lane and John M. ArchibaldThe Canadian Institu te fo r Advanced Research, Integrated M icrobia l B iod ivers ity Program , D epartm ent o f B iochem istry and M olecu lar B io logy, Dalhousie U n iversity, 5850 College Street, Halifax, NS B3H 1X5, Canada

Resolving the structure of the eukaryotic tree of life remains one of the most important and challenging tasks facing biologists. The notion of six eukaryotic 'supergroups' has recently gained some acceptance, and several papers in 2007 suggest that resolution of higher taxonomic levels is possible. However, in organ­isms that acquired photosynthesis via secondary (i.e. eukaryote-eukaryote) endosymbiosis, the host nuclear genome is a mosaic of genes derived from two (or more) nuclei, a fact that is often overlooked in studies attempt­ing to reconstruct the deep evolutionary history of eukaryotes. Accurate identification of gene transfers and replacements involving eukaryotic donor and reci­pient genomes represents a potentially formidable chal­lenge for the phylogenomics community as more protist genomes are sequenced and concatenated data sets grow.

Evolution of the eukaryotic tree of lifeA well-resolved and accurate phylogenetic reconstruction of life on E arth is the ultim ate goal of systematics. Although the possibility of ever realizing this goal is a point of considerable debate in prokaryotes (see the review in this issue by Mclnerney et ál. [1]), it m ight still be achievable for eukaryotes. A robustly supported eukaryotic tree of life (TOL) would provide an im portant foundation for research targeted a t understanding the evolution of a myriad of traits in this domain, including adaptation to anaerobic environments and the origins of multicellularity and photosynthesis. Over the last 50 years, significant advances in taxon sampling and methodology have increased our understanding of eukaryotic evolutionary relationships and dramatically altered our view of high-level diversity. This is especially true of unicellular eukaryotes (protists), which are now recognized as being scattered throughout the tree ra ther than mostly belonging to a single clade (e.g. Refs [2,3]). Whereas early phylogenetic analyses relied primarily on the information contained within a single gene (i.e. th a t encoding 18S rRNA), and were thus limited in term s of their resolution (and in some cases positively misleading [4] ), the genomics revolution has brought an ever-increasing amount of data to bear on the question of protist phylogeny and, more generally, the ancient divergences among eukaryotic lineages.

Research in the last decade has led to the hypothesis of six eukaryotic ‘supergroups’ (Figure 1), erected on the basis

Corresponding author: Lane, C.E. ([email protected]).

of an eclectic mix of morphological and molecular sequence data [5,6]. The strength of the evidence supporting these superassemblages (summarized in the accompanying Glossary) has been the subject of much debate [7] and the relationships between the supergroups are largely unknown. Indeed, whether molecular data can accurately resolve relationships between taxa th a t diverged ca. one billion years ago is unclear. In photosynthetic eukaryotes, the problem of ancient divergence times is confounded by

GlossaryAlveolata: com posed of the apicom plexans, dinoflagellate algae and ciliates (e.g. Tetrahymena), a trio w hose shared com m on ancestry is very well supported in nuclear gene phylogenies.Amoebozoa: com posed of lineages of unicellular organism s, generally well supported in m olecular phylogenies; w ithout clear-cut morphological apo- m orphies, although m ost lineages produce pseudopodia that a re broad (lobose).Archaeplastida: includes all prim ary plastid-containing lineages, and is well supported in both plastid and phylogenom ic-scale nuclear gene phylogenies. M olecular evidence for th e single origin of red algal, green algal and glaucophyte plastids is supported by the structure of plastid genom es and th e light-harvesting complex.Chroma I veo lata: predom inantly unicellular collection of photosynthetic and non photosynthetic organism s united by th e 'chrom alveolate hypothesis' (Box 2), which states that th e plastids o f chrom ists and alveolates are th e product of a single secondary endosym biosis in the com m on ancestor of the tw o groups. Support for this group is based largely on plastid-related characters between subse ts of its com ponent lineages, with no single character o r phylogeny that has been show n to unite all of its hypothesized m em bers.Chromists: include cryptophytes and haptophytes, tw o predom inantly photo­synthetic algal groups, and the stram enopiles, a group of unicellular (e.g. d iatom s) and multicellular (e.g. kelp) algal species toge ther with a diverse array of nonphotosynthetic free-living and parasitic lineages.Endosym biotic gene replacem ent (EGR): a specific case of EGT, w here an endosym biont gene is transferred to th e host nucleus and replaces th e function of a nu cl ear-encoded gene.Endosym biotic gene transfer (EGT): genes transferred from th e genom e of an endosym biont to the nucleus of th e host, w here they can be degraded, assum e novel functions, replace host genes (EGR) o r acquire a targeting signal(s) so th a t their protein products are directed back to the endosym biont com part­ment.Excavata: encom pass unicellular eukaryotes th a t share a distinctive ventral feeding groove and an array of cytoskeletal features (and their relatives as determ ined by m olecular means). The com m on ancestry of the group as a w hole is, a t best, weakly supported by published m olecular data.Expressed sequence tag (EST): a single sequencing read of a cloned mRNA- derived cDNA, isolated from an organism . Multiple reads can be assem bled to produce th e entire sequence of a m ature transcript, which can then be used in place of th e genom ic copy of th e sequence in phylogenetic analyses. Opisthokonta: includes anim als, fungi and their unicellular relatives (such as Capsaspora and choanoflagellatea), w hich sh a re (or a re derived from organism s with) a single posterior flagellum and are strongly supported by sequence data, including a unique shared insertion in th e genes encoding Elongation Factor 1-a and enolase.Rhizaria: a group united only by m olecular phylogenies. M em bers include ecologically im portant and abundant organism s, such as foram iniferans and cercozoans, which a re largely understudied.

268 0169-5347/$ - see front matter © 2008 Elsevier Ltd. All rights reserved. doi:10.1016/j.tree.2008.02.004 Available online 2 April 2008

Page 2: The eukaryotic tree of life: endosymbiosis takes its TOL · cells, a process known as ‘secondary’ endosymbiosis [9,10]. In the case of secondary endosymbiosis, the plastid acts

Opinion

the fact th a t the nuclear genome is a mosaic of genes with different evolutionary histories. The cyanobacterial pro­genitor of the plastid (chloroplast) in modern-day plants and algae donated thousands of prokaryotic genes to its eukaryotic host during the initial transition from endo­symbiont to organelle [8], and plastids subsequently spread laterally by endosymbioses between two eukaryotic cells, a process known as ‘secondary’ endosymbiosis [9,10]. In the case of secondary endosymbiosis, the plastid acts as a genetic Trojan horse, bringing with it the nucleus of an unrelated eukaryotic endosymbiont whose genes meld with - and can replace - their counterparts in the host nuclear genome. The mixing and matching of eukaryotic genes th a t occurs in the context of secondary endosymbio­sis seriously challenges our ability to accurately infer the evolutionary history of these organisms. At the same time, a comprehensive understanding of the impact of endosym­biosis on the structure and content of nuclear genomes has the potential to assist our efforts to resolve the structure of the eukaryotic tree of life.

The imprint of endosymbiosisThe process of endosymbiosis has been responsible for some of the most significant events in eukaryotic evolution. The most celebrated examples are the endosymbioses th a t gave rise to plastids and mitochondria. In the case of plastids, a wealth of biochemical and molecular data indicate th a t a prokaryotic relative of modern-day cyano­bacteria was engulfed and retained by a heterotrophic eukaryote [11] and transform ed into the photosynthetic organelle seen in the plants and algae of today (Box 1). Integration of a prokaryotic endosymbiont into the cellular machinery of a eukaryote is a complex process and involves substantial modifications to the genetic makeup of both cells [10]. All known organelles of endosymbiotic origin encode only a fraction of the genes present in their prokar­yotic antecedents, meaning th a t scores of genes th a t were once essential to the free-living prokaryote, bu t obsolete in the context of intracellular life (e.g. genes for locomotion or certain metabolic pathways), are lost, either by transfer to the host nucleus or by genomic degradation [12]. Even the most gene-rich plastid genomes contain at most ~250 genes - an order of magnitude less than most free-living cyanobacteria [13]. As the genetic capacity of the prokar­yotic endosymbiont is reduced during the transition from free-living cell to fully integrated organelle, the host cell becomes a repository of genetic information by way of endosymbiotic gene transfer (EGT) [8]. Many of the genes transferred to the nucleus acquire targeting signals, which allow their products to be shuttled back to the plastid to perform vital functions [10]. However, transferred genes can also assume novel functions in the eukaryotic cell, sometimes even replacing eukaryotic versions of the proteins they encode.

Comparative genomic studies of each of the three lineages of ‘prim ary’ plastid-containing eukaryotes (Box 1) have revealed ju s t how significant endosymbiont-to- nucleus gene transfer is. A 2002 study of the nuclear genome of the land plant Arabidopsis thaliana suggested th a t an unexpectedly large fraction of its genes (1700/ 9368 = ~-18%) are derived from the cyanobacterial

Trends in Ecology and Evolution Vol.23 No.5

Box 1. The birth and spread of eukaryotic photosynthesis

Plastids (ch loroplasts) are th e light-harvesting organel les of p h o to ­synthe tic eukaryo tes an d are derived f rom once free-living c y an o ­bacteria by th e p rocess of e n dosym bios is . It Is generally accepted that plastids evolved f rom cyanobac te ria only once during the history of life, In a c o m m o n a n c e s to r sh a re d by green a lgae (and plants), red a lgae and g laucophy te a lgae (reviewed by Ref. [36]). There Is, never theless , d e b a te as to w h e n this occurred (e.g. Refs [46,47]), an d s o m e au th o rs (e.g. Refs [48-50]) believe th a t th e notion of mult iple pr imary en d o s y m b io t ic even ts sh o u ld not be d is­counted .

S u b s e q u e n t to th e diversif ication of th e th re e 'p r im ary ' plastid- conta in ing lineages, plastids sp re a d laterally ac ross th e t ree of eukaryo tes via th e p rocess of ' s eco n d a ry ' e n dosym bios is , tha t Is, th e engu l fm en t and retention of a eukaryotic pho to troph by an unre la ted nonph o to sy n th e t lc eukaryotic host. The n u m b e r of t im es this has occurred Is a topic of cons iderab le deba te , but m ost r e co g n ized h y p o t h e s e s p r o p o s e t h r e e o r m o re e v e n ts . Two unre la ted l ineages, th e eug len lds (Excavata) and chlorarachnlo- phy tes (Rhizaria), contain plastids of g reen algal ances try [21], w h e re a s plastids In th e s u p e rg ro u p C hrom alveola ta are, with the exception of s o m e dlnoflagella tes (below), d e s c e n d e d f rom a red alga [51], The morpholog ica l ev idence for th e eukaryotic origin of plastids In th e s e th re e g ro u p s c o m e s f rom th e p re sen c e of th re e or fo u r m e m b r a n e s su r ro u n d in g s e c o n d a ry p las t ids (versus tw o m e m b ra n e s a ro u n d primary plastids) and th e re m n a n t plastld- assoc la ted nuclei (nuc leom orphs) tha t exist In ch lo ra rachn lophy tes and c ryp tophy tes ( reviewed In Ref. [52]). O rg an ism s harboring s e co n d a ry plastids a re th u s th e biological equ ivalen t of nes ted Russian dolls - a cyanobac te r ium en c a s e d within a eukaryote, e n v e lo p e d w ith in a s e c o n d e u k a ry o te (Figure 2). To fu r th e r com plica te m atters , s o m e dlnoflagella te a lgae have replaced their plastid with a 'tertiary ' plastid, s to len f rom o the r ch rom alveo la tes Including c ryp tophytes , d ia to m s and hap tophy tes , and even a green algal plastid In a ca se of serial se c o n d a ry en d o s y m b io s is (see Refs [37,53] an d refe rences therein). Therefore, a l though all plastids probab ly t race back to a s ingle pr imary e n dosym bio t ic event, they have been p ro p ag a ted th ro u g h o u t eukaryotic diversi ty multiple t im es by a s imilar process.

progenitor of the plastid [8]. More recent studies using different approaches have produced lower estim ates of EGT in A. thaliana [14,15] and proportionately fewer genes of cyanobacterial origin appear to reside in the nuclear genome of the green alga Chlamydomonas reinhardtii (6%)[16], the red alga Cyanidioschyzon merolae (12.7%) [14] and the glaucophyte alga Cyanophora paradoxa (10.8%)[17]. Regardless, it is clear th a t endosymbiont-derived genes have contributed substantially to the host cell nucleus. In the case of A. thaliana, more than half of the genes of putative cyanobacterial origin are predicted to perform functions unrelated to the plastid [8], indicating th a t these genes have either taken on new functions or replaced nuclear genes (termed endosymbiotic gene repla­cement; EGR). A specific example of EGR is phosphogly- cerate kinase (PGK) [18], which occurs in two copies within land plants, with a plastid-encoded homolog functioning in the Calvin cycle and a nucleus-encoded copy involved in glycolysis in the cytosol. Phylogenetic analysis shows th a t both are derived from cyanobacteria, indicating th a t the plastid copy was transferred to the nucleus and duplicated before replacing the resident eukaryotic gene encoding PGK in the nucleus [18]. Genes transferred from the plastid genome to the nucleus are, for the most part, easily identified because of their phylogenetic affinity to

269

Page 3: The eukaryotic tree of life: endosymbiosis takes its TOL · cells, a process known as ‘secondary’ endosymbiosis [9,10]. In the case of secondary endosymbiosis, the plastid acts

Opinion Trends in Ecology and Evolution Vol.23 No.5

Ciliates

I Dinoflagellates

Stramenopiles

Land plants

Red a

Glaucophytes Haptophytes Cryptophytes ApicomplexansGreen algae

Chroma veolataArchaeplastida

Animals Foramimferans Chlorarachn lophytes

Radiolarians

f:H IDiplomonads

) *Pelobionts

Amoeba proteus Andalucía Carpediemonas Trimastix incarcerata

Euglenids

TRENDS in Ecology & Evolution

Figure 1. The six hypothesized supergroups of eukaryotes. There is currently no consensus regarding th e root o f eukaryotes or th e branching order between th ese groups, except tha t recent studies indicate a sister relationship between Am oebozoa and Opisthokonta. Dashed lines a t the base of Chrom alveolata and Excavata indicate uncertainty regarding th e m onophyly of th ese assem blages. Im ages show representatives of som e of the m ajor lineages within each supergroup. Images are credited to the following: Chromalveolata: haptophyte and cryptophytes by th e Archibald laboratory; stram enopile by G.W. Saunders; apicom plexan by B.S. Leander; ciliate and dinoflagellate from Visuals Unlimited (VU). Archaeplastida: red alga by C. Bates (http://www.coastalim ageworks.com); green algae by the Archibald laboratory and from VU; land plant by J. Palmer; glaucophyte by D. Patterson, provided with perm ission by http://m icroscope.m bl.edu. Rhizaria: radiolarian and foraminiferan from VU; chlorarachniophyte by th e Archibald laboratory. Excavata: eu g I en i d by D. Patterson, provided with perm ission by http://m icroscope.m bl.edu; rem aining excavates by A.G.B. Sim pson. Amoebozoa: Am oeba proteus from VU; pelobionts from A.G.B. Sim pson. Opisthokonta: Animals from W. F. Doolittle, C. Bates (http:// w w w .coastalim agew orks.com ) and VU; fungus by G. Burger.

cyanobacteria. However, in eukaryotes th a t have acquired photosynthesis secondarily through the engulfment of a prim ary plastid-containing alga, the impact of EGT is much more subtle bu t no less significant.

The impact of secondary endosymbiosis on genome evolutionThree of the six eukaryotic supergroups (Figure 1), Rhi­zaria, Excavata and Chromalveolata (see Glossary), in­clude organisms with secondary plastids (Box 1). W ithin the Rhizaria and Excavata, chlorarachniophytes and euglenids are the sole photosynthetic lineages, respect­ively, and both contain green algal secondary plastids [19,20]. As described in Box 1, secondary plastids have evolved on multiple occasions from both red and green

algal endosymbionts. W hereas chlorarachniophytes and euglenids acquired plastids independent of one another and are members of predominantly nonphotosynthetic supergroups [21,22], most plastid-bearing members of the contentious supergroup Chromalveolata possess red algal-derived secondary plastids, which has been taken as a sign of monophyly of the group (the ‘chromalveolate hypothesis’).

Unlike prim ary endosymbiosis, secondary endosymbio­sis involves the genetic integration of two eukaryotes. In this case, the host nucleus is bombarded with genes not only from the plastid genome but also from the nucleus of the endosymbiont, which in tu rn already contains cyano- bacterial-derived genes th a t were previously transferred from the prim ary plastid (Figure 2). Indeed, clear cases

270

Page 4: The eukaryotic tree of life: endosymbiosis takes its TOL · cells, a process known as ‘secondary’ endosymbiosis [9,10]. In the case of secondary endosymbiosis, the plastid acts

Opinion Trends in Ecology and Evolution Vol.23 No.5

Plastid

TRENDS in Ecology & Evolution

Figure 2. Endosym biosis and the flow of genetic information in photosynthetic eukaryotes. A generic secondary plastid-containing alga with arrow s indicating the m ovem ent of genes from the plastid genom e to th e endosym biont nuclear genom e, from th e plastid genom e to th e nuclear genom e of the host cell (H) and from th e plastid and endosym biont nuclear (E) genom es to the host nuclear genom e. The result of this gene transfer is th a t th e nuclear genom e of the host cell is a m osaic of cyanobacterial (red), nucleom orph (blue) and host (gray) DNA. The protein products of genes derived from th ese th ree genom es (colored balls; PL = plastid; EN = endosym biont; HN = host nucleus) have th e potential to take on roles in various cellular com partm ents, including th e cytosol (CY), mitochondrion (MT) and secretory pathw ay (SP) of th e host cell, as well as th e plastid and endosym biont cytosol (eCY). M ovement of genes associated with the m itochondria of th e endosym biont and host cell has been om itted for clarity, as have genes derived by lateral gene transfer (see main text).

of EGR involving nuclear genes for plastid-targeted proteins have provided some of the most compelling evi­dence in support of the chromalveolate hypothesis [23,24]. Genome-scale analyses of secondary plastid-containing organisms have also begun to reveal the extent of EGT. For example, expressed sequence tag (EST) data sets from the haptophyte Emiliana huxleyi and the dinoflagellate Karenia brevis have revealed 19 and 17 putative cases of EGT, respectively [25,26]. The nuclear genome sequence of the diatom Thalassiosira pseudonana [27] encodes 1057 genes (of 11 242 = 9.4%) th a t show sequence homology to only plants and algae and 271 genes with only cyanobac­terial homologs. Additionally, an analysis of ESTs from Euglena gracilis (a photosynthetic excavate; Figure 1) found th a t 22% of 259 globally distributed genes showed phylogenetic affinity to prim ary plastid-containing lineages [28], suggesting th a t these genes represent cases of eukaryote-eukaryote EGT and, possibly, EGR. Instances of gene transfer can also point to a photosyn­thetic ancestry for nonphotosynthetic organisms, such as the parasitic stramenopile Phytophthora ramorum (the causative agent of the Irish potato blight), which does not contain a plastid. Strikingly, the identification of 855 genes of putative red algal or cyanobacterial ancestry in the P. ramorum nuclear genome [29] dem onstrates th a t transferred genes can take on many functions in their new genomic context, unrelated to photosynthetic processes.

Therefore, with respect to broad-scale eukaryotic phylogenetics, the question is, how do we detect eukar- yote-to-eukaryote EGT and EGR events and exclude them from our analyses so th a t only the genes th a t accurately reflect the history of the host cell lineage are included? Genes of prokaryotic origin are readily identified, assum ­ing the time since their transfer has not erased the phy­logenetic signal linking them to other prokaryotes. However, in cases of endosymbiont-to-host nuclear tran s­fer or replacement, the eukaryotic origin of the foreign DNA makes its detection significantly more difficult. Unless the transfer is recent and the evolutionary position of the host nucleus in known, cases of eukaryote-to-eukar- yote EGT or EGR are unlikely to be obvious. Additionally, a eukaryotic gene is presumably more likely to be expressed and to function if transferred to the host nucleus than is a prokaryotic gene, suggesting th a t the rate of successful eukaryotic EGTs and EGRs could be higher than those involving mitochondrion- or plastid-derived genes. If the num ber of ‘foreign’ eukaryotic genes in the genomes of secondary plastid-containing (or formerly containing) organisms is significant, but difficult to detect, then the evolutionary history of the host organism would be very difficult to resolve, because of the presence of genes with two (or more) evolutionary histories.

To make m atters worse, detection of eukaryotic EGTs and EGRs should be even more difficult, owing to lineage- specific evolutionary ra tes or gene loss, when taxon sam pling for a group of in terest is low and the hypotheses being tested involve deep divergences. Eukaryote-wide phylogenies often include only a small num ber of repre­sentative taxa for lineages such as Rhizaria or some of the chromalveolate phyla. In addition, the taxa used to represen t m ajor groups in phylogenomic analyses are often highly reduced or simplified organisms, whose gen­ome m ight not be an appropriate proxy for the m ajority of the lineage. This is both an artifact of targeting organisms th a t im pact hum ans (e.g. pathogens) and the need to choose small genomes to make projects manageable. Although this situation is improving, and EST data can often be used in lieu of complete genomes, there is cur­rently an extrem ely lim ited set of organisms available for comparison. In instances where the secondary plastid is of red algal origin, whole-genome comparisons m ust be made to the only red algal nuclear genome th a t has been com­pletely sequenced, th a t of Cyanidioschyzon merolae [30]. Although a genome sequence from a second red alga is near completion (see the Galdieria sulphuraria Genome Project, http://genom ics.m su.edu/galdieria/about.htm l), both of these genomes are from highly reduced unicellular organisms, adapted to life a t high tem peratures and belonging to a single order (Cyanidiales) among the poorly understood early-diverging subphylum Cyanidophytina [31]. Of the roughly 6000 recognized species of red algae, only four are classified in this subphylum. Therefore, through the combined effects of lim ited taxon sampling, lineage-specific variation in evolutionary rates and com­positional biases in DNA and protein sequences [32], red algal genomic data currently available make the reliable detection of EGT in chromalveolate genomes a significant challenge.

271

Page 5: The eukaryotic tree of life: endosymbiosis takes its TOL · cells, a process known as ‘secondary’ endosymbiosis [9,10]. In the case of secondary endosymbiosis, the plastid acts

Opinion

Despite an overwhelming am ount of evidence demon­strating th a t endosymbiont-derived genes readily estab­lish themselves in the nuclear genome of their host, to our knowledge only a handful of studies have thoroughly investigated bona fide transfers of a eukaryotic gene from a secondary endosymbiont nucleus to a host nucleus. For example, two studies have convincingly shown th a t the cryptophyte nuclear genome possesses a gene encoding actin th a t is derived from the red algal nucleus th a t came in with the chromalveolate plastid [33,34]. In this case both the host and endosymbiont copies persist, but instances of complete EGR can often pass phylogenetic screening methods commonly used to detect anomalous genes in large data sets unless the issue of secondary EGR is specifically addressed [35].

The supergroup Chromalveolata - a playground for the evolution of photosynthesisThe supergroup Chromalveolata is composed of the ‘chro- m ists’ and Alveolata whose evolutionary origins are a subject of active debate. Chromalveolates are a diverse assemblage of mostly unicellular lineages, and the complex distribution of plastids among chromalveolate taxa has fueled the debate regarding the monophyly of these organ­isms (Box 2). Of the chromists, cryptophytes and hapto- phytes are two predominantly photosynthetic algal groups, and the stramenopiles are a group of unicellular (e.g. diatoms) and multicellular (e.g. kelp) algal species together with a diverse array of nonphotosynthetic free-living and parasitic lineages (Figure 1). Of the Alveolata, dinoflagel- lates are microalgae th a t are notorious for producing red tides, but roughly half of dinoflagellates lack plastids. The Apicomplexa (a mostly parasitic assemblage, including the causative agent of m alaria) have lost their photosynthetic capabilities, although species in such medically im portant genera as Plasmodium and Toxoplasma re ta in plastids in a highly reduced form (the apicoplast) that, on balance, appear to be of red algal origin [36]. By contrast, the ciliates are an exclusively plastid-free lineage. Although the com­mon origin of chromalveolate plastids from a single endo­symbiotic event is contentious, it is generally accepted th a t all but a few of the photosynthetic taxa th a t fall under this label contain plastids derived from a red algal endosym­biont. A handful of dinoflagellates have clearly taken on plastids from other chromalveolate lineages in tertiary endosymbiotic events (e.g. Dinophysis, Karenia, Kryptoper­idinium), and the green algal plastid oí Lepidodinium is an instance of serial secondary endosymbiosis (see Ref. [37] and references therein).

Shuffling supergroupsIn 2007, a series of papers from independent research groups began to pull apart the chromists (cryptophyte- s + haptophytes + stramenopiles) and the chromalveolate concept as a whole or, a t the very least, significantly complicate the hypothesis. Several studies [35,38,39] resolved topologies th a t break apart chromists, and instead unite stramenopiles with Alveolata with signifi­cant support. The finding is independently supported by an rpL36 gene replacement in the plastid of both cryptophytes and haptophytes, suggesting these two lineages share a

Trends in Ecology and Evolution Vol.23 No.5

Box 2. The chromalveolate controversy

Save for p las tid-associa ted fea tu res , cellular charac te rs th a t su p p o r t t h e c h ro m a lv e o la te h y p o th e s is (I.e. a m o n o p h y le t lc or igin of Chromalveolata) are lacking, as Is phy logenetic ev idence based on nuclear DNA se q u e n c e s . Molecular phy logenetic ana lyses of nuclear g e n e s tha t are c ons is ten t with Chrom alveo la ta have only Included alveola tes (ciliates, dinoflagella tes an d ap lcom plexans) and s t r a m e ­nopiles [54] an d ana lyses of plastid g enes , or nuc lear-encoded g e n e s for p las tld- targeted p rote ins have provided varying deg rees of su p p o r t for th e m ono p h y ly of ch rom is ts + dinoflagella tes [47,53], Critics of t h e c h ro m a lv e o la te h y p o th e s is point ou t th a t such a n a ly s e s on ly s h o w a c o m m o n a n c e s t ry of t h e p las tid , not necessari ly th e host l ineages In which th e y reside [55,56], Thus, a l ternative m o d e ls of plas tid t ransfe r a m o n g ch rom alveo la te taxa have been s u g g e s te d , specifically Invoking a m odel of tertia ry t ransfe r of th e hap to p h y te plas tid [55,56], This a l ternative h y po th ­es is s u g g e s t s th a t th e red algal-derived p lastids of 'ch rom alveo la tes ' are united by horizontal t ransfer , not vertical Inheri tance f rom a c o m m o n ancestor . Never theless , e ndosym bio t ic g e n e rep lacem en ts of nuc lear -encoded g e n e s for p las tld- targeted o r plastld-derlved prote ins have been described that unite pho tosyn the t ic ch ro m al­v e o la te ta x a [23,24], A l th o u g h a l te rn a te e x p la n a t io n s for th e o b se rv ed pattern of g e n e t ransfe rs have been p ro p o sed [55], th e n u m b e r of such c ases co n t inues to g ro w with m ore data.

Central to th e d e b a te of c h rom alveo la te m ono p h y ly Is th e w eigh t placed on plas tid gain v e r su s plas tid loss. The main rationale for p roposing ch rom alveo la te m ono p h y ly Is tha t It greatly reduces th e n u m b e r of se c o n d a ry e n d o s y m b io s e s requ ired to a c co u n t for presen t-day pho tosyn the t ic diversity, a position tha t has previously been a rgued based on th e p r e su m e d difficulty a s soc ia ted with th e genet ic Integration of host an d e n d o s y m b io n t [51]. If plas tid gain Is m uch m o re difficult than plas tid loss, It a p p ea r s m os t plausib le to hypothes ize a single plas tid acquisition In th e c o m m o n an ces to r of all chrom alveo la tes , and s u b s e q u e n t loss In th e ciliates, s o m e ap lcom plexans , and early-diverging m e m b e r s of m a n y c h ro m alv eo ­late l ineages (Figure 3a). An al ternative v iew Is tha t o rganel les are Inherently m ore difficult to lose th an gain [56,57], based on th e fact tha t m a n y nonph o to sy n th e t lc ap lcom plexan paras i tes retain an apicop las t (a rem n an t plas tid u se d for ce llular functions such as fat ty acid, h e m e and am ino acid b iosynthesis) an d th e fact that derivatives of th e m itochondrion have been found In every case w h e re a putatlvely am ltochondr la te o rgan ism has been tho ro u g h ly Investigated [4], Flowever, direct ev idence for plastid loss f rom th e ap lcom plexan C ryp tospo rid ium an d th e s t r am en o p l le P hytophtho ra (e.g. Refs [29,41,58]) d e m a n d s a réévaluation of th is a rgum en t . Additionally, th e description of a reduced plas tid In th e dlnoflagel- late Perkinsus [59], p r e su m e d to be h o m o lo g o u s to th e apicoplast, and th e d iscovery of C hrom era velia, a pho tosyn the t ic alveolate closely related to ap lco m p lex an s [42], s u g g e s t tha t th e c o m m o n an ces to r of dinoflagella tes and ap lco m p lex an s w a s a pho to troph , fu rther push ing back th e plastld-bearlng roots of chrom alveo la te l ineages an d Indicating s u b s e q u e n t plas tid loss.

common ancestor [40], Unexpectedly, however, in the analyses th a t included members of Rhizaria [35,39], this supergroup was resolved within chromalveolates, either as sister to stramenopiles + alveolates, to the exclusion of cryptophytes + haptophytes, or specifically sister to s tra ­menopiles (Figure 3b). However, in one instance [35], a topology including a monophyletic origin of chromalveo­lates (excluding Rhizaria) could not be rejected by the data.

If chromalveolate taxa do indeed share a common ances­tor (Box 2), the resolution of the supergroup Rhizaria within Chromalveolata not only complicates hypotheses regarding the evolution of secondary plastids (Figure 3a), it also demands th a t the ancestor of Rhizaria harbored a red algal-derived plastid. Specifically, the red algal plastid

272

Page 6: The eukaryotic tree of life: endosymbiosis takes its TOL · cells, a process known as ‘secondary’ endosymbiosis [9,10]. In the case of secondary endosymbiosis, the plastid acts

Opinion Trends in Ecology and Evolution Vol.23 No.5

Dinoflagellates

(a)Kinetoplastids /

Diplonemids

Heterolobosea

Excavata

Amoebozoa

u O °Other Excavates

Opisthokonta

Rhizaria

(b)

Kinetoplastids/diplonemids

Heterolobosea

Other Cercozoa

Other excavates

Glaucophytes

Green algae

Archaeplastida

Chlorarachniophytes

Foraminifera

Polycystines

Acantharea

Haptophytes Cryptophytes

Goniomonas

K atablepharisEarly branching stramenopiles

Derived stramenopiles Ciliates

Chromera velia Apicomplexans

Red algaeGlaucophytes

Green algae

Haptophytes

Cryptophytes

Goniomonas

Oxyrrhis Chromalveolata

Red algae

Perkinsus

Other Cercozoa

Chlorarachniophytes

Foraminifera

Polycystines

Acantharea

Early branching stramenopiles

Derived stramenopiles Ciliates

Chromera velia Apicomplexans

Dinoflagellates

Oxyrrhis

• Red algal plastid o Green algal plastid • Glaucophyte plastid o Plastid loss? ® Reduced plastid or partial lossin lineage

TRENDS in Ecology & Evolution

Figure 3. Two hypotheses to explain th e distribution of secondary plastids, based on com peting scenarios of eukaryotic evolution. A green algal-derived secondary plastid has been acquired by tw o separate lineages, in independent endosym biotic events (thin dashed lines), (a) A single red algal endosym biosis occurred in the com m on ancestor of Chromalveolata, necessitating multiple plastid losses a t th e b ase of th e various nonphotosynthetic lineages, (b) If Rhizaria evolved from within chrom alveolates, it is m ost parsim onious to a ssum e that th e red algal secondary plastid w as lost before th e diversification of this lineage. A green algal secondary plastid has been acquired by chlorarachniophytes m ore recently.

would have to have been acquired before the split between the haptophyte + cryptophyte clade from alveolates + - stramenopiles + Rhizaria (Figure 3b). P lastids would then presumably have been lost, independently, in Rhizaria, some stramenopiles, ciliates, early diverging dinoflagel­lates (e.g. Oxyrrhis) and m any or most apicomplexans [39]. Finally, the subsequent uptake of a green algal endo­symbiont in the ancestor of chlorarachniophytes would produce the distribution of plastids observed today (Figure 3b). Like the original chromalveolate hypothesis (Box 2), this scenario would require th a t plastid loss be far more common than gain. Although the prevalence of plas­tid loss (as opposed to loss of photosynthesis) among eukaryotes is unknown, the nuclear genomes of two Phy­tophthora species [29] (stramenopiles) and the apicom- plexan Cryptosporidium [41] encode plastid-derived genes, despite these organisms lacking plastids, an indica­tion of a t least two instances of plastid loss in the ancestors of these different organisms. Additionally, the recently discovered photosynthetic eukaryote Chômera velia [42], which is closely related to apicomplexans, strongly indicates a shared photosynthetic ancestor of both Apicom­plexa and dinoflagellates and subsequent loss in the plas- tid-lacking members of these groups.

If the new position of Rhizaria as a part of Chromalveo­lata reflects the true evolutionary history of this lineage, one would predict th a t genes of red algal ancestry might persist in the nuclear genomes of this group as rem nants of the red algal genomes th a t were present in the rhizarian

common ancestor. Interestingly, red algal-derived plastid genes were discovered in the nuclear genome of the green algal plastid-containing rhizarian Bigelowiella natans [43], and were interpreted as having been acquired by lateral gene transfer ra ther than vertically inherited from a red algal plastid-containing ancestor. A complete genome sequence for B. natans will soon be available (http:// www.jgi.doe.gov) and will make it possible to test w hether or not this red algal ‘footprint’ is (at least in part) the result of ancient endosymbiotic gene transfer. However, most Rhizaria are recalcitrant to laboratory experimentation, and significant amounts of sequence data from diverse members of this lineage will be slow in coming. At any rate, if analyses eventually show th a t two (or more) distinct plastids were harbored by the ancestors of extant organ­isms, as has been previously shown in some dinoflagellates (see Ref. [37]), then determ ining the organismal history of such eukaryotes might be even more difficult than cur­rently appreciated.

Phylogenetic hope in light of EGT?Although we have focused on chromalveolates and ignored the potentially significant role of lateral gene transfer in eukaryotic evolution (e.g. Ref. [44]), the reality of EGT and its phylogenetic implications can be extended to m any of the eukaryotic supergroups. The relationships within and between chromalveolate and rhizarian taxa are not only im portant for understanding a major component of the tree of life but also for understanding organelle evolution and

273

Page 7: The eukaryotic tree of life: endosymbiosis takes its TOL · cells, a process known as ‘secondary’ endosymbiosis [9,10]. In the case of secondary endosymbiosis, the plastid acts

Opinion Trends in Ecology and Evolution Vol.23 No.5

the potential complications of EGT and EGR. On the surface, it would appear th a t taxonomic resolution at the deepest levels of eukaryotic divergence is possible if enough data are used. However, as biologists attem pt to resolve deeper relationships within and between eukaryo­tic supergroups, phylogenetic artifacts are confounded by the significant divergence times between the lineages under study. Cases of gene transfers and replacements additionally complicate m atters, and although phyloge- nomic data sets are routinely screened for genes with prokaryotic affinities, the enormous potential for eukar­yotic EGRs in lineages th a t harbor (or have harbored) secondary endosymbionts is usually overlooked.

Recent methodological advances have made it possible to system atically and efficiently examine the phylogenetic signal of separate genes in a genome or in multigene data sets (Box 3). For example, the program Concaterpillar [45] separates genes in a multilocus da ta set based on pairwise comparisons of congruence (Box 3 ), potentially identifying instances of EGT and EGR. In a te s t case, when applied to a eukaryote-wide 60-gene data set of conserved tran s la ­tional proteins, Concaterpillar identified three d ata sets of 35,15 and 10 genes, w ith the 15-gene da ta set recovering a sister relationship between stram enopiles and the red alga Porphyra, as would be predicted in the case of EGR [45]. If this pa tte rn holds true on a larger scale, then the

Box 3. Emerging EGT and EGR detection methods

Until recently, th e only w a y to detec t e ndosym bio t ic g e n e transfers (EGTs) and rep lacem en ts (EGRs) w a s to make Individual g e n e t rees for every g e n e of Interest and m anual ly c o m p a re the ir topo logies . With m odern data se ts often exceeding 100 protein s e q u e n c e s per taxon , a r ra n g ed en d to en d (concatenated) as o n e large se q u en ce , th is p rocess can be ex trem ely t im e c o n s u m in g an d laborious. For th is reason , r esea rchers have often had to rely on th e a s su m p t io n th a t th e ' t rue ' signal will overw helm an y no ise In th e da ta cau sed by th e occasional protein with a d isco rdan t phy logene t ic history. Fortunate ly, n e w m e th o d s have b egun to e m e rg e In r e sp o n se to Increasing recognition tha t EGT an d EGR are Im portan t and that th e re Is a critical need to detect and u n d e rs ta n d conflict ing s igna ls In data sets.

PhyloSort [16] Is an a u to m a te d m e th o d for Investigating g e n o m e - scale data se ts on a g ene-by -gene basis for a n o m a lo u s phy logenetic pa t te rns . The p ro g ram can be u se d to sc ree n th e o u tp u t of p hy logenom lcs 'p ipe lines ' such as PhyloGenle [60] for g e n e s tha t s h o w a phy logenetic pattern co n g ru en t with a use r-def ined m ono- phylet lc group. By search ing a collection of s ing le -gene Input t rees for a relat ionship of Interest , u se rs can rapidly focus on only th o se g e n e s th a t su p p o r t a particular phy logenetic hypothesis .

The p rog ram Concaterpillar Is a likelihood rat io-based m e thod des ig n ed to tes t for c o n g ru en ce a m o n g loci In c o n ca ten a te d data s e ts using hierarchical c lus tering [45], Concaterpil lar perfo rm s pairwise likelihood ratio c o m p a r iso n s for all loci In an Iterative fash ion , com bin ing th e p rote ins with th e lowest likelihood ratio at each s tep and cons idering th e m as a single locus for s u b s e q u e n t s teps . In this way, th e p ro g ram builds da ta se ts of pro te ins based on th e similar ity of the ir phy logenetic signal until all loci have been con s id e red or a com par ison exceeds th e user-def ined likelihood ratio a level. T he resu lt ing da ta s e t s can th e n be ana lyzed Independen t ly , giving th e u se r th e potential to t e a s e apart th e var ious com pe t ing phy logenetic s igna ls In th e data , ra ther than settl ing for a t ree based on a mix of conflict ing Information. In sum , phy logenetic too ls such as Concaterpillar and PhyloSort a re paving th e w a y for r esea rchers to sys tem atica lly a d d re ss q u es t io n s of EGT/ EGR using different a p p ro a c h e s and ty p es of data.

im pact of EGT and EGR in lineages w ith photosynthetic ancestry m ight be far g reater th an currently appreciated, and empirical studies will be essential to determ ine the extent to which foreign genes have invaded the nuclear genomes of these organisms. Awareness of the extent of this phenomenon, however, will stim ulate researchers to account for it.

Additionally, the constraints of taxon sampling, which have ham pered in-depth exploration of EGT, are becoming less of a problem as new data come online. More red algal genome sequences will soon provide the data to better explore chromalveolate genomes for transferred genes, and a sharp increase in the num ber of chromalveolate nuclear sequences is making pairwise genomic compari­sons realistic. Ironically, although their potential to intro­duce noise in phylogenetic studies is enormous, instances of EGR and EGT can be used as derived characters in support of common ancestry between two lineages (e.g. Refs [23,24,40]). I t is imperative th a t the phylogenetics community continues to improve methods for the detection of foreign genes residing in nuclear genomes if resolution of ancient eukaryotic evolutionary divergences is to be achieved.

AcknowledgementsWe thank A. Simpson, E. Kim and G. Sanchez Perez for helpful comments on this m anuscript and the useful input of four anonymous reviewers. We apologize to those whose im portant work could not be cited owing to space constraints. Research in the Archibald laboratory is supported by the Canadian Institu tes of H ealth Research, the N atural Sciences and E ngineering Research Council of C anada, the Dalhousie Medical R esearch Foundation and th e C anad ian In s titu te for Advanced Research (CIFAR). J.M.A. is a Scholar of the CIFAR Program in Integrated Microbial Biodiversity.

References1 Mclnemey, J.O. et al. (2008) The prokaryotic tree of life: past,

p resen t.. .and future? Trends Ecol. Evol. 23, 276-2812 Margulis, L. and Schwartz, K. (1988) Five Kingdoms: A n Illustrated

Guide to the Phyla o f Life on Earth, W.H. Freem an3 W hittaker, R.H. (1969) New concepts of kingdoms of organisms.

Science 163, 150—1604 Embley, T.M. and M artin, W. (2006) Eukaryotic evolution, changes and

challenges. Nature 440, 623-6305 Simpson, A.G. and Roger, A. J . (2004) The real ‘kingdoms’ of eukaryotes.

Curr. Biol. 14, R693—R6966 Keeling, P. J . et al. (2005) The tree of eukaryotes. Trends Ecol. Evol. 20,

670-6767 W egener Parfrey, L. et al. (2006) Evaluating support for the current

classification of eukaryotic diversity. PLoS Genet. 2, e2208 M artin, W. et al. (2002) Evolutionary analysis of Arabidopsis,

cyanobacterial, and chloroplast genomes reveals plastid phylogeny and thousands of cyanobacterial genes in the nucleus. Proc. Natl. Acad. Sei. V. S. A. 99, 12246-12251

9 Archibald, A.M. and Keeling, P .J. (2002) Recycled plastids: a ‘green movement’ in eukaryotic evolution. Trends Genet. 18, 577-584

10 Bhattacharya, D. et al. (2007) How do endosymbionts become organelles? U nderstanding early events in plastid evolution. BioEssays 29, 1239—1246

11 Deschamps, P. et al. (2008) Metabolic symbiosis and the b irth of the p lan t kingdom. Mol. Biol. Evol. 25, 536-548

12 M artin, W. et al. (1998) Gene transfer to the nucleus and the evolution of chloroplasts. Nature 393, 162-165

13 Stoebe, B. and Kowallik, K.V. (1999) Gene-cluster analysis in chloroplast genomics. Trends Genet. 15, 344-347

14 Sato, N. et al. (2005) Mass identification of chloroplast proteins of endosymbiotic origin by phylogenetic profiling based on organism- optimized homologous protein groups. Genome Informât. 16, 56-68

274

Page 8: The eukaryotic tree of life: endosymbiosis takes its TOL · cells, a process known as ‘secondary’ endosymbiosis [9,10]. In the case of secondary endosymbiosis, the plastid acts

Opinion Trends in Ecology and E volution Vo 1.23 N o. 5

15 Richly, E. and Leister, D. (2004) An improved prediction of chloroplast proteins reveals diversities and commonalities in the chloroplast proteomes of Arabidopsis and rice. Gene 329, 11-16

16 M oustafa, A. and B hattacharya, D. (2008) PhyloSort: a user- friendly phylogenetic sorting tool and its application to estim ating the cyanobacterial contribution to algal nuclear genomes. BM C Evol. Biol., DOI: 10.1186/1471-2148-1188-1186 (http://www.biomedcentral.com/bmcevolbiol)

17 Reyes-Prieto, A. et al. (2006) Cyanobacterial contribution to algal nuclear genomes is prim arily lim ited to plastid functions. Curr. Biol. 16, 2320-2325

18 Brinkmann, H. and M artin, W. (1996) Higher-plant chloroplast and cystolic 3-phosphoglycerate kinases: a case of endosymbiotic gene replacement. Plant Mol. Biol. 30, 65-75

19 Cavalier-Smith, T. (2003) Genomic reduction and evolution of novel genetic mem branes and protein-targeting m achinery in eukaryote- eukaryote chim aeras (meta-algae). Philos. Trans. R. Soc. Lond. B Biol. Sei. 358, 109-134

20 Takahashi, F. et al. (2007) Origins of the secondary plastids of euglenophyta and chlorarachniophyta as revealed by an analysis of the plastid-targeting, nuclear-encoded gene pshO. J. Phycol. 43 ,1302- 1309

21 Cavalier-Smith, T. (2002) The phago trophic origin of eukaryotes and phylogenetic classification of protozoa. Int. J. Syst. Evol. Micro. 52, 297-354

22 Rogers, M.B. et al. (2007) The complete chloroplast genome of the chlorarachniophyte Bigelowiella natans : evidence for independent origins of chlorarachniophyte and euglenid secondary endosymbionts. Mol. Biol. Evol. 24, 54-62

23 H arper, J.T. and Keeling, P.J. (2003) Nucleus-encoded, plastid- targeted glyceraldehyde-3-phosphate dehydrogenase (GAPDH) indicates a single origin for chromalveolate plastids. Mol. Biol. Evol. 20, 1730-1735

24 Patron, N.J. et al. (2004) Gene replacement of fructose-1,6-bisphosphate aldolase supports the hypothesis of a single photosynthetic ancestor of chromalveolates. Eukaryot. Cell 3, 1169-1175

25 Li, S.L. et al. (2006) Phylogenomic analysis identifies red algal genes of endosymbiotic origin in the chromalveolates. Mol. Biol. Evol. 23, 663- 674

26 Nosenko, T. and Bhattacharya, D. (2007) Horizontal gene transfer in chromalveolates. BM C Evol. Biol. 7, 173

27 Arm brust, E.V. et al. (2004) The genome of the diatom Thalassiosira pseudonana: ecology, evolution, and metabolism. Science 306, 79-86

28 Ahmadinejad, N. et al. (2007) Genome history in the symbiotic hybrid Euglena gracilis. Gene 402, 35—39

29 Tyler, B.M. et al. (2006) Phytophthora genome sequences uncover evolutionary origins and mechanisms of pathogenesis. Science 313, 1261-1266

30 M atsuzaki, M. et al. (2004) Genome sequence of the ultrasm all unicellular red alga Cyanidioschyzon merolae 10D. Nature 428, 653

31 Yoon, H.S. et al. (2006) Defining the major lineages of red algae (Rhodophyta). J. Phycol. 42, 482-492

32 Rodriguez-Ezpeleta, N. et al. (2007) Detecting and overcoming systematic errors in genome-scale phylogenies. Syst. Biol. 56, 389-399

33 Stibitz, T.B. et al. (2000) Symbiotic origin of a novel actin gene in the cryptophyte Pyrenomonas helgolandii. Mol. Biol. Evol. 17, 1731-1738

34 Tanifuji, G. et al. (2006) Diversity of secondary endosymbiont-derived actin-coding genes in cryptomonads and their evolutionary implications. J. Plant Res. 119, 205-215

35 Hackett, J.D. et al. (2007) Phylogenomic analysis supports the monophyly of cryptophytes and haptophytes and the association of Rhizaria w ith chromalveolates. Mol. Biol. Evol. 24, 1702-1713

36 Palmer, J.D. (2003) The symbiotic b irth and spread of plastids: how m any times and whodunit? J. Phycol. 39, 4-11

37 Archibald, J.M. (2005) Jum ping genes and shrinking genomes - probing the evolution of eukaryotic photosynthesis using genomics. ITJBMB Life 57, 539-547

38 Patron, N.J. et al. (2007) Multiple gene phylogenies support the monophyly of crypto monad and haptophyte host lineages. Curr. Biol. 17, 887-891

39 Burki, F. et al. (2007) Phylogenomics reshuffles the eukaryotic supergroups. PLoS ONE 2, e790

40 Rice, D.W. and Palmer, J.D. (2006) An exceptional horizontal gene transfer in plastids: gene replacem ent by a d istan t bacterial paralog and evidence th a t haptophyte and cryptophyte plastids are sisters. BM C Biol. 4, 31

41 Huang, J . et al. (2004) Phylogenomic evidence supports past endosymbiosis, in tracellular and horizontal gene transfer in Cryptosporidium parvum . Genome Biol. 5, R88

42 Moore, R.B. et al. (2008) A photosynthetic alveolate closely related to apicomplexans parasites. Nature 451, 959-963

43 Archibald, J.M. et al. (2003) Lateral gene transfer and the evolution of plastid-targeted proteins in the secondary plastid-containing alga Bigelowiella natans. Proc. Natl. Acad. Sei. XJ. S. A. 100, 7678—7683

44 Rogers, M.B. et al. (2007) A complex and punctate distribution of three eukaryotic genes derived by lateral gene transfer. BM C Evol. Biol. 7,89

45 Leigh, J.W. et al. (2008) Testing congruence in phylogenomic analysis. Syst. Biol. 57, 4-15

46 Nozaki, H. and Iseki, M. (2007) Phylogeny of prim ary photosynthetic eukaryotes as deduced from slowly evolving nuclear genes. Mol. Biol. Evol. 24, 1592-1595

47 Yoon, H.S. et al. (2004) A molecular timeline for the origin of photosynthetic eukaryotes. Mol. Biol. Evol. 21, 809-818

48 Larkum , A.W.D. et al. (2007) Shopping for plastids. Trends P lant Sei. 12,189-195

49 Stiller, J.W. (2007) Plastid endosymbiosis, genome evolution and the origin of green plants. Trends Plant Sei. 12, 391-396

50 Stiller, J.W. et al. (2003) A single origin of plastids revisited: convergent evolution in organellar genome content. J. Phycol. 39, 95-105

51 Cavalier-Smith, T. (1999) Principles of protein and lipid targeting in secondary symbiogenesis: euglenoid, dinoflagellate, and sporozoan plastid origins and the eukaryote family tree. J. Eukaryot. Microbiol. 46, 347-366

52 Archibald, J.M. (2007) Nucleomorph genomes: structure, function, origin and evolution. BioEssays 29, 1239-1246

53 Yoon, H.S. et al. (2005) Tertiary endosymbiosis driven genome evolution in dinoflagellate algae. Mol. Biol. Evol. 22, 1299-1308

54 Rodriguez-Ezpeleta, N. et al. (2007) Toward resolving the eukaryotic tree: the phylogenetic positions of jakobids and cercozoans. Curr. Biol. 17, 1420-1425

55 Bodyl, A. (2005) Do plastid-related characters support the chromalveolate hypothesis? J. Phycol. 41, 712-719

56 Bodyl, A. and Moszczynski, K. (2006) Did the peridinin plastid evolve through tertiary endosymbiosis? A hypothesis. Eur. J. Phycol. 41 ,435- 448

57 Bodyl, A. et al. (2007) The intracellular cyanobacteria of Paulinella chromatophora: endosymbionts or organelles? Trends Microbiol. 15, 295-296

58 Huang, J . et al. (2004) A first glimpse into the patte rn and scale of gene transfer in Apicomplexa. Int. J. Parasitol. 34, 265-274

59 Teles-Grilo, M.L. et al. (2007) Is there a plastid in Perkinsus atlanticus (Phylum Perkinsozoa)? Eur. J. Protistol. 43, 163-167

60 Frickey, T. and Lupas, A.N. (2004) PhyloGenie: autom ated phylome generation and analysis. Nucleic Acids Res. 32, 5231-5238

Reproduction of material from Elsevier articlesInterested in reproducing part or all o f an artic le published by Elsevier, o r one o f ou r artic le figures?

If so, please contact ou r G lobal R ights D epartm ent w ith details o f how and w here the requested m aterial w ill be used. To subm it a perm ission request on line, please vis it:

www.elsevier.com/locate/permissions

275