identification of genes encoding hypothetical proteins in ... · these genes, we generated 1161...
TRANSCRIPT
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
Identification of genes encoding hypothetical proteins in open-reading frame expressed sequence tags from mammalian stages of Trypanosoma cruzi
C. Martins1, J.L. Reis-Cunha2, M.N. Silva1, E.G. Pereira1, G.J. Pappas Jr.3,4, D.C. Bartholomeu2 and B. Zingales1
1Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, São Paulo, SP, Brasil2Departamento de Parasitologia, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, MG, Brasil3Embrapa Recursos Genéticos e Biotecnologia, Brasília, DF, Brasil4Universidade Católica de Brasília, Brasília, DF, Brasil
Corresponding author: B. ZingalesE-mail: [email protected]
Genet. Mol. Res. 10 (3): 1589-1630 (2011)Received November 30, 2010Accepted January 21, 2011Published August 5, 2011DOI http://dx.doi.org/10.4238/vol10-3gmr1140
ABSTRACT. Approximately 50% of the predicted protein-coding genes of the Trypanosoma cruzi CL Brener strain are annotated as hypothetical or conserved hypothetical proteins. To further characterize these genes, we generated 1161 open-reading frame expressed sequence tags (ORESTES) from the mammalian stages of the VL10 human strain. Sequence clustering resulted in 435 clusters, consisting of 339 singletons and 96 contigs. Significant matches to the T. cruzi predicted gene database were found for ~94% contigs and ~69% singletons. These included genes encoding surface proteins, known to be intensely expressed in the parasite mammalian stages and implicated in host cell invasion and/or immune evasion mechanisms. Among 151 contigs and singletons with similarity to predicted hypothetical protein-
1590
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
C. Martins et al.
coding genes and conserved hypothetical protein-coding genes, 83% showed no match with T. cruzi EST and/or proteome databases. These ORESTES are the first experimental evidence that the corresponding genes are in fact transcribed. Sequences with no significant match were searched against several T. cruzi and National Center for Biotechnology Information non-redundant sequence databases. The ORESTES analysis indicated that 124 predicted conserved hypothetical protein-coding genes and 27 predicted hypothetical protein-coding genes annotated in the CL Brener genome are transcribed in the VL10 mammalian stages. Six ORESTES annotated as hypothetical protein-coding genes showing no match to EST and/or proteome databases were confirmed by Northern blot in VL10. The generation of this set of ORESTES complements the T. cruzi genome annotation and suggests new stage-regulated genes encoding hypothetical proteins.
Key words: Trypanosoma cruzi; Mammalian stages; ORESTES; Hypothetical protein-coding genes; Transcription
INTRODUCTION
Trypanosoma cruzi is the etiological agent of Chagas disease, which afflicts 8 to 10 million people in Latin America. As a consequence of the intense migration of Latin-Americans from endemic regions, Chagas disease has now become a public health concern in several countries, such as the United States, Canada, Spain, and Japan, among others.
T. cruzi is a digenetic protozoan of the order Kinetoplastida, able to infect a great variety of mammalian species and is transmitted by insects of the family Triatominae. Chagas disease has different clinical presentations. Approximately 70% of seropositive individuals are asymptomatic (indeterminate form), whereas 30% develop serious cardiac, digestive and/or neurological diseases. Each year, 2-3% of the asymptomatic individuals will progress to the above-mentioned symptomatic manifestations (Pinto Dias, 2006). The outcome of infection in a particular individual is the result of a set of complex interactions among the genetic make-up of the parasite, the host immunogenetic background and environmental factors (reviewed by Campbell et al., 2004; Macedo et al., 2004).
The biological, biochemical and genetic diversity of T. cruzi strains has long been recog-nized along with their eco-epidemiological complexity, which have been reviewed extensively elsewhere (Macedo and Pena, 1998; Campbell et al., 2004; Miles et al., 2009). Although it has been initially proposed that the major genetic variability of T. cruzi re sults from clonal evolu-tion, increasing evidence indicates that hybridization events must have contributed to the present population structure (Tomazi et al., 2009). Numerous approaches have been used to define the number of relevant T. cruzi groups, which have received different designations (reviewed by Zingales et al., 2009). In an effort to unify the nomenclature, there was agreement on an updated system in 2009 and a consensus was reached that T. cruzi strains should be referred to by six dis-crete typing units (DTUs) named T. cruzi I-VI (TcI-VI) (Zingales et al., 2009). A DTU is defined as ‘‘sets of stocks that are genetically more related to each other than to any other stock and that are identifiable by common genetic, molecular, or immunological markers’’ (Tibayrenc, 1998).
1591
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
Transcriptome of Trypanosoma cruzi mammalian stages
The T. cruzi genome sequencing project was completed in 2005, focusing on the hybrid CL Brener (TcVI) strain (El-Sayed et al., 2005). Assembly parameters were adjusted to contend with the high allelic variation, and partial sequencing of the Esmeraldo cl3 strain from the progenitor TcII DTU allowed the definition of two haplotypes of CL Brener, which were named Esmeraldo and non-Esmeraldo (El-Sayed et al., 2005). An updated version of the CL Brener nuclear genome architecture using the genome project data and synteny information with the Trypanosoma brucei genome has been presented (Weatherly et al., 2009). Over 50% of the CL Brener genome is represented by repetitive sequences, consist-ing mostly of large gene families of surface proteins, retrotransposons, subtelomeric repeats (El-Sayed et al., 2005), and the 195-bp satellite DNA (Martins et al., 2008). In the origi-nal report, it was concluded that the CL Brener haploid genome contains ~12,000 protein-coding genes (El-Sayed et al., 2005). However, the number of protein-coding sequences and pseudogenes may be twice the previous estimate (Arner et al., 2007). Putative function could be assigned to 50.8% of the predicted protein-coding genes on the basis of significant similarity to previously characterized proteins or known functional domains (El-Sayed et al., 2005). This means that for around 6000 proteins the function is unknown and further characterization is needed.
The assessment of T. cruzi gene expression was initially obtained by large-scale single pass sequencing of cDNA clones to generate expressed sequence tags (ESTs), mainly of the epi-mastigote stage encountered in the gut of the triatomid insect vector (reviewed by Aguero et al., 2006). More recently, whole genome oligonucleotide microarrays were employed to determine the extent to which T. cruzi regulates mRNA relative abundances over the course of its com-plete life-cycle (Minning et al., 2009). In parallel, several proteomic studies have been carried out to validate or discover novel gene products (Atwood III et al., 2005; Ferella et al., 2008).
Most of the genome, transcriptome and proteomic data have been obtained for the CL Brener strain (TcVI), a clone of the CL strain isolated from the vector Triatoma infestans in south Brazil (Zingales et al., 1997). On the other hand, TcII DTU is the primary cause of severe acute and chronic Chagas disease in southeast and central Brazil (Carranza et al., 2009; Miles et al., 2009). Due to the high genetic diversity of T. cruzi strains, transcriptome and proteomic information from human-derived isolates will contribute to the understanding of pathogenic features of this parasite.
Most ESTs (200-500 nt) correspond to the 5' or 3' end of cDNAs, and may there-fore be derived from untranslated regions of the transcripts. In contrast, open-reading frame (ORF) EST methodology, called ORESTES (Dias et al., 1997), is an alternative approach that allows the generation of sequences preferentially from the transcript central portion, presumably the most informative region of the gene. The basis of the ORESTES approach is to generate short cDNA templates by reverse transcription (RT) and polymerase chain reac-tion (PCR) using arbitrarily selected, non-degenerate primers under low-stringency condi-tions (Dias et al., 1997). Because the position of the amplified fragments within transcripts is highly ordered, a high percentage of fragments encompassing the central portion of the cDNAs is obtained (Dias et al., 1997, 2000). Besides preferential targeting of ORF regions, ORESTES strategy results in a partial equalization of cDNA species, similar to normaliza-tion, which in turn facilitates the discovery of low-abundance transcripts (Dias et al., 2000). Such methodology has been successfully used in the definition of the human transcriptome (Dias et al., 2000; Camargo et al., 2001).
1592
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
C. Martins et al.
In this study, we undertook a novel T. cruzi transcriptome survey employing the ORESTES approach in order to analyze gene expression in the mammalian stages of the TcII VL10 strain, isolated from a patient in the chronic phase of Chagas disease.
MATERIAL AND METHODS
T. cruzi VL10 strain and cultivation
The VL10 strain was isolated from a 15-year-old female patient in Minas Gerais, Brazil. The patient presented a normal electrocardiogram and normal chest X-rays and was considered to be asymptomatic (indeterminate form). The VL10 strain was genotyped as belonging to TcII DTU (Baptista et al., 2006). Trypomastigote and amastigote forms were obtained from the su-pernatant of monolayers of Simian LLC-MK2 cells on the 7th day post-infection, as described previously (Zingales et al., 1997). Epimastigote forms of CL Brener (TcVI DTU) and VL10 strains were cultured in liver infusion-tryptose (LIT) medium with 10% FBS, at 28°C.
cDNA template preparation and DNA sequencing
PolyA+ RNA was prepared from 109 parasites using the µMACS kit (Miltenyi Bio-tec, Auburn, CA, USA) according to manufacturer instructions. RNA samples were treated with DNase I (10 U/50 µg RNA), and the absence of contaminating genomic DNA was confirmed by PCR using primers for the 195-bp satellite DNA and the LSU RNA gene, as previously described (Martins et al., 2008). RNA integrity was confirmed by means of RT-PCR using primers for the T. cruzi actin 2 (GenBank accession #AF494294) and elongation factor 1 (GenBank accession #L76077) genes. To produce cDNA templates, the ORESTES methodology (Dias et al., 2000) was employed with some modifications. Samples of 10-30 ng purified DNAse-treated mRNA were heated at 65°C for 5 min and then subjected to RT at 37°C for 60 min in the presence of 200 U Superscript II Reverse Transcriptase (Invitrogen) and 15 pmol randomly selected primers in a final volume of 20 µL. The five primers used for cDNA synthesis and amplification were 18-mers with GC content of 50 to 77% (PS0229, 5'-GAGTAGACTCTGTCCTGG-3'; PS0236, 5'-TGGGAGTCCATGC GTGCC-3'; PS0225, 5'-GTGACCACCAGCTGCTGC-3'; PS0230, 5'-ACCGCAGCG GACAGCGCC-3'; PS0231, 5'-GACAGCAGGACCAAGCCA-3'). After cDNA synthesis, 2 µL of each single-stranded cDNA was PCR amplified using the same primer employed in the RT reaction with the Ready-to-go PCR beads system (GE Healthcare) according to manufacturer instructions. Touchdown PCR with 45 cycles was used after cDNA denatur-ation at 75°C. Annealing temperatures varied from 52° to 45°C (with progressive reduc-tions of 1° to 2°C per cycle). The samples were run on 1% low-melting temperature aga-rose gels and stained with ethidium bromide. After electrophoresis, profiles composed of a DNA smear were size selected from the gels and cloned in pGEM-T Easy vector (Promega) according to standard procedures. Plasmid templates were prepared using 96-Plasmid Puri-fication System kits for the Biomek 2000 Automation Workstation (Beckman). Sequencing was carried out using an ABI 377 automated DNA sequencer (Applied Biosystems Inc.). Nucleotide sequences determined here have been deposited in the GenBank database with the accession numbers GW395924-GW397084.
1593
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
Transcriptome of Trypanosoma cruzi mammalian stages
EST processing pipeline
Sequence analysis began with base calling using the phred program (Ewing and Green, 1998). Sequence quality trimming was performed with the lucy program (Chou and Holmes, 2001), and vector masking with the cross match program (Ewing B, unpublished results). The processed sequences were clustered with the CAP3 program (Huang and Madan, 1999), generating the sequence consensi that were used in sequence annotation.
Sequence annotation
Contigs and singletons were searched as a query against the T. cruzi predicted gene database (http://tritrypdb.org/tritrypdb/) using the BLASTN algorithm with low complexity filtering turned off. Matches with a bit score ≥100 were considered to be significant. Sequences with no significant match were searched against the T. cruzi contig database using the same parameters, and those that still did not have a significant match were searched against the T. cruzi reads database. Sequences with no signifi-cant similarity with any of the three databases were searched against the non-redundant nucleotide and protein sequences at NCBI using BLASTN and BLASTX, respectively. All 435 consensi were also searched against EST/cDNA sequences and proteome T. cru-zi databases (http://www.tritrypdb.org) and pfam domains (http://pfam.sanger.ac.uk/) (Supplementary Material S1 - Annotation - ORESTES).
Southern blotting
Total parasite DNA was obtained as described (Macedo et al., 1992). DNA preparations (10 µg) were digested with PstI restriction enzymes (New England Biolabs), run on 0.8% aga-rose gels, and blotted onto nylon membranes (Hybond-N, Amersham Pharmacia) using standard protocols. DNA probes were labeled with [a-32P]-dATP with the Random Primer DNA Label-ing kit (Invitrogen). Probes were hybridized at 60°C in 0.1% Ficoll, 0.05% PVP, 1 mM EDTA, 3X SSC, 0.1% SDS and 100 µg/mL salmon sperm DNA. Blots were washed to a final stringency of 1X SSC, 0.1% SDS at 60°C. The radioactive images were recorded on X-ray films.
RNA blotting
Total parasite was RNA extracted with the TRIzol reagent according to manufacturer instructions, DNase-treated and separated on formaldehyde-containing 1% agarose gels in MOPS buffer according to standard procedures and blotted onto nylon membranes (Hybond-N, Amersham Pharmacia). DNA probes were labeled as above. Hybridization was carried out in 5X SSPE, 50% formamide, 5X Denhardt, 0.5% SDS at 42°C, overnight. Blots were washed to final stringency of 1X SSPE, 0.1% SDS at 42°C, and exposed to X-ray films.
RESULTS AND DISCUSSION
ORESTES were obtained from DNase-treated PolyA+ RNA samples of trypomasti-gote and amastigote forms (3:1 proportion) of the human VL10 strain, recovered in the su-
1594
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
C. Martins et al.
pernatant of LLC-MK2 monolayers. Amastigotes were obtained mostly by the premature lysis of infected cells or from the extracellular differentiation of trypomastigotes. Like trypomasti-gotes, extracellular amastigotes are capable of invading both professional and non-professional phagocytes and sustain the parasite’s life cycle (Fernandes et al., 2006 and cited references).
No amplification of T. cruzi abundant genomic sequences, the 195-bp satellite DNA (Martins et al., 2008) and the LSU RNA genes, was observed in the RNA samples, ruling out the presence of contaminating genomic DNA (data not shown). The integrity of the mRNA samples was confirmed by RT-PCR using primers for T. cruzi actin 2 and elongation factor 1 genes (data not shown). Synthesis and amplification of the cDNA templates were obtained with five 18-mer primers. The criterion for primer selection was GC content ≥50%, and no specific sequence constraints were imposed (Dias et al., 2000). After cDNA electrophoresis, profiles composed of a DNA smear between 400 and 900 bp were excised from the gels, cloned in pGEM-T Easy vector and sequenced.
In total 1522 ORESTES were generated, of which 68% were obtained with PS0225 and PS0230 primers. Post-processing to exclude primer and vector sequences, as well as low-quality bases, resulted in 1161 ORESTES with an average phred-generated base-quality value of 20.3 (i.e., 99% base call accuracy) and average size of 500 nt. This size is adequate for transcript identification, since the average size of a T. cruzi coding DNA sequence (CDS) is 1513 bp (El-Sayed et al., 2005). We determined that the ORESTES G+C content was 52.7%, in agreement with the reported CDS G+C content of 53.4%, which is higher than the average G+C content of inter-coding regions (47%) (El-Sayed et al., 2005). Sequence clustering was performed by the CAP3 program, resulting in 479 clusters, which represented 32.4% of the accepted sequences and indicated a high redundancy level of the library.
Of the 479 clusters, 44 represented contaminant sequences, mainly derived from Ma-cacca mulatta (from the LLC-MK2 cell monolayers), and were excluded from the analysis. Of the remaining 435 consensi, 96 (22.1%) contained multiple reads (contigs) and 339 (77.9%) contained only one ORESTES (singletons) (Table 1). The 435 clusters were searched as query against the following T. cruzi databases (http://tritrypdb.org/tritrypdb/): predicted gene con-tigs, individual reads, EST/cDNA sequences and proteome as described in Material and Meth-ods. Sequences with no significant similarity with T. cruzi databases were searched against the non-redundant nucleotide and protein sequences at NCBI using BLASTN and BLASTX, respectively. All 435 consensi were also searched for pfam domains (Supplementary Material S2 - Pfam - ORESTES).
Description Number Percentage
ORESTES contigs 96 100.0 Similarity to Trypanosoma cruzi genes 90 93.7 No similarity to coding sequences, but similarity to T. cruzi contigs 2 2.1 No match with any database 4 4.2ORESTES singletons 339 100.0 Similarity to T. cruzi genes 234 69.0 No similarity to coding sequences, but similarity to T. cruzi contig database 5 1.5 No match with any database 100 29.5
Table 1. ORESTES contig and singleton BLAST matches to nucleotide and protein databases.
1595
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
Transcriptome of Trypanosoma cruzi mammalian stages
Table 1 summarizes the similarity search results. Around 94% contigs and 69% sin-gletons displayed significant matches with T. cruzi predicted genes. Considering the reference CL Brener genome haplotypic distribution and classification into Esmeraldo- or non-Esmeral-do-like alleles (El-Sayed et al., 2005), it was possible to identify that 87.5% contigs and 95.6% singletons had higher identity with Esmeraldo-like alleles (Supplementary Material S1 - An-notation - ORESTES). This confirms that VL10 and Esmeraldo strains are closely related, both belonging to the TcII DTU (Zingales et al., 2009). A total of 2 contigs and 5 singletons corresponded to non-annotated regions of T. cruzi contigs. One of these sequences matched to a maxicircle sequence (mitochondrial genome was not part of the annotated genome da-taset) (Westenberger et al., 2006); two sequences were part of missing portions of annotated partial genes, and four sequences were likely untranslated sequences since they were derived from inter-coding regions (Supplementary Material S1 - Annotation - ORESTES). Around 4.2% contigs and 29.5% singletons resulted in no matches against either the T. cruzi databases (coding-sequences, contigs or reads) or the non-redundant Genbank protein database. Part of these ORESTES may have been derived from cloning artifacts.
Significant information obtained from highly populated clusters is that the relative levels of gene expression can provide a snapshot of the physiological state of the cells. In this direction, the list of the 15 most populated ORESTES clusters and their best blastn hit against T. cruzi genes and contigs is reported in Table 2. Of these clusters, 40% represented hypothetical conserved proteins. The putative dispersed gene family 1 (DGF-1) protein was found in two clusters. DGF-1 proteins are encoded by ~500 genes (El-Sayed et al., 2005) and are localized in the parasite cell membrane. One of the DGF-1 members (DGF1.2) is more abundant in the amastigote stage than in trypomastigotes and epimastigotes, and its expression is developmentally regulated (Lander et al., 2010). Interestingly, transcripts from DGF-1 have not been previously reported in T. cruzi EST databases (Table 2). This was also the case with the sequences of 13 of the 15 most populated ORESTES clusters (Table 2), indicating that the ORESTES strategy is an alternative methodology to identify novel gene transcripts.
Contig ORESTES Best BLASTN hit against Annotation Best BLASTN hit (No.) (count) T. cruzi genes and contigs against T. cruzi ESTs
16 6 Tc00.1047053508179.70 GPR1/FUN34/yaaH family, putative Null11 8 Tc00.1047053507653.40 Dispersed gene family protein 1 (DGF-1), putative Null90 8 Tc00.1047053509865.9 Retrotransposon hot spot (RHS) protein, putative Null89 8 Tc00.1047053507479.29 NUP-1 protein (pseudogene), putative Null20 9 Tc00.1047053510431.190 Hypothetical protein, conserved Null64 10 Tc00.1047053507083.109 Hypothetical protein, conserved Null62 12 Tc00.1047053509157.170 Trans-sialidase, putative Null54 14 Tc00.1047053507099.80 ABC transporter, putative Null95 14 Tc00.1047053506481.30 Hypothetical protein, conserved Null35 15 Tc00.1047053399389.10 Hypothetical protein, conserved Null 6 22 Tc00.1047053511727.290 RNA binding protein, putative TENU4719 epimastigote normalized cDNA library (5' mRNA sequence)50 25 Tc00.1047053508367.50 Hypothetical protein, conserved (pseudogene) TENU1365 epimastigote normalized cDNA library (3' mRNA sequence)23 30 Tc00.1047053504929.5 Hypothetical protein, conserved Null83 32 Tc00.1047053510275.130 Dispersed gene family protein 1 (DGF-1), putative Null42 89 Null Not identified Null
Table 2. Annotation of the most populated ORESTES clusters obtained after selecting the best BLASTN hit against Trypanosoma cruzi databases.
1596
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
C. Martins et al.
Analysis of the best BLASTX hits against T. cruzi proteome databases indicated that of these 15 ORESTES clusters, only the proteins related to contig89 (putative NUP-1 protein, pseudogene) and contig90 (putative retrotransposon hot spot (RHS) protein) have been depos-ited in this database (Supplementary Material S1 - Annotation - ORESTES). Coincidentally, RHS protein was characterized in amastigote, trypomastigote and metacyclic infective stages, and the nuclear envelope protein NUP-1, in the metacyclic insect stage. For the highly popu-lated contig42, no match was found with any database (Table 2).
Next we analyzed the ORESTES matching T. cruzi predicted genes (Figure 1). The ma-jority of the contigs (55%) and large proportion of singletons (43%) were derived from known coding sequences previously reported in T. cruzi and/or other organisms, and annotated here as predicted protein-coding genes. These included genes encoding surface proteins such as the trans-sialidase/gp85 superfamily, mucin-associated surface protein (MASP), mucin TcMUCII, DGF-1, and surface protease GP63 (Supplementary Material S1 - Annotation - ORESTES), all known to be intensely expressed in the mammalian stages of the parasite (Frasch, 2000; Grandgenett et al., 2000; Buscaglia et al., 2004; Bartholomeu et al., 2009; Lander et al., 2010).
Figure 1. Distribution of the ORESTES contigs (N = 90; Panel A) and singletons (N = 234; Panel B) among classes of Trypanosoma cruzi predicted genes.
1597
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
Transcriptome of Trypanosoma cruzi mammalian stages
In comparison with Leishmania major and T. brucei genomes, T. cruzi shows a remarkable expansion of these gene families, corresponding to approximately 20% of the total protein-coding genes. Some of these families are T. cruzi-specific such as MASP, mu-cin TcMUC and DGF-1, while others are much more expanded in T. cruzi compared with T. brucei and/or L. major, such as gp85/trans-sialidase and GP63 (El-Sayed et al., 2005 and cited references). Each of the three parasite species evolved a specific surface coat adapted for survival and proliferation in the different niches they occupy in the mammalian host. Unlike T. brucei, which is exclusively extracellular, and L. major, which infects macro-phages, T. cruzi has the ability to infect and proliferate within a large variety of host cell types. It has been demonstrated that several of these T. cruzi surface proteins are glycosyl-ated and mediate host cell attachment and invasion and/or intracellular replication (Acosta-Serrano et al., 2001; Kulkarni et al., 2009; Albertti et al., 2010). Additionally, some of these molecules are implicated in immune evasion mechanisms (Pereira-Chioccola et al., 2000; Pitcovsky et al., 2002; Gao et al., 2002).
We have also found that a small proportion of the contigs and singletons match retroelements, related to the modeling of T. cruzi genome (El-Sayed et al., 2005), the transcribed repetitive 195-bp satellite DNA sequence (Martins et al., 2008) and ribosomal RNA genes.
In addition to T. cruzi genes experimentally characterized, we also identified a con-siderable number of contigs (28%) and a large proportion of singletons (42%) corresponding to genes matching hypothetical proteins also identified in other organisms, but with unknown function (annotated as predicted conserved hypothetical protein-coding genes) (Figure 1). Of particular interest are a number of contigs (7%) and singletons (9%) matching hypothetical proteins, which are putatively T. cruzi-specific genes.
Aiming to investigate the genomic organization of some ORESTES sequences, Southern blots of total DNA of epimastigote forms of CL Brener and VL10 strains were digested with PstI and hybridized with five probes: three ORESTES that showed no match with any database (one member of contig42, contig59 and the singleton TCA236-IQ001_E12_.B_10.ab1); one member of contig83 (DGF-1; GenBank accession #GW396523) and contig23 (hypothetical protein, conserved; GenBank accession #GW396011) (Figure 2). No hybridization signal was obtained for the highly populated contig42 probe (Table 2), confirm-ing that these sequences are artifacts. On the other hand, the probes of the two other no-match ORESTES (contig59 and singleton TCA236-IQ001_E12_.B_10.ab1) identified DNA bands in the two strains (Figure 2B and C). Some differences in the organization of DGF-1 in the genome of CL Brener and VL10 could be detected (Figure 2D). In contrast, the same hybrid-ization pattern in the two strains was observed for the conserved hypothetical protein-coding gene of contig23 (Figure 2E).
Among the ORESTES contigs and singletons with similarity to predicted conserved hypothetical protein-coding genes and predicted hypothetical protein-coding genes (total 151), 82.8% (125/151) showed no matches against T. cruzi EST and proteome databases (Sup-plementary Material S1 - Annotation - ORESTES). These ORESTES sequences are the first experimental evidence that the corresponding genes are in fact transcribed. Those matching hypothetical protein-coding genes, which showed no similarity to EST and proteome data (12 in total), deserve further investigation since they are likely to be T. cruzi-specific and ex-pressed in the mammalian infective stages.
1598
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
C. Martins et al.
To further characterize the 151 contigs and singletons matching hypothetical and con-served hypothetical proteins, the corresponding proteins were searched for conserved domains against the Pfam database (http://pfam.sanger.ac.uk/), with a cutoff e-value of 10-5 (Supple-mentary Material S2 - Pfam - ORESTES). The data are summarized in Table 3. Considering the 31 ORESTES contigs, Pfam domains were identified in 7 of 25 conserved hypothetical proteins and in 1 of 6 hypothetical proteins. Regarding the 120 singletons, Pfam domains were identified in 19 of 99 conserved hypothetical proteins and in 2 of 21 hypothetical proteins (Table 3). Additionally, conserved protein domains could be recognized in 1 of the 6 contigs and 2 of the 14 singletons annotated as hypothetical protein-coding genes and which showed no match against EST and proteome databases (Table 3).
Figure 2. Genomic organization of ORESTES. Southern blot of total DNA of CL Brener (CL) and VL10 (VL) strains digested with PstI. A. Ethidium bromide-stained gel and hybridizations with 32P-labeled probes; B. contig59 (no match); C. singleton (TCA236-IQ001_E12_.B_10.ab1, no match); D. contig83 (GenBank accession #GW396523); E. contig23 (GenBank accession #GW396011). Molecular mass markers are indicated in kb.
Description Total No match with EST and proteome databases
Number Pfam domains Number Pfam domains
Contigs Conserved hypothetical proteins 25 7 22 7 Hypothetical proteins 6 1 6 1Singletons Conserved hypothetical proteins 99 19 83 18 Hypothetical proteins 21 2 14 2
Table 3. Pfam domains of conserved hypothetical proteins and hypothetical proteins of ORESTES contigs and singletons.
1599
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
Transcriptome of Trypanosoma cruzi mammalian stages
In our analysis, we found several ORESTES annotated as hypothetical protein-coding genes that showed no match against EST and proteome databases. Since, in prin-ciple, these genes could be T. cruzi-specific, we decided to confirm the transcript abun-dance of some of these genes by means of Northern blot assays with DNase-treated total RNA of trypomastigote + amastigote forms of the VL10 strain. Because most of T. cruzi ESTs were obtained from CL Brener epimastigotes (reviewed by Aguero et al., 2006), RNA blots of this parasite stage were also included in the analysis (Figure 3). The blots were hybridized with six labeled ORESTES probes. Two probes corresponded to one member of contig03 (GenBank accession #GW395938) and one member of contig25 (GenBank accession #GW396121), and the other four probes to singletons (GenBank accession #GW396611, #GW396648, #GW396850, and #GW396862) (Figure 3). Tran-scripts could be detected in the mammalian forms of VL10 for all six probes, corrobo-rating the ORESTES data indicating that these genes are transcribed. Only the probe derived from contig25 also identified transcripts in CL Brener epimastigotes (Figure 3, panel H). At least three strong hybridization bands were detected by one singleton probe (GenBank accession #GW396862) (Figure 3, panel G). We verified that the hypothetical protein-coding gene, which was the best hit for this singleton matched six other hypo-thetical protein-coding genes (e-value ranging from 10-32 to 10-146) with coding regions varying in length from 585 to 1140 bp. Some of these genes were located at contig ends, and therefore, the size of their corresponding transcripts could not be estimated based on the genome data. Therefore, the various hybridization bands could be originated by cross-hybridization of the probe with these related sequences. For all the other probes, the transcript sizes were compatible with the length of the CDS plus flanking sequences, most probably untranslated regions.
Figure 3. Transcription of hypothetical protein-coding genes. Blots of DNase-treated total RNA of CL Brener epimastigotes (CL) and VL10 trypomastigote + amastigote forms (VL). A. Ethidium bromide-stained agarose gel. The blots were hybridized with 32P-labeled probes; B. Actin 2 housekeeping gene; C. singleton (GenBank accession #GW396611); D. singleton (GenBank accession #GW396648); E. singleton (GenBank accession #GW396850); F. contig03 (GenBank accession #GW395938); G. singleton (GenBank accession #GW396862); H. contig25 (GenBank accession #GW396121). The estimated sizes (in kb) of the transcripts are indicated on the right side of the panels.
1600
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
C. Martins et al.
The data presented in Figure 3 indicating that five ORESTES probes of hypothetical protein-coding genes recognized RNA transcripts only in the VL10 mammalian stages strongly suggest that the level of expression of these genes is developmentally regulated. In fact, dif-ferential mRNA levels conferring stage-regulated gene expression have been detected on T. cruzi microarrays (Minning et al., 2009) and in several other studies (reviewed by Teixeira and Da Rocha, 2003). On the other hand, the differential hybridization between the CL Brener (TcVI) and VL10 (TcII) strains could be related to genetic differences between the two T. cruzi DTUs (Miles et al., 2009; Zingales et al., 2009).
In conclusion, the ORESTES analysis led to the verification that 124 predicted con-served hypothetical protein-coding genes and 27 predicted hypothetical protein-coding genes annotated in the CL Brener genome are transcribed in the mammalian stages of the VL10 strain. The generation of this small set of ORESTES helps to complement the T. cruzi genome annotation, since the detection of missing portions of genes belonging to the contigs derived from the non-annotated dataset has been possible. The verification that some genes annotated as coding for hypothetical proteins are in fact transcribed will stimulate the characterization of the corresponding proteins that could not be identified by the current proteomic methods due to their low levels or physical properties. This can contribute to a better understanding of the parasite biology and pathogenicity since these are a priori T. cruzi-specific genes. At the same time, this approach provides information on the putative developmental expression of the genes, helping in the use of more assertive strategies for functional studies.
ACKNOWLEDGMENTS
We are greatly indebted to Dr. Emmanuel Dias Neto and Dr. Elida Ojopi for insightful suggestions. Research supported by Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP) and Ministério de Ciência e Tecnologia/Conselho Nacional de Desenvolvimento Científico e Tecnológico/Ministério da Saúde (MCT/CNPq/MS-SCTIE-DECIT - Edital de Doenças Negligenciadas) to B. Zingales, UNICEF/UNDP/World Bank WHO for Research and Training in Tropical Diseases, Fundação de Amparo à Pesquisa do Estado de Minas Gerais (FAPEMIG), and Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq) to D.C. Bartholomeu. All authors, except M.N. Silva are recipients of CNPq fellowships.
REFERENCES
Acosta-Serrano A, Almeida IC, Freitas-Junior LH, Yoshida N, et al. (2001). The mucin-like glycoprotein super-family of Trypanosoma cruzi: structure and biological roles. Mol. Biochem. Parasitol. 114: 143-150.
Aguero F, Zheng W, Weatherly DB, Mendes P, et al. (2006). TcruziDB: an integrated, post-genomics community resource for Trypanosoma cruzi. Nucleic Acids Res. 34: D428-D431.
Albertti LA, Macedo AM, Chiari E, Andrews NW, et al. (2010). Role of host lysosomal associated membrane protein (LAMP) in Trypanosoma cruzi invasion and intracellular development. Microb. Infect. 12: 784-789.
Arner E, Kindlund E, Nilsson D, Farzana F, et al. (2007). Database of Trypanosoma cruzi repeated genes: 20,000 additional gene variants. BMC Genom. 8: 391.
Atwood JA III, Weatherly DB, Minning TA, Bundy B, et al. (2005). The Trypanosoma cruzi proteome. Science 309: 473-476.
Baptista CS, Vencio RZ, Abdala S, Carranza JC, et al. (2006). Differential transcription profiles in Trypanosoma cruzi associated with clinical forms of Chagas disease: Maxicircle NADH dehydrogenase subunit 7 gene truncation in asymptomatic patient isolates. Mol. Biochem. Parasitol. 150: 236-248.
Bartholomeu DC, Cerqueira GC, Leao AC, daRocha WD, et al. (2009). Genomic organization and expression profile of
1601
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
Transcriptome of Trypanosoma cruzi mammalian stages
the mucin-associated surface protein (masp) family of the human pathogen Trypanosoma cruzi. Nucleic Acids Res. 37: 3407-3417.
Buscaglia CA, Campo VA, Di Noia JM, Torrecilhas AC, et al. (2004). The surface coat of the mammal-dwelling infective trypomastigote stage of Trypanosoma cruzi is formed by highly diverse immunogenic mucins. J. Biol. Chem. 279: 15860-15869.
Camargo AA, Samaia HP, Dias-Neto E, Simao DF, et al. (2001). The contribution of 700,000 ORF sequence tags to the definition of the human transcriptome. Proc. Natl. Acad. Sci. U. S. A. 98: 12103-12108.
Campbell DA, Westenberger SJ and Sturm NR (2004). The determinants of Chagas disease: connecting parasite and host genetics. Curr. Mol. Med. 4: 549-562.
Carranza JC, Valadares HM, D’Avila DA, Baptista RP, et al. (2009). Trypanosoma cruzi maxicircle heterogeneity in Chagas disease patients from Brazil. Int. J. Parasitol. 39: 963-973.
Chou HH and Holmes MH (2001). DNA sequence quality trimming and vector removal. Bioinformatics. 17: 1093-1104.Dias NE, Caballero OL, Vidigal TH, Pena SD, et al. (1997). Partially degraded DNA of parasitological interest serves as an
adequate template for the production of random amplified polymorphic DNAs (RAPDs). J. Parasitol. 83: 753-755.Dias NE, Correa RG, Verjovski-Almeida S, Briones MR, et al. (2000). Shotgun sequencing of the human transcriptome
with ORF expressed sequence tags. Proc. Natl. Acad. Sci. U. S. A. 97: 3491-3496.El-Sayed NM, Myler PJ, Bartholomeu DC, Nilsson D, et al. (2005). The genome sequence of Trypanosoma cruzi, etiologic
agent of Chagas disease. Science 309: 409-415.Ewing B and Green P (1998). Base-calling of automated sequencer traces using phred. II. Error probabilities. Genome
Res. 8: 186-194.Ferella M, Nilsson D, Darban H, Rodrigues C, et al. (2008). Proteomics in Trypanosoma cruzi - localization of novel
proteins to various organelles. Proteomics 8: 2735-2749.Fernandes AB, Neira I, Ferreira AT and Mortara RA (2006). Cell invasion by Trypanosoma cruzi amastigotes of distinct
infectivities: studies on signaling pathways. Parasitol. Res. 100: 59-68.Frasch AC (2000). Functional diversity in the trans-sialidase and mucin families in Trypanosoma cruzi. Parasitol. Today
16: 282-286.Gao W, Wortis HH and Pereira MA (2002). The Trypanosoma cruzi trans-sialidase is a T cell-independent B cell mitogen
and an inducer of non-specific Ig secretion. Int. Immunol. 14: 299-308.Grandgenett PM, Coughlin BC, Kirchhoff LV and Donelson JE (2000). Differential expression of GP63 genes in
Trypanosoma cruzi. Mol. Biochem. Parasitol. 110: 409-415.Huang X and Madan A (1999). CAP3: A DNA sequence assembly program. Genome Res. 9: 868-877.Kulkarni MM, Olson CL, Engman DM and McGwire BS (2009). Trypanosoma cruzi GP63 proteins undergo stage-specific
differential posttranslational modification and are important for host cell infection. Infect. Immun. 77: 2193-2200.Lander N, Bernal C, Diez N, Anez N, et al. (2010). Localization and developmental regulation of a dispersed gene family
1 protein in Trypanosoma cruzi. Infect. Immun. 78: 231-240.Macedo AM and Pena SD (1998). Genetic variability of Trypanosoma cruzi: Implications for the pathogenesis of Chagas
disease. Parasitol. Today 14: 119-124.Macedo AM, Martins MS, Chiari E and Pena SD (1992). DNA fingerprinting of Trypanosoma cruzi: a new tool for
characterization of strains and clones. Mol. Biochem. Parasitol. 55: 147-153.Macedo AM, Machado CR, Oliveira RP and Pena SD (2004). Trypanosoma cruzi: genetic structure of populations and
relevance of genetic variability to the pathogenesis of Chagas disease. Mem. Inst. Oswaldo Cruz 99: 1-12.Martins C, Baptista CS, Ienne S, Cerqueira GC, et al. (2008). Genomic organization and transcription analysis of the 195-
bp satellite DNA in Trypanosoma cruzi. Mol. Biochem. Parasitol. 160: 60-64.Miles MA, Llewellyn MS, Lewis MD, Yeo M, et al. (2009). The molecular epidemiology and phylogeography of
Trypanosoma cruzi and parallel research on Leishmania: looking back and to the future. Parasitology 136: 1509-1528.
Minning TA, Weatherly DB, Atwood J, III, Orlando R, et al. (2009). The steady-state transcriptome of the four major life-cycle stages of Trypanosoma cruzi. BMC Genom. 10: 370.
Pereira-Chioccola VL, Acosta-Serrano A, Correia dA, I, Ferguson MA, et al. (2000). Mucin-like molecules form a negatively charged coat that protects Trypanosoma cruzi trypomastigotes from killing by human anti-alpha-galactosyl antibodies. J. Cell Sci. 113 (Pt 7): 1299-1307.
Pinto Dias JC (2006). The treatment of Chagas disease (South American trypanosomiasis). Ann. Intern. Med. 144: 772-774.Pitcovsky TA, Buscaglia CA, Mucci J and Campetella O (2002). A functional network of intramolecular cross-reacting
epitopes delays the elicitation of neutralizing antibodies to Trypanosoma cruzi trans-sialidase. J. Infect. Dis. 186: 397-404.
1602
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
C. Martins et al.
Teixeira SMR and Da Rocha WD (2003). Control of gene expression and genetic manipulation in Trypanosomatidae. Genet. Mol. Res. 2: 148-158.
Tibayrenc M (1998). Genetic epidemiology of parasitic protozoa and other infectious agents: the need for an integrated approach. Int. J. Parasitol. 28: 85-104.
Tomazi L, Kawashita SY, Pereira PM, Zingales B, et al. (2009). Haplotype distribution of five nuclear genes based on network genealogies and Bayesian inference indicates that Trypanosoma cruzi hybrid strains are polyphyletic. Genet. Mol. Res. 8: 458-476.
Weatherly DB, Boehlke C and Tarleton RL (2009). Chromosome level assembly of the hybrid Trypanosoma cruzi genome. BMC Genom. 10: 255.
Westenberger SJ, Cerqueira GC, El-Sayed NM, Zingales B, et al. (2006). Trypanosoma cruzi mitochondrial maxicircles display species- and strain-specific variation and a conserved element in the non-coding region. BMC Genom. 7: 60.
Zingales B, Pereira ME, Almeida KA, Umezawa ES, et al. (1997). Biological parameters and molecular markers of clone CL Brener - the reference organism of the Trypanosoma cruzi genome project. Mem. Inst. Oswaldo Cruz 92: 811-814.
Zingales B, Andrade SG, Briones MR, Campbell DA, et al. (2009). A new consensus for Trypanosoma cruzi intraspecific nomenclature: second revision meeting recommends TcI to TcVI. Mem. Inst. Oswaldo Cruz 104: 1051-1054.
1603
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
Transcriptome of Trypanosoma cruzi mammalian stages
Que
ry
Bes
t bla
stn
hit a
gain
st
Ann
otat
ion
Bes
t bla
stn
hit a
gain
st
EST-
stag
e B
est b
last
x hi
t aga
inst
Pr
oteo
me-
stag
e C
ontig
s R
eads
T.
cru
zi g
enes
and
con
tigs
T.
cru
zi E
STs
T.
cru
zi p
rote
ome
Con
tig1
2350
6.rr
na00
001
RN
A ri
boso
mal
TE
NU
0441
T. c
ruzi
epi
mas
tigot
e Ep
imas
tigot
e N
ull
Nul
l
norm
aliz
ed c
DN
A li
brar
y T.
cru
zi
cDN
A c
lone
4m
9 3'
,
mR
NA
sequ
ence
C
ontig
2 Tc
00.1
0470
5350
6559
.559
A
ntig
enic
pro
tein
, put
ativ
e N
ull
Nul
l N
ull
Nul
lC
ontig
3 Tc
00.1
0470
5350
4039
.110
H
ypot
hetic
al p
rote
in
Nul
l N
ull
Nul
l N
ull
Con
tig4
Tc00
.104
7053
5074
83.4
Po
lyub
iqui
tin (p
seud
ogen
e), p
utat
ive
TcTR
-583
TcT
R T
. cru
zi c
DN
A
Tryp
omas
tigot
e 16
0_Ep
i Ep
imas
tigot
e
clon
e 03
i13
5', m
RN
A se
quen
ceC
ontig
5 Tc
00.1
0470
5350
6265
.100
A
TP-d
epen
dent
RN
A h
elic
ase,
put
ativ
e N
ull
Nul
l N
ull
Nul
lC
ontig
6 Tc
00.1
0470
5351
1727
.290
R
NA
-bin
ding
pro
tein
, put
ativ
e TE
NU
4719
T. c
ruzi
epi
mas
tigot
e Ep
imas
tigot
e N
ull
Nul
l
norm
aliz
ed c
DN
A li
brar
y T.
cru
zi
cDN
A c
lone
40o
22 5
',
mRN
A se
quen
ceC
ontig
7 Tc
00.1
0470
5350
9695
.150
K
ines
in, p
utat
ive
Nul
l N
ull
Nul
l N
ull
Con
tig8
Tc00
.104
7053
5101
01.4
30
40S
ribos
omal
pro
tein
S21
, put
ativ
e am
astc
-406
TcA
M T
. cru
zi cD
NA
A
mas
tigot
e N
ull
Nul
l
clon
e 14
F3 5
', m
RN
A se
quen
ceC
ontig
9 Tc
00.1
0470
5350
6591
.50
Hyp
othe
tical
pro
tein
, con
serv
ed (p
seud
ogen
e)
Nul
l N
ull
Nul
l N
ull
Con
tig10
Tc
00.1
0470
5351
0769
.80
Ank
yrin
repe
at p
rote
in, p
utat
ive
Nul
l N
ull
Nul
l N
ull
Con
tig11
Tc
00.1
0470
5350
7653
.40
Dis
pers
ed g
ene
fam
ily p
rote
in
Nul
l N
ull
Nul
l N
ull
1 (D
GF-
1), p
utat
ive
Con
tig12
Tc
00.1
0470
5350
6401
.210
H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
lC
ontig
13
Tc00
.104
7053
5085
51.3
9 H
exos
e tra
nspo
rter,
puta
tive
amas
tc-4
11 T
cAM
T. c
ruzi
cDN
A
Am
astig
ote
Nul
l N
ull
cl
one
14F8
5',
mR
NA
sequ
ence
Con
tig14
Tc
00.1
0470
5350
4137
.150
H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
lC
ontig
15
Tc00
.104
7053
5036
27.2
0 Pr
otei
n ki
nase
, put
ativ
e N
ull
Nul
l N
ull
Nul
lC
ontig
16
Tc00
.104
7053
5081
79.7
0 G
PR1/
FUN
34/y
aaH
fam
ily, p
utat
ive
Nul
l N
ull
Nul
l N
ull
Con
tig17
Tc
00.1
0470
5350
8865
.4
Prot
ein
phos
phat
ase
2C, p
utat
ive
Nul
l N
ull
Nul
l N
ull
Con
tig18
Tc
00.1
0470
5350
9985
.60
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
Con
tig19
Tc
00.1
0470
5351
0431
.190
H
ypot
hetic
al p
rote
in
Nul
l N
ull
Nul
l N
ull
Con
tig20
Tc
00.1
0470
5350
6825
.60
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
Con
tig21
24
358.
rrna
0000
1 R
NA
ribo
som
al
TEN
Q08
45 T
. cru
zi e
pim
astig
ote
Epim
astig
ote
Nul
l N
ull
no
rmal
ized
cD
NA
libr
ary
T. c
ruzi
cD
NA
, mR
NA
sequ
ence
Con
tig22
Tc
00.1
0470
5351
0759
.180
H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
lC
ontig
23
Tc00
.104
7053
5049
29.5
H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
lC
ontig
24
Tc00
.104
7053
5116
11.1
1 H
ypot
hetic
al p
rote
in
Nul
l N
ull
Nul
l N
ull
Con
tig25
Tc
00.1
0470
5350
3975
.20
Hyp
othe
tical
pro
tein
N
ull
Nul
l N
ull
Nul
lC
ontig
26
Tc00
.104
7053
5103
07.6
0 R
etro
trans
poso
n ho
t spo
t pro
tein
N
ull
Nul
l N
ull
Nul
l
(R
HS,
pse
udog
ene)
, put
ativ
e
Supp
lem
enta
ry M
ater
ial S
1. A
nnot
atio
n - O
RES
TES
cont
igs.
Con
tinue
d on
nex
t pag
e
1604
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
C. Martins et al.
Que
ry
Bes
t bla
stn
hit a
gain
st
Ann
otat
ion
Bes
t bla
stn
hit a
gain
st
EST-
stag
e B
est b
last
x hi
t aga
inst
Pr
oteo
me-
stag
e C
ontig
s R
eads
T.
cru
zi g
enes
and
con
tigs
T.
cru
zi E
STs
T.
cru
zi p
rote
ome
Con
tig27
Tc
00.1
0470
5350
7477
.10
Dis
pers
ed g
ene
fam
ily p
rote
in
Nul
l N
ull
Nul
l
1
(DG
F-1)
, put
ativ
eC
ontig
28
Tc00
.104
7053
5041
05.1
50
Impo
rtin
beta
-1 su
buni
t, pu
tativ
e N
ull
Nul
l N
ull
Nul
lC
ontig
29
Tc00
.104
7053
5116
77.1
0 D
ispe
rsed
gen
e fa
mily
pro
tein
N
ull
Nul
l N
ull
1 (D
GF-
1), p
utat
ive
Con
tig30
Tc
00.1
0470
5350
3547
.20
Dis
pers
ed g
ene
fam
ily p
rote
in 1
TE
NS1
238
T. c
ruzi
epi
mas
tigot
e Ep
imas
tigot
e N
ull
Nul
l
(D
GF-
1, p
seud
ogen
e), p
utat
ive
norm
aliz
ed c
DN
A li
brar
y T.
cru
zi
cDN
A c
lone
123
8 5'
,
mR
NA
sequ
ence
Con
tig31
Tc
00.1
0470
5350
9473
.10
Hyp
othe
tical
pro
tein
(pse
udog
ene)
N
ull
Nul
l N
ull
Nul
lC
ontig
32
Tc00
.104
7053
5104
03.5
0 Re
trotra
nspo
son
hot s
pot p
rote
in
Nul
l N
ull
Nul
l N
ull
(RH
S, p
seud
ogen
e), p
utat
ive
Con
tig33
Tc
00.1
0470
5350
9429
.4
Tran
s-si
alid
ase
(pse
udog
ene)
, N
ull
Nul
l N
ull
Nul
l
pu
tativ
e, 8
096.
t000
37C
ontig
34
Tc00
.104
7053
5101
01.3
40
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
Con
tig35
Tc
00.1
0470
5339
9389
.10
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
Con
tig36
Tc
00.1
0470
5350
8647
.220
H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
lC
ontig
37
Tc00
.104
7053
5070
99.4
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
lC
ontig
38
Tc00
.104
7053
5112
83.2
20
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
Con
tig39
C
ontig
1047
0534
3535
5 N
on-a
nnot
ated
con
tig. L
ikel
y th
e
amas
tc-2
13 T
cAM
T. c
ruzi
cD
NA
A
mas
tigot
e N
ull
Nul
l 10
4705
3435
351
5' m
issi
ng p
art o
f the
cod
ing
cl
one
12D
7 5'
, mR
NA
sequ
ence
regi
on o
f the
5' p
artia
l gen
e
Tc
00.1
0470
5350
6489
.10
1047
0534
3535
5C
ontig
40
Tc00
.104
7053
5082
57.1
70
Dyn
ein
heav
y ch
ain,
put
ativ
e N
ull
Nul
l N
ull
Nul
lC
ontig
41
Tc00
.104
7053
5076
63.5
0 R
NA
hel
icas
e, p
utat
ive
Nul
l N
ull
Nul
l N
ull
Con
tig42
N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lC
ontig
43
Tc00
.104
7053
5068
21.2
0 A
TP-d
epen
dent
Clp
pro
teas
e su
buni
t, he
at
Nul
l N
ull
421_
Am
a.m
eta.
epi
Am
astig
ote/
shoc
k pr
otei
n 10
0 (p
seud
ogen
e), p
utat
ive
M
etac
yclic
/
se
rine
pept
idas
e (p
seud
ogen
e), p
utat
ive
Ep
imas
tigot
eC
ontig
44
Tc00
.104
7053
5099
31.2
0 Tr
ans-
sial
idas
e, p
utat
ive
TEN
U48
70 T
. cru
zi e
pim
astig
ote
Ep
imas
tigot
e N
ull
Nul
l
norm
aliz
ed c
DN
A li
brar
y T.
cru
zi
cDN
A c
lone
43g
22 5
',
mR
NA
sequ
ence
Con
tig45
Tc
00.1
0470
5350
5163
.80
Olig
osac
char
yl tr
ansf
eras
e su
buni
t, pu
tativ
e N
ull
Nul
l N
ull
Nul
lC
ontig
46
Tc00
.104
7053
5100
99.1
20
D-is
omer
spec
ific
2-hy
drox
yaci
d
TcA
maP
l06R
un01
_D09
T. c
ruzi
A
mas
tigot
e N
ull
Nul
l
de
hydr
ogen
ase-
prot
ein,
put
ativ
e am
astig
ote
cDN
A li
brar
y T.
cru
zi
cD
NA
5' s
imila
r to
hypo
thet
ical
pr
otei
n, m
RN
A se
quen
ceC
ontig
47
Tc00
.104
7053
5091
71.6
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
l
Supp
lem
enta
ry M
ater
ial S
1. C
ontin
ued.
Con
tinue
d on
nex
t pag
e
1605
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
Transcriptome of Trypanosoma cruzi mammalian stages
Que
ry
Bes
t bla
stn
hit a
gain
st
Ann
otat
ion
Bes
t bla
stn
hit a
gain
st
EST-
stag
e B
est b
last
x hi
t aga
inst
Pr
oteo
me-
stag
e C
ontig
s R
eads
T.
cru
zi g
enes
and
con
tigs
T.
cru
zi E
STs
T.
cru
zi p
rote
ome
Con
tig48
Tc
00.1
0470
5350
6649
.90
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
Con
tig49
Tc
00.1
0470
5350
6795
.80
Prol
ine
race
mas
e TE
NS2
013
T. c
ruzi
epi
mas
tigot
e
Epim
astig
ote
Nul
l N
ull
no
rmal
ized
cD
NA
libr
ary
T. c
ruzi
cDN
A c
lone
201
3 5'
, mR
NA
sequ
ence
Con
tig50
Tc
00.1
0470
5350
8367
.50
Hyp
othe
tical
pro
tein
, con
serv
ed (p
seud
ogen
e)
TEN
U13
65 T
. cru
zi e
pim
astig
ote
Ep
imas
tigot
e N
ull
Nul
l
norm
aliz
ed c
DN
A li
brar
y T.
cru
zi
cD
NA
clo
ne 2
1e3
3', m
RN
A se
quen
ceC
ontig
51
Tc00
.104
7053
5064
05.1
40
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
Con
tig52
Tc
00.1
0470
5350
6661
.50
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
Con
tig53
Tc
00.1
0470
5351
0689
.10
Coa
tom
er a
lpha
subu
nit (
pseu
doge
ne),
puta
tive
Nul
l N
ull
Nul
l N
ull
Con
tig54
Tc
00.1
0470
5350
7099
.80
AB
C tr
ansp
orte
r, pu
tativ
e N
ull
Nul
l N
ull
Nul
lC
ontig
55
Tc00
.104
7053
5037
01.9
D
isper
sed
gene
fam
ily p
rote
in
TcA
maP
l09R
un01
_B10
T. c
ruzi
A
mas
tigot
e 74
3_M
eta
Met
acyc
lic
1
(DG
F-1,
pse
udog
ene)
, put
ativ
e am
astig
ote
cDN
A li
brar
y T.
cru
zi
cD
NA
3' s
imila
r to
cyst
eine
-ric
h
repe
at m
otif,
mR
NA
sequ
ence
Con
tig56
Tc
00.1
0470
5351
0565
.20
Fatty
aci
d de
satu
rase
, put
ativ
e am
astc
-438
TcA
M T
. cru
zi c
DN
A
Am
astig
ote
Nul
l N
ull
cl
one
15A
4 5'
, mR
NA
sequ
ence
Con
tig57
Tc
00.1
0470
5350
4071
.30
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
Con
tig58
Tc
00.1
0470
5351
1105
.29
Dis
pers
ed g
ene
fam
ily p
rote
in
Nul
l N
ull
Nul
l N
ull
1 (D
GF-
1), p
utat
ive
Con
tig59
N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lC
ontig
60
Tc00
.104
7053
5113
93.6
0 Pr
otei
n ki
nase
, put
ativ
e N
ull
Nul
l N
ull
Nul
lC
ontig
61
Tc00
.104
7053
5112
11.1
70
Hea
t sho
ck p
rote
in 7
0 (H
SP70
), pu
tativ
e am
astc
-216
TcA
M T
. cru
zi c
DN
A
Am
astig
ote
257_
Met
a M
etac
yclic
cl
one
12E1
5',
mR
NA
sequ
ence
Con
tig62
Tc
00.1
0470
5350
9157
.170
Tr
ans-
sial
idas
e, p
utat
ive
Nul
l N
ull
Nul
l N
ull
Con
tig63
Tc
00.1
0470
5350
9455
.30
Hyp
othe
tical
pro
tein
, con
serv
ed
TEN
S056
8 T.
cru
zi e
pim
astig
ote
Ep
imas
tigot
e N
ull
Nul
l
norm
aliz
ed c
DN
A li
brar
y T.
cru
zi
cD
NA
clo
ne 5
68 5
', m
RN
A se
quen
ceC
ontig
64
Tc00
.104
7053
5070
83.1
09
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
Con
tig65
Tc
00.1
0470
5350
9931
.20
Tran
s-si
alid
ase,
put
ativ
e N
ull
Nul
l N
ull
Nul
lC
ontig
66
Tc00
.104
7053
5103
29.1
40
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
Con
tig67
Tc
00.1
0470
5350
9965
.110
Im
porti
n al
pha,
put
ativ
e Tc
Am
aPl0
9Run
01_G
06 T
. cru
zi
Am
astig
ote
Nul
l N
ull
am
astig
ote
cDN
A li
brar
y T.
cru
zi
cD
NA
3' s
imila
r to
impo
rtin
alph
a-lik
e
prot
ein,
mR
NA
sequ
ence
C
ontig
68
Tc00
.104
7053
5087
99.2
40
Am
inop
eptid
ase,
put
ativ
e N
ull
Nul
l 10
6_Tr
ypo.
met
a.ep
i Tr
ypom
astig
ote/
Met
acyc
lic/
Ep
imas
tigot
e
Supp
lem
enta
ry M
ater
ial S
1. C
ontin
ued.
Con
tinue
d on
nex
t pag
e
1606
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
C. Martins et al.
Que
ry
Bes
t bla
stn
hit a
gain
st
Ann
otat
ion
Bes
t bla
stn
hit a
gain
st
EST-
stag
e B
est b
last
x hi
t aga
inst
Pr
oteo
me-
stag
e C
ontig
s R
eads
T.
cru
zi g
enes
and
con
tigs
T.
cru
zi E
STs
T.
cru
zi p
rote
ome
Con
tig69
Tc
00.1
0470
5351
1617
.9
NU
P-1
prot
ein
(pse
udog
ene)
, put
ativ
e N
ull
Nul
l N
ull
Nul
lC
ontig
70
Tc00
.104
7053
5112
17.1
20
Cyc
loph
ilin,
put
ativ
e N
ull
Nul
l N
ull
Nul
lC
ontig
71
Tc00
.104
7053
5105
99.7
0 Pr
otam
ine
P1, p
utat
ive
Nul
l N
ull
Nul
l N
ull
Con
tig72
80
94.te
0001
5 D
IRE
TcA
maP
l09R
un01
_A09
T. c
ruzi
A
mas
tigot
e N
ull
Nul
l
amas
tigot
e cD
NA
libr
ary
T. c
ruzi
cD
NA
3' s
imila
r to
hypo
thet
ical
pr
otei
n, m
RN
A se
quen
ceC
ontig
73
1941
7.rr
na00
001
RN
A ri
boso
mal
N
ull
Nul
l N
ull
Nul
lC
ontig
74
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
Con
tig75
N
ull
195-
bp sa
telli
te D
NA
TE
NU
4403
T. c
ruzi
epi
mas
tigot
e
Epim
astig
ote
Nul
l N
ull
no
rmal
ized
cD
NA
libr
ary
T. c
ruzi
cD
NA
clo
ne 5
h21
3',
m
RN
A se
quen
ceC
ontig
76
Tc00
.104
7053
5077
15.2
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
lC
ontig
77
Nul
l 19
5-bp
sate
llite
DN
A
TEN
U44
03 T
. cru
zi e
pim
astig
ote
Ep
imas
tigot
e N
ull
Nul
l
norm
aliz
ed c
DN
A li
brar
y T.
cru
zi
cDN
A c
lone
5h2
1 3'
,
mR
NA
sequ
ence
C
ontig
78
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
lC
ontig
79
Tc00
.104
7053
5069
45.2
30
40S
ribos
omal
pro
tein
S14
, put
ativ
e
TEN
U49
84 T
. cru
zi e
pim
astig
ote
Ep
imas
tigot
e N
ull
Nul
l
norm
aliz
ed c
DN
A li
brar
y T.
cru
zi
cDN
A c
lone
43p
12 5
',
mR
NA
sequ
ence
C
ontig
80
Con
tig10
4705
3455
497
Non
-ann
otat
ed c
ontig
- M
axic
ircle
sequ
ence
TE
UF0
146
T. c
ruzi
epi
mas
tigot
e
Epim
astig
ote
Nul
l N
ull
1047
0534
5549
7
non-
norm
aliz
ed c
DN
A li
brar
y T.
cru
zi
cD
NA
clo
ne 1
46 5
' sim
ilar t
o T.
cru
zi
ge
ne fo
r his
tone
H2b
(X60
982)
,
mR
NA
sequ
ence
C
ontig
81
1184
3.rr
na00
002
RN
A ri
boso
mal
N
ull
Nul
l N
ull
Nul
lC
ontig
82
Tc00
.104
7053
5064
71.5
0 Tr
ans-
sial
idas
e (p
seud
ogen
e), p
utat
ive
TE
NU
4870
T. c
ruzi
epi
mas
tigot
e
Epim
astig
ote
Nul
l N
ull
no
rmal
ized
cD
NA
libr
ary
T. c
ruzi
cD
NA
clo
ne 4
3g22
5',
m
RN
A se
quen
ce
Con
tig83
Tc
00.1
0470
5351
0275
.130
D
ispe
rsed
gen
e fa
mily
pro
tein
N
ull
Nul
l N
ull
Nul
l
1
(DG
F-1)
, put
ativ
eC
ontig
84
1941
7.rr
na00
001
RN
A ri
boso
mal
TE
NQ
0845
T. c
ruzi
epi
mas
tigot
e
Epim
astig
ote
Nul
l N
ull
no
rmal
ized
cD
NA
libr
ary
T. c
ruzi
cDN
A, m
RN
A se
quen
ce
Con
tinue
d on
nex
t pag
e
Supp
lem
enta
ry M
ater
ial S
1. C
ontin
ued.
1607
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
Transcriptome of Trypanosoma cruzi mammalian stages
Que
ry
Bes
t bla
stn
hit a
gain
st
Ann
otat
ion
Bes
t bla
stn
hit a
gain
st
EST-
stag
e B
est b
last
x hi
t aga
inst
Pr
oteo
me-
stag
e C
ontig
s R
eads
T.
cru
zi g
enes
and
con
tigs
T.
cru
zi E
STs
T.
cru
zi p
rote
ome
Con
tig85
Tc
00.1
0470
5350
8879
.130
Te
lom
eras
e co
mpo
nent
(pse
udog
ene)
, put
ativ
e N
ull
Nul
l N
ull
Nul
lC
ontig
86
Tc00
.104
7053
5101
87.2
10
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
Con
tig87
Tc
00.1
0470
5351
1283
.240
H
ypot
hetic
al p
rote
in, c
onse
rved
TE
NU
0619
T. c
ruzi
epi
mas
tigot
e Ep
imas
tigot
e N
ull
Nul
l
norm
aliz
ed c
DN
A li
brar
y T.
cru
zi
cD
NA
clo
ne 3
c1 3
',
mR
NA
sequ
ence
Con
tig88
74
91.te
0001
1 L1
Tc p
oten
tially
func
tiona
l TE
NG
0433
T. c
ruzi
epi
mas
tigot
e
Epim
astig
ote
Nul
l N
ull
no
rmal
ised
cD
NA
libr
ary
T. c
ruzi
cDN
A c
lone
n78
2.r 5
',
mR
NA
sequ
ence
C
ontig
89
Tc00
.104
7053
5074
79.2
9 R
etro
trans
poso
n ho
t spo
t pro
tein
N
ull
Nul
l 48
8_M
eta
Met
acyc
lic
(R
HS,
pse
udog
ene)
, put
ativ
eC
ontig
90
Tc00
.104
7053
5098
65.9
R
etro
trans
poso
n ho
t spo
t (R
HS)
N
ull
Nul
l 23
8_A
ma.
trypo
.met
a A
mas
tigot
e/
pr
otei
n, p
utat
ive
Tr
ypom
astig
ote/
Met
acyc
licC
ontig
91
Tc00
.104
7053
5088
37.3
4 D
ispe
rsed
gen
e fa
mily
pro
tein
Tc
Am
aPl0
9Run
01_B
10 T
. cru
zi
Am
astig
ote
743_
Met
a M
etac
yclic
1 (D
GF-
1), p
utat
ive
amas
tigot
e cD
NA
libr
ary
T. c
ruzi
cDN
A 3
' sim
ilar t
o cy
stei
ne-r
ich
re
peat
mot
if, m
RN
A se
quen
ce
Con
tig92
Tc
00.1
0470
5350
7543
.30
Ret
rotra
nspo
son
hot s
pot p
rote
in
Nul
l N
ull
Nul
l N
ull
(RH
S, p
seud
ogen
e), p
utat
ive
Con
tig93
Tc
00.1
0470
5350
6625
.60
Inos
itol 5
-pho
spha
tase
1, p
utat
ive
Nul
l N
ull
Nul
l N
ull
Con
tig94
Tc
00.1
0470
5351
1675
.3
Hyp
othe
tical
pro
tein
N
ull
Nul
l N
ull
Nul
lC
ontig
95
Tc00
.104
7053
5064
81.3
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
lC
ontig
96
Tc00
.104
7053
5114
15.3
0 R
etro
trans
poso
n ho
t spo
t Pro
tein
N
ull
Nul
l N
ull
Nul
l
(R
HS,
pse
udog
ene)
, put
ativ
e
Supp
lem
enta
ry M
ater
ial S
1. C
ontin
ued.
1608
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
C. Martins et al.
Que
ry
Bes
t bla
stn
hit a
gain
st
Ann
otat
ion
Bes
t bla
stn
hit a
gain
st
EST-
stag
e B
est b
last
x hi
t aga
inst
Pr
oteo
me-
stag
e C
ontig
s R
eads
T.
cru
zi g
enes
and
con
tigs
T.
cru
zi E
STs
T.
cru
zi p
rote
ome
TCA
006-
IQ22
5C_A
05_.
B_0
1.ab
1 Tc
00.1
0470
5351
1369
.30
Elon
gatio
n fa
ctor
1-a
lpha
TE
NG
0420
T. c
ruzi
epi
mas
tigot
e
Epim
astig
ote
Nul
l N
ull
(E
F-1-
alph
a), p
utat
ive
norm
alis
ed c
DN
A li
brar
y T.
cru
zi
cDN
A c
lone
n76
8.r 5
',
mR
NA
sequ
ence
TC
A00
6-IQ
225C
_A06
_.B
_02.
ab1
Tc00
.104
7053
5081
43.9
0 Tr
ans-
sial
idas
e (p
seud
ogen
e), p
utat
ive
Nul
l N
ull
Nul
l N
ull
TC
A00
6-IQ
225C
_A07
_.B
_01.
ab1
Tc00
.104
7053
5065
31.4
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
l
TCA
006-
IQ22
5C_A
10_.
B_0
2.ab
1 Tc
00.1
0470
5351
1249
.110
K
ines
in K
39 (p
seud
ogen
e), p
utat
ive
Nul
l N
ull
Nul
l N
ull
TC
A00
6-IQ
225C
_A12
_.B
_02.
ab1
Tc00
.104
7053
5101
23.1
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
l
TCA
006-
IQ22
5C_B
03_.
B_0
3.ab
1 Tc
00.1
0470
5350
4277
.30
Hyp
othe
tical
pro
tein
N
ull
Nul
l N
ull
Nul
l
TCA
006-
IQ22
5C_B
06_.
B_0
4.ab
1 Tc
00.1
0470
5351
1173
.64
Muc
in-a
ssoc
iate
d su
rfac
e pr
otei
n
TEN
S242
2 T.
cru
zi e
pim
astig
ote
Epim
astig
ote
Nul
l N
ull
(M
ASP
), pu
tativ
e no
rmal
ized
cDN
A lib
rary
T. c
ruzi
cD
NA
clon
e 24
22 5
',
mR
NA
sequ
ence
TCA
006-
IQ22
5C_B
08_.
B_0
4.ab
1 Tc
00.1
0470
5350
7583
.60
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
TC
A00
6-IQ
225C
_B09
_.B
_03.
ab1
Tc00
.104
7053
5079
41.1
00
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
TC
A00
6-IQ
225C
_B10
_.B
_04.
ab1
Tc00
.104
7053
5119
03.1
40
Tubu
lin fo
ldin
g co
fact
or
Nul
l N
ull
Nul
l N
ull
D
(pse
udog
ene)
, put
ativ
eTC
A00
6-IQ
225C
_C01
_.B
_05.
ab1
Tc00
.104
7053
5101
01.3
40
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
TC
A00
6-IQ
225C
_C03
_.B
_05.
ab1
Tc00
.104
7053
5043
93.1
0 H
ypot
hetic
al p
rote
in
TEN
G05
14 T
. cru
zi e
pim
astig
ote
Ep
imas
tigot
e N
ull
Nul
l
no
rmal
ised
cD
NA
libr
ary
T. c
ruzi
cD
NA
clo
ne n
894.
r 5',
m
RNA
sequ
ence
TC
A00
6-IQ
225C
_C04
_.B
_06.
ab1
Tc00
.104
7053
5067
97.2
0 H
ypot
hetic
al p
rote
in
Nul
l N
ull
Nul
l N
ull
TC
A00
6-IQ
225C
_C05
_.B
_05.
ab1
Tc00
.104
7053
4040
01.2
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
l
TCA
006-
IQ22
5C_C
06_.
B_0
6.ab
1 Tc
00.1
0470
5350
6203
.40
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
TC
A00
6-IQ
225C
_C12
_.B
_06.
ab1
Tc00
.104
7053
5100
13.2
6 Su
rfac
e pr
otea
se G
P63
N
ull
Nul
l N
ull
Nul
l
(pse
udog
ene)
, put
ativ
eTC
A00
6-IQ
225C
_D01
_.B
_07.
ab1
Tc00
.104
7053
5108
77.1
90
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
TC
A00
6-IQ
225C
_D03
_.B
_07.
ab1
Tc00
.104
7053
5111
93.9
D
ispe
rsed
gen
e fa
mily
pro
tein
N
ull
Nul
l N
ull
Nul
l
1 (D
GF-
1, p
seud
ogen
e), p
utat
ive
TCA
006-
IQ22
5C_D
04_.
B_0
8.ab
1 Tc
00.1
0470
5350
8811
.55
Hyp
othe
tical
pro
tein
am
astc
-122
5 Tc
AM
T. c
ruzi
A
mas
tigot
e N
ull
Nul
l
cD
NA
clo
ne 9
G8
5',
m
RN
A se
quen
ce
TCA
006-
IQ22
5C_D
07_.
B_0
7.ab
1 Tc
00.1
0470
5350
9011
.90
Lipa
se, p
utat
ive
Nul
l N
ull
Nul
l N
ull
TC
A00
6-IQ
225C
_D08
_.B
_08.
ab1
Tc00
.104
7053
5076
41.2
50
Hyp
othe
tical
pro
tein
N
ull
Nul
l N
ull
Nul
l
TCA
006-
IQ22
5C_D
09_.
B_0
7.ab
1 Tc
00.1
0470
5346
6531
.9
Dis
pers
ed g
ene
fam
ily p
rote
in
Nul
l N
ull
Nul
l N
ull
1
(DG
F-1)
, put
ativ
e
Supp
lem
enta
ry M
ater
ial S
1. A
nnot
atio
n - O
RES
TES
sing
leto
ns.
Con
tinue
d on
nex
t pag
e
1609
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
Transcriptome of Trypanosoma cruzi mammalian stages
Que
ry
Bes
t bla
stn
hit a
gain
st
Ann
otat
ion
Bes
t bla
stn
hit a
gain
st
EST-
stag
e B
est b
last
x hi
t aga
inst
Pr
oteo
me-
stag
e C
ontig
s R
eads
T.
cru
zi g
enes
and
con
tigs
T.
cru
zi E
STs
T.
cru
zi p
rote
ome
TCA
006-
IQ22
5C_D
10_.
B_0
8.ab
1 Tc
00.1
0470
5350
6401
.240
H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
l
TCA
006-
IQ22
5C_E
01_.
B_0
9.ab
1 Tc
00.1
0470
5351
0565
.179
H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
l
TCA
006-
IQ22
5C_E
04_.
B_1
0.ab
1 Tc
00.1
0470
5350
4277
.30
Hyp
othe
tical
pro
tein
SG
-4-6
T. c
ruzi
diff
eren
tial
Not
spec
ified
N
ull
Nul
l
di
spla
y cD
NA
libr
ary
T. c
ruzi
cDN
A, m
RN
A se
quen
ce
TCA
006-
IQ22
5C_E
06_.
B_1
0.ab
1 Tc
00.1
0470
5351
0533
.210
H
ypot
hetic
al p
rote
in,
Nul
l N
ull
Nul
l N
ull
co
nser
ved
(pse
udog
ene)
TCA
006-
IQ22
5C_E
08_.
B_1
0.ab
1 Tc
00.1
0470
5350
8781
.70
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
TC
A00
6-IQ
225C
_E11
_.B
_09.
ab1
NU
LL
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l
Nul
l N
ull
TCA
006-
IQ22
5C_F
03_.
B_1
1.ab
1 Tc
00.1
0470
5351
0791
.60
Muc
in-a
ssoc
iate
d su
rfac
e pr
otei
n
TcA
maP
l09R
un01
_F04
A
mas
tigot
e N
ull
Nul
l
(M
ASP
, pse
udog
ene)
, put
ativ
e T.
cru
zi a
mas
tigot
e cD
NA
libra
ry T
. cru
zi c
DN
A
3'
sim
ilar t
o ga
g-re
late
d
prot
ein,
mR
NA
sequ
ence
TCA
006-
IQ22
5C_F
04_.
B_1
2.ab
1 Tc
00.1
0470
5351
1819
.59
Dyn
ein
heav
y ch
ain,
cyt
osol
ic, p
utat
ive
Nul
l N
ull
Nul
l N
ull
TC
A00
6-IQ
225C
_F08
_.B
_12.
ab1
Tc00
.104
7053
5098
71.8
0 D
ispe
rsed
gen
e fa
mily
pro
tein
TE
NS1
221
T. c
ruzi
epi
mas
tigot
e
Epim
astig
ote
Nul
l N
ull
1
(DG
F-1)
, put
ativ
e no
rmal
ized
cD
NA
libr
ary
T.
cru
zi c
DN
A c
lone
122
1 5'
,
mR
NA
sequ
ence
TCA
006-
IQ22
5C_G
02_.
B_1
4.ab
1 Tc
00.1
0470
5350
4929
.5
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
TC
A00
6-IQ
225C
_G05
_.B
_13.
ab1
Tc00
.104
7053
5069
49.2
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l 43
5_M
eta.
epi
Met
acyc
lic,
epim
astig
ote
TC
A00
6-IQ
225C
_G06
_.B
_14.
ab1
Tc00
.104
7053
5111
33.3
0 H
ypot
hetic
al p
rote
in, c
onse
rved
TE
NU
0543
T. c
ruzi
epi
mas
tigot
e
Epim
astig
ote
Nul
l N
ull
norm
aliz
ed c
DN
A li
brar
y T.
cru
zi
cDN
A c
lone
8h4
3',
m
RN
A se
quen
ce
TCA
006-
IQ22
5C_G
07_.
B_1
3.ab
1 Tc
00.1
0470
5350
8165
.430
Tr
ans-
sial
idas
e, p
utat
ive
Nul
l N
ull
Nul
l N
ull
TC
A00
6-IQ
225C
_G11
_.B
_13.
ab1
Tc00
.104
7053
5067
99.1
50
Surf
ace
prot
ease
GP6
3
Nul
l N
ull
Nul
l N
ull
(p
seud
ogen
e), p
utat
ive
TCA
006-
IQ22
5C_H
02_.
B_1
6.ab
1 Tc
00.1
0470
5350
9775
.30
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
TC
A00
6-IQ
225C
_H04
_.B
_16.
ab1
Tc00
.104
7053
5064
01.7
0 Va
cuol
ar p
rote
in so
rting
N
ull
Nul
l N
ull
Nul
l
prot
ein
18, p
utat
ive
TCA
006-
IQ22
5C_H
05_.
B_1
5.ab
1 Tc
00.1
0470
5350
8277
.380
H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
l
TCA
006-
IQ22
5C_H
08_.
B_1
6.ab
1 Tc
00.1
0470
5350
3881
.20
GD
P-m
anno
se 4
,6 d
ehyd
rata
se, p
utat
ive
Nul
l N
ull
Nul
l N
ull
TC
A00
6-IQ
225C
_H10
_.B
_16.
ab1
Tc00
.104
7053
5074
27.3
0 R
etro
trans
poso
n ho
t spo
t pro
tein
N
ull
Nul
l 76
1_M
eta.
epi
Met
acyc
lic,
(RH
S, p
seud
ogen
e), p
utat
ive
ep
imas
tigot
e TC
A00
6-IQ
225C
_H11
_.B
_15.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
N
ull
Nul
l
Supp
lem
enta
ry M
ater
ial S
1. C
ontin
ued.
Con
tinue
d on
nex
t pag
e
1610
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
C. Martins et al.
Que
ry
Bes
t bla
stn
hit a
gain
st
Ann
otat
ion
Bes
t bla
stn
hit a
gain
st
EST-
stag
e B
est b
last
x hi
t aga
inst
Pr
oteo
me-
stag
e C
ontig
s R
eads
T.
cru
zi g
enes
and
con
tigs
T.
cru
zi E
STs
T.
cru
zi p
rote
ome
TCA
006-
IQ22
5C_H
12_.
B_1
6.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
6-IQ
225E
_A03
_.B
_01.
ab1
Tc00
.104
7053
5079
53.1
30
Hyp
othe
tical
pro
tein
TE
NU
4024
T. c
ruzi
epi
mas
tigot
e
Epim
astig
ote
Nul
l N
ull
norm
aliz
ed c
DN
A li
brar
y T.
cru
zi
cDN
A c
lone
1l2
3',
m
RN
A se
quen
ce
TCA
006-
IQ22
5E_A
04_.
B_0
2.ab
1 Tc
00.1
0470
5350
3697
.60
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
TC
A00
6-IQ
225E
_A06
_.B
_02.
ab1
Tc00
.104
7053
5082
13.3
0 H
ypot
hetic
al p
rote
in
Nul
l N
ull
Nul
l N
ull
TC
A00
6-IQ
225E
_A08
_.B
_02.
ab1
Tc00
.104
7053
5101
71.1
0 Tr
ans-
sial
idas
e, p
utat
ive
Nul
l N
ull
Nul
l N
ull
TC
A00
6-IQ
225E
_A09
_.B
_01.
ab1
Tc00
.104
7053
5089
49.1
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
l
TCA
006-
IQ22
5E_A
10_.
B_0
2.ab
1 Tc
00.1
0470
5350
9433
.30
Hyp
othe
tical
pro
tein
N
ull
Nul
l N
ull
Nul
l
TCA
006-
IQ22
5E_B
02_.
B_0
4.ab
1 Tc
00.1
0470
5351
0101
.340
H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
lTC
A00
6-IQ
225E
_B03
_.B
_03.
ab1
Tc00
.104
7053
5069
61.1
30
Ret
rotra
nspo
son
hot s
pot p
rote
in
Nul
l N
ull
761_
Met
a.ep
i M
etac
yclic
,
(R
HS,
pse
udog
ene)
, put
ativ
e
epim
astig
ote
TCA
006-
IQ22
5E_C
01_.
B_0
5.ab
1 Tc
00.1
0470
5351
0985
.10
Tran
s-si
alid
ase,
put
ativ
e N
ull
Nul
l N
ull
Nul
l
TCA
006-
IQ22
5E_C
03_.
B_0
5.ab
1 Tc
00.1
0470
5350
8277
.350
C
atio
n tra
nspo
rter,
puta
tive
Nul
l N
ull
Nul
l N
ull
TC
A00
6-IQ
225E
_C07
_.B
_05.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
006-
IQ22
5E_C
09_.
B_0
5.ab
1 Tc
00.1
0470
5350
6321
.270
H
ypot
hetic
al p
rote
in, c
onse
rved
TE
NU
2275
T. c
ruzi
epi
mas
tigot
e
Epim
astig
ote
Nul
l N
ull
no
rmal
ized
cD
NA
libr
ary
T. c
ruzi
cD
NA
clo
ne 1
0k16
3’,
m
RN
A se
quen
ceTC
A00
6-IQ
225E
_C12
_.B
_06.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
006-
IQ22
5E_D
01_.
B_0
7.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
6-IQ
225E
_D08
_.B
_08.
ab1
Tc00
.104
7053
5072
11.2
0 Pr
otei
n ki
nase
, put
ativ
e N
ull
Nul
l N
ull
Nul
l
TCA
006-
IQ22
5E_D
09_.
B_0
7.ab
1 Tc
00.1
0470
5351
0325
.69
Am
inop
eptid
ase,
put
ativ
e N
ull
Nul
l N
ull
Nul
l
TCA
006-
IQ22
5E_D
10_.
B_0
8.ab
1 Tc
00.1
0470
5350
6799
.10
Fatty
aci
d tra
nspo
rter
Nul
l N
ull
Nul
l N
ull
pr
otei
n-lik
e, p
utat
ive
TCA
006-
IQ22
5E_D
11_.
B_0
7.ab
1 C
ontig
1047
0535
0404
5 M
atch
es th
e 3’
end
of t
he
Nul
l N
ull
Nul
l N
ull
1047
0535
0404
5
co
ding
regi
on o
f the
gen
e
Tc
00.1
0470
5350
4045
.50
and
dow
nstre
am fl
anki
ng re
gion
TCA
006-
IQ22
5E_D
12_.
B_0
8.ab
1 Tc
00.1
0470
5350
8325
.290
Tr
ans-
sial
idas
e (p
seud
ogen
e),
Nul
l N
ull
Nul
l N
ull
pu
tativ
eTC
A00
6-IQ
225E
_E04
_.B
_10.
ab1
Tc00
.104
7053
5101
67.2
9 Ex
osom
e co
mpl
ex
Nul
l N
ull
Nul
l N
ull
ex
onuc
leas
e, p
utat
ive
TCA
006-
IQ22
5E_E
08_.
B_1
0.ab
1 Tc
00.1
0470
5350
8173
.264
H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
l
TCA
006-
IQ22
5E_E
11_.
B_0
9.ab
1 Tc
00.1
0470
5350
3803
.10
rRN
A b
ioge
nesi
s pro
tein
, put
ativ
e N
ull
Nul
l N
ull
Nul
l
TCA
006-
IQ22
5E_E
12_.
B_1
0.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
l
Supp
lem
enta
ry M
ater
ial S
1. C
ontin
ued.
Con
tinue
d on
nex
t pag
e
1611
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
Transcriptome of Trypanosoma cruzi mammalian stages
Con
tinue
d on
nex
t pag
e
Que
ry
Bes
t bla
stn
hit a
gain
st
Ann
otat
ion
Bes
t bla
stn
hit a
gain
st
EST-
stag
e B
est b
last
x hi
t aga
inst
Pr
oteo
me-
stag
e C
ontig
s R
eads
T.
cru
zi g
enes
and
con
tigs
T.
cru
zi E
STs
T.
cru
zi p
rote
ome
TCA
006-
IQ22
5E_F
03_.
B_1
1.ab
1 Tc
00.1
0470
5350
6243
.94
Ret
rotra
nspo
son
hot s
pot p
rote
in
Nul
l N
ull
Nul
l N
ull
(R
HS,
pse
udog
ene)
, put
ativ
eTC
A00
6-IQ
225E
_F07
_.B
_11.
ab1
Tc00
.104
7053
5087
19.9
H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
l
TCA
006-
IQ22
5E_F
10_.
B_1
2.ab
1 Tc
00.1
0470
5350
7747
.10
Surf
ace
prot
ease
GP6
3, p
utat
ive
Nul
l N
ull
571_
Met
a M
etac
yclic
TCA
006-
IQ22
5E_F
11_.
B_1
1.ab
1 Tc
00.1
0470
5350
7809
.60
Hyp
othe
tical
pro
tein
, con
serv
ed
TEN
S228
9 T.
cru
zi e
pim
astig
ote
Ep
imas
tigot
e N
ull
Nul
l
no
rmal
ized
cD
NA
libr
ary
T. c
ruzi
cD
NA
clo
ne 2
289
5',
m
RN
A se
quen
ce
TCA
006-
IQ22
5E_F
12_.
B_1
2.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
6-IQ
225E
_G03
_.B
_13.
ab1
Tc00
.104
7053
5108
57.1
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
l
TCA
006-
IQ22
5E_G
04_.
B_1
4.ab
1 Tc
00.1
0470
5350
9171
.70
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
TCA
006-
IQ22
5E_G
05_.
B_1
3.ab
1 Tc
00.1
0470
5351
0053
.79
FG-G
AP
repe
at p
rote
in, p
utat
ive
Nul
l N
ull
Nul
l N
ull
TC
A00
6-IQ
225E
_H01
_.B
_15.
ab1
Tc00
.104
7053
4539
17.9
H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l 33
_Try
po.m
eta
Tryp
omas
tigot
e,
met
acyc
licTC
A00
6-IQ
225E
_H02
_.B
_16.
ab1
Tc00
.104
7053
5067
95.9
0 H
ypot
hetic
al p
rote
in, c
onse
rved
TE
NU
0078
T. c
ruzi
epi
mas
tigot
e
Epim
astig
ote
Nul
l N
ull
norm
aliz
ed c
DN
A li
brar
y T.
cru
zi
cDN
A c
lone
5e1
1 3’
,
mR
NA
sequ
ence
TC
A00
6-IQ
225E
_H05
_.B
_15.
ab1
Tc00
.104
7053
5068
11.1
60
R27
-2 p
rote
in, p
utat
ive
Nul
l N
ull
Nul
l N
ull
TC
A00
6-IQ
225E
_H06
_.B
_16.
ab1
Tc00
.104
7053
5038
93.1
70
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
TC
A00
6-IQ
225E
_H07
_.B
_15.
ab1
Tc00
.104
7053
5094
51.7
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
l
TCA
006-
IQ22
5E_H
09_.
B_1
5.ab
1 Tc
00.1
0470
5350
6925
.70
Epsi
n, p
utat
ive
Nul
l N
ull
Nul
l N
ull
TC
A00
6-IQ
225E
_H11
_.B
_15.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
006-
IQ22
5F_A
02_.
b_02
.ab1
N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
6-IQ
225F
_A04
_.b_
02.a
b1
Tc00
.104
7053
5101
01.3
40
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
TC
A00
6-IQ
225F
_A10
_.b_
02.a
b1
Tc00
.104
7053
5084
45.1
0 ra
b11B
GTP
ase,
put
ativ
e N
ull
Nul
l N
ull
Nul
l
TCA
006-
IQ22
5F_B
01_.
b_03
.ab1
Tc
00.1
0470
5350
9979
.249
Sy
ntax
in b
indi
ng p
rote
in
Nul
l N
ull
Nul
l N
ull
1
(pse
udog
ene)
, put
ativ
eTC
A00
6-IQ
225F
_C01
_.b_
05.a
b1
Tc00
.104
7053
5041
05.1
30
Cal
cium
cha
nnel
pro
tein
, put
ativ
e N
ull
Nul
l N
ull
Nul
l
TCA
006-
IQ22
5F_C
03_.
b_05
.ab1
N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
6-IQ
225F
_C07
_.b_
05.a
b1
Tc00
.104
7053
5101
01.3
40
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
TC
A00
6-IQ
225F
_C11
_.b_
05.a
b1
Tc00
.104
7053
5092
33.7
0 A
AA
ATP
ase,
put
ativ
e TE
NU
3420
T. c
ruzi
epi
mas
tigot
e
Epim
astig
ote
Nul
l N
ull
norm
aliz
ed c
DN
A li
brar
y T.
cru
zi
cDN
A c
lone
29c
14 5
’,
mR
NA
sequ
ence
TC
A00
6-IQ
225F
_D03
_.b_
07.a
b1
Tc00
.104
7053
5096
41.6
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
l
Supp
lem
enta
ry M
ater
ial S
1. C
ontin
ued.
1612
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
C. Martins et al.
Que
ry
Bes
t bla
stn
hit a
gain
st
Ann
otat
ion
Bes
t bla
stn
hit a
gain
st
EST-
stag
e B
est b
last
x hi
t aga
inst
Pr
oteo
me-
stag
e C
ontig
s R
eads
T.
cru
zi g
enes
and
con
tigs
T.
cru
zi E
STs
T.
cru
zi p
rote
ome
TCA
006-
IQ22
5F_D
04_.
b_08
.ab1
Tc
00.1
0470
5350
3715
.40
Prot
ein
kina
se, p
utat
ive
Nul
l N
ull
62_a
ma.
epi
Am
astig
ote,
ep
imas
tigot
e
TCA
006-
IQ22
5F_E
01_.
b_09
.ab1
N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
6-IQ
225F
_E07
_.b_
09.a
b1
Tc00
.104
7053
4136
05.1
0 K
ines
in, p
utat
ive
Nul
l N
ull
Nul
l N
ull
TC
A00
6-IQ
225F
_E08
_.b_
10.a
b1
Tc00
.104
7053
5062
65.1
00
ATP-
depe
nden
t RN
A h
elic
ase,
put
ativ
e N
ull
Nul
l N
ull
Nul
l
TCA
006-
IQ22
5F_F
05_.
b_11
.ab1
Tc
00.1
0470
5351
1745
.10
Hea
t sho
ck 7
0 kD
a pr
otei
n,
TEN
F015
2 T.
cru
zi
Epim
astig
ote
Nul
l N
ull
mito
chon
dria
l pre
curs
or, p
utat
ive
epim
astig
ote
norm
aliz
ed
cD
NA
libr
ary
T. c
ruzi
cDN
A c
lone
152
5' s
imila
r
to m
itoch
ondr
ial H
sp70
,
mR
NA
sequ
ence
TC
A00
6-IQ
225F
_F12
_.b_
12.a
b1
Tc00
.104
7053
5081
73.6
0 C
is-p
reny
ltran
sfer
ase,
put
ativ
e N
ull
Nul
l N
ull
Nul
l
TCA
006-
IQ22
5F_G
01_.
b_13
.ab1
Tc
00.1
0470
5350
6797
.10
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
TC
A00
6-IQ
225F
_G08
_.b_
14.a
b1
Tc00
.104
7053
5047
97.1
44
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
TC
A00
6-IQ
225F
_H01
_.b_
15.a
b1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
006-
IQ22
5F_H
02_.
b_16
.ab1
Tc
00.1
0470
5351
0281
.20
Surf
ace
prot
ease
GP6
3, p
utat
ive
Nul
l N
ull
Nul
l N
ull
TC
A00
6-IQ
225F
_H04
_.b_
16.a
b1
Con
tig10
4705
3503
717
Mat
ches
the
inte
r-cod
ing
N
ull
Nul
l N
ull
Nul
l 10
4705
3503
717
regi
on o
f the
gen
es
Tc00
.104
7053
5037
17.9
0
an
d Tc
00.1
0470
5350
3717
.120
TCA
006-
IQ22
5F_H
07_.
b_15
.ab1
Tc
00.1
0470
5350
4427
.30
Hyp
othe
tical
pro
tein
, con
serv
ed
TEN
U11
96 T
. cru
zi
Epim
astig
ote
Nul
l N
ull
epim
astig
ote
norm
aliz
ed
cD
NA
libr
ary
T. c
ruzi
cDN
A c
lone
20i
24 3
’,
mR
NA
sequ
ence
TC
A00
6-IQ
225F
_H08
_.b_
16.a
b1
Tc00
.104
7053
5096
95.1
50
Kin
esin
, put
ativ
e N
ull
Nul
l N
ull
Nul
l
TCA
006-
IQ22
5G_A
01_.
b_01
.ab1
Tc
00.1
0470
5351
0423
.40
Hyp
othe
tical
pro
tein
, con
serv
ed
amas
tc-1
111
TcA
M T
. cru
zi
Am
astig
ote
Nul
l N
ull
cDN
A c
lone
8E1
1 5’
,
mR
NA
sequ
ence
TC
A00
6-IQ
225G
_A05
_.b_
01.a
b1
Tc00
.104
7053
5062
13.6
0 40
S rib
osom
al p
rote
in S
2, p
utat
ive
amas
tc-1
15 T
cAM
T. c
ruzi
A
mas
tigot
e 37
_am
a.ep
i A
mas
tigot
e,
cDN
A c
lone
11D
2 5’
,
ep
imas
tigot
e
mR
NA
sequ
ence
TC
A00
6-IQ
225G
_A07
_.b_
01.a
b1
Tc00
.104
7053
5064
39.1
8 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
l
TCA
006-
IQ22
5G_A
08_.
b_02
.ab1
Tc
00.1
0470
5350
6401
.290
H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
l
TCA
006-
IQ22
5G_A
12_.
b_02
.ab1
N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
6-IQ
225G
_B02
_.b_
04.a
b1
Tc00
.104
7053
5118
27.1
10
Tran
s-si
alid
ase,
put
ativ
e N
ull
Nul
l N
ull
Nul
l
Supp
lem
enta
ry M
ater
ial S
1. C
ontin
ued.
Con
tinue
d on
nex
t pag
e
1613
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
Transcriptome of Trypanosoma cruzi mammalian stages
Que
ry
Bes
t bla
stn
hit a
gain
st
Ann
otat
ion
Bes
t bla
stn
hit a
gain
st
EST-
stag
e B
est b
last
x hi
t aga
inst
Pr
oteo
me-
stag
e C
ontig
s R
eads
T.
cru
zi g
enes
and
con
tigs
T.
cru
zi E
STs
T.
cru
zi p
rote
ome
TCA
006-
IQ22
5G_B
04_.
b_04
.ab1
Tc
00.1
0470
5350
9887
.19
Dis
pers
ed g
ene
fam
ily p
rote
in
Nul
l N
ull
Nul
l N
ull
1
(DG
F-1)
, put
ativ
eTC
A00
6-IQ
225G
_B09
_.b_
03.a
b1
Tc00
.104
7053
5117
79.6
0 Tr
ans-
sial
idas
e, p
utat
ive
Nul
l N
ull
Nul
l N
ull
TC
A00
6-IQ
225G
_B11
_.b_
03.a
b1
Tc00
.104
7053
5075
05.1
0 Tr
ans-
sial
idas
e, p
utat
ive
TEN
S077
9 T.
cru
zi
Epim
astig
ote
Nul
l N
ull
epim
astig
ote
norm
aliz
ed
cD
NA
libr
ary
T. c
ruzi
cDN
A c
lone
779
5',
m
RN
A se
quen
ce
TCA
006-
IQ22
5G_B
12_.
b_04
.ab1
N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
6-IQ
225G
_C01
_.b_
05.a
b1
Tc00
.104
7053
5070
19.1
0 Pr
otei
n ki
nase
, put
ativ
e N
ull
Nul
l N
ull
Nul
lTC
A00
6-IQ
225G
_C06
_.b_
06.a
b1
Tc00
.104
7053
5107
73.9
H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
l
TCA
006-
IQ22
5G_C
07_.
b_05
.ab1
Tc
00.1
0470
5341
4243
.20
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
TC
A00
6-IQ
225G
_C10
_.b_
06.a
b1
Tc00
.104
7053
5118
25.1
10
Hyp
othe
tical
pro
tein
TE
NF0
697
T. c
ruzi
Ep
imas
tigot
e N
ull
Nul
l
ep
imas
tigot
e no
rmal
ized
cDN
A li
brar
y T.
cru
zi
cD
NA
clo
ne 6
97 5
',
mR
NA
sequ
ence
TC
A00
6-IQ
225G
_C11
_.b_
05.a
b1
Tc00
.104
7053
5091
69.1
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
l
TCA
006-
IQ22
5G_C
12_.
b_06
.ab1
N
ull
Not
iden
tified
N
ull
Nul
l 66
_am
a.ep
i A
mas
tigot
e,
Nul
l N
ull
epim
astig
ote
TCA
006-
IQ22
5G_D
06_.
b_08
.ab1
Tc
00.1
0470
5350
6305
.5
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
TC
A00
6-IQ
225G
_D10
_.b_
08.a
b1
Tc00
.104
7053
5099
21.6
0 D
ispe
rsed
gen
e fa
mily
pro
tein
TE
NU
4582
T. c
ruzi
Ep
imas
tigot
e 21
1_Ep
i Ep
imas
tigot
e
1 (D
GF-
1), p
utat
ive
epim
astig
ote
norm
aliz
ed
cD
NA
libr
ary
T. c
ruzi
cDN
A c
lone
9o9
3',
m
RN
A se
quen
ce
TCA
006-
IQ22
5G_E
02_.
b_10
.ab1
Tc
00.1
0470
5350
9589
.50
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
TC
A00
6-IQ
225G
_E05
_.b_
09.a
b1
Tc00
.104
7053
5079
63.3
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
l
TCA
006-
IQ22
5G_E
11_.
b_09
.ab1
C
ontig
1047
0534
4825
1 N
on-a
nnot
ated
con
tig -
likel
y th
e
TcA
maP
l03R
un01
_F04
A
mas
tigot
e N
ull
Nul
l 10
4705
3448
251
3’ e
nd m
issi
ng p
art o
f the
cod
ing
T.
cru
zi a
mas
tigot
e cD
NA
regi
on o
f the
par
tial g
ene
lib
rary
T. c
ruzi
cDN
A 5
',
Tc
00.1
0470
5351
0771
.9
mR
NA
sequ
ence
TCA
006-
IQ22
5G_F
08_.
b_12
.ab1
Tc
00.1
0470
5350
8731
.40
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
TC
A00
6-IQ
225G
_F09
_.b_
11.a
b1
Tc00
.104
7053
5110
01.8
0 H
ypot
hetic
al p
rote
in, c
onse
rved
am
astc
-156
TcA
M T
. cru
zi
Am
astig
ote
Nul
l N
ull
cDN
A c
lone
11H
1 5'
,
mR
NA
sequ
ence
Supp
lem
enta
ry M
ater
ial S
1. C
ontin
ued.
Con
tinue
d on
nex
t pag
e
1614
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
C. Martins et al.
Que
ry
Bes
t bla
stn
hit a
gain
st
Ann
otat
ion
Bes
t bla
stn
hit a
gain
st
EST-
stag
e B
est b
last
x hi
t aga
inst
Pr
oteo
me-
stag
e C
ontig
s R
eads
T.
cru
zi g
enes
and
con
tigs
T.
cru
zi E
STs
T.
cru
zi p
rote
ome
TCA
006-
IQ22
5G_F
10_.
b_12
.ab1
25
661.
rrna
0000
1 R
NA
ribo
som
al
TEN
U05
43 T
. cru
zi
Epim
astig
ote
Nul
l N
ull
epim
astig
ote
norm
aliz
ed
cD
NA
libr
ary
T. c
ruzi
cDN
A c
lone
8h4
3',
m
RN
A se
quen
ceTC
A00
6-IQ
225G
_F12
_.b_
12.a
b1
Tc00
.104
7053
5038
33.5
0 H
ypot
hetic
al p
rote
in, c
onse
rved
TE
NG
0126
T. c
ruzi
Ep
imas
tigot
e N
ull
Nul
l
ep
imas
tigot
e no
rmal
ised
cDN
A li
brar
y T.
cru
zi
cDN
A c
lone
n12
6.r 5
’,
mR
NA
sequ
ence
TC
A00
6-IQ
225G
_G02
_.b_
14.a
b1
9441
.rrna
0000
1 R
NA
ribo
som
al
Nul
l N
ull
Nul
l N
ull
TC
A00
6-IQ
225G
_G03
_.b_
13.a
b1
Tc00
.104
7053
5097
33.1
00
Tran
slat
ion
elon
gatio
n fa
ctor
Tc
Am
aPl0
7Run
01_H
04
Am
astig
ote
Nul
l N
ull
1-
beta
, put
ativ
e T.
cru
zi a
mas
tigot
e cD
NA
libra
ry T
. cru
zi c
DN
A 3
’
sim
ilar t
o el
onga
tion
fact
or
1-
beta
, mR
NA
sequ
ence
TC
A00
6-IQ
225G
_G07
_.b_
13.a
b1
Tc00
.104
7053
5041
31.1
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
l
(pse
udog
ene)
TCA
006-
IQ22
5G_G
09_.
b_13
.ab1
Tc
00.1
0470
5350
9695
.150
K
ines
in, p
utat
ive
Nul
l N
ull
Nul
l N
ull
TC
A00
6-IQ
225G
_G10
_.b_
14.a
b1
Tc00
.104
7053
5068
43.2
0 R
etro
trans
poso
n ho
t spo
t pro
tein
N
ull
Nul
l N
ull
Nul
l
(RH
S, p
seud
ogen
e), p
utat
ive
TCA
006-
IQ22
5G_H
01_.
b_15
.ab1
N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
6-IQ
225G
_H04
_.b_
16.a
b1
Tc00
.104
7053
5061
55.8
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
l TC
A00
6-IQ
225G
_H12
_.b_
16.a
b1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
007-
IQ23
1A_A
08_.
B_0
2.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
7-IQ
231A
_A09
_.B
_01.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
007-
IQ23
1A_A
12_.
B_0
2.ab
1 Tc
00.1
0470
5351
0877
.160
H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
l TC
A00
7-IQ
231A
_B04
_.B
_04.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
007-
IQ23
1A_B
08_.
B_0
4.ab
1 Tc
00.1
0470
5351
1115
.10
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
TC
A00
7-IQ
231A
_D09
_.B
_07.
ab1
Con
tig10
4705
3510
755
Mat
ches
the
3’ e
nd a
nd th
e
Nul
l N
ull
Nul
l N
ull
1047
0535
1075
5
do
wns
tream
flan
king
re
gion
s of t
he g
ene
Tc00
.104
7053
5107
55.1
20TC
A00
7-IQ
231A
_E03
_.B
_09.
ab1
Tc00
.104
7053
5083
69.4
0 Se
rine/
thre
onin
e-pr
otei
n ki
nase
N
ull
Nul
l N
ull
Nul
l
A (p
seud
ogen
e), p
utat
ive
TCA
007-
IQ23
1A_E
09_.
B_0
9.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
l
Supp
lem
enta
ry M
ater
ial S
1. C
ontin
ued.
Con
tinue
d on
nex
t pag
e
1615
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
Transcriptome of Trypanosoma cruzi mammalian stagesSu
pple
men
tary
Mat
eria
l S1.
Con
tinue
d.
Con
tinue
d on
nex
t pag
e
Que
ry
Bes
t bla
stn
hit a
gain
st
Ann
otat
ion
Bes
t bla
stn
hit a
gain
st
EST-
stag
e B
est b
last
x hi
t aga
inst
Pr
oteo
me-
stag
e C
ontig
s R
eads
T.
cru
zi g
enes
and
con
tigs
T.
cru
zi E
STs
T.
cru
zi p
rote
ome
TCA
007-
IQ23
1A_F
04_.
B_1
2.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
7-IQ
231A
_F06
_.B
_12.
ab1
Tc00
.104
7053
5110
37.4
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
l
TCA
007-
IQ23
1A_F
07_.
B_1
1.ab
1 Tc
00.1
0470
5350
3955
.100
Pr
otei
n ki
nase
, put
ativ
e N
ull
Nul
l N
ull
Nul
l
TCA
007-
IQ23
1A_G
06_.
B_1
4.ab
1 Tc
00.1
0470
5351
0149
.50
DN
A re
pair
helic
ase
and
N
ull
Nul
l N
ull
Nul
l
trans
crip
tion
fact
or
prot
ein,
put
ativ
eTC
A00
7-IQ
231A
_H03
_.B
_15.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
007-
IQ23
1A_H
07_.
B_1
5.ab
1 Tc
00.1
0470
5350
5789
.20
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
TC
A00
7-IQ
231A
_H10
_.B
_16.
ab1
Tc00
.104
7053
5103
11.4
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
l
TCA
007-
IQ23
1B_A
05_.
B_0
1.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
7-IQ
231B
_A06
_.B
_02.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
007-
IQ23
1B_A
07_.
B_0
1.ab
1 Tc
00.1
0470
5350
8811
.10
Hyp
othe
tical
pro
tein
N
ull
Nul
l N
ull
Nul
l
TCA
007-
IQ23
1B_B
04_.
B_0
4.ab
1 Tc
00.1
0470
5351
0077
.20
Tran
s-si
alid
ase
(pse
udog
ene)
, N
ull
Nul
l N
ull
Nul
l
puta
tive
TCA
007-
IQ23
1B_B
05_.
B_0
3.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
7-IQ
231B
_B06
_.B
_04.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
007-
IQ23
1B_C
02_.
B_0
6.ab
1 Tc
00.1
0470
5351
0297
.110
H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
l
TCA
007-
IQ23
1B_C
05_.
B_0
5.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
7-IQ
231B
_C06
_.B
_06.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
007-
IQ23
1B_D
04_.
B_0
8.ab
1 Tc
00.1
0470
5350
7093
.260
A
BC
tran
spor
ter,
puta
tive
Nul
l N
ull
Nul
l N
ull
TC
A00
7-IQ
231B
_D05
_.B
_07.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
007-
IQ23
1B_D
06_.
B_0
8.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
7-IQ
231B
_E05
_.B
_09.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
007-
IQ23
1B_F
05_.
B_1
1.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
7-IQ
231B
_F06
_.B
_12.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
007-
IQ23
1B_F
11_.
B_1
1.ab
1 Tc
00.1
0470
5351
1365
.90
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
TC
A00
7-IQ
231B
_F12
_.B
_12.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
007-
IQ23
1B_G
05_.
B_1
3.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
7-IQ
231B
_G06
_.B
_14.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
007-
IQ23
1B_G
07_.
B_1
3.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
7-IQ
231B
_G09
_.B
_13.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
007-
IQ23
1B_H
05_.
B_1
5.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
7-IQ
231B
_H06
_.B
_16.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
007-
IQ23
1B_H
10_.
B_1
6.ab
1 Tc
00.1
0470
5350
8111
.40
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
TC
A00
8-IQ
229A
_A06
_.B
_02.
ab1
Tc00
.104
7053
5088
11.3
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
l
TCA
008-
IQ22
9A_B
05_.
B_0
3.ab
1 Tc
00.1
0470
5351
0285
.60
Ubi
quin
one
bios
ynth
esis
N
ull
Nul
l N
ull
Nul
l
met
hyltr
ansf
eras
e, p
utat
ive
1616
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
C. Martins et al.
Que
ry
Bes
t bla
stn
hit a
gain
st
Ann
otat
ion
Bes
t bla
stn
hit a
gain
st
EST-
stag
e B
est b
last
x hi
t aga
inst
Pr
oteo
me-
stag
e C
ontig
s R
eads
T.
cru
zi g
enes
and
con
tigs
T.
cru
zi E
STs
T.
cru
zi p
rote
ome
TCA
008-
IQ22
9A_B
06_.
B_0
4.ab
1 Tc
00.1
0470
5350
6247
.160
H
ypot
hetic
al p
rote
in, c
onse
rved
TE
NU
3645
T. c
ruzi
epi
mas
tigot
e
Epim
astig
ote
Nul
l N
ull
norm
aliz
ed c
DN
A li
brar
y T.
cru
zi
cDN
A c
lone
29l
21 5
',
mR
NA
sequ
ence
TCA
008-
IQ22
9A_B
12_.
B_0
4.ab
1 Tc
00.1
0470
5351
1533
.10
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
TC
A00
8-IQ
229A
_C11
_.B
_05.
ab1
Tc00
.104
7053
5063
21.2
80
Hyp
othe
tical
pro
tein
, con
serv
ed
TEN
U38
97 T
. cru
zi e
pim
astig
ote
Ep
imas
tigot
e N
ull
Nul
l
no
rmal
ized
cD
NA
libr
ary
T. c
ruzi
cD
NA
clo
ne 4
2g4
5',
m
RN
A se
quen
ce
TCA
008-
IQ22
9A_D
12_.
B_0
8.ab
1 Tc
00.1
0470
5350
6559
.559
A
ntig
enic
pro
tein
, put
ativ
e N
ull
Nul
l N
ull
Nul
l
TCA
008-
IQ22
9A_F
07_.
B_1
1.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
8-IQ
229A
_F08
_.B
_12.
ab1
Tc00
.104
7053
5110
71.2
02
Hyp
othe
tical
pro
tein
, con
serv
ed
TcTR
-113
3 Tc
TR T
. cru
zi
Tryp
omas
tigot
e N
ull
Nul
l N
ull
Nul
l
cDN
A c
lone
09k
20 5
',
mR
NA
sequ
ence
TCA
008-
IQ22
9A_F
09_.
B_1
1.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
8-IQ
229A
_F10
_.B
_12.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
008-
IQ22
9A_F
12_.
B_1
2.ab
1 Tc
00.1
0470
5350
6885
.270
Tu
bulin
tyro
sine
liga
se, p
utat
ive
Nul
l N
ull
Nul
l N
ull
TC
A00
8-IQ
229A
_G02
_.B
_14.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
008-
IQ22
9A_G
03_.
B_1
3.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
8-IQ
229A
_G04
_.B
_14.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
008-
IQ22
9A_G
05_.
B_1
3.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
8-IQ
229A
_G06
_.B
_14.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
008-
IQ22
9A_G
07_.
B_1
3.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
8-IQ
229A
_G10
_.B
_14.
ab1
Tc00
.104
7053
5112
83.2
80
Prot
ein
kina
se, p
utat
ive
Nul
l N
ull
Nul
l N
ull
TC
A00
8-IQ
229A
_H03
_.B
_15.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
008-
IQ22
9B_A
06_.
B_0
2.ab
1 Tc
00.1
0470
5350
6941
.90
ATP-
depe
nden
t DEA
D/H
DN
A
Nul
l N
ull
Nul
l N
ull
he
licas
e re
cQ fa
mily
, put
ativ
eTC
A00
8-IQ
229B
_A09
_.B
_01.
ab1
Tc00
.104
7053
5065
59.3
60
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
TC
A00
8-IQ
229B
_A12
_.B
_02.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
008-
IQ22
9B_B
05_.
B_0
3.ab
1 86
20.te
0000
5 D
IRE
Nul
l N
ull
Nul
l N
ull
TCA
008-
IQ22
9B_C
02_.
B_0
6.ab
1 56
13.te
0000
5 SI
RE
TEN
U24
24 T
. cru
zi e
pim
astig
ote
Ep
imas
tigot
e N
ull
Nul
l
no
rmal
ized
cD
NA
libr
ary
T. c
ruzi
cDN
A c
lone
26b
10 5
’ sim
ilar t
o
T. c
ruzi
H2A
gen
e fo
r his
tone
H2A
em
b|X
6728
7|TC
HIS
H2A
,
mR
NA
sequ
ence
TC
A00
8-IQ
229B
_C09
_.B
_05.
ab1
Tc00
.104
7053
5114
67.3
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
l
Supp
lem
enta
ry M
ater
ial S
1. C
ontin
ued.
Con
tinue
d on
nex
t pag
e
1617
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
Transcriptome of Trypanosoma cruzi mammalian stages
Que
ry
Bes
t bla
stn
hit a
gain
st
Ann
otat
ion
Bes
t bla
stn
hit a
gain
st
EST-
stag
e B
est b
last
x hi
t aga
inst
Pr
oteo
me-
stag
e C
ontig
s R
eads
T.
cru
zi g
enes
and
con
tigs
T.
cru
zi E
STs
T.
cru
zi p
rote
ome
TCA
008-
IQ22
9B_C
10_.
B_0
6.ab
1 Tc
00.1
0470
5350
6825
.219
H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
l
TCA
008-
IQ22
9B_C
12_.
B_0
6.ab
1 Tc
00.1
0470
5350
6773
.130
Pu
mili
o-re
peat
, RN
A-b
indi
ng
Nul
l N
ull
Nul
l N
ull
pr
otei
n, p
utat
ive
TCA
008-
IQ22
9B_D
07_.
B_0
7.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
8-IQ
229B
_F01
_.B
_11.
ab1
Tc00
.104
7053
5114
27.1
9 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
l
TCA
008-
IQ22
9B_F
03_.
B_1
1.ab
1 Tc
00.1
0470
5350
4153
.370
H
ypot
hetic
al p
rote
in
Nul
l N
ull
Nul
l N
ull
TC
A00
8-IQ
229B
_F11
_.B
_11.
ab1
Tc00
.104
7053
5067
29.9
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
l
TCA
008-
IQ22
9B_G
01_.
B_1
3.ab
1 Tc
00.1
0470
5350
9891
.70
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
(p
seud
ogen
e)TC
A00
8-IQ
229B
_G06
_.B
_14.
ab1
Tc00
.104
7053
5089
53.1
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
l
TCA
008-
IQ22
9B_H
03_.
B_1
5.ab
1 Tc
00.1
0470
5350
7211
.40
Inos
ine-
5'-m
onop
hosp
hate
N
ull
Nul
l N
ull
Nul
l
dehy
drog
enas
e, p
utat
ive
TCA
008-
IQ22
9B_H
12_.
B_1
6.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
9-IQ
230A
_A05
_.B
_01.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
009-
IQ23
0A_A
07_.
B_0
1.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
9-IQ
230A
_A08
_.B
_02.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
009-
IQ23
0A_A
09_.
B_0
1.ab
1 Tc
00.1
0470
5350
8739
.40
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
TC
A00
9-IQ
230A
_B07
_.B
_03.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
009-
IQ23
0A_B
08_.
B_0
4.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
9-IQ
230A
_B12
_.B
_04.
ab1
Tc00
.104
7053
5076
81.2
00
Serin
e ca
rbox
ypep
tidas
e
Nul
l N
ull
Nul
l N
ull
S2
8, p
utat
ive
TCA
009-
IQ23
0A_C
04_.
B_0
6.ab
1 Tc
00.1
0470
5350
4097
.10
RN
A e
ditin
g co
mpl
ex
Nul
l N
ull
Nul
l N
ull
pr
otei
n M
P61,
put
ativ
eTC
A00
9-IQ
230A
_C07
_.B
_05.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
009-
IQ23
0A_D
05_.
B_0
7.ab
1 Tc
00.1
0470
5350
6961
.150
Tr
ans-
sial
idas
e, p
utat
ive
Nul
l N
ull
Nul
l N
ull
TC
A00
9-IQ
230A
_D06
_.B
_08.
ab1
Tc00
.104
7053
5040
47.5
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
l
TCA
009-
IQ23
0A_D
07_.
B_0
7.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
9-IQ
230A
_D08
_.B
_08.
ab1
Tc00
.104
7053
5078
17.6
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
l
TCA
009-
IQ23
0A_E
07_.
B_0
9.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
9-IQ
230A
_F03
_.B
_11.
ab1
Tc00
.104
7053
5070
35.8
0 Tr
ans-
sial
idas
e
Nul
l N
ull
62_T
rypo
Tr
ypom
astig
ote
(p
seud
ogen
e), p
utat
ive
TCA
009-
IQ23
0A_F
04_.
B_1
2.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
9-IQ
230A
_F07
_.B
_11.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
009-
IQ23
0A_G
07_.
B_1
3.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
9-IQ
230A
_G08
_.B
_14.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
009-
IQ23
0A_H
07_.
B_1
5.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
9-IQ
230B
_A01
_.B
_01.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
Supp
lem
enta
ry M
ater
ial S
1. C
ontin
ued.
Con
tinue
d on
nex
t pag
e
1618
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
C. Martins et al.
Supp
lem
enta
ry M
ater
ial S
1. C
ontin
ued.
Con
tinue
d on
nex
t pag
e
Que
ry
Bes
t bla
stn
hit a
gain
st
Ann
otat
ion
Bes
t bla
stn
hit a
gain
st
EST-
stag
e B
est b
last
x hi
t aga
inst
Pr
oteo
me-
stag
e C
ontig
s R
eads
T.
cru
zi g
enes
and
con
tigs
T.
cru
zi E
STs
T.
cru
zi p
rote
ome
TCA
009-
IQ23
0B_A
06_.
B_0
2.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
9-IQ
230B
_A08
_.B
_02.
ab1
Tc00
.104
7053
5109
57.9
C
alpa
in-li
ke c
yste
ine
Tc
TR-4
65 T
cTR
T. c
ruzi
Tr
ypom
astig
ote
Nul
l N
ull
pe
ptid
ase,
put
ativ
e cD
NA
clo
ne 0
2p18
5',
m
RN
A se
quen
ce
TCA
009-
IQ23
0B_A
12_.
B_0
2.ab
1 Tc
00.1
0470
5351
1633
.79
Mic
rotu
bule
-ass
ocia
ted
EA-2
-1 T
. cru
zi d
iffer
entia
l N
ot sp
ecifi
ed
Nul
l N
ull
pr
otei
n, p
utat
ive
disp
lay
cDN
A li
brar
y T.
cru
zi
cDN
A, m
RN
A se
quen
ce
TCA
009-
IQ23
0B_B
01_.
B_0
3.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
9-IQ
230B
_B05
_.B
_03.
ab1
Tc00
.104
7053
5095
81.1
0 Tr
ans-
sial
idas
e, p
utat
ive
Nul
l N
ull
Nul
l N
ull
TC
A00
9-IQ
230B
_C08
_.B
_06.
ab1
Tc00
.104
7053
5085
69.8
0 Ph
osph
oace
tylg
luco
sam
ine
N
ull
Nul
l N
ull
Nul
l
mut
ase,
put
ativ
eTC
A00
9-IQ
230B
_C09
_.B
_05.
ab1
Tc00
.104
7053
5074
83.8
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
l
TCA
009-
IQ23
0B_D
02_.
B_0
8.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
9-IQ
230B
_D04
_.B
_08.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
009-
IQ23
0B_D
05_.
B_0
7.ab
1 Tc
00.1
0470
5350
3697
.70
Hyp
othe
tical
pro
tein
, con
serv
ed
TEN
Q08
90 T
. cru
zi
Epim
astig
ote
184_
Am
a.m
eta
Am
astig
ote,
ep
imas
tigot
e no
rmal
ized
m
etac
yclic
cD
NA
libr
ary
T. c
ruzi
cDN
A, m
RN
A se
quen
ce
TCA
009-
IQ23
0B_D
06_.
B_0
8.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
9-IQ
230B
_D11
_.B
_07.
ab1
Tc00
.104
7053
5111
17.1
0 R
etro
trans
poso
n ho
t spo
t pro
tein
am
astc
-117
4 Tc
AM
T. c
ruzi
A
mas
tigot
e 34
_Met
a M
etac
yclic
(RH
S, p
seud
ogen
e), p
utat
ive
cDN
A c
lone
9C
12 5
',
mR
NA
sequ
ence
TC
A00
9-IQ
230B
_E09
_.B
_09.
ab1
Tc00
.104
7053
5118
73.5
H
ypot
hetic
al p
rote
in
Nul
l N
ull
Nul
l N
ull
TC
A00
9-IQ
230B
_E10
_.B
_10.
ab1
Tc00
.104
7053
5106
45.4
0 H
ypot
hetic
al p
rote
in,
Nul
l N
ull
Nul
l N
ull
co
nser
ved
(pse
udog
ene)
TCA
009-
IQ23
0B_F
04_.
B_1
2.ab
1 Tc
00.1
0470
5350
6155
.60
Ald
ehyd
e de
hydr
ogen
ase,
put
ativ
e N
ull
Nul
l N
ull
Nul
l
TCA
009-
IQ23
0B_G
02_.
B_1
4.ab
1 Tc
00.1
0470
5350
8321
.40
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
TC
A00
9-IQ
230B
_G09
_.B
_13.
ab1
Con
tig10
4705
3508
827
Mat
ches
the
inte
r-cod
ing
TE
NU
1696
T. c
ruzi
epi
mas
tigot
e Ep
imas
tigot
e N
ull
Nul
l 10
4705
3508
827
regi
on o
f the
gen
es
norm
aliz
ed c
DN
A li
brar
y
Tc
00.1
0470
5350
8827
.10
and
T.
cru
zi c
DN
A c
lone
20j
13 3
’,
Tc
00.1
0470
5350
8827
.20
mR
NA
sequ
ence
TCA
009-
IQ23
0C_A
07_.
B_0
1.ab
1 Tc
00.1
0470
5351
0955
.4
Lipa
se-li
ke p
rote
in, p
utat
ive
Nul
l N
ull
Nul
l N
ull
TC
A00
9-IQ
230C
_B04
_.B
_04.
ab1
Tc00
.104
7053
5104
07.7
0 H
ypot
hetic
al p
rote
in
Nul
l N
ull
Nul
l N
ull
1619
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
Transcriptome of Trypanosoma cruzi mammalian stages
Que
ry
Bes
t bla
stn
hit a
gain
st
Ann
otat
ion
Bes
t bla
stn
hit a
gain
st
EST-
stag
e B
est b
last
x hi
t aga
inst
Pr
oteo
me-
stag
e C
ontig
s R
eads
T.
cru
zi g
enes
and
con
tigs
T.
cru
zi E
STs
T.
cru
zi p
rote
ome
TCA
009-
IQ23
0C_C
04_.
B_0
6.ab
1 11
843.
rrna
0000
3 R
NA
ribo
som
al
TEN
U32
27 T
. cru
zi e
pim
astig
ote
Ep
imas
tigot
e N
ull
Nul
l
no
rmal
ized
cD
NA
libr
ary
T.
cru
zi c
DN
A c
lone
29o
3
5' si
mila
r to
T. c
ruzi
5.8
S
ribos
omal
RN
A, i
nter
nal
tra
nscr
ibed
spac
ers
1-
7 (I
TS1-
ITS7
), an
d
28S
ribos
omal
RN
A
gb
|L22
334|
TRB
S3R
RB
N,
m
RN
A se
quen
ceTC
A00
9-IQ
230C
_D08
_.B
_08.
ab1
Tc00
.104
7053
5092
05.1
20
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
TC
A00
9-IQ
230C
_E04
_.B
_10.
ab1
Tc00
.104
7053
5115
85.1
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
l
TCA
009-
IQ23
0C_F
10_.
B_1
2.ab
1 22
080.
rrna
0000
1 R
NA
ribo
som
al
TEN
F068
2 T.
cru
zi e
pim
astig
ote
Ep
imas
tigot
e N
ull
Nul
l
no
rmal
ized
cD
NA
libr
ary
T.
cru
zi c
DN
A c
lone
682
5',
m
RN
A se
quen
ceTC
A00
9-IQ
230D
_A02
_.B
_02.
ab1
Tc00
.104
7053
5104
41.2
0 D
ispe
rsed
gen
e fa
mily
pro
tein
N
ull
Nul
l N
ull
Nul
l
1 (D
GF-
1), p
utat
ive
TCA
009-
IQ23
0D_A
12_.
B_0
2.ab
1 N
ull
195-
bp sa
telli
te D
NA
N
ull
Nul
l N
ull
Nul
l
TCA
009-
IQ23
0D_B
01_.
B_0
3.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
9-IQ
230D
_B05
_.B
_03.
ab1
Tc00
.104
7053
5076
11.1
99
Dis
pers
ed g
ene
fam
ily p
rote
in
Nul
l N
ull
Nul
l N
ull
1
(DG
F-1)
, put
ativ
eTC
A00
9-IQ
230D
_C12
_.B
_06.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
009-
IQ23
0D_D
10_.
B_0
8.ab
1 N
ull
195-
bp sa
telli
te D
NA
TE
NU
4403
T. c
ruzi
epi
mas
tigot
e
Epim
astig
ote
Nul
l N
ull
norm
aliz
ed c
DN
A li
brar
y
T. c
ruzi
cD
NA
clo
ne 5
h21
3',
m
RN
A se
quen
ceTC
A00
9-IQ
230D
_F11
_.B
_11.
ab1
Tc00
.104
7053
5101
21.9
0 H
ypot
hetic
al p
rote
in, c
onse
rved
TE
NU
3409
T. c
ruzi
epi
mas
tigot
e
Epim
astig
ote
Nul
l N
ull
norm
aliz
ed c
DN
A li
brar
y
T. c
ruzi
cD
NA
clo
ne 2
9b3
5',
m
RN
A se
quen
ceTC
A00
9-IQ
230D
_G03
_.B
_13.
ab1
Tc00
.104
7053
5041
53.9
0 D
ynei
n he
avy
chai
n
Nul
l N
ull
Nul
l N
ull
(p
seud
ogen
e), p
utat
ive
TCA
009-
IQ23
0E_A
01_.
B_0
1.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
l
Con
tinue
d on
nex
t pag
e
Supp
lem
enta
ry M
ater
ial S
1. C
ontin
ued.
1620
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
C. Martins et al.
Que
ry
Bes
t bla
stn
hit a
gain
st
Ann
otat
ion
Bes
t bla
stn
hit a
gain
st
EST-
stag
e B
est b
last
x hi
t aga
inst
Pr
oteo
me-
stag
e C
ontig
s R
eads
T.
cru
zi g
enes
and
con
tigs
T.
cru
zi E
STs
T.
cru
zi p
rote
ome
TCA
009-
IQ23
0E_A
04_.
B_0
2.ab
1 Tc
00.1
0470
5350
8869
.10
Hyp
othe
tical
pro
tein
TE
NU
3457
T. c
ruzi
epi
mas
tigot
e
Epim
astig
ote
Nul
l N
ull
norm
aliz
ed c
DN
A li
brar
y
T. c
ruzi
cD
NA
clo
ne 2
9g24
5' si
mila
r to
T. c
ruzi
85
kD
su
rfac
e an
tigen
(gp8
5)
gb
|M64
836|
TRB
85K
D,
m
RN
A se
quen
ceTC
A00
9-IQ
230E
_A06
_.B
_02.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
009-
IQ23
0E_A
07_.
B_0
1.ab
1 Tc
00.1
0470
5350
7641
.14
Neu
robe
achi
n/be
ige
N
ull
Nul
l N
ull
Nul
l
pr
otei
n, p
utat
ive
TCA
009-
IQ23
0E_A
08_.
B_0
2.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
9-IQ
230E
_A09
_.B
_01.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
009-
IQ23
0E_A
12_.
B_0
2.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
9-IQ
230E
_B01
_.B
_03.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
009-
IQ23
0E_B
05_.
B_0
3.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
9-IQ
230E
_B10
_.B
_04.
ab1
264.
rrna
0000
2 R
NA
ribo
som
al
Nul
l N
ull
106_
Tryp
o Tr
ypom
astig
ote
TC
A00
9-IQ
230E
_B12
_.B
_04.
ab1
Tc00
.104
7053
5116
57.3
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
l
TCA
009-
IQ23
0E_C
04_.
B_0
6.ab
1 Tc
00.1
0470
5350
6367
.170
H
ypot
hetic
al p
rote
in
Nul
l N
ull
Nul
l N
ull
TC
A00
9-IQ
230E
_C11
_.B
_05.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
009-
IQ23
0E_C
12_.
B_0
6.ab
1 Tc
00.1
0470
5351
1071
.202
H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
l
TCA
009-
IQ23
0E_D
12_.
B_0
8.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
9-IQ
230E
_E03
_.B
_09.
ab1
Tc00
.104
7053
5091
05.1
40
Hea
t sho
ck p
rote
in 8
5, p
utat
ive
amas
tc-3
62 T
cAM
T. c
ruzi
A
mas
tigot
e
497_
Am
a.m
eta.
epi
Am
astig
ote,
cDN
A c
lone
14B
10 5
',
met
acyc
lic,
mR
NA
sequ
ence
ep
imas
tigot
e
TCA
009-
IQ23
0E_E
12_.
B_1
0.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A00
9-IQ
230E
_F02
_.B
_12.
ab1
Tc00
.104
7053
5061
77.5
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
l
TCA
009-
IQ23
0E_F
09_.
B_1
1.ab
1 Tc
00.1
0470
5351
0533
.10
Dyn
ein
heav
y ch
ain,
N
ull
Nul
l N
ull
Nul
l
cyto
solic
, put
ativ
eTC
A00
9-IQ
230E
_G03
_.B
_13.
ab1
Tc00
.104
7053
5105
65.1
79
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
TC
A00
9-IQ
230E
_G09
_.B
_13.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
009-
IQ23
0E_G
10_.
B_1
4.ab
1 Tc
00.1
0470
5351
1277
.400
U
DP-
gala
ctos
e
Nul
l N
ull
Nul
l N
ull
tra
nspo
rter,
puta
tive
TCA
009-
IQ23
0E_G
12_.
B_1
4.ab
1 Tc
00.1
0470
5350
6579
.40
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
TC
A00
9-IQ
230E
_H01
_.B
_15.
ab1
Tc00
.104
7053
5074
29.6
0 H
ypot
hetic
al p
rote
in
Nul
l N
ull
Nul
l N
ull
TC
A00
9-IQ
230E
_H02
_.B
_16.
ab1
Tc00
.104
7053
5094
73.1
0 H
ypot
hetic
al p
rote
in (p
seud
ogen
e)
Nul
l N
ull
Nul
l N
ull
TC
A23
6-IQ
001_
A01
_.B
_01.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
Supp
lem
enta
ry M
ater
ial S
1. C
ontin
ued.
Con
tinue
d on
nex
t pag
e
1621
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
Transcriptome of Trypanosoma cruzi mammalian stages
Que
ry
Bes
t bla
stn
hit a
gain
st
Ann
otat
ion
Bes
t bla
stn
hit a
gain
st
EST-
stag
e B
est b
last
x hi
t aga
inst
Pr
oteo
me-
stag
e C
ontig
s R
eads
T.
cru
zi g
enes
and
con
tigs
T.
cru
zi E
STs
T.
cru
zi p
rote
ome
TCA
236-
IQ00
1_A
02_.
B_0
2.ab
1 Tc
00.1
0470
5350
6279
.220
M
ucin
-ass
ocia
ted
surf
ace
prot
ein
N
ull
Nul
l N
ull
Nul
l
(MA
SP, p
seud
ogen
e), p
utat
ive
TCA
236-
IQ00
1_A
07_.
B_0
1.ab
1 Tc
00.1
0470
5351
0187
.551
C
hape
roni
n H
SP60
mito
chon
dria
l N
ull
Nul
l N
ull
Nul
l
prec
urso
r, pu
tativ
eTC
A23
6-IQ
001_
A09
_.B
_01.
ab1
Tc00
.104
7053
5038
01.9
R
AD
50 D
NA
repa
ir
Nul
l N
ull
Nul
l N
ull
pr
otei
n, p
utat
ive
TCA
236-
IQ00
1_A
12_.
B_0
2.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
A23
6-IQ
001_
B04
_.B
_04.
ab1
8054
.te00
005
VIP
ER
Nul
l N
ull
Nul
l N
ull
TC
A23
6-IQ
001_
B06
_.B
_04.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
236-
IQ00
1_B
11_.
B_0
3.ab
1 Tc
00.1
0470
5350
4153
.270
H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
l
TCA
236-
IQ00
1_C
03_.
B_0
5.ab
1 24
358.
rrna
0000
1 R
NA
ribo
som
al
TEU
F015
4 T.
cru
zi e
pim
astig
ote
Ep
imas
tigot
e N
ull
Nul
l
no
n-no
rmal
ized
cD
NA
libr
ary
T.
cru
zi c
DN
A c
lone
154
5'
si
mila
r to
T. c
ruzi
gen
e fo
r his
tone
H2b
(X60
982)
, mR
NA
sequ
ence
TC
A23
6-IQ
001_
C05
_.B
_05.
ab1
Tc00
.104
7053
5086
71.2
0 Se
rine
carb
oxyp
eptid
ase
N
ull
Nul
l N
ull
Nul
l
(CB
P1),
puta
tive
TCA
236-
IQ00
1_C
07_.
B_0
5.ab
1 Tc
00.1
0470
5350
6503
.53
Hyp
othe
tical
pro
tein
N
ull
Nul
l N
ull
Nul
l
TCA
236-
IQ00
1_D
04_.
B_0
8.ab
1 Tc
00.1
0470
5350
6849
.20
Dyn
ein
heav
y ch
ain
N
ull
Nul
l N
ull
Nul
l
(pse
udog
ene)
, put
ativ
eTC
A23
6-IQ
001_
D05
_.B
_07.
ab1
Tc00
.104
7053
5076
41.1
60
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
TC
A23
6-IQ
001_
D08
_.B
_08.
ab1
Tc00
.104
7053
5104
99.2
0 H
ypot
hetic
al p
rote
in
Nul
l N
ull
Nul
l N
ull
TC
A23
6-IQ
001_
D09
_.B
_07.
ab1
Tc00
.104
7053
5042
41.8
0 Tr
ans-
sial
idas
e TE
NF0
714
T. c
ruzi
epi
mas
tigot
e
Epim
astig
ote
Nul
l N
ull
(p
seud
ogen
e), p
utat
ive
norm
aliz
ed c
DN
A li
brar
y
T. c
ruzi
cD
NA
clo
ne 7
14 5
',
mR
NA
sequ
ence
TC
A23
6-IQ
001_
D12
_.B
_08.
ab1
Tc00
.104
7053
5115
11.1
0 En
doso
mal
traf
ficki
ng
Nul
l N
ull
Nul
l N
ull
pr
otei
n R
ME-
8, p
utat
ive
TCA
236-
IQ00
1_E1
0_.B
_10.
ab1
Tc00
.104
7053
5114
15.2
1 Tr
ans-
sial
idas
e
Nul
l N
ull
Nul
l N
ull
(p
seud
ogen
e), p
utat
ive
TCA
236-
IQ00
1_E1
2_.B
_10.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
236-
IQ00
1_F0
3_.B
_11.
ab1
7271
.te00
003
VIP
ER
Nul
l N
ull
Nul
l N
ull
TC
A23
6-IQ
001_
F05_
.B_1
1.ab
1 Tc
00.1
0470
5350
6721
.30
Cal
pain
-like
cys
tein
e
Nul
l N
ull
317_
Met
a.ep
i M
etac
yclic
,
pept
idas
e, p
utat
ive
ep
imas
tigot
eTC
A23
6-IQ
001_
F11_
.B_1
1.ab
1 Tc
00.1
0470
5350
4643
.20
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
TC
A23
6-IQ
001_
F12_
.B_1
2.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
l
Supp
lem
enta
ry M
ater
ial S
1. C
ontin
ued.
Con
tinue
d on
nex
t pag
e
1622
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
C. Martins et al.
Que
ry
Bes
t bla
stn
hit a
gain
st
Ann
otat
ion
Bes
t bla
stn
hit a
gain
st
EST-
stag
e B
est b
last
x hi
t aga
inst
Pr
oteo
me-
stag
e C
ontig
s R
eads
T.
cru
zi g
enes
and
con
tigs
T.
cru
zi E
STs
TCA
236-
IQ00
1_G
03_.
B_1
3.ab
1 Tc
00.1
0470
5350
6847
.50
Tran
s-si
alid
ase
N
ull
Nul
l N
ull
Nul
l
(pse
udog
ene)
, put
ativ
eTC
A23
6-IQ
001_
G04
_.B
_14.
ab1
Tc00
.104
7053
5115
29.8
0 K
inet
opla
st
Nul
l N
ull
187_
Am
a.try
po.m
eta
Am
astig
ote,
D
NA
-ass
ocia
ted
try
pom
astig
ote,
pr
otei
n, p
utat
ive
ep
imas
tigot
e
TCA
236-
IQ00
1_G
06_.
B_1
4.ab
1 Tc
00.1
0470
5350
6505
.30
Hyp
othe
tical
pro
tein
, con
serv
ed
TEN
G03
88 T
. cru
zi
Epim
astig
ote
Nul
l N
ull
epim
astig
ote
norm
alis
ed
cD
NA
libr
ary
T. c
ruzi
cDN
A c
lone
n73
1.r 5
',
mR
NA
sequ
ence
TCA
236-
IQ00
1_G
11_.
B_1
3.ab
1 Tc
00.1
0470
5350
7979
.50
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
TC
A23
6-IQ
001_
G12
_.B
_14.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
236-
IQ00
1_H
01_.
B_1
5.ab
1 Tc
00.1
0470
5350
5555
.40
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
TC
A23
6-IQ
001_
H03
_.B
_15.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
236-
IQ00
1_H
05_.
B_1
5.ab
1 Tc
00.1
0470
5351
0055
.150
H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
l
TCA
236-
IQ00
1_H
07_.
B_1
5.ab
1 Tc
00.1
0470
5350
6265
.60
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
TC
A23
6-IQ
001_
H12
_.B
_16.
ab1
Tc00
.104
7053
5115
87.3
0 R
etro
trans
poso
n
Nul
l N
ull
Nul
l N
ull
ho
t spo
t pro
tein
(R
HS,
pse
udog
ene)
, put
ativ
eTC
E006
-IQ
225A
_B01
_.B
_03.
ab1
Tc00
.104
7053
5114
15.1
1 R
etro
trans
poso
n ho
t spo
t N
ull
Nul
l 48
8_M
eta
Met
acyc
lic
(R
HS)
pro
tein
, put
ativ
eTC
E006
-IQ
225A
_B07
_.B
_03.
ab1
Tc00
.104
7053
5083
23.1
99
ATP-
depe
nden
t N
ull
Nul
l N
ull
Nul
l
DEA
D/H
RN
A
helic
ase,
put
ativ
eTC
E006
-IQ
225A
_C07
_.B
_05.
ab1
Tc00
.104
7053
5068
25.1
90
Hyp
othe
tical
pro
tein
TE
NU
2835
T. c
ruzi
Ep
imas
tigot
e N
ull
Nul
l
ep
imas
tigot
e no
rmal
ized
cDN
A li
brar
y T.
cru
zi
cD
NA
clo
ne 2
7d2
5',
m
RN
A se
quen
ce
TCE0
06-I
Q22
5A_C
10_.
B_0
6.ab
1 Tc
00.1
0470
5350
4137
.40
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
TC
E006
-IQ
225A
_E01
_.B
_09.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCE0
06-I
Q22
5A_E
05_.
B_0
9.ab
1 Tc
00.1
0470
5350
6741
.50
Muc
in T
cMU
CII
, put
ativ
e N
ull
Nul
l N
ull
Nul
l
TCE0
06-I
Q22
5A_E
06_.
B_1
0.ab
1 Tc
00.1
0470
5350
8347
.30
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
TC
E006
-IQ
225A
_F03
_.B
_11.
ab1
Tc00
.104
7053
5098
05.1
00
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
TC
E006
-IQ
225A
_G03
_.B
_13.
ab1
Tc00
.104
7053
5114
35.1
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
l
TCE0
06-I
Q22
5A_G
07_.
B_1
3.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
E006
-IQ
225A
_G12
_.B
_14.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
Supp
lem
enta
ry M
ater
ial S
1. C
ontin
ued.
Con
tinue
d on
nex
t pag
e
1623
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
Transcriptome of Trypanosoma cruzi mammalian stages
Que
ry
Bes
t bla
stn
hit a
gain
st
Ann
otat
ion
Bes
t bla
stn
hit a
gain
st
EST-
stag
e B
est b
last
x hi
t aga
inst
Pr
oteo
me-
stag
e C
ontig
s R
eads
T.
cru
zi g
enes
and
con
tigs
T.
cru
zi E
STs
TCE0
06-I
Q22
5B_A
08_.
B_0
2.ab
1 25
661.
rrna
0000
1 R
NA
ribo
som
al
TEN
Q08
45 T
. cru
zi
Epim
astig
ote
210_
Am
a.m
eta.
epi
Am
astig
ote,
ep
imas
tigot
e no
rmal
ized
m
etac
yclic
,
cD
NA
libr
ary
T. c
ruzi
ep
imas
tigot
e
cDN
A, m
RN
A se
quen
ce
TCE0
06-I
Q22
5B_C
12_.
B_0
6.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
lTC
E006
-IQ
225B
_E01
_.B
_09.
ab1
Tc00
.104
7053
5100
55.1
00
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Nul
l N
ull
TC
E006
-IQ
225B
_E05
_.B
_09.
ab1
Tc00
.104
7053
5075
21.1
50
Dis
pers
ed g
ene
fam
ily p
rote
in
Nul
l N
ull
Nul
l N
ull
1
(DG
F-1)
, put
ativ
eTC
E006
-IQ
225B
_F07
_.B
_11.
ab1
Tc00
.104
7053
5092
47.3
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l N
ull
Nul
l
TCE0
06-I
Q22
5B_G
10_.
B_1
4.ab
1 Tc
00.1
0470
5350
3909
.84
Cal
pain
-like
cys
tein
e
Nul
l N
ull
Nul
l N
ull
pe
ptid
ase,
put
ativ
eTC
E006
-IQ
225B
_H04
_.B
_16.
ab1
Tc00
.104
7053
5101
63.6
0 Tr
ans-
sial
idas
e, p
utat
ive
Nul
l N
ull
Nul
l N
ull
TC
E006
-IQ
225B
_H07
_.B
_15.
ab1
Nul
l N
ot id
entifi
ed
Nul
l N
ull
Nul
l N
ull
Nul
l N
ull
TCA
007-
IQ23
1B_E
06_.
B_1
0.ab
1 N
ull
Not
iden
tified
N
ull
Nul
l N
ull
Nul
l N
ull
Nul
l
Supp
lem
enta
ry M
ater
ial S
1. C
ontin
ued.
1624
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
C. Martins et al.Q
uery
Su
bjec
t A
nnot
atio
n ES
T m
atch
Pr
oteo
me
mat
ch
pfam
_acc
pf
am_i
d pf
am e
-val
ue
Con
tig3
Tc00
.104
7053
5040
39.1
10
Hyp
othe
tical
pro
tein
N
ull
Nul
l
C
ontig
19
Tc00
.104
7053
5104
31.1
90
Hyp
othe
tical
pro
tein
N
ull
Nul
l
C
ontig
24
Tc00
.104
7053
5116
11.1
1 H
ypot
hetic
al p
rote
in
Nul
l N
ull
Con
tig25
Tc
00.1
0470
5350
3975
.20
Hyp
othe
tical
pro
tein
N
ull
Nul
l
C
ontig
115
Tc00
.104
7053
5116
75.3
H
ypot
hetic
al p
rote
in
Nul
l N
ull
PF01
456
Muc
in-li
ke g
lyco
prot
ein
4.8e
-07
Con
tig32
Tc
00.1
0470
5350
9473
.10
Hyp
othe
tical
pro
tein
(pse
udog
ene)
N
ull
Nul
l
C
ontig
12
Tc00
.104
7053
5064
01.2
10
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Con
tig14
Tc
00.1
0470
5350
4137
.150
H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l PF
0300
6 H
emol
ysin
-III
rela
ted
4.2e
-45
C
ontig
18
Tc00
.104
7053
5099
85.6
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l
C
ontig
20
Tc00
.104
7053
5068
25.6
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l
C
ontig
22
Tc00
.104
7053
5107
59.1
80
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Con
tig23
Tc
00.1
0470
5350
4929
.5
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
PF00
642
Zinc
fing
er
5.6e
-08
C-x
8-C
-x5-
C-x
3-H
type
C
ontig
35
Tc00
.104
7053
5101
01.3
40
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
C
ontig
38
Tc00
.104
7053
5086
47.2
20
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Con
tig41
Tc
00.1
0470
5350
7099
.40
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
PF00
622/
PF00
622/
PF00
632
SPRY
dom
ain/
SPRY
9.
3e-1
7/1.
9e-1
7/1.
2e-4
7
do
mai
n/H
ECT-
dom
ain
(ubi
quiti
n-tra
nsfe
rase
)C
ontig
42
Tc00
.104
7053
5112
83.2
20
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
PF08
512
His
tone
cha
pero
ne
1.7e
-33
Rttp
106-
like
Con
tig54
Tc
00.1
0470
5350
9171
.60
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
C
ontig
55
Tc00
.104
7053
5066
49.9
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l C
ontig
58
Tc00
.104
7053
5064
05.1
40
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
PF02
990
Endo
mem
bran
e pr
otei
n 70
2.
4e-1
08C
ontig
59
Tc00
.104
7053
5066
61.5
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l
C
ontig
64
Tc00
.104
7053
5040
71.3
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l PF
0050
1 A
MP-
bind
ing
enzy
me
4.00
E-07
Con
tig74
Tc
00.1
0470
5350
9455
.30
Hyp
othe
tical
pro
tein
, con
serv
ed
TEN
S056
8 T.
cru
zi e
pim
astig
ote
N
ull
no
rmal
ized
cD
NA
libr
ary
T. c
ruzi
cDN
A c
lone
568
5',
mR
NA
sequ
ence
Con
tig81
Tc
00.1
0470
5351
0329
.140
H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l
C
ontig
96
Tc00
.104
7053
5077
15.2
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l
C
ontig
107
Tc00
.104
7053
5101
87.2
10
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
PF04
515
Prot
ein
of u
nkno
wn
1.3e
-50
func
tion,
DU
F580
Con
tig10
8 Tc
00.1
0470
5351
1283
.240
H
ypot
hetic
al p
rote
in, c
onse
rved
TE
NU
0619
T. c
ruzi
epi
mas
tigot
e
Nul
l
norm
aliz
ed c
DN
A li
brar
y T.
cru
zi
cD
NA
clo
ne 3
c1 3
', m
RN
A se
quen
ceC
ontig
116
Tc00
.104
7053
5064
81.3
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l
C
ontig
9 Tc
00.1
0470
5350
6591
.50
Hyp
othe
tical
pro
tein
, N
ull
Nul
l
co
nser
ved
(pse
udog
ene)
Con
tig57
Tc
00.1
0470
5350
8367
.50
Hyp
othe
tical
pro
tein
, TE
NU
1365
T. c
ruzi
epi
mas
tigot
e
Nul
l
co
nser
ved
(pse
udog
ene)
no
rmal
ized
cD
NA
libr
ary
T. c
ruzi
cDN
A c
lone
21e
3 3'
, mR
NA
sequ
ence
Con
tig37
Tc
00.1
0470
5339
9389
.10
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Con
tig77
Tc
00.1
0470
5350
7083
.109
H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l
Supp
lem
enta
ry M
ater
ial S
2. P
fam
- O
RES
TES
cont
igs.
1625
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
Transcriptome of Trypanosoma cruzi mammalian stages
Que
ry
Subj
ect
Ann
otat
ion
EST
mat
ch
Prot
eom
e m
atch
pf
am_a
cc
pfam
_id
pfam
e-v
alue
TCA
006-
IQ22
5C_B
03_.
B_0
3.ab
1 Tc
00.1
0470
5350
4277
.30
Hyp
othe
tical
pro
tein
N
ull
Nul
l
TC
A00
6-IQ
225C
_C03
_.B
_05.
ab1
Tc00
.104
7053
5043
93.1
0 H
ypot
hetic
al p
rote
in
TEN
G05
14 T
. cru
zi e
pim
astig
ote
N
ull
no
rmal
ised
cD
NA
libr
ary
T. c
ruzi
cDN
A c
lone
n89
4.r 5
',
mR
NA
sequ
ence
TC
A00
6-IQ
225C
_C04
_.B
_06.
ab1
Tc00
.104
7053
5067
97.2
0 H
ypot
hetic
al p
rote
in
Nul
l N
ull
TCA
006-
IQ22
5C_D
04_.
B_0
8.ab
1 Tc
00.1
0470
5350
8811
.55
Hyp
othe
tical
pro
tein
am
astc
-122
5 Tc
AM
T. c
ruzi
N
ull
cD
NA
clo
ne 9
G8
5',
m
RN
A se
quen
ce
TCA
006-
IQ22
5C_D
08_.
B_0
8.ab
1 Tc
00.1
0470
5350
7641
.250
H
ypot
hetic
al p
rote
in
Nul
l N
ull
PF04
749
PLA
C8
fam
ily
6.30
E-06
TCA
006-
IQ22
5C_E
04_.
B_1
0.ab
1 Tc
00.1
0470
5350
4277
.30
Hyp
othe
tical
pro
tein
SG
-4-6
T. c
ruzi
diff
eren
tial d
ispl
ay
Nul
l
cDN
A li
brar
y T.
cru
zi c
DN
A,
m
RN
A se
quen
ce
TCA
006-
IQ22
5E_A
03_.
B_0
1.ab
1 Tc
00.1
0470
5350
7953
.130
H
ypot
hetic
al p
rote
in
TEN
U40
24 T
. cru
zi e
pim
astig
ote
N
ull
no
rmal
ized
cD
NA
libr
ary
T. c
ruzi
cDN
A c
lone
1l2
3',
m
RN
A se
quen
ceTC
A00
6-IQ
225E
_A06
_.B
_02.
ab1
Tc00
.104
7053
5082
13.3
0 H
ypot
hetic
al p
rote
in
Nul
l N
ull
TCA
006-
IQ22
5E_A
10_.
B_0
2.ab
1 Tc
00.1
0470
5350
9433
.30
Hyp
othe
tical
pro
tein
N
ull
Nul
l
TC
A00
6-IQ
225G
_C10
_.b_
06.a
b1
Tc00
.104
7053
5118
25.1
10
Hyp
othe
tical
pro
tein
TE
NF0
697
T. c
ruzi
epi
mas
tigot
e
Nul
l
norm
aliz
ed c
DN
A li
brar
y T.
cru
zi
cD
NA
clo
ne 6
97 5
',
mR
NA
sequ
ence
TC
A00
7-IQ
231B
_A07
_.B
_01.
ab1
Tc00
.104
7053
5088
11.1
0 H
ypot
hetic
al p
rote
in
Nul
l N
ull
TCA
008-
IQ22
9B_F
03_.
B_1
1.ab
1 Tc
00.1
0470
5350
4153
.370
H
ypot
hetic
al p
rote
in
Nul
l N
ull
TCA
009-
IQ23
0B_E
09_.
B_0
9.ab
1 Tc
00.1
0470
5351
1873
.5
Hyp
othe
tical
pro
tein
N
ull
Nul
l
TC
A00
9-IQ
230C
_B04
_.B
_04.
ab1
Tc00
.104
7053
5104
07.7
0 H
ypot
hetic
al p
rote
in
Nul
l N
ull
PF00
787
PX d
omai
n 2.
9e-0
6TC
A00
9-IQ
230E
_A04
_.B
_02.
ab1
Tc00
.104
7053
5088
69.1
0 H
ypot
hetic
al p
rote
in
TEN
U34
57 T
. cru
zi e
pim
astig
ote
N
ull
no
rmal
ized
cD
NA
libr
ary
T. c
ruzi
cDN
A c
lone
29g
24 5
’ sim
ilar t
o
T. c
ruzi
85
kD su
rfac
e an
tigen
(gp8
5) g
b|M
6483
6|TR
B85
KD
,
mR
NA
sequ
ence
TC
A00
9-IQ
230E
_C04
_.B
_06.
ab1
Tc00
.104
7053
5063
67.1
70
Hyp
othe
tical
pro
tein
N
ull
Nul
l
TC
A00
9-IQ
230E
_H01
_.B
_15.
ab1
Tc00
.104
7053
5074
29.6
0 H
ypot
hetic
al p
rote
in
Nul
l N
ull
TCA
236-
IQ00
1_C
07_.
B_0
5.ab
1 Tc
00.1
0470
5350
6503
.53
Hyp
othe
tical
pro
tein
N
ull
Nul
l
TC
A23
6-IQ
001_
D08
_.B
_08.
ab1
Tc00
.104
7053
5104
99.2
0 H
ypot
hetic
al p
rote
in
Nul
l N
ull
Supp
lem
enta
ry M
ater
ial S
2. P
fam
- O
RES
TES
sing
leto
ns.
Con
tinue
d on
nex
t pag
e
1626
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
C. Martins et al.
Que
ry
Subj
ect
Ann
otat
ion
EST
mat
ch
Prot
eom
e m
atch
pf
am_a
cc
pfam
_id
pfam
e-v
alue
TCE0
06-I
Q22
5A_C
07_.
B_0
5.ab
1 Tc
00.1
0470
5350
6825
.190
H
ypot
hetic
al p
rote
in
TEN
U28
35 T
. cru
zi e
pim
astig
ote
N
ull
no
rmal
ized
cD
NA
libr
ary
T. c
ruzi
cDN
A c
lone
27d
2 5'
,
mR
NA
sequ
ence
TCA
009-
IQ23
0E_H
02_.
B_1
6.ab
1 Tc
00.1
0470
5350
9473
.10
Hyp
othe
tical
pro
tein
(pse
udog
ene)
N
ull
Nul
l
TC
A00
6-IQ
225C
_A07
_.B
_01.
ab1
Tc00
.104
7053
5065
31.4
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l PF
0027
1/PF
0440
8 H
elic
ase
cons
erve
d 2.
3e-0
8/2.
2e-0
8
C
-term
inal
dom
ain/
Hel
icas
e as
soci
ated
do
mai
n (H
A2)
TC
A00
6-IQ
225C
_A12
_.B
_02.
ab1
Tc00
.104
7053
5101
23.1
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l
TC
A00
6-IQ
225C
_B08
_.B
_04.
ab1
Tc00
.104
7053
5075
83.6
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l
TC
A00
6-IQ
225C
_B09
_.B
_03.
ab1
Tc00
.104
7053
5079
41.1
00
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
TCA
006-
IQ22
5C_C
01_.
B_0
5.ab
1 Tc
00.1
0470
5351
0101
.340
H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l
TC
A00
6-IQ
225C
_C05
_.B
_05.
ab1
Tc00
.104
7053
4040
01.2
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l
TC
A00
6-IQ
225C
_C06
_.B
_06.
ab1
Tc00
.104
7053
5062
03.4
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l
TC
A00
6-IQ
225C
_D01
_.B
_07.
ab1
Tc00
.104
7053
5108
77.1
90
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
TCA
006-
IQ22
5C_D
10_.
B_0
8.ab
1 Tc
00.1
0470
5350
6401
.240
H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l
TC
A00
6-IQ
225C
_E01
_.B
_09.
ab1
Tc00
.104
7053
5105
65.1
79
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
TCA
006-
IQ22
5C_E
08_.
B_1
0.ab
1 Tc
00.1
0470
5350
8781
.70
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
PF01
529
DH
HC
zin
c
4.40
E-29
finge
r dom
ain
TCA
006-
IQ22
5C_G
02_.
B_1
4.ab
1 Tc
00.1
0470
5350
4929
.5
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
PF00
642
Zinc
fing
er
5.60
E-08
C-x
8-C
-x5-
C-x
3-H
ty
pe (a
nd si
mila
r)TC
A00
6-IQ
225C
_G05
_.B
_13.
ab1
Tc00
.104
7053
5069
49.2
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
435_
Met
a.ep
i
TC
A00
6-IQ
225C
_G06
_.B
_14.
ab1
Tc00
.104
7053
5111
33.3
0 H
ypot
hetic
al p
rote
in, c
onse
rved
TE
NU
0543
T. c
ruzi
epi
mas
tigot
e
Nul
l
norm
aliz
ed c
DN
A li
brar
y T.
cru
zi
cD
NA
clo
ne 8
h4 3
’,
mR
NA
sequ
ence
TC
A00
6-IQ
225C
_H02
_.B
_16.
ab1
Tc00
.104
7053
5097
75.3
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l
TC
A00
6-IQ
225C
_H05
_.B
_15.
ab1
Tc00
.104
7053
5082
77.3
80
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
TCA
006-
IQ22
5E_A
04_.
B_0
2.ab
1 Tc
00.1
0470
5350
3697
.60
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
TCA
006-
IQ22
5E_A
09_.
B_0
1.ab
1 Tc
00.1
0470
5350
8949
.10
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
TCA
006-
IQ22
5E_B
02_.
B_0
4.ab
1 Tc
00.1
0470
5351
0101
.340
H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l
TC
A00
6-IQ
225E
_C09
_.B
_05.
ab1
Tc00
.104
7053
5063
21.2
70
Hyp
othe
tical
pro
tein
, con
serv
ed
TEN
U22
75 T
. cru
zi e
pim
astig
ote
N
ull
PF00
097
Zinc
fing
er,
3.40
E-06
no
rmal
ized
cD
NA
libr
ary
T. c
ruzi
C
3HC
4 ty
pe
cD
NA
clo
ne 1
0k16
3’,
(RIN
G fi
nger
)
mR
NA
sequ
ence
Supp
lem
enta
ry M
ater
ial S
2. C
ontin
ued.
Con
tinue
d on
nex
t pag
e
1627
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
Transcriptome of Trypanosoma cruzi mammalian stages
Que
ry
Subj
ect
Ann
otat
ion
EST
mat
ch
Prot
eom
e m
atch
pf
am_a
cc
pfam
_id
pfam
e-v
alue
TCA
006-
IQ22
5E_E
08_.
B_1
0.ab
1 Tc
00.1
0470
5350
8173
.264
H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l
TC
A00
6-IQ
225E
_F07
_.B
_11.
ab1
Tc00
.104
7053
5087
19.9
H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l
TC
A00
6-IQ
225E
_F11
_.B
_11.
ab1
Tc00
.104
7053
5078
09.6
0 H
ypot
hetic
al p
rote
in, c
onse
rved
TE
NS2
289
T. c
ruzi
epi
mas
tigot
e
norm
aliz
ed c
DN
A li
brar
y T.
cru
zi
cD
NA
clo
ne 2
289
5',
m
RN
A se
quen
ce
Nul
l
TC
A00
6-IQ
225E
_G03
_.B
_13.
ab1
Tc00
.104
7053
5108
57.1
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l PF
0075
5 C
holin
e/C
arni
tine
2.
80E-
10
o-
acyl
trans
fera
seTC
A00
6-IQ
225E
_G04
_.B
_14.
ab1
Tc00
.104
7053
5091
71.7
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l
TC
A00
6-IQ
225E
_H01
_.B
_15.
ab1
Tc00
.104
7053
4539
17.9
H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
33_T
rypo
.met
a
TC
A00
6-IQ
225E
_H02
_.B
_16.
ab1
Tc00
.104
7053
5067
95.9
0 H
ypot
hetic
al p
rote
in, c
onse
rved
TE
NU
0078
T. c
ruzi
epi
mas
tigot
e
Nul
l
norm
aliz
ed c
DN
A li
brar
y T.
cru
zi
cD
NA
clo
ne 5
e11
3',
m
RN
A se
quen
ceTC
A00
6-IQ
225E
_H06
_.B
_16.
ab1
Tc00
.104
7053
5038
93.1
70
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
TCA
006-
IQ22
5E_H
07_.
B_1
5.ab
1 Tc
00.1
0470
5350
9451
.70
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
TCA
006-
IQ22
5F_A
04_.
b_02
.ab1
Tc
00.1
0470
5351
0101
.340
H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l
TC
A00
6-IQ
225F
_C07
_.b_
05.a
b1
Tc00
.104
7053
5101
01.3
40
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
TCA
006-
IQ22
5F_D
03_.
b_07
.ab1
Tc
00.1
0470
5350
9641
.60
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
TCA
006-
IQ22
5F_G
01_.
b_13
.ab1
Tc
00.1
0470
5350
6797
.10
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
PF00
622
SPRY
dom
ain
2.20
E-09
TCA
006-
IQ22
5F_G
08_.
b_14
.ab1
Tc
00.1
0470
5350
4797
.144
H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l
TC
A00
6-IQ
225F
_H07
_.b_
15.a
b1
Tc00
.104
7053
5044
27.3
0 H
ypot
hetic
al p
rote
in, c
onse
rved
TE
NU
1196
T. c
ruzi
epi
mas
tigot
e
Nul
l
norm
aliz
ed c
DN
A li
brar
y T.
cru
zi
cD
NA
clo
ne 2
0i24
3’,
m
RN
A se
quen
ce
TCA
006-
IQ22
5G_A
01_.
b_01
.ab1
Tc
00.1
0470
5351
0423
.40
Hyp
othe
tical
pro
tein
, con
serv
ed
amas
tc-1
111
TcA
M T
. cru
zi
Nul
l
cDN
A c
lone
8E1
1 5’
,
mR
NA
sequ
ence
TC
A00
6-IQ
225G
_A07
_.b_
01.a
b1
Tc00
.104
7053
5064
39.1
8 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l
TC
A00
6-IQ
225G
_A08
_.b_
02.a
b1
Tc00
.104
7053
5064
01.2
90
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
TCA
006-
IQ22
5G_C
06_.
b_06
.ab1
Tc
00.1
0470
5351
0773
.9
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
TCA
006-
IQ22
5G_C
07_.
b_05
.ab1
Tc
00.1
0470
5341
4243
.20
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
TCA
006-
IQ22
5G_C
11_.
b_05
.ab1
Tc
00.1
0470
5350
9169
.10
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
TCA
006-
IQ22
5G_D
06_.
b_08
.ab1
Tc
00.1
0470
5350
6305
.5
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
TCA
006-
IQ22
5G_E
02_.
b_10
.ab1
Tc
00.1
0470
5350
9589
.50
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
PF07
202
T-co
mpl
ex p
rote
in
3.90
E-31
10 C
-term
inus
TCA
006-
IQ22
5G_E
05_.
b_09
.ab1
Tc
00.1
0470
5350
7963
.30
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Supp
lem
enta
ry M
ater
ial S
2. C
ontin
ued.
Con
tinue
d on
nex
t pag
e
1628
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
C. Martins et al.
Que
ry
Subj
ect
Ann
otat
ion
EST
mat
ch
Prot
eom
e m
atch
pf
am_a
cc
pfam
_id
pfam
e-v
alue
TCA
006-
IQ22
5G_F
08_.
b_12
.ab1
Tc
00.1
0470
5350
8731
.40
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
TCA
006-
IQ22
5G_F
09_.
b_11
.ab1
Tc
00.1
0470
5351
1001
.80
Hyp
othe
tical
pro
tein
, con
serv
ed
amas
tc-1
56 T
cAM
T. c
ruzi
N
ull
cD
NA
clo
ne 1
1H1
5',
m
RN
A se
quen
ce
TCA
006-
IQ22
5G_F
12_.
b_12
.ab1
Tc
00.1
0470
5350
3833
.50
Hyp
othe
tical
pro
tein
, con
serv
ed
TEN
G01
26 T
. cru
zi e
pim
astig
ote
N
ull
no
rmal
ised
cD
NA
libr
ary
T. c
ruzi
cDN
A c
lone
n12
6.r 5
',
mR
NA
sequ
ence
TC
A00
6-IQ
225G
_H04
_.b_
16.a
b1
Tc00
.104
7053
5061
55.8
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l
TC
A00
7-IQ
231A
_A12
_.B
_02.
ab1
Tc00
.104
7053
5108
77.1
60
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
TCA
007-
IQ23
1A_B
08_.
B_0
4.ab
1 Tc
00.1
0470
5351
1115
.10
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
PF02
138
Bei
ge/B
EAC
H
3.10
E-10
7
do
mai
n TC
A00
7-IQ
231A
_F06
_.B
_12.
ab1
Tc00
.104
7053
5110
37.4
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l
TCA
007-
IQ23
1A_H
07_.
B_1
5.ab
1 Tc
00.1
0470
5350
5789
.20
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
TCA
007-
IQ23
1A_H
10_.
B_1
6.ab
1 Tc
00.1
0470
5351
0311
.40
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
TCA
007-
IQ23
1B_C
02_.
B_0
6.ab
1 Tc
00.1
0470
5351
0297
.110
H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l PF
0002
7/
cycl
ic n
ucle
otid
e-
9.1e
-17
/ 2.6
e-06
PF
0002
7 bi
ndin
g do
mai
n/
C
2 do
mai
n TC
A00
7-IQ
231B
_F11
_.B
_11.
ab1
Tc00
.104
7053
5113
65.9
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l
TC
A00
7-IQ
231B
_H10
_.B
_16.
ab1
Tc00
.104
7053
5081
11.4
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l PF
0063
2 H
ECT-
dom
ain
5.
10E-
20
(u
biqu
itin-
trans
fera
se)
TCA
008-
IQ22
9A_A
06_.
B_0
2.ab
1 Tc
00.1
0470
5350
8811
.30
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
TCA
008-
IQ22
9A_B
06_.
B_0
4.ab
1 Tc
00.1
0470
5350
6247
.160
H
ypot
hetic
al p
rote
in, c
onse
rved
TE
NU
3645
T. c
ruzi
epi
mas
tigot
e
Nul
l
norm
aliz
ed c
DN
A li
brar
y T.
cru
zi
cD
NA
clo
ne 2
9l21
5',
m
RN
A se
quen
ceTC
A00
8-IQ
229A
_B12
_.B
_04.
ab1
Tc00
.104
7053
5115
33.1
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l
TC
A00
8-IQ
229A
_C11
_.B
_05.
ab1
Tc00
.104
7053
5063
21.2
80
Hyp
othe
tical
pro
tein
, con
serv
ed
TEN
U38
97 T
. cru
zi e
pim
astig
ote
N
ull
no
rmal
ized
cD
NA
libr
ary
T. c
ruzi
cDN
A c
lone
42g
4 5’
,
mR
NA
sequ
ence
TC
A00
8-IQ
229A
_F08
_.B
_12.
ab1
Tc00
.104
7053
5110
71.2
02
Hyp
othe
tical
pro
tein
, con
serv
ed
TcTR
-113
3 Tc
TR T
. cru
zi
Nul
l
cDN
A c
lone
09k
20 5
’,
mR
NA
sequ
ence
TC
A00
8-IQ
229B
_A09
_.B
_01.
ab1
Tc00
.104
7053
5065
59.3
60
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
Con
tinue
d on
nex
t pag
e
Supp
lem
enta
ry M
ater
ial S
2. C
ontin
ued.
1629
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
Transcriptome of Trypanosoma cruzi mammalian stages
Que
ry
Subj
ect
Ann
otat
ion
EST
mat
ch
Prot
eom
e m
atch
pf
am_a
cc
pfam
_id
pfam
e-v
alue
TCA
008-
IQ22
9B_C
09_.
B_0
5.ab
1 Tc
00.1
0470
5351
1467
.30
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
PF03
031
NLI
inte
ract
ing
2.
30E-
10
fa
ctor
-like
phos
phat
ase
TCA
008-
IQ22
9B_C
10_.
B_0
6.ab
1 Tc
00.1
0470
5350
6825
.219
H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l
TC
A00
8-IQ
229B
_F01
_.B
_11.
ab1
Tc00
.104
7053
5114
27.1
9 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l
TC
A00
8-IQ
229B
_F11
_.B
_11.
ab1
Tc00
.104
7053
5067
29.9
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l PF
0498
1 N
MD
3 fa
mily
3.
10E-
89TC
A00
8-IQ
229B
_G06
_.B
_14.
ab1
Tc00
.104
7053
5089
53.1
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l PF
0769
0 tra
nspo
rter,
maj
or
9.70
E-17
faci
litat
or fa
mily
TCA
009-
IQ23
0A_A
09_.
B_0
1.ab
1 Tc
00.1
0470
5350
8739
.40
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
TCA
009-
IQ23
0A_D
06_.
B_0
8.ab
1 Tc
00.1
0470
5350
4047
.50
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
TCA
009-
IQ23
0A_D
08_.
B_0
8.ab
1 Tc
00.1
0470
5350
7817
.60
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
TCA
009-
IQ23
0B_C
09_.
B_0
5.ab
1 Tc
00.1
0470
5350
7483
.80
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
PF05
889
Solu
ble
liver
ant
igen
/ 3.
50E-
96
liv
er p
ancr
eas a
ntig
en
(SLA
/LP
auto
antig
en)
TCA
009-
IQ23
0B_D
05_.
B_0
7.ab
1 Tc
00.1
0470
5350
3697
.70
Hyp
othe
tical
pro
tein
, con
serv
ed
TEN
Q08
90 T
. cru
zi e
pim
astig
ote
18
4_A
ma.
met
a
norm
aliz
ed c
DN
A li
brar
y T.
cru
zi
cD
NA
, mR
NA
sequ
ence
TC
A00
9-IQ
230B
_G02
_.B
_14.
ab1
Tc00
.104
7053
5083
21.4
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l
TC
A00
9-IQ
230C
_D08
_.B
_08.
ab1
Tc00
.104
7053
5092
05.1
20
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
PF02
129
/ X
-Pro
dip
eptid
yl-
4E-8
6 / 1
.5e-
40
PF08
530
pept
idas
e
(S
15 fa
mily
) /
X-P
ro d
ipep
tidyl
-
pe
ptid
ase
C
-term
inal
non
-
ca
taly
tic d
omai
nTC
A00
9-IQ
230C
_E04
_.B
_10.
ab1
Tc00
.104
7053
5115
85.1
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l
TC
A00
9-IQ
230D
_F11
_.B
_11.
ab1
Tc00
.104
7053
5101
21.9
0 H
ypot
hetic
al p
rote
in, c
onse
rved
TE
NU
3409
T. c
ruzi
epi
mas
tigot
e
Nul
l
norm
aliz
ed c
DN
A li
brar
y T.
cru
zi
cD
NA
clo
ne 2
9b3
5',
m
RN
A se
quen
ce
TCA
009-
IQ23
0E_B
12_.
B_0
4.ab
1 Tc
00.1
0470
5351
1657
.30
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
TCA
009-
IQ23
0E_C
12_.
B_0
6.ab
1 Tc
00.1
0470
5351
1071
.202
H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l
TC
A00
9-IQ
230E
_F02
_.B
_12.
ab1
Tc00
.104
7053
5061
77.5
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l
TC
A00
9-IQ
230E
_G03
_.B
_13.
ab1
Tc00
.104
7053
5105
65.1
79
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
TCA
009-
IQ23
0E_G
12_.
B_1
4.ab
1 Tc
00.1
0470
5350
6579
.40
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
PF07
690
Tran
spor
ter,
maj
or
2.60
E-07
faci
litat
or fa
mily
TCA
236-
IQ00
1_B
11_.
B_0
3.ab
1 Tc
00.1
0470
5350
4153
.270
H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l
Supp
lem
enta
ry M
ater
ial S
2. C
ontin
ued.
Con
tinue
d on
nex
t pag
e
1630
©FUNPEC-RP www.funpecrp.com.brGenetics and Molecular Research 10 (3): 1589-1630 (2011)
C. Martins et al.
Que
ry
Subj
ect
Ann
otat
ion
EST
mat
ch
Prot
eom
e m
atch
pf
am_a
cc
pfam
_id
pfam
e-v
alue
TCA
236-
IQ00
1_D
05_.
B_0
7.ab
1 Tc
00.1
0470
5350
7641
.160
H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l PF
0150
5 M
ajor
Vau
lt
9.60
E-06
Prot
ein
repe
atTC
A23
6-IQ
001_
F11_
.B_1
1.ab
1 Tc
00.1
0470
5350
4643
.20
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
PF01
479
S4 d
omai
n 5.
60E-
06TC
A23
6-IQ
001_
G06
_.B
_14.
ab1
Tc00
.104
7053
5065
05.3
0 H
ypot
hetic
al p
rote
in, c
onse
rved
TE
NG
0388
T. c
ruzi
epi
mas
tigot
e
Nul
l
norm
alis
ed c
DN
A li
brar
y T.
cru
zi
cD
NA
clo
ne n
731.
r 5',
m
RN
A se
quen
ce
TCA
236-
IQ00
1_G
11_.
B_1
3.ab
1 Tc
00.1
0470
5350
7979
.50
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
TCA
236-
IQ00
1_H
01_.
B_1
5.ab
1 Tc
00.1
0470
5350
5555
.40
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
TCA
236-
IQ00
1_H
05_.
B_1
5.ab
1 Tc
00.1
0470
5351
0055
.150
H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l
TC
A23
6-IQ
001_
H07
_.B
_15.
ab1
Tc00
.104
7053
5062
65.6
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l
TC
E006
-IQ
225A
_C10
_.B
_06.
ab1
Tc00
.104
7053
5041
37.4
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l
TC
E006
-IQ
225A
_E06
_.B
_10.
ab1
Tc00
.104
7053
5083
47.3
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l
TC
E006
-IQ
225A
_F03
_.B
_11.
ab1
Tc00
.104
7053
5098
05.1
00
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
TCE0
06-I
Q22
5A_G
03_.
B_1
3.ab
1 Tc
00.1
0470
5351
1435
.10
Hyp
othe
tical
pro
tein
, con
serv
ed
Nul
l N
ull
TCE0
06-I
Q22
5B_E
01_.
B_0
9.ab
1 Tc
00.1
0470
5351
0055
.100
H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l
TC
E006
-IQ
225B
_F07
_.B
_11.
ab1
Tc00
.104
7053
5092
47.3
0 H
ypot
hetic
al p
rote
in, c
onse
rved
N
ull
Nul
l PF
0405
4 CC
R4-N
ot c
ompl
ex
8.70
E-48
com
pone
nt, N
ot1
TCA
006-
IQ22
5C_E
06_.
B_1
0.ab
1 Tc
00.1
0470
5351
0533
.210
H
ypot
hetic
al p
rote
in, c
onse
rved
(pse
udog
ene)
N
ull
Nul
l
TC
A00
6-IQ
225G
_G07
_.b_
13.a
b1
Tc00
.104
7053
5041
31.1
0 H
ypot
hetic
al p
rote
in, c
onse
rved
(pse
udog
ene)
N
ull
Nul
l
TC
A00
8-IQ
229B
_G01
_.B
_13.
ab1
Tc00
.104
7053
5098
91.7
0 H
ypot
hetic
al p
rote
in, c
onse
rved
(pse
udog
ene)
N
ull
Nul
l
TC
A00
9-IQ
230B
_E10
_.B
_10.
ab1
Tc00
.104
7053
5106
45.4
0 H
ypot
hetic
al p
rote
in, c
onse
rved
(pse
udog
ene)
N
ull
Nul
l
Supp
lem
enta
ry M
ater
ial S
2. C
ontin
ued.