RESUMO
BACKGROUND: The apicomplexan parasite Theileria parva causes a livestock disease called East coast fever (ECF), with millions of animals at risk in sub-Saharan East and Southern Africa, the geographic distribution of T. parva. Over a million bovines die each year of ECF, with a tremendous economic burden to pastoralists in endemic countries. Comprehensive, accurate parasite genome annotation can facilitate the discovery of novel chemotherapeutic targets for disease treatment, as well as elucidate the biology of the parasite. However, genome annotation remains a significant challenge because of limitations in the quality and quantity of the data being used to inform the location and function of protein-coding genes and, when RNA data are used, the underlying biological complexity of the processes involved in gene expression. Here, we apply our recently published RNAseq dataset derived from the schizont life-cycle stage of T. parva to update structural and functional gene annotations across the entire nuclear genome. RESULTS: The re-annotation effort lead to evidence-supported updates in over half of all protein-coding sequence (CDS) predictions, including exon changes, gene merges and gene splitting, an increase in average CDS length of approximately 50 base pairs, and the identification of 128 new genes. Among the new genes identified were those involved in N-glycosylation, a process previously thought not to exist in this organism and a potentially new chemotherapeutic target pathway for treating ECF. Alternatively-spliced genes were identified, and antisense and multi-gene family transcription were extensively characterized. CONCLUSIONS: The process of re-annotation led to novel insights into the organization and expression profiles of protein-coding sequences in this parasite, and uncovered a minimal N-glycosylation pathway that changes our current understanding of the evolution of this post-translational modification in apicomplexan parasites.
Assuntos
Anotação de Sequência Molecular/métodos , Proteínas de Protozoários/genética , Proteínas de Protozoários/metabolismo , Theileria parva/genética , Processamento Alternativo , Animais , Redes Reguladoras de Genes , Genoma de Protozoário , Glicosilação , Gado/parasitologia , Análise de Sequência de RNA , Theileria parva/metabolismoRESUMO
We present the draft genome for the Rickettsia endosymbiont of Ixodes scapularis (REIS), a symbiont of the deer tick vector of Lyme disease in North America. Among Rickettsia species (Alphaproteobacteria: Rickettsiales), REIS has the largest genome sequenced to date (>2 Mb) and contains 2,309 genes across the chromosome and four plasmids (pREIS1 to pREIS4). The most remarkable finding within the REIS genome is the extraordinary proliferation of mobile genetic elements (MGEs), which contributes to a limited synteny with other Rickettsia genomes. In particular, an integrative conjugative element named RAGE (for Rickettsiales amplified genetic element), previously identified in scrub typhus rickettsiae (Orientia tsutsugamushi) genomes, is present on both the REIS chromosome and plasmids. Unlike the pseudogene-laden RAGEs of O. tsutsugamushi, REIS encodes nine conserved RAGEs that include F-like type IV secretion systems similar to that of the tra genes encoded in the Rickettsia bellii and R. massiliae genomes. An unparalleled abundance of encoded transposases (>650) relative to genome size, together with the RAGEs and other MGEs, comprise ~35% of the total genome, making REIS one of the most plastic and repetitive bacterial genomes sequenced to date. We present evidence that conserved rickettsial genes associated with an intracellular lifestyle were acquired via MGEs, especially the RAGE, through a continuum of genomic invasions. Robust phylogeny estimation suggests REIS is ancestral to the virulent spotted fever group of rickettsiae. As REIS is not known to invade vertebrate cells and has no known pathogenic effects on I. scapularis, its genome sequence provides insight on the origin of mechanisms of rickettsial pathogenicity.
Assuntos
Regulação Bacteriana da Expressão Gênica/fisiologia , Genoma Bacteriano , Sequências Repetitivas Dispersas , Ixodes/microbiologia , Rickettsia/genética , Animais , Vetores Aracnídeos/microbiologia , Evolução Biológica , Mapeamento Cromossômico , Cromossomos Bacterianos , Dados de Sequência Molecular , Plasmídeos , SimbioseRESUMO
BACKGROUND: Transmission of arthropod-borne apicomplexan parasites that cause disease and result in death or persistent infection represents a major challenge to global human and animal health. First described in 1901 as Piroplasma equi, this re-emergent apicomplexan parasite was renamed Babesia equi and subsequently Theileria equi, reflecting an uncertain taxonomy. Understanding mechanisms by which apicomplexan parasites evade immune or chemotherapeutic elimination is required for development of effective vaccines or chemotherapeutics. The continued risk of transmission of T. equi from clinically silent, persistently infected equids impedes the goal of returning the U. S. to non-endemic status. Therefore comparative genomic analysis of T. equi was undertaken to: 1) identify genes contributing to immune evasion and persistence in equid hosts, 2) identify genes involved in PBMC infection biology and 3) define the phylogenetic position of T. equi relative to sequenced apicomplexan parasites. RESULTS: The known immunodominant proteins, EMA1, 2 and 3 were discovered to belong to a ten member gene family with a mean amino acid identity, in pairwise comparisons, of 39%. Importantly, the amino acid diversity of EMAs is distributed throughout the length of the proteins. Eight of the EMA genes were simultaneously transcribed. As the agents that cause bovine theileriosis infect and transform host cell PBMCs, we confirmed that T. equi infects equine PBMCs, however, there is no evidence of host cell transformation. Indeed, a number of genes identified as potential manipulators of the host cell phenotype are absent from the T. equi genome. Comparative genomic analysis of T. equi revealed the phylogenetic positioning relative to seven apicomplexan parasites using deduced amino acid sequences from 150 genes placed it as a sister taxon to Theileria spp. CONCLUSIONS: The EMA family does not fit the paradigm for classical antigenic variation, and we propose a novel model describing the role of the EMA family in persistence. T. equi has lost the putative genes for host cell transformation, or the genes were acquired by T. parva and T. annulata after divergence from T. equi. Our analysis identified 50 genes that will be useful for definitive phylogenetic classification of T. equi and closely related organisms.
Assuntos
Genoma de Protozoário , Theileria/genética , Animais , Bovinos , Mapeamento Cromossômico , Cromossomos/genética , Cromossomos/metabolismo , Hibridização Genômica Comparativa , Metabolismo Energético/genética , Leucócitos Mononucleares/imunologia , Leucócitos Mononucleares/metabolismo , Fosfolipídeos/metabolismo , Filogenia , Proteínas de Protozoários/genética , Theileria/classificação , Theileriose/genética , Theileriose/metabolismo , Theileriose/parasitologiaRESUMO
BACKGROUND: The size and repetitive nature of the Rhipicephalus microplus genome makes obtaining a full genome sequence fiscally and technically problematic. To selectively obtain gene-enriched regions of this tick's genome, Cot filtration was performed, and Cot-filtered DNA was sequenced via 454 FLX pyrosequencing. RESULTS: The sequenced Cot-filtered genomic DNA was assembled with an EST-based gene index of 14,586 unique entries where each EST served as a potential "seed" for scaffold formation. The new sequence assembly extended the lengths of 3,913 of the 14,586 gene index entries. Over half of the extensions corresponded to extensions of over 30 amino acids. To survey the repetitive elements in the tick genome, the complete sequences of five BAC clones were determined. Both Class I and II transposable elements were found. Comparison of the BAC and Cot filtration data indicates that Cot filtration was highly successful in filtering repetitive DNA out of the genomic DNA used in 454 sequencing. CONCLUSION: Cot filtration is a very useful strategy to incorporate into genome sequencing projects on organisms with large genome sizes and which contain high percentages of repetitive, difficult to assemble, genomic DNA. Combining the Cot selection approach with 454 sequencing and assembly with a pre-existing EST database as seeds resulted in extensions of 27% of the members of the EST database.
Assuntos
Genoma/genética , Rhipicephalus/genética , Análise de Sequência de DNA/métodos , Animais , Bovinos , Cromossomos Artificiais Bacterianos/genética , Clonagem Molecular , DNA Complementar/genética , Genômica , Cinética , Hibridização de Ácido NucleicoRESUMO
Intracellular pathogens have evolved intricate mechanisms to subvert host cell signaling pathways and ensure their own propagation. A lineage of the protozoan parasite genus Theileria infects bovine leukocytes and induces their uncontrolled proliferation causing a leukemia-like disease. Given the importance of E2F transcription factors in mammalian cell cycle regulation, we investigated the role of E2F signaling in Theileria-induced host cell proliferation. Using comparative genomics and surface plasmon resonance, we identified parasite-derived peptides that have the sequence-specific ability to increase E2F signaling by binding E2F negative regulator Retinoblastoma-1 (RB). Using these peptides as a tool to probe host E2F signaling, we show that the disruption of RB complexes ex vivo leads to activation of E2F-driven transcription and increased leukocyte proliferation in an infection-dependent manner. This result is consistent with existing models and, together, they support a critical role of E2F signaling for Theileria-induced host cell proliferation, and its potential direct manipulation by one or more parasite proteins.
Assuntos
Fatores de Transcrição E2F/metabolismo , Leucócitos/citologia , Leucócitos/parasitologia , Transdução de Sinais , Theileria/fisiologia , Linhagem Celular , Proliferação de Células , Fator de Transcrição E2F1/metabolismoRESUMO
Babesia bovis is an apicomplexan tick-transmitted pathogen of cattle imposing a global risk and severe constraints to livestock health and economic development. The complete genome sequence was undertaken to facilitate vaccine antigen discovery, and to allow for comparative analysis with the related apicomplexan hemoprotozoa Theileria parva and Plasmodium falciparum. At 8.2 Mbp, the B. bovis genome is similar in size to that of Theileria spp. Structural features of the B. bovis and T. parva genomes are remarkably similar, and extensive synteny is present despite several chromosomal rearrangements. In contrast, B. bovis and P. falciparum, which have similar clinical and pathological features, have major differences in genome size, chromosome number, and gene complement. Chromosomal synteny with P. falciparum is limited to microregions. The B. bovis genome sequence has allowed wide scale analyses of the polymorphic variant erythrocyte surface antigen protein (ves1 gene) family that, similar to the P. falciparum var genes, is postulated to play a role in cytoadhesion, sequestration, and immune evasion. The approximately 150 ves1 genes are found in clusters that are distributed throughout each chromosome, with an increased concentration adjacent to a physical gap on chromosome 1 that contains multiple ves1-like sequences. ves1 clusters are frequently linked to a novel family of variant genes termed smorfs that may themselves contribute to immune evasion, may play a role in variant erythrocyte surface antigen protein biology, or both. Initial expression analysis of ves1 and smorf genes indicates coincident transcription of multiple variants. B. bovis displays a limited metabolic potential, with numerous missing pathways, including two pathways previously described for the P. falciparum apicoplast. This reduced metabolic potential is reflected in the B. bovis apicoplast, which appears to have fewer nuclear genes targeted to it than other apicoplast containing organisms. Finally, comparative analyses have identified several novel vaccine candidates including a positional homolog of p67 and SPAG-1, Theileria sporozoite antigens targeted for vaccine development. The genome sequence provides a greater understanding of B. bovis metabolism and potential avenues for drug therapies and vaccine development.
Assuntos
Babesia bovis/genética , DNA de Protozoário/análise , Genes de Protozoários , Plasmodium falciparum/genética , Theileria parva/genética , Animais , Antígenos de Protozoários/imunologia , Babesia bovis/imunologia , Babesia bovis/metabolismo , Babesiose/parasitologia , Sequência de Bases , Proteínas de Transporte/genética , Proteínas de Transporte/imunologia , Proteínas de Transporte/metabolismo , Cromossomos , DNA Complementar/análise , Evolução Molecular , Biblioteca Genômica , Dados de Sequência Molecular , Plasmodium falciparum/imunologia , Plasmodium falciparum/metabolismo , Proteínas de Protozoários/genética , Proteínas de Protozoários/imunologia , Proteínas de Protozoários/metabolismo , Análise de Sequência de DNA , Especificidade da Espécie , Sintenia , Theileria parva/imunologia , Theileria parva/metabolismoRESUMO
A population of Rhipicephalus (Boophilus) microplus (Canestrini) (Acari: Ixodidae), designated Coatzacoalcos, sampled from a ranch near Veracruz, Mexico, was found to possess a high level of resistance to pyrethroid-based acaricides. Bioassay and biochemical and molecular analysis had previously shown that resistance in this population could primarily be attributed to expression of a highly active metabolic esterase designated CzEST9. We cloned and sequenced the entire CzEST9 coding region, including introns and > 1.0 kb upstream from the transcription start site, and we compared the upstream region sequence between individual resistant and susceptible ticks from several populations with different pyrethroid resistance characteristics. In the 1.0-kb upstream region sequence, four variant nucleotides were found, and a TGA trinucleotide occurred as either four, five, or nine tandem repeats. However, none of these promoter region sequence differences could be clearly associated with a pyrethroid-resistant phenotype; thus, we concluded that differences in gene promoter sequence were not responsible for the pyrethroid resistance mechanism in the Cz strain. CzEST9 was expressed in recombinant Escherichia coli and Pichia pastoris systems and esterase activity was obtained in recombinant CzEST9 from the P. pastoris system.
Assuntos
Esterases/genética , Inseticidas/farmacologia , Ixodidae/enzimologia , Ixodidae/genética , Piretrinas/metabolismo , Piretrinas/farmacologia , Animais , Sequência de Bases , Sítios de Ligação , Primers do DNA , Esterases/metabolismo , Ixodidae/efeitos dos fármacos , Dados de Sequência Molecular , Proteínas Recombinantes/metabolismoRESUMO
BACKGROUND: The southern cattle tick, Rhipicephalus (Boophilus) microplus, is an economically important parasite of cattle and can transmit several pathogenic microorganisms to its cattle host during the feeding process. Understanding the biology and genomics of R. microplus is critical to developing novel methods for controlling these ticks. RESULTS: We present a global comparative genomic analysis of a gene index of R. microplus comprised of 13,643 unique transcripts assembled from 42,512 expressed sequence tags (ESTs), a significant fraction of the complement of R. microplus genes. The source material for these ESTs consisted of polyA RNA from various tissues, lifestages, and strains of R. microplus, including larvae exposed to heat, cold, host odor, and acaricide. Functional annotation using RPS-Blast analysis identified conserved protein domains in the conceptually translated gene index and assigned GO terms to those database transcripts which had informative BlastX hits. Blast Score Ratio and SimiTri analysis compared the conceptual transcriptome of the R. microplus database to other eukaryotic proteomes and EST databases, including those from 3 ticks. The most abundant protein domains in BmiGI were also analyzed by SimiTri methodology. CONCLUSION: These results indicate that a large fraction of BmiGI entries have no homologs in other sequenced genomes. Analysis with the PartiGene annotation pipeline showed 64% of the members of BmiGI could not be assigned GO annotation, thus minimal information is available about a significant fraction of the tick genome. This highlights the important insights in tick biology which are likely to result from a tick genome sequencing project. Global comparative analysis identified some tick genes with unexpected phylogenetic relationships which detailed analysis attributed to gene losses in some members of the animal kingdom. Some tick genes were identified which had close orthologues to mammalian genes. Members of this group would likely be poor choices as targets for development of novel tick control technology.
Assuntos
Etiquetas de Sequências Expressas , Genômica , Rhipicephalus/genética , Animais , Bovinos , Biblioteca Gênica , FilogeniaRESUMO
The southern cattle tick, Boophilus microplus (Canestrini), causes annual economic losses in the hundreds of millions of dollars to cattle producers throughout the world, and ranks as the most economically important tick from a global perspective. Control failures attributable to the development of pesticide resistance have become commonplace, and novel control technologies are needed. The availability of the genome sequence will facilitate the development of these new technologies, and we are proposing sequencing to a 4-6X draft coverage. Many existing biological resources are available to facilitate a genome sequencing project, including several inbred laboratory tick strains, a database of approximately 45,000 expressed sequence tags compiled into a B. microplus Gene Index, a bacterial artificial chromosome (BAC) library, an established B. microplus cell line, and genomic DNA suitable for library synthesis. Collaborative projects are underway to map BACs and cDNAs to specific chromosomes and to sequence selected BAC clones. When completed, the genome sequences from the cow, B. microplus, and the B. microplus-borne pathogens Babesia bovis and Anaplasma marginale will enhance studies of host-vector-pathogen systems. Genes involved in the regeneration of amputated tick limbs and transitions through developmental stages are largely unknown. Studies of these and other interesting biological questions will be advanced by tick genome sequence data. Comparative genomics offers the prospect of new insight into many, perhaps all, aspects of the biology of ticks and the pathogens they transmit to farm animals and people. The B. microplus genome sequence will fill a major gap in comparative genomics: a sequence from the Metastriata lineage of ticks. The purpose of the article is to synergize interest in and provide rationales for sequencing the genome of B. microplus and for publicizing currently available genomic resources for this tick.
Assuntos
Vetores Aracnídeos/genética , Genoma de Inseto/genética , Ixodidae/genética , Análise de Sequência de DNA/veterinária , Animais , Genômica , Humanos , Projetos de Pesquisa , Análise de Sequência de DNA/métodosRESUMO
Completion of the Babesia bovis (T2Bo strain) genome provides detailed data concerning the predicted proteome of this parasite, and allows for a bioinformatics approach to gene discovery. Comparative genomics of the hemoprotozoan parasites B. bovis and Theileria parva revealed a highly conserved syntenic block of genes flanking the p67 gene of T. parva, a sporozoite stage-specific vaccine candidate against East Coast fever (ECF). The syntenic gene in B. bovis, designated bov57, encodes a protein of limited amino acid sequence identity (11.8%) to p67. Monoclonal antibodies were produced against recombinant BOV57 and were used to demonstrate expression of BOV57 in merozoite and kinete stages of the T2Bo strain of B. bovis. Transcript levels of bov57 in kinetes were increased 100-fold in comparison to msa-1, a previously identified gene encoding an erythrocyte stage surface protein. Amino acid sequence comparisons between the T2Bo strain and two attenuated and virulent strains from Argentina and Australia revealed a high degree of sequence conservation in BOV57 among these geographically and pathogenically divergent isolates (97% amino acid sequence identity). Additional genomic comparisons show that the bov57 gene locus is also conserved in Babesia bigemina and Babesia equi. While not identifiable through amino acid or nucleotide sequence similarity, the conserved gene order within this locus in multiple piroplasms may suggest a critical function adapted for each species' unique host and life-cycle.