Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 16 de 16
Filter
Add more filters










Publication year range
1.
Science ; 381(6664): eadg7492, 2023 09 22.
Article in English | MEDLINE | ID: mdl-37733863

ABSTRACT

The vast majority of missense variants observed in the human genome are of unknown clinical significance. We present AlphaMissense, an adaptation of AlphaFold fine-tuned on human and primate variant population frequency databases to predict missense variant pathogenicity. By combining structural context and evolutionary conservation, our model achieves state-of-the-art results across a wide range of genetic and experimental benchmarks, all without explicitly training on such data. The average pathogenicity score of genes is also predictive for their cell essentiality, capable of identifying short essential genes that existing statistical approaches are underpowered to detect. As a resource to the community, we provide a database of predictions for all possible human single amino acid substitutions and classify 89% of missense variants as either likely benign or likely pathogenic.


Subject(s)
Amino Acid Substitution , Disease , Mutation, Missense , Proteome , Sequence Alignment , Humans , Amino Acid Substitution/genetics , Benchmarking , Conserved Sequence , Databases, Genetic , Disease/genetics , Genome, Human , Protein Conformation , Proteome/genetics , Sequence Alignment/methods , Machine Learning
2.
PLoS Comput Biol ; 16(2): e1007603, 2020 02.
Article in English | MEDLINE | ID: mdl-32053599

ABSTRACT

Analysing multiple cancer samples from an individual patient can provide insight into the way the disease evolves. Monitoring the expansion and contraction of distinct clones helps to reveal the mutations that initiate the disease and those that drive progression. Existing approaches for clonal tracking from sequencing data typically require the user to combine multiple tools that are not purpose-built for this task. Furthermore, most methods require a matched normal (non-tumour) sample, which limits the scope of application. We developed SuperFreq, a cancer exome sequencing analysis pipeline that integrates identification of somatic single nucleotide variants (SNVs) and copy number alterations (CNAs) and clonal tracking for both. SuperFreq does not require a matched normal and instead relies on unrelated controls. When analysing multiple samples from a single patient, SuperFreq cross checks variant calls to improve clonal tracking, which helps to separate somatic from germline variants, and to resolve overlapping CNA calls. To demonstrate our software we analysed 304 cancer-normal exome samples across 33 cancer types in The Cancer Genome Atlas (TCGA) and evaluated the quality of the SNV and CNA calls. We simulated clonal evolution through in silico mixing of cancer and normal samples in known proportion. We found that SuperFreq identified 93% of clones with a cellular fraction of at least 50% and mutations were assigned to the correct clone with high recall and precision. In addition, SuperFreq maintained a similar level of performance for most aspects of the analysis when run without a matched normal. SuperFreq is highly versatile and can be applied in many different experimental settings for the analysis of exomes and other capture libraries. We demonstrate an application of SuperFreq to leukaemia patients with diagnosis and relapse samples.


Subject(s)
Clonal Evolution , Mutation , Neoplasms/genetics , DNA Copy Number Variations , Humans , Polymorphism, Single Nucleotide
3.
Nucleic Acids Res ; 45(13): e122, 2017 Jul 27.
Article in English | MEDLINE | ID: mdl-28472340

ABSTRACT

Genome-wide transcriptome profiling has enabled non-supervised classification of tumours, revealing different sub-groups characterized by specific gene expression features. However, the biological significance of these subtypes remains for the most part unclear. We describe herein an interactive platform, Minimum Spanning Trees Inferred Clustering (MiSTIC), that integrates the direct visualization and comparison of the gene correlation structure between datasets, the analysis of the molecular causes underlying co-variations in gene expression in cancer samples, and the clinical annotation of tumour sets defined by the combined expression of selected biomarkers. We have used MiSTIC to highlight the roles of specific transcription factors in breast cancer subtype specification, to compare the aspects of tumour heterogeneity targeted by different prognostic signatures, and to highlight biomarker interactions in AML. A version of MiSTIC preloaded with datasets described herein can be accessed through a public web server (http://mistic.iric.ca); in addition, the MiSTIC software package can be obtained (github.com/iric-soft/MiSTIC) for local use with personalized datasets.


Subject(s)
Biomarkers, Tumor/genetics , Databases, Genetic/statistics & numerical data , Gene Expression Profiling/statistics & numerical data , Transcriptome/genetics , Biomarkers, Tumor/classification , Breast Neoplasms/classification , Breast Neoplasms/genetics , Cluster Analysis , Computational Biology , Female , Genome-Wide Association Study/statistics & numerical data , Humans , Leukemia, Myeloid, Acute/classification , Leukemia, Myeloid, Acute/genetics , Multigene Family , Prognosis , Software
4.
Stem Cell Reports ; 7(3): 571-582, 2016 09 13.
Article in English | MEDLINE | ID: mdl-27499199

ABSTRACT

Hematopoiesis is a multistage process involving the differentiation of stem and progenitor cells into distinct mature cell lineages. Here we present Haemopedia, an atlas of murine gene-expression data containing 54 hematopoietic cell types, covering all the mature lineages in hematopoiesis. We include rare cell populations such as eosinophils, mast cells, basophils, and megakaryocytes, and a broad collection of progenitor and stem cells. We show that lineage branching and maturation during hematopoiesis can be reconstructed using the expression patterns of small sets of genes. We also have identified genes with enriched expression in each of the mature blood cell lineages, many of which show conserved lineage-enriched expression in human hematopoiesis. We have created an online web portal called Haemosphere to make analyses of Haemopedia and other blood cell transcriptional datasets easier. This resource provides simple tools to interrogate gene-expression-based relationships between hematopoietic cell types and genes of interest.


Subject(s)
Blood Cells/cytology , Blood Cells/metabolism , Computational Biology , Gene Expression Regulation, Developmental , Hematopoiesis/genetics , Animals , Cell Differentiation/genetics , Cell Lineage/genetics , Cluster Analysis , Computational Biology/methods , Gene Expression Profiling , Humans , Mice , Web Browser
5.
Blood ; 126(6): 807-16, 2015 Aug 06.
Article in English | MEDLINE | ID: mdl-25995324

ABSTRACT

The thrombopoietic environment of the neonate is established during prenatal life; therefore, a comprehensive understanding of platelet-forming cell development during embryogenesis is critical to understanding the etiology of early-onset thrombocytopenia. The recent discovery that the first platelet-forming cells of the conceptus are not megakaryocytes (MKs) but diploid platelet-forming cells (DPFCs) revealed a previously unappreciated complexity in thrombopoiesis. This raises important questions, including the following. When do conventional MKs appear? Do pathogenic genetic lesions of adult MKs affect DPFCs? What role does myeloproliferative leukemia virus (MPL), a key regulator of adult megakaryopoiesis, play in prenatal platelet-forming lineages? We performed a comprehensive study to determine the spatial and temporal appearance of prenatal platelet-forming lineages. We demonstrate that DPFCs originate in the yolk sac and then rapidly migrate to other extra- and intraembryonic tissues. Using gene disruption models of Gata1 and Nfe2, we demonstrate that perturbing essential adult MK genes causes an analogous phenotype in the early embryo before the onset of hematopoietic stem/progenitor cell-driven (definitive) hematopoiesis. Finally, we present the surprising finding that DPFC and MK commitment from their respective precursors is MPL independent in vivo but that completion of MK differentiation and establishment of the prenatal platelet mass is dependent on MPL expression.


Subject(s)
Blood Platelets/metabolism , Gene Expression Regulation, Developmental , Megakaryocytes/metabolism , Receptors, Thrombopoietin/genetics , Thrombopoiesis/genetics , Yolk Sac/metabolism , Animals , Blood Platelets/cytology , Cell Differentiation , Cell Lineage/genetics , Embryonic Development/genetics , GATA1 Transcription Factor/deficiency , GATA1 Transcription Factor/genetics , Gene Deletion , Hematopoietic Stem Cells/cytology , Hematopoietic Stem Cells/metabolism , Megakaryocytes/cytology , Mice , Mice, Knockout , NF-E2 Transcription Factor, p45 Subunit/deficiency , NF-E2 Transcription Factor, p45 Subunit/genetics , Receptors, Thrombopoietin/metabolism , Transcription, Genetic , Yolk Sac/cytology , Yolk Sac/growth & development
6.
Proc Natl Acad Sci U S A ; 112(7): 2127-32, 2015 Feb 17.
Article in English | MEDLINE | ID: mdl-25646475

ABSTRACT

In a functional genomics screen of mouse embryonic stem cells (ESCs) with nested hemizygous chromosomal deletions, we reveal that ribosomal protein (RP) genes are the most significant haploinsufficient determinants for embryoid body (EB) formation. Hemizygocity for three RP genes (Rps5, Rps14, or Rps28), distinguished by the proximity of their corresponding protein to the ribosome's mRNA exit site, is associated with the most profound phenotype. This EB phenotype was fully rescued by BAC or cDNA complementation but not by the reduction of p53 levels, although such reduction was effective with most other RP-deleted clones corresponding to non-mRNA exit-site proteins. RNA-sequencing studies further revealed that undifferentiated ESCs hemizygous for Rps5 showed reduced expression levels of several mesoderm-specific genes as compared with wild-type counterparts. Together, these results reveal that RP gene dosage limits the differentiation, not the self-renewal, of mouse ESCs. They also highlight two separate mechanisms underlying this process, one of which is p53 independent.


Subject(s)
Cell Lineage , Embryonic Stem Cells/cytology , Haploinsufficiency , Ribosomal Proteins/genetics , Animals , Cell Line , Humans , Male , Mice , Tumor Suppressor Protein p53/genetics
7.
Blood ; 124(17): 2725-9, 2014 Oct 23.
Article in English | MEDLINE | ID: mdl-25079356

ABSTRACT

In this study, we test the assumption that the hematopoietic progenitor/colony-forming cells of the embryonic yolk sac (YS), which are endowed with megakaryocytic potential, differentiate into the first platelet-forming cells in vivo. We demonstrate that from embryonic day (E) 8.5 all megakaryocyte (MK) colony-forming cells belong to the conventional hematopoietic progenitor cell (HPC) compartment. Although these cells are indeed capable of generating polyploid MKs, they are not the source of the first platelet-forming cells. We show that proplatelet formation first occurs in a unique and previously unrecognized lineage of diploid platelet-forming cells, which develop within the YS in parallel to HPCs but can be specified in the E8.5 Runx1-null embryo despite the absence of the progenitor cell lineage.


Subject(s)
Cell Lineage/genetics , Diploidy , Embryo, Mammalian/metabolism , Megakaryocyte Progenitor Cells/metabolism , Megakaryocytes/metabolism , Polyploidy , Animals , Core Binding Factor Alpha 2 Subunit/genetics , Core Binding Factor Alpha 2 Subunit/metabolism , Embryo, Mammalian/cytology , Embryo, Mammalian/embryology , Gene Expression Regulation, Developmental , Megakaryocyte Progenitor Cells/cytology , Megakaryocytes/cytology , Mice, Inbred C57BL , Mice, Knockout , Mice, Transgenic , Microscopy, Confocal , Oligonucleotide Array Sequence Analysis , Time Factors , Transcriptome , Yolk Sac/cytology , Yolk Sac/embryology , Yolk Sac/metabolism
8.
PLoS One ; 8(9): e72884, 2013.
Article in English | MEDLINE | ID: mdl-24069164

ABSTRACT

Accurate quantification of gene expression by qRT-PCR relies on normalization against a consistently expressed control gene. However, control genes in common use often vary greatly between samples, especially in cancer. The advent of Next Generation Sequencing technology offers the possibility to better select control genes with the least cell to cell variability in steady state transcript levels. Here we analyze the transcriptomes of 55 leukemia samples to identify the most consistent genes. This list is enriched for components of the proteasome (ex. PSMA1) and spliceosome (ex. SF3B2), and also includes the translation initiation factor EIF4H, and many heterogeneous nuclear ribonucleoprotein genes (ex. HNRNPL). We have validated the consistency of our new control genes in 1933 cancer and normal tissues using publically available RNA-seq data, and their usefulness in qRT-PCR analysis is clearly demonstrated.


Subject(s)
Proteasome Endopeptidase Complex/metabolism , Spliceosomes/metabolism , Cell Line, Tumor , Cell Survival/genetics , Cell Survival/physiology , Eukaryotic Initiation Factors/genetics , Eukaryotic Initiation Factors/metabolism , Gene Expression Regulation, Neoplastic/genetics , Gene Expression Regulation, Neoplastic/physiology , High-Throughput Nucleotide Sequencing , Humans , Leukemia/genetics , Leukemia/metabolism , Proteasome Endopeptidase Complex/genetics , Spliceosomes/genetics
9.
Traffic ; 14(5): 532-50, 2013 May.
Article in English | MEDLINE | ID: mdl-23387285

ABSTRACT

Plasmodium falciparum exports several hundred effector proteins that remodel the host erythrocyte and enable parasites to acquire nutrients, sequester in the circulation and evade immune responses. The majority of exported proteins contain the Plasmodium export element (PEXEL; RxLxE/Q/D) in their N-terminus, which is proteolytically cleaved in the parasite endoplasmic reticulum by Plasmepsin V, and is necessary for export. Several exported proteins lack a PEXEL or contain noncanonical motifs. Here, we assessed whether Plasmepsin V could process the N-termini of diverse protein families in P. falciparum. We show that Plasmepsin V cleaves N-terminal sequences from RIFIN, STEVOR and RESA multigene families, the latter of which contain a relaxed PEXEL (RxLxxE). However, Plasmepsin V does not cleave the N-terminal sequence of the major exported virulence factor erythrocyte membrane protein 1 (PfEMP1) or the PEXEL-negative exported proteins SBP-1 or REX-2. We probed the substrate specificity of Plasmepsin V and determined that lysine at the PEXEL P3 position, which is present in PfEMP1 and other putatively exported proteins, blocks Plasmepsin V activity. Furthermore, isoleucine at position P1 also blocked Plasmepsin V activity. The specificity of Plasmepsin V is therefore exquisitely confined and we have used this novel information to redefine the predicted P. falciparum PEXEL exportome.


Subject(s)
Aspartic Acid Endopeptidases/metabolism , Erythrocytes/parasitology , Membrane Proteins/metabolism , Plasmodium falciparum/metabolism , Protozoan Proteins/metabolism , Amino Acid Motifs , Antigens, Protozoan/metabolism , Carrier Proteins/metabolism , Chromatography, High Pressure Liquid , Computational Biology , Endoplasmic Reticulum/metabolism , Erythrocytes/cytology , Humans , Protein Structure, Tertiary , Software , Subcellular Fractions , Virulence Factors/metabolism
10.
Nature ; 455(7214): 757-63, 2008 Oct 09.
Article in English | MEDLINE | ID: mdl-18843361

ABSTRACT

The human malaria parasite Plasmodium vivax is responsible for 25-40% of the approximately 515 million annual cases of malaria worldwide. Although seldom fatal, the parasite elicits severe and incapacitating clinical symptoms and often causes relapses months after a primary infection has cleared. Despite its importance as a major human pathogen, P. vivax is little studied because it cannot be propagated continuously in the laboratory except in non-human primates. We sequenced the genome of P. vivax to shed light on its distinctive biological features, and as a means to drive development of new drugs and vaccines. Here we describe the synteny and isochore structure of P. vivax chromosomes, and show that the parasite resembles other malaria parasites in gene content and metabolic potential, but possesses novel gene families and potential alternative invasion pathways not recognized previously. Completion of the P. vivax genome provides the scientific community with a valuable resource that can be used to advance investigation into this neglected species.


Subject(s)
Genome, Protozoan/genetics , Genomics , Malaria, Vivax/parasitology , Plasmodium vivax/genetics , Amino Acid Motifs , Animals , Artemisinins/metabolism , Artemisinins/pharmacology , Atovaquone/metabolism , Atovaquone/pharmacology , Cell Nucleus/genetics , Chromosomes/genetics , Conserved Sequence/genetics , Erythrocytes/parasitology , Evolution, Molecular , Haplorhini/parasitology , Humans , Isochores/genetics , Ligands , Malaria, Vivax/metabolism , Multigene Family , Plasmodium vivax/drug effects , Plasmodium vivax/pathogenicity , Plasmodium vivax/physiology , Sequence Analysis, DNA , Species Specificity , Synteny/genetics
11.
Infect Immun ; 75(12): 5565-74, 2007 Dec.
Article in English | MEDLINE | ID: mdl-17893128

ABSTRACT

Serine repeat antigens (SERAs) are a family of secreted "cysteine-like" proteases of Plasmodium parasites. Several SERAs possess an atypical active-site serine residue in place of the canonical cysteine. The human malaria parasite Plasmodium falciparum possesses six "serine-type" (SERA1 to SERA5 and SERA9) and three "cysteine-type" (SERA6 to SERA8) SERAs. Here, we investigate the importance of the serine-type SERAs to blood-stage parasite development and examine the extent of functional redundancy among this group. We attempted to knock out the four P. falciparum serine-type SERA genes that have not been disrupted previously. SERA1, SERA4, and SERA9 knockout lines were generated, while only SERA5, the most strongly expressed member of the SERA family, remained refractory to genetic deletion. Interestingly, we discovered that while SERA4-null parasites completed the blood-stage cycle normally, they exhibited a twofold increase in the level of SERA5 mRNA. The inability to disrupt SERA5 and the apparent compensatory increase in SERA5 expression in response to the deletion of SERA4 provides evidence for an important blood-stage function for the serine-type SERAs and supports the notion of functional redundancy among this group. Such redundancy is consistent with our phylogenetic analysis, which reveals a monophyletic grouping of the serine-type SERAs across the genus Plasmodium and a predominance of postspeciation expansion. While SERA5 is to some extent further validated as a target for vaccine and drug development, our data suggest that the expression level of other serine-type SERAs is the only barrier to escape from anti-SERA5-specific interventions.


Subject(s)
Antigens, Protozoan/immunology , Malaria Vaccines/immunology , Plasmodium falciparum/immunology , Serine Endopeptidases/immunology , Animals , Antigens, Protozoan/genetics , Antimalarials/chemistry , Antimalarials/pharmacology , Apicomplexa/genetics , Crossing Over, Genetic , DNA, Recombinant/genetics , Drug Design , Erythrocytes/parasitology , Gene Deletion , Humans , Malaria Vaccines/genetics , Malaria, Falciparum/drug therapy , Malaria, Falciparum/prevention & control , Phylogeny , Plasmodium falciparum/enzymology , Plasmodium falciparum/genetics , Reverse Transcriptase Polymerase Chain Reaction/methods , Serine Endopeptidases/metabolism , Serine Proteinase Inhibitors/metabolism , Transfection , Up-Regulation
12.
Mol Cell Proteomics ; 5(7): 1286-99, 2006 Jul.
Article in English | MEDLINE | ID: mdl-16603573

ABSTRACT

Most proteins that coat the surface of the extracellular forms of the human malaria parasite Plasmodium falciparum are attached to the plasma membrane via glycosylphosphatidylinositol (GPI) anchors. These proteins are exposed to neutralizing antibodies, and several are advanced vaccine candidates. To identify the GPI-anchored proteome of P. falciparum we used a combination of proteomic and computational approaches. Focusing on the clinically relevant blood stage of the life cycle, proteomic analysis of proteins labeled with radioactive glucosamine identified GPI anchoring on 11 proteins (merozoite surface protein (MSP)-1, -2, -4, -5, -10, rhoptry-associated membrane antigen, apical sushi protein, Pf92, Pf38, Pf12, and Pf34). These proteins represent approximately 94% of the GPI-anchored schizont/merozoite proteome and constitute by far the largest validated set of GPI-anchored proteins in this organism. Moreover MSP-1 and MSP-2 were present in similar copy number, and we estimated that together these proteins comprise approximately two-thirds of the total membrane-associated surface coat. This is the first time the stoichiometry of MSPs has been examined. We observed that available software performed poorly in predicting GPI anchoring on P. falciparum proteins where such modification had been validated by proteomics. Therefore, we developed a hidden Markov model (GPI-HMM) trained on P. falciparum sequences and used this to rank all proteins encoded in the completed P. falciparum genome according to their likelihood of being GPI-anchored. GPI-HMM predicted GPI modification on all validated proteins, on several known membrane proteins, and on a number of novel, presumably surface, proteins expressed in the blood, insect, and/or pre-erythrocytic stages of the life cycle. Together this work identified 11 and predicted a further 19 GPI-anchored proteins in P. falciparum.


Subject(s)
Glycosylphosphatidylinositols/metabolism , Membrane Proteins/chemistry , Membrane Proteins/metabolism , Plasmodium falciparum/chemistry , Plasmodium falciparum/metabolism , Animals , Antigens, Surface/analysis , Blood-Borne Pathogens , Cluster Analysis , Databases, Protein , Gene Expression Profiling , Humans , Life Cycle Stages , Markov Chains , Membrane Proteins/isolation & purification , Protein Array Analysis , Protein Structure, Secondary , Protein Structure, Tertiary , Proteome/analysis , Reproducibility of Results
13.
Genome Biol ; 7(2): R12, 2006.
Article in English | MEDLINE | ID: mdl-16507167

ABSTRACT

BACKGROUND: The apicomplexan parasite Plasmodium falciparum causes the most severe form of malaria in humans. After invasion into erythrocytes, asexual parasite stages drastically alter their host cell and export remodeling and virulence proteins. Previously, we have reported identification and functional analysis of a short motif necessary for export of proteins out of the parasite and into the red blood cell. RESULTS: We have developed software for the prediction of exported proteins in the genus Plasmodium, and identified exported proteins conserved between malaria parasites infecting rodents and the two major causes of human malaria, P. falciparum and P. vivax. This conserved 'exportome' is confined to a few subtelomeric chromosomal regions in P. falciparum and the synteny of these and surrounding regions is conserved in P. vivax. We have identified a novel gene family PHIST (for Plasmodium helical interspersed subtelomeric family) that shares a unique domain with 72 paralogs in P. falciparum and 39 in P. vivax; however, there is only one member in each of the three species studied from the P. berghei lineage. CONCLUSION: These data suggest radiation of genes encoding remodeling and virulence factors from a small number of loci in a common Plasmodium ancestor, and imply a closer phylogenetic relationship between the P. vivax and P. falciparum lineages than previously believed. The presence of a conserved 'exportome' in the genus Plasmodium has important implications for our understanding of both common mechanisms and species-specific differences in host-parasite interactions, and may be crucial in developing novel antimalarial drugs to this infectious disease.


Subject(s)
Erythrocytes/parasitology , Plasmodium falciparum/genetics , Protozoan Proteins/blood , Protozoan Proteins/genetics , Amino Acid Sequence , Animals , Conserved Sequence , Humans , Malaria, Falciparum/blood , Multigene Family , Open Reading Frames , Software
14.
PLoS Biol ; 4(3): e46, 2006 Mar.
Article in English | MEDLINE | ID: mdl-16435885

ABSTRACT

The first sequenced marsupial genome promises to reveal unparalleled insights into mammalian evolution. We have used the Monodelphis domestica (gray short-tailed opossum) sequence to construct the first map of a marsupial major histocompatibility complex (MHC). The MHC is the most gene-dense region of the mammalian genome and is critical to immunity and reproductive success. The marsupial MHC bridges the phylogenetic gap between the complex MHC of eutherian mammals and the minimal essential MHC of birds. Here we show that the opossum MHC is gene dense and complex, as in humans, but shares more organizational features with non-mammals. The Class I genes have amplified within the Class II region, resulting in a unique Class I/II region. We present a model of the organization of the MHC in ancestral mammals and its elaboration during mammalian evolution. The opossum genome, together with other extant genomes, reveals the existence of an ancestral "immune supercomplex" that contained genes of both types of natural killer receptors together with antigen processing genes and MHC genes.


Subject(s)
Evolution, Molecular , Major Histocompatibility Complex/genetics , Major Histocompatibility Complex/immunology , Marsupialia/genetics , Marsupialia/immunology , Animals , Chromosomes/genetics , Gene Expression/genetics , Humans , In Situ Hybridization , Metaphase , Models, Genetic , Molecular Sequence Data , Phylogeny , RNA, Messenger/genetics
15.
J Bacteriol ; 186(20): 6792-9, 2004 Oct.
Article in English | MEDLINE | ID: mdl-15466031

ABSTRACT

Several species of mycobacteria express abundant glycopeptidolipids (GPLs) on the surfaces of their cells. The GPLs are glycolipids that contain modified sugars including acetylated 6-deoxy-talose and methylated rhamnose. Four methyltransferases have been implicated in the synthesis of the GPLs of Mycobacterium smegmatis and Mycobacterium avium. A rhamnosyl 3-O-methytransferase and a fatty acid methyltransferase of M. smegmatis have been previously characterized. In this paper, we characterize the methyltransferases that are responsible for modifying the hydroxyl groups at positions 2 and 4 of rhamnose and propose the biosynthetic sequence of GPL trimethylrhamnose formation. The analysis of M. avium genes through the creation of specific mutants is technically difficult; therefore, an alternative approach to determine the function of putative methyltransferases of M. avium was undertaken. Complementation of M. smegmatis methyltransferase mutants with M. avium genes revealed that MtfC and MtfB of the latter species have 4-O-methyltransferase activity and that MtfD is a 3-O-methyltransferase which can modify rhamnose of GPLs in M. smegmatis.


Subject(s)
Glycolipids/metabolism , Glycopeptides/metabolism , Methyltransferases/metabolism , Mycobacterium avium/enzymology , Mycobacterium smegmatis/enzymology , Rhamnose/metabolism , Amino Acid Sequence , Genetic Complementation Test , Methylation , Methyltransferases/chemistry , Methyltransferases/genetics , Molecular Sequence Data , Mutation , Mycobacterium avium/genetics , Mycobacterium avium/metabolism , Mycobacterium smegmatis/genetics , Mycobacterium smegmatis/metabolism , Sequence Analysis, DNA
16.
Mol Biol Evol ; 21(11): 2161-71, 2004 Nov.
Article in English | MEDLINE | ID: mdl-15306658

ABSTRACT

Plasmodium falciparum is the parasite responsible for the most acute form of malaria in humans. Recently, the serine repeat antigen (SERA) in P. falciparum has attracted attention as a potential vaccine and drug target, and it has been shown to be a member of a large gene family. To clarify the relationships among the numerous P. falciparum SERAs and to identify orthologs to SERA5 and SERA6 in Plasmodium species affecting rodents, gene trees were inferred from nucleotide and amino acid sequence data for 33 putative SERA homologs in seven different species. (A distance method for nucleotide sequences that is specifically designed to accommodate differing GC content yielded results that were largely compatible with the amino acid tree. Standard-distance and maximum-likelihood methods for nucleotide sequences, on the other hand, yielded gene trees that differed in important respects.) To infer the pattern of duplication, speciation, and gene loss events in the SERA gene family history, the resulting gene trees were then "reconciled" with two competing Plasmodium species tree topologies that have been identified by previous phylogenetic studies. Parsimony of reconciliation was used as a criterion for selecting a gene tree/species tree pair and provided (1) support for one of the two species trees and for the core topology of the amino acid-derived gene tree, (2) a basis for critiquing fine detail in a poorly resolved region of the gene tree, (3) a set of predicted "missing genes" in some species, (4) clarification of the relationship among the P. falciparum SERA, and (5) some information about SERA5 and SERA6 orthologs in the rodent malaria parasites. Parsimony of reconciliation and a second criterion--implied mutational pattern at two key active sites in the SERA proteins-were also seen to be useful supplements to standard "bootstrap" analysis for inferred topologies.


Subject(s)
Plasmodium falciparum/genetics , Plasmodium/genetics , Serine/chemistry , Algorithms , Animals , Base Composition , Base Sequence , Cytidine/genetics , Databases, Genetic , Guanosine/genetics , Likelihood Functions , Models, Genetic , Models, Theoretical , Phylogeny , Species Specificity
SELECTION OF CITATIONS
SEARCH DETAIL
...