RESUMO
Medulloblastoma (MB) comprises a group of heterogeneous paediatric embryonal neoplasms of the hindbrain with strong links to early development of the hindbrain1-4. Mutations that activate Sonic hedgehog signalling lead to Sonic hedgehog MB in the upper rhombic lip (RL) granule cell lineage5-8. By contrast, mutations that activate WNT signalling lead to WNT MB in the lower RL9,10. However, little is known about the more commonly occurring group 4 (G4) MB, which is thought to arise in the unipolar brush cell lineage3,4. Here we demonstrate that somatic mutations that cause G4 MB converge on the core binding factor alpha (CBFA) complex and mutually exclusive alterations that affect CBFA2T2, CBFA2T3, PRDM6, UTX and OTX2. CBFA2T2 is expressed early in the progenitor cells of the cerebellar RL subventricular zone in Homo sapiens, and G4 MB transcriptionally resembles these progenitors but are stalled in developmental time. Knockdown of OTX2 in model systems relieves this differentiation blockade, which allows MB cells to spontaneously proceed along normal developmental differentiation trajectories. The specific nature of the split human RL, which is destined to generate most of the neurons in the human brain, and its high level of susceptible EOMES+KI67+ unipolar brush cell progenitor cells probably predisposes our species to the development of G4 MB.
Assuntos
Diferenciação Celular , Neoplasias Cerebelares , Meduloblastoma , Metencéfalo , Diferenciação Celular/genética , Linhagem da Célula , Neoplasias Cerebelares/classificação , Neoplasias Cerebelares/genética , Neoplasias Cerebelares/patologia , Cerebelo/embriologia , Cerebelo/patologia , Subunidades alfa de Fatores de Ligação ao Core/genética , Proteínas Hedgehog/metabolismo , Histona Desmetilases , Humanos , Antígeno Ki-67/metabolismo , Meduloblastoma/classificação , Meduloblastoma/genética , Meduloblastoma/patologia , Metencéfalo/embriologia , Metencéfalo/patologia , Proteínas Musculares , Mutação , Fatores de Transcrição Otx/deficiência , Fatores de Transcrição Otx/genética , Proteínas Repressoras , Proteínas com Domínio T/metabolismo , Fatores de TranscriçãoRESUMO
CReSCENT: CanceR Single Cell ExpressioN Toolkit (https://crescent.cloud), is an intuitive and scalable web portal incorporating a containerized pipeline execution engine for standardized analysis of single-cell RNA sequencing (scRNA-seq) data. While scRNA-seq data for tumour specimens are readily generated, subsequent analysis requires high-performance computing infrastructure and user expertise to build analysis pipelines and tailor interpretation for cancer biology. CReSCENT uses public data sets and preconfigured pipelines that are accessible to computational biology non-experts and are user-editable to allow optimization, comparison, and reanalysis for specific experiments. Users can also upload their own scRNA-seq data for analysis and results can be kept private or shared with other users.
Assuntos
Neoplasias/genética , RNA-Seq/métodos , Análise de Célula Única/métodos , Software , Humanos , Neoplasias/imunologia , Linfócitos T/metabolismoRESUMO
Condition-dependent genetic interactions can reveal functional relationships between genes that are not evident under standard culture conditions. State-of-the-art yeast genetic interaction mapping, which relies on robotic manipulation of arrays of double-mutant strains, does not scale readily to multi-condition studies. Here, we describe barcode fusion genetics to map genetic interactions (BFG-GI), by which double-mutant strains generated via en masse "party" mating can also be monitored en masse for growth to detect genetic interactions. By using site-specific recombination to fuse two DNA barcodes, each representing a specific gene deletion, BFG-GI enables multiplexed quantitative tracking of double mutants via next-generation sequencing. We applied BFG-GI to a matrix of DNA repair genes under nine different conditions, including methyl methanesulfonate (MMS), 4-nitroquinoline 1-oxide (4NQO), bleomycin, zeocin, and three other DNA-damaging environments. BFG-GI recapitulated known genetic interactions and yielded new condition-dependent genetic interactions. We validated and further explored a subnetwork of condition-dependent genetic interactions involving MAG1, SLX4, and genes encoding the Shu complex, and inferred that loss of the Shu complex leads to an increase in the activation of the checkpoint protein kinase Rad53.
Assuntos
Mapeamento Cromossômico , Código de Barras de DNA Taxonômico , Dano ao DNA , Proteínas de Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/genética , Reparo do DNA , Epistasia Genética , Deleção de Genes , Loci Gênicos , Sequenciamento de Nucleotídeos em Larga Escala , Metanossulfonato de Metila , Modelos Teóricos , Regiões Promotoras Genéticas , Reprodutibilidade dos TestesRESUMO
While phosphotyrosine modification is an established regulatory mechanism in eukaryotes, it is less well characterized in bacteria due to low prevalence. To gain insight into the extent and biological importance of tyrosine phosphorylation in Escherichia coli, we used immunoaffinity-based phosphotyrosine peptide enrichment combined with high resolution mass spectrometry analysis to comprehensively identify tyrosine phosphorylated proteins and accurately map phosphotyrosine sites. We identified a total of 512 unique phosphotyrosine sites on 342 proteins in E. coli K12 and the human pathogen enterohemorrhagic E. coli (EHEC) O157:H7, representing the largest phosphotyrosine proteome reported to date in bacteria. This large number of tyrosine phosphorylation sites allowed us to define five phosphotyrosine site motifs. Tyrosine phosphorylated proteins belong to various functional classes such as metabolism, gene expression and virulence. We demonstrate for the first time that proteins of a type III secretion system (T3SS), required for the attaching and effacing (A/E) lesion phenotype characteristic for intestinal colonization by certain EHEC strains, are tyrosine phosphorylated by bacterial kinases. Yet, A/E lesion and metabolic phenotypes were unaffected by the mutation of the two currently known tyrosine kinases, Etk and Wzc. Substantial residual tyrosine phosphorylation present in an etk wzc double mutant strongly indicated the presence of hitherto unknown tyrosine kinases in E. coli. We assess the functional importance of tyrosine phosphorylation and demonstrate that the phosphorylated tyrosine residue of the regulator SspA positively affects expression and secretion of T3SS proteins and formation of A/E lesions. Altogether, our study reveals that tyrosine phosphorylation in bacteria is more prevalent than previously recognized, and suggests the involvement of phosphotyrosine-mediated signaling in a broad range of cellular functions and virulence.
Assuntos
Escherichia coli Enteropatogênica/metabolismo , Escherichia coli K12/metabolismo , Proteínas de Escherichia coli/metabolismo , Proteínas de Membrana/metabolismo , Fosfotirosina/metabolismo , Proteínas Tirosina Quinases/metabolismo , Proteoma/metabolismo , Escherichia coli Enteropatogênica/genética , Escherichia coli K12/genética , Proteínas de Escherichia coli/genética , Proteínas de Membrana/genética , Fosfotirosina/genética , Proteínas Tirosina Quinases/genética , Proteoma/genética , Transdução de Sinais/fisiologiaRESUMO
As the interface between a microbe and its environment, the bacterial cell envelope has broad biological and clinical significance. While numerous biosynthesis genes and pathways have been identified and studied in isolation, how these intersect functionally to ensure envelope integrity during adaptive responses to environmental challenge remains unclear. To this end, we performed high-density synthetic genetic screens to generate quantitative functional association maps encompassing virtually the entire cell envelope biosynthetic machinery of Escherichia coli under both auxotrophic (rich medium) and prototrophic (minimal medium) culture conditions. The differential patterns of genetic interactions detected among > 235,000 digenic mutant combinations tested reveal unexpected condition-specific functional crosstalk and genetic backup mechanisms that ensure stress-resistant envelope assembly and maintenance. These networks also provide insights into the global systems connectivity and dynamic functional reorganization of a universal bacterial structure that is both broadly conserved among eubacteria (including pathogens) and an important target.
Assuntos
Membrana Celular/genética , Epistasia Genética/genética , Escherichia coli/genética , Escherichia coli/metabolismo , Proteínas de Membrana/genética , Proteínas Associadas aos Microtúbulos/genética , Meios de Cultura , Resistência a Medicamentos/genética , Escherichia coli/crescimento & desenvolvimento , Regulação Bacteriana da Expressão Gênica , Interação Gene-Ambiente , Proteínas de Membrana/metabolismo , Redes e Vias Metabólicas/genética , Microscopia Eletrônica , Proteínas Associadas aos Microtúbulos/metabolismo , Anotação de Sequência Molecular , Análise de Sequência com Séries de OligonucleotídeosRESUMO
The role of the immune microenvironment in maintaining disease remission in patients with multiple myeloma (MM) is not well understood. In this study, we comprehensively profile the immune system in patients with newly diagnosed MM receiving continuous lenalidomide maintenance therapy with the aim of discovering correlates of long-term treatment response. Leveraging single-cell RNA sequencing and T cell receptor ß sequencing of the peripheral blood and CyTOF mass cytometry of the bone marrow, we longitudinally characterize the immune landscape in 23 patients before and one year after lenalidomide exposure. We compare patients achieving sustained minimal residual disease (MRD) negativity to patients who never achieved or were unable to maintain MRD negativity. We observe that the composition of the immune microenvironment in both the blood and the marrow varied substantially according to both MRD negative status and history of autologous stem cell transplant, supporting the hypothesis that the immune microenvironment influences the depth and duration of treatment response.
Assuntos
Mieloma Múltiplo , Humanos , Mieloma Múltiplo/tratamento farmacológico , Lenalidomida , Imunofenotipagem , Pacientes , Receptores de Antígenos de Linfócitos T alfa-beta , Microambiente TumoralRESUMO
One-third of the 4,225 protein-coding genes of Escherichia coli K-12 remain functionally unannotated (orphans). Many map to distant clades such as Archaea, suggesting involvement in basic prokaryotic traits, whereas others appear restricted to E. coli, including pathogenic strains. To elucidate the orphans' biological roles, we performed an extensive proteomic survey using affinity-tagged E. coli strains and generated comprehensive genomic context inferences to derive a high-confidence compendium for virtually the entire proteome consisting of 5,993 putative physical interactions and 74,776 putative functional associations, most of which are novel. Clustering of the respective probabilistic networks revealed putative orphan membership in discrete multiprotein complexes and functional modules together with annotated gene products, whereas a machine-learning strategy based on network integration implicated the orphans in specific biological processes. We provide additional experimental evidence supporting orphan participation in protein synthesis, amino acid metabolism, biofilm formation, motility, and assembly of the bacterial cell envelope. This resource provides a "systems-wide" functional blueprint of a model microbe, with insights into the biological and evolutionary significance of previously uncharacterized proteins.
Assuntos
Proteínas de Escherichia coli/genética , Escherichia coli/genética , Genoma Bacteriano , Proteoma/genética , Escherichia coli/metabolismo , Proteínas de Escherichia coli/metabolismo , Complexos Multiproteicos/genética , Mapeamento de Interação de Proteínas/métodosRESUMO
As sequencing technologies increase in power, determining the functions of unknown proteins encoded by the DNA sequences so produced becomes a major challenge. Functional annotation is commonly done on the basis of amino-acid sequence similarity alone. Long after sequence similarity becomes undetectable by pair-wise comparison, profile-based identification of homologs can often succeed due to the conservation of position-specific patterns, important for a protein's three dimensional folding and function. Nevertheless, prediction of protein function from homology-driven approaches is not without problems. Homologous proteins might evolve different functions and the power of homology detection has already started to reach its maximum. Computational methods for inferring protein function, which exploit the context of a protein in cellular networks, have come to be built on top of homology-based approaches. These network-based functional inference techniques provide both a first hand hint into a proteins' functional role and offer complementary insights to traditional methods for understanding the function of uncharacterized proteins. Most recent network-based approaches aim to integrate diverse kinds of functional interactions to boost both coverage and confidence level. These techniques not only promise to solve the moonlighting aspect of proteins by annotating proteins with multiple functions, but also increase our understanding on the interplay between different functional classes in a cell. In this article we review the state of the art in network-based function prediction and describe some of the underlying difficulties and successes. Given the volume of high-throughput data that is being reported the time is ripe to employ these network-based approaches, which can be used to unravel the functions of the uncharacterized proteins accumulating in the genomic databases.
Assuntos
Mapeamento Cromossômico/métodos , Enzimas/fisiologia , Metaboloma/fisiologia , Modelos Biológicos , Proteoma/fisiologia , Transdução de Sinais/fisiologia , Animais , Simulação por Computador , Humanos , Mapeamento de Interação de Proteínas/métodosRESUMO
Single-cell RNA-sequencing (scRNA-seq) has revolutionized molecular biology and medicine by enabling high-throughput studies of cellular heterogeneity in diverse tissues. Applying network biology approaches to scRNA-seq data can provide useful insights into genes driving heterogeneous cell-type compositions of tissues. Here, we present scNetViz- a Cytoscape app to aid biological interpretation of cell clusters in scRNA-seq data using network analysis. scNetViz calculates the differential expression of each gene across clusters and then creates a cluster-specific gene functional interaction network between the significantly differentially expressed genes for further analysis, such as pathway enrichment analysis. To automate a complete data analysis workflow, scNetViz integrates parts of the Scanpy software, which is a popular Python package for scRNA-seq data analysis, with Cytoscape apps such as stringApp, cyPlot, and enhancedGraphics. We describe our implementation of methods for accessing data from public single cell atlas projects, differential expression analysis, visualization, and automation. scNetViz enables users to analyze data from public atlases or their own experiments, which we illustrate with two use cases. Analysis can be performed via the Cytoscape GUI or CyREST programming interface using R (RCy3) or Python (py4cytoscape).
Assuntos
Redes Reguladoras de Genes , Software , Automação , Análise de Dados , Fluxo de TrabalhoRESUMO
BACKGROUND: Sitravatinib, a tyrosine kinase inhibitor that targets TYRO3, AXL, MERTK and the VEGF receptor family, is predicted to increase the M1 to M2-polarized tumor-associated macrophages ratio in the tumor microenvironment and have synergistic antitumor activity in combination with anti-programmed death-1/ligand-1 agents. SNOW is a window-of-opportunity study designed to evaluate the immune and molecular effects of preoperative sitravatinib and nivolumab in patients with oral cavity squamous cell carcinoma. METHODS: Patients with newly-diagnosed untreated T2-4a, N0-2 or T1 >1 cm-N2 oral cavity carcinomas were eligible. All patients received sitravatinib 120 mg daily from day 1 up to 48 hours pre-surgery and one dose of nivolumab 240 mg on day 15. Surgery was planned between day 23 and 30. Standard of care adjuvant radiotherapy was given based on clinical stage. Tumor photographs, fresh tumor biopsies and blood samples were collected at baseline, at day 15 after sitravatinib alone, and at surgery after sitravatinib-nivolumab combination. Tumor flow cytometry, multiplex immunofluorescence staining and single-cell RNA sequencing (scRNAseq) were performed on tumor biopsies to study changes in immune-cell populations. Tumor whole-exome sequencing and circulating tumor DNA and cell-free DNA were evaluated at each time point. RESULTS: Ten patients were included. Grade 3 toxicity occurred in one patient (hypertension); one patient required sitravatinib dose reduction, and one patient required discontinuation and surgery delay due to G2 thrombocytopenia. Nine patients had clinical-to-pathological downstaging, with one complete response. Independent pathological treatment response (PTR) assessment confirmed a complete PTR and two major PTRs. With a median follow-up of 21 months, all patients are alive with no recurrence. Circulating tumor DNA and cell-free DNA dynamics correlated with clinical and pathological response and distinguished two patient groups with different tumor biological behavior after sitravatinib alone (1A) versus sitravatinib-nivolumab (1B). Tumor immunophenotyping and scRNAseq analyses revealed differential changes in the expression of immune cell populations and sitravatinib-targeted and hypoxia-related genes in group 1A vs 1B patients. CONCLUSIONS: The SNOW study shows sitravatinib plus nivolumab is safe and leads to deep clinical and pathological responses in oral cavity carcinomas. Multi-omic biomarker analyses dissect the differential molecular effects of sitravatinib versus the sitravatinib-nivolumab and revealed patients with distinct tumor biology behavior. TRIAL REGISTRATION NUMBER: NCT03575598.
Assuntos
Anilidas/uso terapêutico , Protocolos de Quimioterapia Combinada Antineoplásica/uso terapêutico , Neoplasias Bucais/tratamento farmacológico , Nivolumabe/uso terapêutico , Piridinas/uso terapêutico , Idoso , Anilidas/farmacologia , Protocolos de Quimioterapia Combinada Antineoplásica/farmacologia , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Nivolumabe/farmacologia , Período Pré-Operatório , Piridinas/farmacologiaRESUMO
The NifA-RpoN complex is a master regulator of the nitrogen fixation genes in alphaproteobacteria. Based on the complete Rhizobium etli genome sequence, we constructed an R. etli CFN42 oligonucleotide (70-mer) microarray and utilized this tool, reverse transcription (RT)-PCR analysis (transcriptomics), proteomics, and bioinformatics to decipher the NifA-RpoN regulon under microaerobic conditions (free life) and in symbiosis with bean plants. The R. etli NifA-RpoN regulon was determined to contain 78 genes, including the genes involved in nitrogen fixation, and the analyses revealed 42 new NifA-RpoN-dependent genes. More importantly, this study demonstrated that the NifA-RpoN regulon is composed of genes and proteins that have very diverse functions, that play fundamental and previously less appreciated roles in regulating the normal physiology of the cell, and that have important functions in providing adequate conditions for efficient nitrogen fixation in symbiosis. The R. etli NifA-RpoN regulon defined here has some components in common with other NifA-RpoN regulons described previously, but the vast majority of the components have been found only in the R. etli regulon, suggesting that they have a specific role in this bacterium and particular requirements during nitrogen fixation compared with other symbiotic bacterial models.
Assuntos
Proteínas de Bactérias , Regulação Bacteriana da Expressão Gênica , Phaseolus/microbiologia , RNA Polimerase Sigma 54 , Regulon , Rhizobium etli , Simbiose , Fatores de Transcrição , Proteínas de Bactérias/genética , Proteínas de Bactérias/metabolismo , Biologia Computacional , Perfilação da Expressão Gênica , Dados de Sequência Molecular , Mutação , Fixação de Nitrogênio/genética , Análise de Sequência com Séries de Oligonucleotídeos , Proteômica , RNA Polimerase Sigma 54/genética , RNA Polimerase Sigma 54/metabolismo , Rhizobium etli/genética , Rhizobium etli/crescimento & desenvolvimento , Rhizobium etli/metabolismo , Rhizobium etli/fisiologia , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismoRESUMO
Background: Identification of cell type subpopulations from complex cell mixtures using single-cell RNA-sequencing (scRNA-seq) data includes automated steps from normalization to cell clustering. However, assigning cell type labels to cell clusters is often conducted manually, resulting in limited documentation, low reproducibility and uncontrolled vocabularies. This is partially due to the scarcity of reference cell type signatures and because some methods support limited cell type signatures. Methods: In this study, we benchmarked five methods representing first-generation enrichment analysis (ORA), second-generation approaches (GSEA and GSVA), machine learning tools (CIBERSORT) and network-based neighbor voting (METANEIGHBOR), for the task of assigning cell type labels to cell clusters from scRNA-seq data. We used five scRNA-seq datasets: human liver, 11 Tabula Muris mouse tissues, two human peripheral blood mononuclear cell datasets, and mouse retinal neurons, for which reference cell type signatures were available. The datasets span Drop-seq, 10X Chromium and Seq-Well technologies and range in size from ~3,700 to ~68,000 cells. Results: Our results show that, in general, all five methods perform well in the task as evaluated by receiver operating characteristic curve analysis (average area under the curve (AUC) = 0.91, sd = 0.06), whereas precision-recall analyses show a wide variation depending on the method and dataset (average AUC = 0.53, sd = 0.24). We observed an influence of the number of genes in cell type signatures on performance, with smaller signatures leading more frequently to incorrect results. Conclusions: GSVA was the overall top performer and was more robust in cell type signature subsampling simulations, although different methods performed well using different datasets. METANEIGHBOR and GSVA were the fastest methods. CIBERSORT and METANEIGHBOR were more influenced than the other methods by analyses including only expected cell types. We provide an extensible framework that can be used to evaluate other methods and datasets at https://github.com/jdime/scRNAseq_cell_cluster_labeling.
Assuntos
Leucócitos Mononucleares , Algoritmos , Animais , Perfilação da Expressão Gênica , Humanos , Camundongos , RNA , Reprodutibilidade dos Testes , Análise de Célula ÚnicaRESUMO
Antibiotic resistance genes might be maintained by selection pressures different from those which are responsible for initially selecting resistant bacteria. This possibility was suggested from a comparison of oral commensal streptococci isolated from healthy people not taking antibiotics. Resistance frequencies were similar for organisms from Mexico and Cuba despite significant differences in antibiotic usage in these two countries. Resistance to > or = 4 drugs was far more common in Mexico, the only detectable trend that can be related to the higher use of antibiotics in Mexico. If resistance is not uniquely maintained by antibiotics, then other environmental factors must also be at work. These need to be identified if a strategy to control antibiotic resistance is to be successful.
Assuntos
Antibacterianos/administração & dosagem , Farmacorresistência Bacteriana , Metiltransferases , Boca/microbiologia , Streptococcus/efeitos dos fármacos , Proteínas de Bactérias/metabolismo , Cuba , Eritromicina/farmacologia , México , Boca/anatomia & histologia , Reação em Cadeia da Polimerase/métodos , Streptococcus/classificação , Streptococcus/isolamento & purificação , Sulfadiazina/farmacologia , Trimetoprima/farmacologiaRESUMO
Molecular interactions define the functional organization of the cell. Epistatic (genetic, or gene-gene) interactions, one of the most informative and commonly encountered forms of functional relationships, are increasingly being used to map process architecture in model eukaryotic organisms. In particular, 'systems-level' screens in yeast and worm aimed at elucidating genetic interaction networks have led to the generation of models describing the global modular organization of gene products and protein complexes within a cell. However, comparable data for prokaryotic organisms have not been available. Given its ease of growth and genetic manipulation, the Gram-negative bacterium Escherichia coli appears to be an ideal model system for performing comprehensive genome-scale examinations of genetic redundancy in bacteria. In this review, we highlight emerging experimental and computational techniques that have been developed recently to examine functional relationships and redundancy in E. coli at a systems-level, and their potential application to prokaryotes in general. Additionally, we have scanned PubMed abstracts and full-text published articles to manually curate a list of approximately 200 previously reported synthetic sick or lethal genetic interactions in E. coli derived from small-scale experimental studies.
Assuntos
Epistasia Genética/genética , Escherichia coli/genética , Redes Reguladoras de Genes/genética , Modelos Genéticos , Biologia de Sistemas/métodos , Análise de Sequência com Séries de Oligonucleotídeos , Filogenia , Reprodutibilidade dos TestesRESUMO
Integrons are genetic elements that allow the mobilization and expression of smaller elements called gene cassettes, and are considered to be key elements in the evolution of antibiotic resistance among enteric bacteria. Although in nature integrons appear to be abundant, the presence of class 1 integrons in Escherichia coli has been reported to be much less frequent among isolates of non-human origin than among clinical ones. Searching for integrons in a wide variety of E. coli isolates we found a steep decline in class 1 integron prevalence when going from clinical strains to environmental ones, from outdoor urban dust to the microbiota of wild animals. Attempting to assess the causes of this decline, we addressed the evolution of integron integrases, comparing the amino acid sequence of various of these enzymes, the key proteins in gene-cassette mobilization. We found that all integrases are homologues, but different classes have been recruited by enteric bacteria, supporting the notion that integrons can frequently be gained and lost. Additionally, we found that phylogenetically distant organisms that bear intI1, such as E. coli and other enteric bacteria, but also the Gram-positive corynebacteria, have a similar preferential genomic codon usage (CU), suggesting that CU might play an important role in the acquisition and/or maintenance of integrons. In fact, the CU of intI1 is more similar to the preferential genomic CU of non-enteric bacteria than it is to that of E. coli. CU has been proposed to be involved in the retention of horizontally transferred genes; integrons in E. coli are often plasmid-borne. This might explain the reduced prevalence of integrons in enteric bacteria when not under the selective pressure of antibiotics. Collectively, our results provide evidence that class 1 integrons are important gene mobilizers within E. coli, but are not acquired and/or stably maintained without selective pressure. Thus, although not effective to reduce the prevalence of resistance itself, decreasing the use of antibiotics could be useful to diminish the presence of gene-mobilization machineries.
Assuntos
Microbiologia Ambiental , Infecções por Escherichia coli/microbiologia , Escherichia coli/enzimologia , Escherichia coli/genética , Evolução Molecular , Integrases/genética , Integrons , Proteínas de Bactérias/genética , Proteínas de Escherichia coli/genética , Bactérias Gram-Negativas/enzimologia , Bactérias Gram-Negativas/genética , Bactérias Gram-Positivas/enzimologia , Bactérias Gram-Positivas/genética , Filogenia , Plasmídeos , Seleção Genética , Homologia de Sequência de AminoácidosRESUMO
BACKGROUND: Twenty amino acids comprise the universal building blocks of proteins. However, their biosynthetic routes do not appear to be universal from an Escherichia coli-centric perspective. Nevertheless, it is necessary to understand their origin and evolution in a global context, that is, to include more 'model' species and alternative routes in order to do so. We use a comparative genomics approach to assess the origins and evolution of alternative amino acid biosynthetic network branches. RESULTS: By tracking the taxonomic distribution of amino acid biosynthetic enzymes, we predicted a core of widely distributed network branches biosynthesizing at least 16 out of the 20 standard amino acids, suggesting that this core occurred in ancient cells, before the separation of the three cellular domains of life. Additionally, we detail the distribution of two types of alternative branches to this core: analogs, enzymes that catalyze the same reaction (using the same metabolites) and belong to different superfamilies; and 'alternologs', herein defined as branches that, proceeding via different metabolites, converge to the same end product. We suggest that the origin of alternative branches is closely related to different environmental metabolite sources and life-styles among species. CONCLUSION: The multi-organismal seed strategy employed in this work improves the precision of dating and determining evolutionary relationships among amino acid biosynthetic branches. This strategy could be extended to diverse metabolic routes and even other biological processes. Additionally, we introduce the concept of 'alternolog', which not only plays an important role in the relationships between structure and function in biological networks, but also, as shown here, has strong implications for their evolution, almost equal to paralogy and analogy.
Assuntos
Aminoácidos/biossíntese , Evolução Biológica , Vias Biossintéticas , Enzimas/genética , Animais , Enzimas/metabolismo , HumanosRESUMO
Physical and functional interactions define the molecular organization of the cell. Genetic interactions, or epistasis, tend to occur between gene products involved in parallel pathways or interlinked biological processes. High-throughput experimental systems to examine genetic interactions on a genome-wide scale have been devised for Saccharomyces cerevisiae, Schizosaccharomyces pombe, Caenorhabditis elegans and Drosophila melanogaster, but have not been reported previously for prokaryotes. Here we describe the development of a quantitative screening procedure for monitoring bacterial genetic interactions based on conjugation of Escherichia coli deletion or hypomorphic strains to create double mutants on a genome-wide scale. The patterns of synthetic sickness and synthetic lethality (aggravating genetic interactions) we observed for certain double mutant combinations provided information about functional relationships and redundancy between pathways and enabled us to group bacterial gene products into functional modules.