RESUMO
Traditional gene set enrichment analyses are typically limited to a few ontologies and do not account for the interdependence of gene sets or terms, resulting in overcorrected p-values. To address these challenges, we introduce mulea, an R package offering comprehensive overrepresentation and functional enrichment analysis. mulea employs a progressive empirical false discovery rate (eFDR) method, specifically designed for interconnected biological data, to accurately identify significant terms within diverse ontologies. mulea expands beyond traditional tools by incorporating a wide range of ontologies, encompassing Gene Ontology, pathways, regulatory elements, genomic locations, and protein domains. This flexibility enables researchers to tailor enrichment analysis to their specific questions, such as identifying enriched transcriptional regulators in gene expression data or overrepresented protein domains in protein sets. To facilitate seamless analysis, mulea provides gene sets (in standardised GMT format) for 27 model organisms, covering 22 ontology types from 16 databases and various identifiers resulting in almost 900 files. Additionally, the muleaData ExperimentData Bioconductor package simplifies access to these pre-defined ontologies. Finally, mulea's architecture allows for easy integration of user-defined ontologies, or GMT files from external sources (e.g., MSigDB or Enrichr), expanding its applicability across diverse research areas. mulea is distributed as a CRAN R package downloadable from https://cran.r-project.org/web/packages/mulea/ and https://github.com/ELTEbioinformatics/mulea . It offers researchers a powerful and flexible toolkit for functional enrichment analysis, addressing limitations of traditional tools with its progressive eFDR and by supporting a variety of ontologies. Overall, mulea fosters the exploration of diverse biological questions across various model organisms.
Assuntos
Ontologia Genética , Software , Bases de Dados Genéticas , Biologia Computacional/métodosRESUMO
Phage therapy is gaining increasing interest in the fight against critically antibiotic-resistant nosocomial pathogens. However, the narrow host range of bacteriophages hampers the development of broadly effective phage therapeutics and demands precision approaches. Here, we combine large-scale phylogeographic analysis with high-throughput phage typing to guide the development of precision phage cocktails targeting carbapenem-resistant Acinetobacter baumannii, a top-priority pathogen. Our analysis reveals that a few strain types dominate infections in each world region, with their geographical distribution remaining stable within 6 years. As we demonstrate in Eastern Europe, this spatiotemporal distribution enables preemptive preparation of region-specific phage collections that target most local infections. Finally, we showcase the efficacy of phage cocktails against prevalent strain types using in vitro and animal infection models. Ultimately, genomic surveillance identifies patients benefiting from the same phages across geographical scales, thus providing a scalable framework for precision phage therapy.
Assuntos
Acinetobacter baumannii , Bacteriófagos , Terapia por Fagos , Terapia por Fagos/métodos , Acinetobacter baumannii/virologia , Acinetobacter baumannii/efeitos dos fármacos , Acinetobacter baumannii/genética , Animais , Humanos , Bacteriófagos/genética , Antibacterianos/farmacologia , Antibacterianos/uso terapêutico , Infecções por Acinetobacter/terapia , Infecções por Acinetobacter/microbiologia , Genômica/métodos , Farmacorresistência Bacteriana/genética , Camundongos , Filogeografia , Carbapenêmicos/farmacologia , Carbapenêmicos/uso terapêuticoRESUMO
Ascorbate (Asc) is a major plant metabolite that plays crucial roles in various processes, from reactive oxygen scavenging to epigenetic regulation. However, to what extent and how Asc modulates metabolism is largely unknown. We investigated the consequences of chloroplastic and total cellular Asc deficiencies by studying chloroplastic Asc transporter mutant lines lacking PHOSPHATE TRANSPORTER 4; 4 and the Asc-deficient vtc2-4 mutant of Arabidopsis (Arabidopsis thaliana). Under regular growth conditions, both Asc deficiencies caused minor alterations in photosynthesis, with no apparent signs of oxidative damage. In contrast, metabolomics analysis revealed global and largely overlapping alterations in the metabolome profiles of both Asc-deficient mutants, suggesting that chloroplastic Asc modulates plant metabolism. We observed significant alterations in amino acid metabolism, particularly in arginine metabolism, activation of nucleotide salvage pathways, and changes in secondary metabolism. In addition, proteome-wide analysis of thermostability revealed that Asc may interact with enzymes involved in arginine metabolism, the Calvin-Benson cycle, and several photosynthetic electron transport components. Overall, our results suggest that, independent of oxidative stress, chloroplastic Asc modulates the activity of diverse metabolic pathways in vascular plants and may act as an internal metabolite signal.
Assuntos
Arabidopsis , Ácido Ascórbico , Cloroplastos , Estresse Oxidativo , Fotossíntese , Arabidopsis/metabolismo , Arabidopsis/genética , Ácido Ascórbico/metabolismo , Cloroplastos/metabolismo , Metaboloma , Proteínas de Arabidopsis/metabolismo , Proteínas de Arabidopsis/genética , Metabolômica/métodos , Mutação/genéticaRESUMO
Natural selection acts ubiquitously on complex human traits, predominantly constraining the occurrence of extreme phenotypes (stabilizing selection). These constraints propagate to DNA sequence variants associated with traits under selection. The genetic signatures of such evolutionary events can thus be detected via combining effect size estimates from genetic association studies and the corresponding allele frequencies. Although this approach has been successfully applied to high-level traits, the prevalence and mode of selection acting on molecular traits remain poorly understood. Here, we estimate the action of natural selection on genetic variants associated with metabolite levels, an important layer of molecular traits. By leveraging summary statistics of published genome-wide association studies with large sample sizes, we find strong evidence of stabilizing selection for 15 out of 97 plasma metabolites, with nonessential amino acids displaying especially strong selection signatures. Mendelian randomization analysis reveals that metabolites under stronger stabilizing selection display larger effects on a range of clinically relevant complex traits, suggesting that maintaining a disease-free profile may be an important source of selective constraints on the metabolome. Metabolites under strong stabilizing selection in humans are also more conserved in their concentrations among diverse mammalian species, suggesting shared selective forces across micro- and macroevolutionary timescales. Overall, this study demonstrates that variation in metabolite levels among humans is frequently shaped by natural selection and this may act through their causal impact on disease susceptibility.
Assuntos
Estudo de Associação Genômica Ampla , Seleção Genética , Humanos , Estudo de Associação Genômica Ampla/métodos , Metaboloma , Fenótipo , Evolução Molecular , Frequência do Gene , Animais , Análise da Randomização Mendeliana , Polimorfismo de Nucleotídeo ÚnicoRESUMO
Cellular metabolism evolves through changes in the structure and quantitative states of metabolic networks. Here, we explore the evolutionary dynamics of metabolic states by focusing on the collection of metabolite levels, the metabolome, which captures key aspects of cellular physiology. Using a phylogenetic framework, we profiled metabolites in 27 populations of nine budding yeast species, providing a graduated view of metabolic variation across multiple evolutionary time scales. Metabolite levels evolve more rapidly and independently of changes in the metabolic network's structure, providing complementary information to enzyme repertoire. Although metabolome variation accumulates mainly gradually over time, it is profoundly affected by domestication. We found pervasive signatures of convergent evolution in the metabolomes of independently domesticated clades of Saccharomyces cerevisiae. Such recurring metabolite differences between wild and domesticated populations affect a substantial part of the metabolome, including rewiring of the TCA cycle and several amino acids that influence aroma production, likely reflecting adaptation to human niches. Overall, our work reveals previously unrecognized diversity in central metabolism and the pervasive influence of human-driven selection on metabolite levels in yeasts.
Assuntos
Domesticação , Saccharomycetales , Humanos , Filogenia , Saccharomycetales/genética , Metaboloma , Saccharomyces cerevisiae/genéticaRESUMO
Heterochromatin plays a critical role in regulating gene expression and maintaining genome integrity. While structural and enzymatic components have been linked to heterochromatin establishment, a comprehensive view of the underlying pathways at diverse heterochromatin domains remains elusive. Here, we developed a systematic approach to identify factors involved in heterochromatin silencing at pericentromeres, subtelomeres, and the silent mating type locus in Schizosaccharomyces pombe. Using quantitative measures, iterative genetic screening, and domain-specific heterochromatin reporters, we identified 369 mutants with different degrees of reduced or enhanced silencing. As expected, mutations in the core heterochromatin machinery globally decreased silencing. However, most other mutants exhibited distinct qualitative and quantitative profiles that indicate domain-specific functions. For example, decreased mating type silencing was linked to mutations in heterochromatin maintenance genes, while compromised subtelomere silencing was associated with metabolic pathways. Furthermore, similar phenotypic profiles revealed shared functions for subunits within complexes. We also discovered that the uncharacterized protein Dhm2 plays a crucial role in maintaining constitutive and facultative heterochromatin, while its absence caused phenotypes akin to DNA replication-deficient mutants. Collectively, our systematic approach unveiled a landscape of domain-specific heterochromatin regulators controlling distinct states and identified Dhm2 as a previously unknown factor linked to heterochromatin inheritance and replication fidelity.
RESUMO
Metabolite levels shape cellular physiology and disease susceptibility, yet the general principles governing metabolome evolution are largely unknown. Here, we introduce a measure of conservation of individual metabolite levels among related species. By analyzing multispecies tissue metabolome datasets in phylogenetically diverse mammals and fruit flies, we show that conservation varies extensively across metabolites. Three major functional properties, metabolite abundance, essentiality, and association with human diseases predict conservation, highlighting a striking parallel between the evolutionary forces driving metabolome and protein sequence conservation. Metabolic network simulations recapitulated these general patterns and revealed that abundant metabolites are highly conserved due to their strong coupling to key metabolic fluxes in the network. Finally, we show that biomarkers of metabolic diseases can be distinguished from other metabolites simply based on evolutionary conservation, without requiring any prior clinical knowledge. Overall, this study uncovers simple rules that govern metabolic evolution in animals and implies that most tissue metabolome differences between species are permitted, rather than favored by natural selection. More broadly, our work paves the way toward using evolutionary information to identify biomarkers, as well as to detect pathogenic metabolome alterations in individual patients.
Assuntos
Drosophila , Metaboloma , Animais , Humanos , Sequência de Aminoácidos , Conhecimento , MamíferosRESUMO
Assemblysomes are EDTA- and RNase-resistant ribonucleoprotein (RNP) complexes of paused ribosomes with protruding nascent polypeptide chains. They have been described in yeast and human cells for the proteasome subunit Rpt1, and the disordered amino-terminal part of the nascent chain was found to be indispensable for the accumulation of the Rpt1-RNP into assemblysomes. Motivated by this, to find other assemblysome-associated RNPs we used bioinformatics to rank subunits of Saccharomyces cerevisiae protein complexes according to their amino-terminal disorder propensity. The results revealed that gene products involved in DNA repair are enriched among the top candidates. The Sgs1 DNA helicase was chosen for experimental validation. We found that indeed nascent chains of Sgs1 form EDTA-resistant RNP condensates, assemblysomes by definition. Moreover, upon exposure to UV, SGS1 mRNA shifted from assemblysomes to polysomes, suggesting that external stimuli are regulators of assemblysome dynamics. We extended our studies to human cell lines. The BLM helicase, ortholog of yeast Sgs1, was identified upon sequencing assemblysome-associated RNAs from the MCF7 human breast cancer cell line, and mRNAs encoding DNA repair proteins were overall enriched. Using the radiation-resistant A549 cell line, we observed by transmission electron microscopy that 1,6-hexanediol, an agent known to disrupt phase-separated condensates, depletes ring ribosome structures compatible with assemblysomes from the cytoplasm of cells and makes the cells more sensitive to X-ray treatment. Taken together, these findings suggest that assemblysomes may be a component of the DNA damage response from yeast to human.
Assuntos
Proteínas de Saccharomyces cerevisiae , Saccharomyces cerevisiae , Humanos , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/metabolismo , Proteínas de Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/metabolismo , RecQ Helicases/genética , Ácido Edético/metabolismo , Dano ao DNA , RNA/metabolismo , Ribonucleoproteínas/genética , Ribossomos/genética , Ribossomos/metabolismoRESUMO
Functional metagenomics is a powerful experimental tool to identify antibiotic resistance genes (ARGs) in the environment, but the range of suitable host bacterial species is limited. This limitation affects both the scope of the identified ARGs and the interpretation of their clinical relevance. Here we present a functional metagenomics pipeline called Reprogrammed Bacteriophage Particle Assisted Multi-species Functional Metagenomics (DEEPMINE). This approach combines and improves the use of T7 bacteriophage with exchanged tail fibres and targeted mutagenesis to expand phage host-specificity and efficiency for functional metagenomics. These modified phage particles were used to introduce large metagenomic plasmid libraries into clinically relevant bacterial pathogens. By screening for ARGs in soil and gut microbiomes and clinical genomes against 13 antibiotics, we demonstrate that this approach substantially expands the list of identified ARGs. Many ARGs have species-specific effects on resistance; they provide a high level of resistance in one bacterial species but yield very limited resistance in a related species. Finally, we identified mobile ARGs against antibiotics that are currently under clinical development or have recently been approved. Overall, DEEPMINE expands the functional metagenomics toolbox for studying microbial communities.
Assuntos
Bacteriófagos , Genes Bacterianos , Antibacterianos/farmacologia , Metagenômica , Bacteriófagos/genética , Bactérias/genéticaRESUMO
Bacterial evolution of antibiotic resistance frequently has deleterious side effects on microbial growth, virulence, and susceptibility to other antimicrobial agents. However, it is unclear how these trade-offs could be utilized for manipulating antibiotic resistance in the clinic, not least because the underlying molecular mechanisms are poorly understood. Using laboratory evolution, we demonstrate that clinically relevant resistance mutations in Escherichia coli constitutively rewire a large fraction of the transcriptome in a repeatable and stereotypic manner. Strikingly, lineages adapted to functionally distinct antibiotics and having no resistance mutations in common show a wide range of parallel gene expression changes that alter oxidative stress response, iron homeostasis, and the composition of the bacterial outer membrane and cell surface. These common physiological alterations are associated with changes in cell morphology and enhanced sensitivity to antimicrobial peptides. Finally, the constitutive transcriptomic changes induced by resistance mutations are largely distinct from those induced by antibiotic stresses in the wild type. This indicates a limited role for genetic assimilation of the induced antibiotic stress response during resistance evolution. Our work suggests that diverse resistance mutations converge on similar global transcriptomic states that shape genetic susceptibility to antimicrobial compounds.
Assuntos
Antibacterianos , Transcriptoma , Testes de Sensibilidade Microbiana , Antibacterianos/farmacologia , Resistência Microbiana a Medicamentos/genética , Escherichia coli/genética , Escherichia coli/metabolismo , Bactérias/genética , Farmacorresistência Bacteriana/genéticaRESUMO
Metabolism is deeply intertwined with aging. Effects of metabolic interventions on aging have been explained with intracellular metabolism, growth control, and signaling. Studying chronological aging in yeast, we reveal a so far overlooked metabolic property that influences aging via the exchange of metabolites. We observed that metabolites exported by young cells are re-imported by chronologically aging cells, resulting in cross-generational metabolic interactions. Then, we used self-establishing metabolically cooperating communities (SeMeCo) as a tool to increase metabolite exchange and observed significant lifespan extensions. The longevity of the SeMeCo was attributable to metabolic reconfigurations in methionine consumer cells. These obtained a more glycolytic metabolism and increased the export of protective metabolites that in turn extended the lifespan of cells that supplied them with methionine. Our results establish metabolite exchange interactions as a determinant of cellular aging and show that metabolically cooperating cells can shape the metabolic environment to extend their lifespan.
Assuntos
Longevidade , Saccharomyces cerevisiae , Saccharomyces cerevisiae/metabolismo , Metionina/metabolismo , Transdução de SinaisRESUMO
Analysis of transcriptional regulatory interactions and their comparisons across multiple species are crucial for progress in various fields in biology, from functional genomics to the evolution of signal transduction pathways. However, despite the rapidly growing body of data on regulatory interactions in several eukaryotes, no databases exist to provide curated high-quality information on transcription factor-target gene interactions for multiple species. Here, we address this gap by introducing the TFLink gateway, which uniquely provides experimentally explored and highly accurate information on transcription factor-target gene interactions (â¼12 million), nucleotide sequences and genomic locations of transcription factor binding sites (â¼9 million) for human and six model organisms: mouse, rat, zebrafish, fruit fly, worm and yeast by integrating 10 resources. TFLink provides user-friendly access to data on transcription factor-target gene interactions, interactive network visualizations and transcription factor binding sites, with cross-links to several other databases. Besides containing accurate information on transcription factors, with a clear labelling of the type/volume of the experiments (small-scale or high-throughput), the source database and the original publications, TFLink also provides a wealth of standardized regulatory data available for download in multiple formats. The database offers easy access to high-quality data for wet-lab researchers, supplies data for gene set enrichment analyses and facilitates systems biology and comparative gene regulation studies. Database URL https://tflink.net/.
Assuntos
Fatores de Transcrição , Peixe-Zebra , Animais , Regulação da Expressão Gênica , Genômica , Humanos , Camundongos , Ratos , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismoRESUMO
Retrospective evaluation of past waves of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) epidemic is key for designing optimal interventions against future waves and novel pandemics. Here, we report on analysing genome sequences of SARS-CoV-2 from the first two waves of the epidemic in 2020 in Hungary, mirroring a suppression and a mitigation strategy, respectively. Our analysis reveals that the two waves markedly differed in viral diversity and transmission patterns. Specifically, unlike in several European areas or in the USA, we have found no evidence for early introduction and cryptic transmission of the virus in the first wave of the pandemic in Hungary. Despite the introduction of multiple viral lineages, extensive community spread was prevented by a timely national lockdown in March 2020. In sharp contrast, the majority of the cases in the much larger second wave can be linked to a single transmission lineage of the pan-European B.1.160 variant. This lineage was introduced unexpectedly early, followed by a 2-month-long cryptic transmission before a soar of detected cases in September 2020. Epidemic analysis has revealed that the dominance of this lineage in the second wave was not associated with an intrinsic transmission advantage. This finding is further supported by the rapid replacement of B.1.160 by the alpha variant (B.1.1.7) that launched the third wave of the epidemic in February 2021. Overall, these results illustrate how the founder effect in combination with the cryptic transmission, instead of repeated international introductions or higher transmissibility, can govern viral diversity.
RESUMO
Proteins are prone to aggregate when expressed above their solubility limits. Aggregation may occur rapidly, potentially as early as proteins emerge from the ribosome, or slowly, following synthesis. However, in vivo data on aggregation rates are scarce. Here, we classified the Escherichia coli proteome into rapidly and slowly aggregating proteins using an in vivo image-based screen coupled with machine learning. We find that the majority (70%) of cytosolic proteins that become insoluble upon overexpression have relatively low rates of aggregation and are unlikely to aggregate co-translationally. Remarkably, such proteins exhibit higher folding rates compared to rapidly aggregating proteins, potentially implying that they aggregate after reaching their folded states. Furthermore, we find that a substantial fraction (~ 35%) of the proteome remain soluble at concentrations much higher than those found naturally, indicating a large margin of safety to tolerate gene expression changes. We show that high disorder content and low surface stickiness are major determinants of high solubility and are favored in abundant bacterial proteins. Overall, our study provides a global view of aggregation rates and hence solubility limits of proteins in a bacterial cell.
Assuntos
Dobramento de Proteína , Proteoma , Escherichia coli/genética , Escherichia coli/metabolismo , Proteoma/metabolismo , Ribossomos/metabolismo , SolubilidadeRESUMO
MOTIVATION: Bioproduction of value-added compounds is frequently achieved by utilizing enzymes from other species. However, expression of such heterologous enzymes can be detrimental due to unexpected interactions within the host cell. Recently, an alternative strategy emerged, which relies on recruiting side activities of host enzymes to establish new biosynthetic pathways. Although such low-level 'underground' enzyme activities are prevalent, it remains poorly explored whether they may serve as an important reservoir for pathway engineering. RESULTS: Here, we use genome-scale modeling to estimate the theoretical potential of underground reactions for engineering novel biosynthetic pathways in Escherichia coli. We found that biochemical reactions contributed by underground enzyme activities often enhance the in silico production of compounds with industrial importance, including several cases where underground activities are indispensable for production. Most of these new capabilities can be achieved by the addition of one or two underground reactions to the native network, suggesting that only a few side activities need to be enhanced during implementation. Remarkably, we find that the contribution of underground reactions to the production of value-added compounds is comparable to that of heterologous reactions, underscoring their biotechnological potential. Taken together, our genome-wide study demonstrates that exploiting underground enzyme activities could be a promising addition to the toolbox of industrial strain development. AVAILABILITY AND IMPLEMENTATION: The data and scripts underlying this article are available on GitHub at https://github.com/pappb/Kovacs-et-al-Underground-metabolism. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Assuntos
Estudo de Associação Genômica Ampla , Redes e Vias Metabólicas , Escherichia coli/genética , Escherichia coli/metabolismo , Vias Biossintéticas , Engenharia MetabólicaRESUMO
Deleterious mutations are generally considered to be irrelevant for morphological evolution. However, they could be compensated by conditionally beneficial mutations, thereby providing access to new adaptive paths. Here we use high-dimensional phenotyping of laboratory-evolved budding yeast lineages to demonstrate that new cellular morphologies emerge exceptionally rapidly as a by-product of gene loss and subsequent compensatory evolution. Unexpectedly, the capacities for invasive growth, multicellular aggregation and biofilm formation also spontaneously evolve in response to gene loss. These multicellular phenotypes can be achieved by diverse mutational routes and without reactivating the canonical regulatory pathways. These ecologically and clinically relevant traits originate as pleiotropic side effects of compensatory evolution and have no obvious utility in the laboratory environment. The extent of morphological diversity in the evolved lineages is comparable to that of natural yeast isolates with diverse genetic backgrounds and lifestyles. Finally, we show that both the initial gene loss and subsequent compensatory mutations contribute to new morphologies, with their synergistic effects underlying specific morphological changes. We conclude that compensatory evolution is a previously unrecognized source of morphological diversity and phenotypic novelties.
Assuntos
Saccharomycetales , Mutação , Fenótipo , Saccharomyces cerevisiae/genética , Saccharomycetales/genéticaRESUMO
Human leukocyte antigen class I (HLA-I) genes shape our immune response against pathogens and cancer. Certain HLA-I variants can bind a wider range of peptides than others, a feature that could be favorable against a range of viral diseases. However, the implications of this phenomenon on cancer immune response are unknown. Here we quantified peptide repertoire breadth (or promiscuity) of a representative set of HLA-I alleles and found that patients with cancer who were carrying HLA-I alleles with high peptide-binding promiscuity have significantly worse prognosis after immune checkpoint inhibition. This can be explained by a reduced capacity of the immune system to discriminate tumor neopeptides from self-peptides when patients carry highly promiscuous HLA-I variants, shifting the regulation of tumor-infiltrating T cells from activation to tolerance. In summary, HLA-I peptide-binding specificity shapes neopeptide immunogenicity and the self-immunopeptidome repertoire in an antagonistic manner, and could underlie a negative trade-off between antitumor immunity and genetic susceptibility to viral infections.
Assuntos
Antígenos de Histocompatibilidade Classe I , Neoplasias , Alelos , Antígenos de Histocompatibilidade Classe I/genética , Humanos , Neoplasias/genética , Peptídeos/genética , Linfócitos TRESUMO
The fitness impact of loss-of-function mutations is generally assumed to reflect the loss of specific molecular functions associated with the perturbed gene. Here, we propose that rewiring of the transcriptome upon deleterious gene inactivation is frequently nonspecific and mimics stereotypic responses to external environmental change. Consequently, transcriptional response to gene deletion could be suboptimal and incur an extra fitness cost. Analysis of the transcriptomes of â¼1,500 single-gene deletion Saccharomyces cerevisiae strains supported this scenario. First, most transcriptomic changes are not specific to the deleted gene but are rather triggered by perturbations in functionally diverse genes. Second, gene deletions that alter the expression of dosage-sensitive genes are especially harmful. Third, by elevating the expression level of downregulated genes, we could experimentally mitigate the fitness defect of gene deletions. Our work shows that rewiring of genomic expression upon gene inactivation shapes the harmful effects of mutations.