RESUMEN
Extrachromosomal mobile genetic elements (eMGEs), including phages and plasmids, that can move across different microbes, play important roles in genome evolution and shaping the structure of microbial communities. However, we still know very little about eMGEs, especially their abundances, distributions and putative functions in microbiomes. Thus, a comprehensive description of eMGEs is of great utility. Here we present mMGE, a comprehensive catalog of 517 251 non-redundant eMGEs, including 92 492 plasmids and 424 759 phages, derived from diverse body sites of 66 425 human metagenomic samples. About half the eMGEs could be further grouped into 70 074 clusters using relaxed criteria (referred as to eMGE clusters below). We provide extensive annotations of the identified eMGEs including sequence characteristics, taxonomy affiliation, gene contents and their prokaryotic hosts. We also calculate the prevalence, both within and across samples for each eMGE and eMGE cluster, enabling users to see putative associations of eMGEs with human phenotypes or their distribution preferences. All eMGE records can be browsed or queried in multiple ways, such as eMGE clusters, metagenomic samples and associated hosts. The mMGE is equipped with a user-friendly interface and a BLAST server, facilitating easy access/queries to all its contents easily. mMGE is freely available for academic use at: https://mgedb.comp-sysbio.org.
Asunto(s)
Cromosomas Humanos/genética , Bases de Datos Genéticas , Secuencias Repetitivas Esparcidas/genética , Metagenómica , Análisis por Conglomerados , Secuencia Conservada , Mapeo Contig , Evolución Molecular , Cuerpo Humano , Humanos , Anotación de Secuencia Molecular , Interfaz Usuario-ComputadorRESUMEN
GMrepo (data repository for Gut Microbiota) is a database of curated and consistently annotated human gut metagenomes. Its main purpose is to facilitate the reusability and accessibility of the rapidly growing human metagenomic data. This is achieved by consistently annotating the microbial contents of collected samples using state-of-art toolsets and by manual curation of the meta-data of the corresponding human hosts. GMrepo organizes the collected samples according to their associated phenotypes and includes all possible related meta-data such as age, sex, country, body-mass-index (BMI) and recent antibiotics usage. To make relevant information easier to access, GMrepo is equipped with a graphical query builder, enabling users to make customized, complex and biologically relevant queries. For example, to find (1) samples from healthy individuals of 18 to 25 years old with BMIs between 18.5 and 24.9, or (2) projects that are related to colorectal neoplasms, with each containing >100 samples and both patients and healthy controls. Precomputed species/genus relative abundances, prevalence within and across phenotypes, and pairwise co-occurrence information are all available at the website and accessible through programmable interfaces. So far, GMrepo contains 58 903 human gut samples/runs (including 17 618 metagenomes and 41 285 amplicons) from 253 projects concerning 92 phenotypes. GMrepo is freely available at: https://gmrepo.humangut.info.
Asunto(s)
Bases de Datos Genéticas , Microbioma Gastrointestinal , Metagenoma , Metagenómica/métodos , Programas Informáticos , Genes Bacterianos , Genoma Humano , Humanos , Anotación de Secuencia MolecularRESUMEN
BACKGROUND: Compromised intestinal barrier (CIB) has been associated with many enteropathies, including colorectal cancer (CRC) and inflammatory bowel disease (IBD). We hypothesized that CIB could lead to increased host-derived contents including epithelial cells into the gut, change its physio-metabolic properties, and globally alter microbial community and metabolic capacities. RESULTS: Consistently, we found host DNA contents (HDCs), calculated as the percentage of metagenomic sequencing reads mapped to the host genome, were significantly elevated in patients of CRC and Crohn's disease (CD). Consistent with our hypothesis, we found that HDC correlated with microbial- and metabolic-biomarkers of these diseases, contributed significantly to machine-learning models for patient stratification and was consequently ranked as a top contributor. CD patients with treatment could partially reverse the changes of many CD-signature species over time, with reduced HDC and fecal calprotectin (FCP) levels. Strikingly, HDC showed stronger correlations with the reversing changes of the CD-related species than FCP, and contributed greatly in classifying treatment responses, suggesting that it was also a biomarker for effective treatment. CONCLUSIONS: Together, we revealed that association between HDCs and gut dysbiosis, and identified HDC as a novel biomarker from fecal metagenomics for diagnosis and effective treatment of intestinal diseases; our results also suggested that host-derived contents may have greater impact on gut microbiota than previously anticipated.
Asunto(s)
Biomarcadores/metabolismo , Neoplasias Colorrectales/diagnóstico , Enfermedad de Crohn/diagnóstico , ADN/metabolismo , Heces/química , Anticuerpos Monoclonales/uso terapéutico , Neoplasias Colorrectales/genética , Neoplasias Colorrectales/patología , Enfermedad de Crohn/tratamiento farmacológico , Enfermedad de Crohn/genética , Enfermedad de Crohn/patología , Disbiosis , Microbioma Gastrointestinal , Humanos , Mucosa Intestinal/metabolismo , Complejo de Antígeno L1 de Leucocito/metabolismo , Aprendizaje Automático , Metagenómica/métodos , Resultado del TratamientoRESUMEN
Drought is an abiotic stress that affects plant growth, and lipids are the main economic factor in the agricultural production of oil crops. However, the molecular mechanisms of drought response function in lipid metabolism remain little known. In this study, overexpression (OE) of different copies of the drought response genes LEA3 and VOC enhanced both drought tolerance and oil content in Brassica napus and Arabidopsis. Meanwhile, seed size, membrane stability and seed weight were also improved in OE lines. In contrast, oil content and drought tolerance were decreased in the AtLEA3 mutant (atlea3) and AtVOC-RNAi of Arabidopsis and in both BnLEA-RNAi and BnVOC-RNAi B. napus RNAi lines. Hybrids between two lines with increased or reduced expression (LEA3-OE with VOC-OE, atlea3 with AtVOC-RNAi) showed corresponding stronger trends in drought tolerance and lipid metabolism. Comparative transcriptomic analysis revealed the mechanisms of drought response gene function in lipid accumulation and drought tolerance. Gene networks involved in fatty acid (FA) synthesis and FA degradation were up- and down-regulated in OE lines, respectively. Key genes in the photosynthetic system and reactive oxygen species (ROS) metabolism were up-regulated in OE lines and down-regulated in atlea3 and AtVOC-RNAi lines, including LACS9, LIPASE1, PSAN, LOX2 and SOD1. Further analysis of photosynthetic and ROS enzymatic activities confirmed that the drought response genes LEA3 and VOC altered lipid accumulation mainly via enhancing photosynthetic efficiency and reducing ROS. The present study provides a novel way to improve lipid accumulation in plants, especially in oil production crops.
Asunto(s)
Arabidopsis/genética , Brassica napus/genética , Sequías , Metabolismo de los Lípidos , Proteínas de Plantas/genética , Arabidopsis/fisiología , Brassica napus/fisiología , Regulación de la Expresión Génica de las Plantas , Fotosíntesis , Plantas Modificadas Genéticamente , Especies Reactivas de Oxígeno/metabolismoRESUMEN
Genetic variations are instrumental for unraveling phage evolution and deciphering their functional implications. Here, we explore the underlying fine-scale genetic variations in the gut phageome, especially structural variations (SVs). By using virome-enriched long-read metagenomic sequencing across 91 individuals, we identified a total of 14,438 nonredundant phage SVs and revealed their prevalence within the human gut phageome. These SVs are mainly enriched in genes involved in recombination, DNA methylation, and antibiotic resistance. Notably, a substantial fraction of phage SV sequences share close homology with bacterial fragments, with most SVs enriched for horizontal gene transfer (HGT) mechanism. Further investigations showed that these SV sequences were genetic exchanged between specific phage-bacteria pairs, particularly between phages and their respective bacterial hosts. Temperate phages exhibit a higher frequency of genetic exchange with bacterial chromosomes and then virulent phages. Collectively, our findings provide insights into the genetic landscape of the human gut phageome.
Asunto(s)
Bacterias , Bacteriófagos , Microbioma Gastrointestinal , Transferencia de Gen Horizontal , Bacteriófagos/genética , Humanos , Microbioma Gastrointestinal/genética , Bacterias/virología , Bacterias/genética , Metagenómica/métodos , Variación Genética , Viroma/genética , Genoma Viral , Secuenciación de Nucleótidos de Alto RendimientoRESUMEN
Current metagenome assembled human gut phage catalogs contained mostly fragmented genomes. Here, comprehensive gut virome detection procedure is developed involving virus-like particle (VLP) enrichment from ≈500 g feces and combined sequencing of short- and long-read. Applied to 135 samples, a Chinese Gut Virome Catalog (CHGV) is assembled consisting of 21,499 non-redundant viral operational taxonomic units (vOTUs) that are significantly longer than those obtained by short-read sequencing and contained ≈35% (7675) complete genomes, which is ≈nine times more than those in the Gut Virome Database (GVD, ≈4%, 1,443). Interestingly, the majority (≈60%, 13,356) of the CHGV vOTUs are obtained by either long-read or hybrid assemblies, with little overlap with those assembled from only the short-read data. With this dataset, vast diversity of the gut virome is elucidated, including the identification of 32% (6,962) novel vOTUs compare to public gut virome databases, dozens of phages that are more prevalent than the crAssphages and/or Gubaphages, and several viral clades that are more diverse than the two. Finally, the functional capacities are also characterized of the CHGV encoded proteins and constructed a viral-host interaction network to facilitate future research and applications.
Asunto(s)
Bacteriófagos , Humanos , Bacteriófagos/genética , Análisis de Secuencia , Genoma Viral/genética , Metagenoma/genética , HecesRESUMEN
BACKGROUND: The fungal component of the human gut microbiome, also known as the mycobiome, plays a vital role in intestinal ecology and human health. However, the overall structure of the gut mycobiome as well as the inter-individual variations in fungal composition remains largely unknown. In this study, we collected a total of 3363 fungal sequencing samples from 16 cohorts across three continents, including 572 newly profiled samples from China. RESULTS: We identify and characterize four mycobiome enterotypes using ITS profiling of 3363 samples from 16 cohorts. These enterotypes exhibit stability across populations and geographical locations and significant correlation with bacterial enterotypes. Particularly, we notice that fungal enterotypes have a strong age preference, where the enterotype dominated by Candida (i.e., Can_type enterotype) is enriched in the elderly population and confers an increased risk of multiple diseases associated with a compromised intestinal barrier. In addition, bidirectional mediation analysis reveals that the fungi-contributed aerobic respiration pathway associated with the Can_type enterotype might mediate the association between the compromised intestinal barrier and aging. CONCLUSIONS: We show that the human gut mycobiome has stable compositional patterns across individuals and significantly correlates with multiple host factors, such as diseases and host age. Video Abstract.
Asunto(s)
Microbioma Gastrointestinal , Microbiota , Micobioma , Humanos , Anciano , Micobioma/genética , Microbioma Gastrointestinal/genética , Candida , EnvejecimientoRESUMEN
Evaluating the quality of metagenomic assemblies is important for constructing reliable metagenome-assembled genomes and downstream analyses. Here, we present metaMIC ( https://github.com/ZhaoXM-Lab/metaMIC ), a machine learning-based tool for identifying and correcting misassemblies in metagenomic assemblies. Benchmarking results on both simulated and real datasets demonstrate that metaMIC outperforms existing tools when identifying misassembled contigs. Furthermore, metaMIC is able to localize the misassembly breakpoints, and the correction of misassemblies by splitting at misassembly breakpoints can improve downstream scaffolding and binning results.