RESUMO
Retrophylogenomics makes use of genome-wide retrotransposon presence/absence insertion patterns to resolve questions in phylogeny and population genetics. In the genomics era, evaluating high-throughput data requires the associated development of appropriately powerful statistical tools. The currently used KKSC 3-lineage statistical test for estimating the significance of retrophylogenomic data is limited by the number of possible tree topologies it can assess in one step. To improve on this, we have extended the analysis to simultaneously compare four lineages, enabling us to evaluate ten distinct presence/absence insertion patterns for 26 possible tree topologies plus 129 trees with different incidences of hybridization or introgression. The new tool provides statistics for cases involving multiple ancestral hybridizations/introgressions, ancestral incomplete lineage sorting, bifurcation, and polytomy. The test is embedded in a user-friendly web R application (http://retrogenomics.uni-muenster.de:3838/hammlet/) and is available for use by the scientific community. [ancestral hybridization/introgression; ancestral incomplete lineage sorting (ILS); empirical distribution; KKSC-statistics; 4-lineage (4-LIN) insertion polymorphism; polytomy; retrophylogenomics.].
Assuntos
Evolução Biológica , Retroelementos , Retroelementos/genética , Filogenia , Software , GenômicaRESUMO
Motivation: Antibiotic resistance is an important global public health problem. Human gut microbiota is an accumulator of resistance genes potentially providing them to pathogens. It is important to develop tools for identifying the mechanisms of how resistance is transmitted between gut microbial species and pathogens. Results: We developed MetaCherchant-an algorithm for extracting the genomic environment of antibiotic resistance genes from metagenomic data in the form of a graph. The algorithm was validated on a number of simulated and published datasets, as well as applied to new 'shotgun' metagenomes of gut microbiota from patients with Helicobacter pylori who underwent antibiotic therapy. Genomic context was reconstructed for several major resistance genes. Taxonomic annotation of the context suggests that within a single metagenome, the resistance genes can be contained in genomes of multiple species. MetaCherchant allows reconstruction of mobile elements with resistance genes within the genomes of bacteria using metagenomic data. Application of MetaCherchant in differential mode produced specific graph structures suggesting the evidence of possible resistance gene transmission within a mobile element that occurred as a result of the antibiotic therapy. MetaCherchant is a promising tool giving researchers an opportunity to get an insight into dynamics of resistance transmission in vivo basing on metagenomic data. Availability and implementation: Source code and binaries are freely available for download at https://github.com/ctlab/metacherchant. The code is written in Java and is platform-independent. Cotanct: ulyantsev@rain.ifmo.ru. Supplementary information: Supplementary data are available at Bioinformatics online.
Assuntos
Bactérias/genética , Farmacorresistência Bacteriana/genética , Microbioma Gastrointestinal/genética , Metagenômica/métodos , Software , Algoritmos , HumanosRESUMO
MOTIVATION: High-throughput metagenomic sequencing has revolutionized our view on the structure and metabolic potential of microbial communities. However, analysis of metagenomic composition is often complicated by the high complexity of the community and the lack of related reference genomic sequences. As a start point for comparative metagenomic analysis, the researchers require efficient means for assessing pairwise similarity of the metagenomes (beta-diversity). A number of approaches were used to address this task, however, most of them have inherent disadvantages that limit their scope of applicability. For instance, the reference-based methods poorly perform on metagenomes from previously unstudied niches, while composition-based methods appear to be too abstract for straightforward interpretation and do not allow to identify the differentially abundant features. RESULTS: We developed MetaFast, an approach that allows to represent a shotgun metagenome from an arbitrary environment as a modified de Bruijn graph consisting of simplified components. For multiple metagenomes, the resulting representation is used to obtain a pairwise similarity matrix. The dimensional structure of the metagenomic components preserved in our algorithm reflects the inherent subspecies-level diversity of microbiota. The method is computationally efficient and especially promising for an analysis of metagenomes from novel environmental niches. AVAILABILITY AND IMPLEMENTATION: Source code and binaries are freely available for download at https://github.com/ctlab/metafast The code is written in Java and is platform independent (tested on Linux and Windows x86_64). CONTACT: ulyantsev@rain.ifmo.ru SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Assuntos
Algoritmos , Metagenômica , Biologia Computacional/métodos , Bases de Dados Genéticas , Metagenoma , MicrobiotaRESUMO
BACKGROUND: A rapidly increasing flow of genomic data requires the development of efficient methods for obtaining its compact representation. Feature extraction facilitates classification, clustering and model analysis for testing and refining biological hypotheses. "Shotgun" metagenome is an analytically challenging type of genomic data - containing sequences of all genes from the totality of a complex microbial community. Recently, researchers started to analyze metagenomes using reference-free methods based on the analysis of oligonucleotides (k-mers) frequency spectrum previously applied to isolated genomes. However, little is known about their correlation with the existing approaches for metagenomic feature extraction, as well as the limits of applicability. Here we evaluated a metagenomic pairwise dissimilarity measure based on short k-mer spectrum using the example of human gut microbiota, a biomedically significant object of study. RESULTS: We developed a method for calculating pairwise dissimilarity (beta-diversity) of "shotgun" metagenomes based on short k-mer spectra (5 ≤ k ≤ 11). The method was validated on simulated metagenomes and further applied to a large collection of human gut metagenomes from the populations of the world (n=281). The k-mer spectrum-based measure was found to behave similarly to one based on mapping to a reference gene catalog, but different from one using a genome catalog. This difference turned out to be associated with a significant presence of viral reads in a number of metagenomes. Simulations showed limited impact of bacterial genetic variability as well as sequencing errors on k-mer spectra. Specific differences between the datasets from individual populations were identified. CONCLUSIONS: Our approach allows rapid estimation of pairwise dissimilarity between metagenomes. Though we applied this technique to gut microbiota, it should be useful for arbitrary metagenomes, even metagenomes with novel microbiota. Dissimilarity measure based on k-mer spectrum provides a wider perspective in comparison with the ones based on the alignment against reference sequence sets. It helps not to miss possible outstanding features of metagenomic composition, particularly related to the presence of an unknown bacteria, virus or eukaryote, as well as to technical artifacts (sample contamination, reads of non-biological origin, etc.) at the early stages of bioinformatic analysis. Our method is complementary to reference-based approaches and can be easily integrated into metagenomic analysis pipelines.
Assuntos
Metagenoma , Metagenômica/métodos , Mapeamento Cromossômico , Análise por Conglomerados , Biologia Computacional , Simulação por Computador , Bases de Dados Genéticas , Microbioma Gastrointestinal/genética , Trato Gastrointestinal/microbiologia , Humanos , Modelos Moleculares , Polimorfismo de Nucleotídeo Único , Análise de Sequência de DNARESUMO
Immunotherapy has proven to be a boon for patients battling metastatic melanoma, significantly improving their clinical condition and overall quality of life. A compelling link between the composition of the gut microbiome and the efficacy of immunotherapy has been established in both animal models and human patients. However, the precise biological mechanisms by which gut microbes influence treatment outcomes remain poorly understood. Using a robust dataset of 680 fecal metagenomes from melanoma patients, a detailed catalog of metagenome-assembled genomes (MAGs) was constructed to explore the compositional and functional properties of the gut microbiome. Our study uncovered significant findings that deepen the understanding of the intricate relationship between gut microbes and the efficacy of melanoma immunotherapy. In particular, we discovered the specific metagenomic profile of patients with favorable treatment outcomes, characterized by a prevalence of MAGs with increased overall metabolic potential and proficiency in polysaccharide utilization, along with those responsible for cobalamin and amino acid production. Furthermore, our investigation of the biosynthetic pathways of short-chain fatty acids, known for their immunomodulatory role, revealed a differential abundance of these pathways among the specific MAGs. Among others, the cobalamin-dependent Wood-Ljungdahl pathway of acetate synthesis was directly associated with responsiveness to melanoma immunotherapy.
Assuntos
Microbioma Gastrointestinal , Melanoma , Animais , Humanos , Microbioma Gastrointestinal/genética , Melanoma/terapia , Qualidade de Vida , Imunoterapia , Vitamina B 12RESUMO
The human gut microbiome plays an important role in both health and disease. Recent studies have demonstrated a strong influence of the gut microbiome composition on the efficacy of cancer immunotherapy. However, available studies have not yet succeeded in finding reliable and consistent metagenomic markers that are associated with the response to immunotherapy. Therefore, the reanalysis of the published data may improve our understanding of the association between the composition of the gut microbiome and the treatment response. In this study, we focused on melanoma-related metagenomic data, which are more abundant than are data from other tumor types. We analyzed the metagenomes of 680 stool samples from 7 studies that were published earlier. The taxonomic and functional biomarkers were selected after comparing the metagenomes of patients showing different treatment responses. The list of selected biomarkers was also validated on additional metagenomic data sets that were dedicated to the influence of fecal microbiota transplantation on the response to melanoma immunotherapy. According to our analysis, the resulting cross-study taxonomic biomarkers included three bacterial species: Faecalibacterium prausnitzii, Bifidobacterium adolescentis, and Eubacterium rectale. 101 groups of genes were identified to be functional biomarkers, including those potentially involved in the production of immune-stimulating molecules and metabolites. Moreover, we ranked the microbial species by the number of genes encoding functionally relevant biomarkers that they contained. Thus, we put together a list of potentially the most beneficial bacteria for immunotherapy success. F. prausnitzii, E. rectale, and three species of bifidobacteria stood out as the most beneficial species, even though some useful functions were also present in other bacterial species. IMPORTANCE In this study, we put together a list of potentially the most beneficial bacteria that were associated with a responsiveness to melanoma immunotherapy. Another important result of this study is the list of functional biomarkers of responsiveness to immunotherapy, which are dispersed among different bacterial species. This result possibly explains the existing irregularities between studies regarding the bacterial species that are beneficial to melanoma immunotherapy. Overall, these findings can be utilized to issue recommendations for gut microbiome correction in cancer immunotherapy, and the resulting list of biomarkers might serve as a good stepping stone for the development of a diagnostic test that is aimed at predicting patients' responses to melanoma immunotherapy.
Assuntos
Melanoma , Microbiota , Humanos , Metagenoma , Melanoma/genética , Microbiota/genética , Bactérias/genética , Biomarcadores , Imunoterapia/métodosRESUMO
A growing body of evidence links gut microbiota changes with inflammatory bowel disease (IBD), raising the potential benefit of exploiting metagenomics data for non-invasive IBD diagnostics. The sbv IMPROVER metagenomics diagnosis for inflammatory bowel disease challenge investigated computational metagenomics methods for discriminating IBD and nonIBD subjects. Participants in this challenge were given independent training and test metagenomics data from IBD and nonIBD subjects, which could be wither either raw read data (sub-challenge 1, SC1) or processed Taxonomy- and Function-based profiles (sub-challenge 2, SC2). A total of 81 anonymized submissions were received between September 2019 and March 2020. Most participants' predictions performed better than random predictions in classifying IBD versus nonIBD, Ulcerative Colitis (UC) versus nonIBD, and Crohn's Disease (CD) versus nonIBD. However, discrimination between UC and CD remains challenging, with the classification quality similar to the set of random predictions. We analyzed the class prediction accuracy, the metagenomics features by the teams, and computational methods used. These results will be openly shared with the scientific community to help advance IBD research and illustrate the application of a range of computational methodologies for effective metagenomic classification.
Assuntos
Colite Ulcerativa , Doença de Crohn , Microbioma Gastrointestinal , Doenças Inflamatórias Intestinais , Humanos , Doenças Inflamatórias Intestinais/diagnóstico , Doenças Inflamatórias Intestinais/genética , Colite Ulcerativa/diagnóstico , Doença de Crohn/diagnóstico , Doença de Crohn/genética , Metagenômica , Microbioma Gastrointestinal/genéticaRESUMO
BACKGROUND: Inference of complex demographic histories is a source of information about events that happened in the past of studied populations. Existing methods for demographic inference typically require input from the researcher in the form of a parameterized model. With an increased variety of methods and tools, each with its own interface, the model specification becomes tedious and error-prone. Moreover, optimization algorithms used to find model parameters sometimes turn out to be inefficient, for instance, by being not properly tuned or highly dependent on a user-provided initialization. The open-source software GADMA addresses these problems, providing automatic demographic inference. It proposes a common interface for several likelihood engines and provides global parameters optimization based on a genetic algorithm. RESULTS: Here, we introduce the new GADMA2 software and provide a detailed description of the added and expanded features. It has a renovated core code base, new likelihood engines, an updated optimization algorithm, and a flexible setup for automatic model construction. We provide a full overview of GADMA2 enhancements, compare the performance of supported likelihood engines on simulated data, and demonstrate an example of GADMA2 usage on 2 empirical datasets. CONCLUSIONS: We demonstrate the better performance of a genetic algorithm in GADMA2 by comparing it to the initial version and other existing optimization approaches. Our experiments on simulated data indicate that GADMA2's likelihood engines are able to provide accurate estimations of demographic parameters even for misspecified models. We improve model parameters for 2 empirical datasets of inbred species.
Assuntos
Algoritmos , Pesquisadores , Humanos , Demografia , Probabilidade , SoftwareRESUMO
BACKGROUND: Necrotizing enterocolitis (NEC) is a multifactorial disease that predominantly affects premature neonates. Intestinal dysbiosis plays a critical role in NEC pathogenesis in premature neonates. The main risk factor for NEC in term infants is mesenteric hypoperfusion associated with ductal-dependent congenital heart disease (CHD) that eventually leads to intestinal ischemia. The incidence of NEC in neonates with critical CHD is 6.8%-13%. However, the role of the intestinal microbiome in NEC pathogenesis in infants with ductal-dependent CHD remains unclear. CASE SUMMARY: A male term neonate with right atrial isomerism underwent modified Blalock-Taussig shunt placement on the 14th day of life and had persistent mesenteric hypoperfusion after surgery. The patient had episodes of NEC stage IIA on the 1st and 28th days after cardiac surgery. Fecal microbial composition was analyzed before and after cardiac surgery by sequencing region V4 of the 16S rRNA gene. Before surgery, species belonging to genera Veillonella and Clostridia and class Gammaproteobacteria were detected, Bifidobacteriaceae showed a low abundance. The first NEC episode was associated with postoperative hemodynamic instability, intestinal ischemia-reperfusion injury during cardiopulmonary bypass, and a high abundance of Clostridium paraputrificum (Clostridium sensu stricto I) (56.1%). Antibacterial therapy after the first NEC episode resulted in increased abundance of Gammaproteobacteria, decreased abundance of Firmicutes, and low alpha diversity. These changes in the microbial composition promoted the growth of Clostridium sensu stricto I (72.0%) before the second NEC episode. CONCLUSION: A high abundance of Clostridium sensu stricto I and mesenteric hypoperfusion may have contributed to NEC in the present case.
RESUMO
Fecal microbiota transplantation (FMT) is currently used in medicine to treat recurrent clostridial colitis and other intestinal diseases. However, neither the therapeutic mechanism of FMT nor the mechanism that allows the donor bacteria to colonize the intestine of the recipient has yet been clearly described. From a biological point of view, FMT can be considered a useful model for studying the ecology of host-associated microbial communities. FMT experiments can shed light on the relationship features between the host and its gut microbiota. This creates the need for experimentation with approaches to metagenomic data analysis which may be useful for the interpretation of observed biological phenomena. Here, the recipient intestine colonization analysis tool (RECAST) novel computational approach is presented, which is based on the metagenomic read sorting process per their origin in the recipient's post-FMT stool metagenome. Using the RECAST algorithm, taxonomic/functional annotation, and machine learning approaches, the metagenomes from three FMT studies, including healthy volunteers, patients with clostridial colitis, and patients with metabolic syndrome, were analyzed. Using our computational pipeline, the donor-derived and recipient-derived microbes which formed the recipient post-FMT stool metagenomes (successful microbes) were identified. Their presence is well explained by a higher relative abundance in donor/pre-FMT recipient metagenomes or other metagenomes from the human population. In addition, successful microbes are enriched with gene groups potentially related to antibiotic resistance, including antimicrobial peptides. Interestingly, the observed reorganization features are universal and independent of the disease. IMPORTANCE We assumed that the enrichment of successful gut microbes by lantibiotic/antibiotic resistance genes can be related to gut microbiota colonization resistance by third-party microbe phenomena and resistance to bacterium-derived or host-derived antimicrobial substances. According to this assumption, competition between the donor-derived and recipient-derived microbes as well as host immunity may play a key role in the FMT-related colonization and redistribution of recipient gut microbiota structure.
RESUMO
Gut microbiome in critically ill patients shows profound dysbiosis. The most vulnerable is the subgroup of chronically critically ill (CCI) patients - those suffering from long-term dependence on support systems in intensive care units. It is important to investigate their microbiome as a potential reservoir of opportunistic taxa causing co-infections and a morbidity factor. We explored dynamics of microbiome composition in the CCI patients by combining "shotgun" metagenomics with chromosome conformation capture (Hi-C). Stool samples were collected at 2 time points from 2 patients with severe brain injury with different outcomes within a 1-2-week interval. The metagenome-assembled genomes (MAGs) were reconstructed based on the Hi-C data using a novel hicSPAdes method (along with the bin3c method for comparison), as well as independently of the Hi-C using MetaBAT2. The resistomes of the samples were derived using a novel assembly graph-based approach. Links of bacteria to antibiotic resistance genes, plasmids and viruses were analyzed using Hi-C-based networks. The gut community structure was enriched in opportunistic microorganisms. The binning using hicSPAdes was superior to the conventional WGS-based binning as well as to the bin3c in terms of the number, completeness and contamination of the reconstructed MAGs. Using Klebsiella pneumoniae as an example, we showed how chromosome conformation capture can aid comparative genomic analysis of clinically important pathogens. Diverse associations of resistome with antimicrobial therapy from the level of assembly graphs to gene content were discovered. Analysis of Hi-C networks suggested multiple "host-plasmid" and "host-phage" links. Hi-C metagenomics is a promising technique for investigating clinical microbiome samples. It provides a community composition profile with increased details on bacterial gene content and mobile genetic elements compared to conventional metagenomics. The ability of Hi-C binning to encompass the MAG's plasmid content facilitates metagenomic evaluation of virulence and drug resistance dynamics in clinically relevant opportunistic pathogens. These findings will help to identify the targets for developing cost-effective and rapid tests for assessing microbiome-related health risks.
RESUMO
BACKGROUND: The demographic history of any population is imprinted in the genomes of the individuals that make up the population. One of the most popular and convenient representations of genetic information is the allele frequency spectrum (AFS), the distribution of allele frequencies in populations. The joint AFS is commonly used to reconstruct the demographic history of multiple populations, and several methods based on diffusion approximation (e.g., ∂a∂i) and ordinary differential equations (e.g., moments) have been developed and applied for demographic inference. These methods provide an opportunity to simulate AFS under a variety of researcher-specified demographic models and to estimate the best model and associated parameters using likelihood-based local optimizations. However, there are no known algorithms to perform global searches of demographic models with a given AFS. RESULTS: Here, we introduce a new method that implements a global search using a genetic algorithm for the automatic and unsupervised inference of demographic history from joint AFS data. Our method is implemented in the software GADMA (Genetic Algorithm for Demographic Model Analysis, https://github.com/ctlab/GADMA). CONCLUSIONS: We demonstrate the performance of GADMA by applying it to sequence data from humans and non-model organisms and show that it is able to automatically infer a demographic model close to or even better than the one that was previously obtained manually. Moreover, GADMA is able to infer multiple demographic models at different local optima close to the global one, providing a larger set of possible scenarios to further explore demographic history.