Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 30
Filter
1.
BMC Bioinformatics ; 22(1): 509, 2021 Oct 19.
Article in English | MEDLINE | ID: mdl-34666677

ABSTRACT

BACKGROUND: Sequencing partial 16S rRNA genes is a cost effective method for quantifying the microbial composition of an environment, such as the human gut. However, downstream analysis relies on binning reads into microbial groups by either considering each unique sequence as a different microbe, querying a database to get taxonomic labels from sequences, or clustering similar sequences together. However, these approaches do not fully capture evolutionary relationships between microbes, limiting the ability to identify differentially abundant groups of microbes between a diseased and control cohort. We present sequence-based biomarkers (SBBs), an aggregation method that groups and aggregates microbes using single variants and combinations of variants within their 16S sequences. We compare SBBs against other existing aggregation methods (OTU clustering and Microphenoor DiTaxa features) in several benchmarking tasks: biomarker discovery via permutation test, biomarker discovery via linear discriminant analysis, and phenotype prediction power. We demonstrate the SBBs perform on-par or better than the state-of-the-art methods in biomarker discovery and phenotype prediction. RESULTS: On two independent datasets, SBBs identify differentially abundant groups of microbes with similar or higher statistical significance than existing methods in both a permutation-test-based analysis and using linear discriminant analysis effect size. . By grouping microbes by SBB, we can identify several differentially abundant microbial groups (FDR <.1) between children with autism and neurotypical controls in a set of 115 discordant siblings. Porphyromonadaceae, Ruminococcaceae, and an unnamed species of Blastocystis were significantly enriched in autism, while Veillonellaceae was significantly depleted. Likewise, aggregating microbes by SBB on a dataset of obese and lean twins, we find several significantly differentially abundant microbial groups (FDR<.1). We observed Megasphaera andSutterellaceae highly enriched in obesity, and Phocaeicola significantly depleted. SBBs also perform on bar with or better than existing aggregation methods as features in a phenotype prediction model, predicting the autism phenotype with an ROC-AUC score of .64 and the obesity phenotype with an ROC-AUC score of .84. CONCLUSIONS: SBBs provide a powerful method for aggregating microbes to perform differential abundance analysis as well as phenotype prediction. Our source code can be freely downloaded from http://github.com/briannachrisman/16s_biomarkers .


Subject(s)
Gastrointestinal Microbiome , Biomarkers , Cluster Analysis , Gastrointestinal Microbiome/genetics , Humans , RNA, Ribosomal, 16S/genetics , Software
2.
PLoS Comput Biol ; 16(5): e1007859, 2020 05.
Article in English | MEDLINE | ID: mdl-32365061

ABSTRACT

Microbiomes are complex ecological systems that play crucial roles in understanding natural phenomena from human disease to climate change. Especially in human gut microbiome studies, where collecting clinical samples can be arduous, the number of taxa considered in any one study often exceeds the number of samples ten to one hundred-fold. This discrepancy decreases the power of studies to identify meaningful differences between samples, increases the likelihood of false positive results, and subsequently limits reproducibility. Despite the vast collections of microbiome data already available, biome-specific patterns of microbial structure are not currently leveraged to inform studies. Here, we derive microbiome-level properties by applying an embedding algorithm to quantify taxon co-occurrence patterns in over 18,000 samples from the American Gut Project (AGP) microbiome crowdsourcing effort. We then compare the predictive power of models trained using properties, normalized taxonomic count data, and another commonly used dimensionality reduction method, Principal Component Analysis in categorizing samples from individuals with inflammatory bowel disease (IBD) and healthy controls. We show that predictive models trained using property data are the most accurate, robust, and generalizable, and that property-based models can be trained on one dataset and deployed on another with positive results. Furthermore, we find that properties correlate significantly with known metabolic pathways. Using these properties, we are able to extract known and new bacterial metabolic pathways associated with inflammatory bowel disease across two completely independent studies. By providing a set of pre-trained embeddings, we allow any V4 16S amplicon study to apply the publicly informed properties to increase the statistical power, reproducibility, and generalizability of analysis.


Subject(s)
Gastrointestinal Microbiome , Inflammatory Bowel Diseases/microbiology , Terminology as Topic , Algorithms , Bacteria/classification , Bacteria/genetics , Humans , Metabolic Networks and Pathways , Models, Biological , Phylogeny , Reproducibility of Results
3.
PLoS Comput Biol ; 16(11): e1008423, 2020 Nov.
Article in English | MEDLINE | ID: mdl-33137111

ABSTRACT

[This corrects the article DOI: 10.1371/journal.pcbi.1007859.].

4.
Nature ; 521(7551): 208-12, 2015 May 14.
Article in English | MEDLINE | ID: mdl-25739499

ABSTRACT

Over 20% of Earth's terrestrial surface is underlain by permafrost with vast stores of carbon that, once thawed, may represent the largest future transfer of carbon from the biosphere to the atmosphere. This process is largely dependent on microbial responses, but we know little about microbial activity in intact, let alone in thawing, permafrost. Molecular approaches have recently revealed the identities and functional gene composition of microorganisms in some permafrost soils and a rapid shift in functional gene composition during short-term thaw experiments. However, the fate of permafrost carbon depends on climatic, hydrological and microbial responses to thaw at decadal scales. Here we use the combination of several molecular 'omics' approaches to determine the phylogenetic composition of the microbial communities, including several draft genomes of novel species, their functional potential and activity in soils representing different states of thaw: intact permafrost, seasonally thawed active layer and thermokarst bog. The multi-omics strategy reveals a good correlation of process rates to omics data for dominant processes, such as methanogenesis in the bog, as well as novel survival strategies for potentially active microbes in permafrost.


Subject(s)
Genome, Bacterial/genetics , Metagenome/genetics , Microbiota/physiology , Permafrost/microbiology , Soil Microbiology , Wetlands , Alaska , Atmosphere/chemistry , Carbon Cycle , Climate , Denitrification , Freezing , Iron/metabolism , Methane/metabolism , Microbiota/genetics , Nitrates/metabolism , Nitrogen/metabolism , Oxidation-Reduction , Phylogeny , Seasons , Sulfur/metabolism , Time Factors
5.
BMC Genomics ; 18(1): 315, 2017 04 20.
Article in English | MEDLINE | ID: mdl-28427329

ABSTRACT

BACKGROUND: Numerous studies have highlighted the elevated degree of comorbidity associated with autism spectrum disorder (ASD). These comorbid conditions may add further impairments to individuals with autism and are substantially more prevalent compared to neurotypical populations. These high rates of comorbidity are not surprising taking into account the overlap of symptoms that ASD shares with other pathologies. From a research perspective, this suggests common molecular mechanisms involved in these conditions. Therefore, identifying crucial genes in the overlap between ASD and these comorbid disorders may help unravel the common biological processes involved and, ultimately, shed some light in the understanding of autism etiology. RESULTS: In this work, we used a two-fold systems biology approach specially focused on biological processes and gene networks to conduct a comparative analysis of autism with 31 frequently comorbid disorders in order to define a multi-disorder subcomponent of ASD and predict new genes of potential relevance to ASD etiology. We validated our predictions by determining the significance of our candidate genes in high throughput transcriptome expression profiling studies. Using prior knowledge of disease-related biological processes and the interaction networks of the disorders related to autism, we identified a set of 19 genes not previously linked to ASD that were significantly differentially regulated in individuals with autism. In addition, these genes were of potential etiologic relevance to autism, given their enriched roles in neurological processes crucial for optimal brain development and function, learning and memory, cognition and social behavior. CONCLUSIONS: Taken together, our approach represents a novel perspective of autism from the point of view of related comorbid disorders and proposes a model by which prior knowledge of interaction networks may enlighten and focus the genome-wide search for autism candidate genes to better define the genetic heterogeneity of ASD.


Subject(s)
Autism Spectrum Disorder/epidemiology , Autism Spectrum Disorder/genetics , Comorbidity , Systems Biology , Autism Spectrum Disorder/etiology , Gene Expression Profiling , Humans
6.
Nature ; 480(7377): 368-71, 2011 Nov 06.
Article in English | MEDLINE | ID: mdl-22056985

ABSTRACT

Permafrost contains an estimated 1672 Pg carbon (C), an amount roughly equivalent to the total currently contained within land plants and the atmosphere. This reservoir of C is vulnerable to decomposition as rising global temperatures cause the permafrost to thaw. During thaw, trapped organic matter may become more accessible for microbial degradation and result in greenhouse gas emissions. Despite recent advances in the use of molecular tools to study permafrost microbial communities, their response to thaw remains unclear. Here we use deep metagenomic sequencing to determine the impact of thaw on microbial phylogenetic and functional genes, and relate these data to measurements of methane emissions. Metagenomics, the direct sequencing of DNA from the environment, allows the examination of whole biochemical pathways and associated processes, as opposed to individual pieces of the metabolic puzzle. Our metagenome analyses reveal that during transition from a frozen to a thawed state there are rapid shifts in many microbial, phylogenetic and functional gene abundances and pathways. After one week of incubation at 5 °C, permafrost metagenomes converge to be more similar to each other than while they are frozen. We find that multiple genes involved in cycling of C and nitrogen shift rapidly during thaw. We also construct the first draft genome from a complex soil metagenome, which corresponds to a novel methanogen. Methane previously accumulated in permafrost is released during thaw and subsequently consumed by methanotrophic bacteria. Together these data point towards the importance of rapid cycling of methane and nitrogen in thawing permafrost.


Subject(s)
Bacteria/genetics , Bacteria/metabolism , Freezing , Metagenome/genetics , Metagenomics , Soil Microbiology , Temperature , Alaska , Arctic Regions , Bacteria/isolation & purification , Carbon/metabolism , Carbon Cycle/genetics , DNA/analysis , DNA/genetics , Genes, rRNA/genetics , Methane/metabolism , Nitrogen/metabolism , Nitrogen Cycle/genetics , Oxidation-Reduction , Phylogeny , RNA, Ribosomal, 16S/genetics , Soil/chemistry , Time Factors
7.
Nucleic Acids Res ; 42(19): e145, 2014 Oct 29.
Article in English | MEDLINE | ID: mdl-25260589

ABSTRACT

A new functional gene database, FOAM (Functional Ontology Assignments for Metagenomes), was developed to screen environmental metagenomic sequence datasets. FOAM provides a new functional ontology dedicated to classify gene functions relevant to environmental microorganisms based on Hidden Markov Models (HMMs). Sets of aligned protein sequences (i.e. 'profiles') were tailored to a large group of target KEGG Orthologs (KOs) from which HMMs were trained. The alignments were checked and curated to make them specific to the targeted KO. Within this process, sequence profiles were enriched with the most abundant sequences available to maximize the yield of accurate classifier models. An associated functional ontology was built to describe the functional groups and hierarchy. FOAM allows the user to select the target search space before HMM-based comparison steps and to easily organize the results into different functional categories and subcategories. FOAM is publicly available at http://portal.nersc.gov/project/m1317/FOAM/.


Subject(s)
Biological Ontologies , Databases, Nucleic Acid , Metagenomics , Soil Microbiology , Markov Chains , Metagenome , Sequence Alignment , Sequence Analysis, Protein
8.
Environ Microbiol ; 17(12): 4835-50, 2015 Dec.
Article in English | MEDLINE | ID: mdl-24517489

ABSTRACT

This study focused on the microbial ecology of tetrachloroethene (PCE) degradation to trichloroethene, cis-1,2-dichloroethene and vinyl chloride to evaluate the relationship between the microbial community and the potential accumulation or degradation of these toxic metabolites. Multiple soil microcosms supplied with different organic substrates were artificially contaminated with PCE. A thymidine analogue, bromodeoxyuridine (BrdU), was added to the microcosms and incorporated into the DNA of actively replicating cells. We compared the total and active bacterial communities during the 50-day incubations by using phylogenic microarrays and 454 pyrosequencing to identify microorganisms and functional genes associated with PCE degradation to ethene. By use of this integrative approach, both the key community members and the ecological functions concomitant with complete PCE degradation could be determined, including the presence and activity of microbial community members responsible for producing hydrogen and acetate, which are critical for Dehalococcoides-mediated PCE degradation. In addition, by correlation of chemical data and phylogenic microarray data, we identified several bacteria that could potentially oxidize hydrogen. These results demonstrate that PCE degradation is dependent on some microbial community members for production of appropriate metabolites, while other members of the community compete for hydrogen in soil at low redox potentials.


Subject(s)
Biodegradation, Environmental , Chloroflexi/metabolism , Solvents/metabolism , Tetrachloroethylene/metabolism , Water Pollutants, Chemical/metabolism , Bromodeoxyuridine/metabolism , Chloroflexi/genetics , DNA, Bacterial/genetics , Dichloroethylenes/metabolism , Ethylenes/biosynthesis , Halogenation , Microbiota/physiology , Phylogeny , RNA, Ribosomal, 16S/genetics , Trichloroethylene/metabolism , Vinyl Chloride/metabolism
9.
Trends Microbiol ; 32(2): 151-161, 2024 02.
Article in English | MEDLINE | ID: mdl-37813734

ABSTRACT

Research into the microbiota-gut-brain axis (MGBA) has entered a golden age, raising the hope that therapeutics acting on it may offer breakthroughs in the treatment of many illnesses. However, most of this work overlooks a fundamental, yet understudied, biological variable: sex. Sex differences exist at every level of the MGBA. Sex steroids shape the structure of the gut microbiota, and these microbes in turn regulate levels of bioactive sex steroids. These hormones and microbes act on gut sensory enteroendocrine cells, which modulate downstream activity in the enteric nervous system, vagus nerve, and brain. We examine recent advances in this field, and discuss the scientific and moral imperative to include females in biomedical research, using autism spectrum disorder (ASD) as an example.


Subject(s)
Autism Spectrum Disorder , Brain , Cell Communication , Gastrointestinal Microbiome , Female , Humans , Male , Autism Spectrum Disorder/physiopathology , Brain/physiology , Gastrointestinal Microbiome/physiology , Steroids , Cell Communication/physiology , Sex Factors
10.
Proteomics ; 13(18-19): 2776-85, 2013 Oct.
Article in English | MEDLINE | ID: mdl-23776032

ABSTRACT

Characterization of microbial protein expression provides information necessary to better understand the unique biological pathways that occur within soil microbial communities that contribute to atmospheric CO2 levels and the earth's changing climate. A significant challenge in studying the soil microbial community proteome is the initial dissociation of bacterial proteins from the complex mixture of particles found in natural soil. The differential extraction of intact bacterial cells limits the characterization of the complete representation of a microbial community. However, in situ lysis of bacterial cells in soil can lead to potentially high levels of protein adsorption to soil particles. Here, we investigated various amino acids for their ability to block soil protein adsorption sites prior to in situ lysis of bacterial cells, as well as their compatibility with both tryptic digestion and mass spectrometric analysis. The treatments were tested by adding proteins from lysed Escherichia coli cells to representative treated and untreated soil samples. The results show that it is possible to significantly increase protein identifications through blockage of binding sites on a variety of soil and sediment textures; use of an optimized desorption buffer further increases the number of identifications.


Subject(s)
Amino Acids/pharmacology , Bacterial Proteins/isolation & purification , Geologic Sediments/chemistry , Proteomics/methods , Soil Microbiology , Buffers , Chromatography, Liquid , Escherichia coli/drug effects , Escherichia coli/metabolism , Mass Spectrometry , Peptides/metabolism , Reproducibility of Results
11.
PLoS One ; 18(12): e0289314, 2023.
Article in English | MEDLINE | ID: mdl-38091316

ABSTRACT

Over 2 million people in North America suffer from inflammatory bowel disease (IBD), a chronic and idiopathic inflammatory condition. While previous research has primarily focused on studying immune cells as a cause and therapeutic target for IBD, recent findings suggest that non-immune cells may also play a crucial role in mediating cytokine and chemokine signaling, and therefore IBD symptoms. In this study, we developed an organ-on-a-chip co-culture model of Caco2 epithelial and HUVEC endothelial cells and induced inflammation using pro-inflammatory cytokines TNF-α and IFN-γ. We tested different concentration ranges and delivery orientations (apical vs. basal) to develop a consistently inducible inflammatory response model. We then measured pro-inflammatory cytokines and chemokines IL-6, IL-8, and CXCL-10, as well as epithelial barrier integrity. Our results indicate that this model 1. induces IBD-like cytokine secretion in non-immune cells and 2. decreases barrier integrity, making it a feasible and reliable model to test the direct actions of potential anti-inflammatory therapeutics on epithelial and endothelial cells.


Subject(s)
Cytokines , Inflammatory Bowel Diseases , Humans , Caco-2 Cells , Endothelial Cells , Microphysiological Systems , Intestinal Mucosa
12.
Sci Rep ; 13(1): 11353, 2023 07 13.
Article in English | MEDLINE | ID: mdl-37443184

ABSTRACT

While healthy gut microbiomes are critical to human health, pertinent microbial processes remain largely undefined, partially due to differential bias among profiling techniques. By simultaneously integrating multiple profiling methods, multi-omic analysis can define generalizable microbial processes, and is especially useful in understanding complex conditions such as Autism. Challenges with integrating heterogeneous data produced by multiple profiling methods can be overcome using Latent Dirichlet Allocation (LDA), a promising natural language processing technique that identifies topics in heterogeneous documents. In this study, we apply LDA to multi-omic microbial data (16S rRNA amplicon, shotgun metagenomic, shotgun metatranscriptomic, and untargeted metabolomic profiling) from the stool of 81 children with and without Autism. We identify topics, or microbial processes, that summarize complex phenomena occurring within gut microbial communities. We then subset stool samples by topic distribution, and identify metabolites, specifically neurotransmitter precursors and fatty acid derivatives, that differ significantly between children with and without Autism. We identify clusters of topics, deemed "cross-omic topics", which we hypothesize are representative of generalizable microbial processes observable regardless of profiling method. Interpreting topics, we find each represents a particular diet, and we heuristically label each cross-omic topic as: healthy/general function, age-associated function, transcriptional regulation, and opportunistic pathogenesis.


Subject(s)
Autistic Disorder , Gastrointestinal Microbiome , Microbiota , Child , Humans , Gastrointestinal Microbiome/genetics , Multiomics , RNA, Ribosomal, 16S/genetics , Microbiota/genetics
13.
Nat Neurosci ; 26(7): 1208-1217, 2023 07.
Article in English | MEDLINE | ID: mdl-37365313

ABSTRACT

Autism spectrum disorder (ASD) is a neurodevelopmental disorder characterized by heterogeneous cognitive, behavioral and communication impairments. Disruption of the gut-brain axis (GBA) has been implicated in ASD although with limited reproducibility across studies. In this study, we developed a Bayesian differential ranking algorithm to identify ASD-associated molecular and taxa profiles across 10 cross-sectional microbiome datasets and 15 other datasets, including dietary patterns, metabolomics, cytokine profiles and human brain gene expression profiles. We found a functional architecture along the GBA that correlates with heterogeneity of ASD phenotypes, and it is characterized by ASD-associated amino acid, carbohydrate and lipid profiles predominantly encoded by microbial species in the genera Prevotella, Bifidobacterium, Desulfovibrio and Bacteroides and correlates with brain gene expression changes, restrictive dietary patterns and pro-inflammatory cytokine profiles. The functional architecture revealed in age-matched and sex-matched cohorts is not present in sibling-matched cohorts. We also show a strong association between temporal changes in microbiome composition and ASD phenotypes. In summary, we propose a framework to leverage multi-omic datasets from well-defined cohorts and investigate how the GBA influences ASD.


Subject(s)
Autism Spectrum Disorder , Gastrointestinal Microbiome , Humans , Gastrointestinal Microbiome/genetics , Brain-Gut Axis , Autism Spectrum Disorder/genetics , Autism Spectrum Disorder/metabolism , Cross-Sectional Studies , Bayes Theorem , Reproducibility of Results , Cytokines
14.
Appl Environ Microbiol ; 78(15): 5305-12, 2012 Aug.
Article in English | MEDLINE | ID: mdl-22635998

ABSTRACT

Mineralization potentials, rates, and kinetics of the three phenoxy acid (PA) herbicides, 2,4-dichlorophenoxyacetic acid (2,4-D), 4-chloro-2-methylphenoxyacetic acid (MCPA), and 2-(4-chloro-2-methylphenoxy)propanoic acid (MCPP), were investigated and compared in 15 soils collected from five continents. The mineralization patterns were fitted by zero/linear or exponential growth forms of the three-half-order models and by logarithmic (log), first-order, or zero-order kinetic models. Prior and subsequent to the mineralization event, tfdA genes were quantified using real-time PCR to estimate the genetic potential for degrading PA in the soils. In 25 of the 45 mineralization scenarios, ∼60% mineralization was observed within 118 days. Elevated concentrations of tfdA in the range 1 × 10(5) to 5 × 10(7) gene copies g(-1) of soil were observed in soils where mineralization could be described by using growth-linked kinetic models. A clear trend was observed that the mineralization rates of the three PAs occurred in the order 2,4-D > MCPA > MCPP, and a correlation was observed between rapid mineralization and soils exposed to PA previously. Finally, for 2,4-D mineralization, all seven mineralization patterns which were best fitted by the exponential model yielded a higher tfdA gene potential after mineralization had occurred than the three mineralization patterns best fitted by the Lin model.


Subject(s)
2,4-Dichlorophenoxyacetic Acid/metabolism , 2-Methyl-4-chlorophenoxyacetic Acid/analogs & derivatives , 2-Methyl-4-chlorophenoxyacetic Acid/metabolism , Dioxygenases/genetics , Genes, Bacterial/genetics , Herbicides/metabolism , Models, Biological , Soil Microbiology , DNA Primers/genetics , Kinetics , Minerals/metabolism , Real-Time Polymerase Chain Reaction
15.
Front Bioinform ; 2: 828703, 2022.
Article in English | MEDLINE | ID: mdl-36304322

ABSTRACT

Large-scale microbiome studies investigating disease-inducing microbial roles base their findings on differences between microbial count data in contrasting environments (e.g., stool samples between cases and controls). These microbiome survey studies are often impeded by small sample sizes and database bias. Combining data from multiple survey studies often results in obvious batch effects, even when DNA preparation and sequencing methods are identical. Relatedly, predictive models trained on one microbial DNA dataset often do not generalize to outside datasets. In this study, we address these limitations by applying word embedding algorithms (GloVe) and PCA transformation to ASV data from the American Gut Project and generating translation matrices that can be applied to any 16S rRNA V4 region gut microbiome sequencing study. Because these approaches contextualize microbial occurrences in a larger dataset while reducing dimensionality of the feature space, they can improve generalization of predictive models that predict host phenotype from stool associated gut microbiota. The GMEmbeddings R package contains GloVe and PCA embedding transformation matrices at 50, 100 and 250 dimensions, each learned using ∼15,000 samples from the American Gut Project. It currently supports the alignment, matching, and matrix multiplication to allow users to transform their V4 16S rRNA data into these embedding spaces. We show how to correlate the properties in the new embedding space to KEGG functional pathways for biological interpretation of results. Lastly, we provide benchmarking on six gut microbiome datasets describing three phenotypes to demonstrate the ability of embedding-based microbiome classifiers to generalize to independent datasets. Future iterations of GMEmbeddings will include embedding transformation matrices for other biological systems. Available at: https://github.com/MaudeDavidLab/GMEmbeddings.

16.
PLoS One ; 17(9): e0273865, 2022.
Article in English | MEDLINE | ID: mdl-36084055

ABSTRACT

In vivo rodent behavioral and physiological studies often benefit from measurement of general activity. However, many existing instruments necessary to track such activity are high in cost and invasive within home cages, some even requiring extensive separate cage systems, limiting their widespread use to collect data. We present here a low-cost open-source alternative that measures voluntary wheel running activity and allows for modulation and customization, along with a reproducible and easy to set-up code pipeline for setup and analysis in Arduino IDE and R. Our robust, non-invasive scalable voluntary running activity tracker utilizes readily accessible magnets, Hall effect sensors, and an Arduino microcontroller. Importantly, it can interface with existing rodent home cages and wheel equipment, thus eliminating the need to transfer the mice to an unfamiliar environment. The system was validated both for accuracy by a rotating motor used to simulate mouse behavior, and in vivo. Our recorded data is consistent with results found in the literature showing that the mice run between 3 to 16 kilometers per night, and accurately captures speed and distance traveled continuously on the wheel. Such data are critical for analysis of highly variable behavior in mouse models and allow for characterization of behavioral metrics such as general activity. This system provides a flexible, low-cost methodology, and minimizes the cost, infrastructure, and personnel required for tracking voluntary wheel activity.


Subject(s)
Motor Activity , Rodentia , Animals , Disease Models, Animal , Mice , Motor Activity/physiology
17.
mSystems ; 7(1): e0105821, 2022 02 22.
Article in English | MEDLINE | ID: mdl-35040699

ABSTRACT

A growing body of research has established that the microbiome can mediate the dynamics and functional capacities of diverse biological systems. Yet, we understand little about what governs the response of these microbial communities to host or environmental changes. Most efforts to model microbiomes focus on defining the relationships between the microbiome, host, and environmental features within a specified study system and therefore fail to capture those that may be evident across multiple systems. In parallel with these developments in microbiome research, computer scientists have developed a variety of machine learning tools that can identify subtle, but informative, patterns from complex data. Here, we recommend using deep transfer learning to resolve microbiome patterns that transcend study systems. By leveraging diverse public data sets in an unsupervised way, such models can learn contextual relationships between features and build on those patterns to perform subsequent tasks (e.g., classification) within specific biological contexts.


Subject(s)
Microbiota , Microbiota/physiology , Machine Learning
18.
Sci Rep ; 12(1): 17034, 2022 10 11.
Article in English | MEDLINE | ID: mdl-36220843

ABSTRACT

Observational studies have shown that the composition of the human gut microbiome in children diagnosed with Autism Spectrum Disorder (ASD) differs significantly from that of their neurotypical (NT) counterparts. Thus far, reported ASD-specific microbiome signatures have been inconsistent. To uncover reproducible signatures, we compiled 10 publicly available raw amplicon and metagenomic sequencing datasets alongside new data generated from an internal cohort (the largest ASD cohort to date), unified them with standardized pre-processing methods, and conducted a comprehensive meta-analysis of all taxa and variables detected across multiple studies. By screening metadata to test associations between the microbiome and 52 variables in multiple patient subsets and across multiple datasets, we determined that differentially abundant taxa in ASD versus NT children were dependent upon age, sex, and bowel function, thus marking these variables as potential confounders in case-control ASD studies. Several taxa, including the strains Bacteroides stercoris t__190463 and Clostridium M bolteae t__180407, and the species Granulicatella elegans and Massilioclostridium coli, exhibited differential abundance in ASD compared to NT children only after subjects with bowel dysfunction were removed. Adjusting for age, sex and bowel function resulted in adding or removing significantly differentially abundant taxa in ASD-diagnosed individuals, emphasizing the importance of collecting and controlling for these metadata. We have performed the largest (n = 690) and most comprehensive systematic analysis of ASD gut microbiome data to date. Our study demonstrated the importance of accounting for confounding variables when designing statistical comparative analyses of ASD- and NT-associated gut bacterial profiles. Mitigating these confounders identified robust microbial signatures across cohorts, signifying the importance of accounting for these factors in comparative analyses of ASD and NT-associated gut profiles. Such studies will advance the understanding of different patient groups to deliver appropriate therapeutics by identifying microbiome traits germane to the specific ASD phenotype.


Subject(s)
Autism Spectrum Disorder , Gastrointestinal Microbiome , Microbiota , Autism Spectrum Disorder/genetics , Bacteria/genetics , Child , Gastrointestinal Microbiome/genetics , Humans , Metagenome
19.
mSystems ; 6(2)2021 Apr 06.
Article in English | MEDLINE | ID: mdl-33824195

ABSTRACT

Autism spectrum disorder (ASD) is a remarkably complex disorder influenced by both genetic and environmental factors. Numerous microbial diversity surveys conducted over the past decade have attempted to link specific ASD biomarkers to gastrointestinal tract disturbances, but results generated across cohorts and studies remain inconsistent. This commentary discusses multidirectional interactions between the host, the microbiome, and external factors germane to autism. Recent studies posit the heritability of the gut microbiome itself, confounding attempts to discern heritable from nonheritable effectors in neurodevelopmental disorders. Elucidating the ever-evolving gut microbiome's role in modulating the ASD phenotype will most certainly require new experimental methodologies and designs. In a recent paper published in mSystems (J. Fouquier, N. Moreno Huizar, J. Donnelly, C. Glickman, et al., mSystems e00848-20, 2021, https://doi.org/10.1128/mSystems.00848-20), the authors describe a web of interactions by collecting samples longitudinally, analyzing cross-sectional cohorts, and recording nonbinary phenotypic measurements.

20.
mSystems ; 6(2)2021 Apr 06.
Article in English | MEDLINE | ID: mdl-33824194

ABSTRACT

The existence of a link between the gut microbiome and autism spectrum disorder (ASD) is well established in mice, but in human populations, efforts to identify microbial biomarkers have been limited due to a lack of appropriately matched controls, stratification of participants within the autism spectrum, and sample size. To overcome these limitations, we crowdsourced the recruitment of families with age-matched sibling pairs between 2 and 7 years old (within 2 years of each other), where one child had a diagnosis of ASD and the other did not. Parents collected stool samples, provided a home video of their ASD child's natural social behavior, and responded online to diet and behavioral questionnaires. 16S rRNA V4 amplicon sequencing of 117 samples (60 ASD and 57 controls) identified 21 amplicon sequence variants (ASVs) that differed significantly between the two cohorts: 11 were found to be enriched in neurotypical children (six ASVs belonging to the Lachnospiraceae family), while 10 were enriched in children with ASD (including Ruminococcaceae and Bacteroidaceae families). Summarizing the expected KEGG orthologs of each predicted genome, the taxonomic biomarkers associated with children with ASD can use amino acids as precursors for butyragenic pathways, potentially altering the availability of neurotransmitters like glutamate and gamma aminobutyric acid (GABA).IMPORTANCE Autism spectrum disorder (ASD), which now affects 1 in 54 children in the United States, is known to have comorbidity with gut disorders of a variety of types; however, the link to the microbiome remains poorly characterized. Recent work has provided compelling evidence to link the gut microbiome to the autism phenotype in mouse models, but identification of specific taxa associated with autism has suffered replicability issues in humans. This has been due in part to sample size that sufficiently covers the spectrum of phenotypes known to autism (which range from subtle to severe) and a lack of appropriately matched controls. Our original study proposes to overcome these limitations by collecting stool-associated microbiome on 60 sibling pairs of children, one with autism and one neurotypically developing, both 2 to 7 years old and no more than 2 years apart in age. We use exact sequence variant analysis and both permutation and differential abundance procedures to identify 21 taxa with significant enrichment or depletion in the autism cohort compared to their matched sibling controls. Several of these 21 biomarkers have been identified in previous smaller studies; however, some are new to autism and known to be important in gut-brain interactions and/or are associated with specific fatty acid biosynthesis pathways.

SELECTION OF CITATIONS
SEARCH DETAIL