Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 56
Filter
Add more filters

Country/Region as subject
Publication year range
1.
Cell ; 184(19): 4939-4952.e15, 2021 09 16.
Article in English | MEDLINE | ID: mdl-34508652

ABSTRACT

The emergence of the COVID-19 epidemic in the United States (U.S.) went largely undetected due to inadequate testing. New Orleans experienced one of the earliest and fastest accelerating outbreaks, coinciding with Mardi Gras. To gain insight into the emergence of SARS-CoV-2 in the U.S. and how large-scale events accelerate transmission, we sequenced SARS-CoV-2 genomes during the first wave of the COVID-19 epidemic in Louisiana. We show that SARS-CoV-2 in Louisiana had limited diversity compared to other U.S. states and that one introduction of SARS-CoV-2 led to almost all of the early transmission in Louisiana. By analyzing mobility and genomic data, we show that SARS-CoV-2 was already present in New Orleans before Mardi Gras, and the festival dramatically accelerated transmission. Our study provides an understanding of how superspreading during large-scale events played a key role during the early outbreak in the U.S. and can greatly accelerate epidemics.


Subject(s)
COVID-19/epidemiology , Epidemics , SARS-CoV-2/physiology , COVID-19/transmission , Databases as Topic , Disease Outbreaks , Humans , Louisiana/epidemiology , Phylogeny , Risk Factors , SARS-CoV-2/classification , Texas , Travel , United States/epidemiology
2.
Nature ; 626(7998): 419-426, 2024 Feb.
Article in English | MEDLINE | ID: mdl-38052229

ABSTRACT

Determining the structure and phenotypic context of molecules detected in untargeted metabolomics experiments remains challenging. Here we present reverse metabolomics as a discovery strategy, whereby tandem mass spectrometry spectra acquired from newly synthesized compounds are searched for in public metabolomics datasets to uncover phenotypic associations. To demonstrate the concept, we broadly synthesized and explored multiple classes of metabolites in humans, including N-acyl amides, fatty acid esters of hydroxy fatty acids, bile acid esters and conjugated bile acids. Using repository-scale analysis1,2, we discovered that some conjugated bile acids are associated with inflammatory bowel disease (IBD). Validation using four distinct human IBD cohorts showed that cholic acids conjugated to Glu, Ile/Leu, Phe, Thr, Trp or Tyr are increased in Crohn's disease. Several of these compounds and related structures affected pathways associated with IBD, such as interferon-γ production in CD4+ T cells3 and agonism of the pregnane X receptor4. Culture of bacteria belonging to the Bifidobacterium, Clostridium and Enterococcus genera produced these bile amidates. Because searching repositories with tandem mass spectrometry spectra has only recently become possible, this reverse metabolomics approach can now be used as a general strategy to discover other molecules from human and animal ecosystems.


Subject(s)
Amides , Bile Acids and Salts , Esters , Fatty Acids , Metabolomics , Animals , Humans , Bifidobacterium/metabolism , Bile Acids and Salts/chemistry , Bile Acids and Salts/metabolism , CD4-Positive T-Lymphocytes/immunology , CD4-Positive T-Lymphocytes/metabolism , Clostridium/metabolism , Cohort Studies , Crohn Disease/metabolism , Enterococcus/metabolism , Esters/chemistry , Esters/metabolism , Fatty Acids/chemistry , Fatty Acids/metabolism , Inflammatory Bowel Diseases/metabolism , Metabolomics/methods , Phenotype , Pregnane X Receptor/metabolism , Reproducibility of Results , Tandem Mass Spectrometry , Amides/chemistry , Amides/metabolism
3.
Nature ; 609(7925): 101-108, 2022 09.
Article in English | MEDLINE | ID: mdl-35798029

ABSTRACT

As SARS-CoV-2 continues to spread and evolve, detecting emerging variants early is critical for public health interventions. Inferring lineage prevalence by clinical testing is infeasible at scale, especially in areas with limited resources, participation, or testing and/or sequencing capacity, which can also introduce biases1-3. SARS-CoV-2 RNA concentration in wastewater successfully tracks regional infection dynamics and provides less biased abundance estimates than clinical testing4,5. Tracking virus genomic sequences in wastewater would improve community prevalence estimates and detect emerging variants. However, two factors limit wastewater-based genomic surveillance: low-quality sequence data and inability to estimate relative lineage abundance in mixed samples. Here we resolve these critical issues to perform a high-resolution, 295-day wastewater and clinical sequencing effort, in the controlled environment of a large university campus and the broader context of the surrounding county. We developed and deployed improved virus concentration protocols and deconvolution software that fully resolve multiple virus strains from wastewater. We detected emerging variants of concern up to 14 days earlier in wastewater samples, and identified multiple instances of virus spread not captured by clinical genomic surveillance. Our study provides a scalable solution for wastewater genomic surveillance that allows early detection of SARS-CoV-2 variants and identification of cryptic transmission.


Subject(s)
COVID-19 , SARS-CoV-2 , Wastewater-Based Epidemiological Monitoring , Wastewater , COVID-19/epidemiology , COVID-19/transmission , COVID-19/virology , Humans , RNA, Viral/analysis , RNA, Viral/genetics , SARS-CoV-2/classification , SARS-CoV-2/genetics , SARS-CoV-2/isolation & purification , Sequence Analysis, RNA , Wastewater/virology
4.
Nature ; 579(7797): 123-129, 2020 03.
Article in English | MEDLINE | ID: mdl-32103176

ABSTRACT

A mosaic of cross-phylum chemical interactions occurs between all metazoans and their microbiomes. A number of molecular families that are known to be produced by the microbiome have a marked effect on the balance between health and disease1-9. Considering the diversity of the human microbiome (which numbers over 40,000 operational taxonomic units10), the effect of the microbiome on the chemistry of an entire animal remains underexplored. Here we use mass spectrometry informatics and data visualization approaches11-13 to provide an assessment of the effects of the microbiome on the chemistry of an entire mammal by comparing metabolomics data from germ-free and specific-pathogen-free mice. We found that the microbiota affects the chemistry of all organs. This included the amino acid conjugations of host bile acids that were used to produce phenylalanocholic acid, tyrosocholic acid and leucocholic acid, which have not previously been characterized despite extensive research on bile-acid chemistry14. These bile-acid conjugates were also found in humans, and were enriched in patients with inflammatory bowel disease or cystic fibrosis. These compounds agonized the farnesoid X receptor in vitro, and mice gavaged with the compounds showed reduced expression of bile-acid synthesis genes in vivo. Further studies are required to confirm whether these compounds have a physiological role in the host, and whether they contribute to gut diseases that are associated with microbiome dysbiosis.


Subject(s)
Bile Acids and Salts/biosynthesis , Bile Acids and Salts/chemistry , Metabolomics , Microbiota/physiology , Animals , Bile Acids and Salts/metabolism , Cholic Acid/biosynthesis , Cholic Acid/chemistry , Cholic Acid/metabolism , Cystic Fibrosis/genetics , Cystic Fibrosis/metabolism , Cystic Fibrosis/microbiology , Germ-Free Life , Humans , Inflammatory Bowel Diseases/genetics , Inflammatory Bowel Diseases/metabolism , Inflammatory Bowel Diseases/microbiology , Mice , Receptors, Cytoplasmic and Nuclear/genetics , Receptors, Cytoplasmic and Nuclear/metabolism
5.
Genome Res ; 30(6): 898-909, 2020 06.
Article in English | MEDLINE | ID: mdl-32540955

ABSTRACT

Long-range sequencing information is required for haplotype phasing, de novo assembly, and structural variation detection. Current long-read sequencing technologies can provide valuable long-range information but at a high cost with low accuracy and high DNA input requirements. We have developed a single-tube Transposase Enzyme Linked Long-read Sequencing (TELL-seq) technology, which enables a low-cost, high-accuracy, and high-throughput short-read second-generation sequencer to generate over 100 kb of long-range sequencing information with as little as 0.1 ng input material. In a PCR tube, millions of clonally barcoded beads are used to uniquely barcode long DNA molecules in an open bulk reaction without dilution and compartmentation. The barcoded linked-reads are used to successfully assemble genomes ranging from microbes to human. These linked-reads also generate megabase-long phased blocks and provide a cost-effective tool for detecting structural variants in a genome, which are important to identify compound heterozygosity in recessive Mendelian diseases and discover genetic drivers and diagnostic biomarkers in cancers.


Subject(s)
Gene Library , High-Throughput Nucleotide Sequencing , Sequence Analysis, DNA , Computational Biology/methods , DNA Barcoding, Taxonomic/methods , Genetic Variation , Genome, Human , Genomics/methods , HLA Antigens/genetics , Haplotypes , High-Throughput Nucleotide Sequencing/methods , High-Throughput Nucleotide Sequencing/standards , Humans , Sequence Analysis, DNA/methods , Sequence Analysis, DNA/standards , Workflow
6.
Environ Sci Technol ; 57(10): 4071-4081, 2023 03 14.
Article in English | MEDLINE | ID: mdl-36862087

ABSTRACT

Roughly half of the human population lives near the coast, and coastal water pollution (CWP) is widespread. Coastal waters along Tijuana, Mexico, and Imperial Beach (IB), USA, are frequently polluted by millions of gallons of untreated sewage and stormwater runoff. Entering coastal waters causes over 100 million global annual illnesses, but CWP has the potential to reach many more people on land via transfer in sea spray aerosol (SSA). Using 16S rRNA gene amplicon sequencing, we found sewage-associated bacteria in the polluted Tijuana River flowing into coastal waters and returning to land in marine aerosol. Tentative chemical identification from non-targeted tandem mass spectrometry identified anthropogenic compounds as chemical indicators of aerosolized CWP, but they were ubiquitous and present at highest concentrations in continental aerosol. Bacteria were better tracers of airborne CWP, and 40 tracer bacteria comprised up to 76% of the bacteria community in IB air. These findings confirm that CWP transfers in SSA and exposes many people along the coast. Climate change may exacerbate CWP with more extreme storms, and our findings call for minimizing CWP and investigating the health effects of airborne exposure.


Subject(s)
Aerosolized Particles and Droplets , Seawater , Humans , Seawater/microbiology , Rivers , Sewage/analysis , RNA, Ribosomal, 16S , Water Pollution , Bacteria , Aerosols/analysis , Environmental Monitoring/methods
7.
Anal Chem ; 93(38): 12833-12839, 2021 09 28.
Article in English | MEDLINE | ID: mdl-34533933

ABSTRACT

Molecular networking of non-targeted tandem mass spectrometry data connects structurally related molecules based on similar fragmentation spectra. Here, we report the Chemical Proportionality (ChemProp) contextualization of molecular networks. ChemProp scores the changes of abundance between two connected nodes over sequential data series (e.g., temporal or spatial relationships), which can be displayed as a direction within the network to prioritize potential biological and chemical transformations or proportional changes of (biosynthetically) related compounds. We tested the ChemProp workflow on a ground truth data set of a defined mixture and highlighted the utility of the tool to prioritize specific molecules within biological samples, including bacterial transformations of bile acids, human drug metabolism, and bacterial natural products biosynthesis. The ChemProp workflow is freely available through the Global Natural Products Social Molecular Networking (GNPS) environment.


Subject(s)
Biological Products , Tandem Mass Spectrometry , Humans , Workflow
8.
Mar Drugs ; 19(1)2021 Jan 06.
Article in English | MEDLINE | ID: mdl-33418911

ABSTRACT

Microbial natural products are important for the understanding of microbial interactions, chemical defense and communication, and have also served as an inspirational source for numerous pharmaceutical drugs. Tropical marine cyanobacteria have been highlighted as a great source of new natural products, however, few reports have appeared wherein a multi-omics approach has been used to study their natural products potential (i.e., reports are often focused on an individual natural product and its biosynthesis). This study focuses on describing the natural product genetic potential as well as the expressed natural product molecules in benthic tropical cyanobacteria. We collected from several sites around the world and sequenced the genomes of 24 tropical filamentous marine cyanobacteria. The informatics program antiSMASH was used to annotate the major classes of gene clusters. BiG-SCAPE phylum-wide analysis revealed the most promising strains for natural product discovery among these cyanobacteria. LCMS/MS-based metabolomics highlighted the most abundant molecules and molecular classes among 10 of these marine cyanobacterial samples. We observed that despite many genes encoding for peptidic natural products, peptides were not as abundant as lipids and lipopeptides in the chemical extracts. Our results highlight a number of highly interesting biosynthetic gene clusters for genome mining among these cyanobacterial samples.


Subject(s)
Biological Products/pharmacology , Cyanobacteria/chemistry , Chromatography, High Pressure Liquid , Cyanobacteria/genetics , Genome, Bacterial , Genomics , Marine Biology , Mass Spectrometry , Metabolomics , Multigene Family , Phylogeny , Tropical Climate
10.
Handb Exp Pharmacol ; 260: 301-326, 2019.
Article in English | MEDLINE | ID: mdl-31820171

ABSTRACT

The human microbiota (the microscopic organisms that inhabit us) and microbiome (their genes) hold considerable potential for improving pharmacological practice. Recent advances in multi-"omics" techniques have dramatically improved our understanding of the constituents of the microbiome and their functions. The implications of this research for human health, including microbiome links to obesity, drug metabolism, neurological diseases, cancer, and many other health conditions, have sparked considerable interest in exploiting the microbiome for targeted therapeutics. Links between microbial pathways and disease states further highlight a rich potential for companion diagnostics and precision medicine approaches. For example, the success of fecal microbiota transplantation to treat Clostridium difficile infection has already started to redefine standard of care with a microbiome-directed therapy. In this review we briefly discuss the nature of human microbial ecosystems and with pathologies and biological processes linked to the microbiome. We then review emerging computational metagenomic, metabolomic, and wet lab techniques researchers are using today to learn about the roles host-microbial interactions have with respect to pharmacological purposes and vice versa. Finally, we describe how drugs affect the microbiome, how the microbiome can impact drug response in different people, and the potential of the microbiome itself as a source of new therapeutics.


Subject(s)
Microbiota , Precision Medicine , Humans , Neoplasms , Nervous System Diseases , Obesity , Pharmaceutical Preparations/metabolism
11.
Proteomics ; 15(20): 3497-507, 2015 Oct.
Article in English | MEDLINE | ID: mdl-26272225

ABSTRACT

Tooth decay is considered the most prevalent human disease worldwide. We present the first metaproteomic study of the oral biofilm, using different mass spectrometry approaches that have allowed us to quantify individual peptides in healthy and caries-bearing individuals. A total of 7771 bacterial and 853 human proteins were identified in 17 individuals, which provide the first available protein repertoire of human dental plaque. Actinomyces and Coryneybacterium represent a large proportion of the protein activity followed by Rothia and Streptococcus. Those four genera account for 60-90% of total diversity. Healthy individuals appeared to have significantly higher amounts of L-lactate dehydrogenase and the arginine deiminase system, both implicated in pH buffering. Other proteins found to be at significantly higher levels in healthy individuals were involved in exopolysaccharide synthesis, iron metabolism and immune response. We applied multivariate analysis in order to find the minimum set of proteins that better allows discrimination of healthy and caries-affected dental plaque samples, detecting seven bacterial and five human protein functions that allow determining the health status of the studied individuals with an estimated specificity and sensitivity over 96%. We propose that future validation of these potential biomarkers in larger sample size studies may serve to develop diagnostic tests of caries risk that could be used in tooth decay prevention.


Subject(s)
Biomarkers , Dental Caries/genetics , Mouth/microbiology , Proteome/genetics , Biofilms/growth & development , Dental Caries/microbiology , Dental Plaque/genetics , Dental Plaque/microbiology , Humans , Hydrolases/genetics , Hydrolases/isolation & purification , L-Lactate Dehydrogenase/genetics , L-Lactate Dehydrogenase/isolation & purification , Streptococcus mutans/genetics
12.
BMC Genomics ; 15: 311, 2014 Apr 27.
Article in English | MEDLINE | ID: mdl-24767457

ABSTRACT

BACKGROUND: Micro-organisms inhabiting teeth surfaces grow on biofilms where a specific and complex succession of bacteria has been described by co-aggregation tests and DNA-based studies. Although the composition of oral biofilms is well established, the active portion of the bacterial community and the patterns of gene expression in vivo have not been studied. RESULTS: Using RNA-sequencing technologies, we present the first metatranscriptomic study of human dental plaque, performed by two different approaches: (1) A short-reads, high-coverage approach by Illumina sequencing to characterize the gene activity repertoire of the microbial community during biofilm development; (2) A long-reads, lower-coverage approach by pyrosequencing to determine the taxonomic identity of the active microbiome before and after a meal ingestion. The high-coverage approach allowed us to analyze over 398 million reads, revealing that microbial communities are individual-specific and no bacterial species was detected as key player at any time during biofilm formation. We could identify some gene expression patterns characteristic for early and mature oral biofilms. The transcriptomic profile of several adhesion genes was confirmed through qPCR by measuring expression of fimbriae-associated genes. In addition to the specific set of gene functions overexpressed in early and mature oral biofilms, as detected through the short-reads dataset, the long-reads approach detected specific changes when comparing the metatranscriptome of the same individual before and after a meal, which can narrow down the list of organisms responsible for acid production and therefore potentially involved in dental caries. CONCLUSIONS: The bacteria changing activity during biofilm formation and after meal ingestion were person-specific. Interestingly, some individuals showed extreme homeostasis with virtually no changes in the active bacterial population after food ingestion, suggesting the presence of a microbial community which could be associated to dental health.


Subject(s)
Biofilms , Gene Expression , Microbiota/genetics , Mouth/microbiology , Humans , Metagenome
13.
Int J Syst Evol Microbiol ; 64(Pt 1): 60-65, 2014 Jan.
Article in English | MEDLINE | ID: mdl-24006481

ABSTRACT

Genomic, taxonomic and biochemical studies were performed on two strains of α-haemolytic streptococci that showed them to be clustered with major members of the Streptococcus mitis group. These Gram-stain-positive strains were isolated from tooth surfaces of caries-free humans and showed the classical spherical shape of streptococcal species growing in chains. Sequence analysis from concatenated 16S and 23S rRNA gene and sodA genes showed that these strains belonged to the mitis group, but both of them clustered into a new phylogenetic branch. The genomes of these two isolates were sequenced, and whole-genome average nucleotide identity (ANI) demonstrated that these strains significantly differed from any streptococcal species, showing ANI values under 91 % even when compared with the phylogenetically closest species such as Streptococcus oralis and S. mitis. Biochemically, the two isolates also showed distinct metabolic features relative to closely related species, like α-galactosidase activity. From the results of the present study, the name Streptococcus dentisani sp. nov. is proposed to accommodate these novel strains, which have been deposited in open collections at the Spanish type Culture Collection (CECT) and Leibniz Institute DSMZ-German Collection of Microorganisms and Cell Cultures (DSMZ), being respectively identified as Streptococcus dentisani Str. 7746 ( = CECT 8313 = DSM 27089) and Streptococcus dentisani Str. 7747(T) ( = CECT 8312(T) = DSM 27088(T)).


Subject(s)
Dental Plaque/microbiology , Phylogeny , Streptococcus/classification , Bacterial Proteins/genetics , Bacterial Typing Techniques , Biofilms , DNA, Bacterial/genetics , Genome, Bacterial , Humans , Molecular Sequence Data , RNA, Ribosomal, 16S/genetics , RNA, Ribosomal, 23S/genetics , Sequence Analysis, DNA , Streptococcus/genetics , Streptococcus/isolation & purification , Streptococcus/metabolism , Superoxide Dismutase/genetics , Tooth/microbiology , alpha-Galactosidase/metabolism
14.
mSystems ; : e0051624, 2024 Jun 27.
Article in English | MEDLINE | ID: mdl-38934546

ABSTRACT

Bacteroides fragilis is a Gram-negative commensal bacterium commonly found in the human colon, which differentiates into two genomospecies termed divisions I and II. Through a comprehensive collection of 694 B. fragilis whole genome sequences, we identify novel features distinguishing these divisions. Our study reveals a distinct geographic distribution with division I strains predominantly found in North America and division II strains in Asia. Additionally, division II strains are more frequently associated with bloodstream infections, suggesting a distinct pathogenic potential. We report differences between the two divisions in gene abundance related to metabolism, virulence, stress response, and colonization strategies. Notably, division II strains harbor more antimicrobial resistance (AMR) genes than division I strains. These findings offer new insights into the functional roles of division I and II strains, indicating specialized niches within the intestine and potential pathogenic roles in extraintestinal sites. IMPORTANCE: Understanding the distinct functions of microbial species in the gut microbiome is crucial for deciphering their impact on human health. Classifying division II strains as Bacteroides fragilis can lead to erroneous associations, as researchers may mistakenly attribute characteristics observed in division II strains to the more extensively studied division I B. fragilis. Our findings underscore the necessity of recognizing these divisions as separate species with distinct functions. We unveil new findings of differential gene prevalence between division I and II strains in genes associated with intestinal colonization and survival strategies, potentially influencing their role as gut commensals and their pathogenicity in extraintestinal sites. Despite the significant niche overlap and colonization patterns between these groups, our study highlights the complex dynamics that govern strain distribution and behavior, emphasizing the need for a nuanced understanding of these microorganisms.

15.
bioRxiv ; 2024 Jun 19.
Article in English | MEDLINE | ID: mdl-38948766

ABSTRACT

Bacteroides fragilis is a prominent member of the human gut microbiota, playing crucial roles in maintaining gut homeostasis and host health. Although it primarily functions as a beneficial commensal, B. fragilis can become pathogenic. To determine the genetic basis of its duality, we conducted a comparative genomic analysis of 813 B. fragilis strains, representing both commensal and pathogenic origins. Our findings reveal that pathogenic strains emerge across diverse phylogenetic lineages, due in part to rapid gene exchange and the adaptability of the accessory genome. We identified 16 phylogenetic groups, differentiated by genes associated with capsule composition, interspecies competition, and host interactions. A microbial genome-wide association study identified 44 genes linked to extra-intestinal survival and pathogenicity. These findings reveal how genomic diversity within commensal species can lead to the emergence of pathogenic traits, broadening our understanding of microbial evolution in the gut.

16.
mSystems ; 8(4): e0000623, 2023 08 31.
Article in English | MEDLINE | ID: mdl-37350611

ABSTRACT

Next-generation sequencing technologies have enabled many advances across diverse areas of biology, with many benefiting from increased sample size. Although the cost of running next-generation sequencing instruments has dropped substantially over time, the cost of sample preparation methods has lagged behind. To counter this, researchers have adapted library miniaturization protocols and large sample pools to maximize the number of samples that can be prepared by a certain amount of reagents and sequenced in a single run. However, due to high variability of sample quality, over and underrepresentation of samples in a sequencing run has become a major issue in high-throughput sequencing. This leads to misinterpretation of results due to increased noise, and additional time and cost rerunning underrepresented samples. To overcome this problem, we present a normalization method that uses shallow iSeq sequencing to accurately inform pooling volumes based on read distribution. This method is superior to the widely used fluorometry methods, which cannot specifically target adapter-ligated molecules that contribute to sequencing output. Our normalization method not only quantifies adapter-ligated molecules but also allows normalization of feature space; for example, we can normalize to reads of interest such as non-ribosomal reads. As a result, this normalization method improves the efficiency of high-throughput next-generation sequencing by reducing noise and producing higher average reads per sample with more even sequencing depth. IMPORTANCE High-throughput next generation sequencing (NGS) has significantly contributed to the field of genomics; however, further improvements can maximize the potential of this important tool. Uneven sequencing of samples in a multiplexed run is a common issue that leads to unexpected extra costs or low-quality data. To mitigate this problem, we introduce a normalization method based on read counts rather than library concentration. This method allows for an even distribution of features of interest across samples, improving the statistical power of data sets and preventing the financial loss associated with resequencing libraries. This method optimizes NGS, which already has huge importance across many areas of biology.


Subject(s)
Genomics , Software , Genomics/methods , Sequence Analysis, DNA , Gene Library , High-Throughput Nucleotide Sequencing
17.
Infect Control Hosp Epidemiol ; 43(5): 657-660, 2022 05.
Article in English | MEDLINE | ID: mdl-33706827

ABSTRACT

Transmission of severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) is possible among symptom-free individuals. Patients are avoiding medically necessary healthcare visits for fear of becoming infected in the healthcare setting. We screened 489 symptom-free healthcare workers for SARS-CoV-2 and found no positive results, strongly suggesting that the prevalence of SARS-CoV-2 was <1%.


Subject(s)
COVID-19 , SARS-CoV-2 , COVID-19/diagnosis , Delivery of Health Care , Health Personnel , Humans , Mass Screening
18.
mSystems ; 7(2): e0016722, 2022 04 26.
Article in English | MEDLINE | ID: mdl-35369727

ABSTRACT

We introduce the operational genomic unit (OGU) method, a metagenome analysis strategy that directly exploits sequence alignment hits to individual reference genomes as the minimum unit for assessing the diversity of microbial communities and their relevance to environmental factors. This approach is independent of taxonomic classification, granting the possibility of maximal resolution of community composition, and organizes features into an accurate hierarchy using a phylogenomic tree. The outputs are suitable for contemporary analytical protocols for community ecology, differential abundance, and supervised learning while supporting phylogenetic methods, such as UniFrac and phylofactorization, that are seldom applied to shotgun metagenomics despite being prevalent in 16S rRNA gene amplicon studies. As demonstrated in two real-world case studies, the OGU method produces biologically meaningful patterns from microbiome data sets. Such patterns further remain detectable at very low metagenomic sequencing depths. Compared with taxonomic unit-based analyses implemented in currently adopted metagenomics tools, and the analysis of 16S rRNA gene amplicon sequence variants, this method shows superiority in informing biologically relevant insights, including stronger correlation with body environment and host sex on the Human Microbiome Project data set and more accurate prediction of human age by the gut microbiomes of Finnish individuals included in the FINRISK 2002 cohort. We provide Woltka, a bioinformatics tool to implement this method, with full integration with the QIIME 2 package and the Qiita web platform, to facilitate adoption of the OGU method in future metagenomics studies. IMPORTANCE Shotgun metagenomics is a powerful, yet computationally challenging, technique compared to 16S rRNA gene amplicon sequencing for decoding the composition and structure of microbial communities. Current analyses of metagenomic data are primarily based on taxonomic classification, which is limited in feature resolution. To solve these challenges, we introduce operational genomic units (OGUs), which are the individual reference genomes derived from sequence alignment results, without further assigning them taxonomy. The OGU method advances current read-based metagenomics in two dimensions: (i) providing maximal resolution of community composition and (ii) permitting use of phylogeny-aware tools. Our analysis of real-world data sets shows that it is advantageous over currently adopted metagenomic analysis methods and the finest-grained 16S rRNA analysis methods in predicting biological traits. We thus propose the adoption of OGUs as an effective practice in metagenomic studies.


Subject(s)
Metagenome , Microbiota , Humans , Phylogeny , RNA, Ribosomal, 16S/genetics , Ecology
19.
mSystems ; 7(4): e0010922, 2022 08 30.
Article in English | MEDLINE | ID: mdl-35703436

ABSTRACT

A promising approach to help students safely return to in person learning is through the application of sentinel cards for accurate high resolution environmental monitoring of SARS-CoV-2 traces indoors. Because SARS-CoV-2 RNA can persist for up to a week on several indoor surface materials, there is a need for increased temporal resolution to determine whether consecutive surface positives arise from new infection events or continue to report past events. Cleaning sentinel cards after sampling would provide the needed resolution but might interfere with assay performance. We tested the effect of three cleaning solutions (BZK wipes, Wet Wipes, RNase Away) at three different viral loads: "high" (4 × 104 GE/mL), "medium" (1 × 104 GE/mL), and "low" (2.5 × 103 GE/mL). RNase Away, chosen as a positive control, was the most effective cleaning solution on all three viral loads. Wet Wipes were found to be more effective than BZK wipes in the medium viral load condition. The low viral load condition was easily reset with all three cleaning solutions. These findings will enable temporal SARS-CoV-2 monitoring in indoor environments where transmission risk of the virus is high and the need to avoid individual-level sampling for privacy or compliance reasons exists. IMPORTANCE Because SARS-CoV-2, the virus that causes COVID-19, persists on surfaces, testing swabs taken from surfaces is useful as a monitoring tool. This approach is especially valuable in school settings, where there are cost and privacy concerns that are eliminated by taking a single sample from a classroom. However, the virus persists for days to weeks on surface samples, so it is impossible to tell whether positive detection events on consecutive days are a persistent signal or new infectious cases and therefore whether the positive individuals have been successfully removed from the classroom. We compare several methods for cleaning "sentinel cards" to show that this approach can be used to identify new SARS-CoV-2 signals day to day. The results are important for determining how to monitor classrooms and other indoor environments for SARS-CoV-2 virus.


Subject(s)
COVID-19 , SARS-CoV-2 , Humans , RNA, Viral , Endoribonucleases , Ribonuclease, Pancreatic , Ribonucleases
20.
mSystems ; 7(4): e0010322, 2022 08 30.
Article in English | MEDLINE | ID: mdl-35703437

ABSTRACT

Surface sampling for SARS-CoV-2 RNA detection has shown considerable promise to detect exposure of built environments to infected individuals shedding virus who would not otherwise be detected. Here, we compare two popular sampling media (VTM and SDS) and two popular workflows (Thermo and PerkinElmer) for implementation of a surface sampling program suitable for environmental monitoring in public schools. We find that the SDS/Thermo pipeline shows superior sensitivity and specificity, but that the VTM/PerkinElmer pipeline is still sufficient to support surface surveillance in any indoor setting with stable cohorts of occupants (e.g., schools, prisons, group homes, etc.) and may be used to leverage existing investments in infrastructure. IMPORTANCE The ongoing COVID-19 pandemic has claimed the lives of over 5 million people worldwide. Due to high density occupancy of indoor spaces for prolonged periods of time, schools are often of concern for transmission, leading to widespread school closings to combat pandemic spread when cases rise. Since pediatric clinical testing is expensive and difficult from a consent perspective, we have deployed surface sampling in SASEA (Safer at School Early Alert), which allows for detection of SARS-CoV-2 from surfaces within a classroom. In this previous work, we developed a high-throughput method which requires robotic automation and specific reagents that are often not available for public health laboratories such as the San Diego County Public Health Laboratory (SDPHL). Therefore, we benchmarked our method (Thermo pipeline) against SDPHL's (PerkinElmer) more widely used method for the detection and prediction of SARS-CoV-2 exposure. While our method shows superior sensitivity (false-negative rate of 9% versus 27% for SDPHL), the SDPHL pipeline is sufficient to support surface surveillance in indoor settings. These findings are important since they show that existing investments in infrastructure can be leveraged to slow the spread of SARS-CoV-2 not in just the classroom but also in prisons, nursing homes, and other high-risk, indoor settings.


Subject(s)
COVID-19 , SARS-CoV-2 , Humans , Child , COVID-19/diagnosis , Pandemics/prevention & control , RNA, Viral , Automation
SELECTION OF CITATIONS
SEARCH DETAIL