Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 21
Filter
1.
Cell ; 183(5): 1185-1201.e20, 2020 11 25.
Article in English | MEDLINE | ID: mdl-33242417

ABSTRACT

Spaceflight is known to impose changes on human physiology with unknown molecular etiologies. To reveal these causes, we used a multi-omics, systems biology analytical approach using biomedical profiles from fifty-nine astronauts and data from NASA's GeneLab derived from hundreds of samples flown in space to determine transcriptomic, proteomic, metabolomic, and epigenetic responses to spaceflight. Overall pathway analyses on the multi-omics datasets showed significant enrichment for mitochondrial processes, as well as innate immunity, chronic inflammation, cell cycle, circadian rhythm, and olfactory functions. Importantly, NASA's Twin Study provided a platform to confirm several of our principal findings. Evidence of altered mitochondrial function and DNA damage was also found in the urine and blood metabolic data compiled from the astronaut cohort and NASA Twin Study data, indicating mitochondrial stress as a consistent phenotype of spaceflight.


Subject(s)
Genomics , Mitochondria/pathology , Space Flight , Stress, Physiological , Animals , Circadian Rhythm , Extracellular Matrix/metabolism , Humans , Immunity, Innate , Lipid Metabolism , Metabolic Flux Analysis , Mice, Inbred BALB C , Mice, Inbred C57BL , Muscles/immunology , Organ Specificity , Smell/physiology
2.
Genome Res ; 31(4): 635-644, 2021 04.
Article in English | MEDLINE | ID: mdl-33602693

ABSTRACT

The COVID-19 pandemic has sparked an urgent need to uncover the underlying biology of this devastating disease. Though RNA viruses mutate more rapidly than DNA viruses, there are a relatively small number of single nucleotide polymorphisms (SNPs) that differentiate the main SARS-CoV-2 lineages that have spread throughout the world. In this study, we investigated 129 RNA-seq data sets and 6928 consensus genomes to contrast the intra-host and inter-host diversity of SARS-CoV-2. Our analyses yielded three major observations. First, the mutational profile of SARS-CoV-2 highlights intra-host single nucleotide variant (iSNV) and SNP similarity, albeit with differences in C > U changes. Second, iSNV and SNP patterns in SARS-CoV-2 are more similar to MERS-CoV than SARS-CoV-1. Third, a significant fraction of insertions and deletions contribute to the genetic diversity of SARS-CoV-2. Altogether, our findings provide insight into SARS-CoV-2 genomic diversity, inform the design of detection tests, and highlight the potential of iSNVs for tracking the transmission of SARS-CoV-2.


Subject(s)
COVID-19/diagnosis , COVID-19/transmission , Genetic Variation , Genome, Viral , Real-Time Polymerase Chain Reaction/methods , SARS-CoV-2/genetics , COVID-19/virology , Host-Pathogen Interactions , Humans , Polymorphism, Single Nucleotide
3.
Clin Infect Dis ; 75(1): e241-e248, 2022 08 24.
Article in English | MEDLINE | ID: mdl-34519774

ABSTRACT

BACKGROUND: Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) epidemiology implicates airborne transmission; aerosol infectiousness and impacts of masks and variants on aerosol shedding are not well understood. METHODS: We recruited coronavirus disease 2019 (COVID-19) cases to give blood, saliva, mid-turbinate and fomite (phone) swabs, and 30-minute breath samples while vocalizing into a Gesundheit-II, with and without masks at up to 2 visits 2 days apart. We quantified and sequenced viral RNA, cultured virus, and assayed serum samples for anti-spike and anti-receptor binding domain antibodies. RESULTS: We enrolled 49 seronegative cases (mean days post onset 3.8 ±â€…2.1), May 2020 through April 2021. We detected SARS-CoV-2 RNA in 36% of fine (≤5 µm), 26% of coarse (>5 µm) aerosols, and 52% of fomite samples overall and in all samples from 4 alpha variant cases. Masks reduced viral RNA by 48% (95% confidence interval [CI], 3 to 72%) in fine and by 77% (95% CI, 51 to 89%) in coarse aerosols; cloth and surgical masks were not significantly different. The alpha variant was associated with a 43-fold (95% CI, 6.6- to 280-fold) increase in fine aerosol viral RNA, compared with earlier viruses, that remained a significant 18-fold (95% CI, 3.4- to 92-fold) increase adjusting for viral RNA in saliva, swabs, and other potential confounders. Two fine aerosol samples, collected while participants wore masks, were culture-positive. CONCLUSIONS: SARS-CoV-2 is evolving toward more efficient aerosol generation and loose-fitting masks provide significant but only modest source control. Therefore, until vaccination rates are very high, continued layered controls and tight-fitting masks and respirators will be necessary.


Subject(s)
COVID-19 , SARS-CoV-2 , COVID-19/prevention & control , Humans , Masks , RNA, Viral , Respiratory Aerosols and Droplets
5.
Pac Symp Biocomput ; 29: 506-520, 2024.
Article in English | MEDLINE | ID: mdl-38160303

ABSTRACT

The microbes present in the human gastrointestinal tract are regularly linked to human health and disease outcomes. Thanks to technological and methodological advances in recent years, metagenomic sequencing data, and computational methods designed to analyze metagenomic data, have contributed to improved understanding of the link between the human gut microbiome and disease. However, while numerous methods have been recently developed to extract quantitative and qualitative results from host-associated microbiome data, improved computational tools are still needed to track microbiome dynamics with short-read sequencing data. Previously we have proposed KOMB as a de novo tool for identifying copy number variations in metagenomes for characterizing microbial genome dynamics in response to perturbations. In this work, we present KombOver (KO), which includes four key contributions with respect to our previous work: (i) it scales to large microbiome study cohorts, (ii) it includes both k-core and K-truss based analysis, (iii) we provide the foundation of a theoretical understanding of the relation between various graph-based metagenome representations, and (iv) we provide an improved user experience with easier-to-run code and more descriptive outputs/results. To highlight the aforementioned benefits, we applied KO to nearly 1000 human microbiome samples, requiring less than 10 minutes and 10 GB RAM per sample to process these data. Furthermore, we highlight how graph-based approaches such as k-core and K-truss can be informative for pinpointing microbial community dynamics within a myalgic encephalomyelitis/chronic fatigue syndrome (ME/CFS) cohort. KO is open source and available for download/use at: https://github.com/treangenlab/komb.


Subject(s)
Gastrointestinal Microbiome , Microbiota , Humans , DNA Copy Number Variations , Computational Biology , Microbiota/genetics , Metagenome , Metagenomics/methods
6.
Nat Commun ; 15(1): 4545, 2024 May 28.
Article in English | MEDLINE | ID: mdl-38806450

ABSTRACT

Wastewater surveillance for SARS-CoV-2 provides early warnings of emerging variants of concerns and can be used to screen for novel cryptic linked-read mutations, which are co-occurring single nucleotide mutations that are rare, or entirely missing, in existing SARS-CoV-2 databases. While previous approaches have focused on specific regions of the SARS-CoV-2 genome, there is a need for computational tools capable of efficiently tracking cryptic mutations across the entire genome and investigating their potential origin. We present Crykey, a tool for rapidly identifying rare linked-read mutations across the genome of SARS-CoV-2. We evaluated the utility of Crykey on over 3,000 wastewater and over 22,000 clinical samples; our findings are three-fold: i) we identify hundreds of cryptic mutations that cover the entire SARS-CoV-2 genome, ii) we track the presence of these cryptic mutations across multiple wastewater treatment plants and over three years of sampling in Houston, and iii) we find a handful of cryptic mutations in wastewater mirror cryptic mutations in clinical samples and investigate their potential to represent real cryptic lineages. In summary, Crykey enables large-scale detection of cryptic mutations in wastewater that represent potential circulating cryptic lineages, serving as a new computational tool for wastewater surveillance of SARS-CoV-2.


Subject(s)
COVID-19 , Genome, Viral , Mutation , SARS-CoV-2 , Wastewater , SARS-CoV-2/genetics , SARS-CoV-2/isolation & purification , Wastewater/virology , COVID-19/virology , COVID-19/epidemiology , COVID-19/diagnosis , Humans , Genome, Viral/genetics , Computational Biology/methods
7.
bioRxiv ; 2024 Jun 03.
Article in English | MEDLINE | ID: mdl-38895276

ABSTRACT

Taxonomic profiling is a ubiquitous task in the analysis of clinical and environmental microbiomes. The advent of long-read sequencing of microbiomes necessitates the development of new taxonomic profilers tailored to long-read shotgun metagenomic datasets. Here, we introduce Lemur and Magnet, a pair of tools optimized for lightweight and accurate taxonomic profiling from long-read shotgun metagenomic datasets. Lemur is a marker-gene based method that leverages an EM algorithm to reduce false positive calls while preserving true positives; Magnet makes detailed presence/absence calls for bacterial genomes based on whole-genome read mapping. The tools work in sequence: Lemur estimates abundances conservatively, and Magnet operates on the genomes of identified organisms to filter out likely false positive taxa. The result is an increase in precision of as much as 70%, which far exceeds competing methods. By operating only on marker genes, Lemur is a comparatively lightweight software. We demonstrate that it can run in minutes to hours on a laptop with 32 GB of RAM, even for large inputs - a crucial feature given the portability of long-read sequencing machines. Furthermore, the marker gene database used by Lemur is only 4 GB and contains information from over 300,000 RefSeq genomes. The reference is available at https://zenodo.org/records/10802546, and the software is open-source and available at https://github.com/treangenlab/lemur.

8.
Nat Commun ; 14(1): 2834, 2023 05 17.
Article in English | MEDLINE | ID: mdl-37198181

ABSTRACT

As clinical testing declines, wastewater monitoring can provide crucial surveillance on the emergence of SARS-CoV-2 variant of concerns (VoCs) in communities. In this paper we present QuaID, a novel bioinformatics tool for VoC detection based on quasi-unique mutations. The benefits of QuaID are three-fold: (i) provides up to 3-week earlier VoC detection, (ii) accurate VoC detection (>95% precision on simulated benchmarks), and (iii) leverages all mutational signatures (including insertions & deletions).


Subject(s)
COVID-19 , Humans , COVID-19/diagnosis , SARS-CoV-2/genetics , Wastewater , Benchmarking
9.
medRxiv ; 2023 Nov 12.
Article in English | MEDLINE | ID: mdl-37986916

ABSTRACT

We present Crykey, a computational tool for rapidly identifying cryptic mutations of SARS-CoV-2. Specifically, we identify co-occurring single nucleotide mutations on the same sequencing read, called linked-read mutations, that are rare or entirely missing in existing databases, and have the potential to represent novel cryptic lineages found in wastewater. While previous approaches exist for identifying cryptic linked-read mutations from specific regions of the SARS-CoV-2 genome, there is a need for computational tools capable of efficiently tracking cryptic mutations across the entire genome and for tens of thousands of samples and with increased scrutiny, given their potential to represent either artifacts or hidden SARS-CoV-2 lineages. Crykey fills this gap by identifying rare linked-read mutations that pass stringent computational filters to limit the potential for artifacts. We evaluate the utility of Crykey on >3,000 wastewater and >22,000 clinical samples; our findings are three-fold: i) we identify hundreds of cryptic mutations that cover the entire SARS-CoV-2 genome, ii) we track the presence of these cryptic mutations across multiple wastewater treatment plants and over a three years of sampling in Houston, and iii) we find a handful of cryptic mutations in wastewater mirror cryptic mutations in clinical samples and investigate their potential to represent real cryptic lineages. In summary, Crykey enables large-scale detection of cryptic mutations representing potential cryptic lineages in wastewater.

10.
bioRxiv ; 2023 Sep 30.
Article in English | MEDLINE | ID: mdl-36824759

ABSTRACT

Tiled amplicon sequencing has served as an essential tool for tracking the spread and evolution of pathogens. Over 2 million complete SARS-CoV-2 genomes are now publicly available, most sequenced and assembled via tiled amplicon sequencing. While computational tools for tiled amplicon design exist, they require downstream manual optimization both computationally and experimentally, which is slow and costly. Here we present Olivar, a first step towards a fully automated, variant-aware design of tiled amplicons for pathogen genomes. Olivar converts each nucleotide of the target genome into a numeric risk score, capturing undesired sequence features that should be avoided. In a direct comparison with PrimalScheme, we show that Olivar has fewer SNPs overlapping with primers and predicted PCR byproducts. We also compared Olivar head-to-head with ARTIC v4.1, the most widely used primer set for SARS-CoV-2 sequencing, and show Olivar yields similar read mapping rates (~90%) and better coverage to the manually designed ARTIC v4.1 amplicons. We also evaluated Olivar on real wastewater samples and found that Olivar had up to 3-fold higher mapping rates while retaining similar coverage. In summary, Olivar automates and accelerates the generation of tiled amplicons, even in situations of high mutation frequency and/or density. Olivar is available as a web application at https://olivar.rice.edu. Olivar can also be installed locally as a command line tool with Bioconda. Source code, installation guide and usage are available at https://github.com/treangenlab/Olivar.

11.
medRxiv ; 2022 Aug 04.
Article in English | MEDLINE | ID: mdl-35898338

ABSTRACT

As clinical testing declines, wastewater monitoring can provide crucial surveillance on the emergence of SARS-CoV-2 variants of concern (VoC) in communities. Multiple recent studies support that wastewater-based SARS-CoV-2 detection of circulating VoC can precede clinical cases by up to two weeks. Furthermore, wastewater based epidemiology enables wide population-based screening and study of viral evolutionary dynamics. However, highly sensitive detection of emerging variants remains a complex task due to the pooled nature of environmental samples and genetic material degradation. In this paper we propose quasi-unique mutations for VoC identification, implemented in a novel bioinformatics tool (QuaID) for VoC detection based on quasi-unique mutations. The benefits of QuaID are three-fold: (i) provides up to 3 week earlier VoC detection compared to existing approaches, (ii) enables more sensitive VoC detection, which is shown to be tolerant of >50% mutation drop-out, and (iii) leverages all mutational signatures, including insertions & deletions.

12.
Comput Struct Biotechnol J ; 20: 3208-3222, 2022.
Article in English | MEDLINE | ID: mdl-35832621

ABSTRACT

Characterizing metagenomes via kmer-based, database-dependent taxonomic classification has yielded key insights into underlying microbiome dynamics. However, novel approaches are needed to track community dynamics and genomic flux within metagenomes, particularly in response to perturbations. We describe KOMB, a novel method for tracking genome level dynamics within microbiomes. KOMB utilizes K-core decomposition to identify Structural variations (SVs), specifically, population-level Copy Number Variation (CNV) within microbiomes. K-core decomposition partitions the graph into shells containing nodes of induced degree at least K, yielding reduced computational complexity compared to prior approaches. Through validation on a synthetic community, we show that KOMB recovers and profiles repetitive genomic regions in the sample. KOMB is shown to identify functionally-important regions in Human Microbiome Project datasets, and was used to analyze longitudinal data and identify keystone taxa in Fecal Microbiota Transplantation (FMT) samples. In summary, KOMB represents a novel graph-based, taxonomy-oblivious, and reference-free approach for tracking CNV within microbiomes. KOMB is open source and available for download at https://gitlab.com/treangenlab/komb.

13.
Virus Evol ; 8(1): veac008, 2022.
Article in English | MEDLINE | ID: mdl-35242361

ABSTRACT

A detailed understanding of how and when severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) transmission occurs is crucial for designing effective prevention measures. Other than contact tracing, genome sequencing provides information to help infer who infected whom. However, the effectiveness of the genomic approach in this context depends on both (high enough) mutation and (low enough) transmission rates. Today, the level of resolution that we can obtain when describing SARS-CoV-2 outbreaks using just genomic information alone remains unclear. In order to answer this question, we sequenced forty-nine SARS-CoV-2 patient samples from ten local clusters in NW Spain for which partial epidemiological information was available and inferred transmission history using genomic variants. Importantly, we obtained high-quality genomic data, sequencing each sample twice and using unique barcodes to exclude cross-sample contamination. Phylogenetic and cluster analyses showed that consensus genomes were generally sufficient to discriminate among independent transmission clusters. However, levels of intrahost variation were low, which prevented in most cases the unambiguous identification of direct transmission events. After filtering out recurrent variants across clusters, the genomic data were generally compatible with the epidemiological information but did not support specific transmission events over possible alternatives. We estimated the effective transmission bottleneck size to be one to two viral particles for sample pairs whose donor-recipient relationship was likely. Our analyses suggest that intrahost genomic variation in SARS-CoV-2 might be generally limited and that homoplasy and recurrent errors complicate identifying shared intrahost variants. Reliable reconstruction of direct SARS-CoV-2 transmission based solely on genomic data seems hindered by a slow mutation rate, potential convergent events, and technical artifacts. Detailed contact tracing seems essential in most cases to study SARS-CoV-2 transmission at high resolution.

14.
Nat Commun ; 13(1): 1728, 2022 04 01.
Article in English | MEDLINE | ID: mdl-35365602

ABSTRACT

Deep Learning (DL) has recently enabled unprecedented advances in one of the grand challenges in computational biology: the half-century-old problem of protein structure prediction. In this paper we discuss recent advances, limitations, and future perspectives of DL on five broad areas: protein structure prediction, protein function prediction, genome engineering, systems biology and data integration, and phylogenetic inference. We discuss each application area and cover the main bottlenecks of DL approaches, such as training data, problem scope, and the ability to leverage existing DL architectures in new contexts. To conclude, we provide a summary of the subject-specific and general challenges for DL across the biosciences.


Subject(s)
Deep Learning , Computational Biology , Phylogeny , Proteins , Systems Biology
15.
Sci Total Environ ; 833: 155059, 2022 Aug 10.
Article in English | MEDLINE | ID: mdl-35395314

ABSTRACT

Over the course of the COVID-19 pandemic, variants of SARS-CoV-2 have emerged that are more contagious and more likely to cause breakthrough infections. Targeted amplicon sequencing approach is a gold standard for identification and analysis of variants. However, when applied to environmental samples such as wastewater, it remains unclear how sensitive this method is for detecting variant-associated mutations in environmental samples. Here we directly compare a targeted amplicon sequencing approach (using ARTIC v3; hereafter referred to as sequencing) with RT-ddPCR quantification for the detection of five mutations that are characteristic of variants of concern (VoCs) in wastewater samples. In total, 547 wastewater samples were analyzed using both methods in parallel. When we observed positive mutation detections by RT-ddPCR, 42.6% of the detection events were missed by sequencing, due to negative detection or the limited read coverage at the mutation position. Further, when sequencing reported negative or depth-limited mutation detections, 26.7% of those events were instead positive detections by RT-ddPCR, highlighting the relatively poor sensitivity of sequencing. No or weak associations were observed between quantitative measurements of target mutations determined by RT-ddPCR and sequencing. These findings caution the use of quantitative measurements of SARS-CoV-2 variants in wastewater samples determined solely based on sequencing.


Subject(s)
COVID-19 , SARS-CoV-2 , Humans , Mutation , Pandemics , SARS-CoV-2/genetics , Wastewater
16.
F1000Res ; 11: 530, 2022.
Article in English | MEDLINE | ID: mdl-36262335

ABSTRACT

In October 2021, 59 scientists from 14 countries and 13 U.S. states collaborated virtually in the Third Annual Baylor College of Medicine & DNANexus Structural Variation hackathon. The goal of the hackathon was to advance research on structural variants (SVs) by prototyping and iterating on open-source software. This led to nine hackathon projects focused on diverse genomics research interests, including various SV discovery and genotyping methods, SV sequence reconstruction, and clinically relevant structural variation, including SARS-CoV-2 variants. Repositories for the projects that participated in the hackathon are available at https://github.com/collaborativebioinformatics.


Subject(s)
COVID-19 , SARS-CoV-2 , Humans , SARS-CoV-2/genetics , Genomics , Software
17.
ArXiv ; 2021 May 07.
Article in English | MEDLINE | ID: mdl-33972927

ABSTRACT

With recent advances in sequencing technology it has become affordable and practical to sequence genomes to very high depth-of-coverage, allowing researchers to discover low-frequency variants in the genome. However, due to the errors in sequencing it is an active area of research to develop algorithms that can separate noise from the true variants. LoFreq is a state of the art algorithm for low-frequency variant detection but has a relatively long runtime compared to other tools. In addition to this, the interface for running in parallel could be simplified, allowing for multithreading as well as distributing jobs to a cluster. In this work we describe some specific contributions to LoFreq that remedy these issues.

18.
F1000Res ; 10: 246, 2021.
Article in English | MEDLINE | ID: mdl-34621504

ABSTRACT

In October 2020, 62 scientists from nine nations worked together remotely in the Second Baylor College of Medicine & DNAnexus hackathon, focusing on different related topics on Structural Variation, Pan-genomes, and SARS-CoV-2 related research.   The overarching focus was to assess the current status of the field and identify the remaining challenges. Furthermore, how to combine the strengths of the different interests to drive research and method development forward. Over the four days, eight groups each designed and developed new open-source methods to improve the identification and analysis of variations among species, including humans and SARS-CoV-2. These included improvements in SV calling, genotyping, annotations and filtering. Together with advancements in benchmarking existing methods. Furthermore, groups focused on the diversity of SARS-CoV-2. Daily discussion summary and methods are available publicly at  https://github.com/collaborativebioinformatics provides valuable insights for both participants and the research community.


Subject(s)
COVID-19 , SARS-CoV-2 , Animals , Genome, Viral , Humans , Vertebrates
19.
Cell Rep ; 37(3): 109839, 2021 10 19.
Article in English | MEDLINE | ID: mdl-34624208

ABSTRACT

MicroRNAs (miRNAs) are small non-coding RNAs involved in post-transcriptional gene regulation that have a major impact on many diseases and provide an exciting avenue toward antiviral therapeutics. From patient transcriptomic data, we determined that a circulating miRNA, miR-2392, is directly involved with severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) machinery during host infection. Specifically, we show that miR-2392 is key in driving downstream suppression of mitochondrial gene expression, increasing inflammation, glycolysis, and hypoxia, as well as promoting many symptoms associated with coronavirus disease 2019 (COVID-19) infection. We demonstrate that miR-2392 is present in the blood and urine of patients positive for COVID-19 but is not present in patients negative for COVID-19. These findings indicate the potential for developing a minimally invasive COVID-19 detection method. Lastly, using in vitro human and in vivo hamster models, we design a miRNA-based antiviral therapeutic that targets miR-2392, significantly reduces SARS-CoV-2 viability in hamsters, and may potentially inhibit a COVID-19 disease state in humans.


Subject(s)
COVID-19/genetics , COVID-19/immunology , MicroRNAs/genetics , SARS-CoV-2/genetics , Adult , Aged , Aged, 80 and over , Animals , Antiviral Agents/pharmacology , Biomarkers/metabolism , Cricetinae , Female , Ferrets , Gene Expression Regulation , Glycolysis , Healthy Volunteers , Humans , Hypoxia , Inflammation , Male , Mice , Middle Aged , Proteomics/methods , ROC Curve , Rats , COVID-19 Drug Treatment
20.
bioRxiv ; 2021 Aug 18.
Article in English | MEDLINE | ID: mdl-33948587

ABSTRACT

MicroRNAs (miRNAs) are small non-coding RNAs involved in post-transcriptional gene regulation that have a major impact on many diseases and provides an exciting avenue towards antiviral therapeutics. From patient transcriptomic data, we have discovered a circulating miRNA, miR-2392, that is directly involved with SARS-CoV-2 machinery during host infection. Specifically, we show that miR-2392 is key in driving downstream suppression of mitochondrial gene expression, increasing inflammation, glycolysis, and hypoxia as well as promoting many symptoms associated with COVID-19 infection. We demonstrate miR-2392 is present in the blood and urine of COVID-19 positive patients, but not detected in COVID-19 negative patients. These findings indicate the potential for developing a novel, minimally invasive, COVID-19 detection method. Lastly, using in vitro human and in vivo hamster models, we have developed a novel miRNA-based antiviral therapeutic that targets miR-2392, significantly reduces SARS-CoV-2 viability in hamsters and may potentially inhibit a COVID-19 disease state in humans.

SELECTION OF CITATIONS
SEARCH DETAIL