Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 25
Filter
Add more filters










Publication year range
1.
Microbiol Spectr ; 12(4): e0315723, 2024 Apr 02.
Article in English | MEDLINE | ID: mdl-38385740

ABSTRACT

Chronic Pseudomonas aeruginosa lung infections are a feature of cystic fibrosis (CF) that many patients experience even with the advent of highly effective modulator therapies. Identifying factors that impact P. aeruginosa in the CF lung could yield novel strategies to eradicate infection or otherwise improve outcomes. To complement published P. aeruginosa studies using laboratory models or RNA isolated from sputum, we analyzed transcripts of strain PAO1 after incubation in sputum from different CF donors prior to RNA extraction. We compared PAO1 gene expression in this "spike-in" sputum model to that for P. aeruginosa grown in synthetic cystic fibrosis sputum medium to determine key genes, which are among the most differentially expressed or most highly expressed. Using the key genes, gene sets with correlated expression were determined using the gene expression analysis tool eADAGE. Gene sets were used to analyze the activity of specific pathways in P. aeruginosa grown in sputum from different individuals. Gene sets that we found to be more active in sputum showed similar activation in published data that included P. aeruginosa RNA isolated from sputum relative to corresponding in vitro reference cultures. In the ex vivo samples, P. aeruginosa had increased levels of genes related to zinc and iron acquisition which were suppressed by metal amendment of sputum. We also found a significant correlation between expression of the H1-type VI secretion system and CFTR corrector use by the sputum donor. An ex vivo sputum model or synthetic sputum medium formulation that imposes metal restriction may enhance future CF-related studies.IMPORTANCEIdentifying the gene expression programs used by Pseudomonas aeruginosa to colonize the lungs of people with cystic fibrosis (CF) will illuminate new therapeutic strategies. To capture these transcriptional programs, we cultured the common P. aeruginosa laboratory strain PAO1 in expectorated sputum from CF patient donors. Through bioinformatic analysis, we defined sets of genes that are more transcriptionally active in real CF sputum compared to a synthetic cystic fibrosis sputum medium. Many of the most differentially active gene sets contained genes related to metal acquisition, suggesting that these gene sets play an active role in scavenging for metals in the CF lung environment which may be inadequately represented in some models. Future studies of P. aeruginosa transcript abundance in CF may benefit from the use of an expectorated sputum model or media supplemented with factors that induce metal restriction.


Subject(s)
Cystic Fibrosis , Pseudomonas Infections , Humans , Pseudomonas aeruginosa/metabolism , Sputum , Gene Expression Profiling , Metals , Culture Media/metabolism , RNA/metabolism
2.
mBio ; : e0167623, 2023 Nov 10.
Article in English | MEDLINE | ID: mdl-37947402

ABSTRACT

Metagenomics is a powerful method for interpreting the ecological roles and physiological capabilities of mixed microbial communities. Yet, many tools for processing metagenomic data are neither designed to consider eukaryotes nor are they built for an increasing amount of sequence data. EukHeist is an automated pipeline to retrieve eukaryotic and prokaryotic metagenome-assembled genomes (MAGs) from large-scale metagenomic sequence data sets. We developed the EukHeist workflow to specifically process large amounts of both metagenomic and/or metatranscriptomic sequence data in an automated and reproducible fashion. Here, we applied EukHeist to the large-size fraction data (0.8-2,000 µm) from Tara Oceans to recover both eukaryotic and prokaryotic MAGs, which we refer to as TOPAZ (Tara Oceans Particle-Associated MAGs). The TOPAZ MAGs consisted of >900 environmentally relevant eukaryotic MAGs and >4,000 bacterial and archaeal MAGs. The bacterial and archaeal TOPAZ MAGs expand upon the phylogenetic diversity of likely particle- and host-associated taxa. We use these MAGs to demonstrate an approach to infer the putative trophic mode of the recovered eukaryotic MAGs. We also identify ecological cohorts of co-occurring MAGs, which are driven by specific environmental factors and putative host-microbe associations. These data together add to a number of growing resources of environmentally relevant eukaryotic genomic information. Complementary and expanded databases of MAGs, such as those provided through scalable pipelines like EukHeist, stand to advance our understanding of eukaryotic diversity through increased coverage of genomic representatives across the tree of life.IMPORTANCESingle-celled eukaryotes play ecologically significant roles in the marine environment, yet fundamental questions about their biodiversity, ecological function, and interactions remain. Environmental sequencing enables researchers to document naturally occurring protistan communities, without culturing bias, yet metagenomic and metatranscriptomic sequencing approaches cannot separate individual species from communities. To more completely capture the genomic content of mixed protistan populations, we can create bins of sequences that represent the same organism (metagenome-assembled genomes [MAGs]). We developed the EukHeist pipeline, which automates the binning of population-level eukaryotic and prokaryotic genomes from metagenomic reads. We show exciting insight into what protistan communities are present and their trophic roles in the ocean. Scalable computational tools, like EukHeist, may accelerate the identification of meaningful genetic signatures from large data sets and complement researchers' efforts to leverage MAG databases for addressing ecological questions, resolving evolutionary relationships, and discovering potentially novel biodiversity.

3.
bioRxiv ; 2023 Aug 21.
Article in English | MEDLINE | ID: mdl-37662412

ABSTRACT

Chronic Pseudomonas aeruginosa lung infections are a distinctive feature of cystic fibrosis (CF) pathology, that challenge adults with CF even with the advent of highly effective modulator therapies. Characterizing P. aeruginosa transcription in the CF lung and identifying factors that drive gene expression could yield novel strategies to eradicate infection or otherwise improve outcomes. To complement published P. aeruginosa gene expression studies in laboratory culture models designed to model the CF lung environment, we employed an ex vivo sputum model in which laboratory strain PAO1 was incubated in sputum from different CF donors. As part of the analysis, we compared PAO1 gene expression in this "spike-in" sputum model to that for P. aeruginosa grown in artificial sputum medium (ASM). Analyses focused on genes that were differentially expressed between sputum and ASM and genes that were most highly expressed in sputum. We present a new approach that used sets of genes with correlated expression, identified by the gene expression analysis tool eADAGE, to analyze the differential activity of pathways in P. aeruginosa grown in CF sputum from different individuals. A key characteristic of P. aeruginosa grown in expectorated CF sputum was related to zinc and iron acquisition, but this signal varied by donor sputum. In addition, a significant correlation between P. aeruginosa expression of the H1-type VI secretion system and corrector use by the sputum donor was observed. These methods may be broadly useful in looking for variable signals across clinical samples.

4.
mSystems ; 8(1): e0034122, 2023 02 23.
Article in English | MEDLINE | ID: mdl-36541761

ABSTRACT

Thousands of Pseudomonas aeruginosa RNA sequencing (RNA-seq) gene expression profiles are publicly available via the National Center for Biotechnology Information (NCBI) Sequence Read Archive (SRA). In this work, the transcriptional profiles from hundreds of studies performed by over 75 research groups were reanalyzed in aggregate to create a powerful tool for hypothesis generation and testing. Raw sequence data were uniformly processed using the Salmon pseudoaligner, and this read mapping method was validated by comparison to a direct alignment method. We developed filtering criteria to exclude samples with aberrant levels of housekeeping gene expression or an unexpected number of genes with no reported values and normalized the filtered compendia using the ratio-of-medians method. The filtering and normalization steps greatly improved gene expression correlations for genes within the same operon or regulon across the 2,333 samples. Since the RNA-seq data were generated using diverse strains, we report the effects of mapping samples to noncognate reference genomes by separately analyzing all samples mapped to cDNA reference genomes for strains PAO1 and PA14, two divergent strains that were used to generate most of the samples. Finally, we developed an algorithm to incorporate new data as they are deposited into the SRA. Our processing and quality control methods provide a scalable framework for taking advantage of the troves of biological information hibernating in the depths of microbial gene expression data and yield useful tools for P. aeruginosa RNA-seq data to be leveraged for diverse research goals. IMPORTANCE Pseudomonas aeruginosa is a causative agent of a wide range of infections, including chronic infections associated with cystic fibrosis. These P. aeruginosa infections are difficult to treat and often have negative outcomes. To aid in the study of this problematic pathogen, we mapped, filtered for quality, and normalized thousands of P. aeruginosa RNA-seq gene expression profiles that were publicly available via the National Center for Biotechnology Information (NCBI) Sequence Read Archive (SRA). The resulting compendia facilitate analyses across experiments, strains, and conditions. Ultimately, the workflow that we present could be applied to analyses of other microbial species.


Subject(s)
Cystic Fibrosis , Pseudomonas aeruginosa , Humans , Pseudomonas aeruginosa/genetics , Transcriptome , RNA , Cystic Fibrosis/complications
5.
mSystems ; 8(1): e0034222, 2023 02 23.
Article in English | MEDLINE | ID: mdl-36541762

ABSTRACT

Pseudomonas aeruginosa is an opportunistic pathogen that causes difficult-to-treat infections. Two well-studied divergent P. aeruginosa strain types, PAO1 and PA14, have significant genomic heterogeneity, including diverse accessory genes present in only some strains. Genome content comparisons find core genes that are conserved across both PAO1 and PA14 strains and accessory genes that are present in only a subset of PAO1 and PA14 strains. Here, we use recently assembled transcriptome compendia of publicly available P. aeruginosa RNA sequencing (RNA-seq) samples to create two smaller compendia consisting of only strain PAO1 or strain PA14 samples with each aligned to their cognate reference genome. We confirmed strain annotations and identified other samples for inclusion by assessing each sample's median expression of PAO1-only or PA14-only accessory genes. We then compared the patterns of core gene expression in each strain. To do so, we developed a method by which we analyzed genes in terms of which genes showed similar expression patterns across strain types. We found that some core genes had consistent correlated expression patterns across both compendia, while others were less stable in an interstrain comparison. For each accessory gene, we also determined core genes with correlated expression patterns. We found that stable core genes had fewer coexpressed neighbors that were accessory genes. Overall, this approach for analyzing expression patterns across strain types can be extended to other groups of genes, like phage genes, or applied for analyzing patterns beyond groups of strains, such as samples with different traits, to reveal a deeper understanding of regulation. IMPORTANCE Pseudomonas aeruginosa is a ubiquitous pathogen. There is much diversity among P. aeruginosa strains, including two divergent but well-studied strains, PAO1 and PA14. Understanding how these different strain-level traits manifest is important for identifying targets that regulate different traits of interest. With the availability of thousands of PAO1 and PA14 samples, we created two strain-specific RNA-seq compendia where each one contains hundreds of samples from PAO1 or PA14 strains and used them to compare the expression patterns of core genes that are conserved in both strain types and to determine which core genes have expression patterns that are similar to those of accessory genes that are unique to one strain or the other using an approach that we developed. We found a subset of core genes with different transcriptional patterns across PAO1 and PA14 strains and identified those core genes with expression patterns similar to those of strain-specific accessory genes.


Subject(s)
Genomics , Pseudomonas aeruginosa , Pseudomonas aeruginosa/genetics , Base Sequence
7.
Mol Cell ; 82(20): 3856-3871.e6, 2022 10 20.
Article in English | MEDLINE | ID: mdl-36220102

ABSTRACT

To determine which transcripts should reach the cytoplasm for translation, eukaryotic cells have established mechanisms to regulate selective mRNA export through the nuclear pore complex (NPC). The nuclear basket, a substructure of the NPC protruding into the nucleoplasm, is thought to function as a stable platform where mRNA-protein complexes (mRNPs) are rearranged and undergo quality control prior to export, ensuring that only mature mRNAs reach the cytoplasm. Here, we use proteomic, genetic, live-cell, and single-molecule resolution microscopy approaches in budding yeast to demonstrate that basket formation is dependent on RNA polymerase II transcription and subsequent mRNP processing. We further show that while all NPCs can bind Mlp1, baskets assemble only on a subset of nucleoplasmic NPCs, and these basket-containing NPCs associate a distinct protein and RNA interactome. Taken together, our data point toward NPC heterogeneity and an RNA-dependent mechanism for functionalization of NPCs in budding yeast through nuclear basket assembly.


Subject(s)
Nuclear Pore , Saccharomycetales , Nuclear Pore/genetics , Nuclear Pore/metabolism , Saccharomycetales/genetics , Saccharomycetales/metabolism , RNA Polymerase II/genetics , RNA Polymerase II/metabolism , Proteomics , Active Transport, Cell Nucleus/physiology , Cell Nucleus/genetics , Cell Nucleus/metabolism , RNA, Messenger/genetics , RNA, Messenger/metabolism , Nuclear Pore Complex Proteins/genetics , Nuclear Pore Complex Proteins/metabolism
8.
Comput Struct Biotechnol J ; 20: 4315-4324, 2022.
Article in English | MEDLINE | ID: mdl-36016717

ABSTRACT

A gene expression compendium is a heterogeneous collection of gene expression experiments assembled from data collected for diverse purposes. The widely varied experimental conditions and genetic backgrounds across samples creates a tremendous opportunity for gaining a systems level understanding of the transcriptional responses that influence phenotypes. Variety in experimental design is particularly important for studying microbes, where the transcriptional responses integrate many signals and demonstrate plasticity across strains including response to what nutrients are available and what microbes are present. Advances in high-throughput measurement technology have made it feasible to construct compendia for many microbes. In this review we discuss how these compendia are constructed and analyzed to reveal transcriptional patterns.

9.
Nat Microbiol ; 7(2): 193-194, 2022 02.
Article in English | MEDLINE | ID: mdl-34980920
10.
Front Immunol ; 12: 670309, 2021.
Article in English | MEDLINE | ID: mdl-34594320

ABSTRACT

Natural killer (NK) cells are key effectors of the innate immune system, but major differences between human and murine NK cells have impeded translation. Outbred dogs offer an important link for studies of NK biology and immunotherapy. We analyzed gene expression of putative NK populations from healthy dogs and dogs with naturally-occurring cancers examining differential gene expression across multiple conditions, including steady-state, in vitro activation with cytokines and co-culture, and in vivo activation with inhaled IL-15 in dogs receiving IL-15 immunotherapy. We also compared dog, mouse and human CD3-NKp46+ NK cells using a novel orthologous transcriptome. Distinct transcriptional profiles between NK populations exist between conditions and in vitro versus in vivo treatments. In cross-species analysis, canine NK cells were globally more similar to human NK cells than mice. These data define canine NK cell gene expression under multiple conditions and across species, filling an important gap in translational NK studies.


Subject(s)
Bone Neoplasms , Dog Diseases , Immunotherapy , Killer Cells, Natural , Lung Neoplasms , Melanoma , Osteosarcoma , Transcriptome , Adult , Aged , Animals , Dogs , Female , Humans , Male , Mice , Middle Aged , Young Adult , Administration, Inhalation , Blood Donors , Bone Neoplasms/genetics , Bone Neoplasms/immunology , Bone Neoplasms/pathology , Bone Neoplasms/veterinary , Dog Diseases/genetics , Dog Diseases/immunology , Dog Diseases/therapy , Gene Expression Regulation, Neoplastic/immunology , Healthy Volunteers , Immunologic Factors/administration & dosage , Immunotherapy/methods , Interleukin-15/administration & dosage , K562 Cells , Killer Cells, Natural/immunology , Lung Neoplasms/immunology , Lung Neoplasms/secondary , Lung Neoplasms/therapy , Lung Neoplasms/veterinary , Melanoma/genetics , Melanoma/immunology , Melanoma/pathology , Melanoma/veterinary , Mice, Inbred C57BL , Osteosarcoma/genetics , Osteosarcoma/immunology , Osteosarcoma/pathology , Osteosarcoma/veterinary , Treatment Outcome
11.
mBio ; 12(5): e0234521, 2021 10 26.
Article in English | MEDLINE | ID: mdl-34607457

ABSTRACT

During fermentation, Saccharomyces cerevisiae metabolizes sugars and other nutrients to obtain energy for growth and survival, while also modulating these activities in response to cell-environment interactions. Here, differences in S. cerevisiae gene expression were explored over a time course of fermentation and used to differentiate fermentations, using Pinot noir grapes from 15 unique sites. Data analysis was complicated by the fact that the fermentations proceeded at different rates, making a direct comparison of time series gene expression data difficult with conventional differential expression tools. This led to the development of a novel approach combining diffusion mapping with continuous differential expression analysis (termed DMap-DE). Using this method, site-specific deviations in gene expression were identified, including changes in gene expression correlated with the non-Saccharomyces yeast Hanseniaspora uvarum, as well as initial nitrogen concentrations in grape musts. These results highlight novel relationships between site-specific variables and Saccharomyces cerevisiae gene expression that are linked to repeated fermentation outcomes. It was also demonstrated that DMap-DE can extract biologically relevant gene expression patterns from other contexts (e.g., hypoxic response of Saccharomyces cerevisiae) and offers advantages over other data dimensionality reduction approaches, indicating that DMap-DE offers a robust method for investigating asynchronous time series gene expression data. IMPORTANCE In this work, Saccharomyces cerevisiae gene expression was used as a biosensor to capture differences across and between fermentations of Pinot noir grapes from 15 unique sites representing eight American Viticultural Areas. This required development of a novel analysis method, DMap-DE, for investigation of asynchronous gene expression data. It was demonstrated that DMap-DE reveals biologically relevant shifts in gene expression related to cell-environment interactions in the context of hypoxia and fermentation. Using these data, it was discovered that gene expression by non-Saccharomyces yeasts and initial nitrogen content in grape musts are correlated with differences in gene expression among fermentations. These findings highlight important relationships between site-specific variables and gene expression that may be used to understand why foods and beverages, including wine, possess sensory characteristics associated with or derived from their place of origin.


Subject(s)
Computational Biology/methods , Saccharomyces cerevisiae Proteins/genetics , Saccharomyces cerevisiae/genetics , Fermentation , Gene Expression Regulation, Fungal , Hanseniaspora/genetics , Hanseniaspora/growth & development , Hanseniaspora/metabolism , RNA-Seq , Saccharomyces cerevisiae/growth & development , Saccharomyces cerevisiae/metabolism , Saccharomyces cerevisiae Proteins/metabolism , Vitis/microbiology
12.
mSystems ; 6(2)2021 Apr 13.
Article in English | MEDLINE | ID: mdl-33850038

ABSTRACT

Ribosomal DNA amplicon sequencing of grape musts has demonstrated that microorganisms occur nonrandomly and are associated with the vineyard of origin, suggesting a role for the vineyard, grape, and wine microbiome in shaping wine fermentation outcomes. Here, ribosomal DNA amplicon sequencing from grape musts and RNA sequencing of eukaryotic transcripts from primary fermentations inoculated with the wine yeast Saccharomyces cerevisiae RC212 were used to profile fermentations from 15 vineyards in California and Oregon across two vintages. These data demonstrate that the relative abundance of fungal organisms detected by ribosomal DNA amplicon sequencing correlated with neither transcript abundance from those same organisms within the RNA sequencing data nor gene expression of the inoculated RC212 yeast strain. These data suggest that the majority of the fungi detected in must by ribosomal DNA amplicon sequencing were not active during the primary stage of these inoculated fermentations and were not a major factor in determining RC212 gene expression. However, unique genetic signatures were detected within the ribosomal DNA amplicon and eukaryotic transcriptomic sequencing that were predictive of vineyard site and region. These signatures included S. cerevisiae gene expression patterns linked to nitrogen, sulfur, and thiamine metabolism. These genetic signatures of site offer insight into specific environmental factors to consider with respect to fermentation outcomes and vineyard site and regional wine characteristics.IMPORTANCE The wine industry generates billions of dollars of revenue annually, and economic productivity is in part associated with regional distinctiveness of wine sensory attributes. Microorganisms associated with grapes and wineries are influenced by region of origin, and given that some microorganisms play a role in fermentation, it is thought that microbes may contribute to the regional distinctiveness of wine. In this work, as in previous studies, it is demonstrated that specific bacteria and fungi are associated with individual wine regions and vineyard sites. However, this work further shows that their presence is not associated with detectable fungal gene expression during the primary fermentation or the expression of specific genes by the inoculate Saccharomyces cerevisiae strain RC212. The detected RC212 gene expression signatures associated with region and vineyard site also allowed the identification of flavor-associated metabolic processes and environmental factors that could impact primary fermentation outcomes. These data offer novel insights into the complexities and subtleties of vineyard-specific inoculated wine fermentation and starting points for future investigations into factors that contribute to regional wine distinctiveness.

13.
Food Res Int ; 141: 110045, 2021 03.
Article in English | MEDLINE | ID: mdl-33641957

ABSTRACT

The reproducibility of elemental profile in wines produced across vintages of 2015 and 2016 has been studied using grapes from a single scion clone of Vitis vinifera L. cv. Pinot noir. Grapevines were grown on fourteen different vineyard sites, from Oregon to southern California in the U.S.A., which span distances from approximately hundreds of meters to 1450 km, while elevations range from near sea level to nearly 500 m. The number of elements quantified in the wines made from the 2016 vintage was thirty, by using inductively coupled plasma mass spectrometry (ICP-MS). These data were compared with the twenty-seven elements quantified and previously reported in wines made from 2015 vintage, including twenty-four elements reported in both vintages. The composition of each element was analyzed by analysis of variance with main effect of vineyard. Wines were classified according to vineyard origin and environmental growing site with a combination of factors correlated with the wine elemental profile. The low variability (< 25%) of certain elements in wines from at least eight sites across both vintages, including Group 1 (Cs, K, Na and Rb), Group 2 (Ba, Ca, Mg and Sr), Group 3B (Eu), Group 13 (Al, B and Ga), Group 15 (As and P) and Co, Fe, Mn, Ni and V, demonstrated the reproducibility over the seasons analyzed (2015 and 2016). The comparison of elemental profile of wines across growing seasons demonstrates the opportunity to reproduce one key aspect of wine chemistry across vintages.


Subject(s)
Vitis , Wine , Farms , Reproducibility of Results , Wine/analysis
14.
Appl Environ Microbiol ; 87(11)2021 05 11.
Article in English | MEDLINE | ID: mdl-33741633

ABSTRACT

Saccharomyces cerevisiae metabolism produces ethanol and other compounds during the fermentation of grape must into wine. Thousands of genes change expression over the course of a wine fermentation, allowing S. cerevisiae to adapt to and dominate the fermentation environment. Investigations into these gene expression patterns previously revealed genes that underlie cellular adaptation to the grape must and wine environments, involving metabolic specialization and ethanol tolerance. However, the majority of studies detailing gene expression patterns have occurred in controlled environments that may not recapitulate the biological and chemical complexity of fermentations performed at production scale. Here, an analysis of the S. cerevisiae RC212 gene expression program is presented, drawing from 40 pilot-scale fermentations (150 liters) using Pinot noir grapes from 10 California vineyards across two vintages. A core gene expression program was observed across all fermentations irrespective of vintage, similar to that of laboratory fermentations, in addition to novel gene expression patterns likely related to the presence of non-Saccharomyces microorganisms and oxygen availability during fermentation. These gene expression patterns, both common and diverse, provide insight into Saccharomyces cerevisiae biology critical to fermentation outcomes under industry-relevant conditions.IMPORTANCE This study characterized Saccharomyces cerevisiae RC212 gene expression during Pinot noir fermentation at pilot scale (150 liters) using industry-relevant conditions. The reported gene expression patterns of RC212 are generally similar to those observed under laboratory fermentation conditions but also contain gene expression signatures related to yeast-environment interactions found in a production setting (e.g., the presence of non-Saccharomyces microorganisms). Key genes and pathways highlighted by this work remain undercharacterized, indicating the need for further research to understand the roles of these genes and their impact on industrial wine fermentation outcomes.


Subject(s)
Gene Expression , Genes, Fungal , Saccharomyces cerevisiae Proteins/genetics , Saccharomyces cerevisiae/genetics , Wine/microbiology , Fermentation , Saccharomyces cerevisiae/metabolism , Saccharomyces cerevisiae Proteins/metabolism
15.
Gigascience ; 10(1)2021 01 13.
Article in English | MEDLINE | ID: mdl-33438730

ABSTRACT

As the scale of biological data generation has increased, the bottleneck of research has shifted from data generation to analysis. Researchers commonly need to build computational workflows that include multiple analytic tools and require incremental development as experimental insights demand tool and parameter modifications. These workflows can produce hundreds to thousands of intermediate files and results that must be integrated for biological insight. Data-centric workflow systems that internally manage computational resources, software, and conditional execution of analysis steps are reshaping the landscape of biological data analysis and empowering researchers to conduct reproducible analyses at scale. Adoption of these tools can facilitate and expedite robust data analysis, but knowledge of these techniques is still lacking. Here, we provide a series of strategies for leveraging workflow systems with structured project, data, and resource management to streamline large-scale biological analysis. We present these practices in the context of high-throughput sequencing data analysis, but the principles are broadly applicable to biologists working beyond this field.


Subject(s)
Computational Biology , Software , Data Analysis , High-Throughput Nucleotide Sequencing , Workflow
16.
mSystems ; 6(1)2021 Jan 12.
Article in English | MEDLINE | ID: mdl-33436519

ABSTRACT

RNA sequencing (RNA-seq) has matured into a reliable and low-cost assay for transcriptome profiling and has been deployed across a range of systems. The computational tool space for the analysis of RNA-seq data has kept pace with advances in sequencing. Yet tool development has largely centered around the human transcriptome. While eukaryotic and prokaryotic transcriptomes are similar, key differences in transcribed units limit the transfer of wet-lab and computational tools between the two domains. The article by M. Chung, R. S. Adkins, J. S. A. Mattick, K. R. Bradwell, et al. (mSystems 6:e00917-20, 2021, https://doi.org/10.1128/mSystems.00917-20), demonstrates that integrating prokaryote-specific strategies into existing RNA-seq analyses improves read quantification. Unlike in eukaryotes, polycistronic transcripts derived from operons lead to sequencing reads that span multiple neighboring genes. Chung et al. introduce FADU, a software tool that performs a correction for such reads and thereby improves read quantification and biological interpretation of prokaryotic RNA sequencing.

17.
Nucleic Acids Res ; 48(20): 11675-11694, 2020 11 18.
Article in English | MEDLINE | ID: mdl-33137177

ABSTRACT

RNA-binding proteins (RBPs) are key mediators of RNA metabolism. Whereas some RBPs exhibit narrow transcript specificity, others function broadly across both coding and non-coding RNAs. Here, in Saccharomyces cerevisiae, we demonstrate that changes in RBP availability caused by disruptions to distinct cellular processes promote a common global breakdown in RNA metabolism and nuclear RNA homeostasis. Our data shows that stabilization of aberrant ribosomal RNA (rRNA) precursors in an enp1-1 mutant causes phenotypes similar to RNA exosome mutants due to nucleolar sequestration of the poly(A)-binding protein (PABP) Nab2. Decreased nuclear PABP availability is accompanied by genome-wide changes in RNA metabolism, including increased pervasive transcripts levels and snoRNA processing defects. These phenotypes are mitigated by overexpression of PABPs, inhibition of rDNA transcription, or alterations in TRAMP activity. Our results highlight the need for cells to maintain poly(A)-RNA levels in balance with PABPs and other RBPs with mutable substrate specificity across nucleoplasmic and nucleolar RNA processes.


Subject(s)
Cell Nucleus/metabolism , Nucleocytoplasmic Transport Proteins/metabolism , RNA Processing, Post-Transcriptional , RNA, Ribosomal/metabolism , RNA-Binding Proteins/metabolism , Saccharomyces cerevisiae Proteins/metabolism , Cell Nucleus/genetics , Exosome Multienzyme Ribonuclease Complex/genetics , Guanine Nucleotide Exchange Factors/genetics , Homeostasis , Mutation , Nuclear Proteins/genetics , Polyadenylation , RNA Precursors/metabolism , Saccharomyces cerevisiae/genetics , Saccharomyces cerevisiae Proteins/genetics , Transcriptome
19.
Genome Biol ; 21(1): 164, 2020 07 06.
Article in English | MEDLINE | ID: mdl-32631445

ABSTRACT

Genomes computationally inferred from large metagenomic data sets are often incomplete and may be missing functionally important content and strain variation. We introduce an information retrieval system for large metagenomic data sets that exploits the sparsity of DNA assembly graphs to efficiently extract subgraphs surrounding an inferred genome. We apply this system to recover missing content from genome bins and show that substantial genomic sequence variation is present in a real metagenome. Our software implementation is available at https://github.com/spacegraphcats/spacegraphcats under the 3-Clause BSD License.


Subject(s)
Algorithms , Genetic Variation , Genome , Metagenomics/methods , Software
20.
F1000Res ; 8: 1006, 2019.
Article in English | MEDLINE | ID: mdl-31508216

ABSTRACT

The sourmash software package uses MinHash-based sketching to create "signatures", compressed representations of DNA, RNA, and protein sequences, that can be stored, searched, explored, and taxonomically annotated. sourmash signatures can be used to estimate sequence similarity between very large data sets quickly and in low memory, and can be used to search large databases of genomes for matches to query genomes and metagenomes. sourmash is implemented in C++, Rust, and Python, and is freely available under the BSD license at http://github.com/dib-lab/sourmash.


Subject(s)
Genome , Software , Databases, Factual
SELECTION OF CITATIONS
SEARCH DETAIL