Your browser doesn't support javascript.
loading
: 20 | 50 | 100
1 - 6 de 6
1.
Bioinformatics ; 40(4)2024 Mar 29.
Article En | MEDLINE | ID: mdl-38569896

MOTIVATION: Long-read sequencing technologies, an attractive solution for many applications, often suffer from higher error rates. Alignment of multiple reads can improve base-calling accuracy, but some applications, e.g. sequencing mutagenized libraries where multiple distinct clones differ by one or few variants, require the use of barcodes or unique molecular identifiers. Unfortunately, sequencing errors can interfere with correct barcode identification, and a given barcode sequence may be linked to multiple independent clones within a given library. RESULTS: Here we focus on the target application of sequencing mutagenized libraries in the context of multiplexed assays of variant effects (MAVEs). MAVEs are increasingly used to create comprehensive genotype-phenotype maps that can aid clinical variant interpretation. Many MAVE methods use long-read sequencing of barcoded mutant libraries for accurate association of barcode with genotype. Existing long-read sequencing pipelines do not account for inaccurate sequencing or nonunique barcodes. Here, we describe Pacybara, which handles these issues by clustering long reads based on the similarities of (error-prone) barcodes while also detecting barcodes that have been associated with multiple genotypes. Pacybara also detects recombinant (chimeric) clones and reduces false positive indel calls. In three example applications, we show that Pacybara identifies and correctly resolves these issues. AVAILABILITY AND IMPLEMENTATION: Pacybara, freely available at https://github.com/rothlab/pacybara, is implemented using R, Python, and bash for Linux. It runs on GNU/Linux HPC clusters via Slurm, PBS, or GridEngine schedulers. A single-machine simplex version is also available.


High-Throughput Nucleotide Sequencing , Software , Sequence Analysis, DNA/methods , High-Throughput Nucleotide Sequencing/methods , Gene Library , Genotype , Cluster Analysis
2.
G3 (Bethesda) ; 13(7)2023 07 05.
Article En | MEDLINE | ID: mdl-37267226

The COVID-19 pandemic has catalyzed unprecedented scientific data and reagent sharing and collaboration, which enabled understanding the virology of the SARS-CoV-2 virus and vaccine development at record speed. The pandemic, however, has also raised awareness of the danger posed by the family of coronaviruses, of which 7 are known to infect humans and dozens have been identified in reservoir species, such as bats, rodents, or livestock. To facilitate understanding the commonalities and specifics of coronavirus infections and aspects of viral biology that determine their level of lethality to the human host, we have generated a collection of freely available clones encoding nearly all human coronavirus proteins known to date. We hope that this flexible, Gateway-compatible vector collection will encourage further research into the interactions of coronaviruses with their human host, to increase preparedness for future zoonotic viral outbreaks.


COVID-19 , Humans , COVID-19/epidemiology , SARS-CoV-2/genetics , Pandemics
3.
bioRxiv ; 2023 Dec 07.
Article En | MEDLINE | ID: mdl-36865234

Long read sequencing technologies, an attractive solution for many applications, often suffer from higher error rates. Alignment of multiple reads can improve base-calling accuracy, but some applications, e.g. sequencing mutagenized libraries where multiple distinct clones differ by one or few variants, require the use of barcodes or unique molecular identifiers. Unfortunately, sequencing errors can interfere with correct barcode identification, and a given barcode sequence may be linked to multiple independent clones within a given library. Here we focus on the target application of sequencing mutagenized libraries in the context of multiplexed assays of variant effects (MAVEs). MAVEs are increasingly used to create comprehensive genotype-phenotype maps that can aid clinical variant interpretation. Many MAVE methods use long-read sequencing of barcoded mutant libraries for accurate association of barcode with genotype. Existing long-read sequencing pipelines do not account for inaccurate sequencing or non-unique barcodes. Here, we describe Pacybara, which handles these issues by clustering long reads based on the similarities of (error-prone) barcodes while also detecting barcodes that have been associated with multiple genotypes. Pacybara also detects recombinant (chimeric) clones and reduces false positive indel calls. In three example applications, we show that Pacybara identifies and correctly resolves these issues.

4.
Nat Biotechnol ; 41(1): 140-149, 2023 01.
Article En | MEDLINE | ID: mdl-36217029

Understanding the mechanisms of coronavirus disease 2019 (COVID-19) disease severity to efficiently design therapies for emerging virus variants remains an urgent challenge of the ongoing pandemic. Infection and immune reactions are mediated by direct contacts between viral molecules and the host proteome, and the vast majority of these virus-host contacts (the 'contactome') have not been identified. Here, we present a systematic contactome map of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) with the human host encompassing more than 200 binary virus-host and intraviral protein-protein interactions. We find that host proteins genetically associated with comorbidities of severe illness and long COVID are enriched in SARS-CoV-2 targeted network communities. Evaluating contactome-derived hypotheses, we demonstrate that viral NSP14 activates nuclear factor κB (NF-κB)-dependent transcription, even in the presence of cytokine signaling. Moreover, for several tested host proteins, genetic knock-down substantially reduces viral replication. Additionally, we show for USP25 that this effect is phenocopied by the small-molecule inhibitor AZ1. Our results connect viral proteins to human genetic architecture for COVID-19 severity and offer potential therapeutic targets.


COVID-19 , SARS-CoV-2 , Humans , SARS-CoV-2/genetics , COVID-19/genetics , Proteome/genetics , Post-Acute COVID-19 Syndrome , Virus Replication/genetics , Ubiquitin Thiolesterase/pharmacology
5.
G3 (Bethesda) ; 10(9): 3399-3402, 2020 09 02.
Article En | MEDLINE | ID: mdl-32763951

The world is facing a global pandemic of COVID-19 caused by the SARS-CoV-2 coronavirus. Here we describe a collection of codon-optimized coding sequences for SARS-CoV-2 cloned into Gateway-compatible entry vectors, which enable rapid transfer into a variety of expression and tagging vectors. The collection is freely available. We hope that widespread availability of this SARS-CoV-2 resource will enable many subsequent molecular studies to better understand the viral life cycle and how to block it.


Betacoronavirus/genetics , Open Reading Frames/genetics , Betacoronavirus/isolation & purification , COVID-19 , Cloning, Molecular , Coronavirus Infections/pathology , Coronavirus Infections/virology , Escherichia coli/metabolism , Humans , Pandemics , Plasmids/genetics , Plasmids/metabolism , Pneumonia, Viral/pathology , Pneumonia, Viral/virology , Potyvirus/genetics , SARS-CoV-2
6.
Biol Open ; 7(1)2018 Jan 17.
Article En | MEDLINE | ID: mdl-29343513

Tuberous sclerosis complex is an autosomal dominant disorder characterized by benign tumors arising from the abnormal activation of mTOR signaling in cells lacking TSC1 (hamartin) or TSC2 (tuberin) activity. To expand the genetic framework surrounding this group of growth regulators, we utilized the model eukaryote Schizosaccharomyces pombe to uncover and characterize genes that buffer the phenotypic effects of mutations in the orthologous tsc1 or tsc2 loci. Our study identified two genes: fft3 (encoding a DNA helicase) and ypa1 (encoding a peptidyle-prolyl cis/trans isomerase). While the deletion of fft3 or ypa1 has little effect in wild-type fission yeast cells, their loss in tsc1Δ or tsc2Δ backgrounds results in severe growth inhibition. These data suggest that the inhibition of Ypa1p or Fft3p might represent an 'Achilles' heel' of cells defective in hamartin/tuberin function. Furthermore, we demonstrate that the interaction between tsc1/tsc2 and ypa1 can be rescued through treatment with the mTOR inhibitor, torin-1, and that ypa1Δ cells are resistant to the glycolytic inhibitor, 2-deoxyglucose. This identifies ypa1 as a novel upstream regulator of mTOR and suggests that the effects of ypa1 loss, together with mTOR activation, combine to result in a cellular maladaptation in energy metabolism that is profoundly inhibitory to growth.

...