Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 6 de 6
Filtrar
Mais filtros

Base de dados
Tipo de documento
Intervalo de ano de publicação
1.
Bioinformatics ; 40(4)2024 Mar 29.
Artigo em Inglês | MEDLINE | ID: mdl-38569896

RESUMO

MOTIVATION: Long-read sequencing technologies, an attractive solution for many applications, often suffer from higher error rates. Alignment of multiple reads can improve base-calling accuracy, but some applications, e.g. sequencing mutagenized libraries where multiple distinct clones differ by one or few variants, require the use of barcodes or unique molecular identifiers. Unfortunately, sequencing errors can interfere with correct barcode identification, and a given barcode sequence may be linked to multiple independent clones within a given library. RESULTS: Here we focus on the target application of sequencing mutagenized libraries in the context of multiplexed assays of variant effects (MAVEs). MAVEs are increasingly used to create comprehensive genotype-phenotype maps that can aid clinical variant interpretation. Many MAVE methods use long-read sequencing of barcoded mutant libraries for accurate association of barcode with genotype. Existing long-read sequencing pipelines do not account for inaccurate sequencing or nonunique barcodes. Here, we describe Pacybara, which handles these issues by clustering long reads based on the similarities of (error-prone) barcodes while also detecting barcodes that have been associated with multiple genotypes. Pacybara also detects recombinant (chimeric) clones and reduces false positive indel calls. In three example applications, we show that Pacybara identifies and correctly resolves these issues. AVAILABILITY AND IMPLEMENTATION: Pacybara, freely available at https://github.com/rothlab/pacybara, is implemented using R, Python, and bash for Linux. It runs on GNU/Linux HPC clusters via Slurm, PBS, or GridEngine schedulers. A single-machine simplex version is also available.


Assuntos
Sequenciamento de Nucleotídeos em Larga Escala , Software , Análise de Sequência de DNA/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Biblioteca Gênica , Genótipo , Análise por Conglomerados
2.
Bioinformatics ; 38(10): 2927-2929, 2022 05 13.
Artigo em Inglês | MEDLINE | ID: mdl-35561209

RESUMO

SUMMARY: Use of PacBio sequencing for characterizing barcoded libraries of genetic variants is on the rise. However, current approaches in resolving PacBio sequencing artifacts can result in a high number of incorrectly identified or unusable reads. Here, we developed a PacBio Read Alignment Tool (PacRAT) that improves the accuracy of barcode-variant mapping through several steps of read alignment and consensus calling. To quantify the performance of our approach, we simulated PacBio reads from eight variant libraries of various lengths and showed that PacRAT improves the accuracy in pairing barcodes and variants across these libraries. Analysis of real (non-simulated) libraries also showed an increase in the number of reads that can be used for downstream analyses when using PacRAT. AVAILABILITY AND IMPLEMENTATION: PacRAT is written in Python and is freely available (https://github.com/dunhamlab/PacRAT). SUPPLEMENTARY INFORMATION: Supplemental data are available at Bioinformatics online.


Assuntos
Sequenciamento de Nucleotídeos em Larga Escala , Software , Algoritmos , Alinhamento de Sequência , Análise de Sequência de DNA
3.
PLoS Genet ; 15(9): e1008383, 2019 09.
Artigo em Inglês | MEDLINE | ID: mdl-31525194

RESUMO

Interspecific hybridization can introduce genetic variation that aids in adaptation to new or changing environments. Here, we investigate how hybrid adaptation to temperature and nutrient limitation may alter parental genome representation over time. We evolved Saccharomyces cerevisiae x Saccharomyces uvarum hybrids in nutrient-limited continuous culture at 15°C for 200 generations. In comparison to previous evolution experiments at 30°C, we identified a number of responses only observed in the colder temperature regime, including the loss of the S. cerevisiae allele in favor of the cryotolerant S. uvarum allele for several portions of the hybrid genome. In particular, we discovered a genotype by environment interaction in the form of a loss of heterozygosity event on chromosome XIII; which species' haplotype is lost or maintained is dependent on the parental species' temperature preference and the temperature at which the hybrid was evolved. We show that a large contribution to this directionality is due to a temperature dependent fitness benefit at a single locus, the high affinity phosphate transporter gene PHO84. This work helps shape our understanding of what forces impact genome evolution after hybridization, and how environmental conditions may promote or disfavor the persistence of hybrids over time.


Assuntos
Adaptação Biológica/genética , Hibridização Genética/genética , Simportadores de Próton-Fosfato/metabolismo , Proteínas de Saccharomyces cerevisiae/metabolismo , Adaptação Fisiológica/genética , Evolução Biológica , Quimera/genética , Temperatura Baixa , Aptidão Genética/genética , Variação Genética/genética , Genoma Fúngico/genética , Genótipo , Simportadores de Próton-Fosfato/genética , Saccharomyces/genética , Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/genética , Temperatura
4.
bioRxiv ; 2023 Dec 07.
Artigo em Inglês | MEDLINE | ID: mdl-36865234

RESUMO

Long read sequencing technologies, an attractive solution for many applications, often suffer from higher error rates. Alignment of multiple reads can improve base-calling accuracy, but some applications, e.g. sequencing mutagenized libraries where multiple distinct clones differ by one or few variants, require the use of barcodes or unique molecular identifiers. Unfortunately, sequencing errors can interfere with correct barcode identification, and a given barcode sequence may be linked to multiple independent clones within a given library. Here we focus on the target application of sequencing mutagenized libraries in the context of multiplexed assays of variant effects (MAVEs). MAVEs are increasingly used to create comprehensive genotype-phenotype maps that can aid clinical variant interpretation. Many MAVE methods use long-read sequencing of barcoded mutant libraries for accurate association of barcode with genotype. Existing long-read sequencing pipelines do not account for inaccurate sequencing or non-unique barcodes. Here, we describe Pacybara, which handles these issues by clustering long reads based on the similarities of (error-prone) barcodes while also detecting barcodes that have been associated with multiple genotypes. Pacybara also detects recombinant (chimeric) clones and reduces false positive indel calls. In three example applications, we show that Pacybara identifies and correctly resolves these issues.

5.
Curr Opin Genet Dev ; 76: 101979, 2022 10.
Artigo em Inglês | MEDLINE | ID: mdl-36075138

RESUMO

Expansion of sequencing efforts to include thousands of genomes is providing a fundamental resource for determining the genetic diversity that exists in a population. Now, high-throughput approaches are necessary to begin to understand the role these genotypic changes play in affecting phenotypic variation. Saccharomyces cerevisiae maintains its position as an excellent model system to determine the function of unknown variants with its exceptional genetic diversity, phenotypic diversity, and reliable genetic manipulation tools. Here, we review strategies and techniques developed in yeast that scale classic approaches of assessing variant function. These approaches improve our ability to better map quantitative trait loci at a higher resolution, even for rare variants, and are already providing greater insight into the role that different types of mutations play in phenotypic variation and evolution not just in yeast but across taxa.


Assuntos
Proteínas de Saccharomyces cerevisiae , Saccharomyces cerevisiae , Mapeamento Cromossômico , Variação Genética/genética , Fenótipo , Locos de Características Quantitativas/genética , Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/genética
6.
Genetics ; 218(2)2021 06 24.
Artigo em Inglês | MEDLINE | ID: mdl-33848333

RESUMO

Despite the fundamental importance of mutation rate as a driving force in evolution and disease risk, common methods to assay mutation rate are time-consuming and tedious. Established methods such as fluctuation tests and mutation accumulation experiments are low-throughput and often require significant optimization to ensure accuracy. We established a new method to determine the mutation rate of many strains simultaneously by tracking mutation events in a chemostat continuous culture device and applying deep sequencing to link mutations to alleles of a DNA-repair gene. We applied this method to assay the mutation rate of hundreds of Saccharomyces cerevisiae strains carrying mutations in the gene encoding Msh2, a DNA repair enzyme in the mismatch repair pathway. Loss-of-function mutations in MSH2 are associated with hereditary nonpolyposis colorectal cancer, an inherited disorder that increases risk for many different cancers. However, the vast majority of MSH2 variants found in human populations have insufficient evidence to be classified as either pathogenic or benign. We first benchmarked our method against Luria-Delbrück fluctuation tests using a collection of published MSH2 missense variants. Our pooled screen successfully identified previously characterized nonfunctional alleles as high mutators. We then created an additional 185 human missense variants in the yeast ortholog, including both characterized and uncharacterized alleles curated from ClinVar and other clinical testing data. In a set of alleles of known pathogenicity, our assay recapitulated ClinVar's classification; we then estimated pathogenicity for 157 variants classified as uncertain or conflicting reports of significance. This method is capable of studying the mutation rate of many microbial species and can be applied to problems ranging from the generation of high-fidelity polymerases to measuring the frequency of antibiotic resistance emergence.


Assuntos
Análise Mutacional de DNA/métodos , Ensaios de Triagem em Larga Escala/métodos , Proteína 2 Homóloga a MutS/genética , Proteínas de Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/genética , DNA Fúngico/genética , DNA Fúngico/isolamento & purificação , Resistência Microbiana a Medicamentos/genética , Biblioteca Gênica , Humanos , Taxa de Mutação , Mutação de Sentido Incorreto
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA