Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 7 de 7
Filtrar
1.
BMC Genomics ; 16: 589, 2015 Aug 07.
Artigo em Inglês | MEDLINE | ID: mdl-26248467

RESUMO

BACKGROUND: PCR amplicon sequencing has been widely used as a targeted approach for both DNA and RNA sequence analysis. High multiplex PCR has further enabled the enrichment of hundreds of amplicons in one simple reaction. At the same time, the performance of PCR amplicon sequencing can be negatively affected by issues such as high duplicate reads, polymerase artifacts and PCR amplification bias. Recently researchers have made some good progress in addressing these shortcomings by incorporating molecular barcodes into PCR primer design. So far, most work has been demonstrated using one to a few pairs of primers, which limits the size of the region one can analyze. RESULTS: We developed a simple protocol, which enables the use of molecular barcodes in high multiplex PCR with hundreds of amplicons. Using this protocol and reference materials, we demonstrated the applications in accurate variant calling at very low fraction over a large region and in targeted RNA quantification. We also evaluated the protocol's utility in profiling FFPE samples. CONCLUSIONS: We demonstrated the successful implementation of molecular barcodes in high multiplex PCR, with multiplex scale many times higher than earlier work. We showed that the new protocol combines the benefits of both high multiplex PCR and molecular barcodes, i.e. the analysis of a very large region, low DNA input requirement, very good reproducibility and the ability to detect as low as 1% mutations with minimal false positives (FP).


Assuntos
Código de Barras de DNA Taxonômico/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Reação em Cadeia da Polimerase Multiplex/métodos , Artefatos , Primers do DNA/genética , Humanos , RNA/genética , Reprodutibilidade dos Testes , Análise de Sequência/métodos
2.
PLoS Comput Biol ; 8(5): e1002534, 2012 May.
Artigo em Inglês | MEDLINE | ID: mdl-22693440

RESUMO

The RNA world hypothesis views modern organisms as descendants of RNA molecules. The earliest RNA molecules must have been random sequences, from which the first genomes that coded for polymerase ribozymes emerged. The quasispecies theory by Eigen predicts the existence of an error threshold limiting genomic stability during such transitions, but does not address the spontaneity of changes. Following a recent theoretical approach, we applied the quasispecies theory combined with kinetic/thermodynamic descriptions of RNA replication to analyze the collective behavior of RNA replicators based on known experimental kinetics data. We find that, with increasing fidelity (relative rate of base-extension for Watson-Crick versus mismatched base pairs), replications without enzymes, with ribozymes, and with protein-based polymerases are above, near, and below a critical point, respectively. The prebiotic evolution therefore must have crossed this critical region. Over large regions of the phase diagram, fitness increases with increasing fidelity, biasing random drifts in sequence space toward 'crystallization.' This region encloses the experimental nonenzymatic fidelity value, favoring evolutions toward polymerase sequences with ever higher fidelity, despite error rates above the error catastrophe threshold. Our work shows that experimentally characterized kinetics and thermodynamics of RNA replication allow us to determine the physicochemical conditions required for the spontaneous crystallization of biological information. Our findings also suggest that among many potential oligomers capable of templated replication, RNAs may have evolved to form prebiotic genomes due to the value of their nonenzymatic fidelity.


Assuntos
Evolução Molecular , Genoma , Origem da Vida , RNA/metabolismo , DNA Polimerase Dirigida por DNA/metabolismo , Modelos Genéticos , RNA Catalítico/metabolismo , Processos Estocásticos , Termodinâmica
3.
BMC Bioinformatics ; 11: 340, 2010 Jun 23.
Artigo em Inglês | MEDLINE | ID: mdl-20573238

RESUMO

BACKGROUND: Pathogen diagnostic assays based on polymerase chain reaction (PCR) technology provide high sensitivity and specificity. However, the design of these diagnostic assays is computationally intensive, requiring high-throughput methods to identify unique PCR signatures in the presence of an ever increasing availability of sequenced genomes. RESULTS: We present the Tool for PCR Signature Identification (TOPSI), a high-performance computing pipeline for the design of PCR-based pathogen diagnostic assays. The TOPSI pipeline efficiently designs PCR signatures common to multiple bacterial genomes by obtaining the shared regions through pairwise alignments between the input genomes. TOPSI successfully designed PCR signatures common to 18 Staphylococcus aureus genomes in less than 14 hours using 98 cores on a high-performance computing system. CONCLUSIONS: TOPSI is a computationally efficient, fully integrated tool for high-throughput design of PCR signatures common to multiple bacterial genomes. TOPSI is freely available for download at http://www.bhsai.org/downloads/topsi.tar.gz.


Assuntos
Biologia Computacional/métodos , Genoma Bacteriano , Reação em Cadeia da Polimerase/métodos , Staphylococcus aureus/genética , Sequência de Bases , Burkholderia mallei/genética , Burkholderia pseudomallei/genética , Mapeamento Cromossômico , Sensibilidade e Especificidade , Staphylococcus aureus/classificação
4.
Nat Med ; 25(12): 1928-1937, 2019 12.
Artigo em Inglês | MEDLINE | ID: mdl-31768066

RESUMO

Accurate identification of tumor-derived somatic variants in plasma circulating cell-free DNA (cfDNA) requires understanding of the various biological compartments contributing to the cfDNA pool. We sought to define the technical feasibility of a high-intensity sequencing assay of cfDNA and matched white blood cell DNA covering a large genomic region (508 genes; 2 megabases; >60,000× raw depth) in a prospective study of 124 patients with metastatic cancer, with contemporaneous matched tumor tissue biopsies, and 47 controls without cancer. The assay displayed high sensitivity and specificity, allowing for de novo detection of tumor-derived mutations and inference of tumor mutational burden, microsatellite instability, mutational signatures and sources of somatic mutations identified in cfDNA. The vast majority of cfDNA mutations (81.6% in controls and 53.2% in patients with cancer) had features consistent with clonal hematopoiesis. This cfDNA sequencing approach revealed that clonal hematopoiesis constitutes a pervasive biological phenomenon, emphasizing the importance of matched cfDNA-white blood cell sequencing for accurate variant interpretation.


Assuntos
Ácidos Nucleicos Livres/sangue , DNA Tumoral Circulante/sangue , Genômica , Neoplasias/sangue , Adulto , Biomarcadores Tumorais/sangue , DNA Tumoral Circulante/genética , Análise Mutacional de DNA , DNA de Neoplasias/sangue , Feminino , Regulação Neoplásica da Expressão Gênica , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Masculino , Instabilidade de Microssatélites , Pessoa de Meia-Idade , Mutação , Neoplasias/genética , Neoplasias/patologia
5.
BMC Bioinformatics ; 9: 185, 2008 Apr 10.
Artigo em Inglês | MEDLINE | ID: mdl-18402679

RESUMO

BACKGROUND: We present a methodology for high-throughput design of oligonucleotide fingerprints for microarray-based pathogen diagnostic assays. The oligonucleotide fingerprints, or DNA microarray probes, are designed for identifying target organisms in environmental or clinical samples. The design process is implemented in a high-performance computing software pipeline that incorporates major algorithmic improvements over a previous version to both reduce computation time and improve specificity assessment. RESULTS: The algorithmic improvements result in significant reduction in runtimes, with the updated pipeline being nearly up to five-times faster than the previous version. The improvements in specificity assessment, based on multiple specificity criteria, result in robust and consistent evaluation of cross-hybridization with nontarget sequences. In addition, the multiple criteria provide finer control on the number of resulting fingerprints, which helps in obtaining a larger number of fingerprints with high specificity. Simulation tests for Francisella tularensis and Yersinia pestis, using a well-established hybridization model to estimate cross-hybridization with nontarget sequences, show that the improved specificity criteria yield a larger number of fingerprints as compared to using a single specificity criterion. CONCLUSION: The faster runtimes, achieved as the result of algorithmic improvements, are critical for extending the pipeline to process multiple target genomes. The larger numbers of identified fingerprints, obtained by considering broader specificity criteria, are essential for designing probes for hard-to-distinguish target sequences.


Assuntos
Bactérias/genética , Bactérias/isolamento & purificação , Impressões Digitais de DNA/métodos , Sondas de DNA/genética , DNA Bacteriano/genética , Ilhas Genômicas/genética , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Algoritmos , Mapeamento Cromossômico/métodos
6.
BMC Genomics ; 9: 496, 2008 Oct 21.
Artigo em Inglês | MEDLINE | ID: mdl-18940003

RESUMO

BACKGROUND: With multiple strains of various pathogens being sequenced, it is necessary to develop high-throughput methods that can simultaneously process multiple bacterial or viral genomes to find common fingerprints as well as fingerprints that are unique to each individual genome. We present algorithmic enhancements to an existing single-genome pipeline that allows for efficient design of microarray probes common to groups of target genomes. The enhanced pipeline takes advantage of the similarities in the input genomes to narrow the search to short, nonredundant regions of the target genomes and, thereby, significantly reduces the computation time. The pipeline also computes a three-state hybridization matrix, which gives the expected hybridization of each probe with each target. RESULTS: Design of microarray probes for eight pathogenic Burkholderia genomes shows that the multiple-genome pipeline is nearly four-times faster than the single-genome pipeline for this application. The probes designed for these eight genomes were experimentally tested with one non-target and three target genomes. Hybridization experiments show that less than 10% of the designed probes cross hybridize with non-targets. Also, more than 65% of the probes designed to identify all Burkholderia mallei and B. pseudomallei strains successfully hybridize with a B. pseudomallei strain not used for probe design. CONCLUSION: The savings in runtime suggest that the enhanced pipeline can be used to design fingerprints for tens or even hundreds of related genomes in a single run. Hybridization results with an unsequenced B. pseudomallei strain indicate that the designed probes might be useful in identifying unsequenced strains of B. mallei and B. pseudomallei.


Assuntos
Burkholderia/genética , Impressões Digitais de DNA/métodos , Sondas de DNA , Genoma Bacteriano , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Algoritmos , Técnicas de Tipagem Bacteriana , Burkholderia/classificação , Biologia Computacional , DNA Bacteriano/genética , Sensibilidade e Especificidade , Análise de Sequência de DNA
7.
Source Code Biol Med ; 6: 14, 2011 Sep 08.
Artigo em Inglês | MEDLINE | ID: mdl-21902825

RESUMO

With ever-increasing numbers of microbial genomes being sequenced, efficient tools are needed to perform strain-level identification of any newly sequenced genome. Here, we present the SNP identification for strain typing (SNIT) pipeline, a fast and accurate software system that compares a newly sequenced bacterial genome with other genomes of the same species to identify single nucleotide polymorphisms (SNPs) and small insertions/deletions (indels). Based on this information, the pipeline analyzes the polymorphic loci present in all input genomes to identify the genome that has the fewest differences with the newly sequenced genome. Similarly, for each of the other genomes, SNIT identifies the input genome with the fewest differences. Results from five bacterial species show that the SNIT pipeline identifies the correct closest neighbor with 75% to 100% accuracy. The SNIT pipeline is available for download at http://www.bhsai.org/snit.html.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA