Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 7 de 7
Filtrar
1.
Nucleic Acids Res ; 49(17): e102, 2021 09 27.
Artículo en Inglés | MEDLINE | ID: mdl-34214168

RESUMEN

Rapidly evolving RNA viruses continuously produce minority haplotypes that can become dominant if they are drug-resistant or can better evade the immune system. Therefore, early detection and identification of minority viral haplotypes may help to promptly adjust the patient's treatment plan preventing potential disease complications. Minority haplotypes can be identified using next-generation sequencing, but sequencing noise hinders accurate identification. The elimination of sequencing noise is a non-trivial task that still remains open. Here we propose CliqueSNV based on extracting pairs of statistically linked mutations from noisy reads. This effectively reduces sequencing noise and enables identifying minority haplotypes with the frequency below the sequencing error rate. We comparatively assess the performance of CliqueSNV using an in vitro mixture of nine haplotypes that were derived from the mutation profile of an existing HIV patient. We show that CliqueSNV can accurately assemble viral haplotypes with frequencies as low as 0.1% and maintains consistent performance across short and long bases sequencing platforms.


Asunto(s)
Algoritmos , Biología Computacional/métodos , Haplotipos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Infecciones por Virus ARN/diagnóstico , Virus ARN/genética , COVID-19/diagnóstico , COVID-19/virología , Frecuencia de los Genes , Infecciones por VIH/diagnóstico , Infecciones por VIH/virología , VIH-1/genética , Humanos , Mutación , Polimorfismo de Nucleótido Simple , Infecciones por Virus ARN/virología , Reproducibilidad de los Resultados , SARS-CoV-2/genética , Sensibilidad y Especificidad
2.
BMC Genomics ; 18(Suppl 10): 918, 2017 Dec 06.
Artículo en Inglés | MEDLINE | ID: mdl-29244009

RESUMEN

BACKGROUND: RNA viruses such as HCV and HIV mutate at extremely high rates, and as a result, they exist in infected hosts as populations of genetically related variants. Recent advances in sequencing technologies make possible to identify such populations at great depth. In particular, these technologies provide new opportunities for inference of relatedness between viral samples, identification of transmission clusters and sources of infection, which are crucial tasks for viral outbreaks investigations. RESULTS: We present (i) an evolutionary simulation algorithm Viral Outbreak InferenCE (VOICE) inferring genetic relatedness, (ii) an algorithm MinDistB detecting possible transmission using minimal distances between intra-host viral populations and sizes of their relative borders, and (iii) a non-parametric recursive clustering algorithm Relatedness Depth (ReD) analyzing clusters' structure to infer possible transmissions and their directions. All proposed algorithms were validated using real sequencing data from HCV outbreaks. CONCLUSIONS: All algorithms are applicable to the analysis of outbreaks of highly heterogeneous RNA viruses. Our experimental validation shows that they can successfully identify genetic relatedness between viral populations, as well as infer transmission clusters and outbreak sources.


Asunto(s)
Biología Computacional , Hepacivirus/genética , Filogenia , Cuasiespecies/genética , Análisis de Secuencia de ARN , Algoritmos , Análisis por Conglomerados , Genoma Viral/genética , ARN Viral/genética
3.
Bioinformatics ; 31(5): 682-90, 2015 Mar 01.
Artículo en Inglés | MEDLINE | ID: mdl-25359889

RESUMEN

MOTIVATION: Next-generation sequencing (NGS) allows for analyzing a large number of viral sequences from infected patients, providing an opportunity to implement large-scale molecular surveillance of viral diseases. However, despite improvements in technology, traditional protocols for NGS of large numbers of samples are still highly cost and labor intensive. One of the possible cost-effective alternatives is combinatorial pooling. Although a number of pooling strategies for consensus sequencing of DNA samples and detection of SNPs have been proposed, these strategies cannot be applied to sequencing of highly heterogeneous viral populations. RESULTS: We developed a cost-effective and reliable protocol for sequencing of viral samples, that combines NGS using barcoding and combinatorial pooling and a computational framework including algorithms for optimal virus-specific pools design and deconvolution of individual samples from sequenced pools. Evaluation of the framework on experimental and simulated data for hepatitis C virus showed that it substantially reduces the sequencing costs and allows deconvolution of viral populations with a high accuracy. AVAILABILITY AND IMPLEMENTATION: The source code and experimental data sets are available at http://alan.cs.gsu.edu/NGS/?q=content/pooling.


Asunto(s)
Algoritmos , Biología Computacional/métodos , ADN Viral/genética , Genoma Viral , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Análisis de Secuencia de ADN/métodos , Virus/clasificación , Virus/genética , Variación Genética , Hepacivirus/clasificación , Hepacivirus/genética , Humanos
4.
BMC Bioinformatics ; 14 Suppl 9: S2, 2013.
Artículo en Inglés | MEDLINE | ID: mdl-23902469

RESUMEN

BACKGROUND: Highly mutable RNA viruses exist in infected hosts as heterogeneous populations of genetically close variants known as quasispecies. Next-generation sequencing (NGS) allows for analysing a large number of viral sequences from infected patients, presenting a novel opportunity for studying the structure of a viral population and understanding virus evolution, drug resistance and immune escape. Accurate reconstruction of genetic composition of intra-host viral populations involves assembling the NGS short reads into whole-genome sequences and estimating frequencies of individual viral variants. Although a few approaches were developed for this task, accurate reconstruction of quasispecies populations remains greatly unresolved. RESULTS: Two new methods, AmpMCF and ShotMCF, for reconstruction of the whole-genome intra-host viral variants and estimation of their frequencies were developed, based on Multicommodity Flows (MCFs). AmpMCF was designed for NGS reads obtained from individual PCR amplicons and ShotMCF for NGS shotgun reads. While AmpMCF, based on covering formulation, identifies a minimal set of quasispecies explaining all observed reads, ShotMCS, based on packing formulation, engages the maximal number of reads to generate the most probable set of quasispecies. Both methods were evaluated on simulated data in comparison to Maximum Bandwidth and ViSpA, previously developed state-of-the-art algorithms for estimating quasispecies spectra from the NGS amplicon and shotgun reads, respectively. Both algorithms were accurate in estimation of quasispecies frequencies, especially from large datasets. CONCLUSIONS: The problem of viral population reconstruction from amplicon or shotgun NGS reads was solved using the MCF formulation. The two methods, ShotMCF and AmpMCF, developed here afford accurate reconstruction of the structure of intra-host viral population from NGS reads. The implementations of the algorithms are available at http://alan.cs.gsu.edu/vira.html (AmpMCF) and http://alan.cs.gsu.edu/NGS/?q=content/shotmcf (ShotMCF).


Asunto(s)
Algoritmos , Variación Genética , Genoma Viral , Virus ARN/genética , Análisis de Secuencia de ARN/métodos , Hepacivirus/clasificación , Hepacivirus/genética , Virus ARN/clasificación
5.
Bone ; 168: 116642, 2023 03.
Artículo en Inglés | MEDLINE | ID: mdl-36526263

RESUMEN

OBJECTIVES: To assess the reliability and validity of age-specific versions of the Fibrodysplasia Ossificans Progressiva Physical Function Questionnaire (FOP-PFQ), developed to measure the impact of FOP on physical function and activities of daily living. METHODS: FOP-PFQ development included a literature review, two iterative phases of qualitative work involving individuals with FOP, and clinical expert review. The analysis used pooled FOP-PFQ data from an FOP natural history study (NCT02322255), a patient registry (NCT02745158), and phase II trials (NCT02190747; NCT02279095; NCT02979769). Item-level and factor analysis informed item retention and determined factor structure. Reliability was evaluated using Cronbach's alpha and intraclass correlation coefficients. Convergent validity was assessed by comparing scores with age, the Cumulative Analogue Joint Involvement Scale (CAJIS), the Patient-Reported Outcomes Measurement Information System Global Health Scale (PROMIS), and heterotopic ossification (HO) volume. Known-groups validity assessment used age, CAJIS, and HO volume. RESULTS: Factor analysis confirmed a two-factor solution: Mobility and Upper Extremity. Results reflected high internal consistency and were supportive of test-retest reliability; correlation coefficients >0.90 demonstrated FOP-PFQ scores were stable over a one- to three-week period. The majority of scores were moderately (r = 0.30-0.50) to highly (r ≥ 0.50) correlated with CAJIS and HO volume, supporting convergent validity. With the exception of some age-based and functional groups, FOP-PFQ scores were significantly worse in groups with more severe disease, demonstrating known-groups validity. CONCLUSION: The FOP-PFQ was demonstrated to be a reliable, valid measure that may be responsive to change in individuals with FOP, although some results were inconclusive for pediatric versions.


Asunto(s)
Miositis Osificante , Osificación Heterotópica , Humanos , Niño , Miositis Osificante/diagnóstico , Reproducibilidad de los Resultados , Actividades Cotidianas , Encuestas y Cuestionarios , Medición de Resultados Informados por el Paciente
6.
Clin Cancer Res ; 25(23): 7035-7045, 2019 12 01.
Artículo en Inglés | MEDLINE | ID: mdl-31383735

RESUMEN

PURPOSE: To analytically and clinically validate microsatellite instability (MSI) detection using cell-free DNA (cfDNA) sequencing. EXPERIMENTAL DESIGN: Pan-cancer MSI detection using Guardant360 was analytically validated according to established guidelines and clinically validated using 1,145 cfDNA samples for which tissue MSI status based on standard-of-care tissue testing was available. The landscape of cfDNA-based MSI across solid tumor types was investigated in a cohort of 28,459 clinical plasma samples. Clinical outcomes for 16 patients with cfDNA MSI-H gastric cancer treated with immunotherapy were evaluated. RESULTS: cfDNA MSI evaluation was shown to have high specificity, precision, and sensitivity, with a limit of detection of 0.1% tumor content. In evaluable patients, cfDNA testing accurately detected 87% (71/82) of tissue MSI-H and 99.5% of tissue microsatellite stable (863/867) for an overall accuracy of 98.4% (934/949) and a positive predictive value of 95% (71/75). Concordance of cfDNA MSI with tissue PCR and next-generation sequencing was significantly higher than IHC. Prevalence of cfDNA MSI for major cancer types was consistent with those reported for tissue. Finally, robust clinical activity of immunotherapy treatment was seen in patients with advanced gastric cancer positive for MSI by cfDNA, with 63% (10/16) of patients achieving complete or partial remission with sustained clinical benefit. CONCLUSIONS: cfDNA-based MSI detection using Guardant360 is highly concordant with tissue-based testing, enabling highly accurate detection of MSI status concurrent with comprehensive genomic profiling and expanding access to immunotherapy for patients with advanced cancer for whom current testing practices are inadequate.See related commentary by Wang and Ajani, p. 6887.


Asunto(s)
Biomarcadores de Tumor/genética , ADN Tumoral Circulante/sangre , ADN Tumoral Circulante/genética , Inestabilidad de Microsatélites , Neoplasias/genética , Biomarcadores de Tumor/sangre , Estudios de Casos y Controles , Estudios de Seguimiento , Genotipo , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Neoplasias/sangre , Neoplasias/patología , Pronóstico
7.
J Comput Biol ; 24(6): 558-570, 2017 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-27901586

RESUMEN

As a result of a high rate of mutations and recombination events, an RNA-virus exists as a heterogeneous "swarm" of mutant variants. The long read length offered by single-molecule sequencing technologies allows each mutant variant to be sequenced in a single pass. However, high error rate limits the ability to reconstruct heterogeneous viral population composed of rare, related mutant variants. In this article, we present two single-nucleotide variants (2SNV), a method able to tolerate the high error rate of the single-molecule protocol and reconstruct mutant variants. 2SNV uses linkage between single-nucleotide variations to efficiently distinguish them from read errors. To benchmark the sensitivity of 2SNV, we performed a single-molecule sequencing experiment on a sample containing a titrated level of known viral mutant variants. Our method is able to accurately reconstruct clone with frequency of 0.2% and distinguish clones that differed in only two nucleotides distantly located on the genome. 2SNV outperforms existing methods for full-length viral mutant reconstruction.


Asunto(s)
Algoritmos , Biología Computacional/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Mutación , Orthomyxoviridae/genética , Análisis de Secuencia de ADN/métodos , Análisis por Conglomerados , Genoma Viral , Humanos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA