Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 48
Filtrar
Más filtros

Tipo del documento
Intervalo de año de publicación
1.
Proc Natl Acad Sci U S A ; 120(24): e2220294120, 2023 06 13.
Artículo en Inglés | MEDLINE | ID: mdl-37276424

RESUMEN

A hepatitis C virus (HCV) vaccine is urgently needed. Vaccine development has been hindered by HCV's genetic diversity, particularly within the immunodominant hypervariable region 1 (HVR1). Here, we developed a strategy to elicit broadly neutralizing antibodies to HVR1, which had previously been considered infeasible. We first applied a unique information theory-based measure of genetic distance to evaluate phenotypic relatedness between HVR1 variants. These distances were used to model the structure of HVR1's sequence space, which was found to have five major clusters. Variants from each cluster were used to immunize mice individually, and as a pentavalent mixture. Sera obtained following immunization neutralized every variant in a diverse HCVpp panel (n = 10), including those resistant to monovalent immunization, and at higher mean titers (1/ID50 = 435) than a glycoprotein E2 (1/ID50 = 205) vaccine. This synergistic immune response offers a unique approach to overcoming antigenic variability and may be applicable to other highly mutable viruses.


Asunto(s)
Hepacivirus , Hepatitis C , Animales , Ratones , Proteínas del Envoltorio Viral/genética , Inmunización , Inmunidad , Anticuerpos contra la Hepatitis C , Anticuerpos Neutralizantes
2.
BMC Bioinformatics ; 23(1): 62, 2022 Feb 08.
Artículo en Inglés | MEDLINE | ID: mdl-35135469

RESUMEN

BACKGROUND: Investigation of outbreaks to identify the primary case is crucial for the interruption and prevention of transmission of infectious diseases. These individuals may have a higher risk of participating in near future transmission events when compared to the other patients in the outbreak, so directing more transmission prevention resources towards these individuals is a priority. Although the genetic characterization of intra-host viral populations can aid the identification of transmission clusters, it is not trivial to determine the directionality of transmissions during outbreaks, owing to complexity of viral evolution. Here, we present a new computational framework, PYCIVO: primary case inference in viral outbreaks. This framework expands upon our earlier work in development of QUENTIN, which builds a probabilistic disease transmission tree based on simulation of evolution of intra-host hepatitis C virus (HCV) variants between cases involved in direct transmission during an outbreak. PYCIVO improves upon QUENTIN by also adding a custom heterogeneity index and identifying the scenario when the primary case may have not been sampled. RESULTS: These approaches were validated using a set of 105 sequence samples from 11 distinct HCV transmission clusters identified during outbreak investigations, in which the primary case was epidemiologically verified. Both models can detect the correct primary case in 9 out of 11 transmission clusters (81.8%). However, while QUENTIN issues erroneous predictions on the remaining 2 transmission clusters, PYCIVO issues a null output for these clusters, giving it an effective prediction accuracy of 100%. To further evaluate accuracy of the inference, we created 10 modified transmission clusters in which the primary case had been removed. In this scenario, PYCIVO was able to correctly identify that there was no primary case in 8/10 (80%) of these modified clusters. This model was validated with HCV; however, this approach may be applicable to other microbial pathogens. CONCLUSIONS: PYCIVO improves upon QUENTIN by also implementing a custom heterogeneity index which empowers PYCIVO to make the important 'No primary case' prediction. One or more samples, possibly including the primary case, may have not been sampled, and this designation is meant to account for these scenarios.


Asunto(s)
Enfermedades Transmisibles , Hepatitis C , Biología Computacional , Brotes de Enfermedades , Hepacivirus/genética , Hepatitis C/epidemiología , Humanos , Filogenia
3.
BMC Bioinformatics ; 21(Suppl 18): 482, 2020 Dec 30.
Artículo en Inglés | MEDLINE | ID: mdl-33375937

RESUMEN

BACKGROUND: In molecular epidemiology, comparison of intra-host viral variants among infected persons is frequently used for tracing transmissions in human population and detecting viral infection outbreaks. Application of Ultra-Deep Sequencing (UDS) immensely increases the sensitivity of transmission detection but brings considerable computational challenges when comparing all pairs of sequences. We developed a new population comparison method based on convex hulls in hamming space. We applied this method to a large set of UDS samples obtained from unrelated cases infected with hepatitis C virus (HCV) and compared its performance with three previously published methods. RESULTS: The convex hull in hamming space is a data structure that provides information on: (1) average hamming distance within the set, (2) average hamming distance between two sets; (3) closeness centrality of each sequence; and (4) lower and upper bound of all the pairwise distances among the members of two sets. This filtering strategy rapidly and correctly removes 96.2% of all pairwise HCV sample comparisons, outperforming all previous methods. The convex hull distance (CHD) algorithm showed variable performance depending on sequence heterogeneity of the studied populations in real and simulated datasets, suggesting the possibility of using clustering methods to improve the performance. To address this issue, we developed a new clustering algorithm, k-hulls, that reduces heterogeneity of the convex hull. This efficient algorithm is an extension of the k-means algorithm and can be used with any type of categorical data. It is 6.8-times more accurate than k-mode, a previously developed clustering algorithm for categorical data. CONCLUSIONS: CHD is a fast and efficient filtering strategy for massively reducing the computational burden of pairwise comparison among large samples of sequences, and thus, aiding the calculation of transmission links among infected individuals using threshold-based methods. In addition, the convex hull efficiently obtains important summary metrics for intra-host viral populations.


Asunto(s)
Algoritmos , Genómica/métodos , Análisis por Conglomerados , Hepacivirus/genética , Humanos
4.
Bioinformatics ; 34(1): 163-170, 2018 01 01.
Artículo en Inglés | MEDLINE | ID: mdl-29304222

RESUMEN

Motivation: Genomic analysis has become one of the major tools for disease outbreak investigations. However, existing computational frameworks for inference of transmission history from viral genomic data often do not consider intra-host diversity of pathogens and heavily rely on additional epidemiological data, such as sampling times and exposure intervals. This impedes genomic analysis of outbreaks of highly mutable viruses associated with chronic infections, such as human immunodeficiency virus and hepatitis C virus, whose transmissions are often carried out through minor intra-host variants, while the additional epidemiological information often is either unavailable or has a limited use. Results: The proposed framework QUasispecies Evolution, Network-based Transmission INference (QUENTIN) addresses the above challenges by evolutionary analysis of intra-host viral populations sampled by deep sequencing and Bayesian inference using general properties of social networks relevant to infection dissemination. This method allows inference of transmission direction even without the supporting case-specific epidemiological information, identify transmission clusters and reconstruct transmission history. QUENTIN was validated on experimental and simulated data, and applied to investigate HCV transmission within a community of hosts with high-risk behavior. It is available at https://github.com/skumsp/QUENTIN. Contact: pskums@gsu.edu or alexz@cs.gsu.edu or rahul@sfsu.edu or yek0@cdc.gov. Supplementary information: Supplementary data are available at Bioinformatics online.


Asunto(s)
Genoma Viral , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Cuasiespecies , Análisis de Secuencia de ARN/métodos , Programas Informáticos , Teorema de Bayes , Brotes de Enfermedades , Genómica/métodos , Hepacivirus/genética , Humanos , Análisis de Secuencia de ADN/métodos
5.
Scand J Med Sci Sports ; 29(5): 766-775, 2019 May.
Artículo en Inglés | MEDLINE | ID: mdl-30632640

RESUMEN

INTRODUCTION: This study examined the impact of a multicomponent physical activity (PA) intervention (MOVI-KIDS) on improving cognition in schoolchildren. This paper also analyzed the mediator role of motor fitness between MOVI-KIDS and cognition. METHODS: Propensity score analysis of data from a cluster randomized controlled trial (MOVI-KIDS study). This analysis including 240 5-7 years old children from nine schools in the provinces of Cuenca and Ciudad Real, Spain. MOVI-KIDS program consisted of: (a) three weekly after-school sessions of recreational non-competitive PA lasting 60 minutes during one academic year, (b) educational materials for parents and teachers, and (c) school playground modifications. Changes in cognition (logical reasoning, verbal factor, numerical factor, spatial factor, and general intelligence) were measured. A propensity score cross-cluster matching procedure and mediation analysis (Hayes's PROCESS macro) were conducted. RESULTS: All cognitive variables pre-post mean changes were significantly higher (P ≤ 0.05) in children from intervention schools than those from control schools (effect size ranged from 0.33 to 1.48). The effect of the intervention on the spatial factor and general intelligence was partially mediated by motor fitness (indirect effect = 0.92, 95% CI: 0.36; 1.65; and indirect effect = 1.21, 95% CI: 0.06; 2.62, respectively). CONCLUSIONS: This study shows that a one-school-year multicomponent intervention consisting of a recreational non-competitive PA program, educational materials for parents and teachers, and school playground modifications improved the cognition of first-grade children. Further, our results suggest that the effect of the intervention on cognition was mediated by changes in motor fitness.


Asunto(s)
Cognición , Ejercicio Físico , Educación y Entrenamiento Físico/métodos , Aptitud Física , Niño , Preescolar , Femenino , Humanos , Masculino , Clase Social , España
6.
BMC Bioinformatics ; 19(Suppl 11): 360, 2018 Oct 22.
Artículo en Inglés | MEDLINE | ID: mdl-30343669

RESUMEN

BACKGROUND: Many biological analysis tasks require extraction of families of genetically similar sequences from large datasets produced by Next-generation Sequencing (NGS). Such tasks include detection of viral transmissions by analysis of all genetically close pairs of sequences from viral datasets sampled from infected individuals or studying of evolution of viruses or immune repertoires by analysis of network of intra-host viral variants or antibody clonotypes formed by genetically close sequences. The most obvious naïeve algorithms to extract such sequence families are impractical in light of the massive size of modern NGS datasets. RESULTS: In this paper, we present fast and scalable k-mer-based framework to perform such sequence similarity queries efficiently, which specifically targets data produced by deep sequencing of heterogeneous populations such as viruses. It shows better filtering quality and time performance when comparing to other tools. The tool is freely available for download at https://github.com/vyacheslav-tsivina/signature-sj CONCLUSION: The proposed tool allows for efficient detection of genetic relatedness between genomic samples produced by deep sequencing of heterogeneous populations. It should be especially useful for analysis of relatedness of genomes of viruses with unevenly distributed variable genomic regions, such as HIV and HCV. For the future we envision, that besides applications in molecular epidemiology the tool can also be adapted to immunosequencing and metagenomics data.


Asunto(s)
Algoritmos , Variación Genética , Genoma , Filogenia , Secuencia de Bases , Entropía , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Metagenómica , Reproducibilidad de los Resultados , Factores de Tiempo
7.
BMC Bioinformatics ; 19(Suppl 11): 358, 2018 Oct 22.
Artículo en Inglés | MEDLINE | ID: mdl-30343674

RESUMEN

BACKGROUND: Molecular surveillance and outbreak investigation are important for elimination of hepatitis C virus (HCV) infection in the United States. A web-based system, Global Hepatitis Outbreak and Surveillance Technology (GHOST), has been developed using Illumina MiSeq-based amplicon sequence data derived from the HCV E1/E2-junction genomic region to enable public health institutions to conduct cost-effective and accurate molecular surveillance, outbreak detection and strain characterization. However, as there are many factors that could impact input data quality to which the GHOST system is not completely immune, accuracy of epidemiological inferences generated by GHOST may be affected. Here, we analyze the data submitted to the GHOST system during its pilot phase to assess the nature of the data and to identify common quality concerns that can be detected and corrected automatically. RESULTS: The GHOST quality control filters were individually examined, and quality failure rates were measured for all samples, including negative controls. New filters were developed and introduced to detect primer dimers, loss of specimen-specific product, or short products. The genotyping tool was adjusted to improve the accuracy of subtype calls. The identification of "chordless" cycles in a transmission network from data generated with known laboratory-based quality concerns allowed for further improvement of transmission detection by GHOST in surveillance settings. Parameters derived to detect actionable common quality control anomalies were incorporated into the automatic quality control module that rejects data depending on the magnitude of a quality problem, and warns and guides users in performing correctional actions. The guiding responses generated by the system are tailored to the GHOST laboratory protocol. CONCLUSIONS: Several new quality control problems were identified in MiSeq data submitted to GHOST and used to improve protection of the system from erroneous data and users from erroneous inferences. The GHOST system was upgraded to include identification of causes of erroneous data and recommendation of corrective actions to laboratory users.


Asunto(s)
Brotes de Enfermedades/prevención & control , Vigilancia de la Población/métodos , Automatización , Técnicas de Genotipaje , Hepacivirus/fisiología , Hepatitis C/epidemiología , Hepatitis C/virología , Humanos , Control de Calidad , Estándares de Referencia , Estados Unidos
8.
BMC Genomics ; 18(Suppl 10): 881, 2017 Dec 06.
Artículo en Inglés | MEDLINE | ID: mdl-29244001

RESUMEN

BACKGROUND: Intra-host hepatitis C virus (HCV) populations are genetically heterogeneous and organized in subpopulations. With the exception of blood transfusions, transmission of HCV occurs via a small number of genetic variants, the effect of which is frequently described as a bottleneck. Stochasticity of transmission associated with the bottleneck is usually used to explain genetic differences among HCV populations identified in the source and recipient cases, which may be further exacerbated by intra-host HCV evolution and differential biological capacity of HCV variants to successfully establish a population in a new host. RESULTS: Transmissibility was formulated as a property that can be measured from experimental Ultra-Deep Sequencing (UDS) data. The UDS data were obtained from one large hepatitis C outbreak involving an epidemiologically defined source and 18 recipient cases. k-Step networks of HCV variants were constructed and used to identify a potential association between transmissibility and network centrality of individual HCV variants from the source. An additional dataset obtained from nine other HCV outbreaks with known directionality of transmission was used for validation. Transmissibility was not found to be dependent on high frequency of variants in the source, supporting the earlier observations of transmission of minority variants. Among all tested measures of centrality, the highest correlation of transmissibility was found with Hamming centrality (r = 0.720; p = 1.57 E-71). Correlation between genetic distances and differences in transmissibility among HCV variants from the source was found to be 0.3276 (Mantel Test, p = 9.99 E-5), indicating association between genetic proximity and transmissibility. A strong correlation ranging from 0.565-0.947 was observed between Hamming centrality and transmissibility in 7 of the 9 additional transmission clusters (p < 0.05). CONCLUSIONS: Transmission is not an exclusively stochastic process. Transmissibility, as formally measured in this study, is associated with certain biological properties that also define location of variants in the genetic space occupied by the HCV strain from the source. The measure may also be applicable to other highly heterogeneous viruses. Besides improving accuracy of outbreak investigations, this finding helps with the understanding of molecular mechanisms contributing to establishment of chronic HCV infection.


Asunto(s)
Variación Genética , Hepacivirus/genética , Hepacivirus/fisiología , Brotes de Enfermedades , Evolución Molecular , Genotipo , Hepatitis C/epidemiología , Hepatitis C/transmisión , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos
9.
BMC Genomics ; 18(Suppl 4): 372, 2017 05 24.
Artículo en Inglés | MEDLINE | ID: mdl-28589864

RESUMEN

BACKGROUND: Hepatitis C is a major public health problem in the United States and worldwide. Outbreaks of hepatitis C virus (HCV) infections associated with unsafe injection practices, drug diversion, and other exposures to blood are difficult to detect and investigate. Molecular analysis has been frequently used in the study of HCV outbreaks and transmission chains; helping identify a cluster of sequences as linked by transmission if their genetic distances are below a previously defined threshold. However, HCV exists as a population of numerous variants in each infected individual and it has been observed that minority variants in the source are often the ones responsible for transmission, a situation that precludes the use of a single sequence per individual because many such transmissions would be missed. The use of Next-Generation Sequencing immensely increases the sensitivity of transmission detection but brings a considerable computational challenge because all sequences need to be compared among all pairs of samples. METHODS: We developed a three-step strategy that filters pairs of samples according to different criteria: (i) a k-mer bloom filter, (ii) a Levenhstein filter and (iii) a filter of identical sequences. We applied these three filters on a set of samples that cover the spectrum of genetic relationships among HCV cases, from being part of the same transmission cluster, to belonging to different subtypes. RESULTS: Our three-step filtering strategy rapidly removes 85.1% of all the pairwise sample comparisons and 91.0% of all pairwise sequence comparisons, accurately establishing which pairs of HCV samples are below the relatedness threshold. CONCLUSIONS: We present a fast and efficient three-step filtering strategy that removes most sequence comparisons and accurately establishes transmission links of any threshold-based method. This highly efficient workflow will allow a faster response and molecular detection capacity, improving the rate of detection of viral transmissions with molecular data.


Asunto(s)
Hepacivirus/genética , Hepacivirus/fisiología , Secuenciación de Nucleótidos de Alto Rendimiento , Algoritmos , Estadística como Asunto
10.
BMC Genomics ; 18(Suppl 10): 916, 2017 Dec 06.
Artículo en Inglés | MEDLINE | ID: mdl-29244005

RESUMEN

BACKGROUND: Hepatitis C is a major public health problem in the United States and worldwide. Outbreaks of hepatitis C virus (HCV) infections associated with unsafe injection practices, drug diversion, and other exposures to blood are difficult to detect and investigate. Effective HCV outbreak investigation requires comprehensive surveillance and robust case investigation. We previously developed and validated a methodology for the rapid and cost-effective identification of HCV transmission clusters. Global Hepatitis Outbreak and Surveillance Technology (GHOST) is a cloud-based system enabling users, regardless of computational expertise, to analyze and visualize transmission clusters in an independent, accurate and reproducible way. RESULTS: We present and explore performance of several GHOST implemented algorithms using next-generation sequencing data experimentally obtained from hypervariable region 1 of genetically related and unrelated HCV strains. GHOST processes data from an entire MiSeq run in approximately 3 h. A panel of seven specimens was used for preparation of six repeats of MiSeq libraries. Testing sequence data from these libraries by GHOST showed a consistent transmission linkage detection, testifying to high reproducibility of the system. Lack of linkage among genetically unrelated HCV strains and constant detection of genetic linkage between HCV strains from known transmission pairs and from follow-up specimens at different levels of MiSeq-read sampling indicate high specificity and sensitivity of GHOST in accurate detection of HCV transmission. CONCLUSIONS: GHOST enables automatic extraction of timely and relevant public health information suitable for guiding effective intervention measures. It is designed as a virtual diagnostic system intended for use in molecular surveillance and outbreak investigations rather than in research. The system produces accurate and reproducible information on HCV transmission clusters for all users, irrespective of their level of bioinformatics expertise. Improvement in molecular detection capacity will contribute to increasing the rate of transmission detection, thus providing opportunity for rapid, accurate and effective response to outbreaks of hepatitis C. Although GHOST was originally developed for hepatitis C surveillance, its modular structure is readily applicable to other infectious diseases. Worldwide availability of GHOST for the detection of HCV transmissions will foster deeper involvement of public health researchers and practitioners in hepatitis C outbreak investigation.


Asunto(s)
Nube Computacional , Biología Computacional/métodos , Brotes de Enfermedades/estadística & datos numéricos , Monitoreo Epidemiológico , Hepatitis C/epidemiología , Internacionalidad , Algoritmos , Humanos , Programas Informáticos , Interfaz Usuario-Computador
11.
J Gen Virol ; 98(5): 1048-1057, 2017 May.
Artículo en Inglés | MEDLINE | ID: mdl-28537543

RESUMEN

Despite the significant public health problems associated with hepatitis B virus (HBV) in sub-Saharan Africa, many countries in this region do not have systematic HBV surveillance or genetic information on HBV circulating locally. Here, we report on the genetic characterization of 772 HBV strains from Tanzania. Phylogenetic analysis of the S-gene sequences showed prevalence of HBV genotype A (HBV/A, n=671, 86.9 %), followed by genotypes D (HBV/D, n=95, 12.3 %) and E (HBV/E, n=6, 0.8 %). All HBV/A sequences were further classified into subtype A1, while the HBV/D sequences were assigned to a new cluster. Among the Tanzanian sequences, 84 % of HBV/A1 and 94 % of HBV/D were unique. The Tanzanian and global HBV/A1 sequences were compared and were completely intermixed in the phylogenetic tree, with the Tanzanian sequences frequently generating long terminal branches, indicating a long history of HBV/A1 infections in the country. The time to the most recent common ancestor was estimated to be 188 years ago [95 % highest posterior density (HPD): 132 to 265 years] for HBV/A1 and 127 years ago (95 % HPD: 79 to 192 years) for HBV/D. The Bayesian skyline plot showed that the number of transmissions 'exploded' exponentially between 1960-1970 for HBV/A1 and 1970-1990 for HBV/D, with the effective population of HBV/A1 having expanded twice as much as that of HBV/D. The data suggest that Tanzania is at least a part of the geographic origin of the HBV/A1 subtype. A recent increase in the transmission rate and significant HBV genetic diversity should be taken into consideration when devising public health interventions to control HBV infections in Tanzania.

12.
BMC Palliat Care ; 16(1): 75, 2017 Dec 19.
Artículo en Inglés | MEDLINE | ID: mdl-29258495

RESUMEN

BACKGROUND: Amyotrophic lateral sclerosis (ALS) is an incurable neurodegenerative disease that dramatically affects patients' quality of life (QoL) and dignity of life (DoL). We aimed to study the impact of ALS on QoL and DoL and how these evolve throughout the duration of the disease. METHODS: First, we performed an observational, descriptive study of 43 patients with ALS recruited from the ALS unit at our center and compared them with 20 healthy age- and sex-matched controls. Second, we performed a prospective cohort study, following up 23 patients with ALS over 3 months. All participants completed questionnaires about their functional status, QoL, and DoL. RESULTS: QoL and DoL were significantly worse in the ALS group than in controls (both p < 0.001). During the three-month follow-up in the ALS cohort, statistically significant declines were observed in clinical status and QoL. For clinical status, median scores on the ALS Functional Rating scale changed from 30.95 points at baseline to 27.24 points after 3 months (p = 0.0003). For QoL, median scores on the ALS Assessment Questionnaire changed from 124.19 points at baseline to 131.81 at 3 months (p = 0.0062). However, no significant differences were found between the DoL scores at baseline (48.14 points) and 3 months (45 points) (p-value = 0.12). CONCLUSIONS: ALS is a neurodegenerative disease that affects QoL and DoL alike. We found that clinical status and QoL both deteriorated in patients with ALS as the disease progressed, but that DoL was preserved. However, our findings are limited by small sample sizes. The preservation of DoL may be due to multiple factors, including the therapies provided by the ALS unit. These findings suggest that alongside QoL, DoL may be an important target in the management and care of ALS patients.


Asunto(s)
Esclerosis Amiotrófica Lateral/psicología , Estado de Salud , Calidad de Vida/psicología , Adulto , Anciano , Femenino , Humanos , Masculino , Persona de Mediana Edad , Estudios Prospectivos , Psicometría/instrumentación , Psicometría/métodos , España , Encuestas y Cuestionarios
13.
J Infect Dis ; 213(6): 957-65, 2016 Mar 15.
Artículo en Inglés | MEDLINE | ID: mdl-26582955

RESUMEN

Hepatitis C is a major public health problem in the United States and worldwide. Outbreaks of hepatitis C virus (HCV) infections are associated with unsafe injection practices, drug diversion, and other exposures to blood and are difficult to detect and investigate. Here, we developed and validated a simple approach for molecular detection of HCV transmissions in outbreak settings. We obtained sequences from the HCV hypervariable region 1 (HVR1), using end-point limiting-dilution (EPLD) technique, from 127 cases involved in 32 epidemiologically defined HCV outbreaks and 193 individuals with unrelated HCV strains. We compared several types of genetic distances and calculated a threshold, using minimal Hamming distances, that identifies transmission clusters in all tested outbreaks with 100% accuracy. The approach was also validated on sequences obtained using next-generation sequencing from HCV strains recovered from 239 individuals, and findings showed the same accuracy as that for EPLD. On average, the nucleotide diversity of the intrahost population was 6.2 times greater in the source case than in any incident case, allowing the correct detection of transmission direction in 8 outbreaks for which source cases were known. A simple and accurate distance-based approach developed here for detecting HCV transmissions streamlines molecular investigation of outbreaks, thus improving the public health capacity for rapid and effective control of hepatitis C.


Asunto(s)
Brotes de Enfermedades , Ligamiento Genético , Hepacivirus/genética , Hepacivirus/aislamiento & purificación , Hepatitis C/transmisión , Hepatitis C/virología , Análisis por Conglomerados , Variación Genética , Genotipo , Hepatitis C/epidemiología , Humanos , Reproducibilidad de los Resultados
14.
Bioinformatics ; 31(5): 682-90, 2015 Mar 01.
Artículo en Inglés | MEDLINE | ID: mdl-25359889

RESUMEN

MOTIVATION: Next-generation sequencing (NGS) allows for analyzing a large number of viral sequences from infected patients, providing an opportunity to implement large-scale molecular surveillance of viral diseases. However, despite improvements in technology, traditional protocols for NGS of large numbers of samples are still highly cost and labor intensive. One of the possible cost-effective alternatives is combinatorial pooling. Although a number of pooling strategies for consensus sequencing of DNA samples and detection of SNPs have been proposed, these strategies cannot be applied to sequencing of highly heterogeneous viral populations. RESULTS: We developed a cost-effective and reliable protocol for sequencing of viral samples, that combines NGS using barcoding and combinatorial pooling and a computational framework including algorithms for optimal virus-specific pools design and deconvolution of individual samples from sequenced pools. Evaluation of the framework on experimental and simulated data for hepatitis C virus showed that it substantially reduces the sequencing costs and allows deconvolution of viral populations with a high accuracy. AVAILABILITY AND IMPLEMENTATION: The source code and experimental data sets are available at http://alan.cs.gsu.edu/NGS/?q=content/pooling.


Asunto(s)
Algoritmos , Biología Computacional/métodos , ADN Viral/genética , Genoma Viral , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Análisis de Secuencia de ADN/métodos , Virus/clasificación , Virus/genética , Variación Genética , Hepacivirus/clasificación , Hepacivirus/genética , Humanos
15.
J Infect Dis ; 212(12): 1962-9, 2015 Dec 15.
Artículo en Inglés | MEDLINE | ID: mdl-26155829

RESUMEN

BACKGROUND: Up to 30% of acute viral hepatitis has no known etiology. To determine the disease etiology in patients with acute hepatitis of unknown etiology (HUE), serum specimens were obtained from 38 patients residing in the United Kingdom and Vietnam and from 26 healthy US blood donors. All specimens tested negative for known viral infections causing hepatitis, using commercially available serological and nucleic acid assays. METHODS: Specimens were processed by sequence-independent complementary DNA amplification and next-generation sequencing (NGS). Sufficient material for individual NGS libraries was obtained from 12 HUE cases and 26 blood donors; the remaining HUE cases were sequenced as a pool. Read mapping was done by targeted and de novo assembly. RESULTS: Sequences from hepatitis B virus (HBV) were detected in 7 individuals with HUE (58.3%) and the pooled library, and hepatitis E virus (HEV) was detected in 2 individuals with HUE (16.7%) and the pooled library. Both HEV-positive cases were coinfected with HBV. HBV sequences belonged to genotypes A, D, or G, and HEV sequences belonged to genotype 3. No known hepatotropic viruses were detected in the tested normal human sera. CONCLUSIONS: NGS-based detection of HBV and HEV infections is more sensitive than using commercially available assays. HBV and HEV may be cryptically associated with HUE.


Asunto(s)
Sangre/virología , Pruebas Diagnósticas de Rutina/métodos , Virus de la Hepatitis B/aislamiento & purificación , Virus de la Hepatitis E/aislamiento & purificación , Hepatitis Viral Humana/diagnóstico , Hepatitis Viral Humana/etiología , Adulto , Anciano , Coinfección/virología , Femenino , Virus de la Hepatitis B/genética , Virus de la Hepatitis E/genética , Humanos , Masculino , Persona de Mediana Edad , Sensibilidad y Especificidad , Análisis de Secuencia de ADN , Reino Unido , Estados Unidos , Vietnam , Adulto Joven
16.
J Virol ; 88(24): 13971-80, 2014 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-25187549

RESUMEN

UNLABELLED: The recent epidemic history of hepatitis B virus (HBV) infections in the United States is complex, as indicated by current disparity in HBV genotype distribution between acute and chronic hepatitis B cases and the rapid decline in hepatitis B incidence since the 1990s. We report temporal changes in the genetic composition of the HBV population using whole-genome sequences (n = 179) from acute hepatitis B cases (n = 1,206) identified through the Sentinel County Surveillance for Acute Hepatitis (1998 to 2006). HBV belonged mainly to subtypes A2 (75%) and D3 (18%), with times of their most recent common ancestors being 1979 and 1987, respectively. A2 underwent rapid population expansions in ca. 1995 and ca. 2002, coinciding with transient rises in acute hepatitis B notification rates among adults; D3 underwent expansion in ca. 1998. A2 strains from cases identified after 2002, compared to those before 2002, tended to cluster phylogenetically, indicating selective expansion of specific strains, and were significantly reduced in genetic diversity (P = 0.001) and frequency of drug resistance mutations (P = 0.001). The expansion of genetically close HBV A2 strains was associated with risk of infection among male homosexuals (P = 0.03). Incident HBV strains circulating in the United States were recent in origin and restricted in genetic diversity. Disparate transmission dynamics among phylogenetic lineages affected the genetic composition of HBV populations and their capacity to maintain drug resistance mutations. The tendency of selectively expanding HBV strains to be transmitted among male homosexuals highlights the need to improve hepatitis B vaccination coverage among at-risk adults. IMPORTANCE: Hepatitis B virus (HBV) remains an important cause of acute and chronic liver disease globally and in the United States. Genetic analysis of HBV whole genomes from cases of acute hepatitis B identified from 1998 to 2006 in the United States showed dominance of genotype A2 (75%), followed by D3 (18%). Strains of both subtypes were recent in origin and underwent rapid population expansions from 1995 to 2000, indicating increase in transmission rate for certain HBV strains during a period of decline in the reported incidence of acute hepatitis B in the United States. HBV A2 strains from a particular cluster that experienced the most recent population expansion were more commonly detected among men who have sex with men. Vaccination needs to be stepped up to protect persons who remain at risk of HBV infection.


Asunto(s)
Variación Genética , Virus de la Hepatitis B/clasificación , Virus de la Hepatitis B/genética , Hepatitis B/epidemiología , Hepatitis B/virología , Adulto , Análisis por Conglomerados , ADN Viral/química , ADN Viral/genética , Femenino , Genoma Viral , Genotipo , Hepatitis B/transmisión , Humanos , Masculino , Epidemiología Molecular , Datos de Secuencia Molecular , Filogenia , Análisis de Secuencia de ADN , Estados Unidos/epidemiología
17.
BMC Genomics ; 15 Suppl 5: S4, 2014.
Artículo en Inglés | MEDLINE | ID: mdl-25081811

RESUMEN

BACKGROUND: Next-generation sequencing (NGS) allows for sampling numerous viral variants from infected patients. This provides a novel opportunity to represent and study the mutational landscape of Hepatitis C Virus (HCV) within a single host. RESULTS: Intra-host variants of the HCV E1/E2 region were extensively sampled from 58 chronically infected patients. After NGS error correction, the average number of reads and variants obtained from each sample were 3202 and 464, respectively. The distance between each pair of variants was calculated and networks were created for each patient, where each node is a variant and two nodes are connected by a link if the nucleotide distance between them is 1. The work focused on large components having > 5% of all reads, which in average account for 93.7% of all reads found in a patient. CONCLUSIONS: Most intra-host variants are organized into distinct single-mutation components that are: well separated from each other, represent genetic distances between viral variants, robust to sampling, reproducible and likely seeded during transmission events. Facilitated by NGS, large components offer a novel evolutionary framework for genetic analysis of intra-host viral populations and understanding transmission, immune escape and drug resistance.


Asunto(s)
Variación Genética , Hepacivirus/genética , Secuenciación de Nucleótidos de Alto Rendimiento , Mutación , Simulación por Computador , Genotipo , Hepatitis C/transmisión , Humanos , Compartición de Agujas , ARN Viral/genética , Análisis de Secuencia de ADN
18.
J Med Virol ; 86(5): 765-71, 2014 May.
Artículo en Inglés | MEDLINE | ID: mdl-24519518

RESUMEN

Hepatitis C virus (HCV) infection presents an important, but underappreciated public health problem in Africa. In Côte d'Ivoire, very little is known about the molecular dynamics of HCV infection. Plasma samples (n = 608) from pregnant women collected in 1995 from Côte d'Ivoire were analyzed in this study. Only 18 specimens (∼3%) were found to be HCV PCR-positive. Phylogenetic analysis of the HCV NS5b sequences showed that the HCV variants belong to genotype 1 (HCV1) (n = 12, 67%) and genotype 2 (HCV2) (n = 6, 33%), with a maximum genetic diversity among HCV variants in each genotype being 20.7% and 24.0%, respectively. Although all HCV2 variants were genetically distant from each other, six HCV1 variants formed two tight sub-clusters belonging to HCV1a and HCV1b. Analysis of molecular variance (AMOVA) showed that the genetic structure of HCV isolates from West Africa with Côte d'Ivoire included were significantly different from Central African strains (P = 0.0001). Examination of intra-host viral populations using next-generation sequencing of the HCV HVR1 showed a significant variation in intra-host genetic diversity among infected individuals, with some strains composed of sub-populations as distant from each other as viral populations from different hosts. Collectively, the results indicate a complex HCV evolution in Côte d'Ivoire, similar to the rest of West Africa, and suggest a unique HCV epidemic history in the country.


Asunto(s)
Enfermedades Endémicas , Evolución Molecular , Variación Genética , Hepacivirus/clasificación , Hepacivirus/genética , Hepatitis C Crónica/epidemiología , Hepatitis C Crónica/virología , África , África Occidental , Análisis por Conglomerados , Côte d'Ivoire/epidemiología , Femenino , Genotipo , Hepacivirus/aislamiento & purificación , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Datos de Secuencia Molecular , Filogenia , Embarazo , Complicaciones Infecciosas del Embarazo/epidemiología , Complicaciones Infecciosas del Embarazo/virología , ARN Viral/genética , Proteínas no Estructurales Virales/genética
19.
J Infect Dis ; 207(6): 999-1006, 2013 Mar 15.
Artículo en Inglés | MEDLINE | ID: mdl-23300164

RESUMEN

The molecular detection of transmission of rapidly mutating pathogens such as hepatitis C virus (HCV) is commonly achieved by assessing the genetic relatedness of strains among infected patients. We describe the development of a novel mass spectrometry (MS)-based approach to identify HCV transmission. MS was used to detect products of base-specific cleavage of RNA molecules obtained from HCV polymerase chain reaction fragments. The MS-peak profiles were found to reflect variation in the HCV genomic sequence and the intrahost composition of the HCV population. Serum specimens originating from 60 case patients from 14 epidemiologically confirmed outbreaks and 25 unrelated controls were tested. Neighbor-joining trees constructed using MS-peak profile-based Hamming distances showed 100% accuracy, and linkage networks constructed using a threshold established from the Hamming distances between epidemiologically unrelated cases showed 100% sensitivity and 99.93% specificity in transmission detection. This MS-based approach is rapid, robust, reproducible, cost-effective, and applicable to investigating transmissions of other pathogens.


Asunto(s)
ADN Viral/aislamiento & purificación , Hepacivirus/aislamiento & purificación , Hepatitis C/epidemiología , Hepatitis C/transmisión , Espectrometría de Masas/métodos , Análisis de Varianza , ADN Viral/sangre , Hepacivirus/genética , Hepatitis C/sangre , Humanos , Epidemiología Molecular , Filogenia , Reacción en Cadena de la Polimerasa , ARN Viral/sangre , Sensibilidad y Especificidad , Estados Unidos/epidemiología
20.
J Comput Biol ; 30(4): 420-431, 2023 04.
Artículo en Inglés | MEDLINE | ID: mdl-36602524

RESUMEN

Application of genetic distances to measure phenotypic relatedness is a challenging task, reflecting the complex relationship between genotype and phenotype. Accurate assessment of proximity among sequences with different phenotypic traits depends on how strongly the chosen distance is associated with structural and functional properties. In this study, we present a new distance measure Mutual Information and Entropy H (MIH) for categorical data such as nucleotide or amino acid sequences. MIH applies an information matrix (IM), which is calculated from the data and captures heterogeneity of individual positions as measured by Shannon entropy and coordinated substitutions among positions as measured by mutual information. In general, MIH assigns low weights to differences occurring at high entropy positions or at dependent positions. MIH distance was compared with other common distances on two experimental and two simulated data sets. MIH showed the best ability to distinguish cross-immunoreactive sequence pairs from non-cross-immunoreactive pairs of variants of the hepatitis C virus hypervariable region 1 (26,883 pairwise comparisons), and Major Histocompatibility Complex (MHC) binding peptides (n = 181) from non-binding peptides (n = 129). Analysis of 74 simulated RNA secondary structures also showed that the ratio between MIH distance of sequences from the same RNA structure and MIH of sequences from different structures is three orders of magnitude greater than for Hamming distances. These findings indicate that lower MIH between two sequences is associated with greater probability of the sequences to belong to the same phenotype. Examination of rule-based phenotypes generated in silico showed that (1) MIH is strongly associated with phenotypic differences, (2) IM of sequences under selection is very different from IM generated under random scenarios, and (3) IM is robust to sampling. In conclusion, MIH strongly approximates structural/functional distances and should have important applications to a wide range of biological problems, including evolution, artificial selection of biological functions and structures, and measuring phenotypic similarity.


Asunto(s)
Péptidos , ARN , Secuencia de Aminoácidos , Fenotipo
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA