Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 36
Filtrar
Más filtros

Tipo del documento
Intervalo de año de publicación
1.
Proc Natl Acad Sci U S A ; 120(24): e2220294120, 2023 06 13.
Artículo en Inglés | MEDLINE | ID: mdl-37276424

RESUMEN

A hepatitis C virus (HCV) vaccine is urgently needed. Vaccine development has been hindered by HCV's genetic diversity, particularly within the immunodominant hypervariable region 1 (HVR1). Here, we developed a strategy to elicit broadly neutralizing antibodies to HVR1, which had previously been considered infeasible. We first applied a unique information theory-based measure of genetic distance to evaluate phenotypic relatedness between HVR1 variants. These distances were used to model the structure of HVR1's sequence space, which was found to have five major clusters. Variants from each cluster were used to immunize mice individually, and as a pentavalent mixture. Sera obtained following immunization neutralized every variant in a diverse HCVpp panel (n = 10), including those resistant to monovalent immunization, and at higher mean titers (1/ID50 = 435) than a glycoprotein E2 (1/ID50 = 205) vaccine. This synergistic immune response offers a unique approach to overcoming antigenic variability and may be applicable to other highly mutable viruses.


Asunto(s)
Hepacivirus , Hepatitis C , Animales , Ratones , Proteínas del Envoltorio Viral/genética , Inmunización , Inmunidad , Anticuerpos contra la Hepatitis C , Anticuerpos Neutralizantes
2.
BMC Bioinformatics ; 23(1): 62, 2022 Feb 08.
Artículo en Inglés | MEDLINE | ID: mdl-35135469

RESUMEN

BACKGROUND: Investigation of outbreaks to identify the primary case is crucial for the interruption and prevention of transmission of infectious diseases. These individuals may have a higher risk of participating in near future transmission events when compared to the other patients in the outbreak, so directing more transmission prevention resources towards these individuals is a priority. Although the genetic characterization of intra-host viral populations can aid the identification of transmission clusters, it is not trivial to determine the directionality of transmissions during outbreaks, owing to complexity of viral evolution. Here, we present a new computational framework, PYCIVO: primary case inference in viral outbreaks. This framework expands upon our earlier work in development of QUENTIN, which builds a probabilistic disease transmission tree based on simulation of evolution of intra-host hepatitis C virus (HCV) variants between cases involved in direct transmission during an outbreak. PYCIVO improves upon QUENTIN by also adding a custom heterogeneity index and identifying the scenario when the primary case may have not been sampled. RESULTS: These approaches were validated using a set of 105 sequence samples from 11 distinct HCV transmission clusters identified during outbreak investigations, in which the primary case was epidemiologically verified. Both models can detect the correct primary case in 9 out of 11 transmission clusters (81.8%). However, while QUENTIN issues erroneous predictions on the remaining 2 transmission clusters, PYCIVO issues a null output for these clusters, giving it an effective prediction accuracy of 100%. To further evaluate accuracy of the inference, we created 10 modified transmission clusters in which the primary case had been removed. In this scenario, PYCIVO was able to correctly identify that there was no primary case in 8/10 (80%) of these modified clusters. This model was validated with HCV; however, this approach may be applicable to other microbial pathogens. CONCLUSIONS: PYCIVO improves upon QUENTIN by also implementing a custom heterogeneity index which empowers PYCIVO to make the important 'No primary case' prediction. One or more samples, possibly including the primary case, may have not been sampled, and this designation is meant to account for these scenarios.


Asunto(s)
Enfermedades Transmisibles , Hepatitis C , Biología Computacional , Brotes de Enfermedades , Hepacivirus/genética , Hepatitis C/epidemiología , Humanos , Filogenia
3.
BMC Bioinformatics ; 21(Suppl 18): 482, 2020 Dec 30.
Artículo en Inglés | MEDLINE | ID: mdl-33375937

RESUMEN

BACKGROUND: In molecular epidemiology, comparison of intra-host viral variants among infected persons is frequently used for tracing transmissions in human population and detecting viral infection outbreaks. Application of Ultra-Deep Sequencing (UDS) immensely increases the sensitivity of transmission detection but brings considerable computational challenges when comparing all pairs of sequences. We developed a new population comparison method based on convex hulls in hamming space. We applied this method to a large set of UDS samples obtained from unrelated cases infected with hepatitis C virus (HCV) and compared its performance with three previously published methods. RESULTS: The convex hull in hamming space is a data structure that provides information on: (1) average hamming distance within the set, (2) average hamming distance between two sets; (3) closeness centrality of each sequence; and (4) lower and upper bound of all the pairwise distances among the members of two sets. This filtering strategy rapidly and correctly removes 96.2% of all pairwise HCV sample comparisons, outperforming all previous methods. The convex hull distance (CHD) algorithm showed variable performance depending on sequence heterogeneity of the studied populations in real and simulated datasets, suggesting the possibility of using clustering methods to improve the performance. To address this issue, we developed a new clustering algorithm, k-hulls, that reduces heterogeneity of the convex hull. This efficient algorithm is an extension of the k-means algorithm and can be used with any type of categorical data. It is 6.8-times more accurate than k-mode, a previously developed clustering algorithm for categorical data. CONCLUSIONS: CHD is a fast and efficient filtering strategy for massively reducing the computational burden of pairwise comparison among large samples of sequences, and thus, aiding the calculation of transmission links among infected individuals using threshold-based methods. In addition, the convex hull efficiently obtains important summary metrics for intra-host viral populations.


Asunto(s)
Algoritmos , Genómica/métodos , Análisis por Conglomerados , Hepacivirus/genética , Humanos
4.
BMC Bioinformatics ; 19(Suppl 11): 360, 2018 Oct 22.
Artículo en Inglés | MEDLINE | ID: mdl-30343669

RESUMEN

BACKGROUND: Many biological analysis tasks require extraction of families of genetically similar sequences from large datasets produced by Next-generation Sequencing (NGS). Such tasks include detection of viral transmissions by analysis of all genetically close pairs of sequences from viral datasets sampled from infected individuals or studying of evolution of viruses or immune repertoires by analysis of network of intra-host viral variants or antibody clonotypes formed by genetically close sequences. The most obvious naïeve algorithms to extract such sequence families are impractical in light of the massive size of modern NGS datasets. RESULTS: In this paper, we present fast and scalable k-mer-based framework to perform such sequence similarity queries efficiently, which specifically targets data produced by deep sequencing of heterogeneous populations such as viruses. It shows better filtering quality and time performance when comparing to other tools. The tool is freely available for download at https://github.com/vyacheslav-tsivina/signature-sj CONCLUSION: The proposed tool allows for efficient detection of genetic relatedness between genomic samples produced by deep sequencing of heterogeneous populations. It should be especially useful for analysis of relatedness of genomes of viruses with unevenly distributed variable genomic regions, such as HIV and HCV. For the future we envision, that besides applications in molecular epidemiology the tool can also be adapted to immunosequencing and metagenomics data.


Asunto(s)
Algoritmos , Variación Genética , Genoma , Filogenia , Secuencia de Bases , Entropía , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Metagenómica , Reproducibilidad de los Resultados , Factores de Tiempo
5.
BMC Bioinformatics ; 19(Suppl 11): 358, 2018 Oct 22.
Artículo en Inglés | MEDLINE | ID: mdl-30343674

RESUMEN

BACKGROUND: Molecular surveillance and outbreak investigation are important for elimination of hepatitis C virus (HCV) infection in the United States. A web-based system, Global Hepatitis Outbreak and Surveillance Technology (GHOST), has been developed using Illumina MiSeq-based amplicon sequence data derived from the HCV E1/E2-junction genomic region to enable public health institutions to conduct cost-effective and accurate molecular surveillance, outbreak detection and strain characterization. However, as there are many factors that could impact input data quality to which the GHOST system is not completely immune, accuracy of epidemiological inferences generated by GHOST may be affected. Here, we analyze the data submitted to the GHOST system during its pilot phase to assess the nature of the data and to identify common quality concerns that can be detected and corrected automatically. RESULTS: The GHOST quality control filters were individually examined, and quality failure rates were measured for all samples, including negative controls. New filters were developed and introduced to detect primer dimers, loss of specimen-specific product, or short products. The genotyping tool was adjusted to improve the accuracy of subtype calls. The identification of "chordless" cycles in a transmission network from data generated with known laboratory-based quality concerns allowed for further improvement of transmission detection by GHOST in surveillance settings. Parameters derived to detect actionable common quality control anomalies were incorporated into the automatic quality control module that rejects data depending on the magnitude of a quality problem, and warns and guides users in performing correctional actions. The guiding responses generated by the system are tailored to the GHOST laboratory protocol. CONCLUSIONS: Several new quality control problems were identified in MiSeq data submitted to GHOST and used to improve protection of the system from erroneous data and users from erroneous inferences. The GHOST system was upgraded to include identification of causes of erroneous data and recommendation of corrective actions to laboratory users.


Asunto(s)
Brotes de Enfermedades/prevención & control , Vigilancia de la Población/métodos , Automatización , Técnicas de Genotipaje , Hepacivirus/fisiología , Hepatitis C/epidemiología , Hepatitis C/virología , Humanos , Control de Calidad , Estándares de Referencia , Estados Unidos
6.
BMC Genomics ; 18(Suppl 10): 881, 2017 Dec 06.
Artículo en Inglés | MEDLINE | ID: mdl-29244001

RESUMEN

BACKGROUND: Intra-host hepatitis C virus (HCV) populations are genetically heterogeneous and organized in subpopulations. With the exception of blood transfusions, transmission of HCV occurs via a small number of genetic variants, the effect of which is frequently described as a bottleneck. Stochasticity of transmission associated with the bottleneck is usually used to explain genetic differences among HCV populations identified in the source and recipient cases, which may be further exacerbated by intra-host HCV evolution and differential biological capacity of HCV variants to successfully establish a population in a new host. RESULTS: Transmissibility was formulated as a property that can be measured from experimental Ultra-Deep Sequencing (UDS) data. The UDS data were obtained from one large hepatitis C outbreak involving an epidemiologically defined source and 18 recipient cases. k-Step networks of HCV variants were constructed and used to identify a potential association between transmissibility and network centrality of individual HCV variants from the source. An additional dataset obtained from nine other HCV outbreaks with known directionality of transmission was used for validation. Transmissibility was not found to be dependent on high frequency of variants in the source, supporting the earlier observations of transmission of minority variants. Among all tested measures of centrality, the highest correlation of transmissibility was found with Hamming centrality (r = 0.720; p = 1.57 E-71). Correlation between genetic distances and differences in transmissibility among HCV variants from the source was found to be 0.3276 (Mantel Test, p = 9.99 E-5), indicating association between genetic proximity and transmissibility. A strong correlation ranging from 0.565-0.947 was observed between Hamming centrality and transmissibility in 7 of the 9 additional transmission clusters (p < 0.05). CONCLUSIONS: Transmission is not an exclusively stochastic process. Transmissibility, as formally measured in this study, is associated with certain biological properties that also define location of variants in the genetic space occupied by the HCV strain from the source. The measure may also be applicable to other highly heterogeneous viruses. Besides improving accuracy of outbreak investigations, this finding helps with the understanding of molecular mechanisms contributing to establishment of chronic HCV infection.


Asunto(s)
Variación Genética , Hepacivirus/genética , Hepacivirus/fisiología , Brotes de Enfermedades , Evolución Molecular , Genotipo , Hepatitis C/epidemiología , Hepatitis C/transmisión , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos
7.
BMC Genomics ; 18(Suppl 4): 372, 2017 05 24.
Artículo en Inglés | MEDLINE | ID: mdl-28589864

RESUMEN

BACKGROUND: Hepatitis C is a major public health problem in the United States and worldwide. Outbreaks of hepatitis C virus (HCV) infections associated with unsafe injection practices, drug diversion, and other exposures to blood are difficult to detect and investigate. Molecular analysis has been frequently used in the study of HCV outbreaks and transmission chains; helping identify a cluster of sequences as linked by transmission if their genetic distances are below a previously defined threshold. However, HCV exists as a population of numerous variants in each infected individual and it has been observed that minority variants in the source are often the ones responsible for transmission, a situation that precludes the use of a single sequence per individual because many such transmissions would be missed. The use of Next-Generation Sequencing immensely increases the sensitivity of transmission detection but brings a considerable computational challenge because all sequences need to be compared among all pairs of samples. METHODS: We developed a three-step strategy that filters pairs of samples according to different criteria: (i) a k-mer bloom filter, (ii) a Levenhstein filter and (iii) a filter of identical sequences. We applied these three filters on a set of samples that cover the spectrum of genetic relationships among HCV cases, from being part of the same transmission cluster, to belonging to different subtypes. RESULTS: Our three-step filtering strategy rapidly removes 85.1% of all the pairwise sample comparisons and 91.0% of all pairwise sequence comparisons, accurately establishing which pairs of HCV samples are below the relatedness threshold. CONCLUSIONS: We present a fast and efficient three-step filtering strategy that removes most sequence comparisons and accurately establishes transmission links of any threshold-based method. This highly efficient workflow will allow a faster response and molecular detection capacity, improving the rate of detection of viral transmissions with molecular data.


Asunto(s)
Hepacivirus/genética , Hepacivirus/fisiología , Secuenciación de Nucleótidos de Alto Rendimiento , Algoritmos , Estadística como Asunto
8.
BMC Genomics ; 18(Suppl 10): 916, 2017 Dec 06.
Artículo en Inglés | MEDLINE | ID: mdl-29244005

RESUMEN

BACKGROUND: Hepatitis C is a major public health problem in the United States and worldwide. Outbreaks of hepatitis C virus (HCV) infections associated with unsafe injection practices, drug diversion, and other exposures to blood are difficult to detect and investigate. Effective HCV outbreak investigation requires comprehensive surveillance and robust case investigation. We previously developed and validated a methodology for the rapid and cost-effective identification of HCV transmission clusters. Global Hepatitis Outbreak and Surveillance Technology (GHOST) is a cloud-based system enabling users, regardless of computational expertise, to analyze and visualize transmission clusters in an independent, accurate and reproducible way. RESULTS: We present and explore performance of several GHOST implemented algorithms using next-generation sequencing data experimentally obtained from hypervariable region 1 of genetically related and unrelated HCV strains. GHOST processes data from an entire MiSeq run in approximately 3 h. A panel of seven specimens was used for preparation of six repeats of MiSeq libraries. Testing sequence data from these libraries by GHOST showed a consistent transmission linkage detection, testifying to high reproducibility of the system. Lack of linkage among genetically unrelated HCV strains and constant detection of genetic linkage between HCV strains from known transmission pairs and from follow-up specimens at different levels of MiSeq-read sampling indicate high specificity and sensitivity of GHOST in accurate detection of HCV transmission. CONCLUSIONS: GHOST enables automatic extraction of timely and relevant public health information suitable for guiding effective intervention measures. It is designed as a virtual diagnostic system intended for use in molecular surveillance and outbreak investigations rather than in research. The system produces accurate and reproducible information on HCV transmission clusters for all users, irrespective of their level of bioinformatics expertise. Improvement in molecular detection capacity will contribute to increasing the rate of transmission detection, thus providing opportunity for rapid, accurate and effective response to outbreaks of hepatitis C. Although GHOST was originally developed for hepatitis C surveillance, its modular structure is readily applicable to other infectious diseases. Worldwide availability of GHOST for the detection of HCV transmissions will foster deeper involvement of public health researchers and practitioners in hepatitis C outbreak investigation.


Asunto(s)
Nube Computacional , Biología Computacional/métodos , Brotes de Enfermedades/estadística & datos numéricos , Monitoreo Epidemiológico , Hepatitis C/epidemiología , Internacionalidad , Algoritmos , Humanos , Programas Informáticos , Interfaz Usuario-Computador
9.
J Gen Virol ; 98(5): 1048-1057, 2017 May.
Artículo en Inglés | MEDLINE | ID: mdl-28537543

RESUMEN

Despite the significant public health problems associated with hepatitis B virus (HBV) in sub-Saharan Africa, many countries in this region do not have systematic HBV surveillance or genetic information on HBV circulating locally. Here, we report on the genetic characterization of 772 HBV strains from Tanzania. Phylogenetic analysis of the S-gene sequences showed prevalence of HBV genotype A (HBV/A, n=671, 86.9 %), followed by genotypes D (HBV/D, n=95, 12.3 %) and E (HBV/E, n=6, 0.8 %). All HBV/A sequences were further classified into subtype A1, while the HBV/D sequences were assigned to a new cluster. Among the Tanzanian sequences, 84 % of HBV/A1 and 94 % of HBV/D were unique. The Tanzanian and global HBV/A1 sequences were compared and were completely intermixed in the phylogenetic tree, with the Tanzanian sequences frequently generating long terminal branches, indicating a long history of HBV/A1 infections in the country. The time to the most recent common ancestor was estimated to be 188 years ago [95 % highest posterior density (HPD): 132 to 265 years] for HBV/A1 and 127 years ago (95 % HPD: 79 to 192 years) for HBV/D. The Bayesian skyline plot showed that the number of transmissions 'exploded' exponentially between 1960-1970 for HBV/A1 and 1970-1990 for HBV/D, with the effective population of HBV/A1 having expanded twice as much as that of HBV/D. The data suggest that Tanzania is at least a part of the geographic origin of the HBV/A1 subtype. A recent increase in the transmission rate and significant HBV genetic diversity should be taken into consideration when devising public health interventions to control HBV infections in Tanzania.

10.
J Infect Dis ; 213(6): 957-65, 2016 Mar 15.
Artículo en Inglés | MEDLINE | ID: mdl-26582955

RESUMEN

Hepatitis C is a major public health problem in the United States and worldwide. Outbreaks of hepatitis C virus (HCV) infections are associated with unsafe injection practices, drug diversion, and other exposures to blood and are difficult to detect and investigate. Here, we developed and validated a simple approach for molecular detection of HCV transmissions in outbreak settings. We obtained sequences from the HCV hypervariable region 1 (HVR1), using end-point limiting-dilution (EPLD) technique, from 127 cases involved in 32 epidemiologically defined HCV outbreaks and 193 individuals with unrelated HCV strains. We compared several types of genetic distances and calculated a threshold, using minimal Hamming distances, that identifies transmission clusters in all tested outbreaks with 100% accuracy. The approach was also validated on sequences obtained using next-generation sequencing from HCV strains recovered from 239 individuals, and findings showed the same accuracy as that for EPLD. On average, the nucleotide diversity of the intrahost population was 6.2 times greater in the source case than in any incident case, allowing the correct detection of transmission direction in 8 outbreaks for which source cases were known. A simple and accurate distance-based approach developed here for detecting HCV transmissions streamlines molecular investigation of outbreaks, thus improving the public health capacity for rapid and effective control of hepatitis C.


Asunto(s)
Brotes de Enfermedades , Ligamiento Genético , Hepacivirus/genética , Hepacivirus/aislamiento & purificación , Hepatitis C/transmisión , Hepatitis C/virología , Análisis por Conglomerados , Variación Genética , Genotipo , Hepatitis C/epidemiología , Humanos , Reproducibilidad de los Resultados
11.
Bioinformatics ; 31(5): 682-90, 2015 Mar 01.
Artículo en Inglés | MEDLINE | ID: mdl-25359889

RESUMEN

MOTIVATION: Next-generation sequencing (NGS) allows for analyzing a large number of viral sequences from infected patients, providing an opportunity to implement large-scale molecular surveillance of viral diseases. However, despite improvements in technology, traditional protocols for NGS of large numbers of samples are still highly cost and labor intensive. One of the possible cost-effective alternatives is combinatorial pooling. Although a number of pooling strategies for consensus sequencing of DNA samples and detection of SNPs have been proposed, these strategies cannot be applied to sequencing of highly heterogeneous viral populations. RESULTS: We developed a cost-effective and reliable protocol for sequencing of viral samples, that combines NGS using barcoding and combinatorial pooling and a computational framework including algorithms for optimal virus-specific pools design and deconvolution of individual samples from sequenced pools. Evaluation of the framework on experimental and simulated data for hepatitis C virus showed that it substantially reduces the sequencing costs and allows deconvolution of viral populations with a high accuracy. AVAILABILITY AND IMPLEMENTATION: The source code and experimental data sets are available at http://alan.cs.gsu.edu/NGS/?q=content/pooling.


Asunto(s)
Algoritmos , Biología Computacional/métodos , ADN Viral/genética , Genoma Viral , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Análisis de Secuencia de ADN/métodos , Virus/clasificación , Virus/genética , Variación Genética , Hepacivirus/clasificación , Hepacivirus/genética , Humanos
12.
J Infect Dis ; 212(12): 1962-9, 2015 Dec 15.
Artículo en Inglés | MEDLINE | ID: mdl-26155829

RESUMEN

BACKGROUND: Up to 30% of acute viral hepatitis has no known etiology. To determine the disease etiology in patients with acute hepatitis of unknown etiology (HUE), serum specimens were obtained from 38 patients residing in the United Kingdom and Vietnam and from 26 healthy US blood donors. All specimens tested negative for known viral infections causing hepatitis, using commercially available serological and nucleic acid assays. METHODS: Specimens were processed by sequence-independent complementary DNA amplification and next-generation sequencing (NGS). Sufficient material for individual NGS libraries was obtained from 12 HUE cases and 26 blood donors; the remaining HUE cases were sequenced as a pool. Read mapping was done by targeted and de novo assembly. RESULTS: Sequences from hepatitis B virus (HBV) were detected in 7 individuals with HUE (58.3%) and the pooled library, and hepatitis E virus (HEV) was detected in 2 individuals with HUE (16.7%) and the pooled library. Both HEV-positive cases were coinfected with HBV. HBV sequences belonged to genotypes A, D, or G, and HEV sequences belonged to genotype 3. No known hepatotropic viruses were detected in the tested normal human sera. CONCLUSIONS: NGS-based detection of HBV and HEV infections is more sensitive than using commercially available assays. HBV and HEV may be cryptically associated with HUE.


Asunto(s)
Sangre/virología , Pruebas Diagnósticas de Rutina/métodos , Virus de la Hepatitis B/aislamiento & purificación , Virus de la Hepatitis E/aislamiento & purificación , Hepatitis Viral Humana/diagnóstico , Hepatitis Viral Humana/etiología , Adulto , Anciano , Coinfección/virología , Femenino , Virus de la Hepatitis B/genética , Virus de la Hepatitis E/genética , Humanos , Masculino , Persona de Mediana Edad , Sensibilidad y Especificidad , Análisis de Secuencia de ADN , Reino Unido , Estados Unidos , Vietnam , Adulto Joven
13.
J Virol ; 88(24): 13971-80, 2014 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-25187549

RESUMEN

UNLABELLED: The recent epidemic history of hepatitis B virus (HBV) infections in the United States is complex, as indicated by current disparity in HBV genotype distribution between acute and chronic hepatitis B cases and the rapid decline in hepatitis B incidence since the 1990s. We report temporal changes in the genetic composition of the HBV population using whole-genome sequences (n = 179) from acute hepatitis B cases (n = 1,206) identified through the Sentinel County Surveillance for Acute Hepatitis (1998 to 2006). HBV belonged mainly to subtypes A2 (75%) and D3 (18%), with times of their most recent common ancestors being 1979 and 1987, respectively. A2 underwent rapid population expansions in ca. 1995 and ca. 2002, coinciding with transient rises in acute hepatitis B notification rates among adults; D3 underwent expansion in ca. 1998. A2 strains from cases identified after 2002, compared to those before 2002, tended to cluster phylogenetically, indicating selective expansion of specific strains, and were significantly reduced in genetic diversity (P = 0.001) and frequency of drug resistance mutations (P = 0.001). The expansion of genetically close HBV A2 strains was associated with risk of infection among male homosexuals (P = 0.03). Incident HBV strains circulating in the United States were recent in origin and restricted in genetic diversity. Disparate transmission dynamics among phylogenetic lineages affected the genetic composition of HBV populations and their capacity to maintain drug resistance mutations. The tendency of selectively expanding HBV strains to be transmitted among male homosexuals highlights the need to improve hepatitis B vaccination coverage among at-risk adults. IMPORTANCE: Hepatitis B virus (HBV) remains an important cause of acute and chronic liver disease globally and in the United States. Genetic analysis of HBV whole genomes from cases of acute hepatitis B identified from 1998 to 2006 in the United States showed dominance of genotype A2 (75%), followed by D3 (18%). Strains of both subtypes were recent in origin and underwent rapid population expansions from 1995 to 2000, indicating increase in transmission rate for certain HBV strains during a period of decline in the reported incidence of acute hepatitis B in the United States. HBV A2 strains from a particular cluster that experienced the most recent population expansion were more commonly detected among men who have sex with men. Vaccination needs to be stepped up to protect persons who remain at risk of HBV infection.


Asunto(s)
Variación Genética , Virus de la Hepatitis B/clasificación , Virus de la Hepatitis B/genética , Hepatitis B/epidemiología , Hepatitis B/virología , Adulto , Análisis por Conglomerados , ADN Viral/química , ADN Viral/genética , Femenino , Genoma Viral , Genotipo , Hepatitis B/transmisión , Humanos , Masculino , Epidemiología Molecular , Datos de Secuencia Molecular , Filogenia , Análisis de Secuencia de ADN , Estados Unidos/epidemiología
14.
BMC Genomics ; 15 Suppl 5: S4, 2014.
Artículo en Inglés | MEDLINE | ID: mdl-25081811

RESUMEN

BACKGROUND: Next-generation sequencing (NGS) allows for sampling numerous viral variants from infected patients. This provides a novel opportunity to represent and study the mutational landscape of Hepatitis C Virus (HCV) within a single host. RESULTS: Intra-host variants of the HCV E1/E2 region were extensively sampled from 58 chronically infected patients. After NGS error correction, the average number of reads and variants obtained from each sample were 3202 and 464, respectively. The distance between each pair of variants was calculated and networks were created for each patient, where each node is a variant and two nodes are connected by a link if the nucleotide distance between them is 1. The work focused on large components having > 5% of all reads, which in average account for 93.7% of all reads found in a patient. CONCLUSIONS: Most intra-host variants are organized into distinct single-mutation components that are: well separated from each other, represent genetic distances between viral variants, robust to sampling, reproducible and likely seeded during transmission events. Facilitated by NGS, large components offer a novel evolutionary framework for genetic analysis of intra-host viral populations and understanding transmission, immune escape and drug resistance.


Asunto(s)
Variación Genética , Hepacivirus/genética , Secuenciación de Nucleótidos de Alto Rendimiento , Mutación , Simulación por Computador , Genotipo , Hepatitis C/transmisión , Humanos , Compartición de Agujas , ARN Viral/genética , Análisis de Secuencia de ADN
15.
J Med Virol ; 86(5): 765-71, 2014 May.
Artículo en Inglés | MEDLINE | ID: mdl-24519518

RESUMEN

Hepatitis C virus (HCV) infection presents an important, but underappreciated public health problem in Africa. In Côte d'Ivoire, very little is known about the molecular dynamics of HCV infection. Plasma samples (n = 608) from pregnant women collected in 1995 from Côte d'Ivoire were analyzed in this study. Only 18 specimens (∼3%) were found to be HCV PCR-positive. Phylogenetic analysis of the HCV NS5b sequences showed that the HCV variants belong to genotype 1 (HCV1) (n = 12, 67%) and genotype 2 (HCV2) (n = 6, 33%), with a maximum genetic diversity among HCV variants in each genotype being 20.7% and 24.0%, respectively. Although all HCV2 variants were genetically distant from each other, six HCV1 variants formed two tight sub-clusters belonging to HCV1a and HCV1b. Analysis of molecular variance (AMOVA) showed that the genetic structure of HCV isolates from West Africa with Côte d'Ivoire included were significantly different from Central African strains (P = 0.0001). Examination of intra-host viral populations using next-generation sequencing of the HCV HVR1 showed a significant variation in intra-host genetic diversity among infected individuals, with some strains composed of sub-populations as distant from each other as viral populations from different hosts. Collectively, the results indicate a complex HCV evolution in Côte d'Ivoire, similar to the rest of West Africa, and suggest a unique HCV epidemic history in the country.


Asunto(s)
Enfermedades Endémicas , Evolución Molecular , Variación Genética , Hepacivirus/clasificación , Hepacivirus/genética , Hepatitis C Crónica/epidemiología , Hepatitis C Crónica/virología , África , África Occidental , Análisis por Conglomerados , Côte d'Ivoire/epidemiología , Femenino , Genotipo , Hepacivirus/aislamiento & purificación , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Datos de Secuencia Molecular , Filogenia , Embarazo , Complicaciones Infecciosas del Embarazo/epidemiología , Complicaciones Infecciosas del Embarazo/virología , ARN Viral/genética , Proteínas no Estructurales Virales/genética
16.
J Infect Dis ; 207(6): 999-1006, 2013 Mar 15.
Artículo en Inglés | MEDLINE | ID: mdl-23300164

RESUMEN

The molecular detection of transmission of rapidly mutating pathogens such as hepatitis C virus (HCV) is commonly achieved by assessing the genetic relatedness of strains among infected patients. We describe the development of a novel mass spectrometry (MS)-based approach to identify HCV transmission. MS was used to detect products of base-specific cleavage of RNA molecules obtained from HCV polymerase chain reaction fragments. The MS-peak profiles were found to reflect variation in the HCV genomic sequence and the intrahost composition of the HCV population. Serum specimens originating from 60 case patients from 14 epidemiologically confirmed outbreaks and 25 unrelated controls were tested. Neighbor-joining trees constructed using MS-peak profile-based Hamming distances showed 100% accuracy, and linkage networks constructed using a threshold established from the Hamming distances between epidemiologically unrelated cases showed 100% sensitivity and 99.93% specificity in transmission detection. This MS-based approach is rapid, robust, reproducible, cost-effective, and applicable to investigating transmissions of other pathogens.


Asunto(s)
ADN Viral/aislamiento & purificación , Hepacivirus/aislamiento & purificación , Hepatitis C/epidemiología , Hepatitis C/transmisión , Espectrometría de Masas/métodos , Análisis de Varianza , ADN Viral/sangre , Hepacivirus/genética , Hepatitis C/sangre , Humanos , Epidemiología Molecular , Filogenia , Reacción en Cadena de la Polimerasa , ARN Viral/sangre , Sensibilidad y Especificidad , Estados Unidos/epidemiología
17.
J Comput Biol ; 30(4): 420-431, 2023 04.
Artículo en Inglés | MEDLINE | ID: mdl-36602524

RESUMEN

Application of genetic distances to measure phenotypic relatedness is a challenging task, reflecting the complex relationship between genotype and phenotype. Accurate assessment of proximity among sequences with different phenotypic traits depends on how strongly the chosen distance is associated with structural and functional properties. In this study, we present a new distance measure Mutual Information and Entropy H (MIH) for categorical data such as nucleotide or amino acid sequences. MIH applies an information matrix (IM), which is calculated from the data and captures heterogeneity of individual positions as measured by Shannon entropy and coordinated substitutions among positions as measured by mutual information. In general, MIH assigns low weights to differences occurring at high entropy positions or at dependent positions. MIH distance was compared with other common distances on two experimental and two simulated data sets. MIH showed the best ability to distinguish cross-immunoreactive sequence pairs from non-cross-immunoreactive pairs of variants of the hepatitis C virus hypervariable region 1 (26,883 pairwise comparisons), and Major Histocompatibility Complex (MHC) binding peptides (n = 181) from non-binding peptides (n = 129). Analysis of 74 simulated RNA secondary structures also showed that the ratio between MIH distance of sequences from the same RNA structure and MIH of sequences from different structures is three orders of magnitude greater than for Hamming distances. These findings indicate that lower MIH between two sequences is associated with greater probability of the sequences to belong to the same phenotype. Examination of rule-based phenotypes generated in silico showed that (1) MIH is strongly associated with phenotypic differences, (2) IM of sequences under selection is very different from IM generated under random scenarios, and (3) IM is robust to sampling. In conclusion, MIH strongly approximates structural/functional distances and should have important applications to a wide range of biological problems, including evolution, artificial selection of biological functions and structures, and measuring phenotypic similarity.


Asunto(s)
Péptidos , ARN , Secuencia de Aminoácidos , Fenotipo
18.
BMC Bioinformatics ; 13 Suppl 10: S6, 2012 Jun 25.
Artículo en Inglés | MEDLINE | ID: mdl-22759430

RESUMEN

BACKGROUND: Next-generation sequencing allows the analysis of an unprecedented number of viral sequence variants from infected patients, presenting a novel opportunity for understanding virus evolution, drug resistance and immune escape. However, sequencing in bulk is error prone. Thus, the generated data require error identification and correction. Most error-correction methods to date are not optimized for amplicon analysis and assume that the error rate is randomly distributed. Recent quality assessment of amplicon sequences obtained using 454-sequencing showed that the error rate is strongly linked to the presence and size of homopolymers, position in the sequence and length of the amplicon. All these parameters are strongly sequence specific and should be incorporated into the calibration of error-correction algorithms designed for amplicon sequencing. RESULTS: In this paper, we present two new efficient error correction algorithms optimized for viral amplicons: (i) k-mer-based error correction (KEC) and (ii) empirical frequency threshold (ET). Both were compared to a previously published clustering algorithm (SHORAH), in order to evaluate their relative performance on 24 experimental datasets obtained by 454-sequencing of amplicons with known sequences. All three algorithms show similar accuracy in finding true haplotypes. However, KEC and ET were significantly more efficient than SHORAH in removing false haplotypes and estimating the frequency of true ones. CONCLUSIONS: Both algorithms, KEC and ET, are highly suitable for rapid recovery of error-free haplotypes obtained by 454-sequencing of amplicons from heterogeneous viruses.The implementations of the algorithms and data sets used for their testing are available at: http://alan.cs.gsu.edu/NGS/?q=content/pyrosequencing-error-correction-algorithm.


Asunto(s)
Algoritmos , Biología Computacional/métodos , Análisis de Secuencia de ADN/métodos , Virus/genética , Análisis por Conglomerados , ADN Viral/genética , Haplotipos
19.
J Gen Virol ; 93(Pt 7): 1410-1421, 2012 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-22456613

RESUMEN

We investigated the molecular epidemiology and population dynamics of HCV infection among indigenes of two semi-isolated communities in North-Central Nigeria. Despite remoteness and isolation, ~15% of the population had serological or molecular markers of hepatitis C virus (HCV) infection. Phylogenetic analysis of the NS5b sequences obtained from 60 HCV-infected residents showed that HCV variants belonged to genotype 1 (n=51; 85%) and genotype 2 (n=9; 15%). All sequences were unique and intermixed in the phylogenetic tree with HCV sequences from people infected from other West African countries. The high-throughput 454 pyrosequencing of the HCV hypervariable region 1 and an empirical threshold error correction algorithm were used to evaluate intra-host heterogeneity of HCV strains of genotype 1 (n=43) and genotype 2 (n=6) from residents of the communities. Analysis revealed a rare detectable intermixing of HCV intra-host variants among residents. Identification of genetically close HCV variants among all known groups of relatives suggests a common intra-familial HCV transmission in the communities. Applying Bayesian coalescent analysis to the NS5b sequences, the most recent common ancestors for genotype 1 and 2 variants were estimated to have existed 675 and 286 years ago, respectively. Bayesian skyline plots suggest that HCV lineages of both genotypes identified in the Nigerian communities experienced epidemic growth for 200-300 years until the mid-20th century. The data suggest a massive introduction of numerous HCV variants to the communities during the 20th century in the background of a dynamic evolutionary history of the hepatitis C epidemic in Nigeria over the past three centuries.


Asunto(s)
Epidemias/historia , Hepacivirus/clasificación , Hepacivirus/genética , Hepatitis C/epidemiología , Hepatitis C/virología , ARN Viral/genética , Proteínas no Estructurales Virales/genética , Adulto , África Occidental/epidemiología , Análisis por Conglomerados , Femenino , Genotipo , Hepacivirus/aislamiento & purificación , Hepatitis C/historia , Secuenciación de Nucleótidos de Alto Rendimiento , Historia del Siglo XV , Historia del Siglo XVI , Historia del Siglo XVII , Historia del Siglo XVIII , Historia del Siglo XIX , Historia del Siglo XX , Historia del Siglo XXI , Humanos , Masculino , Epidemiología Molecular , Datos de Secuencia Molecular , Nigeria/epidemiología , Filogenia , Polimorfismo Genético , Grupos de Población , Prevalencia
20.
J Virol ; 85(13): 6369-80, 2011 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-21525348

RESUMEN

The intrahost evolution of hepatitis C virus (HCV) holds keys to understanding mechanisms responsible for the establishment of chronic infections and to development of a vaccine and therapeutics. In this study, intrahost variants of two variable HCV genomic regions, HVR1 and NS5A, were sequenced from four treatment-naïve chronically infected patients who were followed up from the acute stage of infection for 9 to 18 years. Median-joining network analysis indicated that the majority of the HCV intrahost variants were observed only at certain time points, but some variants were detectable at more than one time point. In all patients, these variants were found organized into communities or subpopulations. We hypothesize that HCV intrahost evolution is defined by two processes: incremental changes within communities through random mutation and alternations between coexisting communities. The HCV population was observed to incrementally evolve within a single community during approximately the first 3 years of infection, followed by dispersion into several subpopulations. Two patients demonstrated this pattern of dispersion for the rest of the observation period, while HCV variants in the other two patients converged into another single subpopulation after ∼9 to 12 years of dispersion. The final subpopulation in these two patients was under purifying selection. Intrahost HCV evolution in all four patients was characterized by a consistent increase in negative selection over time, suggesting the increasing HCV adaptation to the host late in infection. The data suggest specific staging of HCV intrahost evolution.


Asunto(s)
Evolución Molecular , Variación Genética , Hepacivirus/genética , Hepatitis C Crónica/virología , Interacciones Huésped-Patógeno , Proteínas no Estructurales Virales/genética , Proteínas Virales/genética , Sangre/virología , Genotipo , Hepacivirus/clasificación , Hepacivirus/aislamiento & purificación , Interacciones Huésped-Patógeno/genética , Humanos , Datos de Secuencia Molecular , Filogenia , Análisis de Secuencia de ADN , Factores de Tiempo , Proteínas no Estructurales Virales/química , Proteínas Virales/química
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA