RESUMEN
BACKGROUND: Investigation of outbreaks to identify the primary case is crucial for the interruption and prevention of transmission of infectious diseases. These individuals may have a higher risk of participating in near future transmission events when compared to the other patients in the outbreak, so directing more transmission prevention resources towards these individuals is a priority. Although the genetic characterization of intra-host viral populations can aid the identification of transmission clusters, it is not trivial to determine the directionality of transmissions during outbreaks, owing to complexity of viral evolution. Here, we present a new computational framework, PYCIVO: primary case inference in viral outbreaks. This framework expands upon our earlier work in development of QUENTIN, which builds a probabilistic disease transmission tree based on simulation of evolution of intra-host hepatitis C virus (HCV) variants between cases involved in direct transmission during an outbreak. PYCIVO improves upon QUENTIN by also adding a custom heterogeneity index and identifying the scenario when the primary case may have not been sampled. RESULTS: These approaches were validated using a set of 105 sequence samples from 11 distinct HCV transmission clusters identified during outbreak investigations, in which the primary case was epidemiologically verified. Both models can detect the correct primary case in 9 out of 11 transmission clusters (81.8%). However, while QUENTIN issues erroneous predictions on the remaining 2 transmission clusters, PYCIVO issues a null output for these clusters, giving it an effective prediction accuracy of 100%. To further evaluate accuracy of the inference, we created 10 modified transmission clusters in which the primary case had been removed. In this scenario, PYCIVO was able to correctly identify that there was no primary case in 8/10 (80%) of these modified clusters. This model was validated with HCV; however, this approach may be applicable to other microbial pathogens. CONCLUSIONS: PYCIVO improves upon QUENTIN by also implementing a custom heterogeneity index which empowers PYCIVO to make the important 'No primary case' prediction. One or more samples, possibly including the primary case, may have not been sampled, and this designation is meant to account for these scenarios.
Asunto(s)
Enfermedades Transmisibles , Hepatitis C , Biología Computacional , Brotes de Enfermedades , Hepacivirus/genética , Hepatitis C/epidemiología , Humanos , FilogeniaRESUMEN
Hepatitis A virus (HAV) genotype IA was most common among strains tested in US outbreak investigations and surveillance during 1996-2015. However, HAV genotype IB gained prominence during 2016-2019 person-to-person multistate outbreaks. Detection of previously uncommon strains highlights the changing molecular epidemiology of HAV infection in the United States.
Asunto(s)
Virus de la Hepatitis A , Hepatitis A , Brotes de Enfermedades , Genotipo , Hepatitis A/epidemiología , Virus de la Hepatitis A/genética , Humanos , Epidemiología Molecular , Filogenia , ARN Viral , Estados UnidosRESUMEN
Motivation: Genomic analysis has become one of the major tools for disease outbreak investigations. However, existing computational frameworks for inference of transmission history from viral genomic data often do not consider intra-host diversity of pathogens and heavily rely on additional epidemiological data, such as sampling times and exposure intervals. This impedes genomic analysis of outbreaks of highly mutable viruses associated with chronic infections, such as human immunodeficiency virus and hepatitis C virus, whose transmissions are often carried out through minor intra-host variants, while the additional epidemiological information often is either unavailable or has a limited use. Results: The proposed framework QUasispecies Evolution, Network-based Transmission INference (QUENTIN) addresses the above challenges by evolutionary analysis of intra-host viral populations sampled by deep sequencing and Bayesian inference using general properties of social networks relevant to infection dissemination. This method allows inference of transmission direction even without the supporting case-specific epidemiological information, identify transmission clusters and reconstruct transmission history. QUENTIN was validated on experimental and simulated data, and applied to investigate HCV transmission within a community of hosts with high-risk behavior. It is available at https://github.com/skumsp/QUENTIN. Contact: pskums@gsu.edu or alexz@cs.gsu.edu or rahul@sfsu.edu or yek0@cdc.gov. Supplementary information: Supplementary data are available at Bioinformatics online.
Asunto(s)
Genoma Viral , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Cuasiespecies , Análisis de Secuencia de ARN/métodos , Programas Informáticos , Teorema de Bayes , Brotes de Enfermedades , Genómica/métodos , Hepacivirus/genética , Humanos , Análisis de Secuencia de ADN/métodosRESUMEN
BACKGROUND: Hepatitis C is a major public health problem in the United States and worldwide. Outbreaks of hepatitis C virus (HCV) infections associated with unsafe injection practices, drug diversion, and other exposures to blood are difficult to detect and investigate. Effective HCV outbreak investigation requires comprehensive surveillance and robust case investigation. We previously developed and validated a methodology for the rapid and cost-effective identification of HCV transmission clusters. Global Hepatitis Outbreak and Surveillance Technology (GHOST) is a cloud-based system enabling users, regardless of computational expertise, to analyze and visualize transmission clusters in an independent, accurate and reproducible way. RESULTS: We present and explore performance of several GHOST implemented algorithms using next-generation sequencing data experimentally obtained from hypervariable region 1 of genetically related and unrelated HCV strains. GHOST processes data from an entire MiSeq run in approximately 3 h. A panel of seven specimens was used for preparation of six repeats of MiSeq libraries. Testing sequence data from these libraries by GHOST showed a consistent transmission linkage detection, testifying to high reproducibility of the system. Lack of linkage among genetically unrelated HCV strains and constant detection of genetic linkage between HCV strains from known transmission pairs and from follow-up specimens at different levels of MiSeq-read sampling indicate high specificity and sensitivity of GHOST in accurate detection of HCV transmission. CONCLUSIONS: GHOST enables automatic extraction of timely and relevant public health information suitable for guiding effective intervention measures. It is designed as a virtual diagnostic system intended for use in molecular surveillance and outbreak investigations rather than in research. The system produces accurate and reproducible information on HCV transmission clusters for all users, irrespective of their level of bioinformatics expertise. Improvement in molecular detection capacity will contribute to increasing the rate of transmission detection, thus providing opportunity for rapid, accurate and effective response to outbreaks of hepatitis C. Although GHOST was originally developed for hepatitis C surveillance, its modular structure is readily applicable to other infectious diseases. Worldwide availability of GHOST for the detection of HCV transmissions will foster deeper involvement of public health researchers and practitioners in hepatitis C outbreak investigation.
Asunto(s)
Nube Computacional , Biología Computacional/métodos , Brotes de Enfermedades/estadística & datos numéricos , Monitoreo Epidemiológico , Hepatitis C/epidemiología , Internacionalidad , Algoritmos , Humanos , Programas Informáticos , Interfaz Usuario-ComputadorRESUMEN
Hepatitis C is a major public health problem in the United States and worldwide. Outbreaks of hepatitis C virus (HCV) infections are associated with unsafe injection practices, drug diversion, and other exposures to blood and are difficult to detect and investigate. Here, we developed and validated a simple approach for molecular detection of HCV transmissions in outbreak settings. We obtained sequences from the HCV hypervariable region 1 (HVR1), using end-point limiting-dilution (EPLD) technique, from 127 cases involved in 32 epidemiologically defined HCV outbreaks and 193 individuals with unrelated HCV strains. We compared several types of genetic distances and calculated a threshold, using minimal Hamming distances, that identifies transmission clusters in all tested outbreaks with 100% accuracy. The approach was also validated on sequences obtained using next-generation sequencing from HCV strains recovered from 239 individuals, and findings showed the same accuracy as that for EPLD. On average, the nucleotide diversity of the intrahost population was 6.2 times greater in the source case than in any incident case, allowing the correct detection of transmission direction in 8 outbreaks for which source cases were known. A simple and accurate distance-based approach developed here for detecting HCV transmissions streamlines molecular investigation of outbreaks, thus improving the public health capacity for rapid and effective control of hepatitis C.
Asunto(s)
Brotes de Enfermedades , Ligamiento Genético , Hepacivirus/genética , Hepacivirus/aislamiento & purificación , Hepatitis C/transmisión , Hepatitis C/virología , Análisis por Conglomerados , Variación Genética , Genotipo , Hepatitis C/epidemiología , Humanos , Reproducibilidad de los ResultadosRESUMEN
UNLABELLED: Hypervariable region 1 (HVR1) of hepatitis C virus (HCV) comprises the first 27 N-terminal amino acid residues of E2. It is classically seen as the most heterogeneous region of the HCV genome. In this study, we assessed HVR1 evolution by using ultradeep pyrosequencing for a cohort of treatment-naive, chronically infected patients over a short, 16-week period. Organization of the sequence set into connected components that represented single nucleotide substitution events revealed a network dominated by highly connected, centrally positioned master sequences. HVR1 phenotypes were observed to be under strong purifying (stationary) and strong positive (antigenic drift) selection pressures, which were coincident with advancing patient age and cirrhosis of the liver. It followed that stationary viromes were dominated by a single HVR1 variant surrounded by minor variants comprised from conservative single amino acid substitution events. We present evidence to suggest that neutralization antibody efficacy was diminished for stationary-virome HVR1 variants. Our results identify the HVR1 network structure during chronic infection as the preferential dominance of a single variant within a narrow sequence space. IMPORTANCE: HCV infection is often asymptomatic, and chronic infection is generally well established in advance of initial diagnosis and subsequent treatment. HVR1 can undergo rapid sequence evolution during acute infection, and the variant pool is typically seen to diverge away from ancestral sequences as infection progresses from the acute to the chronic phase. In this report, we describe HVR1 viromes in chronically infected patients that are defined by a dominant epitope located centrally within a narrow variant pool. Our findings suggest that weakened humoral immune activity, as a consequence of persistent chronic infection, allows for the acquisition and maintenance of host-specific adaptive mutations at HVR1 that reflect virus fitness.
Asunto(s)
Anticuerpos Neutralizantes/inmunología , Hepacivirus/inmunología , Anticuerpos contra la Hepatitis C/inmunología , Hepatitis C Crónica/virología , Proteínas Virales/inmunología , Adulto , Anciano , Envejecimiento , Secuencia de Aminoácidos , Sustitución de Aminoácidos , Secuencia de Bases , Femenino , Hepacivirus/genética , Hepatitis C Crónica/inmunología , Humanos , Inmunidad Humoral/inmunología , Inmunoglobulina G/inmunología , Cirrosis Hepática/patología , Masculino , Persona de Mediana Edad , Datos de Secuencia Molecular , Análisis de Secuencia de ARN , Proteínas del Envoltorio Viral/genética , Proteínas Virales/genética , Adulto JovenRESUMEN
MOTIVATION: Next-generation sequencing (NGS) allows for analyzing a large number of viral sequences from infected patients, providing an opportunity to implement large-scale molecular surveillance of viral diseases. However, despite improvements in technology, traditional protocols for NGS of large numbers of samples are still highly cost and labor intensive. One of the possible cost-effective alternatives is combinatorial pooling. Although a number of pooling strategies for consensus sequencing of DNA samples and detection of SNPs have been proposed, these strategies cannot be applied to sequencing of highly heterogeneous viral populations. RESULTS: We developed a cost-effective and reliable protocol for sequencing of viral samples, that combines NGS using barcoding and combinatorial pooling and a computational framework including algorithms for optimal virus-specific pools design and deconvolution of individual samples from sequenced pools. Evaluation of the framework on experimental and simulated data for hepatitis C virus showed that it substantially reduces the sequencing costs and allows deconvolution of viral populations with a high accuracy. AVAILABILITY AND IMPLEMENTATION: The source code and experimental data sets are available at http://alan.cs.gsu.edu/NGS/?q=content/pooling.
Asunto(s)
Algoritmos , Biología Computacional/métodos , ADN Viral/genética , Genoma Viral , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Análisis de Secuencia de ADN/métodos , Virus/clasificación , Virus/genética , Variación Genética , Hepacivirus/clasificación , Hepacivirus/genética , HumanosRESUMEN
UNLABELLED: The extent of provider-to-patient hepatitis C virus (HCV) transmission from diversion, self-injection, and substitution ("tampering") of anesthetic opioids is unknown. To quantify the contribution of opioid tampering to nosocomial HCV outbreaks, data from health care-related HCV outbreaks occurring in developed countries from 1990 to 2012 were collated, grouped, and compared. Tampering was associated with 17% (8 of 46) of outbreaks, but 53% (438 of 833) of cases. Of the tampering outbreaks, six (75%) involved fentanyl, five (63%) occurred in the United States, and one each in Australia, Israel, and Spain. Case counts ranged from 5 to 275 in the tampering outbreaks (mean, 54.8; median, 25), and 1-99 in the nontampering outbreaks (mean, 10.4; median, 5); between them, the difference in mean ranks of counts was significant (P < 0.01). To estimate HCV transmission risks from tampering, risk-assessment models were constructed, and these risks compared with those from surgery. HCV transmission risk from exposure to an opioid preparation tampered by a provider of unknown HCV infection status who is a person who injects drugs (PWID; 0.62%; standard error [SE] = 0.38%) exceeds 16,757 times the risk from surgery by a surgeon of unknown HCV infection status (0.000037%; SE = 0.000029%) and 135 times by an HCV-infected surgeon (0.0046%; SE = 0.0033%). To pose a 50% patient transmission risk, an infected surgeon may take 30 years, compared to <1 year for a PWID tamperer, and weeks or days for a PWID tamperer who intensifies access to opioids. CONCLUSION: Disproportionately, many cases of HCV infection from nosocomial outbreaks were attributable to provider tampering of anesthetic opioids. Transmission risk from tampering is substantially higher than from surgery.
Asunto(s)
Analgésicos Opioides/administración & dosificación , Anestésicos , Infección Hospitalaria/transmisión , Contaminación de Medicamentos , Consumidores de Drogas , Hepatitis C/transmisión , Brotes de Enfermedades , Humanos , Medición de Riesgo , Procedimientos Quirúrgicos Operativos/efectos adversosRESUMEN
BACKGROUND: Up to 30% of acute viral hepatitis has no known etiology. To determine the disease etiology in patients with acute hepatitis of unknown etiology (HUE), serum specimens were obtained from 38 patients residing in the United Kingdom and Vietnam and from 26 healthy US blood donors. All specimens tested negative for known viral infections causing hepatitis, using commercially available serological and nucleic acid assays. METHODS: Specimens were processed by sequence-independent complementary DNA amplification and next-generation sequencing (NGS). Sufficient material for individual NGS libraries was obtained from 12 HUE cases and 26 blood donors; the remaining HUE cases were sequenced as a pool. Read mapping was done by targeted and de novo assembly. RESULTS: Sequences from hepatitis B virus (HBV) were detected in 7 individuals with HUE (58.3%) and the pooled library, and hepatitis E virus (HEV) was detected in 2 individuals with HUE (16.7%) and the pooled library. Both HEV-positive cases were coinfected with HBV. HBV sequences belonged to genotypes A, D, or G, and HEV sequences belonged to genotype 3. No known hepatotropic viruses were detected in the tested normal human sera. CONCLUSIONS: NGS-based detection of HBV and HEV infections is more sensitive than using commercially available assays. HBV and HEV may be cryptically associated with HUE.
Asunto(s)
Sangre/virología , Pruebas Diagnósticas de Rutina/métodos , Virus de la Hepatitis B/aislamiento & purificación , Virus de la Hepatitis E/aislamiento & purificación , Hepatitis Viral Humana/diagnóstico , Hepatitis Viral Humana/etiología , Adulto , Anciano , Coinfección/virología , Femenino , Virus de la Hepatitis B/genética , Virus de la Hepatitis E/genética , Humanos , Masculino , Persona de Mediana Edad , Sensibilidad y Especificidad , Análisis de Secuencia de ADN , Reino Unido , Estados Unidos , Vietnam , Adulto JovenRESUMEN
UNLABELLED: Hepatitis C virus (HCV) causes chronic infection in up to 50% to 80% of infected individuals. Hypervariable region 1 (HVR1) variability is frequently studied to gain an insight into the mechanisms of HCV adaptation during chronic infection, but the changes to and persistence of HCV subpopulations during intrahost evolution are poorly understood. In this study, we used ultradeep pyrosequencing (UDPS) to map the viral heterogeneity of a single patient over 9.6 years of chronic HCV genotype 4a infection. Informed error correction of the raw UDPS data was performed using a temporally matched clonal data set. The resultant data set reported the detection of low-frequency recombinants throughout the study period, implying that recombination is an active mechanism through which HCV can explore novel sequence space. The data indicate that polyvirus infection of hepatocytes has occurred but that the fitness quotients of recombinant daughter virions are too low for the daughter virions to compete against the parental genomes. The subpopulations of parental genomes contributing to the recombination events highlighted a dynamic virome where subpopulations of variants are in competition. In addition, we provide direct evidence that demonstrates the growth of subdominant populations to dominance in the absence of a detectable humoral response. IMPORTANCE: Analysis of ultradeep pyrosequencing data sets derived from virus amplicons frequently relies on software tools that are not optimized for amplicon analysis, assume random incorporation of sequencing errors, and are focused on achieving higher specificity at the expense of sensitivity. Such analysis is further complicated by the presence of hypervariable regions. In this study, we made use of a temporally matched reference sequence data set to inform error correction algorithms. Using this methodology, we were able to (i) detect multiple instances of hepatitis C virus intrasubtype recombination at the E1/E2 junction (a phenomenon rarely reported in the literature) and (ii) interrogate the longitudinal quasispecies complexity of the virome. Parallel to the UDPS, isolation of IgG-bound virions was found to coincide with the collapse of specific viral subpopulations.
Asunto(s)
Variación Genética , Hepacivirus/clasificación , Hepacivirus/genética , Hepatitis C Crónica/virología , ARN Viral/genética , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Estudios Longitudinales , Datos de Secuencia MolecularRESUMEN
UNLABELLED: The recent epidemic history of hepatitis B virus (HBV) infections in the United States is complex, as indicated by current disparity in HBV genotype distribution between acute and chronic hepatitis B cases and the rapid decline in hepatitis B incidence since the 1990s. We report temporal changes in the genetic composition of the HBV population using whole-genome sequences (n = 179) from acute hepatitis B cases (n = 1,206) identified through the Sentinel County Surveillance for Acute Hepatitis (1998 to 2006). HBV belonged mainly to subtypes A2 (75%) and D3 (18%), with times of their most recent common ancestors being 1979 and 1987, respectively. A2 underwent rapid population expansions in ca. 1995 and ca. 2002, coinciding with transient rises in acute hepatitis B notification rates among adults; D3 underwent expansion in ca. 1998. A2 strains from cases identified after 2002, compared to those before 2002, tended to cluster phylogenetically, indicating selective expansion of specific strains, and were significantly reduced in genetic diversity (P = 0.001) and frequency of drug resistance mutations (P = 0.001). The expansion of genetically close HBV A2 strains was associated with risk of infection among male homosexuals (P = 0.03). Incident HBV strains circulating in the United States were recent in origin and restricted in genetic diversity. Disparate transmission dynamics among phylogenetic lineages affected the genetic composition of HBV populations and their capacity to maintain drug resistance mutations. The tendency of selectively expanding HBV strains to be transmitted among male homosexuals highlights the need to improve hepatitis B vaccination coverage among at-risk adults. IMPORTANCE: Hepatitis B virus (HBV) remains an important cause of acute and chronic liver disease globally and in the United States. Genetic analysis of HBV whole genomes from cases of acute hepatitis B identified from 1998 to 2006 in the United States showed dominance of genotype A2 (75%), followed by D3 (18%). Strains of both subtypes were recent in origin and underwent rapid population expansions from 1995 to 2000, indicating increase in transmission rate for certain HBV strains during a period of decline in the reported incidence of acute hepatitis B in the United States. HBV A2 strains from a particular cluster that experienced the most recent population expansion were more commonly detected among men who have sex with men. Vaccination needs to be stepped up to protect persons who remain at risk of HBV infection.
Asunto(s)
Variación Genética , Virus de la Hepatitis B/clasificación , Virus de la Hepatitis B/genética , Hepatitis B/epidemiología , Hepatitis B/virología , Adulto , Análisis por Conglomerados , ADN Viral/química , ADN Viral/genética , Femenino , Genoma Viral , Genotipo , Hepatitis B/transmisión , Humanos , Masculino , Epidemiología Molecular , Datos de Secuencia Molecular , Filogenia , Análisis de Secuencia de ADN , Estados Unidos/epidemiologíaRESUMEN
BACKGROUND: Next-generation sequencing (NGS) allows for sampling numerous viral variants from infected patients. This provides a novel opportunity to represent and study the mutational landscape of Hepatitis C Virus (HCV) within a single host. RESULTS: Intra-host variants of the HCV E1/E2 region were extensively sampled from 58 chronically infected patients. After NGS error correction, the average number of reads and variants obtained from each sample were 3202 and 464, respectively. The distance between each pair of variants was calculated and networks were created for each patient, where each node is a variant and two nodes are connected by a link if the nucleotide distance between them is 1. The work focused on large components having > 5% of all reads, which in average account for 93.7% of all reads found in a patient. CONCLUSIONS: Most intra-host variants are organized into distinct single-mutation components that are: well separated from each other, represent genetic distances between viral variants, robust to sampling, reproducible and likely seeded during transmission events. Facilitated by NGS, large components offer a novel evolutionary framework for genetic analysis of intra-host viral populations and understanding transmission, immune escape and drug resistance.
Asunto(s)
Variación Genética , Hepacivirus/genética , Secuenciación de Nucleótidos de Alto Rendimiento , Mutación , Simulación por Computador , Genotipo , Hepatitis C/transmisión , Humanos , Compartición de Agujas , ARN Viral/genética , Análisis de Secuencia de ADNRESUMEN
Hepatitis C virus (HCV) infection presents an important, but underappreciated public health problem in Africa. In Côte d'Ivoire, very little is known about the molecular dynamics of HCV infection. Plasma samples (n = 608) from pregnant women collected in 1995 from Côte d'Ivoire were analyzed in this study. Only 18 specimens (â¼3%) were found to be HCV PCR-positive. Phylogenetic analysis of the HCV NS5b sequences showed that the HCV variants belong to genotype 1 (HCV1) (n = 12, 67%) and genotype 2 (HCV2) (n = 6, 33%), with a maximum genetic diversity among HCV variants in each genotype being 20.7% and 24.0%, respectively. Although all HCV2 variants were genetically distant from each other, six HCV1 variants formed two tight sub-clusters belonging to HCV1a and HCV1b. Analysis of molecular variance (AMOVA) showed that the genetic structure of HCV isolates from West Africa with Côte d'Ivoire included were significantly different from Central African strains (P = 0.0001). Examination of intra-host viral populations using next-generation sequencing of the HCV HVR1 showed a significant variation in intra-host genetic diversity among infected individuals, with some strains composed of sub-populations as distant from each other as viral populations from different hosts. Collectively, the results indicate a complex HCV evolution in Côte d'Ivoire, similar to the rest of West Africa, and suggest a unique HCV epidemic history in the country.
Asunto(s)
Enfermedades Endémicas , Evolución Molecular , Variación Genética , Hepacivirus/clasificación , Hepacivirus/genética , Hepatitis C Crónica/epidemiología , Hepatitis C Crónica/virología , África , África Occidental , Análisis por Conglomerados , Côte d'Ivoire/epidemiología , Femenino , Genotipo , Hepacivirus/aislamiento & purificación , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Datos de Secuencia Molecular , Filogenia , Embarazo , Complicaciones Infecciosas del Embarazo/epidemiología , Complicaciones Infecciosas del Embarazo/virología , ARN Viral/genética , Proteínas no Estructurales Virales/genéticaRESUMEN
The molecular detection of transmission of rapidly mutating pathogens such as hepatitis C virus (HCV) is commonly achieved by assessing the genetic relatedness of strains among infected patients. We describe the development of a novel mass spectrometry (MS)-based approach to identify HCV transmission. MS was used to detect products of base-specific cleavage of RNA molecules obtained from HCV polymerase chain reaction fragments. The MS-peak profiles were found to reflect variation in the HCV genomic sequence and the intrahost composition of the HCV population. Serum specimens originating from 60 case patients from 14 epidemiologically confirmed outbreaks and 25 unrelated controls were tested. Neighbor-joining trees constructed using MS-peak profile-based Hamming distances showed 100% accuracy, and linkage networks constructed using a threshold established from the Hamming distances between epidemiologically unrelated cases showed 100% sensitivity and 99.93% specificity in transmission detection. This MS-based approach is rapid, robust, reproducible, cost-effective, and applicable to investigating transmissions of other pathogens.
Asunto(s)
ADN Viral/aislamiento & purificación , Hepacivirus/aislamiento & purificación , Hepatitis C/epidemiología , Hepatitis C/transmisión , Espectrometría de Masas/métodos , Análisis de Varianza , ADN Viral/sangre , Hepacivirus/genética , Hepatitis C/sangre , Humanos , Epidemiología Molecular , Filogenia , Reacción en Cadena de la Polimerasa , ARN Viral/sangre , Sensibilidad y Especificidad , Estados Unidos/epidemiologíaRESUMEN
BACKGROUND: Next-generation sequencing allows the analysis of an unprecedented number of viral sequence variants from infected patients, presenting a novel opportunity for understanding virus evolution, drug resistance and immune escape. However, sequencing in bulk is error prone. Thus, the generated data require error identification and correction. Most error-correction methods to date are not optimized for amplicon analysis and assume that the error rate is randomly distributed. Recent quality assessment of amplicon sequences obtained using 454-sequencing showed that the error rate is strongly linked to the presence and size of homopolymers, position in the sequence and length of the amplicon. All these parameters are strongly sequence specific and should be incorporated into the calibration of error-correction algorithms designed for amplicon sequencing. RESULTS: In this paper, we present two new efficient error correction algorithms optimized for viral amplicons: (i) k-mer-based error correction (KEC) and (ii) empirical frequency threshold (ET). Both were compared to a previously published clustering algorithm (SHORAH), in order to evaluate their relative performance on 24 experimental datasets obtained by 454-sequencing of amplicons with known sequences. All three algorithms show similar accuracy in finding true haplotypes. However, KEC and ET were significantly more efficient than SHORAH in removing false haplotypes and estimating the frequency of true ones. CONCLUSIONS: Both algorithms, KEC and ET, are highly suitable for rapid recovery of error-free haplotypes obtained by 454-sequencing of amplicons from heterogeneous viruses.The implementations of the algorithms and data sets used for their testing are available at: http://alan.cs.gsu.edu/NGS/?q=content/pyrosequencing-error-correction-algorithm.
Asunto(s)
Algoritmos , Biología Computacional/métodos , Análisis de Secuencia de ADN/métodos , Virus/genética , Análisis por Conglomerados , ADN Viral/genética , HaplotiposRESUMEN
We investigated the molecular epidemiology and population dynamics of HCV infection among indigenes of two semi-isolated communities in North-Central Nigeria. Despite remoteness and isolation, ~15% of the population had serological or molecular markers of hepatitis C virus (HCV) infection. Phylogenetic analysis of the NS5b sequences obtained from 60 HCV-infected residents showed that HCV variants belonged to genotype 1 (n=51; 85%) and genotype 2 (n=9; 15%). All sequences were unique and intermixed in the phylogenetic tree with HCV sequences from people infected from other West African countries. The high-throughput 454 pyrosequencing of the HCV hypervariable region 1 and an empirical threshold error correction algorithm were used to evaluate intra-host heterogeneity of HCV strains of genotype 1 (n=43) and genotype 2 (n=6) from residents of the communities. Analysis revealed a rare detectable intermixing of HCV intra-host variants among residents. Identification of genetically close HCV variants among all known groups of relatives suggests a common intra-familial HCV transmission in the communities. Applying Bayesian coalescent analysis to the NS5b sequences, the most recent common ancestors for genotype 1 and 2 variants were estimated to have existed 675 and 286 years ago, respectively. Bayesian skyline plots suggest that HCV lineages of both genotypes identified in the Nigerian communities experienced epidemic growth for 200-300 years until the mid-20th century. The data suggest a massive introduction of numerous HCV variants to the communities during the 20th century in the background of a dynamic evolutionary history of the hepatitis C epidemic in Nigeria over the past three centuries.
Asunto(s)
Epidemias/historia , Hepacivirus/clasificación , Hepacivirus/genética , Hepatitis C/epidemiología , Hepatitis C/virología , ARN Viral/genética , Proteínas no Estructurales Virales/genética , Adulto , África Occidental/epidemiología , Análisis por Conglomerados , Femenino , Genotipo , Hepacivirus/aislamiento & purificación , Hepatitis C/historia , Secuenciación de Nucleótidos de Alto Rendimiento , Historia del Siglo XV , Historia del Siglo XVI , Historia del Siglo XVII , Historia del Siglo XVIII , Historia del Siglo XIX , Historia del Siglo XX , Historia del Siglo XXI , Humanos , Masculino , Epidemiología Molecular , Datos de Secuencia Molecular , Nigeria/epidemiología , Filogenia , Polimorfismo Genético , Grupos de Población , PrevalenciaRESUMEN
The intrahost evolution of hepatitis C virus (HCV) holds keys to understanding mechanisms responsible for the establishment of chronic infections and to development of a vaccine and therapeutics. In this study, intrahost variants of two variable HCV genomic regions, HVR1 and NS5A, were sequenced from four treatment-naïve chronically infected patients who were followed up from the acute stage of infection for 9 to 18 years. Median-joining network analysis indicated that the majority of the HCV intrahost variants were observed only at certain time points, but some variants were detectable at more than one time point. In all patients, these variants were found organized into communities or subpopulations. We hypothesize that HCV intrahost evolution is defined by two processes: incremental changes within communities through random mutation and alternations between coexisting communities. The HCV population was observed to incrementally evolve within a single community during approximately the first 3 years of infection, followed by dispersion into several subpopulations. Two patients demonstrated this pattern of dispersion for the rest of the observation period, while HCV variants in the other two patients converged into another single subpopulation after â¼9 to 12 years of dispersion. The final subpopulation in these two patients was under purifying selection. Intrahost HCV evolution in all four patients was characterized by a consistent increase in negative selection over time, suggesting the increasing HCV adaptation to the host late in infection. The data suggest specific staging of HCV intrahost evolution.
Asunto(s)
Evolución Molecular , Variación Genética , Hepacivirus/genética , Hepatitis C Crónica/virología , Interacciones Huésped-Patógeno , Proteínas no Estructurales Virales/genética , Proteínas Virales/genética , Sangre/virología , Genotipo , Hepacivirus/clasificación , Hepacivirus/aislamiento & purificación , Interacciones Huésped-Patógeno/genética , Humanos , Datos de Secuencia Molecular , Filogenia , Análisis de Secuencia de ADN , Factores de Tiempo , Proteínas no Estructurales Virales/química , Proteínas Virales/químicaRESUMEN
Hepatitis C virus (HCV) infections are public health problem across the globe, particularly in developing countries. Pakistan has the second highest prevalence of HCV infection worldwide. Limited data exist from Pakistan about persons who inject drugs (PWID) and are at significant risk of exposure to HCV infection and transmission. Serum specimens (n = 110) collected from PWID residing in four provinces were tested for molecular markers of HCV infection. Next generation sequencing (NGS) of the hypervariable region (HVR1) of HCV and Global Hepatitis Outbreak and Surveillance Technology (GHOST) were used to determine HCV genotype, genetic heterogeneity, and construct transmission networks. Among tested specimens, 47.3% were found anti-HCV positive and 34.6% were HCV RNA-positive and belonged to four genotypes, with 3a most prevalent followed by 1a, 1b and 4a. Variants sampled from five cases formed phylogenetic cluster and a transmission network. One case harbored infection with two different genotypes. High prevalence of infections and presence of various genotypes indicate frequent introduction and transmission of HCV among PWID in Pakistan. Identification of a transmission cluster across three provinces, involving 20% of all cases, suggests the existence of a countrywide transmission network among PWIDs. Understanding the structure of this network should assist in devising effective public health strategies to eliminate HCV infection in Pakistan.
Asunto(s)
Consumidores de Drogas , Hepatitis C , Abuso de Sustancias por Vía Intravenosa , Genotipo , Hepacivirus/genética , Humanos , Pakistán/epidemiología , Filogenia , Prevalencia , Abuso de Sustancias por Vía Intravenosa/epidemiologíaRESUMEN
Hepatitis C virus (HCV) is a major cause of liver disease world-wide. Current interferon and ribavirin (IFN/RBV) therapy is effective in 50%-60% of patients. HCV exists in infected patients as a large viral population of intra-host variants (quasispecies), which may be differentially resistant to interferon treatment. We present a method for measuring differential interferon resistance of HCV quasispecies based on mathematical modeling and analysis of HCV population dynamics during the first hours of interferon therapy. The mathematical models showed that individual intra-host HCV variants have a wide range of resistance to IFN treatment in each patient. Analysis of differential IFN resistance among intra-host HCV variants allows for accurate prediction of response to IFN therapy. The models strongly suggest that resistance to interferon may vary broadly among closely related variants in infected hosts and therapy outcome may be defined by a single or a few variants irrespective of their frequency in the intra-host HCV population before treatment.
Asunto(s)
Antivirales/farmacología , Hepacivirus/genética , Hepatitis C Crónica/tratamiento farmacológico , Interferones/farmacología , Algoritmos , Antivirales/uso terapéutico , Farmacorresistencia Viral , Hepacivirus/efectos de los fármacos , Hepatitis C Crónica/virología , Humanos , Interferones/uso terapéutico , Modelos TeóricosRESUMEN
Six immunoassays for detecting immunoglobulin M antibodies to hepatitis E virus were evaluated. Serum samples representing acute infection by each of the 4 viral genotypes as well as nonacute hepatitis E virus infection constituted the test panels. Diagnostic sensitivities and specificities as well as interassay agreement varied widely. Analytical sensitivity limits also were determined and were found to be particularly disparate.