Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 13 de 13
Filter
1.
Methods Mol Biol ; 939: 253-61, 2013.
Article in English | MEDLINE | ID: mdl-23192551

ABSTRACT

One of the challenges of genetic data analysis is to combine information from sources that are distributed around the world and accessible through a wide array of different methods and interfaces. The HIV database and its footsteps, the hepatitis C virus (HCV) and hemorrhagic fever virus (HFV) databases, have made it their mission to make different data types easily available to their users. This involves a large amount of behind-the-scenes processing, including quality control and analysis of the sequences and their annotation. Gene and protein sequences are distilled from the sequences that are stored in GenBank; to this end, both submitter annotation and script-generated sequences are used. Alignments of both nucleotide and amino acid sequences are generated, manually curated, distilled into an alignment model, and regenerated in an iterative cycle that results in ever better new alignments. Annotation of epidemiological and clinical information is parsed, checked, and added to the database. User interfaces are updated, and new interfaces are added based upon user requests. Vital for its success, the database staff are heavy users of the system, which enables them to fix bugs and find opportunities for improvement. In this chapter we describe some of the infrastructure that keeps these heavily used analysis platforms alive and vital after nearly 25 years of use. The database/analysis platforms described in this chapter can be accessed at http://hiv.lanl.gov http://hcv.lanl.gov http://hfv.lanl.gov.


Subject(s)
Computational Biology/methods , Genome, Viral , Genomics/methods , Knowledge Management , Arenavirus/classification , Arenavirus/genetics , Databases, Genetic , HIV/classification , HIV/genetics , Hepacivirus/classification , Hepacivirus/genetics , Sequence Alignment , Sequence Analysis, RNA , Software
2.
PLoS Pathog ; 7(9): e1002209, 2011 Sep.
Article in English | MEDLINE | ID: mdl-21980282

ABSTRACT

Here we have identified HIV-1 B clade Envelope (Env) amino acid signatures from early in infection that may be favored at transmission, as well as patterns of recurrent mutation in chronic infection that may reflect common pathways of immune evasion. To accomplish this, we compared thousands of sequences derived by single genome amplification from several hundred individuals that were sampled either early in infection or were chronically infected. Samples were divided at the outset into hypothesis-forming and validation sets, and we used phylogenetically corrected statistical strategies to identify signatures, systematically scanning all of Env. Signatures included single amino acids, glycosylation motifs, and multi-site patterns based on functional or structural groupings of amino acids. We identified signatures near the CCR5 co-receptor-binding region, near the CD4 binding site, and in the signal peptide and cytoplasmic domain, which may influence Env expression and processing. Two signatures patterns associated with transmission were particularly interesting. The first was the most statistically robust signature, located in position 12 in the signal peptide. The second was the loss of an N-linked glycosylation site at positions 413-415; the presence of this site has been recently found to be associated with escape from potent and broad neutralizing antibodies, consistent with enabling a common pathway for immune escape during chronic infection. Its recurrent loss in early infection suggests it may impact fitness at the time of transmission or during early viral expansion. The signature patterns we identified implicate Env expression levels in selection at viral transmission or in early expansion, and suggest that immune evasion patterns that recur in many individuals during chronic infection when antibodies are present can be selected against when the infection is being established prior to the adaptive immune response.


Subject(s)
HIV Infections/genetics , HIV-1/genetics , Mutation, Missense , Protein Sorting Signals/genetics , env Gene Products, Human Immunodeficiency Virus/genetics , Adaptive Immunity , Amino Acid Motifs , Amino Acid Substitution , Antibodies, Viral/immunology , Binding Sites/genetics , CD4 Antigens/genetics , CD4 Antigens/immunology , Chronic Disease , Gene Expression Regulation, Viral/physiology , Glycosylation , HIV Infections/immunology , HIV-1/immunology , HIV-1/pathogenicity , Receptors, CCR5/genetics , Receptors, CCR5/immunology , Retrospective Studies , env Gene Products, Human Immunodeficiency Virus/biosynthesis
3.
J Theor Biol ; 261(2): 341-60, 2009 Nov 21.
Article in English | MEDLINE | ID: mdl-19660475

ABSTRACT

We describe a mathematical model and Monte Carlo (MC) simulation of viral evolution during acute infection. We consider both synchronous and asynchronous processes of viral infection of new target cells. The model enables an assessment of the expected sequence diversity in new HIV-1 infections originating from a single transmitted viral strain, estimation of the most recent common ancestor (MRCA) of the transmitted viral lineage, and estimation of the time to coalesce back to the MRCA. We also calculate the probability of the MRCA being the transmitted virus or an evolved variant. Excluding insertions and deletions, we assume HIV-1 evolves by base substitution without selection pressure during the earliest phase of HIV-1 infection prior to the immune response. Unlike phylogenetic methods that follow a lineage backwards to coalescence, we compare the observed data to a model of the diversification of a viral population forward in time. To illustrate the application of these methods, we provide detailed comparisons of the model and simulations results to 306 envelope sequences obtained from eight newly infected subjects at a single time point. The data from 68 patients were in good agreement with model predictions, and hence compatible with a single-strain infection evolving under no selection pressure. The diversity of the samples from the other two patients was too great to be explained by the model, suggesting multiple HIV-1-strains were transmitted. The model can also be applied to longitudinal patient data to estimate within-host viral evolutionary parameters.


Subject(s)
Evolution, Molecular , HIV Infections/virology , HIV-1/genetics , Models, Genetic , Acute Disease , Female , Genetic Variation , Humans , Male , Monte Carlo Method , Phylogeny
4.
PLoS Pathog ; 5(5): e1000414, 2009 May.
Article in English | MEDLINE | ID: mdl-19424423

ABSTRACT

The pattern of viral diversification in newly infected individuals provides information about the host environment and immune responses typically experienced by the newly transmitted virus. For example, sites that tend to evolve rapidly across multiple early-infection patients could be involved in enabling escape from common early immune responses, could represent adaptation for rapid growth in a newly infected host, or could represent reversion from less fit forms of the virus that were selected for immune escape in previous hosts. Here we investigated the diversification of HIV-1 env coding sequences in 81 very early B subtype infections previously shown to have resulted from transmission or expansion of single viruses (n = 78) or two closely related viruses (n = 3). In these cases, the sequence of the infecting virus can be estimated accurately, enabling inference of both the direction of substitutions as well as distinction between insertion and deletion events. By integrating information across multiple acutely infected hosts, we find evidence of adaptive evolution of HIV-1 env and identify a subset of codon sites that diversified more rapidly than can be explained by a model of neutral evolution. Of 24 such rapidly diversifying sites, 14 were either i) clustered and embedded in CTL epitopes that were verified experimentally or predicted based on the individual's HLA or ii) in a nucleotide context indicative of APOBEC-mediated G-to-A substitutions, despite having excluded heavily hypermutated sequences prior to the analysis. In several cases, a rapidly evolving site was embedded both in an APOBEC motif and in a CTL epitope, suggesting that APOBEC may facilitate early immune escape. Ten rapidly diversifying sites could not be explained by CTL escape or APOBEC hypermutation, including the most frequently mutated site, in the fusion peptide of gp41. We also examined the distribution, extent, and sequence context of insertions and deletions, and we provide evidence that the length variation seen in hypervariable loop regions of the envelope glycoprotein is a consequence of selection and not of mutational hotspots. Our results provide a detailed view of the process of diversification of HIV-1 following transmission, highlighting the role of CTL escape and hypermutation in shaping viral evolution during the establishment of new infections.


Subject(s)
Cytidine Deaminase/metabolism , Evolution, Molecular , HIV Infections/genetics , HIV-1/genetics , env Gene Products, Human Immunodeficiency Virus/genetics , Base Sequence , Cross-Sectional Studies , Epitopes, T-Lymphocyte/genetics , Genes, Viral/physiology , HIV Infections/virology , Humans , INDEL Mutation , Models, Genetic , Molecular Sequence Data , Phylogeny , T-Lymphocytes, Cytotoxic/virology
5.
PLoS One ; 4(5): e5683, 2009 May 25.
Article in English | MEDLINE | ID: mdl-19479085

ABSTRACT

High-throughput sequencing platforms provide an approach for detecting rare HIV-1 variants and documenting more fully quasispecies diversity. We applied this technology to the V3 loop-coding region of env in samples collected from 4 chronically HIV-infected subjects in whom CCR5 antagonist (vicriviroc [VVC]) therapy failed. Between 25,000-140,000 amplified sequences were obtained per sample. Profound baseline V3 loop sequence heterogeneity existed; predicted CXCR4-using populations were identified in a largely CCR5-using population. The V3 loop forms associated with subsequent virologic failure, either through CXCR4 use or the emergence of high-level VVC resistance, were present as minor variants at 0.8-2.8% of baseline samples. Extreme, rapid shifts in population frequencies toward these forms occurred, and deep sequencing provided a detailed view of the rapid evolutionary impact of VVC selection. Greater V3 diversity was observed post-selection. This previously unreported degree of V3 loop sequence diversity has implications for viral pathogenesis, vaccine design, and the optimal use of HIV-1 CCR5 antagonists.


Subject(s)
CCR5 Receptor Antagonists , HIV Infections/drug therapy , HIV Infections/virology , HIV-1/drug effects , HIV-1/physiology , Piperazines/pharmacology , Pyrimidines/pharmacology , Sequence Analysis, DNA/methods , Base Sequence , Biological Evolution , Entropy , HIV-1/genetics , Humans , Likelihood Functions , Nucleic Acid Conformation , Piperazines/therapeutic use , Pyrimidines/therapeutic use , RNA, Viral/blood , RNA, Viral/chemistry , Reproducibility of Results , Selection, Genetic , Sequence Alignment , Time Factors , Treatment Failure
6.
Proc Natl Acad Sci U S A ; 105(21): 7552-7, 2008 May 27.
Article in English | MEDLINE | ID: mdl-18490657

ABSTRACT

The precise identification of the HIV-1 envelope glycoprotein (Env) responsible for productive clinical infection could be instrumental in elucidating the molecular basis of HIV-1 transmission and in designing effective vaccines. Here, we developed a mathematical model of random viral evolution and, together with phylogenetic tree construction, used it to analyze 3,449 complete env sequences derived by single genome amplification from 102 subjects with acute HIV-1 (clade B) infection. Viral env genes evolving from individual transmitted or founder viruses generally exhibited a Poisson distribution of mutations and star-like phylogeny, which coalesced to an inferred consensus sequence at or near the estimated time of virus transmission. Overall, 78 of 102 subjects had evidence of productive clinical infection by a single virus, and 24 others had evidence of productive clinical infection by a minimum of two to five viruses. Phenotypic analysis of transmitted or early founder Envs revealed a consistent pattern of CCR5 dependence, masking of coreceptor binding regions, and equivalent or modestly enhanced resistance to the fusion inhibitor T1249 and broadly neutralizing antibodies compared with Envs from chronically infected subjects. Low multiplicity infection and limited viral evolution preceding peak viremia suggest a finite window of potential vulnerability of HIV-1 to vaccine-elicited immune responses, although phenotypic properties of transmitted Envs pose a formidable defense.


Subject(s)
Disease Transmission, Infectious , Evolution, Molecular , HIV Infections/transmission , HIV Infections/virology , HIV-1/genetics , env Gene Products, Human Immunodeficiency Virus/genetics , AIDS Vaccines/immunology , Base Sequence , Genetic Variation , HIV Antibodies/immunology , HIV Infections/immunology , HIV-1/isolation & purification , HIV-1/physiology , Humans , Models, Biological , Molecular Sequence Data , Mutation , Phylogeny , RNA, Viral/blood , RNA, Viral/genetics , Receptors, CCR5/metabolism , Sequence Analysis, RNA , env Gene Products, Human Immunodeficiency Virus/immunology
7.
Science ; 315(5818): 1583-6, 2007 Mar 16.
Article in English | MEDLINE | ID: mdl-17363674

ABSTRACT

Escape from T cell-mediated immune responses affects the ongoing evolution of rapidly evolving viruses such as HIV. By applying statistical approaches that account for phylogenetic relationships among viral sequences, we show that viral lineage effects rather than immune escape often explain apparent human leukocyte antigen (HLA)-mediated immune-escape mutations defined by older analysis methods. Phylogenetically informed methods identified immune-susceptible locations with greatly improved accuracy, and the associations we identified with these methods were experimentally validated. This approach has practical implications for understanding the impact of host immunity on pathogen evolution and for defining relevant variants for inclusion in vaccine antigens.


Subject(s)
Founder Effect , HIV Infections/virology , HIV-1/genetics , HIV-1/immunology , HLA Antigens/genetics , Polymorphism, Genetic , Alleles , Antigen Presentation , Epitopes , Evolution, Molecular , Genes, MHC Class I , Genes, Viral , HIV Infections/immunology , HIV-1/classification , HLA Antigens/immunology , HLA-C Antigens/genetics , Humans , Immune Tolerance , Likelihood Functions , Mutation , Phenotype , Phylogeny , T-Lymphocytes, Cytotoxic/immunology
8.
J Acquir Immune Defic Syndr ; 41(4): 399-404, 2006 Apr 01.
Article in English | MEDLINE | ID: mdl-16652046

ABSTRACT

New York City (NYC) is one of the original foci of the HIV-1 epidemic and has a greater number of AIDS cases than any other city in the United States. NYC also hosts the highest number of immigrants among the nation's cities: more than 2 million among a total population of 8 million. Such a high rate of immigration could act as a potential source for introducing and disseminating novel HIV-1 strains into the United States. Our current study focuses on the genetic characterization of HIV-1 strains circulating in an immigrant population in NYC. Of the 505 HIV-1-positive specimens obtained, 196 were available for viral sequencing from the C2 to V3 region of env. Phylogenetic analysis using maximum-likelihood and neighbor-joining methods demonstrated that non-B subtypes and circulating recombinant forms (CRFs) accounted for 43.4% (85 of 196 cases), whereas the remaining 56.6% (111 of 196) cases had viral variants similar to the typical North American subtype B virus. Of those non-B subtypes and CRFs, subtype A and CRF02 dominated (63.5% combined); other subtypes, including C, D, F1, G, CRF01_AE, and CRF06_cpx, were also detected. Two HIV-1 sequences do not cluster with any known subtypes or CRFs. Furthermore, the distribution of non-B subtypes and CRFs was consistent with the countries of origin, suggesting that many of the study subjects were likely infected in their home country before they entered the United States. Subtype B viruses identified in the immigrant population showed no significant differences from the typical North American B subtype, however, indicating that a significant proportion of the immigrants must have been infected after they came to the United States. Public health officials and physicians should be aware of the growing genetic diversity of HIV-1 in this country, particularly in areas with sizable immigrant populations.


Subject(s)
Emigration and Immigration , Genome, Viral , HIV Infections/virology , HIV-1/classification , HIV-1/genetics , Adolescent , Adult , Aged , Child , Child, Preschool , Genetic Variation , Genotype , HIV Envelope Protein gp160/genetics , HIV-1/isolation & purification , Humans , Middle Aged , New York City , Phylogeny , Polymerase Chain Reaction , Recombination, Genetic , Sequence Analysis, DNA , Sequence Homology
9.
AIDS Res Hum Retroviruses ; 21(11): 955-60, 2005 Nov.
Article in English | MEDLINE | ID: mdl-16386113

ABSTRACT

A nearly full-length genome sequence of an HIV-1 isolate originating from Cameroon, 02CM.3226MN, was found to cluster together with previously reported CRF13 sequences 96CM-4164 and 96CM-1849. Similarity plotting, bootscanning, breakpoint analysis, and phylogenetic trees confirmed similar genomic structures with almost identical breakpoint positions among these three isolates. Thus, CRF13 now fulfills the HIV-1 nomenclature requirements. A X2 analysis across all three genomes simultaneously was applied to more accurately determine breakpoints and address the uncertainty in such estimates. Some fragments were found to be difficult to classify, as indicated by a low branching index (BI), due to limited knowledge about parental and reference subtype sequences. One fragment with low BI association to reference subtype J sequences (BI = 0.27, cut-off for subtype classification >0.55) was found to be closer to J fragments of CRF11 similar to the way that A1-A2 and F1-F2 subsubtypes associate. This suggests that subtype J may need to be reclassified into subsubtypes J1 and J2. The CRF13 genome consists of fragments from subtypes A1, G, and both J1 and J2 as well as CRF01 and one region that was left unclassified.


Subject(s)
HIV Infections/virology , HIV-1/classification , HIV-1/genetics , Recombination, Genetic , Cameroon , Genome, Viral , Genotype , HIV-1/isolation & purification , Humans , Phylogeny , Sequence Analysis, DNA , Sequence Homology
10.
J Infect Dis ; 190(4): 697-701, 2004 Aug 15.
Article in English | MEDLINE | ID: mdl-15272396

ABSTRACT

To investigate African long-term survivors (LTSs) infected with non-subtype B human immunodeficiency virus type 1 (HIV-1), we obtained full-length HIV-1 RNA sequences and immunogenetic profiles from 6 untreated women enrolled in the Pumwani Sex Worker Cohort in Nairobi, Kenya. There were no discernible sequence changes likely to cause attenuation. CCR2-V64I, an immunogenetic polymorphism linked to LTSs, was detected in 4 women, all of whom carried the HLA B58 allele. Further investigation of 99 HIV-1-infected Nairobi women found an association between CCR2-V64I and HLA B58 (P=.0048). Studying the interaction among immunogenetics, immune responses, and viral sequences from all HIV-1 subtypes may increase our understanding of slow HIV-1 disease progression.


Subject(s)
HIV Infections/epidemiology , HIV Infections/genetics , HIV-1/genetics , Occupational Diseases/epidemiology , RNA, Viral/genetics , Sex Work , Adult , Alleles , Chemokine CCL2/genetics , Cohort Studies , Female , Genotype , HIV Infections/blood , HIV-1/pathogenicity , HLA Antigens/genetics , Humans , Kenya/epidemiology , Molecular Sequence Data , Polymorphism, Genetic , RNA, Viral/blood , Receptors, CCR2 , Receptors, Chemokine/genetics
11.
Glycobiology ; 14(12): 1229-46, 2004 Dec.
Article in English | MEDLINE | ID: mdl-15175256

ABSTRACT

Human and simian immunodeficiency viruses (HIV and SIV), influenza virus, and hepatitis C virus (HCV) have heavily glycosylated, highly variable surface proteins. Here we explore N-linked glycosylation site (sequon) variation at the population level in these viruses, using a new Web-based program developed to facilitate the sequon tracking and to define patterns (www.hiv.lanl.gov). This tool allowed rapid visualization of the two distinctive patterns of sequon variation found in HIV-1, HIV-2, and SIV CPZ. The first pattern (fixed) describes readily aligned sites that are either simply present or absent. These sites tend to be occupied by high-mannose glycans. The second pattern (shifting) refers to sites embedded in regions of extreme local length variation and is characterized by shifts in terms of the relative position and local density of sequons; these sites tend to be populated by complex carbohydrates. HIV, with its extreme variation in number and precise location of sequons, does not have a net increase in the number of sites over time at the population level. Primate lentiviral lineages have host species-dependent levels of sequon shifting, with HIV-1 in humans the most extreme. HCV E1 and E2 proteins, despite evolving extremely rapidly through point mutation, show limited sequon variation, although two shifting sites were identified. Human influenza A hemagglutinin H3 HA1 is accumulating sequons over time, but this trend is not evident in any other avian or human influenza A serotypes.


Subject(s)
Glycoproteins/chemistry , HIV/chemistry , Hemagglutinin Glycoproteins, Influenza Virus/chemistry , Hepacivirus/chemistry , Simian Immunodeficiency Virus/chemistry , Viral Envelope Proteins/chemistry , Animals , Binding Sites/genetics , Evolution, Molecular , Genetic Variation , Glycoproteins/genetics , Glycosylation , HIV Envelope Protein gp120/chemistry , HIV Envelope Protein gp120/genetics , Hemagglutinin Glycoproteins, Influenza Virus/genetics , Humans , Phenotype , Phylogeny , Species Specificity , Time , Viral Envelope Proteins/genetics
12.
J Virol ; 76(17): 8757-68, 2002 Sep.
Article in English | MEDLINE | ID: mdl-12163596

ABSTRACT

The human cytotoxic T-lymphocyte (CTL) response to human immunodeficiency virus type 1 (HIV-1) has been intensely studied, and hundreds of CTL epitopes have been experimentally defined, published, and compiled in the HIV Molecular Immunology Database. Maps of CTL epitopes on HIV-1 protein sequences reveal that defined epitopes tend to cluster. Here we integrate the global sequence and immunology databases to systematically explore the relationship between HIV-1 amino acid sequences and CTL epitope distributions. CTL responses to five HIV-1 proteins, Gag p17, Gag p24, reverse transcriptase (RT), Env, and Nef, have been particularly well characterized in the literature to date. Through comparing CTL epitope distributions in these five proteins to global protein sequence alignments, we identified distinct characteristics of HIV amino acid sequences that correlate with CTL epitope localization. First, experimentally defined HIV CTL epitopes are concentrated in relatively conserved regions. Second, the highly variable regions that lack epitopes bear cumulative evidence of past immune escape that may make them relatively refractive to CTLs: a paucity of predicted proteasome processing sites and an enrichment for amino acids that do not serve as C-terminal anchor residues. Finally, CTL epitopes are more highly concentrated in alpha-helical regions of proteins. Based on amino acid sequence characteristics, in a blinded fashion, we predicted regions in HIV regulatory and accessory proteins that would be likely to contain CTL epitopes; these predictions were then validated by comparison to new sets of experimentally defined epitopes in HIV-1 Rev, Tat, Vif, and Vpr.


Subject(s)
Epitopes, T-Lymphocyte/immunology , Genetic Variation , HIV Infections/immunology , HIV-1/immunology , T-Lymphocytes, Cytotoxic/immunology , Viral Proteins/genetics , Amino Acid Sequence , Antigenic Variation , Cysteine Endopeptidases/metabolism , Epitope Mapping , Epitopes, T-Lymphocyte/chemistry , HIV Infections/virology , HIV-1/classification , HIV-1/genetics , HIV-1/pathogenicity , Humans , Multienzyme Complexes/metabolism , Proteasome Endopeptidase Complex , Sequence Alignment , Viral Proteins/classification , Viral Proteins/immunology
13.
Science ; 296(5577): 2354-60, 2002 Jun 28.
Article in English | MEDLINE | ID: mdl-12089434

ABSTRACT

Globally, human immunodeficiency virus-type 1 (HIV-1) is extraordinarily variable, and this diversity poses a major obstacle to AIDS vaccine development. Currently, candidate vaccines are derived from isolates, with the hope that they will be sufficiently cross-reactive to protect against circulating viruses. This may be overly optimistic, however, given that HIV-1 envelope proteins can differ in more than 30% of their amino acids. To contend with the diversity, country-specific vaccines are being considered, but evolutionary relationships may be more useful than regional considerations. Consensus or ancestor sequences could be used in vaccine design to minimize the genetic differences between vaccine strains and contemporary isolates, effectively reducing the extent of diversity by half.


Subject(s)
AIDS Vaccines , Genetic Variation , HIV-1/genetics , HIV-1/immunology , Phylogeny , AIDS Vaccines/immunology , Base Sequence , Consensus Sequence , Cross Reactions , Evolution, Molecular , Gene Products, env/chemistry , Gene Products, env/genetics , Gene Products, env/immunology , Genome, Viral , HIV Antibodies/biosynthesis , HIV Antibodies/immunology , HIV Antigens/genetics , HIV Antigens/immunology , HIV Infections/epidemiology , HIV Infections/prevention & control , HIV Infections/virology , HIV-1/classification , Humans , Likelihood Functions
SELECTION OF CITATIONS
SEARCH DETAIL
...