RESUMO
Viral hemorrhagic septicemia virus (VHSV), a rhabdovirus infecting teleost fish, has repeatedly crossed the boundary from marine fish species to freshwater cultured rainbow trout. These naturally replicated cross-species transmission events permit the study of general and repeatable evolutionary events occurring in connection with viral emergence in a novel host species. The purpose of the present study was to investigate the adaptive molecular evolution of the VHSV glycoprotein, one of the key virus proteins involved in viral emergence, following emergence from marine species into freshwater cultured rainbow trout. A comprehensive phylogenetic reconstruction of the complete coding region of the VHSV glycoprotein was conducted, and adaptive molecular evolution was investigated using a maximum likelihood approach to compare different codon substitution models allowing for heterogeneous substitution rate ratios among amino acid sites. Evidence of positive selection was detected at six amino acid sites of the VHSV glycoprotein, within the signal peptide, the confirmation-dependent major neutralizing epitope, and the intracellular tail. Evidence of positive selection was found exclusively in rainbow trout-adapted virus isolates, and amino acid combinations found at the six sites under positive selection pressure differentiated rainbow trout- from non-rainbow trout-adapted isolates. Furthermore, four adaptive sites revealed signs of recurring identical changes across phylogenetic groups of rainbow trout-adapted isolates, suggesting that repeated VHSV emergence in freshwater cultured rainbow trout was established through convergent routes of evolution that are associated with immune escape.IMPORTANCE This study is the first to demonstrate that VHSV emergence from marine species into freshwater cultured rainbow trout has been accompanied by bursts of adaptive evolution in the VHSV glycoprotein. Furthermore, repeated detection of the same adaptive amino acid sites across phylogenetic groups of rainbow trout-adapted isolates indicates that adaptation to rainbow trout was established through parallel evolution. In addition, signals of convergent evolution toward the maintenance of genetic variation were detected in the conformation-dependent neutralizing epitope or in close proximity to disulfide bonds involved in the structural conformation of the neutralizing epitope, indicating adaptation to immune response-related genetic variation across freshwater cultured rainbow trout.
Assuntos
Doenças dos Peixes/transmissão , Glicoproteínas/genética , Septicemia Hemorrágica Viral/transmissão , Novirhabdovirus/genética , Oncorhynchus mykiss/virologia , Infecções por Rhabdoviridae/veterinária , Adaptação Biológica/genética , Substituição de Aminoácidos/genética , Animais , Evolução Molecular , Doenças dos Peixes/virologia , Septicemia Hemorrágica Viral/virologia , Novirhabdovirus/patogenicidade , Infecções por Rhabdoviridae/transmissão , Infecções por Rhabdoviridae/virologia , Especificidade da EspécieRESUMO
Existing computational methods for RNA secondary-structure prediction tacitly assume RNA to only encode functional RNA structures. However, experimental studies have revealed that some RNA sequences, e.g. compact viral genomes, can simultaneously encode functional RNA structures as well as proteins, and evidence is accumulating that this phenomenon may also be found in Eukaryotes. We here present the first comparative method, called RNA-DECODER, which explicitly takes the known protein-coding context of an RNA-sequence alignment into account in order to predict evolutionarily conserved secondary-structure elements, which may span both coding and non-coding regions. RNA-DECODER employs a stochastic context-free grammar together with a set of carefully devised phylogenetic substitution-models, which can disentangle and evaluate the different kinds of overlapping evolutionary constraints which arise. We show that RNA-DECODER's parameters can be automatically trained to successfully fold known secondary structures within the HCV genome. We scan the genomes of HCV and polio virus for conserved secondary-structure elements, and analyze performance as a function of available evolutionary information. On known secondary structures, RNA-DECODER shows a sensitivity similar to the programs MFOLD, PFOLD and RNAALIFOLD. When scanning the entire genomes of HCV and polio virus for structure elements, RNA-DECODER's results indicate a markedly higher specificity than MFOLD, PFOLD and RNAALIFOLD.
Assuntos
Códon/química , RNA/química , Análise de Sequência de RNA/métodos , Evolução Molecular , Hepacivirus/genética , Conformação de Ácido Nucleico , Filogenia , Poliovirus/genética , Proteínas/genética , RNA Viral/química , Alinhamento de Sequência , Software , Processos EstocásticosRESUMO
BACKGROUND: Time-structured genetic samples are a valuable source of information in population genetics because they provide several correlated observations of the underlying evolutionary processes. In this paper we study basic properties of the genetic variation in time-structured samples as reflected in the genealogies relating individuals and the number of segregating sites observed. Our emphasis is on "measurably evolving populations" i.e. populations from which it is possible to obtain time-structured samples that span a significant interval of evolutionary time. RESULTS: We use results from the coalescent process to derive properties of time-structured samples. In the first section we extend existing results to attain measures on coalescent trees relating time-structured samples. These include the expected time to a most recent common ancestor, the expected total branch length and the expected length of branches subtending only ancient individuals. The effect of different sampling schemes on the latter measure is studied. In the second section we study the special case where the full sample consists of a group of contemporary extant samples and a group of contemporary ancient samples. As regards this case, we present results and applications concerning the probability distribution of the number of segregating sites where a mutation is unique to the ancient individuals and the number of segregating sites where a mutation is shared between ancient and extant individuals. CONCLUSION: The methodology and results presented here is of use to the design and interpretation of ancient DNA experiments. Furthermore, the results may be useful in further development of statistical tests of e.g. population dynamics and selection, which include temporal information.
Assuntos
Evolução Biológica , Modelos Genéticos , Mutação , Animais , DNA Mitocondrial/genética , Humanos , Linhagem , Probabilidade , TempoRESUMO
This study presents a new computer program for assessing the effects of different factors and sequencing strategies on de novo sequence assembly. The program uses reads from actual sequencing studies or from simulations with a reference genome that may also be real or simulated. The simulated reads can be created with our read simulator. They can be of differing length and coverage, consist of paired reads with varying distance, and include sequencing errors such as color space miscalls to imitate SOLiD data. The simulated or real reads are mapped to their reference genome and our assembly simulator is then used to obtain optimal assemblies that are limited only by the distribution of repeats. By way of this mapping, the assembly simulator determines which contigs are theoretically possible, or conversely (and perhaps more importantly), which are not. We illustrate the application and utility of our new simulation tools with several experiments that test the effects of genome complexity (repeats), read length and coverage, word size in De Bruijn graph assembly, and alternative sequencing strategies (e.g., BAC pooling) on sequence assemblies. These experiments highlight just some of the uses of our simulators in the experimental design of sequencing projects and in the further development of assembly algorithms.
RESUMO
Porcine reproductive and respiratory syndrome virus (PRRSV) recently emerged in domestic pigs of Western Europe and North America. Although time of emergence was identical on the two continents, genetic composition was markedly different with a clear geographical subtype structure, indicating that subtypes diverged in separate reservoirs prior to emergence. Genetic analyses have shown that the most recent common ancestor (MRCA) of Western European isolates existed around 1980 and that these originate from Eastern European pigs. These findings are challenged by a study of Hanada et al. who place the MRCA of all PRRSV isolates around 1980 and find that no significant subtype divergence occurred before emergence. Here, I discuss problems of information content, methodology, and biological plausibility associated with this study. Using alternative methodology, I reanalyze the existing data and conclude that the MRCA of all PRRSV isolates existed around 1880, 100 years before the date estimated by Hanada et al.
Assuntos
Evolução Molecular , Variação Genética , Filogenia , Vírus da Síndrome Respiratória e Reprodutiva Suína/genética , Sus scrofa/virologia , Animais , Europa (Continente) , Geografia , América do Norte , Especificidade da EspécieRESUMO
Parasites sometimes expand their host range by acquiring a new host species. After a host change event, the selective regime acting on a given parasite gene may change as a result of host-specific adaptive alterations of protein functionality or host-specific immune-mediated selection. We present a codon-based model that attempts to include these effects by allowing the position-specific substitution process to change in conjunction with a host change event. Following maximum-likelihood parameter estimation, we employ an empirical Bayesian procedure to identify candidate sites potentially involved in host-specific adaptation. We discuss the applicability of the model to the more general problem of ascertaining whether the selective regime differs in two groups of related organisms. The utility of the model is illustrated on a data set of nucleoprotein sequences from the influenza A virus obtained from avian and human hosts.
Assuntos
Códon/genética , Evolução Molecular , Vírus da Influenza A/genética , Filogenia , Seleção Genética , Animais , Variação Genética , Humanos , Funções Verossimilhança , Modelos GenéticosRESUMO
Viral haemorrhagic septicaemia (VHS) caused by the rhabdovirus VHSV is economically the most important viral disease in European rainbow trout farming. Until 1989, this virus was mainly isolated from freshwater salmonids but in the last decade, it has also been isolated from an increasing number of free-living marine fish species. To study the genetic evolution of VHSV, the entire G gene from 74 isolates was analysed. VHSV from wild marine species caught in the Baltic Sea, Skagerrak, Kattegat, North Sea, and English Channel and European freshwater isolates, appeared to share a recent common ancestor. Based on the estimated nucleotide substitution rate, the ancestor of the European fresh water isolates was dated some 50 years ago. This finding fits with the initial reports in the 1950s on clinical observations of VHS in Danish freshwater rainbow trout farms. The study also indicates that European marine VHSV and the North American marine line separated approx. 500 years ago. The codon substitution rate among the freshwater VHSV isolates was found to be 2.5 times faster than among marine isolates. The data support the hypothesis of the marine environment being the original reservoir of VHSV and that the change in host range (to include rainbow trout) may have occurred several times. Virus from the marine environment will therefore continue to represent a threat to the trout aquaculture industry.
Assuntos
Doenças dos Peixes/virologia , Peixes/virologia , Novirhabdovirus/genética , Infecções por Rhabdoviridae/veterinária , Animais , Aquicultura , Evolução Biológica , Códon , Europa (Continente) , Indústria Alimentícia , Água Doce , Novirhabdovirus/isolamento & purificação , Oceanos e Mares , Oncorhynchus mykiss/virologia , Filogenia , Infecções por Rhabdoviridae/virologiaRESUMO
Here we present a model of nucleotide substitution in protein-coding regions that also encode the formation of conserved RNA structures. In such regions, apparent evolutionary context dependencies exist, both between nucleotides occupying the same codon and between nucleotides forming a base pair in the RNA structure. The overlap of these fundamental dependencies is sufficient to cause "contagious" context dependencies which cascade across many nucleotide sites. Such large-scale dependencies challenge the use of traditional phylogenetic models in evolutionary inference because they explicitly assume evolutionary independence between short nucleotide tuples. In our model we address this by replacing context dependencies within codons by annotation-specific heterogeneity in the substitution process. Through a general procedure, we fragment the alignment into sets of short nucleotide tuples based on both the protein coding and the structural annotation. These individual tuples are assumed to evolve independently, and the different tuple sets are assigned different annotation-specific substitution models shared between their members. This allows us to build a composite model of the substitution process from components of traditional phylogenetic models. We applied this to a data set of full-genome sequences from the hepatitis C virus where five RNA structures are mapped within the coding region. This allowed us to partition the effects of selection on different structural elements and to test various hypotheses concerning the relation of these effects. Of particular interest, we found evidence of a functional role of loop and bulge regions, as these were shown to evolve according to a different and more constrained selective regime than the nonpairing regions outside the RNA structures. Other potential applications of the model include comparative RNA structure prediction in coding regions and RNA virus phylogenetics.
Assuntos
Sequência Conservada , Evolução Molecular , Proteínas/genética , RNA/genética , Análise de Sequência de RNA , Sequência de Bases , Modelos Genéticos , Conformação de Ácido Nucleico , Filogenia , Alinhamento de SequênciaRESUMO
A live attenuated porcine reproductive and respiratory syndrome (PRRS) vaccine virus has been shown to revert to virulence under field conditions. In order to identify genetic virulence determinants, ORF1 from the attenuated vaccine virus and three Danish vaccine-derived field isolates was sequenced and compared with the parental strain of the vaccine virus (VR2332). This revealed five mutations that had occurred independently in all three vaccine-derived field isolates, indicating strong parallel selective pressure on these positions in the vaccine virus when used in swine herds. Two of these parallel mutations were direct reversions to the parental VR2332 sequence and were situated in a papain-like cysteine protease domain and in the helicase domain. The remaining parallel mutations might be seen as second-site compensatory mutations for one or more of the mutations that accumulated in the vaccine virus sequence during cell-culture adaptation. Evaluation of the remaining mutations in the ORF1 sequence revealed stronger selective pressure for amino acid conservation during spread in pigs than during vaccine production. Furthermore, it was found that the selective pressure did not change during the time period studied. The implications of these findings for PRRS vaccine attenuation and reversion are discussed.
Assuntos
Evolução Molecular , Síndrome Respiratória e Reprodutiva Suína/virologia , Vírus da Síndrome Respiratória e Reprodutiva Suína/genética , Vírus da Síndrome Respiratória e Reprodutiva Suína/patogenicidade , Supressão Genética/genética , Vacinas Atenuadas/genética , Vacinas Virais/genética , Sequência de Aminoácidos , Animais , Sequência de Bases , DNA Helicases/química , DNA Helicases/genética , Análise Mutacional de DNA , Genes Virais/genética , Dados de Sequência Molecular , Fases de Leitura Aberta/genética , Papaína/química , Filogenia , Mutação Puntual/genética , Reação em Cadeia da Polimerase , Síndrome Respiratória e Reprodutiva Suína/imunologia , Vírus da Síndrome Respiratória e Reprodutiva Suína/enzimologia , Vírus da Síndrome Respiratória e Reprodutiva Suína/imunologia , Estrutura Terciária de Proteína , Seleção Genética , Suínos , Vacinas Atenuadas/efeitos adversos , Vacinas Atenuadas/imunologia , Proteínas Virais/química , Proteínas Virais/genética , Vacinas Virais/efeitos adversos , Vacinas Virais/imunologia , Virulência/genéticaRESUMO
The estimation of evolutionary rates from serially sampled sequences has recently been the focus of several studies. In this paper, we extend these analyzes to allow the estimation of a joint rate of substitution, omega, from several evolving populations from which serial samples are drawn. In the case of viruses evolving in different hosts, therapy may halt replication and therefore the accumulation of substitutions in the population. In such cases, it may be that only a proportion, p, of subjects are nonresponders who have viral populations that continue to evolve. We develop two likelihood-based procedures to jointly estimate p and omega, and empirical Bayes' tests of whether an individual should be classified as a responder or nonresponder. An example data set comprising HIV-1 partial envelope sequences from six patients on highly active antiretroviral therapy is analyzed.
Assuntos
Evolução Molecular , Algoritmos , Terapia Antirretroviral de Alta Atividade/estatística & dados numéricos , Teorema de Bayes , HIV-1/genética , Humanos , Funções Verossimilhança , Modelos Genéticos , FilogeniaRESUMO
Porcine reproductive and respiratory syndrome virus (PRRSV) is a recently emerged pathogen. Two PRRSV genotypes exist, North American and European, which are only 55-70% identical at the nucleotide level. Previous studies have shown high nucleotide diversity in the North American genotype and low nucleotide diversity in the European genotype. Here, we analyzed the ORF5 and ORF7 genes for a large number of new European type PRRSV isolates in conjunction with existing database sequences. This new analysis showed that contrary to previous assumptions, genetic diversity is at least as high in the European genotype as in the North American genotype. Furthermore, we showed that genetic diversity of European type PRRSV has a marked geographical pattern, with exceptionally high genetic diversity among Italian sequences. The geographical pattern of diversity in relation to the epidemiology of PRRSV in Europe is discussed. Discrepancies between ORF5- and ORF7-based genealogies were observed, and further analysis of the data set confirmed the presence of recombination. We were therefore able to report the first observation of recombination in wild-type isolates of European genotype PRRSV.