RESUMO
All vertebrate genomes have been colonized by retroviruses along their evolutionary trajectory. Although endogenous retroviruses (ERVs) can contribute important physiological functions to contemporary hosts, such benefits are attributed to long-term coevolution of ERV and host because germline infections are rare and expansion is slow, and because the host effectively silences them. The genomes of several outbred species including mule deer (Odocoileus hemionus) are currently being colonized by ERVs, which provides an opportunity to study ERV dynamics at a time when few are fixed. We previously established the locus-specific distribution of cervid ERV (CrERV) in populations of mule deer. In this study, we determine the molecular evolutionary processes acting on CrERV at each locus in the context of phylogenetic origin, genome location, and population prevalence. A mule deer genome was de novo assembled from short- and long-insert mate pair reads and CrERV sequence generated at each locus. We report that CrERV composition and diversity have recently measurably increased by horizontal acquisition of a new retrovirus lineage. This new lineage has further expanded CrERV burden and CrERV genomic diversity by activating and recombining with existing CrERV. Resulting interlineage recombinants then endogenize and subsequently expand. CrERV loci are significantly closer to genes than expected if integration were random and gene proximity might explain the recent expansion of one recombinant CrERV lineage. Thus, in mule deer, retroviral colonization is a dynamic period in the molecular evolution of CrERV that also provides a burst of genomic diversity to the host population.
Assuntos
Cervos , Retrovirus Endógenos , Animais , Evolução Biológica , Cervos/genética , Retrovirus Endógenos/genética , Evolução Molecular , Filogenia , Recombinação GenéticaRESUMO
T-cell large granular lymphocyte (T-LGL) leukaemia is characterized by a clonal proliferation of cytotoxic T cells and is frequently associated with rheumatoid arthritis. Sera from some LGL leukaemia patients react to a portion of the human T-cell leukaemia virus (HTLV-1/2) transmembrane envelope protein, BA21, although HTLV-1/2 infection is rare in LGL leukaemia patients. Here we show that family members, including spouses, of an LGL leukaemia patient had elevated LGL counts, BA21 reactivity and, additionally, recognition of HIV-1 gp41. Thus, both LGL leukaemia patients and clinically normal contacts sharing the same environment have evidence of exposure to a retrovirus.
Assuntos
Proteína gp41 do Envelope de HIV , HIV-1 , Vírus Linfotrópico T Tipo 1 Humano , Vírus Linfotrópico T Tipo 2 Humano , Leucemia Linfocítica Granular Grande , Linfócitos T Citotóxicos , Feminino , Proteína gp41 do Envelope de HIV/sangue , Proteína gp41 do Envelope de HIV/imunologia , HIV-1/imunologia , HIV-1/metabolismo , Vírus Linfotrópico T Tipo 1 Humano/imunologia , Vírus Linfotrópico T Tipo 1 Humano/metabolismo , Vírus Linfotrópico T Tipo 2 Humano/imunologia , Vírus Linfotrópico T Tipo 2 Humano/metabolismo , Humanos , Leucemia Linfocítica Granular Grande/sangue , Leucemia Linfocítica Granular Grande/imunologia , Masculino , Linfócitos T Citotóxicos/imunologia , Linfócitos T Citotóxicos/metabolismoRESUMO
Human Endogenous Retrovirus type K (HERV-K) is the only HERV known to be insertionally polymorphic; not all individuals have a retrovirus at a specific genomic location. It is possible that HERV-Ks contribute to human disease because people differ in both number and genomic location of these retroviruses. Indeed viral transcripts, proteins, and antibody against HERV-K are detected in cancers, auto-immune, and neurodegenerative diseases. However, attempts to link a polymorphic HERV-K with any disease have been frustrated in part because population prevalence of HERV-K provirus at each polymorphic site is lacking and it is challenging to identify closely related elements such as HERV-K from short read sequence data. We present an integrated and computationally robust approach that uses whole genome short read data to determine the occupation status at all sites reported to contain a HERV-K provirus. Our method estimates the proportion of fixed length genomic sequence (k-mers) from whole genome sequence data matching a reference set of k-mers unique to each HERV-K locus and applies mixture model-based clustering of these values to account for low depth sequence data. Our analysis of 1000 Genomes Project Data (KGP) reveals numerous differences among the five KGP super-populations in the prevalence of individual and co-occurring HERV-K proviruses; we provide a visualization tool to easily depict the proportion of the KGP populations with any combination of polymorphic HERV-K provirus. Further, because HERV-K is insertionally polymorphic, the genome burden of known polymorphic HERV-K is variable in humans; this burden is lowest in East Asian (EAS) individuals. Our study identifies population-specific sequence variation for HERV-K proviruses at several loci. We expect these resources will advance research on HERV-K contributions to human diseases.
Assuntos
Retrovirus Endógenos/genética , Genética Populacional/métodos , Genômica/métodos , Provírus/genética , Grupos Raciais/genética , Algoritmos , Genoma Humano/genética , Genoma Viral/genética , Humanos , Epidemiologia Molecular , SoftwareRESUMO
Uncovering the genetic and evolutionary basis of local adaptation is a major focus of evolutionary biology. The recent development of cost-effective methods for obtaining high-quality genome-scale data makes it possible to identify some of the loci responsible for adaptive differences among populations. Two basic approaches for identifying putatively locally adaptive loci have been developed and are broadly used: one that identifies loci with unusually high genetic differentiation among populations (differentiation outlier methods) and one that searches for correlations between local population allele frequencies and local environments (genetic-environment association methods). Here, we review the promises and challenges of these genome scan methods, including correcting for the confounding influence of a species' demographic history, biases caused by missing aspects of the genome, matching scales of environmental data with population structure, and other statistical considerations. In each case, we make suggestions for best practices for maximizing the accuracy and efficiency of genome scans to detect the underlying genetic basis of local adaptation. With attention to their current limitations, genome scan methods can be an important tool in finding the genetic basis of adaptive evolutionary change.
Assuntos
Adaptação Fisiológica , Frequência do Gene , Genética Populacional , Animais , Genoma , Genômica , Seleção GenéticaRESUMO
BACKGROUND: Infection with feline immunodeficiency virus (FIV) causes an immunosuppressive disease whose consequences are less severe if cats are co-infected with an attenuated FIV strain (PLV). We use virus diversity measurements, which reflect replication ability and the virus response to various conditions, to test whether diversity of virulent FIV in lymphoid tissues is altered in the presence of PLV. Our data consisted of the 3' half of the FIV genome from three tissues of animals infected with FIV alone, or with FIV and PLV, sequenced by 454 technology. RESULTS: Since rare variants dominate virus populations, we had to carefully distinguish sequence variation from errors due to experimental protocols and sequencing. We considered an exponential-normal convolution model used for background correction of microarray data, and modified it to formulate an error correction approach for minor allele frequencies derived from high-throughput sequencing. Similar to accounting for over-dispersion in counts, this accounts for error-inflated variability in frequencies - and quite effectively reproduces empirically observed distributions. After obtaining error-corrected minor allele frequencies, we applied ANalysis Of VAriance (ANOVA) based on a linear mixed model and found that conserved sites and transition frequencies in FIV genes differ among tissues of dual and single infected cats. Furthermore, analysis of minor allele frequencies at individual FIV genome sites revealed 242 sites significantly affected by infection status (dual vs. single) or infection status by tissue interaction. All together, our results demonstrated a decrease in FIV diversity in bone marrow in the presence of PLV. Importantly, these effects were weakened or undetectable when error correction was performed with other approaches (thresholding of minor allele frequencies; probabilistic clustering of reads). We also queried the data for cytidine deaminase activity on the viral genome, which causes an asymmetric increase in G to A substitutions, but found no evidence for this host defense strategy. CONCLUSIONS: Our error correction approach for minor allele frequencies (more sensitive and computationally efficient than other algorithms) and our statistical treatment of variation (ANOVA) were critical for effective use of high-throughput sequencing data in understanding viral diversity. We found that co-infection with PLV shifts FIV diversity from bone marrow to lymph node and spleen.
Assuntos
Doenças do Gato/imunologia , Interpretação Estatística de Dados , Síndrome de Imunodeficiência Adquirida Felina/imunologia , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Vírus da Imunodeficiência Felina/classificação , Vírus da Imunodeficiência Felina/genética , Modelos Estatísticos , Algoritmos , Animais , Doenças do Gato/genética , Doenças do Gato/transmissão , Doenças do Gato/virologia , Gatos , DNA Viral/genética , Síndrome de Imunodeficiência Adquirida Felina/genética , Síndrome de Imunodeficiência Adquirida Felina/virologia , Vírus da Imunodeficiência Felina/patogenicidadeRESUMO
Mobile elements are powerful agents of genomic evolution and can be exceptionally informative markers for investigating species and population-level evolutionary history. While several studies have utilized retrotransposon-based insertional polymorphisms to resolve phylogenies, few population studies exist outside of humans. Endogenous retroviruses are LTR-retrotransposons derived from retroviruses that have become stably integrated in the host genome during past infections and transmitted vertically to subsequent generations. They offer valuable insight into host-virus co-evolution and a unique perspective on host evolutionary history because they integrate into the genome at a discrete point in time. We examined the evolutionary history of a cervid endogenous gammaretrovirus (CrERVγ) in mule deer (Odocoileus hemionus). We sequenced 14 CrERV proviruses (CrERV-in1 to -in14), and examined the prevalence and distribution of 13 proviruses in 262 deer among 15 populations from Montana, Wyoming, and Utah. CrERV absence in white-tailed deer (O. virginianus), identical 5' and 3' long terminal repeat (LTR) sequences, insertional polymorphism, and CrERV divergence time estimates indicated that most endogenization events occurred within the last 200000 years. Population structure inferred from CrERVs (F ST = 0.008) and microsatellites (θ = 0.01) was low, but significant, with Utah, northwestern Montana, and a Helena herd being particularly differentiated. Clustering analyses indicated regional structuring, and non-contiguous clustering could often be explained by known translocations. Cluster ensemble results indicated spatial localization of viruses, specifically in deer from northeastern and western Montana. This study demonstrates the utility of endogenous retroviruses to elucidate and provide novel insight into both ERV evolutionary history and the history of contemporary host populations.
Assuntos
DNA Viral/isolamento & purificação , Cervos/virologia , Retrovirus Endógenos/genética , Retrovirus Endógenos/isolamento & purificação , Genoma Viral , Animais , Análise por Conglomerados , DNA Viral/genética , Cervos/classificação , Evolução Molecular , Marcadores Genéticos , Repetições de Microssatélites , Montana , Mutagênese Insercional , Filogenia , Polimorfismo Genético , Proteínas Recombinantes , Seleção Genética , Análise de Sequência de DNA , Utah , WyomingRESUMO
We present ViPRA-Haplo, a de novo strain-specific assembly workflow for reconstructing viral haplotypes in a viral population from paired-end next generation sequencing (NGS) data. The proposed Viral Path Reconstruction Algorithm (ViPRA) generates a subset of paths from a De Bruijn graph of reads using the pairing information of reads. The paths generated by ViPRA are an over-estimation of the true contigs. We propose two refinement methods to obtain an optimal set of contigs representing viral haplotypes. The first method clusters paths reconstructed by ViPRA using VSEARCH Deorowicz et al. 2015 based on sequence similarity, while the second method, MLEHaplo, generates a maximum likelihood estimate of viral populations. We evaluated our pipeline on both simulated and real viral quasispecies data from HIV (and real data from SARS-COV-2). Experimental results show that ViPRA-Haplo, although still an overestimation in the number of true contigs, outperforms the existing tool, PEHaplo, providing up to 9% better genome coverage on HIV real data. In addition, ViPRA-Haplo also retains higher diversity of the viral population as demonstrated by the presence of a higher percentage of contigs less than 1000 base pairs (bps), which also contain k-mers with counts less than 100 (representing rarer sequences), which are absent in PEHaplo. For SARS-CoV-2 sequencing data, ViPRA-Haplo reconstructs contigs that cover more than 90% of the reference genome and were able to validate known SARS-CoV-2 strains in the sequencing data.
Assuntos
Algoritmos , Genoma Viral , Sequenciamento de Nucleotídeos em Larga Escala , SARS-CoV-2 , Sequenciamento de Nucleotídeos em Larga Escala/métodos , SARS-CoV-2/genética , Genoma Viral/genética , Humanos , Haplótipos/genética , COVID-19/virologia , HIV/genética , Biologia Computacional/métodosRESUMO
Endogenous retroviruses constitute a significant genomic fraction in all mammalian species. Typically they are evolutionarily old and fixed in the host species population. Here we report on a novel endogenous gammaretrovirus (CrERVγ; for cervid endogenous gammaretrovirus) in the mule deer (Odocoileus hemionus) that is insertionally polymorphic among individuals from the same geographical location, suggesting that it has a more recent evolutionary origin. Using PCR-based methods, we identified seven CrERVγ proviruses and demonstrated that they show various levels of insertional polymorphism in mule deer individuals. One CrERVγ provirus was detected in all mule deer sampled but was absent from white-tailed deer, indicating that this virus originally integrated after the split of the two species, which occurred approximately one million years ago. There are, on average, 100 CrERVγ copies in the mule deer genome based on quantitative PCR analysis. A CrERVγ provirus was sequenced and contained intact open reading frames (ORFs) for three virus genes. Transcripts were identified covering the entire provirus. CrERVγ forms a distinct branch of the gammaretrovirus phylogeny, with the closest relatives of CrERVγ being endogenous gammaretroviruses from sheep and pig. We demonstrated that white-tailed deer (Odocoileus virginianus) and elk (Cervus canadensis) DNA contain proviruses that are closely related to mule deer CrERVγ in a conserved region of pol; more distantly related sequences can be identified in the genome of another member of the Cervidae, the muntjac (Muntiacus muntjak). The discovery of a novel transcriptionally active and insertionally polymorphic retrovirus in mammals could provide a useful model system to study the dynamic interaction between the host genome and an invading retrovirus.
Assuntos
Cervos/virologia , Retrovirus Endógenos/fisiologia , Gammaretrovirus/fisiologia , Polimorfismo Genético , Integração Viral , Animais , Cervos/genética , Retrovirus Endógenos/classificação , Retrovirus Endógenos/genética , Retrovirus Endógenos/isolamento & purificação , Gammaretrovirus/classificação , Gammaretrovirus/genética , Gammaretrovirus/isolamento & purificação , Dosagem de Genes , Genoma , Dados de Sequência Molecular , FilogeniaRESUMO
Newcastle Disease Virus (NDV) is a pathogenic strain of avian paramyxovirus (aPMV-1) that is among the most serious of disease threats to the poultry industry worldwide. Viral diversity is high in aPMV-1; eight genotypes are recognized based on phylogenetic reconstruction of gene sequences. Modified live vaccines have been developed to decrease the economic losses caused by this virus. Vaccines derived from avirulent genotype II strains were developed in the 1950s and are in use globally, whereas Australian strains belonging to genotype I were developed as vaccines in the 1970s and are used mainly in Asia. In this study, we evaluated the consequences of attenuated live virus vaccination on the evolution of aPMV-1 genotypes. There was phylogenetic incongruence among trees based on individual genes and complete coding region of 54 full length aPMV-1 genomes, suggesting that recombinant sequences were present in the data set. Subsequently, five recombinant genomes were identified, four of which contained sequences from either genotype I or II. The population history of vaccine-related genotype II strains was distinct from other aPMV-1 genotypes; genotype II emerged in the late 19(th) century and is evolving more slowly than other genotypes, which emerged in the 1960s. Despite vaccination efforts, genotype II viruses have experienced constant population growth to the present. In contrast, other contemporary genotypes showed population declines in the late 1990s. Additionally, genotype I and II viruses, which are circulating in the presence of homotypic vaccine pressure, have unique selection profiles compared to nonvaccine-related strains. Collectively, these data show that vaccination with live attenuated viruses has changed the evolution of aPMV-1 by maintaining a large effective population size of a vaccine-related genotype, allowing for coinfection and recombination of vaccine and wild type strains, and by applying unique selective pressures on viral glycoproteins.
Assuntos
Doença de Newcastle/prevenção & controle , Vírus da Doença de Newcastle/genética , Vacinação , Animais , Sequência de Bases , Evolução Biológica , Genoma Viral , Genótipo , Dados de Sequência Molecular , Filogenia , Dinâmica Populacional , Aves Domésticas , Doenças das Aves Domésticas/prevenção & controle , Vacinação/métodos , Vacinas AtenuadasRESUMO
We report the existence of two distinct sublineages of avian metapneumovirus (MPV) subtype C, a virus which has caused serious economic loss in commercial turkey farms in the United States. This subtype is closely related to human MPV, infects multiple avian species, and is globally distributed. The evolutionary rates of this virus are estimated to be 1.3 x 10(-3) to 7 x 10(-3) substitutions per site per year, and coalescent estimates place its emergence between 1991 and 1996. The four genes examined show a concordant demographic pattern which is characterized by a rapid increase in population size followed by stable population grown until the present.
Assuntos
Evolução Molecular , Metapneumovirus/classificação , Metapneumovirus/genética , Infecções por Paramyxoviridae/veterinária , Doenças das Aves Domésticas/virologia , RNA Viral/genética , Animais , Análise por Conglomerados , Genótipo , Metapneumovirus/isolamento & purificação , Perus , Estados UnidosRESUMO
BACKGROUND: One of the goals of computational immunology is to facilitate the study of infectious diseases. Dynamic modeling is a powerful tool to integrate empirical data from independent sources, make novel predictions, and to foresee the gaps in the current knowledge. Dynamic models constructed to study the interactions between pathogens and hosts' immune responses have revealed key regulatory processes in the infection. OPTIMUM COMPLEXITY AND DYNAMIC MODELING: We discuss the usability of various deterministic dynamic modeling approaches to study the progression of infectious diseases. The complexity of these models is dependent on the number of components and the temporal resolution in the model. We comment on the specific use of simple and complex models in the study of the progression of infectious diseases. CONCLUSIONS: Models of sub-systems or simplified immune response can be used to hypothesize phenomena of host-pathogen interactions and to estimate rates and parameters. Nevertheless, to study the pathogenesis of an infection we need to develop models describing the dynamics of the immune components involved in the progression of the disease. Incorporation of the large number and variety of immune processes involved in pathogenesis requires tradeoffs in modeling.
Assuntos
Imunidade/imunologia , Modelos Imunológicos , Animais , Infecções/etiologia , Infecções/imunologiaRESUMO
A key hurdle in understanding the spread and control of infectious diseases is to capture appropriately the dynamics of pathogen transmission. As people and goods travel increasingly rapidly around the world, so do pathogens; we must be prepared to understand their spread, in terms of the contact network between hosts, viral life history and within-host dynamics. This will require collaborative work that takes into account viral life history, strategy and evolution, and host genetics, demographics and immunodynamics. Mathematical models are a useful tool for integrating the data and analyses from diverse fields that contribute to our understanding of viral transmission dynamics in heterogeneous host populations.
Assuntos
Interações Hospedeiro-Patógeno , Modelos Biológicos , Viroses/transmissão , Vírus/patogenicidade , Animais , Surtos de Doenças , Humanos , Viroses/epidemiologia , Viroses/virologiaRESUMO
BACKGROUND: Large granular lymphocyte (LGL) leukemia is an uncommon cancer characterized by sustained clonal proliferation of LGL cells. Antibodies reactive to retroviruses have been documented in the serum of patients with LGL leukemia. Culture or molecular approaches have to date not been successful in identifying a retrovirus. METHODS: Because a retrovirus must integrate into the genome of an infected cell, we focused our efforts on detecting a novel retrovirus integration site in the clonally expanded LGL cells. We present a new computational tool that uses long-insert mate pair sequence data to search the genome of LGL leukemia cells for retrovirus integration sites. We also utilize recently published methods to interrogate the status of polymorphic human endogenous retrovirus type K (HERV-K) provirus in patient genomes. RESULTS: Our data show that there are no new retrovirus insertions in LGL genomes of LGL leukemia patients. However, our insertion call tool did detect four HERV-K provirus integration sites that are polymorphic in the human population but absent from the human reference genome, hg19. To determine if the prevalence of these or other polymorphic proviral HERV-Ks differed between LGL leukemia patients and the general population, we used a recently developed tool that reports sites in the human genome occupied by a known proviral HERV-K. We report that there are significant differences in the number of polymorphic HERV-Ks in the genomes of LGL leukemia patients of European origin compared to individuals with European ancestry in the 1000 genomes (KGP) data. CONCLUSIONS: Our study confirms that the clonal expansion of LGL cells in LGL leukemia is not driven by the integration of a new infectious or endogenous retrovirus, although we do not rule out that these cells are responding to retroviral antigens produced in other cell types. However, our computational analyses revealed that the genomes of LGL leukemia patients carry a higher burden of polymorphic HERV-K proviruses compare to individuals from KGP of European ancestry. Our research emphasizes the merits of comprehensive genomic assessment of HERV-K in cancer samples and suggests that further analyses to determine contributions of HERV-K to LGL leukemia are warranted.
Assuntos
Genoma Humano/genética , Leucemia Linfocítica Granular Grande/genética , Leucemia Linfocítica Granular Grande/virologia , Provírus/fisiologia , Retroviridae/fisiologia , Integração Viral/genética , HumanosRESUMO
HIV-1 and parasitic infections co-circulate in many populations, and in a few well-studied examples HIV-1 co-infection is known to amplify parasite transmission. There are indications that HIV-1 interacts significantly with many other parasitic infections within individual hosts, but the population-level impacts of co-infection are not well-characterized. Here we consider how alteration of host immune status due to HIV-1 infection may influence the emergence of novel parasite strains. We review clinical and epidemiological evidence from five parasitic diseases (malaria, leishmaniasis, schistosomiasis, trypanosomiasis and strongyloidiasis) with emphasis on how HIV-1 co-infection alters individual susceptibility and infectiousness for the parasites. We then introduce a simple modelling framework that allows us to project how these individual-level properties might influence population-level dynamics. We find that HIV-1 can facilitate invasion by parasite strains in many circumstances and we identify threshold values of HIV-1 prevalence that allow otherwise unsustainable parasite strains to invade successfully. Definitive evidence to test these predicted effects is largely lacking, and we conclude by discussing challenges in interpreting available data and priorities for future studies.
Assuntos
Infecções por HIV/epidemiologia , HIV-1 , Interações Hospedeiro-Parasita , Doenças Parasitárias/epidemiologia , Animais , Comorbidade , Feminino , Saúde Global , Infecções por HIV/transmissão , Humanos , Hospedeiro Imunocomprometido , Leishmaniose/epidemiologia , Leishmaniose/transmissão , Malária/epidemiologia , Malária/transmissão , Masculino , Matemática , Doenças Parasitárias/transmissão , Dinâmica Populacional , Prevalência , Esquistossomose/epidemiologia , Esquistossomose/transmissão , Tripanossomíase/epidemiologia , Tripanossomíase/transmissão , VirulênciaRESUMO
Feline immunodeficiency virus (FIV) is a lentivirus that has been identified in many members of the family Felidae but domestic cats are the only FIV host in which infection results in disease. We studied FIVpco infection of cougars (Puma concolor) as a model for asymptomatic lentivirus infections to understand the mechanisms of host-virus coexistence. Several natural cougar populations were evaluated to determine if there are any consequences of FIVpco infection on cougar fecundity, survival, or susceptibility to other infections. We have sequenced full-length viral genomes and conducted a detailed analysis of viral molecular evolution on these sequences and on genome fragments of serially sampled animals to determine the evolutionary forces experienced by this virus in cougars. In addition, we have evaluated the molecular genetics of FIVpco in a new host, domestic cats, to determine the evolutionary consequences to a host-adapted virus associated with cross-species infection. Our results indicate that there are no significant differences in survival, fecundity or susceptibility to other infections between FIVpco-infected and uninfected cougars. The molecular evolution of FIVpco is characterized by a slower evolutionary rate and an absence of positive selection, but also by proviral and plasma viral loads comparable to those of epidemic lentiviruses such as HIV-1 or FIVfca. Evolutionary and recombination rates and selection profiles change significantly when FIVpco replicates in a new host.
Assuntos
Evolução Molecular , Vírus da Imunodeficiência Felina/genética , Puma/virologia , Animais , Gatos , Feminino , Variação Genética , Genoma Viral , Masculino , Carga ViralRESUMO
We propose a random forest classifier for detecting rare variants from sequencing errors in Next Generation Sequencing (NGS) data from viral populations. The method utilizes counts of varying length of k-mers from the reads of a viral population to train a Random forest classifier, called MultiRes, that classifies k-mers as erroneous or rare variants. Our algorithm is rooted in concepts from signal processing and uses a frame-based representation of k-mers. Frames are sets of non-orthogonal basis functions that were traditionally used in signal processing for noise removal. We define discrete spatial signals for genomes and sequenced reads, and show that k-mers of a given size constitute a frame. We evaluate MultiRes on simulated and real viral population datasets, which consist of many low frequency variants, and compare it to the error detection methods used in correction tools known in the literature. MultiRes has 4 to 500 times less false positives k-mer predictions compared to other methods, essential for accurate estimation of viral population diversity and their de-novo assembly. It has high recall of the true k-mers, comparable to other error correction methods. MultiRes also has greater than 95% recall for detecting single nucleotide polymorphisms (SNPs) and fewer false positive SNPs, while detecting higher number of rare variants compared to other variant calling methods for viral populations. The software is available freely from the GitHub link https://github.com/raunaq-m/MultiRes.
RESUMO
Serological and genetic material collected over 15 years (1990-2004) from 207 cougars (Puma concolor) in four populations in the Rocky Mountains were examined for evidence of current or prior exposure to feline immunodeficiency virus (FIV), feline parvovirus (FPV), feline coronavirus (FCoV), feline calicivirus (FCV), canine distemper virus (CDV), feline herpesvirus (FHV), and Yersinia pestis. Serologic data were analyzed for annual variation in seroconversions to assess whether these pathogens are epidemic or endemic in cougars, and to determine whether family membership, age, sex, or location influence risk of exposure. FIV and FPV were clearly endemic in the studied populations, whereas exposure to FCoV, FCV, CDV, and Y. pestis was more sporadic. No evidence was found for FHV. Age was the most consistent predictor of increased exposure risk, often with no other important factors emerging. Evidence for transmission within family groups was limited to FIV and FCoV, whereas some indication for host sex affecting exposure probability was found for FIV and Y. pestis. Overall, cougar populations exhibited few differences in terms of pathogen presence and prevalence, suggesting the presence of similar risk factors throughout the study region.
Assuntos
Anticorpos Antivirais/sangue , Peste/veterinária , Puma/microbiologia , Puma/virologia , Viroses/veterinária , Fatores Etários , Animais , Animais Selvagens/microbiologia , Animais Selvagens/virologia , Anticorpos Antibacterianos/sangue , Calicivirus Felino/imunologia , Coronavirus Felino/imunologia , Vírus da Cinomose Canina/imunologia , Feminino , Herpesviridae/imunologia , Vírus da Imunodeficiência Felina/imunologia , Masculino , Parvovirus/imunologia , Peste/epidemiologia , Peste/transmissão , Fatores de Risco , Estudos Soroepidemiológicos , Fatores Sexuais , Estados Unidos/epidemiologia , Viroses/epidemiologia , Viroses/transmissão , Yersinia pestis/imunologiaRESUMO
Neonatal sepsis (NS) is responsible for over 1 million yearly deaths worldwide. In the developing world, NS is often treated without an identified microbial pathogen. Amplicon sequencing of the bacterial 16S rRNA gene can be used to identify organisms that are difficult to detect by routine microbiological methods. However, contaminating bacteria are ubiquitous in both hospital settings and research reagents and must be accounted for to make effective use of these data. In this study, we sequenced the bacterial 16S rRNA gene obtained from blood and cerebrospinal fluid (CSF) of 80 neonates presenting with NS to the Mbarara Regional Hospital in Uganda. Assuming that patterns of background contamination would be independent of pathogenic microorganism DNA, we applied a novel quantitative approach using principal orthogonal decomposition to separate background contamination from potential pathogens in sequencing data. We designed our quantitative approach contrasting blood, CSF, and control specimens and employed a variety of statistical random matrix bootstrap hypotheses to estimate statistical significance. These analyses demonstrate that Leptospira appears present in some infants presenting within 48 h of birth, indicative of infection in utero, and up to 28 days of age, suggesting environmental exposure. This organism cannot be cultured in routine bacteriological settings and is enzootic in the cattle that often live in close proximity to the rural peoples of western Uganda. Our findings demonstrate that statistical approaches to remove background organisms common in 16S sequence data can reveal putative pathogens in small volume biological samples from newborns. This computational analysis thus reveals an important medical finding that has the potential to alter therapy and prevention efforts in a critically ill population.
RESUMO
Multihost parasites have been implicated in the emergence of new diseases in humans and wildlife, yet little is known about factors that influence the host range of parasites in natural populations. We used a comprehensive data set of 415 micro- and macroparasites reported from 119 wild primate hosts to investigate broad patterns of host specificity. The majority (68%) of primate parasites were reported to infect multiple host species, including animals from multiple families or orders. This pattern corresponds to previous studies of parasites found in humans and domesticated animals. Within three parasite groups (viruses, protozoans and helminths), we examined parasite taxonomy and transmission strategy in relation to measures of host specificity. Relative to other parasite groups, helminths were associated with the greatest levels of host specificity, whereas most viruses were reported to infect hosts from multiple families or orders. Highly significant associations between the degree of host specificity and transmission strategy arose within each parasite group, but not always in the same direction, suggesting that unique constraints influence the host range of parasites within each taxonomic group. Finally characteristics of over 100 parasite species shared between wild primates and humans, including those recognised as emerging in humans, revealed that most of these shared parasites were reported from multiple host orders. Furthermore, nearly all viruses that were reported to infect both humans and non-human primates were classified as emerging in humans.
Assuntos
Primatas/parasitologia , Zoonoses/transmissão , Animais , Animais Domésticos/parasitologia , Animais Selvagens/parasitologia , Eucariotos/fisiologia , Helmintíase/transmissão , Helmintos/fisiologia , Interações Hospedeiro-Parasita , Humanos , Filogenia , Infecções por Protozoários/transmissão , Especificidade da Espécie , Viroses/transmissão , Fenômenos Fisiológicos Virais , Zoonoses/parasitologiaRESUMO
Endogenous retroviruses (ERVs) were acquired during evolution of their host organisms after infection and mendelian inheritance in the germline by their exogenous counterparts. The ERVs can spread in the host genome and in some cases they affect the host phenotype. The cervid endogenous gammaretrovirus (CrERV) is one of only a few well-defined examples of evolutionarily recent invasion of mammalian genome by retroviruses. Thousands of insertionally polymorphic CrERV integration sites have been detected in wild ranging mule deer (Odocoileus hemionus) host populations. Here, we describe for the first time induction of replication competent CrERV by cocultivation of deer and human cells. We characterize the physical properties and tropism of the induced virus. The genomic sequence of the induced virus is phylogenetically related to the evolutionarily young endogenous CrERVs described so far. We also describe the level of replication block of CrERV on deer cells and its capacity to establish superinfection interference.