Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 11 de 11
Filtrar
1.
Nucleic Acids Res ; 45(D1): D482-D490, 2017 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-27899678

RESUMO

The Virus Variation Resource is a value-added viral sequence data resource hosted by the National Center for Biotechnology Information. The resource is located at http://www.ncbi.nlm.nih.gov/genome/viruses/variation/ and includes modules for seven viral groups: influenza virus, Dengue virus, West Nile virus, Ebolavirus, MERS coronavirus, Rotavirus A and Zika virus Each module is supported by pipelines that scan newly released GenBank records, annotate genes and proteins and parse sample descriptors and then map them to controlled vocabulary. These processes in turn support a purpose-built search interface where users can select sequences based on standardized gene, protein and metadata terms. Once sequences are selected, a suite of tools for downloading data, multi-sequence alignment and tree building supports a variety of user directed activities. This manuscript describes a series of features and functionalities recently added to the Virus Variation Resource.


Assuntos
Biologia Computacional/métodos , Surtos de Doenças , Variação Genética , Software , Viroses/epidemiologia , Viroses/virologia , Vírus/genética , Bases de Dados Genéticas
2.
Nucleic Acids Res ; 44(D1): D733-45, 2016 Jan 04.
Artigo em Inglês | MEDLINE | ID: mdl-26553804

RESUMO

The RefSeq project at the National Center for Biotechnology Information (NCBI) maintains and curates a publicly available database of annotated genomic, transcript, and protein sequence records (http://www.ncbi.nlm.nih.gov/refseq/). The RefSeq project leverages the data submitted to the International Nucleotide Sequence Database Collaboration (INSDC) against a combination of computation, manual curation, and collaboration to produce a standard set of stable, non-redundant reference sequences. The RefSeq project augments these reference sequences with current knowledge including publications, functional features and informative nomenclature. The database currently represents sequences from more than 55,000 organisms (>4800 viruses, >40,000 prokaryotes and >10,000 eukaryotes; RefSeq release 71), ranging from a single record to complete genomes. This paper summarizes the current status of the viral, prokaryotic, and eukaryotic branches of the RefSeq project, reports on improvements to data access and details efforts to further expand the taxonomic representation of the collection. We also highlight diverse functional curation initiatives that support multiple uses of RefSeq data including taxonomic validation, genome annotation, comparative genomics, and clinical testing. We summarize our approach to utilizing available RNA-Seq and other data types in our manual curation process for vertebrate, plant, and other species, and describe a new direction for prokaryotic genomes and protein name management.


Assuntos
Bases de Dados Genéticas , Genômica , Animais , Bovinos , Perfilação da Expressão Gênica , Genoma Fúngico , Genoma Humano , Genoma Microbiano , Genoma de Planta , Genoma Viral , Genômica/normas , Humanos , Invertebrados/genética , Camundongos , Anotação de Sequência Molecular , Nematoides/genética , Filogenia , RNA Longo não Codificante/genética , Ratos , Padrões de Referência , Análise de Sequência de Proteína , Análise de Sequência de RNA , Vertebrados/genética
3.
Nucleic Acids Res ; 43(Database issue): D571-7, 2015 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-25428358

RESUMO

Recent technological innovations have ignited an explosion in virus genome sequencing that promises to fundamentally alter our understanding of viral biology and profoundly impact public health policy. Yet, any potential benefits from the billowing cloud of next generation sequence data hinge upon well implemented reference resources that facilitate the identification of sequences, aid in the assembly of sequence reads and provide reference annotation sources. The NCBI Viral Genomes Resource is a reference resource designed to bring order to this sequence shockwave and improve usability of viral sequence data. The resource can be accessed at http://www.ncbi.nlm.nih.gov/genome/viruses/ and catalogs all publicly available virus genome sequences and curates reference genome sequences. As the number of genome sequences has grown, so too have the difficulties in annotating and maintaining reference sequences. The rapid expansion of the viral sequence universe has forced a recalibration of the data model to better provide extant sequence representation and enhanced reference sequence products to serve the needs of the various viral communities. This, in turn, has placed increased emphasis on leveraging the knowledge of individual scientific communities to identify important viral sequences and develop well annotated reference virus genome sets.


Assuntos
Bases de Dados de Ácidos Nucleicos , Genoma Viral , Sequenciamento de Nucleotídeos em Larga Escala , Internet , Anotação de Sequência Molecular , Software , Vírus/classificação
4.
J Gen Virol ; 91(Pt 1): 74-86, 2010 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-19759238

RESUMO

Viral particles in stool samples from wild-living chimpanzees were analysed using random PCR amplification and sequencing. Sequences encoding proteins distantly related to the replicase protein of single-stranded circular DNA viruses were identified. Inverse PCR was used to amplify and sequence multiple small circular DNA viral genomes. The viral genomes were related in size and genome organization to vertebrate circoviruses and plant geminiviruses but with a different location for the stem-loop structure involved in rolling circle DNA replication. The replicase genes of these viruses were most closely related to those of the much smaller (approximately 1 kb) plant nanovirus circular DNA chromosomes. Because the viruses have characteristics of both animal and plant viruses, we named them chimpanzee stool-associated circular viruses (ChiSCV). Further metagenomic studies of animal samples will greatly increase our knowledge of viral diversity and evolution.


Assuntos
Animais Selvagens/virologia , Infecções por Vírus de DNA/veterinária , Vírus de DNA/isolamento & purificação , DNA Circular/genética , DNA Viral/genética , Fezes/virologia , Pan troglodytes/virologia , Sequência de Aminoácidos , Animais , Circovirus/genética , Infecções por Vírus de DNA/virologia , Vírus de DNA/genética , Geminiviridae/genética , Genes Virais , Modelos Moleculares , Dados de Sequência Molecular , Nanovirus/genética , Conformação de Ácido Nucleico , Filogenia , Reação em Cadeia da Polimerase/métodos , Alinhamento de Sequência , Análise de Sequência de DNA , Homologia de Sequência
5.
J Virol ; 83(9): 4642-51, 2009 May.
Artigo em Inglês | MEDLINE | ID: mdl-19211756

RESUMO

We analyzed viral nucleic acids in stool samples collected from 35 South Asian children with nonpolio acute flaccid paralysis (AFP). Sequence-independent reverse transcription and PCR amplification of capsid-protected, nuclease-resistant viral nucleic acids were followed by DNA sequencing and sequence similarity searches. Limited Sanger sequencing (35 to 240 subclones per sample) identified an average of 1.4 distinct eukaryotic viruses per sample, while pyrosequencing yielded 2.6 viruses per sample. In addition to bacteriophage and plant viruses, we detected known enteric viruses, including rotavirus, adenovirus, picobirnavirus, and human enterovirus species A (HEV-A) to HEV-C, as well as numerous other members of the Picornaviridae family, including parechovirus, Aichi virus, rhinovirus, and human cardiovirus. The viruses with the most divergent sequences relative to those of previously reported viruses included members of a novel Picornaviridae genus and four new viral species (members of the Dicistroviridae, Nodaviridae, and Circoviridae families and the Bocavirus genus). Samples from six healthy contacts of AFP patients were similarly analyzed and also contained numerous viruses, particularly HEV-C, including a potentially novel Enterovirus genotype. Determining the prevalences and pathogenicities of the novel genotypes, species, genera, and potential new viral families identified in this study in different demographic groups will require further studies with different demographic and patient groups, now facilitated by knowledge of these viral genomes.


Assuntos
Fezes/virologia , Genoma Viral/genética , Neurossífilis/virologia , Doença Aguda , Adolescente , Ásia/epidemiologia , Estudos de Casos e Controles , Criança , Pré-Escolar , Enterovirus/classificação , Enterovirus/genética , Infecções por Enterovirus/epidemiologia , Infecções por Enterovirus/virologia , Feminino , Saúde , Humanos , Lactente , Masculino , Neurossífilis/sangue , Neurossífilis/epidemiologia , Filogenia , Análise de Sequência de DNA
6.
J Virol ; 83(9): 4631-41, 2009 May.
Artigo em Inglês | MEDLINE | ID: mdl-19193786

RESUMO

Cardioviruses cause enteric infections in mice and rats which when disseminated have been associated with myocarditis, type 1 diabetes, encephalitis, and multiple sclerosis-like symptoms. Cardioviruses have also been detected at lower frequencies in other mammals. The Cardiovirus genus within the Picornaviridae family is currently made up of two viral species, Theilovirus and Encephalomyocarditis virus. Until recently, only a single strain of cardioviruses (Vilyuisk virus within the Theilovirus species) associated with a geographically restricted and prevalent encephalitis-like condition had been reported to occur in humans. A second theilovirus-related cardiovirus (Saffold virus [SAFV]) was reported in 2007 and subsequently found in respiratory secretions from children with respiratory problems and in stools of both healthy and diarrheic children. Using viral metagenomics, we identified RNA fragments related to SAFV in the stools of Pakistani and Afghani children with nonpolio acute flaccid paralysis (AFP). We sequenced three near-full-length genomes, showing the presence of divergent strains of SAFV and preliminary evidence of a distant recombination event between the ancestors of the Theiler-like viruses of rats and those of human SAFV. Further VP1 sequencing showed the presence of five new SAFV genotypes, doubling the reported genetic diversity of human and animal theiloviruses combined. Both AFP patients and healthy children in Pakistan were found to be excreting SAFV at high frequencies of 9 and 12%, respectively. Further studies are needed to examine the roles of these highly common and diverse SAFV genotypes in nonpolio AFP and other human diseases.


Assuntos
Infecções por Cardiovirus/epidemiologia , Infecções por Cardiovirus/virologia , Cardiovirus/genética , Cardiovirus/isolamento & purificação , Variação Genética/genética , Enteropatias/epidemiologia , Enteropatias/virologia , Doença Aguda , Sequência de Aminoácidos , Animais , Ásia/epidemiologia , Proteínas do Capsídeo/química , Proteínas do Capsídeo/classificação , Proteínas do Capsídeo/genética , Proteínas do Capsídeo/metabolismo , Cardiovirus/classificação , Cardiovirus/metabolismo , Estudos de Casos e Controles , Pré-Escolar , Genoma Viral/genética , Genótipo , Saúde , Humanos , Dados de Sequência Molecular , Hipotonia Muscular/virologia , Filogenia , Recombinação Genética/genética , Alinhamento de Sequência , Análise de Sequência , Homologia de Sequência de Aminoácidos
7.
J Virol ; 83(22): 12002-6, 2009 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-19759142

RESUMO

A novel picornavirus genome was sequenced, showing 42.6%, 35.2%, and 44.6% of deduced amino acid identities corresponding to the P1, P2, and P3 regions, respectively, of the Aichi virus. Divergent strains of this new virus, which we named salivirus, were detected in 18 stool samples from Nigeria, Tunisia, Nepal, and the United States. A statistical association was seen between virus shedding and unexplained cases of gastroenteritis in Nepal (P = 0.0056). Viruses with approximately 90% nucleotide similarity, named klassevirus, were also recently reported in three cases of unexplained diarrhea from the United States and Australia and in sewage from Spain, reflecting a global distribution and supporting a pathogenic role for this new group of picornaviruses.


Assuntos
Gastroenterite/virologia , Infecções por Picornaviridae/virologia , Picornaviridae/genética , Sequência de Aminoácidos , Sequência de Bases , Genoma Viral/genética , Humanos , Dados de Sequência Molecular , Filogenia , Reação em Cadeia da Polimerase Via Transcriptase Reversa , Proteínas Virais/genética
8.
PLoS One ; 13(10): e0202513, 2018.
Artigo em Inglês | MEDLINE | ID: mdl-30339683

RESUMO

Overlapping genes represent a fascinating evolutionary puzzle, since they encode two functionally unrelated proteins from the same DNA sequence. They originate by a mechanism of overprinting, in which point mutations in an existing frame allow the expression (the "birth") of a completely new protein from a second frame. In viruses, in which overlapping genes are abundant, these new proteins often play a critical role in infection, yet they are frequently overlooked during genome annotation. This results in erroneous interpretation of mutational studies and in a significant waste of resources. Therefore, overlapping genes need to be correctly detected, especially since they are now thought to be abundant also in eukaryotes. Developing better detection methods and conducting systematic evolutionary studies require a large, reliable benchmark dataset of known cases. We thus assembled a high-quality dataset of 80 viral overlapping genes whose expression is experimentally proven. Many of them were not present in databases. We found that overall, overlapping genes differ significantly from non-overlapping genes in their nucleotide and amino acid composition. In particular, the proteins they encode are enriched in high-degeneracy amino acids and depleted in low-degeneracy ones, which may alleviate the evolutionary constraints acting on overlapping genes. Principal component analysis revealed that the vast majority of overlapping genes follow a similar composition bias, despite their heterogeneity in length and function. Six proven mammalian overlapping genes also followed this bias. We propose that this apparently near-universal composition bias may either favour the birth of overlapping genes, or/and result from selection pressure acting on them.


Assuntos
Evolução Molecular , Homologia de Genes/genética , Proteínas/genética , Sequência de Aminoácidos/genética , Animais , Genes Virais/genética , Mamíferos/genética , Mutação , Fases de Leitura Aberta/genética , Análise de Componente Principal
9.
Viruses ; 6(11): 4760-99, 2014 Nov 24.
Artigo em Inglês | MEDLINE | ID: mdl-25421896

RESUMO

In 2014, Ebola virus (EBOV) was identified as the etiological agent of a large and still expanding outbreak of Ebola virus disease (EVD) in West Africa and a much more confined EVD outbreak in Middle Africa. Epidemiological and evolutionary analyses confirmed that all cases of both outbreaks are connected to a single introduction each of EBOV into human populations and that both outbreaks are not directly connected. Coding-complete genomic sequence analyses of isolates revealed that the two outbreaks were caused by two novel EBOV variants, and initial clinical observations suggest that neither of them should be considered strains. Here we present consensus decisions on naming for both variants (West Africa: "Makona", Middle Africa: "Lomela") and provide database-compatible full, shortened, and abbreviated names that are in line with recently established filovirus sub-species nomenclatures.


Assuntos
Ebolavirus/classificação , Doença pelo Vírus Ebola/virologia , Terminologia como Assunto , República Democrática do Congo/epidemiologia , Surtos de Doenças , Ebolavirus/genética , Ebolavirus/isolamento & purificação , Guiné/epidemiologia , Doença pelo Vírus Ebola/epidemiologia , Humanos , Filogenia , RNA Viral/genética , Análise de Sequência de DNA
10.
Viruses ; 6(9): 3663-82, 2014 Sep 26.
Artigo em Inglês | MEDLINE | ID: mdl-25256396

RESUMO

Sequence determination of complete or coding-complete genomes of viruses is becoming common practice for supporting the work of epidemiologists, ecologists, virologists, and taxonomists. Sequencing duration and costs are rapidly decreasing, sequencing hardware is under modification for use by non-experts, and software is constantly being improved to simplify sequence data management and analysis. Thus, analysis of virus disease outbreaks on the molecular level is now feasible, including characterization of the evolution of individual virus populations in single patients over time. The increasing accumulation of sequencing data creates a management problem for the curators of commonly used sequence databases and an entry retrieval problem for end users. Therefore, utilizing the data to their fullest potential will require setting nomenclature and annotation standards for virus isolates and associated genomic sequences. The National Center for Biotechnology Information's (NCBI's) RefSeq is a non-redundant, curated database for reference (or type) nucleotide sequence records that supplies source data to numerous other databases. Building on recently proposed templates for filovirus variant naming [ ()////-], we report consensus decisions from a majority of past and currently active filovirus experts on the eight filovirus type variants and isolates to be represented in RefSeq, their final designations, and their associated sequences.


Assuntos
Bases de Dados de Ácidos Nucleicos , Filoviridae/genética , Evolução Molecular , Filoviridae/classificação , Humanos , Seleção Genética
11.
Virus Res ; 160(1-2): 256-63, 2011 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-21762736

RESUMO

Viruses are most frequently discovered because they cause disease in organisms of importance to humans. To expand knowledge of plant-associated viruses beyond these narrow constraints, non-cultivated plants of the Tallgrass Prairie Preserve, Osage County, Oklahoma, USA were systematically surveyed for evidence of the presence of viruses. This report discusses viruses of the family Tombusviridae putatively identified by the survey. Evidence of two carmoviruses, a tombusvirus, a panicovirus and an unclassifiable tombusvirid was found. The complete genome sequence was obtained for putative TGP carmovirus 1 from the legume Lespedeza procumbens, and the virus was detected in several other plant species including the fern Pellaea atropurpurea. Phylogenetic analysis of the sequence and partial sequence of a related virus supported strongly the placement of these viruses in the genus Carmovirus. Polymorphisms in the sequences suggested existence of two populations of TGP carmovirus 1 in the study area and year-to-year variations in infection by TGP carmovirus 3.


Assuntos
Doenças das Plantas/virologia , Tombusviridae/classificação , Tombusviridae/isolamento & purificação , Análise por Conglomerados , Lespedeza/virologia , Modelos Moleculares , Dados de Sequência Molecular , Conformação de Ácido Nucleico , Oklahoma , Filogenia , Pteridaceae/virologia , RNA Viral/genética , Análise de Sequência de DNA , Tombusviridae/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA