Pesquisa | Prevenção e Controle de Câncer

1.

Author Correction: A general approach for detecting expressed mutations in AML cells using single cell RNA-sequencing.

Petti, Allegra A; Williams, Stephen R; Miller, Christopher A; Fiddes, Ian T; Srivatsan, Sridhar N; Chen, David Y; Fronick, Catrina C; Fulton, Robert S; Church, Deanna M; Ley, Timothy J.

Nat Commun ; 13(1): 4216, 2022 Jul 21.

Artigo em Inglês | MEDLINE | ID: mdl-35864110

2.

A general approach for detecting expressed mutations in AML cells using single cell RNA-sequencing.

Petti, Allegra A; Williams, Stephen R; Miller, Christopher A; Fiddes, Ian T; Srivatsan, Sridhar N; Chen, David Y; Fronick, Catrina C; Fulton, Robert S; Church, Deanna M; Ley, Timothy J.

Nat Commun ; 10(1): 3660, 2019 08 14.

Artigo em Inglês | MEDLINE | ID: mdl-31413257

RESUMO

Virtually all tumors are genetically heterogeneous, containing mutationally-defined subclonal cell populations that often have distinct phenotypes. Single-cell RNA-sequencing has revealed that a variety of tumors are also transcriptionally heterogeneous, but the relationship between expression heterogeneity and subclonal architecture is unclear. Here, we address this question in the context of Acute Myeloid Leukemia (AML) by integrating whole genome sequencing with single-cell RNA-sequencing (using the 10x Genomics Chromium Single Cell 5' Gene Expression workflow). Applying this approach to five cryopreserved AML samples, we identify hundreds to thousands of cells containing tumor-specific mutations in each case, and use the results to distinguish AML cells (including normal-karyotype AML cells) from normal cells, identify expression signatures associated with subclonal mutations, and find cell surface markers that could be used to purify subclones for further study. This integrative approach for connecting genotype to phenotype is broadly applicable to any sample that is phenotypically and genetically heterogeneous.

Assuntos

Leucemia Mieloide Aguda/genética , RNA/genética , Análise de Sequência de RNA/métodos , Análise de Célula Única/métodos , Perfilação da Expressão Gênica/métodos , Genômica , Genótipo , Humanos , Mutação , Fenótipo , Polimorfismo de Nucleotídeo Único , Sequenciamento Completo do Genoma

3.

A variant by any name: quantifying annotation discordance across tools and clinical databases.

Yen, Jennifer L; Garcia, Sarah; Montana, Aldrin; Harris, Jason; Chervitz, Stephen; Morra, Massimo; West, John; Chen, Richard; Church, Deanna M.

Genome Med ; 9(1): 7, 2017 01 26.

Artigo em Inglês | MEDLINE | ID: mdl-28122645

RESUMO

BACKGROUND: Clinical genomic testing is dependent on the robust identification and reporting of variant-level information in relation to disease. With the shift to high-throughput sequencing, a major challenge for clinical diagnostics is the cross-identification of variants called on their genomic position to resources that rely on transcript- or protein-based descriptions. METHODS: We evaluated the accuracy of three tools (SnpEff, Variant Effect Predictor, and Variation Reporter) that generate transcript and protein-based variant nomenclature from genomic coordinates according to guidelines by the Human Genome Variation Society (HGVS). Our evaluation was based on transcript-controlled comparisons to a manually curated set of 126 test variants of various types drawn from data sources, each with HGVS-compliant transcript and protein descriptors. We further evaluated the concordance between annotations generated by Snpeff and Variant Effect Predictor and those in major germline and cancer databases: ClinVar and COSMIC, respectively. RESULTS: We find that there is substantial discordance between the annotation tools and databases in the description of insertions and/or deletions. Using our ground truth set of variants, constructed specifically to identify challenging events, accuracy was between 80 and 90% for coding and 50 and 70% for protein changes for 114 to 126 variants. Exact concordance for SNV syntax was over 99.5% between ClinVar and Variant Effect Predictor and SnpEff, but less than 90% for non-SNV variants. For COSMIC, exact concordance for coding and protein SNVs was between 65 and 88% and less than 15% for insertions. Across the tools and datasets, there was a wide range of different but equivalent expressions describing protein variants. CONCLUSIONS: Our results reveal significant inconsistency in variant representation across tools and databases. While some of these syntax differences may be clear to a clinician, they can confound variant matching, an important step in variant classification. These results highlight the urgent need for the adoption and adherence to uniform standards in variant annotation, with consistent reporting on the genomic reference, to enable accurate and efficient data-driven clinical care.

Assuntos

Confiabilidade dos Dados , Variação Genética , Genoma Humano , Anotação de Sequência Molecular/normas , Software/normas , Biologia Computacional/normas , Bases de Dados Genéticas , Humanos , Mutação INDEL

4.

Single haplotype assembly of the human genome from a hydatidiform mole.

Steinberg, Karyn Meltz; Schneider, Valerie A; Graves-Lindsay, Tina A; Fulton, Robert S; Agarwala, Richa; Huddleston, John; Shiryev, Sergey A; Morgulis, Aleksandr; Surti, Urvashi; Warren, Wesley C; Church, Deanna M; Eichler, Evan E; Wilson, Richard K.

Genome Res ; 24(12): 2066-76, 2014 12.

Artigo em Inglês | MEDLINE | ID: mdl-25373144

RESUMO

A complete reference assembly is essential for accurately interpreting individual genomes and associating variation with phenotypes. While the current human reference genome sequence is of very high quality, gaps and misassemblies remain due to biological and technical complexities. Large repetitive sequences and complex allelic diversity are the two main drivers of assembly error. Although increasing the length of sequence reads and library fragments can improve assembly, even the longest available reads do not resolve all regions. In order to overcome the issue of allelic diversity, we used genomic DNA from an essentially haploid hydatidiform mole, CHM1. We utilized several resources from this DNA including a set of end-sequenced and indexed BAC clones and 100× Illumina whole-genome shotgun (WGS) sequence coverage. We used the WGS sequence and the GRCh37 reference assembly to create an assembly of the CHM1 genome. We subsequently incorporated 382 finished BAC clone sequences to generate a draft assembly, CHM1_1.1 (NCBI AssemblyDB GCA_000306695.2). Analysis of gene, repetitive element, and segmental duplication content show this assembly to be of excellent quality and contiguity. However, comparison to assembly-independent resources, such as BAC clone end sequences and PacBio long reads, indicate misassembled regions. Most of these regions are enriched for structural variation and segmental duplication, and can be resolved in the future. This publicly available assembly will be integrated into the Genome Reference Consortium curation framework for further improvement, with the ultimate goal being a completely finished gap-free assembly.

Assuntos

Genoma Humano , Haplótipos , Mola Hidatiforme/genética , Alelos , Mapeamento Cromossômico , Cromossomos Artificiais Bacterianos , Biologia Computacional/métodos , Feminino , Genômica/métodos , Heterozigoto , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Polimorfismo de Nucleotídeo Único , Gravidez , Sequências Repetitivas de Ácido Nucleico , Duplicações Segmentares Genômicas , Análise de Sequência de DNA

5.

DbVar and DGVa: public archives for genomic structural variation.

Lappalainen, Ilkka; Lopez, John; Skipper, Lisa; Hefferon, Timothy; Spalding, J Dylan; Garner, John; Chen, Chao; Maguire, Michael; Corbett, Matt; Zhou, George; Paschall, Justin; Ananiev, Victor; Flicek, Paul; Church, Deanna M.

Nucleic Acids Res ; 41(Database issue): D936-41, 2013 Jan.

Artigo em Inglês | MEDLINE | ID: mdl-23193291

RESUMO

Much has changed in the last two years at DGVa (http://www.ebi.ac.uk/dgva) and dbVar (http://www.ncbi.nlm.nih.gov/dbvar). We are now processing direct submissions rather than only curating data from the literature and our joint study catalog includes data from over 100 studies in 11 organisms. Studies from human dominate with data from control and case populations, tumor samples as well as three large curated studies derived from multiple sources. During the processing of these data, we have made improvements to our data model, submission process and data representation. Additionally, we have made significant improvements in providing access to these data via web and FTP interfaces.

Assuntos

Bases de Dados de Ácidos Nucleicos , Variação Estrutural do Genoma , Genótipo , Humanos , Internet , Fenótipo

6.

Online resources for genomic structural variation.

Sneddon, Tam P; Church, Deanna M.

Methods Mol Biol ; 838: 273-89, 2012.

Artigo em Inglês | MEDLINE | ID: mdl-22228017

RESUMO

Genomic structural variation (SV) can be thought of on a continuum from a single base pair insertion/deletion (INDEL) to large megabase-scale rearrangements involving insertions, deletions, duplications, inversions, or translocations of whole chromosomes or chromosome arms. These variants can occur in coding or noncoding DNA, they can be inherited or arise sporadically in the germline or somatic cells. Many of these events are segregating in the population and can be considered common alleles while others are new alleles and thus rare events. All species studied to date harbor structural variants and these may be benign, contributing to phenotypes such as sensory perception and immunity, or pathogenic resulting in genomic disorders including DiGeorge/velocardiofacial, Smith-Margenis, Williams-Beuren, and Prader-Willi syndromes. As structural variants are identified, validated, and their significance, origin, and prevalence are elucidated, it is of critical importance that these data be collected and collated in a way that can be easily accessed and analyzed. This chapter describes current structural variation online resources (see Fig. 1 and Table 1), highlights the challenges in capturing, storing, and displaying SV data, and discusses how dbVar and DGVa, the genomic structural variation databases developed at NCBI and EBI, respectively, were designed to address these issues.

Assuntos

Variações do Número de Cópias de DNA , Genoma Humano , Internet , Transtorno Autístico/genética , Pareamento de Bases , Aberrações Cromossômicas , Inversão Cromossômica , Bases de Dados Genéticas , Deleção de Genes , Dosagem de Genes , Duplicação Gênica , Variação Genética , Genômica/métodos , Humanos , Neoplasias/genética , Fenótipo , Alinhamento de Sequência

7.

Database resources of the National Center for Biotechnology Information.

Sayers, Eric W; Barrett, Tanya; Benson, Dennis A; Bolton, Evan; Bryant, Stephen H; Canese, Kathi; Chetvernin, Vyacheslav; Church, Deanna M; DiCuccio, Michael; Federhen, Scott; Feolo, Michael; Fingerman, Ian M; Geer, Lewis Y; Helmberg, Wolfgang; Kapustin, Yuri; Landsman, David; Lipman, David J; Lu, Zhiyong; Madden, Thomas L; Madej, Tom; Maglott, Donna R; Marchler-Bauer, Aron; Miller, Vadim; Mizrachi, Ilene; Ostell, James; Panchenko, Anna; Phan, Lon; Pruitt, Kim D; Schuler, Gregory D; Sequeira, Edwin; Sherry, Stephen T; Shumway, Martin; Sirotkin, Karl; Slotta, Douglas; Souvorov, Alexandre; Starchenko, Grigory; Tatusova, Tatiana A; Wagner, Lukas; Wang, Yanli; Wilbur, W John; Yaschenko, Eugene; Ye, Jian.

Nucleic Acids Res ; 39(Database issue): D38-51, 2011 Jan.

Artigo em Inglês | MEDLINE | ID: mdl-21097890

RESUMO

In addition to maintaining the GenBank® nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides analysis and retrieval resources for the data in GenBank and other biological data made available through the NCBI Web site. NCBI resources include Entrez, the Entrez Programming Utilities, MyNCBI, PubMed, PubMed Central (PMC), Entrez Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link (BLink), Primer-BLAST, COBALT, Electronic PCR, OrfFinder, Splign, ProSplign, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, dbVar, Epigenomics, Cancer Chromosomes, Entrez Genomes and related tools, the Map Viewer, Model Maker, Evidence Viewer, Trace Archive, Sequence Read Archive, Retroviral Genotyping Tools, HIV-1/Human Protein Interaction Database, Gene Expression Omnibus (GEO), Entrez Probe, GENSAT, Online Mendelian Inheritance in Man (OMIM), Online Mendelian Inheritance in Animals (OMIA), the Molecular Modeling Database (MMDB), the Conserved Domain Database (CDD), the Conserved Domain Architecture Retrieval Tool (CDART), IBIS, Biosystems, Peptidome, OMSSA, Protein Clusters and the PubChem suite of small molecule databases. Augmenting many of the Web applications are custom implementations of the BLAST program optimized to search specialized data sets. All of these resources can be accessed through the NCBI home page at www.ncbi.nlm.nih.gov.

Assuntos

Bases de Dados Genéticas , Bases de Dados de Proteínas , Expressão Gênica , Genômica , National Library of Medicine (U.S.) , Estrutura Terciária de Proteína , PubMed , Alinhamento de Sequência , Análise de Sequência de DNA , Análise de Sequência de RNA , Software , Integração de Sistemas , Estados Unidos

8.

Database resources of the National Center for Biotechnology Information.

Sayers, Eric W; Barrett, Tanya; Benson, Dennis A; Bolton, Evan; Bryant, Stephen H; Canese, Kathi; Chetvernin, Vyacheslav; Church, Deanna M; Dicuccio, Michael; Federhen, Scott; Feolo, Michael; Geer, Lewis Y; Helmberg, Wolfgang; Kapustin, Yuri; Landsman, David; Lipman, David J; Lu, Zhiyong; Madden, Thomas L; Madej, Tom; Maglott, Donna R; Marchler-Bauer, Aron; Miller, Vadim; Mizrachi, Ilene; Ostell, James; Panchenko, Anna; Pruitt, Kim D; Schuler, Gregory D; Sequeira, Edwin; Sherry, Stephen T; Shumway, Martin; Sirotkin, Karl; Slotta, Douglas; Souvorov, Alexandre; Starchenko, Grigory; Tatusova, Tatiana A; Wagner, Lukas; Wang, Yanli; John Wilbur, W; Yaschenko, Eugene; Ye, Jian.

Nucleic Acids Res ; 38(Database issue): D5-16, 2010 Jan.

Artigo em Inglês | MEDLINE | ID: mdl-19910364

RESUMO

In addition to maintaining the GenBank nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides analysis and retrieval resources for the data in GenBank and other biological data made available through the NCBI web site. NCBI resources include Entrez, the Entrez Programming Utilities, MyNCBI, PubMed, PubMed Central, Entrez Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link (BLink), Electronic PCR, OrfFinder, Spidey, Splign, Reference Sequence, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, Cancer Chromosomes, Entrez Genomes and related tools, the Map Viewer, Model Maker, Evidence Viewer, Trace Archive, Sequence Read Archive, Retroviral Genotyping Tools, HIV-1/Human Protein Interaction Database, Gene Expression Omnibus, Entrez Probe, GENSAT, Online Mendelian Inheritance in Man, Online Mendelian Inheritance in Animals, the Molecular Modeling Database, the Conserved Domain Database, the Conserved Domain Architecture Retrieval Tool, Biosystems, Peptidome, Protein Clusters and the PubChem suite of small molecule databases. Augmenting many of the web applications are custom implementations of the BLAST program optimized to search specialized data sets. All these resources can be accessed through the NCBI home page at www.ncbi.nlm.nih.gov.

Assuntos

Biologia Computacional/métodos , Bases de Dados Genéticas , Bases de Dados de Ácidos Nucleicos , Algoritmos , Animais , Biologia Computacional/tendências , Bases de Dados de Proteínas , Genoma Bacteriano , Genoma Viral , Humanos , Armazenamento e Recuperação da Informação/métodos , Internet , National Institutes of Health (U.S.) , National Library of Medicine (U.S.) , Software , Estados Unidos

9.

Database resources of the National Center for Biotechnology Information.

Sayers, Eric W; Barrett, Tanya; Benson, Dennis A; Bryant, Stephen H; Canese, Kathi; Chetvernin, Vyacheslav; Church, Deanna M; DiCuccio, Michael; Edgar, Ron; Federhen, Scott; Feolo, Michael; Geer, Lewis Y; Helmberg, Wolfgang; Kapustin, Yuri; Landsman, David; Lipman, David J; Madden, Thomas L; Maglott, Donna R; Miller, Vadim; Mizrachi, Ilene; Ostell, James; Pruitt, Kim D; Schuler, Gregory D; Sequeira, Edwin; Sherry, Stephen T; Shumway, Martin; Sirotkin, Karl; Souvorov, Alexandre; Starchenko, Grigory; Tatusova, Tatiana A; Wagner, Lukas; Yaschenko, Eugene; Ye, Jian.

Nucleic Acids Res ; 37(Database issue): D5-15, 2009 Jan.

Artigo em Inglês | MEDLINE | ID: mdl-18940862

RESUMO

In addition to maintaining the GenBank nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides analysis and retrieval resources for the data in GenBank and other biological data made available through the NCBI web site. NCBI resources include Entrez, the Entrez Programming Utilities, MyNCBI, PubMed, PubMed Central, Entrez Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link (BLink), Electronic PCR, OrfFinder, Spidey, Splign, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, Cancer Chromosomes, Entrez Genomes and related tools, the Map Viewer, Model Maker, Evidence Viewer, Clusters of Orthologous Groups (COGs), Retroviral Genotyping Tools, HIV-1/Human Protein Interaction Database, Gene Expression Omnibus (GEO), Entrez Probe, GENSAT, Online Mendelian Inheritance in Man (OMIM), Online Mendelian Inheritance in Animals (OMIA), the Molecular Modeling Database (MMDB), the Conserved Domain Database (CDD), the Conserved Domain Architecture Retrieval Tool (CDART) and the PubChem suite of small molecule databases. Augmenting many of the web applications is custom implementation of the BLAST program optimized to search specialized data sets. All of the resources can be accessed through the NCBI home page at www.ncbi.nlm.nih.gov.

Assuntos

Bases de Dados Genéticas , Expressão Gênica , Genes , Genômica , Genótipo , National Library of Medicine (U.S.) , Fenótipo , Estrutura Terciária de Proteína , Proteômica , PubMed , Homologia de Sequência , Integração de Sistemas , Estados Unidos

10.

Database resources of the National Center for Biotechnology Information.

Wheeler, David L; Barrett, Tanya; Benson, Dennis A; Bryant, Stephen H; Canese, Kathi; Chetvernin, Vyacheslav; Church, Deanna M; Dicuccio, Michael; Edgar, Ron; Federhen, Scott; Feolo, Michael; Geer, Lewis Y; Helmberg, Wolfgang; Kapustin, Yuri; Khovayko, Oleg; Landsman, David; Lipman, David J; Madden, Thomas L; Maglott, Donna R; Miller, Vadim; Ostell, James; Pruitt, Kim D; Schuler, Gregory D; Shumway, Martin; Sequeira, Edwin; Sherry, Steven T; Sirotkin, Karl; Souvorov, Alexandre; Starchenko, Grigory; Tatusov, Roman L; Tatusova, Tatiana A; Wagner, Lukas; Yaschenko, Eugene.

Nucleic Acids Res ; 36(Database issue): D13-21, 2008 Jan.

Artigo em Inglês | MEDLINE | ID: mdl-18045790

RESUMO

In addition to maintaining the GenBank(R) nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides analysis and retrieval resources for the data in GenBank and other biological data available through NCBI's web site. NCBI resources include Entrez, the Entrez Programming Utilities, My NCBI, PubMed, PubMed Central, Entrez Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link, Electronic PCR, OrfFinder, Spidey, Splign, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, Cancer Chromosomes, Entrez Genome, Genome Project and related tools, the Trace, Assembly, and Short Read Archives, the Map Viewer, Model Maker, Evidence Viewer, Clusters of Orthologous Groups, Influenza Viral Resources, HIV-1/Human Protein Interaction Database, Gene Expression Omnibus, Entrez Probe, GENSAT, Database of Genotype and Phenotype, Online Mendelian Inheritance in Man, Online Mendelian Inheritance in Animals, the Molecular Modeling Database, the Conserved Domain Database, the Conserved Domain Architecture Retrieval Tool and the PubChem suite of small molecule databases. Augmenting the web applications are custom implementations of the BLAST program optimized to search specialized data sets. These resources can be accessed through the NCBI home page at www.ncbi.nlm.nih.gov.

Assuntos

Bases de Dados Genéticas , National Library of Medicine (U.S.) , Animais , Bases de Dados de Ácidos Nucleicos , Expressão Gênica , Genômica , Genótipo , Humanos , Internet , Modelos Moleculares , Fenótipo , Proteômica , Alinhamento de Sequência , Estados Unidos

11.

Database resources of the National Center for Biotechnology Information.

Wheeler, David L; Barrett, Tanya; Benson, Dennis A; Bryant, Stephen H; Canese, Kathi; Chetvernin, Vyacheslav; Church, Deanna M; DiCuccio, Michael; Edgar, Ron; Federhen, Scott; Geer, Lewis Y; Kapustin, Yuri; Khovayko, Oleg; Landsman, David; Lipman, David J; Madden, Thomas L; Maglott, Donna R; Ostell, James; Miller, Vadim; Pruitt, Kim D; Schuler, Gregory D; Sequeira, Edwin; Sherry, Steven T; Sirotkin, Karl; Souvorov, Alexandre; Starchenko, Grigory; Tatusov, Roman L; Tatusova, Tatiana A; Wagner, Lukas; Yaschenko, Eugene.

Nucleic Acids Res ; 35(Database issue): D5-12, 2007 Jan.

Artigo em Inglês | MEDLINE | ID: mdl-17170002

RESUMO

In addition to maintaining the GenBank nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides analysis and retrieval resources for the data in GenBank and other biological data made available through NCBI's Web site. NCBI resources include Entrez, the Entrez Programming Utilities, My NCBI, PubMed, PubMed Central, Entrez Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link(BLink), Electronic PCR, OrfFinder, Spidey, Splign, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, Cancer Chromosomes, Entrez Genome, Genome Project and related tools, the Trace and Assembly Archives, the Map Viewer, Model Maker, Evidence Viewer, Clusters of Orthologous Groups (COGs), Viral Genotyping Tools, Influenza Viral Resources, HIV-1/Human Protein Interaction Database, Gene Expression Omnibus (GEO), Entrez Probe, GENSAT, Online Mendelian Inheritance in Man (OMIM), Online Mendelian Inheritance in Animals (OMIA), the Molecular Modeling Database (MMDB), the Conserved Domain Database (CDD), the Conserved Domain Architecture Retrieval Tool (CDART) and the PubChem suite of small molecule databases. Augmenting many of the Web applications are custom implementations of the BLAST program optimized to search specialized data sets. These resources can be accessed through the NCBI home page at www.ncbi.nlm.nih.gov.

Assuntos

Bases de Dados Genéticas , National Library of Medicine (U.S.) , Animais , Bases de Dados de Ácidos Nucleicos , Bases de Dados de Proteínas , Expressão Gênica , Genômica , Humanos , Internet , Fenótipo , Proteômica , PubMed , Alinhamento de Sequência , Software , Estados Unidos

12.

Database resources of the National Center for Biotechnology Information.

Wheeler, David L; Barrett, Tanya; Benson, Dennis A; Bryant, Stephen H; Canese, Kathi; Chetvernin, Vyacheslav; Church, Deanna M; DiCuccio, Michael; Edgar, Ron; Federhen, Scott; Geer, Lewis Y; Helmberg, Wolfgang; Kapustin, Yuri; Kenton, David L; Khovayko, Oleg; Lipman, David J; Madden, Thomas L; Maglott, Donna R; Ostell, James; Pruitt, Kim D; Schuler, Gregory D; Schriml, Lynn M; Sequeira, Edwin; Sherry, Stephen T; Sirotkin, Karl; Souvorov, Alexandre; Starchenko, Grigory; Suzek, Tugba O; Tatusov, Roman; Tatusova, Tatiana A; Wagner, Lukas; Yaschenko, Eugene.

Nucleic Acids Res ; 34(Database issue): D173-80, 2006 Jan 01.

Artigo em Inglês | MEDLINE | ID: mdl-16381840

RESUMO

In addition to maintaining the GenBank nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides analysis and retrieval resources for the data in GenBank and other biological data made available through NCBI's Web site. NCBI resources include Entrez, the Entrez Programming Utilities, MyNCBI, PubMed, PubMed Central, Entrez Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link (BLink), Electronic PCR, OrfFinder, Spidey, Splign, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, Cancer Chromosomes, Entrez Genomes and related tools, the Map Viewer, Model Maker, Evidence Viewer, Clusters of Orthologous Groups, Retroviral Genotyping Tools, HIV-1, Human Protein Interaction Database, SAGEmap, Gene Expression Omnibus, Entrez Probe, GENSAT, Online Mendelian Inheritance in Man, Online Mendelian Inheritance in Animals, the Molecular Modeling Database, the Conserved Domain Database, the Conserved Domain Architecture Retrieval Tool and the PubChem suite of small molecule databases. Augmenting many of the Web applications are custom implementations of the BLAST program optimized to search specialized datasets. All of the resources can be accessed through the NCBI home page at: http://www.ncbi.nlm.nih.gov.

Assuntos

Bases de Dados Genéticas , National Library of Medicine (U.S.) , Bases de Dados de Ácidos Nucleicos , Bases de Dados de Proteínas , Regulação da Expressão Gênica , Genes , Genômica , Humanos , Internet , PubMed , Alinhamento de Sequência , Análise de Sequência de DNA , Software , Estados Unidos

13.

Database resources of the National Center for Biotechnology Information.

Wheeler, David L; Barrett, Tanya; Benson, Dennis A; Bryant, Stephen H; Canese, Kathi; Church, Deanna M; DiCuccio, Michael; Edgar, Ron; Federhen, Scott; Helmberg, Wolfgang; Kenton, David L; Khovayko, Oleg; Lipman, David J; Madden, Thomas L; Maglott, Donna R; Ostell, James; Pontius, Joan U; Pruitt, Kim D; Schuler, Gregory D; Schriml, Lynn M; Sequeira, Edwin; Sherry, Steven T; Sirotkin, Karl; Starchenko, Grigory; Suzek, Tugba O; Tatusov, Roman; Tatusova, Tatiana A; Wagner, Lukas; Yaschenko, Eugene.

Nucleic Acids Res ; 33(Database issue): D39-45, 2005 Jan 01.

Artigo em Inglês | MEDLINE | ID: mdl-15608222

RESUMO

In addition to maintaining the GenBank nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides data retrieval systems and computational resources for the analysis of data in GenBank and other biological data made available through NCBI's website. NCBI resources include Entrez, Entrez Programming Utilities, PubMed, PubMed Central, Entrez Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link (BLink), Electronic PCR, OrfFinder, Spidey, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, Cancer Chromosomes, Entrez Genomes and related tools, the Map Viewer, Model Maker, Evidence Viewer, Clusters of Orthologous Groups (COGs), Retroviral Genotyping Tools, HIV-1/Human Protein Interaction Database, SAGEmap, Gene Expression Omnibus (GEO), Online Mendelian Inheritance in Man (OMIM), the Molecular Modeling Database (MMDB), the Conserved Domain Database (CDD) and the Conserved Domain Architecture Retrieval Tool (CDART). Augmenting many of the Web applications are custom implementations of the BLAST program optimized to search specialized datasets. All of the resources can be accessed through the NCBI home page at http://www.ncbi.nlm.nih.gov.

Assuntos

Bases de Dados Genéticas , National Library of Medicine (U.S.) , Sequência de Aminoácidos , Animais , Biologia Computacional , Sequência Conservada , Bases de Dados Factuais , Perfilação da Expressão Gênica , Genes , Genômica , Humanos , Modelos Moleculares , Fenótipo , Mapeamento de Interação de Proteínas , Estrutura Terciária de Proteína , Alinhamento de Sequência , Software , Estados Unidos

14.

Analysis of segmental duplications and genome assembly in the mouse.

Bailey, Jeffrey A; Church, Deanna M; Ventura, Mario; Rocchi, Mariano; Eichler, Evan E.

Genome Res ; 14(5): 789-801, 2004 May.

Artigo em Inglês | MEDLINE | ID: mdl-15123579

RESUMO

Limited comparative studies suggest that the human genome is particularly enriched for recent segmental duplications. The extent of segmental duplications in other mammalian genomes is unknown and confounded by methodological differences in genome assembly. Here, we present a detailed analysis of recent duplication content within the mouse genome using a whole-genome assembly comparison method and a novel assembly independent method, designed to take advantage of the reduced allelic variation of the C57BL/6J strain. We conservatively estimate that approximately 57% of all highly identical segmental duplications (>or=90%) were misassembled or collapsed within the working draft WGS assembly. The WGS approach often leaves duplications fragmented and unassigned to a chromosome when compared with the clone-ordered-based approach. Our preliminary analysis suggests that 1.7%-2.0% of the mouse genome is part of recent large segmental duplications (about half of what is observed for the human genome). We have constructed a mouse segmental duplication database to aid in the characterization of these regions and their integration into the final mouse genome assembly. This work suggests significant biological differences in the architecture of recent segmental duplications between human and mouse. In addition, our unique method provides the means for improving whole-genome shotgun sequence assembly of mouse and future mammalian genomes.

Assuntos

Duplicação Gênica , Genoma , Análise de Sequência de DNA/métodos , Animais , Composição de Bases/genética , Células da Medula Óssea/química , Células da Medula Óssea/metabolismo , Núcleo Celular/genética , Mapeamento Cromossômico/métodos , Mapeamento Cromossômico/normas , Mapeamento Cromossômico/estatística & dados numéricos , Biologia Computacional/métodos , DNA/genética , Bases de Dados Genéticas , Genes , Genoma Humano , Humanos , Hibridização in Situ Fluorescente/métodos , Camundongos , Camundongos Endogâmicos C57BL , Alinhamento de Sequência/métodos

15.

Database resources of the National Center for Biotechnology Information: update.

Wheeler, David L; Church, Deanna M; Edgar, Ron; Federhen, Scott; Helmberg, Wolfgang; Madden, Thomas L; Pontius, Joan U; Schuler, Gregory D; Schriml, Lynn M; Sequeira, Edwin; Suzek, Tugba O; Tatusova, Tatiana A; Wagner, Lukas.

Nucleic Acids Res ; 32(Database issue): D35-40, 2004 Jan 01.

Artigo em Inglês | MEDLINE | ID: mdl-14681353

RESUMO

In addition to maintaining the GenBank(R) nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides data analysis and retrieval resources for the data in GenBank and other biological data made available through NCBI's website. NCBI resources include Entrez, PubMed, PubMed Central, LocusLink, the NCBI Taxonomy Browser, BLAST, BLAST Link (BLink), Electronic PCR, OrfFinder, Spidey, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, Cancer Chromosome Aberration Project (CCAP), Entrez Genomes and related tools, the Map Viewer, Model Maker, Evidence Viewer, Clusters of Orthologous Groups (COGs) database, Retroviral Genotyping Tools, SARS Coronavirus Resource, SAGEmap, Gene Expression Omnibus (GEO), Online Mendelian Inheritance in Man (OMIM), the Molecular Modeling Database (MMDB), the Conserved Domain Database (CDD) and the Conserved Domain Architecture Retrieval Tool (CDART). Augmenting many of the web applications are custom implementations of the BLAST program optimized to search specialized data sets. All of the resources can be accessed through the NCBI home page at: http://www.ncbi.nlm.nih.gov.

Assuntos

Biologia Computacional , Bases de Dados Factuais , National Institutes of Health (U.S.) , Animais , Classificação , Perfilação da Expressão Gênica , Genes , Genoma , Genômica , Humanos , Armazenamento e Recuperação da Informação , Fases de Leitura Aberta , Polimorfismo Genético , PubMed , Software , Estados Unidos

16.

Database resources of the National Center for Biotechnology.

Wheeler, David L; Church, Deanna M; Federhen, Scott; Lash, Alex E; Madden, Thomas L; Pontius, Joan U; Schuler, Gregory D; Schriml, Lynn M; Sequeira, Edwin; Tatusova, Tatiana A; Wagner, Lukas.

Nucleic Acids Res ; 31(1): 28-33, 2003 Jan 01.

Artigo em Inglês | MEDLINE | ID: mdl-12519941

RESUMO

In addition to maintaining the GenBank(R) nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides data analysis and retrieval resources for the data in GenBank and other biological data made available through NCBI's Web site. NCBI resources include Entrez, PubMed, PubMed Central (PMC), LocusLink, the NCBITaxonomy Browser, BLAST, BLAST Link (BLink), Electronic PCR (e-PCR), Open Reading Frame (ORF) Finder, References Sequence (RefSeq), UniGene, HomoloGene, ProtEST, Database of Single Nucleotide Polymorphisms (dbSNP), Human/Mouse Homology Map, Cancer Chromosome Aberration Project (CCAP), Entrez Genomes and related tools, the Map Viewer, Model Maker (MM), Evidence Viewer (EV), Clusters of Orthologous Groups (COGs) database, Retroviral Genotyping Tools, SAGEmap, Gene Expression Omnibus (GEO), Online Mendelian Inheritance in Man (OMIM), the Molecular Modeling Database (MMDB), the Conserved Domain Database (CDD), and the Conserved Domain Architecture Retrieval Tool (CDART). Augmenting many of the Web applications are custom implementations of the BLAST program optimized to search specialized data sets. All of the resources can be accessed through the NCBI home page at: http://www.ncbi.nlm.nih.gov.

Assuntos

Biotecnologia , Bases de Dados Genéticas , Animais , Mapeamento Cromossômico , Perfilação da Expressão Gênica , Genes , Genoma , Humanos , Armazenamento e Recuperação da Informação , Camundongos , Modelos Moleculares , Fenótipo , Estrutura Terciária de Proteína , Alinhamento de Sequência/métodos , Homologia de Sequência , Estados Unidos

17.

Initial sequencing and comparative analysis of the mouse genome.

Waterston, Robert H; Lindblad-Toh, Kerstin; Birney, Ewan; Rogers, Jane; Abril, Josep F; Agarwal, Pankaj; Agarwala, Richa; Ainscough, Rachel; Alexandersson, Marina; An, Peter; Antonarakis, Stylianos E; Attwood, John; Baertsch, Robert; Bailey, Jonathon; Barlow, Karen; Beck, Stephan; Berry, Eric; Birren, Bruce; Bloom, Toby; Bork, Peer; Botcherby, Marc; Bray, Nicolas; Brent, Michael R; Brown, Daniel G; Brown, Stephen D; Bult, Carol; Burton, John; Butler, Jonathan; Campbell, Robert D; Carninci, Piero; Cawley, Simon; Chiaromonte, Francesca; Chinwalla, Asif T; Church, Deanna M; Clamp, Michele; Clee, Christopher; Collins, Francis S; Cook, Lisa L; Copley, Richard R; Coulson, Alan; Couronne, Olivier; Cuff, James; Curwen, Val; Cutts, Tim; Daly, Mark; David, Robert; Davies, Joy; Delehaunty, Kimberly D; Deri, Justin; Dermitzakis, Emmanouil T.

Nature ; 420(6915): 520-62, 2002 Dec 05.

Artigo em Inglês | MEDLINE | ID: mdl-12466850

RESUMO

The sequence of the mouse genome is a key informational tool for understanding the contents of the human genome and a key experimental tool for biomedical research. Here, we report the results of an international collaboration to produce a high-quality draft sequence of the mouse genome. We also present an initial comparative analysis of the mouse and human genomes, describing some of the insights that can be gleaned from the two sequences. We discuss topics including the analysis of the evolutionary forces shaping the size, structure and sequence of the genomes; the conservation of large-scale synteny across most of the genomes; the much lower extent of sequence orthology covering less than half of the genomes; the proportions of the genomes under selection; the number of protein-coding genes; the expansion of gene families related to reproduction and immunity; the evolution of proteins; and the identification of intraspecies polymorphism.

Assuntos

Cromossomos de Mamíferos/genética , Evolução Molecular , Genoma , Camundongos/genética , Mapeamento Físico do Cromossomo , Animais , Composição de Bases , Sequência Conservada/genética , Ilhas de CpG/genética , Regulação da Expressão Gênica , Genes/genética , Variação Genética/genética , Genoma Humano , Genômica , Humanos , Camundongos/classificação , Camundongos Knockout , Camundongos Transgênicos , Modelos Animais , Família Multigênica/genética , Mutagênese , Neoplasias/genética , Proteoma/genética , Pseudogenes/genética , Locos de Características Quantitativas/genética , RNA não Traduzido/genética , Sequências Repetitivas de Ácido Nucleico/genética , Seleção Genética , Análise de Sequência de DNA , Cromossomos Sexuais/genética , Especificidade da Espécie , Sintenia

18.

Database resources of the National Center for Biotechnology Information: 2002 update.

Wheeler, David L; Church, Deanna M; Lash, Alex E; Leipe, Detlef D; Madden, Thomas L; Pontius, Joan U; Schuler, Gregory D; Schriml, Lynn M; Tatusova, Tatiana A; Wagner, Lukas; Rapp, Barbara A.

Nucleic Acids Res ; 30(1): 13-6, 2002 Jan 01.

Artigo em Inglês | MEDLINE | ID: mdl-11752242

RESUMO

In addition to maintaining the GenBank nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides data analysis and retrieval resources that operate on the data in GenBank and a variety of other biological data made available through NCBI's web site. NCBI data retrieval resources include Entrez, PubMed, LocusLink and the Taxonomy Browser. Data analysis resources include BLAST, Electronic PCR, OrfFinder, RefSeq, UniGene, HomoloGene, Database of Single Nucleotide Polymorphisms (dbSNP), Human Genome Sequencing, Human MapViewer, Human inverted exclamation markVMouse Homology Map, Cancer Chromosome Aberration Project (CCAP), Entrez Genomes, Clusters of Orthologous Groups (COGs) database, Retroviral Genotyping Tools, SAGEmap, Gene Expression Omnibus (GEO), Online Mendelian Inheritance in Man (OMIM), the Molecular Modeling Database (MMDB) and the Conserved Domain Database (CDD). Augmenting many of the web applications are custom implementations of the BLAST program optimized to search specialized data sets. All of the resources can be accessed through the NCBI home page at http://www.ncbi.nlm.nih.gov.

Assuntos

Biotecnologia , Bases de Dados Genéticas , Sequência de Aminoácidos , Animais , Sequência de Bases , Aberrações Cromossômicas , Cromossomos , Sequência Conservada , Perfilação da Expressão Gênica , Genoma , Genoma Humano , Humanos , Armazenamento e Recuperação da Informação , National Library of Medicine (U.S.) , Polimorfismo de Nucleotídeo Único , Estrutura Terciária de Proteína , RNA Mensageiro/genética , Homologia de Sequência , Estados Unidos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA