Búsqueda | Portal de Búsqueda de la BVS España

1.

GenBank 2024 Update.

Sayers, Eric W; Cavanaugh, Mark; Clark, Karen; Pruitt, Kim D; Sherry, Stephen T; Yankie, Linda; Karsch-Mizrachi, Ilene.

Nucleic Acids Res ; 52(D1): D134-D137, 2024 Jan 05.

Artículo en Inglés | MEDLINE | ID: mdl-37889039

RESUMEN

GenBank® (https://www.ncbi.nlm.nih.gov/genbank/) is a comprehensive, public database that contains 25 trillion base pairs from over 3.7 billion nucleotide sequences for 557 000 formally described species. Daily data exchange with the European Nucleotide Archive (ENA) and the DNA Data Bank of Japan (DDBJ) ensures worldwide coverage. Recent updates include policies for including spatio-temporal metadata, clarified documentation for GenBank data processing, enhanced foreign contamination screening tools, new processes in the Submission Portal, migration of Entrez Genome and Assembly displays into NCBI Datasets, and the impending retirement of tbl2asn, replaced by table2asn.

Asunto(s)

Bases de Datos de Ácidos Nucleicos , Genómica , Secuencia de Bases , Internet , Humanos

2.

Database resources of the National Center for Biotechnology Information.

Sayers, Eric W; Beck, Jeff; Bolton, Evan E; Brister, J Rodney; Chan, Jessica; Comeau, Donald C; Connor, Ryan; DiCuccio, Michael; Farrell, Catherine M; Feldgarden, Michael; Fine, Anna M; Funk, Kathryn; Hatcher, Eneida; Hoeppner, Marilu; Kane, Megan; Kannan, Sivakumar; Katz, Kenneth S; Kelly, Christopher; Klimke, William; Kim, Sunghwan; Kimchi, Avi; Landrum, Melissa; Lathrop, Stacy; Lu, Zhiyong; Malheiro, Adriana; Marchler-Bauer, Aron; Murphy, Terence D; Phan, Lon; Prasad, Arjun B; Pujar, Shashikant; Sawyer, Amanda; Schmieder, Erin; Schneider, Valerie A; Schoch, Conrad L; Sharma, Shobha; Thibaud-Nissen, Françoise; Trawick, Barton W; Venkatapathi, Thilakam; Wang, Jiyao; Pruitt, Kim D; Sherry, Stephen T.

Nucleic Acids Res ; 52(D1): D33-D43, 2024 Jan 05.

Artículo en Inglés | MEDLINE | ID: mdl-37994677

RESUMEN

The National Center for Biotechnology Information (NCBI) provides online information resources for biology, including the GenBank® nucleic acid sequence database and the PubMed® database of citations and abstracts published in life science journals. NCBI provides search and retrieval operations for most of these data from 35 distinct databases. The E-utilities serve as the programming interface for most of these databases. Resources receiving significant updates in the past year include PubMed, PMC, Bookshelf, SciENcv, the NIH Comparative Genomics Resource (CGR), NCBI Virus, SRA, RefSeq, foreign contamination screening tools, Taxonomy, iCn3D, ClinVar, GTR, MedGen, dbSNP, ALFA, ClinicalTrials.gov, Pathogen Detection, antimicrobial resistance resources, and PubChem. These resources can be accessed through the NCBI home page at https://www.ncbi.nlm.nih.gov.

Asunto(s)

Bases de Datos Genéticas , National Library of Medicine (U.S.) , Biotecnología/instrumentación , Bases de Datos de Ácidos Nucleicos , Internet , Estados Unidos

3.

GenBank 2023 update.

Sayers, Eric W; Cavanaugh, Mark; Clark, Karen; Pruitt, Kim D; Sherry, Stephen T; Yankie, Linda; Karsch-Mizrachi, Ilene.

Nucleic Acids Res ; 51(D1): D141-D144, 2023 01 06.

Artículo en Inglés | MEDLINE | ID: mdl-36350640

RESUMEN

GenBank® (https://www.ncbi.nlm.nih.gov/genbank/) is a comprehensive, public database that contains 19.6 trillion base pairs from over 2.9 billion nucleotide sequences for 504 000 formally described species. Daily data exchange with the European Nucleotide Archive (ENA) and the DNA Data Bank of Japan (DDBJ) ensures worldwide coverage. Recent updates include resources for data from the SARS-CoV-2 virus, NCBI Datasets, BLAST ClusteredNR, the Submission Portal, table2asn, a Foreign Contamination Screening tool and BioSample.

Asunto(s)

Bases de Datos de Ácidos Nucleicos , Humanos , COVID-19/genética , Genómica , SARS-CoV-2/genética

4.

Database resources of the National Center for Biotechnology Information in 2023.

Sayers, Eric W; Bolton, Evan E; Brister, J Rodney; Canese, Kathi; Chan, Jessica; Comeau, Donald C; Farrell, Catherine M; Feldgarden, Michael; Fine, Anna M; Funk, Kathryn; Hatcher, Eneida; Kannan, Sivakumar; Kelly, Christopher; Kim, Sunghwan; Klimke, William; Landrum, Melissa J; Lathrop, Stacy; Lu, Zhiyong; Madden, Thomas L; Malheiro, Adriana; Marchler-Bauer, Aron; Murphy, Terence D; Phan, Lon; Pujar, Shashikant; Rangwala, Sanjida H; Schneider, Valerie A; Tse, Tony; Wang, Jiyao; Ye, Jian; Trawick, Barton W; Pruitt, Kim D; Sherry, Stephen T.

Nucleic Acids Res ; 51(D1): D29-D38, 2023 01 06.

Artículo en Inglés | MEDLINE | ID: mdl-36370100

RESUMEN

The National Center for Biotechnology Information (NCBI) provides online information resources for biology, including the GenBank® nucleic acid sequence database and the PubMed® database of citations and abstracts published in life science journals. NCBI provides search and retrieval operations for most of these data from 35 distinct databases. The E-utilities serve as the programming interface for most of these databases. New resources include the Comparative Genome Resource (CGR) and the BLAST ClusteredNR database. Resources receiving significant updates in the past year include PubMed, PMC, Bookshelf, IgBLAST, GDV, RefSeq, NCBI Virus, GenBank type assemblies, iCn3D, ClinVar, GTR, dbGaP, ALFA, ClinicalTrials.gov, Pathogen Detection, antimicrobial resistance resources, and PubChem. These resources can be accessed through the NCBI home page at https://www.ncbi.nlm.nih.gov.

Asunto(s)

Bases de Datos Genéticas , Bases de Datos de Ácidos Nucleicos , Estados Unidos , National Library of Medicine (U.S.) , Alineación de Secuencia , Biotecnología , Internet

5.

GenBank.

Sayers, Eric W; Cavanaugh, Mark; Clark, Karen; Pruitt, Kim D; Schoch, Conrad L; Sherry, Stephen T; Karsch-Mizrachi, Ilene.

Nucleic Acids Res ; 50(D1): D161-D164, 2022 01 07.

Artículo en Inglés | MEDLINE | ID: mdl-34850943

RESUMEN

GenBank® (https://www.ncbi.nlm.nih.gov/genbank/) is a comprehensive, public database that contains 15.3 trillion base pairs from over 2.5 billion nucleotide sequences for 504 000 formally described species. Recent updates include resources for data from the SARS-CoV-2 virus, including a SARS-CoV-2 landing page, NCBI Datasets, NCBI Virus and the Submission Portal. We also discuss upcoming changes to GI identifiers, a new data management interface for BioProject, and advice for providing contextual metadata in submissions.

Asunto(s)

Bases de Datos de Ácidos Nucleicos , Virus/genética , Genoma Viral , National Library of Medicine (U.S.) , SARS-CoV-2/genética , Estados Unidos , Interfaz Usuario-Computador

6.

Database resources of the national center for biotechnology information.

Sayers, Eric W; Bolton, Evan E; Brister, J Rodney; Canese, Kathi; Chan, Jessica; Comeau, Donald C; Connor, Ryan; Funk, Kathryn; Kelly, Chris; Kim, Sunghwan; Madej, Tom; Marchler-Bauer, Aron; Lanczycki, Christopher; Lathrop, Stacy; Lu, Zhiyong; Thibaud-Nissen, Francoise; Murphy, Terence; Phan, Lon; Skripchenko, Yuri; Tse, Tony; Wang, Jiyao; Williams, Rebecca; Trawick, Barton W; Pruitt, Kim D; Sherry, Stephen T.

Nucleic Acids Res ; 50(D1): D20-D26, 2022 01 07.

Artículo en Inglés | MEDLINE | ID: mdl-34850941

RESUMEN

The National Center for Biotechnology Information (NCBI) produces a variety of online information resources for biology, including the GenBank® nucleic acid sequence database and the PubMed® database of citations and abstracts published in life science journals. NCBI provides search and retrieval operations for most of these data from 35 distinct databases. The E-utilities serve as the programming interface for the most of these databases. Resources receiving significant updates in the past year include PubMed, PMC, Bookshelf, RefSeq, SRA, Virus, dbSNP, dbVar, ClinicalTrials.gov, MMDB, iCn3D and PubChem. These resources can be accessed through the NCBI home page at https://www.ncbi.nlm.nih.gov.

Asunto(s)

Biotecnología/tendencias , Bases de Datos Genéticas/tendencias , Bases de Datos de Compuestos Químicos , Bases de Datos de Ácidos Nucleicos , Bases de Datos de Proteínas , Humanos , Internet , National Library of Medicine (U.S.) , PubMed , Estados Unidos

7.

GenBank.

Sayers, Eric W; Cavanaugh, Mark; Clark, Karen; Pruitt, Kim D; Schoch, Conrad L; Sherry, Stephen T; Karsch-Mizrachi, Ilene.

Nucleic Acids Res ; 49(D1): D92-D96, 2021 01 08.

Artículo en Inglés | MEDLINE | ID: mdl-33196830

RESUMEN

GenBank® (https://www.ncbi.nlm.nih.gov/genbank/) is a comprehensive, public database that contains 9.9 trillion base pairs from over 2.1 billion nucleotide sequences for 478 000 formally described species. Daily data exchange with the European Nucleotide Archive and the DNA Data Bank of Japan ensures worldwide coverage. Recent updates include new resources for data from the SARS-CoV-2 virus, updates to the NCBI Submission Portal and associated submission wizards for dengue and SARS-CoV-2 viruses, new taxonomy queries for viruses and prokaryotes, and simplified submission processes for EST and GSS sequences.

Asunto(s)

Biología Computacional/estadística & datos numéricos , Bases de Datos de Ácidos Nucleicos , Genómica/métodos , SARS-CoV-2/genética , Análisis de Secuencia de ADN/métodos , Animales , COVID-19/epidemiología , COVID-19/virología , Biología Computacional/métodos , Humanos , Almacenamiento y Recuperación de la Información/métodos , Internet , Anotación de Secuencia Molecular/métodos , Pandemias

8.

Database resources of the National Center for Biotechnology Information.

Sayers, Eric W; Beck, Jeffrey; Bolton, Evan E; Bourexis, Devon; Brister, James R; Canese, Kathi; Comeau, Donald C; Funk, Kathryn; Kim, Sunghwan; Klimke, William; Marchler-Bauer, Aron; Landrum, Melissa; Lathrop, Stacy; Lu, Zhiyong; Madden, Thomas L; O'Leary, Nuala; Phan, Lon; Rangwala, Sanjida H; Schneider, Valerie A; Skripchenko, Yuri; Wang, Jiyao; Ye, Jian; Trawick, Barton W; Pruitt, Kim D; Sherry, Stephen T.

Nucleic Acids Res ; 49(D1): D10-D17, 2021 01 08.

Artículo en Inglés | MEDLINE | ID: mdl-33095870

RESUMEN

The National Center for Biotechnology Information (NCBI) provides a large suite of online resources for biological information and data, including the GenBank® nucleic acid sequence database and the PubMed® database of citations and abstracts published in life science journals. The Entrez system provides search and retrieval operations for most of these data from 34 distinct databases. The E-utilities serve as the programming interface for the Entrez system. Custom implementations of the BLAST program provide sequence-based searching of many specialized datasets. New resources released in the past year include a new PubMed interface and NCBI datasets. Additional resources that were updated in the past year include PMC, Bookshelf, Genome Data Viewer, SRA, ClinVar, dbSNP, dbVar, Pathogen Detection, BLAST, Primer-BLAST, IgBLAST, iCn3D and PubChem. All of these resources can be accessed through the NCBI home page at https://www.ncbi.nlm.nih.gov.

Asunto(s)

Bases de Datos Genéticas , National Library of Medicine (U.S.) , Biología Computacional/métodos , Bases de Datos de Compuestos Químicos , Bases de Datos de Ácidos Nucleicos , Bases de Datos de Proteínas , Genómica/métodos , Humanos , PubMed , Estados Unidos

9.

The dbGaP data browser: a new tool for browsing dbGaP controlled-access genomic data.

Wong, Kira M; Langlais, Kristofor; Tobias, Geoffrey S; Fletcher-Hoppe, Colette; Krasnewich, Donna; Leeds, Hilary S; Rodriguez, Laura Lyman; Godynskiy, Georgy; Schneider, Valerie A; Ramos, Erin M; Sherry, Stephen T.

Nucleic Acids Res ; 45(D1): D819-D826, 2017 01 04.

Artículo en Inglés | MEDLINE | ID: mdl-27899644

RESUMEN

The database of Genotypes and Phenotypes (dbGaP) Data Browser (https://www.ncbi.nlm.nih.gov/gap/ddb/) was developed in response to requests from the scientific community for a resource that enable view-only access to summary-level information and individual-level genotype and sequence data associated with phenotypic features maintained in the controlled-access tier of dbGaP. Until now, the dbGaP controlled-access environment required investigators to submit a data access request, wait for Data Access Committee review, download each data set and locally examine them for potentially relevant information. Existing unrestricted-access genomic data browsing resources (e.g. http://evs.gs.washington.edu/EVS/, http://exac.broadinstitute.org/) provide only summary statistics or aggregate allele frequencies. The dbGaP Data Browser serves as a third solution, providing researchers with view-only access to a compilation of individual-level data from general research use (GRU) studies through a simplified controlled-access process. The National Institutes of Health (NIH) will continue to improve the Browser in response to user feedback and believes that this tool may decrease unnecessary download requests, while still facilitating responsible genomic data-sharing.

Asunto(s)

Bases de Datos Genéticas , Genómica/métodos , Genotipo , Fenotipo , Programas Informáticos , Navegador Web , Biología Computacional/métodos , Estudios de Asociación Genética/métodos

10.

Consent Codes: Upholding Standard Data Use Conditions.

Dyke, Stephanie O M; Philippakis, Anthony A; Rambla De Argila, Jordi; Paltoo, Dina N; Luetkemeier, Erin S; Knoppers, Bartha M; Brookes, Anthony J; Spalding, J Dylan; Thompson, Mark; Roos, Marco; Boycott, Kym M; Brudno, Michael; Hurles, Matthew; Rehm, Heidi L; Matern, Andreas; Fiume, Marc; Sherry, Stephen T.

PLoS Genet ; 12(1): e1005772, 2016 Jan.

Artículo en Inglés | MEDLINE | ID: mdl-26796797

RESUMEN

A systematic way of recording data use conditions that are based on consent permissions as found in the datasets of the main public genome archives (NCBI dbGaP and EMBL-EBI/CRG EGA).

Asunto(s)

Bases de Datos de Ácidos Nucleicos , Genoma , Biblioteca Genómica , Investigación sobre Servicios de Salud

11.

ClinGen--the Clinical Genome Resource.

Rehm, Heidi L; Berg, Jonathan S; Brooks, Lisa D; Bustamante, Carlos D; Evans, James P; Landrum, Melissa J; Ledbetter, David H; Maglott, Donna R; Martin, Christa Lese; Nussbaum, Robert L; Plon, Sharon E; Ramos, Erin M; Sherry, Stephen T; Watson, Michael S.

N Engl J Med ; 372(23): 2235-42, 2015 06 04.

Artículo en Inglés | MEDLINE | ID: mdl-26014595

RESUMEN

On autopsy, a patient is found to have hypertrophic cardiomyopathy. The patient's family pursues genetic testing that shows a "likely pathogenic" variant for the condition on the basis of a study in an original research publication. Given the dominant inheritance of the condition and the risk of sudden cardiac death, other family members are tested for the genetic variant to determine their risk. Several family members test negative and are told that they are not at risk for hypertrophic cardiomyopathy and sudden cardiac death, and those who test positive are told that they need to be regularly monitored for cardiomyopathy on echocardiography. Five years later, during a routine clinic visit of one of the genotype-positive family members, the cardiologist queries a database for current knowledge on the genetic variant and discovers that the variant is now interpreted as "likely benign" by another laboratory that uses more recently derived population-frequency data. A newly available testing panel for additional genes that are implicated in hypertrophic cardiomyopathy is initiated on an affected family member, and a different variant is found that is determined to be pathogenic. Family members are retested, and one member who previously tested negative is now found to be positive for this new variant. An immediate clinical workup detects evidence of cardiomyopathy, and an intracardiac defibrillator is implanted to reduce the risk of sudden cardiac death.

Asunto(s)

Bases de Datos Genéticas , Enfermedades Genéticas Congénitas/genética , Predisposición Genética a la Enfermedad/genética , Variación Genética , Genoma Humano , Pruebas Genéticas , Humanos , National Library of Medicine (U.S.) , Estados Unidos

12.

Assessing and managing risk when sharing aggregate genetic variant data.

Craig, David W; Goor, Robert M; Wang, Zhenyuan; Paschall, Justin; Ostell, Jim; Feolo, Michael; Sherry, Stephen T; Manolio, Teri A.

Nat Rev Genet ; 12(10): 730-6, 2011 09 16.

Artículo en Inglés | MEDLINE | ID: mdl-21921928

RESUMEN

Access to genetic data across studies is an important aspect of identifying new genetic associations through genome-wide association studies (GWASs). Meta-analysis across multiple GWASs with combined cohort sizes of tens of thousands of individuals often uncovers many more genome-wide associated loci than the original individual studies; this emphasizes the importance of tools and mechanisms for data sharing. However, even sharing summary-level data, such as allele frequencies, inherently carries some degree of privacy risk to study participants. Here we discuss mechanisms and resources for sharing data from GWASs, particularly focusing on approaches for assessing and quantifying the privacy risks to participants that result from the sharing of summary-level data.

Asunto(s)

Recolección de Datos , Variación Genética , Estudio de Asociación del Genoma Completo , Difusión de la Información/métodos , Estudios de Cohortes , Confidencialidad , Recolección de Datos/legislación & jurisprudencia , Bases de Datos Genéticas , Variación Genética/fisiología , Estudio de Asociación del Genoma Completo/métodos , Estudio de Asociación del Genoma Completo/estadística & datos numéricos , Humanos , Difusión de la Información/legislación & jurisprudencia , Metaanálisis como Asunto , Polimorfismo de Nucleótido Simple , Medición de Riesgo

13.

A mechanism for controlled access to GWAS data: experience of the GAIN Data Access Committee.

Ramos, Erin M; Din-Lovinescu, Corina; Bookman, Ebony B; McNeil, Lisa J; Baker, Carl C; Godynskiy, Georgy; Harris, Emily L; Lehner, Thomas; McKeon, Catherine; Moss, Joel; Starks, Vaurice L; Sherry, Stephen T; Manolio, Teri A; Rodriguez, Laura Lyman.

Am J Hum Genet ; 92(4): 479-88, 2013 Apr 04.

Artículo en Inglés | MEDLINE | ID: mdl-23561843

RESUMEN

The Genetic Association Information Network (GAIN) Data Access Committee was established in June 2007 to provide prompt and fair access to data from six genome-wide association studies through the database of Genotypes and Phenotypes (dbGaP). Of 945 project requests received through 2011, 749 (79%) have been approved; median receipt-to-approval time decreased from 14 days in 2007 to 8 days in 2011. Over half (54%) of the proposed research uses were for GAIN-specific phenotypes; other uses were for method development (26%) and adding controls to other studies (17%). Eight data-management incidents, defined as compromises of any of the data-use conditions, occurred among nine approved users; most were procedural violations, and none violated participant confidentiality. Over 5 years of experience with GAIN data access has demonstrated substantial use of GAIN data by investigators from academic, nonprofit, and for-profit institutions with relatively few and contained policy violations. The availability of GAIN data has allowed for advances in both the understanding of the genetic underpinnings of mental-health disorders, diabetes, and psoriasis and the development and refinement of statistical methods for identifying genetic and environmental factors related to complex common diseases.

Asunto(s)

Acceso a la Información , Investigación Biomédica , Bases de Datos Factuales , Estudio de Asociación del Genoma Completo , Servicios de Información , Humanos , Investigadores

14.

New models of collaboration in genome-wide association studies: the Genetic Association Information Network.

Manolio, Teri A; Rodriguez, Laura Lyman; Brooks, Lisa; Abecasis, Gonçalo; Ballinger, Dennis; Daly, Mark; Donnelly, Peter; Faraone, Stephen V; Frazer, Kelly; Gabriel, Stacey; Gejman, Pablo; Guttmacher, Alan; Harris, Emily L; Insel, Thomas; Kelsoe, John R; Lander, Eric; McCowin, Norma; Mailman, Matthew D; Nabel, Elizabeth; Ostell, James; Pugh, Elizabeth; Sherry, Stephen; Sullivan, Patrick F; Thompson, John F; Warram, James; Wholley, David; Milos, Patrice M; Collins, Francis S.

Nat Genet ; 39(9): 1045-51, 2007 Sep.

Artículo en Inglés | MEDLINE | ID: mdl-17728769

RESUMEN

The Genetic Association Information Network (GAIN) is a public-private partnership established to investigate the genetic basis of common diseases through a series of collaborative genome-wide association studies. GAIN has used new approaches for project selection, data deposition and distribution, collaborative analysis, publication and protection from premature intellectual property claims. These demonstrate a new commitment to shared scientific knowledge that should facilitate rapid advances in understanding the genetics of complex diseases.

Asunto(s)

Investigación Biomédica/métodos , Predisposición Genética a la Enfermedad , Genoma Humano/genética , Servicios de Información/organización & administración , Trastorno por Déficit de Atención con Hiperactividad/genética , Trastorno Bipolar/genética , Humanos , Cooperación Internacional , Modelos Organizacionales , Psoriasis/genética

15.

Recommendations of the 2006 Human Variome Project meeting.

Cotton, Richard G H; Appelbe, William; Auerbach, Arleen D; Becker, Kevin; Bodmer, Walter; Boone, D Joe; Boulyjenkov, Victor; Brahmachari, Samir; Brody, Lawrence; Brookes, Anthony; Brown, Alastair F; Byers, Peter; Cantu, Jose Maria; Cassiman, Jean-Jacques; Claustres, Mireille; Concannon, Patrick; Cotton, Richard G H; den Dunnen, Johan T; Flicek, Paul; Gibbs, Richard; Hall, Judith; Hasler, Julia; Katz, Michael; Kwok, Pui-Yan; Laradi, Sandrine; Lindblom, Annika; Maglott, Donna; Marsh, Steven; Masimirembwa, Collen Muto; Minoshima, Shinsei; de Ramirez, Ana Maria Oller; Pagon, Roberta; Ramesar, Raj; Ravine, David; Richards, Sue; Rimoin, David; Ring, Huijun Z; Scriver, Charles R; Sherry, Stephen; Shimizu, Nobuyoshi; Stein, Lincoln; Tadmouri, Ghazi Omar; Taylor, Graham; Watson, Michael.

Nat Genet ; 39(4): 433-6, 2007 Apr.

Artículo en Inglés | MEDLINE | ID: mdl-17392799

RESUMEN

Lists of variations in genomic DNA and their effects have been kept for some time and have been used in diagnostics and research. Although these lists have been carefully gathered and curated, there has been little standardization and coordination, complicating their use. Given the myriad possible variations in the estimated 24,000 genes in the human genome, it would be useful to have standard criteria for databases of variation. Incomplete collection and ascertainment of variants demonstrates a need for a universally accessible system. These and other problems led to the World Heath Organization-cosponsored meeting on June 20-23, 2006 in Melbourne, Australia, which launched the Human Variome Project. This meeting addressed all areas of human genetics relevant to collection of information on variation and its effects. Members of each of eight sessions (the clinic and phenotype, the diagnostic laboratory, the research laboratory, curation and collection, informatics, relevance to the emerging world, integration and federation and funding and sustainability) developed a number of recommendations that were then organized into a total of 96 recommendations to act as a foundation for future work worldwide. Here we summarize the background of the project, the meeting and its recommendations.

Asunto(s)

Genoma Humano , Guías como Asunto , Polimorfismo Genético , Enfermedades Genéticas Congénitas/clasificación , Enfermedades Genéticas Congénitas/genética , Proyecto Genoma Humano , Humanos , Organización Mundial de la Salud

16.

The 1000 Genomes Project: data management and community access.

Clarke, Laura; Zheng-Bradley, Xiangqun; Smith, Richard; Kulesha, Eugene; Xiao, Chunlin; Toneva, Iliana; Vaughan, Brendan; Preuss, Don; Leinonen, Rasko; Shumway, Martin; Sherry, Stephen; Flicek, Paul.

Nat Methods ; 9(5): 459-62, 2012 Apr 27.

Artículo en Inglés | MEDLINE | ID: mdl-22543379

RESUMEN

The 1000 Genomes Project was launched as one of the largest distributed data collection and analysis projects ever undertaken in biology. In addition to the primary scientific goals of creating both a deep catalog of human genetic variation and extensive methods to accurately discover and characterize variation using new sequencing technologies, the project makes all of its data publicly available. Members of the project data coordination center have developed and deployed several tools to enable widespread data access.

Asunto(s)

Bases de Datos Genéticas , Genoma Humano , Genómica/métodos , Análisis de Secuencia de ADN/métodos , Biología Computacional/métodos , Variación Genética , Humanos

17.

Characterizing genetic variants for clinical action.

Ramos, Erin M; Din-Lovinescu, Corina; Berg, Jonathan S; Brooks, Lisa D; Duncanson, Audrey; Dunn, Michael; Good, Peter; Hubbard, Tim J P; Jarvik, Gail P; O'Donnell, Christopher; Sherry, Stephen T; Aronson, Naomi; Biesecker, Leslie G; Blumberg, Bruce; Calonge, Ned; Colhoun, Helen M; Epstein, Robert S; Flicek, Paul; Gordon, Erynn S; Green, Eric D; Green, Robert C; Hurles, Matthew; Kawamoto, Kensaku; Knaus, William; Ledbetter, David H; Levy, Howard P; Lyon, Elaine; Maglott, Donna; McLeod, Howard L; Rahman, Nazneen; Randhawa, Gurvaneet; Wicklund, Catherine; Manolio, Teri A; Chisholm, Rex L; Williams, Marc S.

Am J Med Genet C Semin Med Genet ; 166C(1): 93-104, 2014 Mar.

Artículo en Inglés | MEDLINE | ID: mdl-24634402

RESUMEN

Genome-wide association studies, DNA sequencing studies, and other genomic studies are finding an increasing number of genetic variants associated with clinical phenotypes that may be useful in developing diagnostic, preventive, and treatment strategies for individual patients. However, few variants have been integrated into routine clinical practice. The reasons for this are several, but two of the most significant are limited evidence about the clinical implications of the variants and a lack of a comprehensive knowledge base that captures genetic variants, their phenotypic associations, and other pertinent phenotypic information that is openly accessible to clinical groups attempting to interpret sequencing data. As the field of medicine begins to incorporate genome-scale analysis into clinical care, approaches need to be developed for collecting and characterizing data on the clinical implications of variants, developing consensus on their actionability, and making this information available for clinical use. The National Human Genome Research Institute (NHGRI) and the Wellcome Trust thus convened a workshop to consider the processes and resources needed to: (1) identify clinically valid genetic variants; (2) decide whether they are actionable and what the action should be; and (3) provide this information for clinical use. This commentary outlines the key discussion points and recommendations from the workshop.

Asunto(s)

Variación Genética/genética , Informática Médica/métodos , Fenotipo , Medicina de Precisión/métodos , Educación , Humanos , Difusión de la Información/métodos , National Human Genome Research Institute (U.S.) , Medicina de Precisión/tendencias , Estados Unidos

18.

Database resources of the National Center for Biotechnology Information.

Sayers, Eric W; Barrett, Tanya; Benson, Dennis A; Bolton, Evan; Bryant, Stephen H; Canese, Kathi; Chetvernin, Vyacheslav; Church, Deanna M; Dicuccio, Michael; Federhen, Scott; Feolo, Michael; Fingerman, Ian M; Geer, Lewis Y; Helmberg, Wolfgang; Kapustin, Yuri; Krasnov, Sergey; Landsman, David; Lipman, David J; Lu, Zhiyong; Madden, Thomas L; Madej, Tom; Maglott, Donna R; Marchler-Bauer, Aron; Miller, Vadim; Karsch-Mizrachi, Ilene; Ostell, James; Panchenko, Anna; Phan, Lon; Pruitt, Kim D; Schuler, Gregory D; Sequeira, Edwin; Sherry, Stephen T; Shumway, Martin; Sirotkin, Karl; Slotta, Douglas; Souvorov, Alexandre; Starchenko, Grigory; Tatusova, Tatiana A; Wagner, Lukas; Wang, Yanli; Wilbur, W John; Yaschenko, Eugene; Ye, Jian.

Nucleic Acids Res ; 40(Database issue): D13-25, 2012 Jan.

Artículo en Inglés | MEDLINE | ID: mdl-22140104

RESUMEN

In addition to maintaining the GenBank® nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides analysis and retrieval resources for the data in GenBank and other biological data made available through the NCBI Website. NCBI resources include Entrez, the Entrez Programming Utilities, MyNCBI, PubMed, PubMed Central (PMC), Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link (BLink), Primer-BLAST, COBALT, Splign, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, dbVar, Epigenomics, Genome and related tools, the Map Viewer, Model Maker, Evidence Viewer, Trace Archive, Sequence Read Archive, BioProject, BioSample, Retroviral Genotyping Tools, HIV-1/Human Protein Interaction Database, Gene Expression Omnibus (GEO), Probe, Online Mendelian Inheritance in Animals (OMIA), the Molecular Modeling Database (MMDB), the Conserved Domain Database (CDD), the Conserved Domain Architecture Retrieval Tool (CDART), Biosystems, Protein Clusters and the PubChem suite of small molecule databases. Augmenting many of the Web applications are custom implementations of the BLAST program optimized to search specialized data sets. All of these resources can be accessed through the NCBI home page at www.ncbi.nlm.nih.gov.

Asunto(s)

Bases de Datos como Asunto , Bases de Datos Genéticas , Bases de Datos de Proteínas , Expresión Génica , Genómica , Internet , Modelos Moleculares , National Library of Medicine (U.S.) , Publicaciones Periódicas como Asunto , PubMed , Alineación de Secuencia , Análisis de Secuencia de ADN , Análisis de Secuencia de Proteína , Análisis de Secuencia de ARN , Bibliotecas de Moléculas Pequeñas , Estados Unidos

19.

Database resources of the National Center for Biotechnology Information.

Sayers, Eric W; Barrett, Tanya; Benson, Dennis A; Bolton, Evan; Bryant, Stephen H; Canese, Kathi; Chetvernin, Vyacheslav; Church, Deanna M; DiCuccio, Michael; Federhen, Scott; Feolo, Michael; Fingerman, Ian M; Geer, Lewis Y; Helmberg, Wolfgang; Kapustin, Yuri; Landsman, David; Lipman, David J; Lu, Zhiyong; Madden, Thomas L; Madej, Tom; Maglott, Donna R; Marchler-Bauer, Aron; Miller, Vadim; Mizrachi, Ilene; Ostell, James; Panchenko, Anna; Phan, Lon; Pruitt, Kim D; Schuler, Gregory D; Sequeira, Edwin; Sherry, Stephen T; Shumway, Martin; Sirotkin, Karl; Slotta, Douglas; Souvorov, Alexandre; Starchenko, Grigory; Tatusova, Tatiana A; Wagner, Lukas; Wang, Yanli; Wilbur, W John; Yaschenko, Eugene; Ye, Jian.

Nucleic Acids Res ; 39(Database issue): D38-51, 2011 Jan.

Artículo en Inglés | MEDLINE | ID: mdl-21097890

RESUMEN

In addition to maintaining the GenBank® nucleic acid sequence database, the National Center for Biotechnology Information (NCBI) provides analysis and retrieval resources for the data in GenBank and other biological data made available through the NCBI Web site. NCBI resources include Entrez, the Entrez Programming Utilities, MyNCBI, PubMed, PubMed Central (PMC), Entrez Gene, the NCBI Taxonomy Browser, BLAST, BLAST Link (BLink), Primer-BLAST, COBALT, Electronic PCR, OrfFinder, Splign, ProSplign, RefSeq, UniGene, HomoloGene, ProtEST, dbMHC, dbSNP, dbVar, Epigenomics, Cancer Chromosomes, Entrez Genomes and related tools, the Map Viewer, Model Maker, Evidence Viewer, Trace Archive, Sequence Read Archive, Retroviral Genotyping Tools, HIV-1/Human Protein Interaction Database, Gene Expression Omnibus (GEO), Entrez Probe, GENSAT, Online Mendelian Inheritance in Man (OMIM), Online Mendelian Inheritance in Animals (OMIA), the Molecular Modeling Database (MMDB), the Conserved Domain Database (CDD), the Conserved Domain Architecture Retrieval Tool (CDART), IBIS, Biosystems, Peptidome, OMSSA, Protein Clusters and the PubChem suite of small molecule databases. Augmenting many of the Web applications are custom implementations of the BLAST program optimized to search specialized data sets. All of these resources can be accessed through the NCBI home page at www.ncbi.nlm.nih.gov.

Asunto(s)

Bases de Datos Genéticas , Bases de Datos de Proteínas , Expresión Génica , Genómica , National Library of Medicine (U.S.) , Estructura Terciaria de Proteína , PubMed , Alineación de Secuencia , Análisis de Secuencia de ADN , Análisis de Secuencia de ARN , Programas Informáticos , Integración de Sistemas , Estados Unidos

20.

The variant call format and VCFtools.

Danecek, Petr; Auton, Adam; Abecasis, Goncalo; Albers, Cornelis A; Banks, Eric; DePristo, Mark A; Handsaker, Robert E; Lunter, Gerton; Marth, Gabor T; Sherry, Stephen T; McVean, Gilean; Durbin, Richard.

Bioinformatics ; 27(15): 2156-8, 2011 Aug 01.

Artículo en Inglés | MEDLINE | ID: mdl-21653522

RESUMEN

SUMMARY: The variant call format (VCF) is a generic format for storing DNA polymorphism data such as SNPs, insertions, deletions and structural variants, together with rich annotations. VCF is usually stored in a compressed manner and can be indexed for fast data retrieval of variants from a range of positions on the reference genome. The format was developed for the 1000 Genomes Project, and has also been adopted by other projects such as UK10K, dbSNP and the NHLBI Exome Project. VCFtools is a software suite that implements various utilities for processing VCF files, including validation, merging, comparing and also provides a general Perl API. AVAILABILITY: http://vcftools.sourceforge.net

Asunto(s)

Variación Genética , Genómica/métodos , Almacenamiento y Recuperación de la Información/métodos , Programas Informáticos , Alelos , Genoma Humano , Genotipo , Humanos

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA