Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 16 de 16
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Tipo de documento
Intervalo de ano de publicação
1.
Nature ; 604(7905): 310-315, 2022 04.
Artigo em Inglês | MEDLINE | ID: mdl-35388217

RESUMO

Comprehensive genome annotation is essential to understand the impact of clinically relevant variants. However, the absence of a standard for clinical reporting and browser display complicates the process of consistent interpretation and reporting. To address these challenges, Ensembl/GENCODE1 and RefSeq2 launched a joint initiative, the Matched Annotation from NCBI and EMBL-EBI (MANE) collaboration, to converge on human gene and transcript annotation and to jointly define a high-value set of transcripts and corresponding proteins. Here, we describe the MANE transcript sets for use as universal standards for variant reporting and browser display. The MANE Select set identifies a representative transcript for each human protein-coding gene, whereas the MANE Plus Clinical set provides additional transcripts at loci where the Select transcripts alone are not sufficient to report all currently known clinical variants. Each MANE transcript represents an exact match between the exonic sequences of an Ensembl/GENCODE transcript and its counterpart in RefSeq such that the identifiers can be used synonymously. We have now released MANE Select transcripts for 97% of human protein-coding genes, including all American College of Medical Genetics and Genomics Secondary Findings list v3.0 (ref. 3) genes. MANE transcripts are accessible from major genome browsers and key resources. Widespread adoption of these transcript sets will increase the consistency of reporting, facilitate the exchange of data regardless of the annotation source and help to streamline clinical interpretation.


Assuntos
Biologia Computacional , Bases de Dados Genéticas , Genômica , Genoma , Humanos , Disseminação de Informação , Anotação de Sequência Molecular , National Library of Medicine (U.S.) , Estados Unidos
2.
Nature ; 583(7814): 96-102, 2020 07.
Artigo em Inglês | MEDLINE | ID: mdl-32581362

RESUMO

Most patients with rare diseases do not receive a molecular diagnosis and the aetiological variants and causative genes for more than half such disorders remain to be discovered1. Here we used whole-genome sequencing (WGS) in a national health system to streamline diagnosis and to discover unknown aetiological variants in the coding and non-coding regions of the genome. We generated WGS data for 13,037 participants, of whom 9,802 had a rare disease, and provided a genetic diagnosis to 1,138 of the 7,065 extensively phenotyped participants. We identified 95 Mendelian associations between genes and rare diseases, of which 11 have been discovered since 2015 and at least 79 are confirmed to be aetiological. By generating WGS data of UK Biobank participants2, we found that rare alleles can explain the presence of some individuals in the tails of a quantitative trait for red blood cells. Finally, we identified four novel non-coding variants that cause disease through the disruption of transcription of ARPC1B, GATA1, LRBA and MPL. Our study demonstrates a synergy by using WGS for diagnosis and aetiological discovery in routine healthcare.


Assuntos
Internacionalidade , Programas Nacionais de Saúde , Doenças Raras/diagnóstico , Doenças Raras/genética , Sequenciamento Completo do Genoma , Complexo 2-3 de Proteínas Relacionadas à Actina/genética , Proteínas Adaptadoras de Transdução de Sinal/genética , Alelos , Bases de Dados Factuais , Eritrócitos/metabolismo , Fator de Transcrição GATA1/genética , Humanos , Fenótipo , Locos de Características Quantitativas , Receptores de Trombopoetina/genética , Medicina Estatal , Reino Unido
3.
Am J Hum Genet ; 108(6): 1083-1094, 2021 06 03.
Artigo em Inglês | MEDLINE | ID: mdl-34022131

RESUMO

Clinical genetic testing of protein-coding regions identifies a likely causative variant in only around half of developmental disorder (DD) cases. The contribution of regulatory variation in non-coding regions to rare disease, including DD, remains very poorly understood. We screened 9,858 probands from the Deciphering Developmental Disorders (DDD) study for de novo mutations in the 5' untranslated regions (5' UTRs) of genes within which variants have previously been shown to cause DD through a dominant haploinsufficient mechanism. We identified four single-nucleotide variants and two copy-number variants upstream of MEF2C in a total of ten individual probands. We developed multiple bespoke and orthogonal experimental approaches to demonstrate that these variants cause DD through three distinct loss-of-function mechanisms, disrupting transcription, translation, and/or protein function. These non-coding region variants represent 23% of likely diagnoses identified in MEF2C in the DDD cohort, but these would all be missed in standard clinical genetics approaches. Nonetheless, these variants are readily detectable in exome sequence data, with 30.7% of 5' UTR bases across all genes well covered in the DDD dataset. Our analyses show that non-coding variants upstream of genes within which coding variants are known to cause DD are an important cause of severe disease and demonstrate that analyzing 5' UTRs can increase diagnostic yield. We also show how non-coding variants can help inform both the disease-causing mechanism underlying protein-coding variants and dosage tolerance of the gene.


Assuntos
Regiões 5' não Traduzidas , Deficiências do Desenvolvimento/etiologia , Predisposição Genética para Doença , Mutação com Perda de Função , Criança , Estudos de Coortes , Variações do Número de Cópias de DNA , Deficiências do Desenvolvimento/patologia , Humanos , Fatores de Transcrição MEF2/genética , Sequenciamento do Exoma
4.
Nucleic Acids Res ; 47(D1): D1005-D1012, 2019 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-30445434

RESUMO

The GWAS Catalog delivers a high-quality curated collection of all published genome-wide association studies enabling investigations to identify causal variants, understand disease mechanisms, and establish targets for novel therapies. The scope of the Catalog has also expanded to targeted and exome arrays with 1000 new associations added for these technologies. As of September 2018, the Catalog contains 5687 GWAS comprising 71673 variant-trait associations from 3567 publications. New content includes 284 full P-value summary statistics datasets for genome-wide and new targeted array studies, representing 6 × 109 individual variant-trait statistics. In the last 12 months, the Catalog's user interface was accessed by ∼90000 unique users who viewed >1 million pages. We have improved data access with the release of a new RESTful API to support high-throughput programmatic access, an improved web interface and a new summary statistics database. Summary statistics provision is supported by a new format proposed as a community standard for summary statistics data representation. This format was derived from our experience in standardizing heterogeneous submissions, mapping formats and in harmonizing content. Availability: https://www.ebi.ac.uk/gwas/.


Assuntos
Bases de Dados Genéticas , Estudo de Associação Genômica Ampla , Doença/genética , Variação Genética , Humanos , Análise em Microsséries , Publicações , Software , Interface Usuário-Computador
5.
Nucleic Acids Res ; 47(D1): D745-D751, 2019 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-30407521

RESUMO

The Ensembl project (https://www.ensembl.org) makes key genomic data sets available to the entire scientific community without restrictions. Ensembl seeks to be a fundamental resource driving scientific progress by creating, maintaining and updating reference genome annotation and comparative genomics resources. This year we describe our new and expanded gene, variant and comparative annotation capabilities, which led to a 50% increase in the number of vertebrate genomes we support. We have also doubled the number of available human variants and added regulatory regions for many mouse cell types and developmental stages. Our data sets and tools are available via the Ensembl website as well as a through a RESTful webservice, Perl application programming interface and as data files for download.


Assuntos
Bases de Dados Genéticas , Genoma/genética , Genômica , Vertebrados/genética , Animais , Biologia Computacional/tendências , Humanos , Camundongos , Anotação de Sequência Molecular , Software
6.
Nucleic Acids Res ; 45(D1): D896-D901, 2017 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-27899670

RESUMO

The NHGRI-EBI GWAS Catalog has provided data from published genome-wide association studies since 2008. In 2015, the database was redesigned and relocated to EMBL-EBI. The new infrastructure includes a new graphical user interface (www.ebi.ac.uk/gwas/), ontology supported search functionality and an improved curation interface. These developments have improved the data release frequency by increasing automation of curation and providing scaling improvements. The range of available Catalog data has also been extended with structured ancestry and recruitment information added for all studies. The infrastructure improvements also support scaling for larger arrays, exome and sequencing studies, allowing the Catalog to adapt to the needs of evolving study design, genotyping technologies and user needs in the future.


Assuntos
Bases de Dados de Ácidos Nucleicos , Estudo de Associação Genômica Ampla/métodos , Software , Mineração de Dados , Genômica/métodos , Humanos , Anotação de Sequência Molecular , National Human Genome Research Institute (U.S.) , Estados Unidos , Interface Usuário-Computador , Navegador
7.
Nucleic Acids Res ; 42(Database issue): D873-8, 2014 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-24285302

RESUMO

Locus Reference Genomic (LRG; http://www.lrg-sequence.org/) records contain internationally recognized stable reference sequences designed specifically for reporting clinically relevant sequence variants. Each LRG is contained within a single file consisting of a stable 'fixed' section and a regularly updated 'updatable' section. The fixed section contains stable genomic DNA sequence for a genomic region, essential transcripts and proteins for variant reporting and an exon numbering system. The updatable section contains mapping information, annotation of all transcripts and overlapping genes in the region and legacy exon and amino acid numbering systems. LRGs provide a stable framework that is vital for reporting variants, according to Human Genome Variation Society (HGVS) conventions, in genomic DNA, transcript or protein coordinates. To enable translation of information between LRG and genomic coordinates, LRGs include mapping to the human genome assembly. LRGs are compiled and maintained by the National Center for Biotechnology Information (NCBI) and European Bioinformatics Institute (EBI). LRG reference sequences are selected in collaboration with the diagnostic and research communities, locus-specific database curators and mutation consortia. Currently >700 LRGs have been created, of which >400 are publicly available. The aim is to create an LRG for every locus with clinical implications.


Assuntos
Bases de Dados Genéticas , Variação Genética , Genoma Humano , Éxons , Loci Gênicos , Genômica/normas , Humanos , Internet , Proteínas/genética , RNA Mensageiro/química , Padrões de Referência
8.
Nucleic Acids Res ; 42(Database issue): D1001-6, 2014 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-24316577

RESUMO

The National Human Genome Research Institute (NHGRI) Catalog of Published Genome-Wide Association Studies (GWAS) Catalog provides a publicly available manually curated collection of published GWAS assaying at least 100,000 single-nucleotide polymorphisms (SNPs) and all SNP-trait associations with P <1 × 10(-5). The Catalog includes 1751 curated publications of 11 912 SNPs. In addition to the SNP-trait association data, the Catalog also publishes a quarterly diagram of all SNP-trait associations mapped to the SNPs' chromosomal locations. The Catalog can be accessed via a tabular web interface, via a dynamic visualization on the human karyotype, as a downloadable tab-delimited file and as an OWL knowledge base. This article presents a number of recent improvements to the Catalog, including novel ways for users to interact with the Catalog and changes to the curation infrastructure.


Assuntos
Bases de Dados de Ácidos Nucleicos , Estudo de Associação Genômica Ampla , Polimorfismo de Nucleotídeo Único , Ontologia Genética , Genoma Humano , Humanos , Internet , Cariótipo
9.
Clin Transl Sci ; 17(9): e70017, 2024 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-39206687

RESUMO

Frequencies of pharmacogenetic (PGx) variants are known to differ substantially across populations but much of the available PGx literature focuses on one or a few population groups, often defined in nonstandardized ways, or on a specific gene or variant. Guidelines produced by the Clinical Pharmacogenetic Implementation Consortium (CPIC) provide consistent methods of literature extraction, curation, and reporting, including comprehensive curation of allele frequency data across nine defined "biogeographic groups" from the PGx literature. We extracted data from 23 CPIC guidelines encompassing 19 genes to compare the sizes of the populations from each group and allele frequencies of altered function alleles across groups. The European group was the largest in the curated literature for 16 of the 19 genes, while the American and Oceanian groups were the smallest. Nearly 200 alleles were detected in nonreference groups that were not reported in the largest (reference) group. The genes CYP2B6 and CYP2C9 were more likely to have higher frequencies of altered function alleles in nonreference groups compared to the reference group, while the genes CYP4F2, DPYD, SLCO1B1, and UGT1A1 were less likely to have higher frequencies in nonreference groups. PGx allele frequencies and function differ substantially across nine biogeographic groups, all but two of which are underrepresented in available PGx data. Awareness of these differences and increased efforts to characterize the breadth of global PGx variation are needed to ensure that implementation of PGx-guided drug selection does not further widen existing health disparities among populations currently underrepresented in PGx data.


Assuntos
Frequência do Gene , Variantes Farmacogenômicos , Humanos , Farmacogenética/métodos , Citocromo P-450 CYP2C9/genética , Transportador 1 de Ânion Orgânico Específico do Fígado/genética , Genética Populacional/métodos , Testes Farmacogenômicos/métodos , Citocromo P-450 CYP2B6/genética , Alelos
10.
bioRxiv ; 2024 Oct 23.
Artigo em Inglês | MEDLINE | ID: mdl-39484403

RESUMO

The NHGRI-EBI GWAS Catalog serves as a vital resource for the genetic research community, providing access to the most comprehensive database of human GWAS results. Currently, it contains close to 7,000 publications for more than 15,000 traits, from which more than 625,000 lead associations have been curated. Additionally, 85,000 full genome-wide summary statistics datasets - containing association data for all variants in the analysis - are available for downstream analyses such as meta-analysis, fine-mapping, Mendelian randomisation or development of polygenic risk scores. As a centralised repository for GWAS results, the GWAS Catalog sets and implements standards for data submission and harmonisation, and encourages the use of consistent descriptors for traits, samples and methodologies. We share processes and vocabulary with the PGS Catalog, improving interoperability for a growing user group. Here, we describe the latest changes in data content, improvements in our user interface, and the implementation of the GWAS-SSF standard format for summary statistics. We address the challenges of handling the rapid increase in large-scale molecular quantitative trait GWAS and the need for sensitivity in the use of population and cohort descriptors while maintaining data interoperability and reusability.

11.
Mol Genet Genomic Med ; 9(12): e1786, 2021 12.
Artigo em Inglês | MEDLINE | ID: mdl-34435752

RESUMO

BACKGROUND: Variant interpretation is dependent on transcript annotation and remains time consuming and challenging. There are major obstacles for historical data reuse and for interpretation of new variants. First, both RefSeq and Ensembl/GENCODE produce transcript sets in common use, but there is currently no easy way to translate between the two. Second, the resources often used for variant interpretation (e.g. ClinVar, gnomAD, UniProt) do not use the same transcript set, nor default transcript or protein sequence. METHOD: Ensembl ran a survey in 2018 to sample attitudes to choosing one default transcript per locus, and to gather data on reference sequences used by the scientific community. This was publicised on the Ensembl and UCSC genome browsers, by email and on social media. RESULTS: The survey had 788 responses from 32 different countries, the results of which we report here. CONCLUSIONS: We present our roadmap to create an effective default set of transcripts for resources, and for reporting interpretation of clinical variants.


Assuntos
Biomarcadores , Biologia Computacional , Genômica , RNA Mensageiro/genética , Animais , Biologia Computacional/métodos , Bases de Dados Genéticas , Genômica/métodos , Humanos , Software , Navegador
12.
J Neurosci ; 28(41): 10200-5, 2008 Oct 08.
Artigo em Inglês | MEDLINE | ID: mdl-18842880

RESUMO

Fragile X syndrome (FXS) is the most common form of hereditary mental retardation. FXS patients have a deficit for the fragile X mental retardation protein (FMRP) that results in abnormal neuronal dendritic spine morphology and behavioral phenotypes, including sleep abnormalities. In a Drosophila model of FXS, flies lacking the dfmr1 protein (dFMRP) have abnormal circadian rhythms apparently as a result of altered clock output. In this study, we present biochemical and genetic evidence that dFMRP interacts with a known clock output component, the LARK RNA-binding protein. Our studies demonstrate physical interactions between dFMRP and LARK, that the two proteins are present in a complex in vivo, and that LARK promotes the stability of dFMRP. Furthermore, we show genetic interactions between the corresponding genes indicating that dFMRP and LARK function together to regulate eye development and circadian behavior.


Assuntos
Comportamento Animal/fisiologia , Ritmo Circadiano/fisiologia , Proteínas de Drosophila/metabolismo , Drosophila/fisiologia , Olho/crescimento & desenvolvimento , Proteína do X Frágil da Deficiência Intelectual/metabolismo , Proteínas de Ligação a RNA/metabolismo , Animais , Ritmo Circadiano/genética , Modelos Animais de Doenças , Drosophila/crescimento & desenvolvimento , Proteínas de Drosophila/genética , Proteína do X Frágil da Deficiência Intelectual/genética , Síndrome do Cromossomo X Frágil , Larva/metabolismo , Proteínas de Ligação a RNA/genética
13.
Neuron ; 34(6): 961-72, 2002 Jun 13.
Artigo em Inglês | MEDLINE | ID: mdl-12086643

RESUMO

Mental retardation is a pervasive societal problem, 25 times more common than blindness for example. Fragile X syndrome, the most common form of inherited mental retardation, is caused by mutations in the FMR1 gene. Fragile X patients display neurite morphology defects in the brain, suggesting that this may be the basis of their mental retardation. Drosophila contains a single homolog of FMR1, dfxr (also called dfmr1). We analyzed the role of dfxr in neurite development in three distinct neuronal classes. We find that DFXR is required for normal neurite extension, guidance, and branching. dfxr mutants also display strong eclosion failure and circadian rhythm defects. Interestingly, distinct neuronal cell types show different phenotypes, suggesting that dfxr differentially regulates diverse targets in the brain.


Assuntos
Encéfalo/fisiologia , Proteínas de Drosophila/fisiologia , Síndrome do Cromossomo X Frágil/genética , Proteínas do Tecido Nervoso/fisiologia , Neurônios/citologia , Proteínas de Ligação a RNA , Sequência de Aminoácidos , Animais , Encéfalo/patologia , Ritmo Circadiano/genética , Ritmo Circadiano/fisiologia , Proteína do X Frágil da Deficiência Intelectual , Dados de Sequência Molecular , Atividade Motora/genética , Atividade Motora/fisiologia , Mutação , Proteínas do Tecido Nervoso/deficiência , Proteínas do Tecido Nervoso/genética , Neuroglia/metabolismo , Neuroglia/patologia , Neuroglia/fisiologia , Neurônios/patologia , Neurônios/fisiologia , Homologia de Sequência de Aminoácidos
14.
Epigenetics ; 13(2): 117-121, 2018.
Artigo em Inglês | MEDLINE | ID: mdl-27911167

RESUMO

The analysis of DNA methylation has become routine in the pipeline for diagnosis of imprinting disorders, with many publications reporting aberrant methylation associated with imprinted differentially methylated regions (DMRs). However, comparisons between these studies are routinely hampered by the lack of consistency in reporting sites of methylation evaluated. To avoid confusion surrounding nomenclature, special care is needed to communicate results accurately, especially between scientists and other health care professionals. Within the European Network for Human Congenital Imprinting Disorders we have discussed these issues and designed a nomenclature for naming imprinted DMRs as well as for reporting methylation values. We apply these recommendations for imprinted DMRs that are commonly assayed in clinical laboratories and show how they support standardized database submission. The recommendations are in line with existing recommendations, most importantly the Human Genome Variation Society nomenclature, and should facilitate accurate reporting and data exchange among laboratories and thereby help to avoid future confusion.


Assuntos
Metilação de DNA , Epigenômica/normas , Impressão Genômica , Terminologia como Assunto , Animais , Humanos , Polimorfismo Genético , Guias de Prática Clínica como Assunto
15.
Genome Biol ; 19(1): 21, 2018 02 15.
Artigo em Inglês | MEDLINE | ID: mdl-29448949

RESUMO

The accurate description of ancestry is essential to interpret, access, and integrate human genomics data, and to ensure that these benefit individuals from all ancestral backgrounds. However, there are no established guidelines for the representation of ancestry information. Here we describe a framework for the accurate and standardized description of sample ancestry, and validate it by application to the NHGRI-EBI GWAS Catalog. We confirm known biases and gaps in diversity, and find that African and Hispanic or Latin American ancestry populations contribute a disproportionately high number of associations. It is our hope that widespread adoption of this framework will lead to improved analysis, interpretation, and integration of human genomics data.


Assuntos
Estudo de Associação Genômica Ampla/normas , Genômica/normas , Variação Genética , Humanos , Grupos Raciais
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA