Pesquisa | Portal Regional da BVS

1.

The complete sequence of a human Y chromosome.

Rhie, Arang; Nurk, Sergey; Cechova, Monika; Hoyt, Savannah J; Taylor, Dylan J; Altemose, Nicolas; Hook, Paul W; Koren, Sergey; Rautiainen, Mikko; Alexandrov, Ivan A; Allen, Jamie; Asri, Mobin; Bzikadze, Andrey V; Chen, Nae-Chyun; Chin, Chen-Shan; Diekhans, Mark; Flicek, Paul; Formenti, Giulio; Fungtammasan, Arkarachai; Garcia Giron, Carlos; Garrison, Erik; Gershman, Ariel; Gerton, Jennifer L; Grady, Patrick G S; Guarracino, Andrea; Haggerty, Leanne; Halabian, Reza; Hansen, Nancy F; Harris, Robert; Hartley, Gabrielle A; Harvey, William T; Haukness, Marina; Heinz, Jakob; Hourlier, Thibaut; Hubley, Robert M; Hunt, Sarah E; Hwang, Stephen; Jain, Miten; Kesharwani, Rupesh K; Lewis, Alexandra P; Li, Heng; Logsdon, Glennis A; Lucas, Julian K; Makalowski, Wojciech; Markovic, Christopher; Martin, Fergal J; Mc Cartney, Ann M; McCoy, Rajiv C; McDaniel, Jennifer; McNulty, Brandy M.

Nature ; 621(7978): 344-354, 2023 Sep.

Artigo em Inglês | MEDLINE | ID: mdl-37612512

RESUMO

The human Y chromosome has been notoriously difficult to sequence and assemble because of its complex repeat structure that includes long palindromes, tandem repeats and segmental duplications1-3. As a result, more than half of the Y chromosome is missing from the GRCh38 reference sequence and it remains the last human chromosome to be finished4,5. Here, the Telomere-to-Telomere (T2T) consortium presents the complete 62,460,029-base-pair sequence of a human Y chromosome from the HG002 genome (T2T-Y) that corrects multiple errors in GRCh38-Y and adds over 30 million base pairs of sequence to the reference, showing the complete ampliconic structures of gene families TSPY, DAZ and RBMY; 41 additional protein-coding genes, mostly from the TSPY family; and an alternating pattern of human satellite 1 and 3 blocks in the heterochromatic Yq12 region. We have combined T2T-Y with a previous assembly of the CHM13 genome4 and mapped available population variation, clinical variants and functional genomics data to produce a complete and comprehensive reference sequence for all 24 human chromosomes.

Assuntos

Cromossomos Humanos Y , Genômica , Análise de Sequência de DNA , Humanos , Sequência de Bases , Cromossomos Humanos Y/genética , DNA Satélite/genética , Variação Genética/genética , Genética Populacional , Genômica/métodos , Genômica/normas , Heterocromatina/genética , Família Multigênica/genética , Padrões de Referência , Duplicações Segmentares Genômicas/genética , Análise de Sequência de DNA/normas , Sequências de Repetição em Tandem/genética , Telômero/genética

2.

DECIPHER: Improving Genetic Diagnosis Through Dynamic Integration of Genomic and Clinical Data.

Foreman, Julia; Perrett, Daniel; Mazaika, Erica; Hunt, Sarah E; Ware, James S; Firth, Helen V.

Annu Rev Genomics Hum Genet ; 24: 151-176, 2023 08 25.

Artigo em Inglês | MEDLINE | ID: mdl-37285546

RESUMO

DECIPHER (Database of Genomic Variation and Phenotype in Humans Using Ensembl Resources) shares candidate diagnostic variants and phenotypic data from patients with genetic disorders to facilitate research and improve the diagnosis, management, and therapy of rare diseases. The platform sits at the boundary between genomic research and the clinical community. DECIPHER aims to ensure that the most up-to-date data are made rapidly available within its interpretation interfaces to improve clinical care. Newly integrated cardiac case-control data that provide evidence of gene-disease associations and inform variant interpretation exemplify this mission. New research resources are presented in a format optimized for use by a broad range of professionals supporting the delivery of genomic medicine. The interfaces within DECIPHER integrate and contextualize variant and phenotypic data, helping to determine a robust clinico-molecular diagnosis for rare-disease patients, which combines both variant classification and clinical fit. DECIPHER supports discovery research, connecting individuals within the rare-disease community to pursue hypothesis-driven research.

Assuntos

Genômica , Genômica/métodos , Humanos , Doenças Raras/genética , Alelos , Guias de Prática Clínica como Assunto , Variações do Número de Cópias de DNA , Bases de Dados Genéticas

3.

SUsPECT: a pipeline for variant effect prediction based on custom long-read transcriptomes for improved clinical variant annotation.

Salz, Renee; Saraiva-Agostinho, Nuno; Vorsteveld, Emil; van der Made, Caspar I; Kersten, Simone; Stemerdink, Merel; Allen, Jamie; Volders, Pieter-Jan; Hunt, Sarah E; Hoischen, Alexander; 't Hoen, Peter A C.

BMC Genomics ; 24(1): 305, 2023 Jun 06.

Artigo em Inglês | MEDLINE | ID: mdl-37280537

RESUMO

Our incomplete knowledge of the human transcriptome impairs the detection of disease-causing variants, in particular if they affect transcripts only expressed under certain conditions. These transcripts are often lacking from reference transcript sets, such as Ensembl/GENCODE and RefSeq, and could be relevant for establishing genetic diagnoses. We present SUsPECT (Solving Unsolved Patient Exomes/gEnomes using Custom Transcriptomes), a pipeline based on the Ensembl Variant Effect Predictor (VEP) to predict variant impact on custom transcript sets, such as those generated by long-read RNA-sequencing, for downstream prioritization. Our pipeline predicts the functional consequence and likely deleteriousness scores for missense variants in the context of novel open reading frames predicted from any transcriptome. We demonstrate the utility of SUsPECT by uncovering potential mutational mechanisms of pathogenic variants in ClinVar that are not predicted to be pathogenic using the reference transcript annotation. In further support of SUsPECT's utility, we identified an enrichment of immune-related variants predicted to have a more severe molecular consequence when annotating with a newly generated transcriptome from stimulated immune cells instead of the reference transcriptome. Our pipeline outputs crucial information for further prioritization of potentially disease-causing variants for any disease and will become increasingly useful as more long-read RNA sequencing datasets become available.

Assuntos

Software , Transcriptoma , Humanos , Anotação de Sequência Molecular , Análise de Sequência de RNA/métodos , Exoma , Sequenciamento de Nucleotídeos em Larga Escala

4.

EyeG2P: an automated variant filtering approach improves efficiency of diagnostic genomic testing for inherited ophthalmic disorders.

Lenassi, Eva; Carvalho, Ana; Thormann, Anja; Abrahams, Liam; Arno, Gavin; Fletcher, Tracy; Hardcastle, Claire; Lopez, Javier; Hunt, Sarah E; Short, Patrick; Sergouniotis, Panagiotis I; Michaelides, Michel; Webster, Andrew; Cunningham, Fiona; Ramsden, Simon C; Kasperaviciute, Dalia; Fitzpatrick, David R; Black, Graeme C; Ellingford, Jamie M.

J Med Genet ; 60(8): 810-818, 2023 08.

Artigo em Inglês | MEDLINE | ID: mdl-36669873

RESUMO

BACKGROUND: Genomic variant prioritisation is one of the most significant bottlenecks to mainstream genomic testing in healthcare. Tools to improve precision while ensuring high recall are critical to successful mainstream clinical genomic testing, in particular for whole genome sequencing where millions of variants must be considered for each patient. METHODS: We developed EyeG2P, a publicly available database and web application using the Ensembl Variant Effect Predictor. EyeG2P is tailored for efficient variant prioritisation for individuals with inherited ophthalmic conditions. We assessed the sensitivity of EyeG2P in 1234 individuals with a broad range of eye conditions who had previously received a confirmed molecular diagnosis through routine genomic diagnostic approaches. For a prospective cohort of 83 individuals, we assessed the precision of EyeG2P in comparison with routine diagnostic approaches. For 10 additional individuals, we assessed the utility of EyeG2P for whole genome analysis. RESULTS: EyeG2P had 99.5% sensitivity for genomic variants previously identified as clinically relevant through routine diagnostic analysis (n=1234 individuals). Prospectively, EyeG2P enabled a significant increase in precision (35% on average) in comparison with routine testing strategies (p<0.001). We demonstrate that incorporation of EyeG2P into whole genome sequencing analysis strategies can reduce the number of variants for analysis to six variants, on average, while maintaining high diagnostic yield. CONCLUSION: Automated filtering of genomic variants through EyeG2P can increase the efficiency of diagnostic testing for individuals with a broad range of inherited ophthalmic disorders.

Assuntos

Bases de Dados Genéticas , Oftalmopatias , Testes Genéticos , Genoma Humano , Genômica , Oftalmopatias/genética , Humanos , Variação Genética

5.

DECIPHER: Supporting the interpretation and sharing of rare disease phenotype-linked variant data to advance diagnosis and research.

Foreman, Julia; Brent, Simon; Perrett, Daniel; Bevan, Andrew P; Hunt, Sarah E; Cunningham, Fiona; Hurles, Matthew E; Firth, Helen V.

Hum Mutat ; 43(6): 682-697, 2022 06.

Artigo em Inglês | MEDLINE | ID: mdl-35143074

RESUMO

DECIPHER (https://www.deciphergenomics.org) is a free web platform for sharing anonymized phenotype-linked variant data from rare disease patients. Its dynamic interpretation interfaces contextualize genomic and phenotypic data to enable more informed variant interpretation, incorporating international standards for variant classification. DECIPHER supports almost all types of germline and mosaic variation in the nuclear and mitochondrial genome: sequence variants, short tandem repeats, copy-number variants, and large structural variants. Patient phenotypes are deposited using Human Phenotype Ontology (HPO) terms, supplemented by quantitative data, which is aggregated to derive gene-specific phenotypic summaries. It hosts data from >250 projects from ~40 countries, openly sharing >40,000 patient records containing >51,000 variants and >172,000 phenotype terms. The rich phenotype-linked variant data in DECIPHER drives rare disease research and diagnosis by enabling patient matching within DECIPHER and with other resources, and has been cited in >2,600 publications. In this study, we describe the types of data deposited to DECIPHER, the variant interpretation tools, and patient matching interfaces which make DECIPHER an invaluable rare disease resource.

Assuntos

Bases de Dados Genéticas , Doenças Raras , Genômica , Humanos , Fenótipo , Doenças Raras/diagnóstico , Doenças Raras/genética , Software

6.

Scripting Analyses of Genomes in Ensembl Plants.

Contreras-Moreira, Bruno; Naamati, Guy; Rosello, Marc; Allen, James E; Hunt, Sarah E; Muffato, Matthieu; Gall, Astrid; Flicek, Paul.

Methods Mol Biol ; 2443: 27-55, 2022.

Artigo em Inglês | MEDLINE | ID: mdl-35037199

RESUMO

Ensembl Plants ( http://plants.ensembl.org ) offers genome-scale information for plants, with four releases per year. As of release 47 (April 2020) it features 79 species and includes genome sequence, gene models, and functional annotation. Comparative analyses help reconstruct the evolutionary history of gene families, genomes, and components of polyploid genomes. Some species have gene expression baseline reports or variation across genotypes. While the data can be accessed through the Ensembl genome browser, here we review specifically how our plant genomes can be interrogated programmatically and the data downloaded in bulk. These access routes are generally consistent across Ensembl for other non-plant species, including plant pathogens, pests, and pollinators.

Assuntos

Bases de Dados Genéticas , Genômica , Genoma de Planta , Anotação de Sequência Molecular , Plantas/genética , Software

7.

Annotating and prioritizing genomic variants using the Ensembl Variant Effect Predictor-A tutorial.

Hunt, Sarah E; Moore, Benjamin; Amode, Ridwan M; Armean, Irina M; Lemos, Diana; Mushtaq, Aleena; Parton, Andrew; Schuilenburg, Helen; Szpak, Michal; Thormann, Anja; Perry, Emily; Trevanion, Stephen J; Flicek, Paul; Yates, Andrew D; Cunningham, Fiona.

Hum Mutat ; 43(8): 986-997, 2022 08.

Artigo em Inglês | MEDLINE | ID: mdl-34816521

RESUMO

The Ensembl Variant Effect Predictor (VEP) is a freely available, open-source tool for the annotation and filtering of genomic variants. It predicts variant molecular consequences using the Ensembl/GENCODE or RefSeq gene sets. It also reports phenotype associations from databases such as ClinVar, allele frequencies from studies including gnomAD, and predictions of deleteriousness from tools such as Sorting Intolerant From Tolerant and Combined Annotation Dependent Depletion. Ensembl VEP includes filtering options to customize variant prioritization. It is well supported and updated roughly quarterly to incorporate the latest gene, variant, and phenotype association information. Ensembl VEP analysis can be performed using a highly configurable, extensible command-line tool, a Representational State Transfer application programming interface, and a user-friendly web interface. These access methods are designed to suit different levels of bioinformatics experience and meet different needs in terms of data size, visualization, and flexibility. In this tutorial, we will describe performing variant annotation using the Ensembl VEP web tool, which enables sophisticated analysis through a simple interface.

Assuntos

Genômica , Software , Biologia Computacional , Bases de Dados Genéticas , Frequência do Gene , Humanos , Anotação de Sequência Molecular , Fenótipo

8.

The Ensembl COVID-19 resource: ongoing integration of public SARS-CoV-2 data.

De Silva, Nishadi H; Bhai, Jyothish; Chakiachvili, Marc; Contreras-Moreira, Bruno; Cummins, Carla; Frankish, Adam; Gall, Astrid; Genez, Thiago; Howe, Kevin L; Hunt, Sarah E; Martin, Fergal J; Moore, Benjamin; Ogeh, Denye; Parker, Anne; Parton, Andrew; Ruffier, Magali; Sakthivel, Manoj Pandian; Sheppard, Dan; Tate, John; Thormann, Anja; Thybert, David; Trevanion, Stephen J; Winterbottom, Andrea; Zerbino, Daniel R; Finn, Robert D; Flicek, Paul; Yates, Andrew D.

Nucleic Acids Res ; 50(D1): D765-D770, 2022 01 07.

Artigo em Inglês | MEDLINE | ID: mdl-34634797

RESUMO

The COVID-19 pandemic has seen unprecedented use of SARS-CoV-2 genome sequencing for epidemiological tracking and identification of emerging variants. Understanding the potential impact of these variants on the infectivity of the virus and the efficacy of emerging therapeutics and vaccines has become a cornerstone of the fight against the disease. To support the maximal use of genomic information for SARS-CoV-2 research, we launched the Ensembl COVID-19 browser; the first virus to be encompassed within the Ensembl platform. This resource incorporates a new Ensembl gene set, multiple variant sets, and annotation from several relevant resources aligned to the reference SARS-CoV-2 assembly. Since the first release in May 2020, the content has been regularly updated using our new rapid release workflow, and tools such as the Ensembl Variant Effect Predictor have been integrated. The Ensembl COVID-19 browser is freely available at https://covid-19.ensembl.org.

Assuntos

COVID-19/virologia , Bases de Dados Genéticas , SARS-CoV-2/genética , Navegador , Coronaviridae/genética , Variação Genética , Genoma Viral , Humanos , Anotação de Sequência Molecular

9.

The European Variation Archive: a FAIR resource of genomic variation for all species.

Cezard, Timothe; Cunningham, Fiona; Hunt, Sarah E; Koylass, Baron; Kumar, Nitin; Saunders, Gary; Shen, April; Silva, Andres F; Tsukanov, Kirill; Venkataraman, Sundararaman; Flicek, Paul; Parkinson, Helen; Keane, Thomas M.

Nucleic Acids Res ; 50(D1): D1216-D1220, 2022 01 07.

Artigo em Inglês | MEDLINE | ID: mdl-34718739

RESUMO

The European Variation Archive (EVA; https://www.ebi.ac.uk/eva/) is a resource for sharing all types of genetic variation data (SNPs, indels, and structural variants) for all species. The EVA was created in 2014 to provide FAIR access to genetic variation data and has since grown to be a primary resource for genomic variants hosting >3 billion records. The EVA and dbSNP have established a compatible global system to assign unique identifiers to all submitted genetic variants. The EVA is active within the Global Alliance of Genomics and Health (GA4GH), maintaining, contributing and implementing standards such as VCF, Refget and Variant Representation Specification (VRS). In this article, we describe the submission and permanent accessioning services along with the different ways the data can be retrieved by the scientific community.

Assuntos

Biologia Computacional , Bases de Dados Genéticas , Variação Genética/genética , Software , Animais , Variação Estrutural do Genoma/genética , Genômica , Humanos , Mutação INDEL/genética , Anotação de Sequência Molecular , Polimorfismo de Nucleotídeo Único/genética

10.

The GA4GH Variation Representation Specification: A computational framework for variation representation and federated identification.

Wagner, Alex H; Babb, Lawrence; Alterovitz, Gil; Baudis, Michael; Brush, Matthew; Cameron, Daniel L; Cline, Melissa; Griffith, Malachi; Griffith, Obi L; Hunt, Sarah E; Kreda, David; Lee, Jennifer M; Li, Stephanie; Lopez, Javier; Moyer, Eric; Nelson, Tristan; Patel, Ronak Y; Riehle, Kevin; Robinson, Peter N; Rynearson, Shawn; Schuilenburg, Helen; Tsukanov, Kirill; Walsh, Brian; Konopko, Melissa; Rehm, Heidi L; Yates, Andrew D; Freimuth, Robert R; Hart, Reece K.

Cell Genom ; 1(2)2021 Nov 10.

Artigo em Inglês | MEDLINE | ID: mdl-35311178

RESUMO

Maximizing the personal, public, research, and clinical value of genomic information will require the reliable exchange of genetic variation data. We report here the Variation Representation Specification (VRS, pronounced "verse"), an extensible framework for the computable representation of variation that complements contemporary human-readable and flat file standards for genomic variation representation. VRS provides semantically precise representations of variation and leverages this design to enable federated identification of biomolecular variation with globally consistent and unique computed identifiers. The VRS framework includes a terminology and information model, machine-readable schema, data sharing conventions, and a reference implementation, each of which is intended to be broadly useful and freely available for community use. VRS was developed by a partnership among national information resource providers, public initiatives, and diagnostic testing laboratories under the auspices of the Global Alliance for Genomics and Health (GA4GH).

11.

Ensembl Genomes 2020-enabling non-vertebrate genomic research.

Howe, Kevin L; Contreras-Moreira, Bruno; De Silva, Nishadi; Maslen, Gareth; Akanni, Wasiu; Allen, James; Alvarez-Jarreta, Jorge; Barba, Matthieu; Bolser, Dan M; Cambell, Lahcen; Carbajo, Manuel; Chakiachvili, Marc; Christensen, Mikkel; Cummins, Carla; Cuzick, Alayne; Davis, Paul; Fexova, Silvie; Gall, Astrid; George, Nancy; Gil, Laurent; Gupta, Parul; Hammond-Kosack, Kim E; Haskell, Erin; Hunt, Sarah E; Jaiswal, Pankaj; Janacek, Sophie H; Kersey, Paul J; Langridge, Nick; Maheswari, Uma; Maurel, Thomas; McDowall, Mark D; Moore, Ben; Muffato, Matthieu; Naamati, Guy; Naithani, Sushma; Olson, Andrew; Papatheodorou, Irene; Patricio, Mateus; Paulini, Michael; Pedro, Helder; Perry, Emily; Preece, Justin; Rosello, Marc; Russell, Matthew; Sitnik, Vasily; Staines, Daniel M; Stein, Joshua; Tello-Ruiz, Marcela K; Trevanion, Stephen J; Urban, Martin.

Nucleic Acids Res ; 48(D1): D689-D695, 2020 01 08.

Artigo em Inglês | MEDLINE | ID: mdl-31598706

RESUMO

Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species, complementing the resources for vertebrate genomics developed in the context of the Ensembl project (http://www.ensembl.org). Together, the two resources provide a consistent set of interfaces to genomic data across the tree of life, including reference genome sequence, gene models, transcriptional data, genetic variation and comparative analysis. Data may be accessed via our website, online tools platform and programmatic interfaces, with updates made four times per year (in synchrony with Ensembl). Here, we provide an overview of Ensembl Genomes, with a focus on recent developments. These include the continued growth, more robust and reproducible sets of orthologues and paralogues, and enriched views of gene expression and gene function in plants. Finally, we report on our continued deeper integration with the Ensembl project, which forms a key part of our future strategy for dealing with the increasing quantity of available genome-scale data across the tree of life.

Assuntos

Biologia Computacional/métodos , Bases de Dados Genéticas , Variação Genética , Genoma Bacteriano , Genoma Fúngico , Genoma de Planta , Algoritmos , Animais , Caenorhabditis elegans/genética , Genômica , Internet , Anotação de Sequência Molecular , Fenótipo , Plantas/genética , Valores de Referência , Software , Interface Usuário-Computador

12.

Flexible and scalable diagnostic filtering of genomic variants using G2P with Ensembl VEP.

Thormann, Anja; Halachev, Mihail; McLaren, William; Moore, David J; Svinti, Victoria; Campbell, Archie; Kerr, Shona M; Tischkowitz, Marc; Hunt, Sarah E; Dunlop, Malcolm G; Hurles, Matthew E; Wright, Caroline F; Firth, Helen V; Cunningham, Fiona; FitzPatrick, David R.

Nat Commun ; 10(1): 2373, 2019 05 30.

Artigo em Inglês | MEDLINE | ID: mdl-31147538

RESUMO

We aimed to develop an efficient, flexible and scalable approach to diagnostic genome-wide sequence analysis of genetically heterogeneous clinical presentations. Here we present G2P ( www.ebi.ac.uk/gene2phenotype ) as an online system to establish, curate and distribute datasets for diagnostic variant filtering via association of allelic requirement and mutational consequence at a defined locus with phenotypic terms, confidence level and evidence links. An extension to Ensembl Variant Effect Predictor (VEP), VEP-G2P was used to filter both disease-associated and control whole exome sequence (WES) with Developmental Disorders G2P (G2PDD; 2044 entries). VEP-G2PDD shows a sensitivity/precision of 97.3%/33% for de novo and 81.6%/22.7% for inherited pathogenic genotypes respectively. Many of the missing genotypes are likely false-positive pathogenic assignments. The expected number and discriminative features of background genotypes are defined using control WES. Using only human genetic data VEP-G2P performs well compared to other freely-available diagnostic systems and future phenotypic matching capabilities should further enhance performance.

Assuntos

Deficiências do Desenvolvimento/genética , Sequenciamento do Exoma , Testes Genéticos , Genoma Humano , Alelos , Genótipo , Humanos , Técnicas de Diagnóstico Molecular , Mutação , Fenótipo , Análise de Sequência de DNA , Sequenciamento Completo do Genoma

13.

A plugin for the Ensembl Variant Effect Predictor that uses MaxEntScan to predict variant spliceogenicity.

Shamsani, Jannah; Kazakoff, Stephen H; Armean, Irina M; McLaren, Will; Parsons, Michael T; Thompson, Bryony A; O'Mara, Tracy A; Hunt, Sarah E; Waddell, Nicola; Spurdle, Amanda B.

Bioinformatics ; 35(13): 2315-2317, 2019 07 01.

Artigo em Inglês | MEDLINE | ID: mdl-30475984

RESUMO

SUMMARY: Assessing the pathogenicity of genetic variants can be a complex and challenging task. Spliceogenic variants, which alter mRNA splicing, may yield mature transcripts that encode non-functional protein products, an important predictor of Mendelian disease risk. However, most variant annotation tools do not adequately assess spliceogenicity outside the native splice site and thus the disease-causing potential of variants in other intronic and exonic regions is often overlooked. Here, we present a plugin for the Ensembl Variant Effect Predictor that packages MaxEntScan and extends its functionality to provide splice site predictions using a maximum entropy model. The plugin incorporates a sliding window algorithm to predict splice site loss or gain for any variant that overlaps a transcript feature. We also demonstrate the utility of the plugin by comparing our predictions to two mRNA splicing datasets containing several cancer-susceptibility genes. AVAILABILITY AND IMPLEMENTATION: Source code is freely available under the Apache License, Version 2.0: https://github.com/Ensembl/VEP_plugins. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Assuntos

Splicing de RNA , Software , Algoritmos , Éxons , Íntrons

14.

Ensembl variation resources.

Hunt, Sarah E; McLaren, William; Gil, Laurent; Thormann, Anja; Schuilenburg, Helen; Sheppard, Dan; Parton, Andrew; Armean, Irina M; Trevanion, Stephen J; Flicek, Paul; Cunningham, Fiona.

Database (Oxford) ; 20182018 01 01.

Artigo em Inglês | MEDLINE | ID: mdl-30576484

RESUMO

The major goal of sequencing humans and many other species is to understand the link between genomic variation, phenotype and disease. There are numerous valuable and well-established variation resources, but collating and making sense of non-homogeneous, often large-scale data sets from disparate sources remains a challenge. Without a systematic catalogue of these data and appropriate query and annotation tools, understanding the genome sequence of an individual and assessing their disease risk is impossible. In Ensembl, we substantially solve this problem: we develop methods to facilitate data integration and broad access; aggregate information in a consistent manner and make it available a variety of standard formats, both visually and programmatically; build analysis pipelines to compare variants to comprehensive genomic annotation sets; and make all tools and data publicly available.

Assuntos

Sistemas de Gerenciamento de Base de Dados , Bases de Dados Genéticas , Genômica/métodos , Anotação de Sequência Molecular/métodos , Algoritmos , Humanos , Análise de Sequência de DNA , Interface Usuário-Computador

15.

Ensembl 2018.

Zerbino, Daniel R; Achuthan, Premanand; Akanni, Wasiu; Amode, M Ridwan; Barrell, Daniel; Bhai, Jyothish; Billis, Konstantinos; Cummins, Carla; Gall, Astrid; Girón, Carlos García; Gil, Laurent; Gordon, Leo; Haggerty, Leanne; Haskell, Erin; Hourlier, Thibaut; Izuogu, Osagie G; Janacek, Sophie H; Juettemann, Thomas; To, Jimmy Kiang; Laird, Matthew R; Lavidas, Ilias; Liu, Zhicheng; Loveland, Jane E; Maurel, Thomas; McLaren, William; Moore, Benjamin; Mudge, Jonathan; Murphy, Daniel N; Newman, Victoria; Nuhn, Michael; Ogeh, Denye; Ong, Chuang Kee; Parker, Anne; Patricio, Mateus; Riat, Harpreet Singh; Schuilenburg, Helen; Sheppard, Dan; Sparrow, Helen; Taylor, Kieron; Thormann, Anja; Vullo, Alessandro; Walts, Brandon; Zadissa, Amonida; Frankish, Adam; Hunt, Sarah E; Kostadima, Myrto; Langridge, Nicholas; Martin, Fergal J; Muffato, Matthieu; Perry, Emily.

Nucleic Acids Res ; 46(D1): D754-D761, 2018 01 04.

Artigo em Inglês | MEDLINE | ID: mdl-29155950

RESUMO

The Ensembl project has been aggregating, processing, integrating and redistributing genomic datasets since the initial releases of the draft human genome, with the aim of accelerating genomics research through rapid open distribution of public data. Large amounts of raw data are thus transformed into knowledge, which is made available via a multitude of channels, in particular our browser (http://www.ensembl.org). Over time, we have expanded in multiple directions. First, our resources describe multiple fields of genomics, in particular gene annotation, comparative genomics, genetics and epigenomics. Second, we cover a growing number of genome assemblies; Ensembl Release 90 contains exactly 100. Third, our databases feed simultaneously into an array of services designed around different use cases, ranging from quick browsing to genome-wide bioinformatic analysis. We present here the latest developments of the Ensembl project, with a focus on managing an increasing number of assemblies, supporting efforts in genome interpretation and improving our browser.

Assuntos

Bases de Dados Genéticas , Conjuntos de Dados como Assunto , Genoma , Disseminação de Informação , Animais , Epigenômica , Genoma Humano , Estudo de Associação Genômica Ampla , Genômica , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Anotação de Sequência Molecular , Vertebrados/genética , Navegador

16.

Ensembl 2017.

Aken, Bronwen L; Achuthan, Premanand; Akanni, Wasiu; Amode, M Ridwan; Bernsdorff, Friederike; Bhai, Jyothish; Billis, Konstantinos; Carvalho-Silva, Denise; Cummins, Carla; Clapham, Peter; Gil, Laurent; Girón, Carlos García; Gordon, Leo; Hourlier, Thibaut; Hunt, Sarah E; Janacek, Sophie H; Juettemann, Thomas; Keenan, Stephen; Laird, Matthew R; Lavidas, Ilias; Maurel, Thomas; McLaren, William; Moore, Benjamin; Murphy, Daniel N; Nag, Rishi; Newman, Victoria; Nuhn, Michael; Ong, Chuang Kee; Parker, Anne; Patricio, Mateus; Riat, Harpreet Singh; Sheppard, Daniel; Sparrow, Helen; Taylor, Kieron; Thormann, Anja; Vullo, Alessandro; Walts, Brandon; Wilder, Steven P; Zadissa, Amonida; Kostadima, Myrto; Martin, Fergal J; Muffato, Matthieu; Perry, Emily; Ruffier, Magali; Staines, Daniel M; Trevanion, Stephen J; Cunningham, Fiona; Yates, Andrew; Zerbino, Daniel R; Flicek, Paul.

Nucleic Acids Res ; 45(D1): D635-D642, 2017 01 04.

Artigo em Inglês | MEDLINE | ID: mdl-27899575

RESUMO

Ensembl (www.ensembl.org) is a database and genome browser for enabling research on vertebrate genomes. We import, analyse, curate and integrate a diverse collection of large-scale reference data to create a more comprehensive view of genome biology than would be possible from any individual dataset. Our extensive data resources include evidence-based gene and regulatory region annotation, genome variation and gene trees. An accompanying suite of tools, infrastructure and programmatic access methods ensure uniform data analysis and distribution for all supported species. Together, these provide a comprehensive solution for large-scale and targeted genomics applications alike. Among many other developments over the past year, we have improved our resources for gene regulation and comparative genomics, and added CRISPR/Cas9 target sites. We released new browser functionality and tools, including improved filtering and prioritization of genome variation, Manhattan plot visualization for linkage disequilibrium and eQTL data, and an ontology search for phenotypes, traits and disease. We have also enhanced data discovery and access with a track hub registry and a selection of new REST end points. All Ensembl data are freely released to the scientific community and our source code is available via the open source Apache 2.0 license.

Assuntos

Biologia Computacional/métodos , Bases de Dados Genéticas , Genômica/métodos , Ferramenta de Busca , Software , Navegador , Animais , Mineração de Dados , Evolução Molecular , Regulação da Expressão Gênica , Variação Genética , Genoma Humano , Humanos , Anotação de Sequência Molecular , Especificidade da Espécie , Vertebrados

17.

The Ensembl Variant Effect Predictor.

McLaren, William; Gil, Laurent; Hunt, Sarah E; Riat, Harpreet Singh; Ritchie, Graham R S; Thormann, Anja; Flicek, Paul; Cunningham, Fiona.

Genome Biol ; 17(1): 122, 2016 06 06.

Artigo em Inglês | MEDLINE | ID: mdl-27268795

RESUMO

The Ensembl Variant Effect Predictor is a powerful toolset for the analysis, annotation, and prioritization of genomic variants in coding and non-coding regions. It provides access to an extensive collection of genomic annotation, with a variety of interfaces to suit different requirements, and simple options for configuring and extending analysis. It is open source, free to use, and supports full reproducibility of results. The Ensembl Variant Effect Predictor can simplify and accelerate variant interpretation in a wide range of study designs.

Assuntos

Variação Genética , Anotação de Sequência Molecular/métodos , Software , Biologia Computacional , Bases de Dados de Ácidos Nucleicos , Genômica , Humanos , Internet

18.

Polymorphism in a lincRNA Associates with a Doubled Risk of Pneumococcal Bacteremia in Kenyan Children.

Rautanen, Anna; Pirinen, Matti; Mills, Tara C; Rockett, Kirk A; Strange, Amy; Ndungu, Anne W; Naranbhai, Vivek; Gilchrist, James J; Bellenguez, Céline; Freeman, Colin; Band, Gavin; Bumpstead, Suzannah J; Edkins, Sarah; Giannoulatou, Eleni; Gray, Emma; Dronov, Serge; Hunt, Sarah E; Langford, Cordelia; Pearson, Richard D; Su, Zhan; Vukcevic, Damjan; Macharia, Alex W; Uyoga, Sophie; Ndila, Carolyne; Mturi, Neema; Njuguna, Patricia; Mohammed, Shebe; Berkley, James A; Mwangi, Isaiah; Mwarumba, Salim; Kitsao, Barnes S; Lowe, Brett S; Morpeth, Susan C; Khandwalla, Iqbal; Blackwell, Jenefer M; Bramon, Elvira; Brown, Matthew A; Casas, Juan P; Corvin, Aiden; Duncanson, Audrey; Jankowski, Janusz; Markus, Hugh S; Mathew, Christopher G; Palmer, Colin N A; Plomin, Robert; Sawcer, Stephen J; Trembath, Richard C; Viswanathan, Ananth C; Wood, Nicholas W; Deloukas, Panos.

Am J Hum Genet ; 98(6): 1092-1100, 2016 Jun 02.

Artigo em Inglês | MEDLINE | ID: mdl-27236921

RESUMO

Bacteremia (bacterial bloodstream infection) is a major cause of illness and death in sub-Saharan Africa but little is known about the role of human genetics in susceptibility. We conducted a genome-wide association study of bacteremia susceptibility in more than 5,000 Kenyan children as part of the Wellcome Trust Case Control Consortium 2 (WTCCC2). Both the blood-culture-proven bacteremia case subjects and healthy infants as controls were recruited from Kilifi, on the east coast of Kenya. Streptococcus pneumoniae is the most common cause of bacteremia in Kilifi and was thus the focus of this study. We identified an association between polymorphisms in a long intergenic non-coding RNA (lincRNA) gene (AC011288.2) and pneumococcal bacteremia and replicated the results in the same population (p combined = 1.69 × 10(-9); OR = 2.47, 95% CI = 1.84-3.31). The susceptibility allele is African specific, derived rather than ancestral, and occurs at low frequency (2.7% in control subjects and 6.4% in case subjects). Our further studies showed AC011288.2 expression only in neutrophils, a cell type that is known to play a major role in pneumococcal clearance. Identification of this novel association will further focus research on the role of lincRNAs in human infectious disease.

Assuntos

Bacteriemia/genética , Pneumonia Pneumocócica/genética , Polimorfismo Genético/genética , RNA Longo não Codificante/genética , Streptococcus pneumoniae/genética , Adolescente , Bacteriemia/microbiologia , Bacteriemia/patologia , Estudos de Casos e Controles , Criança , Pré-Escolar , Estudo de Associação Genômica Ampla , Humanos , Lactente , Recém-Nascido , Quênia/epidemiologia , Pneumonia Pneumocócica/microbiologia , Pneumonia Pneumocócica/patologia , Fatores de Risco

19.

A computational model of flow and species transport in the mesangium.

Hunt, Sarah E; Dorfman, Kevin D; Segal, Yoav; Barocas, Victor H.

Am J Physiol Renal Physiol ; 310(3): F222-9, 2016 Feb 01.

Artigo em Inglês | MEDLINE | ID: mdl-26831339

RESUMO

A variety of macromolecules accumulate in the glomerular mesangium in many different diseases, but the physics of the transport of these molecules within the mesangial matrix has not been extensively studied. We present a computational model of convection and diffusion within the porous mesangial matrix and apply this model to the specific instance of immunoglobulin A (IgA) transport in IgA nephropathy. We examine the influence of physiological factors including glomerular basement membrane (GBM) thickness and mesangial matrix density on the total accumulation of IgA. Our results suggest that IgA accumulation can be understood by relating convection and diffusion, thus demonstrating the importance of intrinsic glomerular factors.

Assuntos

Simulação por Computador , Mesângio Glomerular/metabolismo , Glomerulonefrite por IGA/metabolismo , Imunoglobulina A/metabolismo , Modelos Biológicos , Animais , Transporte Biológico , Difusão , Membrana Basal Glomerular/metabolismo , Membrana Basal Glomerular/patologia , Mesângio Glomerular/irrigação sanguínea , Mesângio Glomerular/patologia , Glomerulonefrite por IGA/patologia , Humanos , Movimento (Física) , Pressão Osmótica , Tamanho da Partícula , Porosidade , Pressão , Circulação Renal , Fatores de Tempo

20.

Ensembl 2016.

Yates, Andrew; Akanni, Wasiu; Amode, M Ridwan; Barrell, Daniel; Billis, Konstantinos; Carvalho-Silva, Denise; Cummins, Carla; Clapham, Peter; Fitzgerald, Stephen; Gil, Laurent; Girón, Carlos García; Gordon, Leo; Hourlier, Thibaut; Hunt, Sarah E; Janacek, Sophie H; Johnson, Nathan; Juettemann, Thomas; Keenan, Stephen; Lavidas, Ilias; Martin, Fergal J; Maurel, Thomas; McLaren, William; Murphy, Daniel N; Nag, Rishi; Nuhn, Michael; Parker, Anne; Patricio, Mateus; Pignatelli, Miguel; Rahtz, Matthew; Riat, Harpreet Singh; Sheppard, Daniel; Taylor, Kieron; Thormann, Anja; Vullo, Alessandro; Wilder, Steven P; Zadissa, Amonida; Birney, Ewan; Harrow, Jennifer; Muffato, Matthieu; Perry, Emily; Ruffier, Magali; Spudich, Giulietta; Trevanion, Stephen J; Cunningham, Fiona; Aken, Bronwen L; Zerbino, Daniel R; Flicek, Paul.

Nucleic Acids Res ; 44(D1): D710-6, 2016 Jan 04.

Artigo em Inglês | MEDLINE | ID: mdl-26687719

RESUMO

The Ensembl project (http://www.ensembl.org) is a system for genome annotation, analysis, storage and dissemination designed to facilitate the access of genomic annotation from chordates and key model organisms. It provides access to data from 87 species across our main and early access Pre! websites. This year we introduced three newly annotated species and released numerous updates across our supported species with a concentration on data for the latest genome assemblies of human, mouse, zebrafish and rat. We also provided two data updates for the previous human assembly, GRCh37, through a dedicated website (http://grch37.ensembl.org). Our tools, in particular the VEP, have been improved significantly through integration of additional third party data. REST is now capable of larger-scale analysis and our regulatory data BioMart can deliver faster results. The website is now capable of displaying long-range interactions such as those found in cis-regulated datasets. Finally we have launched a website optimized for mobile devices providing views of genes, variants and phenotypes. Our data is made available without restriction and all code is available from our GitHub organization site (http://github.com/Ensembl) under an Apache 2.0 license.

Assuntos

Bases de Dados Genéticas , Genômica , Anotação de Sequência Molecular , Animais , Genes , Variação Genética , Humanos , Internet , Camundongos , Proteínas/genética , Ratos , Sequências Reguladoras de Ácido Nucleico , Software

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA