Search | VHL Regional Portal

1.

Open Targets Platform: supporting systematic drug-target identification and prioritisation.

Ochoa, David; Hercules, Andrew; Carmona, Miguel; Suveges, Daniel; Gonzalez-Uriarte, Asier; Malangone, Cinzia; Miranda, Alfredo; Fumis, Luca; Carvalho-Silva, Denise; Spitzer, Michaela; Baker, Jarrod; Ferrer, Javier; Raies, Arwa; Razuvayevskaya, Olesya; Faulconbridge, Adam; Petsalaki, Eirini; Mutowo, Prudence; Machlitt-Northen, Sandra; Peat, Gareth; McAuley, Elaine; Ong, Chuang Kee; Mountjoy, Edward; Ghoussaini, Maya; Pierleoni, Andrea; Papa, Eliseo; Pignatelli, Miguel; Koscielny, Gautier; Karim, Mohd; Schwartzentruber, Jeremy; Hulcoop, David G; Dunham, Ian; McDonagh, Ellen M.

Nucleic Acids Res ; 49(D1): D1302-D1310, 2021 01 08.

Article in English | MEDLINE | ID: mdl-33196847

ABSTRACT

The Open Targets Platform (https://www.targetvalidation.org/) provides users with a queryable knowledgebase and user interface to aid systematic target identification and prioritisation for drug discovery based upon underlying evidence. It is publicly available and the underlying code is open source. Since our last update two years ago, we have had 10 releases to maintain and continuously improve evidence for target-disease relationships from 20 different data sources. In addition, we have integrated new evidence from key datasets, including prioritised targets identified from genome-wide CRISPR knockout screens in 300 cancer models (Project Score), and GWAS/UK BioBank statistical genetic analysis evidence from the Open Targets Genetics Portal. We have evolved our evidence scoring framework to improve target identification. To aid the prioritisation of targets and inform on the potential impact of modulating a given target, we have added evaluation of post-marketing adverse drug reactions and new curated information on target tractability and safety. We have also developed the user interface and backend technologies to improve performance and usability. In this article, we describe the latest enhancements to the Platform, to address the fundamental challenge that developing effective and safe drugs is difficult and expensive.

Subject(s)

Antineoplastic Agents/therapeutic use , Drugs, Investigational/therapeutic use , Knowledge Bases , Molecular Targeted Therapy/methods , Neoplasms/drug therapy , Software , Antineoplastic Agents/chemistry , Databases, Factual , Datasets as Topic , Drug Discovery/methods , Drugs, Investigational/chemistry , Humans , Internet , Neoplasms/classification , Neoplasms/genetics , Neoplasms/pathology

2.

Open Targets Genetics: systematic identification of trait-associated genes using large-scale genetics and functional genomics.

Ghoussaini, Maya; Mountjoy, Edward; Carmona, Miguel; Peat, Gareth; Schmidt, Ellen M; Hercules, Andrew; Fumis, Luca; Miranda, Alfredo; Carvalho-Silva, Denise; Buniello, Annalisa; Burdett, Tony; Hayhurst, James; Baker, Jarrod; Ferrer, Javier; Gonzalez-Uriarte, Asier; Jupp, Simon; Karim, Mohd Anisul; Koscielny, Gautier; Machlitt-Northen, Sandra; Malangone, Cinzia; Pendlington, Zoe May; Roncaglia, Paola; Suveges, Daniel; Wright, Daniel; Vrousgou, Olga; Papa, Eliseo; Parkinson, Helen; MacArthur, Jacqueline A L; Todd, John A; Barrett, Jeffrey C; Schwartzentruber, Jeremy; Hulcoop, David G; Ochoa, David; McDonagh, Ellen M; Dunham, Ian.

Nucleic Acids Res ; 49(D1): D1311-D1320, 2021 01 08.

Article in English | MEDLINE | ID: mdl-33045747

ABSTRACT

Open Targets Genetics (https://genetics.opentargets.org) is an open-access integrative resource that aggregates human GWAS and functional genomics data including gene expression, protein abundance, chromatin interaction and conformation data from a wide range of cell types and tissues to make robust connections between GWAS-associated loci, variants and likely causal genes. This enables systematic identification and prioritisation of likely causal variants and genes across all published trait-associated loci. In this paper, we describe the public resources we aggregate, the technology and analyses we use, and the functionality that the portal offers. Open Targets Genetics can be searched by variant, gene or study/phenotype. It offers tools that enable users to prioritise causal variants and genes at disease-associated loci and access systematic cross-disease and disease-molecular trait colocalization analysis across 92 cell types and tissues including the eQTL Catalogue. Data visualizations such as Manhattan-like plots, regional plots, credible sets overlap between studies and PheWAS plots enable users to explore GWAS signals in depth. The integrated data is made available through the web portal, for bulk download and via a GraphQL API, and the software is open source. Applications of this integrated data include identification of novel targets for drug discovery and drug repurposing.

Subject(s)

Databases, Genetic , Genome, Human , Inflammatory Bowel Diseases/genetics , Molecular Targeted Therapy/methods , Quantitative Trait Loci , Software , Chromatin/chemistry , Chromatin/metabolism , Datasets as Topic , Drug Discovery/methods , Drug Repositioning/methods , Genome-Wide Association Study , Genotype , Humans , Inflammatory Bowel Diseases/drug therapy , Inflammatory Bowel Diseases/metabolism , Inflammatory Bowel Diseases/pathology , Internet , Phenotype , Quantitative Trait, Heritable

3.

Open Targets Platform: new developments and updates two years on.

Carvalho-Silva, Denise; Pierleoni, Andrea; Pignatelli, Miguel; Ong, ChuangKee; Fumis, Luca; Karamanis, Nikiforos; Carmona, Miguel; Faulconbridge, Adam; Hercules, Andrew; McAuley, Elaine; Miranda, Alfredo; Peat, Gareth; Spitzer, Michaela; Barrett, Jeffrey; Hulcoop, David G; Papa, Eliseo; Koscielny, Gautier; Dunham, Ian.

Nucleic Acids Res ; 47(D1): D1056-D1065, 2019 01 08.

Article in English | MEDLINE | ID: mdl-30462303

ABSTRACT

The Open Targets Platform integrates evidence from genetics, genomics, transcriptomics, drugs, animal models and scientific literature to score and rank target-disease associations for drug target identification. The associations are displayed in an intuitive user interface (https://www.targetvalidation.org), and are available through a REST-API (https://api.opentargets.io/v3/platform/docs/swagger-ui) and a bulk download (https://www.targetvalidation.org/downloads/data). In addition to target-disease associations, we also aggregate and display data at the target and disease levels to aid target prioritisation. Since our first publication two years ago, we have made eight releases, added new data sources for target-disease associations, started including causal genetic variants from non genome-wide targeted arrays, added new target and disease annotations, launched new visualisations and improved existing ones and released a new web tool for batch search of up to 200 targets. We have a new URL for the Open Targets Platform REST-API, new REST endpoints and also removed the need for authorisation for API fair use. Here, we present the latest developments of the Open Targets Platform, expanding the evidence and target-disease associations with new and improved data sources, refining data quality, enhancing website usability, and increasing our user base with our training workshops, user support, social media and bioinformatics forum engagement.

Subject(s)

Computational Biology/methods , Databases, Genetic , Genomics/methods , Information Storage and Retrieval/methods , Molecular Targeted Therapy/methods , Computational Biology/trends , Gene Expression Profiling/methods , Genomics/trends , Humans , Information Storage and Retrieval/trends , Internet , Reproducibility of Results , Software

4.

Uncovering novel repositioning opportunities using the Open Targets platform.

Khaladkar, Mugdha; Koscielny, Gautier; Hasan, Samiul; Agarwal, Pankaj; Dunham, Ian; Rajpal, Deepak; Sanseau, Philippe.

Drug Discov Today ; 22(12): 1800-1807, 2017 12.

Article in English | MEDLINE | ID: mdl-28919242

ABSTRACT

The recently developed Open Targets platform consolidates a wide range of comprehensive evidence associating known and potential drug targets with human diseases. We have harnessed the integrated data from this platform for novel drug repositioning opportunities. Our computational workflow systematically mines data from various evidence categories and presents potential repositioning opportunities for drugs that are marketed or being investigated in ongoing human clinical trials, based on evidence strength on target-disease pairing. We classified these novel target-disease opportunities in several ways: (i) number of independent counts of evidence; (ii) broad therapy area of origin; and (iii) repositioning within or across therapy areas. Finally, we elaborate on one example that was identified by this approach.

Subject(s)

Computational Biology/methods , Drug Repositioning , Animals , Humans , Rare Diseases/drug therapy , Receptor, Melanocortin, Type 1/metabolism , Vitiligo/drug therapy , Vitiligo/metabolism

5.

Open Targets: a platform for therapeutic target identification and validation.

Koscielny, Gautier; An, Peter; Carvalho-Silva, Denise; Cham, Jennifer A; Fumis, Luca; Gasparyan, Rippa; Hasan, Samiul; Karamanis, Nikiforos; Maguire, Michael; Papa, Eliseo; Pierleoni, Andrea; Pignatelli, Miguel; Platt, Theo; Rowland, Francis; Wankar, Priyanka; Bento, A Patrícia; Burdett, Tony; Fabregat, Antonio; Forbes, Simon; Gaulton, Anna; Gonzalez, Cristina Yenyxe; Hermjakob, Henning; Hersey, Anne; Jupe, Steven; Kafkas, Senay; Keays, Maria; Leroy, Catherine; Lopez, Francisco-Javier; Magarinos, Maria Paula; Malone, James; McEntyre, Johanna; Munoz-Pomer Fuentes, Alfonso; O'Donovan, Claire; Papatheodorou, Irene; Parkinson, Helen; Palka, Barbara; Paschall, Justin; Petryszak, Robert; Pratanwanich, Naruemon; Sarntivijal, Sirarat; Saunders, Gary; Sidiropoulos, Konstantinos; Smith, Thomas; Sondka, Zbyslaw; Stegle, Oliver; Tang, Y Amy; Turner, Edward; Vaughan, Brendan; Vrousgou, Olga; Watkins, Xavier.

Nucleic Acids Res ; 45(D1): D985-D994, 2017 01 04.

Article in English | MEDLINE | ID: mdl-27899665

ABSTRACT

We have designed and developed a data integration and visualization platform that provides evidence about the association of known and potential drug targets with diseases. The platform is designed to support identification and prioritization of biological targets for follow-up. Each drug target is linked to a disease using integrated genome-wide data from a broad range of data sources. The platform provides either a target-centric workflow to identify diseases that may be associated with a specific target, or a disease-centric workflow to identify targets that may be associated with a specific disease. Users can easily transition between these target- and disease-centric workflows. The Open Targets Validation Platform is accessible at https://www.targetvalidation.org.

Subject(s)

Computational Biology/methods , Molecular Targeted Therapy , Search Engine , Software , Databases, Factual , Humans , Molecular Targeted Therapy/methods , Reproducibility of Results , Web Browser , Workflow

6.

Linking rare and common disease: mapping clinical disease-phenotypes to ontologies in therapeutic target validation.

Sarntivijai, Sirarat; Vasant, Drashtti; Jupp, Simon; Saunders, Gary; Bento, A Patrícia; Gonzalez, Daniel; Betts, Joanna; Hasan, Samiul; Koscielny, Gautier; Dunham, Ian; Parkinson, Helen; Malone, James.

J Biomed Semantics ; 7: 8, 2016.

Article in English | MEDLINE | ID: mdl-27011785

ABSTRACT

BACKGROUND: The Centre for Therapeutic Target Validation (CTTV - https://www.targetvalidation.org/) was established to generate therapeutic target evidence from genome-scale experiments and analyses. CTTV aims to support the validity of therapeutic targets by integrating existing and newly-generated data. Data integration has been achieved in some resources by mapping metadata such as disease and phenotypes to the Experimental Factor Ontology (EFO). Additionally, the relationship between ontology descriptions of rare and common diseases and their phenotypes can offer insights into shared biological mechanisms and potential drug targets. Ontologies are not ideal for representing the sometimes associated type relationship required. This work addresses two challenges; annotation of diverse big data, and representation of complex, sometimes associated relationships between concepts. METHODS: Semantic mapping uses a combination of custom scripting, our annotation tool 'Zooma', and expert curation. Disease-phenotype associations were generated using literature mining on Europe PubMed Central abstracts, which were manually verified by experts for validity. Representation of the disease-phenotype association was achieved by the Ontology of Biomedical AssociatioN (OBAN), a generic association representation model. OBAN represents associations between a subject and object i.e., disease and its associated phenotypes and the source of evidence for that association. The indirect disease-to-disease associations are exposed through shared phenotypes. This was applied to the use case of linking rare to common diseases at the CTTV. RESULTS: EFO yields an average of over 80% of mapping coverage in all data sources. A 42% precision is obtained from the manual verification of the text-mined disease-phenotype associations. This results in 1452 and 2810 disease-phenotype pairs for IBD and autoimmune disease and contributes towards 11,338 rare diseases associations (merged with existing published work [Am J Hum Genet 97:111-24, 2015]). An OBAN result file is downloadable at http://sourceforge.net/p/efo/code/HEAD/tree/trunk/src/efoassociations/. Twenty common diseases are linked to 85 rare diseases by shared phenotypes. A generalizable OBAN model for association representation is presented in this study. CONCLUSIONS: Here we present solutions to large-scale annotation-ontology mapping in the CTTV knowledge base, a process for disease-phenotype mining, and propose a generic association model, 'OBAN', as a means to integrate disease using shared phenotypes. AVAILABILITY: EFO is released monthly and available for download at http://www.ebi.ac.uk/efo/.

Subject(s)

Biological Ontologies , Molecular Targeted Therapy , Phenotype , Rare Diseases/drug therapy , Data Mining , Databases, Factual , Humans , Inflammatory Bowel Diseases/drug therapy , Reproducibility of Results

7.

The International Mouse Phenotyping Consortium Web Portal, a unified point of access for knockout mice and related phenotyping data.

Koscielny, Gautier; Yaikhom, Gagarine; Iyer, Vivek; Meehan, Terrence F; Morgan, Hugh; Atienza-Herrero, Julian; Blake, Andrew; Chen, Chao-Kung; Easty, Richard; Di Fenza, Armida; Fiegel, Tanja; Grifiths, Mark; Horne, Alan; Karp, Natasha A; Kurbatova, Natalja; Mason, Jeremy C; Matthews, Peter; Oakley, Darren J; Qazi, Asfand; Regnart, Jack; Retha, Ahmad; Santos, Luis A; Sneddon, Duncan J; Warren, Jonathan; Westerberg, Henrik; Wilson, Robert J; Melvin, David G; Smedley, Damian; Brown, Steve D M; Flicek, Paul; Skarnes, William C; Mallon, Ann-Marie; Parkinson, Helen.

Nucleic Acids Res ; 42(Database issue): D802-9, 2014 Jan.

Article in English | MEDLINE | ID: mdl-24194600

ABSTRACT

The International Mouse Phenotyping Consortium (IMPC) web portal (http://www.mousephenotype.org) provides the biomedical community with a unified point of access to mutant mice and rich collection of related emerging and existing mouse phenotype data. IMPC mouse clinics worldwide follow rigorous highly structured and standardized protocols for the experimentation, collection and dissemination of data. Dedicated 'data wranglers' work with each phenotyping center to collate data and perform quality control of data. An automated statistical analysis pipeline has been developed to identify knockout strains with a significant change in the phenotype parameters. Annotation with biomedical ontologies allows biologists and clinicians to easily find mouse strains with phenotypic traits relevant to their research. Data integration with other resources will provide insights into mammalian gene function and human disease. As phenotype data become available for every gene in the mouse, the IMPC web portal will become an invaluable tool for researchers studying the genetic contributions of genes to human diseases.

Subject(s)

Databases, Genetic , Mice, Knockout , Phenotype , Animals , Biological Ontologies , Internet , Mice

8.

Analysis of variation at transcription factor binding sites in Drosophila and humans.

Spivakov, Mikhail; Akhtar, Junaid; Kheradpour, Pouya; Beal, Kathryn; Girardot, Charles; Koscielny, Gautier; Herrero, Javier; Kellis, Manolis; Furlong, Eileen E M; Birney, Ewan.

Genome Biol ; 13(9): R49, 2012 Sep 28.

Article in English | MEDLINE | ID: mdl-22950968

ABSTRACT

BACKGROUND: Advances in sequencing technology have boosted population genomics and made it possible to map the positions of transcription factor binding sites (TFBSs) with high precision. Here we investigate TFBS variability by combining transcription factor binding maps generated by ENCODE, modENCODE, our previously published data and other sources with genomic variation data for human individuals and Drosophila isogenic lines. RESULTS: We introduce a metric of TFBS variability that takes into account changes in motif match associated with mutation and makes it possible to investigate TFBS functional constraints instance-by-instance as well as in sets that share common biological properties. We also take advantage of the emerging per-individual transcription factor binding data to show evidence that TFBS mutations, particularly at evolutionarily conserved sites, can be efficiently buffered to ensure coherent levels of transcription factor binding. CONCLUSIONS: Our analyses provide insights into the relationship between individual and interspecies variation and show evidence for the functional buffering of TFBS mutations in both humans and flies. In a broad perspective, these results demonstrate the potential of combining functional genomics and population genetics approaches for understanding gene regulation.

Subject(s)

Drosophila/genetics , Genetic Variation , Genome, Human , Genome, Insect , Transcription Factors/metabolism , Analysis of Variance , Animals , Binding Sites , Humans , Models, Genetic , Molecular Sequence Annotation , Mutation , Nucleotide Motifs , Position-Specific Scoring Matrices , Regulatory Sequences, Nucleic Acid , Sequence Analysis, DNA/methods

9.

Ensembl 2012.

Flicek, Paul; Amode, M Ridwan; Barrell, Daniel; Beal, Kathryn; Brent, Simon; Carvalho-Silva, Denise; Clapham, Peter; Coates, Guy; Fairley, Susan; Fitzgerald, Stephen; Gil, Laurent; Gordon, Leo; Hendrix, Maurice; Hourlier, Thibaut; Johnson, Nathan; Kähäri, Andreas K; Keefe, Damian; Keenan, Stephen; Kinsella, Rhoda; Komorowska, Monika; Koscielny, Gautier; Kulesha, Eugene; Larsson, Pontus; Longden, Ian; McLaren, William; Muffato, Matthieu; Overduin, Bert; Pignatelli, Miguel; Pritchard, Bethan; Riat, Harpreet Singh; Ritchie, Graham R S; Ruffier, Magali; Schuster, Michael; Sobral, Daniel; Tang, Y Amy; Taylor, Kieron; Trevanion, Stephen; Vandrovcova, Jana; White, Simon; Wilson, Mark; Wilder, Steven P; Aken, Bronwen L; Birney, Ewan; Cunningham, Fiona; Dunham, Ian; Durbin, Richard; Fernández-Suarez, Xosé M; Harrow, Jennifer; Herrero, Javier; Hubbard, Tim J P.

Nucleic Acids Res ; 40(Database issue): D84-90, 2012 Jan.

Article in English | MEDLINE | ID: mdl-22086963

ABSTRACT

The Ensembl project (http://www.ensembl.org) provides genome resources for chordate genomes with a particular focus on human genome data as well as data for key model organisms such as mouse, rat and zebrafish. Five additional species were added in the last year including gibbon (Nomascus leucogenys) and Tasmanian devil (Sarcophilus harrisii) bringing the total number of supported species to 61 as of Ensembl release 64 (September 2011). Of these, 55 species appear on the main Ensembl website and six species are provided on the Ensembl preview site (Pre!Ensembl; http://pre.ensembl.org) with preliminary support. The past year has also seen improvements across the project.

Subject(s)

Databases, Genetic , Genomics , Animals , Gene Expression Regulation , Genetic Variation , Humans , Mice , Molecular Sequence Annotation , Rats

10.

VectorBase: improvements to a bioinformatics resource for invertebrate vector genomics.

Megy, Karine; Emrich, Scott J; Lawson, Daniel; Campbell, David; Dialynas, Emmanuel; Hughes, Daniel S T; Koscielny, Gautier; Louis, Christos; Maccallum, Robert M; Redmond, Seth N; Sheehan, Andrew; Topalis, Pantelis; Wilson, Derek.

Nucleic Acids Res ; 40(Database issue): D729-34, 2012 Jan.

Article in English | MEDLINE | ID: mdl-22135296

ABSTRACT

VectorBase (http://www.vectorbase.org) is a NIAID-supported bioinformatics resource for invertebrate vectors of human pathogens. It hosts data for nine genomes: mosquitoes (three Anopheles gambiae genomes, Aedes aegypti and Culex quinquefasciatus), tick (Ixodes scapularis), body louse (Pediculus humanus), kissing bug (Rhodnius prolixus) and tsetse fly (Glossina morsitans). Hosted data range from genomic features and expression data to population genetics and ontologies. We describe improvements and integration of new data that expand our taxonomic coverage. Releases are bi-monthly and include the delivery of preliminary data for emerging genomes. Frequent updates of the genome browser provide VectorBase users with increasing options for visualizing their own high-throughput data. One major development is a new population biology resource for storing genomic variations, insecticide resistance data and their associated metadata. It takes advantage of improved ontologies and controlled vocabularies. Combined, these new features ensure timely release of multiple types of data in the public domain while helping overcome the bottlenecks of bioinformatics and annotation by engaging with our user community.

Subject(s)

Databases, Genetic , Genome, Insect , Insect Vectors/genetics , Animals , Culicidae/genetics , Genetic Variation , Genomics , Insecticide Resistance , Ixodes/genetics , Pediculus/genetics , Rhodnius/genetics , Tsetse Flies/genetics

11.

Ensembl Genomes: an integrative resource for genome-scale data from non-vertebrate species.

Kersey, Paul J; Staines, Daniel M; Lawson, Daniel; Kulesha, Eugene; Derwent, Paul; Humphrey, Jay C; Hughes, Daniel S T; Keenan, Stephan; Kerhornou, Arnaud; Koscielny, Gautier; Langridge, Nicholas; McDowall, Mark D; Megy, Karine; Maheswari, Uma; Nuhn, Michael; Paulini, Michael; Pedro, Helder; Toneva, Iliana; Wilson, Derek; Yates, Andrew; Birney, Ewan.

Nucleic Acids Res ; 40(Database issue): D91-7, 2012 Jan.

Article in English | MEDLINE | ID: mdl-22067447

ABSTRACT

Ensembl Genomes (http://www.ensemblgenomes.org) is an integrative resource for genome-scale data from non-vertebrate species. The project exploits and extends technology (for genome annotation, analysis and dissemination) developed in the context of the (vertebrate-focused) Ensembl project and provides a complementary set of resources for non-vertebrate species through a consistent set of programmatic and interactive interfaces. These provide access to data including reference sequence, gene models, transcriptional data, polymorphisms and comparative analysis. Since its launch in 2009, Ensembl Genomes has undergone rapid expansion, with the goal of providing coverage of all major experimental organisms, and additionally including taxonomic reference points to provide the evolutionary context in which genes can be understood. Against the backdrop of a continuing increase in genome sequencing activities in all parts of the tree of life, we seek to work, wherever possible, with the communities actively generating and using data, and are participants in a growing range of collaborations involved in the annotation and analysis of genomes.

Subject(s)

Databases, Genetic , Genomics , Animals , Genome , Genome, Bacterial , Genome, Fungal , Genome, Plant , Invertebrates/genetics , Molecular Sequence Annotation , Systems Integration

12.

Ensembl's 10th year.

Flicek, Paul; Aken, Bronwen L; Ballester, Benoit; Beal, Kathryn; Bragin, Eugene; Brent, Simon; Chen, Yuan; Clapham, Peter; Coates, Guy; Fairley, Susan; Fitzgerald, Stephen; Fernandez-Banet, Julio; Gordon, Leo; Gräf, Stefan; Haider, Syed; Hammond, Martin; Howe, Kerstin; Jenkinson, Andrew; Johnson, Nathan; Kähäri, Andreas; Keefe, Damian; Keenan, Stephen; Kinsella, Rhoda; Kokocinski, Felix; Koscielny, Gautier; Kulesha, Eugene; Lawson, Daniel; Longden, Ian; Massingham, Tim; McLaren, William; Megy, Karine; Overduin, Bert; Pritchard, Bethan; Rios, Daniel; Ruffier, Magali; Schuster, Michael; Slater, Guy; Smedley, Damian; Spudich, Giulietta; Tang, Y Amy; Trevanion, Stephen; Vilella, Albert; Vogel, Jan; White, Simon; Wilder, Steven P; Zadissa, Amonida; Birney, Ewan; Cunningham, Fiona; Dunham, Ian; Durbin, Richard.

Nucleic Acids Res ; 38(Database issue): D557-62, 2010 Jan.

Article in English | MEDLINE | ID: mdl-19906699

ABSTRACT

Ensembl (http://www.ensembl.org) integrates genomic information for a comprehensive set of chordate genomes with a particular focus on resources for human, mouse, rat, zebrafish and other high-value sequenced genomes. We provide complete gene annotations for all supported species in addition to specific resources that target genome variation, function and evolution. Ensembl data is accessible in a variety of formats including via our genome browser, API and BioMart. This year marks the tenth anniversary of Ensembl and in that time the project has grown with advances in genome technology. As of release 56 (September 2009), Ensembl supports 51 species including marmoset, pig, zebra finch, lizard, gorilla and wallaby, which were added in the past year. Major additions and improvements to Ensembl since our previous report include the incorporation of the human GRCh37 assembly, enhanced visualisation and data-mining options for the Ensembl regulatory features and continued development of our software infrastructure.

Subject(s)

Computational Biology/methods , Databases, Genetic , Databases, Nucleic Acid , Access to Information , Animals , Computational Biology/trends , Databases, Protein , Genetic Variation , Genomics/methods , Humans , Information Storage and Retrieval/methods , Internet , Protein Structure, Tertiary , Software , Species Specificity

13.

ASTD: The Alternative Splicing and Transcript Diversity database.

Koscielny, Gautier; Le Texier, Vincent; Gopalakrishnan, Chellappa; Kumanduri, Vasudev; Riethoven, Jean-Jack; Nardone, Francesco; Stanley, Eleanor; Fallsehr, Christine; Hofmann, Oliver; Kull, Meelis; Harrington, Eoghan; Boué, Stéphanie; Eyras, Eduardo; Plass, Mireya; Lopez, Fabrice; Ritchie, William; Moucadel, Virginie; Ara, Takeshi; Pospisil, Heike; Herrmann, Alexander; G Reich, Jens; Guigó, Roderic; Bork, Peer; Doeberitz, Magnus von Knebel; Vilo, Jaak; Hide, Winston; Apweiler, Rolf; Thanaraj, Thangavel Alphonse; Gautheret, Daniel.

Genomics ; 93(3): 213-20, 2009 Mar.

Article in English | MEDLINE | ID: mdl-19059335

ABSTRACT

The Alternative Splicing and Transcript Diversity database (ASTD) gives access to a vast collection of alternative transcripts that integrate transcription initiation, polyadenylation and splicing variant data. Alternative transcripts are derived from the mapping of transcribed sequences to the complete human, mouse and rat genomes using an extension of the computational pipeline developed for the ASD (Alternative Splicing Database) and ATD (Alternative Transcript Diversity) databases, which are now superseded by ASTD. For the human genome, ASTD identifies splicing variants, transcription initiation variants and polyadenylation variants in 68%, 68% and 62% of the gene set, respectively, consistent with current estimates for transcription variation. Users can access ASTD through a variety of browsing and query tools, including expression state-based queries for the identification of tissue-specific isoforms. Participating laboratories have experimentally validated a subset of ASTD-predicted alternative splice forms and alternative polyadenylation forms that were not previously reported. The ASTD database can be accessed at http://www.ebi.ac.uk/astd.

Subject(s)

Alternative Splicing/genetics , Databases, Genetic , Animals , Database Management Systems , Humans , Information Storage and Retrieval/methods , Mice , Rats , Reproducibility of Results , Software , User-Computer Interface

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL