Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 8 de 8
Filter
1.
Nucleic Acids Res ; 49(D1): D1302-D1310, 2021 01 08.
Article in English | MEDLINE | ID: mdl-33196847

ABSTRACT

The Open Targets Platform (https://www.targetvalidation.org/) provides users with a queryable knowledgebase and user interface to aid systematic target identification and prioritisation for drug discovery based upon underlying evidence. It is publicly available and the underlying code is open source. Since our last update two years ago, we have had 10 releases to maintain and continuously improve evidence for target-disease relationships from 20 different data sources. In addition, we have integrated new evidence from key datasets, including prioritised targets identified from genome-wide CRISPR knockout screens in 300 cancer models (Project Score), and GWAS/UK BioBank statistical genetic analysis evidence from the Open Targets Genetics Portal. We have evolved our evidence scoring framework to improve target identification. To aid the prioritisation of targets and inform on the potential impact of modulating a given target, we have added evaluation of post-marketing adverse drug reactions and new curated information on target tractability and safety. We have also developed the user interface and backend technologies to improve performance and usability. In this article, we describe the latest enhancements to the Platform, to address the fundamental challenge that developing effective and safe drugs is difficult and expensive.


Subject(s)
Antineoplastic Agents/therapeutic use , Drugs, Investigational/therapeutic use , Knowledge Bases , Molecular Targeted Therapy/methods , Neoplasms/drug therapy , Software , Antineoplastic Agents/chemistry , Databases, Factual , Datasets as Topic , Drug Discovery/methods , Drugs, Investigational/chemistry , Humans , Internet , Neoplasms/classification , Neoplasms/genetics , Neoplasms/pathology
2.
Nucleic Acids Res ; 46(D1): D802-D808, 2018 01 04.
Article in English | MEDLINE | ID: mdl-29092050

ABSTRACT

Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species, complementing the resources for vertebrate genomics developed in the Ensembl project (http://www.ensembl.org). Together, the two resources provide a consistent set of programmatic and interactive interfaces to a rich range of data including genome sequence, gene models, transcript sequence, genetic variation, and comparative analysis. This paper provides an update to the previous publications about the resource, with a focus on recent developments and expansions. These include the incorporation of almost 20 000 additional genome sequences and over 35 000 tracks of RNA-Seq data, which have been aligned to genomic sequence and made available for visualization. Other advances since 2015 include the release of the database in Resource Description Framework (RDF) format, a large increase in community-derived curation, a new high-performance protein sequence search, additional cross-references, improved annotation of non-protein-coding genes, and the launch of pre-release and archival sites. Collectively, these changes are part of a continuing response to the increasing quantity of publicly-available genome-scale data, and the consequent need to archive, integrate, annotate and disseminate these using automated, scalable methods.


Subject(s)
Archaea/genetics , Bacteria/genetics , Databases, Genetic , Databases, Protein , Eukaryota/genetics , Genomics , Amino Acid Sequence , Animals , Base Sequence , Data Mining , Forecasting , Genome , Molecular Sequence Annotation , RNA/genetics , User-Computer Interface
3.
Nucleic Acids Res ; 46(D1): D754-D761, 2018 01 04.
Article in English | MEDLINE | ID: mdl-29155950

ABSTRACT

The Ensembl project has been aggregating, processing, integrating and redistributing genomic datasets since the initial releases of the draft human genome, with the aim of accelerating genomics research through rapid open distribution of public data. Large amounts of raw data are thus transformed into knowledge, which is made available via a multitude of channels, in particular our browser (http://www.ensembl.org). Over time, we have expanded in multiple directions. First, our resources describe multiple fields of genomics, in particular gene annotation, comparative genomics, genetics and epigenomics. Second, we cover a growing number of genome assemblies; Ensembl Release 90 contains exactly 100. Third, our databases feed simultaneously into an array of services designed around different use cases, ranging from quick browsing to genome-wide bioinformatic analysis. We present here the latest developments of the Ensembl project, with a focus on managing an increasing number of assemblies, supporting efforts in genome interpretation and improving our browser.


Subject(s)
Databases, Genetic , Datasets as Topic , Genome , Information Dissemination , Animals , Epigenomics , Genome, Human , Genome-Wide Association Study , Genomics , High-Throughput Nucleotide Sequencing , Humans , Molecular Sequence Annotation , Vertebrates/genetics , Web Browser
4.
Nucleic Acids Res ; 45(D1): D635-D642, 2017 01 04.
Article in English | MEDLINE | ID: mdl-27899575

ABSTRACT

Ensembl (www.ensembl.org) is a database and genome browser for enabling research on vertebrate genomes. We import, analyse, curate and integrate a diverse collection of large-scale reference data to create a more comprehensive view of genome biology than would be possible from any individual dataset. Our extensive data resources include evidence-based gene and regulatory region annotation, genome variation and gene trees. An accompanying suite of tools, infrastructure and programmatic access methods ensure uniform data analysis and distribution for all supported species. Together, these provide a comprehensive solution for large-scale and targeted genomics applications alike. Among many other developments over the past year, we have improved our resources for gene regulation and comparative genomics, and added CRISPR/Cas9 target sites. We released new browser functionality and tools, including improved filtering and prioritization of genome variation, Manhattan plot visualization for linkage disequilibrium and eQTL data, and an ontology search for phenotypes, traits and disease. We have also enhanced data discovery and access with a track hub registry and a selection of new REST end points. All Ensembl data are freely released to the scientific community and our source code is available via the open source Apache 2.0 license.


Subject(s)
Computational Biology/methods , Databases, Genetic , Genomics/methods , Search Engine , Software , Web Browser , Animals , Data Mining , Evolution, Molecular , Gene Expression Regulation , Genetic Variation , Genome, Human , Humans , Molecular Sequence Annotation , Species Specificity , Vertebrates
5.
Nucleic Acids Res ; 44(D1): D574-80, 2016 Jan 04.
Article in English | MEDLINE | ID: mdl-26578574

ABSTRACT

Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species, complementing the resources for vertebrate genomics developed in the context of the Ensembl project (http://www.ensembl.org). Together, the two resources provide a consistent set of programmatic and interactive interfaces to a rich range of data including reference sequence, gene models, transcriptional data, genetic variation and comparative analysis. This paper provides an update to the previous publications about the resource, with a focus on recent developments. These include the development of new analyses and views to represent polyploid genomes (of which bread wheat is the primary exemplar); and the continued up-scaling of the resource, which now includes over 23 000 bacterial genomes, 400 fungal genomes and 100 protist genomes, in addition to 55 genomes from invertebrate metazoa and 39 genomes from plants. This dramatic increase in the number of included genomes is one part of a broader effort to automate the integration of archival data (genome sequence, but also associated RNA sequence data and variant calls) within the context of reference genomes and make it available through the Ensembl user interfaces.


Subject(s)
Databases, Genetic , Genome, Bacterial , Genome, Fungal , Genome, Plant , Invertebrates/genetics , Animals , Diploidy , Eukaryota/genetics , Genetic Variation , Genome , Polyploidy , Sequence Alignment
6.
BMC Genomics ; 18(1): 470, 2017 06 21.
Article in English | MEDLINE | ID: mdl-28637447

ABSTRACT

BACKGROUND: The oil yield trait of oil palm is expected to involve multiple genes, environmental influences and interactions. Many of the underlying mechanisms that contribute to oil yield are still poorly understood. In this study, we used a microarray approach to study the gene expression profiles of mesocarp tissue at different developmental stages, comparing genetically related high- and low- oil yielding palms to identify genes that contributed to the higher oil-yielding palm and might contribute to the wider genetic improvement of oil palm breeding populations. RESULTS: A total of 3412 (2001 annotated) gene candidates were found to be significantly differentially expressed between high- and low-yielding palms at at least one of the different stages of mesocarp development evaluated. Gene Ontologies (GO) enrichment analysis identified 28 significantly enriched GO terms, including regulation of transcription, fatty acid biosynthesis and metabolic processes. These differentially expressed genes comprise several transcription factors, such as, bHLH, Dof zinc finger proteins and MADS box proteins. Several genes involved in glycolysis, TCA, and fatty acid biosynthesis pathways were also found up-regulated in high-yielding oil palm, among them; pyruvate dehydrogenase E1 component Subunit Beta (PDH), ATP-citrate lyase, ß- ketoacyl-ACP synthases I (KAS I), ß- ketoacyl-ACP synthases III (KAS III) and ketoacyl-ACP reductase (KAR). Sucrose metabolism-related genes such as Invertase, Sucrose Synthase 2 and Sucrose Phosphatase 2 were found to be down-regulated in high-yielding oil palms, compared to the lower yield palms. CONCLUSIONS: Our findings indicate that a higher carbon flux (channeled through down-regulation of the Sucrose Synthase 2 pathway) was being utilized by up-regulated genes involved in glycolysis, TCA and fatty acid biosynthesis leading to enhanced oil production in the high-yielding oil palm. These findings are an important stepping stone to understand the processes that lead to production of high-yielding oil palms and have implications for breeding to maximize oil production.


Subject(s)
Arecaceae/growth & development , Arecaceae/genetics , Fruit/growth & development , Fruit/genetics , Gene Expression Profiling , Citric Acid Cycle/genetics , Fatty Acids/biosynthesis , Glycolysis/genetics , Lipid Metabolism/genetics , Transcription Factors/genetics
7.
Nucleic Acids Res ; 42(Database issue): D546-52, 2014 Jan.
Article in English | MEDLINE | ID: mdl-24163254

ABSTRACT

Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species. The project exploits and extends technologies for genome annotation, analysis and dissemination, developed in the context of the vertebrate-focused Ensembl project, and provides a complementary set of resources for non-vertebrate species through a consistent set of programmatic and interactive interfaces. These provide access to data including reference sequence, gene models, transcriptional data, polymorphisms and comparative analysis. This article provides an update to the previous publications about the resource, with a focus on recent developments. These include the addition of important new genomes (and related data sets) including crop plants, vectors of human disease and eukaryotic pathogens. In addition, the resource has scaled up its representation of bacterial genomes, and now includes the genomes of over 9000 bacteria. Specific extensions to the web and programmatic interfaces have been developed to support users in navigating these large data sets. Looking forward, analytic tools to allow targeted selection of data for visualization and download are likely to become increasingly important in future as the number of available genomes increases within all domains of life, and some of the challenges faced in representing bacterial data are likely to become commonplace for eukaryotes in future.


Subject(s)
Databases, Genetic , Genome , Animals , Edible Grain/genetics , Genome, Bacterial , Genome, Fungal , Genome, Plant , Genomics , Internet , Molecular Sequence Annotation , Software
8.
Microarrays (Basel) ; 3(4): 263-81, 2014 Nov 13.
Article in English | MEDLINE | ID: mdl-27600348

ABSTRACT

Gene expression changes that occur during mesocarp development are a major research focus in oil palm research due to the economic importance of this tissue and the relatively rapid increase in lipid content to very high levels at fruit ripeness. Here, we report the development of a transcriptome-based 105,000-probe oil palm mesocarp microarray. The expression of genes involved in fatty acid (FA) and triacylglycerol (TAG) assembly, along with the tricarboxylic acid cycle (TCA) and glycolysis pathway at 16 Weeks After Anthesis (WAA) exhibited significantly higher signals compared to those obtained from a cross-species hybridization to the Arabidopsis (p-value < 0.01), and rice (p-value < 0.01) arrays. The oil palm microarray data also showed comparable correlation of expression (r² = 0.569, p < 0.01) throughout mesocarp development to transcriptome (RNA sequencing) data, and improved correlation over quantitative real-time PCR (qPCR) (r² = 0.721, p < 0.01) of the same RNA samples. The results confirm the advantage of the custom microarray over commercially available arrays derived from model species. We demonstrate the utility of this custom microarray to gain a better understanding of gene expression patterns in the oil palm mesocarp that may lead to increasing future oil yield.

SELECTION OF CITATIONS
SEARCH DETAIL