Pesquisa | Portal Regional da BVS

1.

The Reactome Pathway Knowledgebase 2024.

Milacic, Marija; Beavers, Deidre; Conley, Patrick; Gong, Chuqiao; Gillespie, Marc; Griss, Johannes; Haw, Robin; Jassal, Bijay; Matthews, Lisa; May, Bruce; Petryszak, Robert; Ragueneau, Eliot; Rothfels, Karen; Sevilla, Cristoffer; Shamovsky, Veronica; Stephan, Ralf; Tiwari, Krishna; Varusai, Thawfeek; Weiser, Joel; Wright, Adam; Wu, Guanming; Stein, Lincoln; Hermjakob, Henning; D'Eustachio, Peter.

Nucleic Acids Res ; 52(D1): D672-D678, 2024 Jan 05.

Artigo em Inglês | MEDLINE | ID: mdl-37941124

RESUMO

The Reactome Knowledgebase (https://reactome.org), an Elixir and GCBR core biological data resource, provides manually curated molecular details of a broad range of normal and disease-related biological processes. Processes are annotated as an ordered network of molecular transformations in a single consistent data model. Reactome thus functions both as a digital archive of manually curated human biological processes and as a tool for discovering functional relationships in data such as gene expression profiles or somatic mutation catalogs from tumor cells. Here we review progress towards annotation of the entire human proteome, targeted annotation of disease-causing genetic variants of proteins and of small-molecule drugs in a pathway context, and towards supporting explicit annotation of cell- and tissue-specific pathways. Finally, we briefly discuss issues involved in making Reactome more fully interoperable with other related resources such as the Gene Ontology and maintaining the resulting community resource network.

Assuntos

Bases de Conhecimento , Redes e Vias Metabólicas , Transdução de Sinais , Humanos , Redes e Vias Metabólicas/genética , Proteoma/genética

2.

Using the Reactome Database.

Rothfels, Karen; Milacic, Marija; Matthews, Lisa; Haw, Robin; Sevilla, Cristoffer; Gillespie, Marc; Stephan, Ralf; Gong, Chuqiao; Ragueneau, Eliot; May, Bruce; Shamovsky, Veronica; Wright, Adam; Weiser, Joel; Beavers, Deidre; Conley, Patrick; Tiwari, Krishna; Jassal, Bijay; Griss, Johannes; Senff-Ribeiro, Andrea; Brunson, Timothy; Petryszak, Robert; Hermjakob, Henning; D'Eustachio, Peter; Wu, Guanming; Stein, Lincoln.

Curr Protoc ; 3(4): e722, 2023 Apr.

Artigo em Inglês | MEDLINE | ID: mdl-37053306

RESUMO

Pathway databases provide descriptions of the roles of proteins, nucleic acids, lipids, carbohydrates, and other molecular entities within their biological cellular contexts. Pathway-centric views of these roles may allow for the discovery of unexpected functional relationships in data such as gene expression profiles and somatic mutation catalogues from tumor cells. For this reason, there is a high demand for high-quality pathway databases and their associated tools. The Reactome project (a collaboration between the Ontario Institute for Cancer Research, New York University Langone Health, the European Bioinformatics Institute, and Oregon Health & Science University) is one such pathway database. Reactome collects detailed information on biological pathways and processes in humans from the primary literature. Reactome content is manually curated, expert-authored, and peer-reviewed and spans the gamut from simple intermediate metabolism to signaling pathways and complex cellular events. This information is supplemented with likely orthologous molecular reactions in mouse, rat, zebrafish, worm, and other model organisms. © 2023 The Authors. Current Protocols published by Wiley Periodicals LLC. Basic Protocol 1: Browsing a Reactome pathway Basic Protocol 2: Exploring Reactome annotations of disease and drugs Basic Protocol 3: Finding the pathways involving a gene or protein Alternate Protocol 1: Finding the pathways involving a gene or protein using UniProtKB (SwissProt), Ensembl, or Entrez gene identifier Alternate Protocol 2: Using advanced search Basic Protocol 4: Using the Reactome pathway analysis tool to identify statistically overrepresented pathways Basic Protocol 5: Using the Reactome pathway analysis tool to overlay expression data onto Reactome pathway diagrams Basic Protocol 6: Comparing inferred model organism and human pathways using the Species Comparison tool Basic Protocol 7: Comparing tissue-specific expression using the Tissue Distribution tool.

Assuntos

Redes e Vias Metabólicas , Peixe-Zebra , Humanos , Animais , Camundongos , Ratos , Peixe-Zebra/metabolismo , Bases de Dados de Proteínas , Proteínas/metabolismo , Transdução de Sinais

3.

A user guide for the online exploration and visualization of PCAWG data.

Goldman, Mary J; Zhang, Junjun; Fonseca, Nuno A; Cortés-Ciriano, Isidro; Xiang, Qian; Craft, Brian; Piñeiro-Yáñez, Elena; O'Connor, Brian D; Bazant, Wojciech; Barrera, Elisabet; Muñoz-Pomer, Alfonso; Petryszak, Robert; Füllgrabe, Anja; Al-Shahrour, Fatima; Keays, Maria; Haussler, David; Weinstein, John N; Huber, Wolfgang; Valencia, Alfonso; Park, Peter J; Papatheodorou, Irene; Zhu, Jingchun; Ferretti, Vincent; Vazquez, Miguel.

Nat Commun ; 11(1): 3400, 2020 07 07.

Artigo em Inglês | MEDLINE | ID: mdl-32636365

RESUMO

The Pan-Cancer Analysis of Whole Genomes (PCAWG) project generated a vast amount of whole-genome cancer sequencing resource data. Here, as part of the ICGC/TCGA Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium, which aggregated whole genome sequencing data from 2658 cancers across 38 tumor types, we provide a user's guide to the five publicly available online data exploration and visualization tools introduced in the PCAWG marker paper. These tools are ICGC Data Portal, UCSC Xena, Chromothripsis Explorer, Expression Atlas, and PCAWG-Scout. We detail use cases and analyses for each tool, show how they incorporate outside resources from the larger genomics ecosystem, and demonstrate how the tools can be used together to understand the biology of cancers more deeply. Together, the tools enable researchers to query the complex genomic PCAWG data dynamically and integrate external information, enabling and enhancing interpretation.

Assuntos

Biologia Computacional/métodos , Genoma Humano , Neoplasias/genética , Cromotripsia , Análise de Dados , Bases de Dados Genéticas , Genômica , Humanos , Internet , Mutação , Software , Interface Usuário-Computador , Sequenciamento Completo do Genoma

4.

The ELIXIR Core Data Resources: fundamental infrastructure for the life sciences.

Drysdale, Rachel; Cook, Charles E; Petryszak, Robert; Baillie-Gerritsen, Vivienne; Barlow, Mary; Gasteiger, Elisabeth; Gruhl, Franziska; Haas, Jürgen; Lanfear, Jerry; Lopez, Rodrigo; Redaschi, Nicole; Stockinger, Heinz; Teixeira, Daniel; Venkatesan, Aravind; Blomberg, Niklas; Durinx, Christine; McEntyre, Johanna.

Bioinformatics ; 36(8): 2636-2642, 2020 04 15.

Artigo em Inglês | MEDLINE | ID: mdl-31950984

RESUMO

SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Assuntos

Disciplinas das Ciências Biológicas , Biologia Computacional

5.

Expression Atlas update: from tissues to single cells.

Papatheodorou, Irene; Moreno, Pablo; Manning, Jonathan; Fuentes, Alfonso Muñoz-Pomer; George, Nancy; Fexova, Silvie; Fonseca, Nuno A; Füllgrabe, Anja; Green, Matthew; Huang, Ni; Huerta, Laura; Iqbal, Haider; Jianu, Monica; Mohammed, Suhaib; Zhao, Lingyun; Jarnuczak, Andrew F; Jupp, Simon; Marioni, John; Meyer, Kerstin; Petryszak, Robert; Prada Medina, Cesar Augusto; Talavera-López, Carlos; Teichmann, Sarah; Vizcaino, Juan Antonio; Brazma, Alvis.

Nucleic Acids Res ; 48(D1): D77-D83, 2020 01 08.

Artigo em Inglês | MEDLINE | ID: mdl-31665515

RESUMO

Expression Atlas is EMBL-EBI's resource for gene and protein expression. It sources and compiles data on the abundance and localisation of RNA and proteins in various biological systems and contexts and provides open access to this data for the research community. With the increased availability of single cell RNA-Seq datasets in the public archives, we have now extended Expression Atlas with a new added-value service to display gene expression in single cells. Single Cell Expression Atlas was launched in 2018 and currently includes 123 single cell RNA-Seq studies from 12 species. The website can be searched by genes within or across species to reveal experiments, tissues and cell types where this gene is expressed or under which conditions it is a marker gene. Within each study, cells can be visualized using a pre-calculated t-SNE plot and can be coloured by different features or by cell clusters based on gene expression. Within each experiment, there are links to downloadable files, such as RNA quantification matrices, clustering results, reports on protocols and associated metadata, such as assigned cell types.

Assuntos

Biologia Computacional/métodos , Bases de Dados de Ácidos Nucleicos , Perfilação da Expressão Gênica , Software , Perfilação da Expressão Gênica/métodos , Especificidade de Órgãos , Análise de Célula Única/métodos , Interface Usuário-Computador

6.

Re-annotation of 191 developmental and epileptic encephalopathy-associated genes unmasks de novo variants in SCN1A.

Steward, Charles A; Roovers, Jolien; Suner, Marie-Marthe; Gonzalez, Jose M; Uszczynska-Ratajczak, Barbara; Pervouchine, Dmitri; Fitzgerald, Stephen; Viola, Margarida; Stamberger, Hannah; Hamdan, Fadi F; Ceulemans, Berten; Leroy, Patricia; Nava, Caroline; Lepine, Anne; Tapanari, Electra; Keiller, Don; Abbs, Stephen; Sanchis-Juan, Alba; Grozeva, Detelina; Rogers, Anthony S; Diekhans, Mark; Guigó, Roderic; Petryszak, Robert; Minassian, Berge A; Cavalleri, Gianpiero; Vitsios, Dimitrios; Petrovski, Slavé; Harrow, Jennifer; Flicek, Paul; Lucy Raymond, F; Lench, Nicholas J; Jonghe, Peter De; Mudge, Jonathan M; Weckhuysen, Sarah; Sisodiya, Sanjay M; Frankish, Adam.

NPJ Genom Med ; 4: 31, 2019.

Artigo em Inglês | MEDLINE | ID: mdl-31814998

RESUMO

The developmental and epileptic encephalopathies (DEE) are a group of rare, severe neurodevelopmental disorders, where even the most thorough sequencing studies leave 60-65% of patients without a molecular diagnosis. Here, we explore the incompleteness of transcript models used for exome and genome analysis as one potential explanation for a lack of current diagnoses. Therefore, we have updated the GENCODE gene annotation for 191 epilepsy-associated genes, using human brain-derived transcriptomic libraries and other data to build 3,550 putative transcript models. Our annotations increase the transcriptional 'footprint' of these genes by over 674 kb. Using SCN1A as a case study, due to its close phenotype/genotype correlation with Dravet syndrome, we screened 122 people with Dravet syndrome or a similar phenotype with a panel of exon sequences representing eight established genes and identified two de novo SCN1A variants that now - through improved gene annotation - are ascribed to residing among our exons. These two (from 122 screened people, 1.6%) molecular diagnoses carry significant clinical implications. Furthermore, we identified a previously classified SCN1A intronic Dravet syndrome-associated variant that now lies within a deeply conserved exon. Our findings illustrate the potential gains of thorough gene annotation in improving diagnostic yields for genetic disorders.

7.

Quantifying the impact of public omics data.

Perez-Riverol, Yasset; Zorin, Andrey; Dass, Gaurhari; Vu, Manh-Tu; Xu, Pan; Glont, Mihai; Vizcaíno, Juan Antonio; Jarnuczak, Andrew F; Petryszak, Robert; Ping, Peipei; Hermjakob, Henning.

Nat Commun ; 10(1): 3512, 2019 08 05.

Artigo em Inglês | MEDLINE | ID: mdl-31383865

RESUMO

The amount of omics data in the public domain is increasing every year. Modern science has become a data-intensive discipline. Innovative solutions for data management, data sharing, and for discovering novel datasets are therefore increasingly required. In 2016, we released the first version of the Omics Discovery Index (OmicsDI) as a light-weight system to aggregate datasets across multiple public omics data resources. OmicsDI aggregates genomics, transcriptomics, proteomics, metabolomics and multiomics datasets, as well as computational models of biological processes. Here, we propose a set of novel metrics to quantify the attention and impact of biomedical datasets. A complete framework (now integrated into OmicsDI) has been implemented in order to provide and evaluate those metrics. Finally, we propose a set of recommendations for authors, journals and data resources to promote an optimal quantification of the impact of datasets.

Assuntos

Acesso à Informação , Conjuntos de Dados como Assunto , Disseminação de Informação , Biologia Computacional/estatística & dados numéricos , Perfilação da Expressão Gênica/estatística & dados numéricos , Genômica/estatística & dados numéricos , Humanos , Metabolômica/estatística & dados numéricos , Proteômica/estatística & dados numéricos

8.

ArrayExpress update - from bulk to single-cell expression data.

Athar, Awais; Füllgrabe, Anja; George, Nancy; Iqbal, Haider; Huerta, Laura; Ali, Ahmed; Snow, Catherine; Fonseca, Nuno A; Petryszak, Robert; Papatheodorou, Irene; Sarkans, Ugis; Brazma, Alvis.

Nucleic Acids Res ; 47(D1): D711-D715, 2019 01 08.

Artigo em Inglês | MEDLINE | ID: mdl-30357387

RESUMO

ArrayExpress (https://www.ebi.ac.uk/arrayexpress) is an archive of functional genomics data from a variety of technologies assaying functional modalities of a genome, such as gene expression or promoter occupancy. The number of experiments based on sequencing technologies, in particular RNA-seq experiments, has been increasing over the last few years and submissions of sequencing data have overtaken microarray experiments in the last 12 months. Additionally, there is a significant increase in experiments investigating single cells, rather than bulk samples, known as single-cell RNA-seq. To accommodate these trends, we have substantially changed our submission tool Annotare which, along with raw and processed data, collects all metadata necessary to interpret these experiments. Selected datasets are re-processed and loaded into our sister resource, the value-added Expression Atlas (and its component Single Cell Expression Atlas), which not only enables users to interpret the data easily but also serves as a test for data quality. With an increasing number of studies that combine different assay modalities (multi-omics experiments), a new more general archival resource the BioStudies Database has been developed, which will eventually supersede ArrayExpress. Data submissions will continue unchanged; all existing ArrayExpress data will be incorporated into BioStudies and the existing accession numbers and application programming interfaces will be maintained.

Assuntos

Análise de Sequência com Séries de Oligonucleotídeos/métodos , Análise de Célula Única/métodos , Software , Bases de Dados Genéticas , RNA-Seq/métodos

9.

Expression Atlas: gene and protein expression across multiple studies and organisms.

Papatheodorou, Irene; Fonseca, Nuno A; Keays, Maria; Tang, Y Amy; Barrera, Elisabet; Bazant, Wojciech; Burke, Melissa; Füllgrabe, Anja; Fuentes, Alfonso Muñoz-Pomer; George, Nancy; Huerta, Laura; Koskinen, Satu; Mohammed, Suhaib; Geniza, Matthew; Preece, Justin; Jaiswal, Pankaj; Jarnuczak, Andrew F; Huber, Wolfgang; Stegle, Oliver; Vizcaino, Juan Antonio; Brazma, Alvis; Petryszak, Robert.

Nucleic Acids Res ; 46(D1): D246-D251, 2018 01 04.

Artigo em Inglês | MEDLINE | ID: mdl-29165655

RESUMO

Expression Atlas (http://www.ebi.ac.uk/gxa) is an added value database that provides information about gene and protein expression in different species and contexts, such as tissue, developmental stage, disease or cell type. The available public and controlled access data sets from different sources are curated and re-analysed using standardized, open source pipelines and made available for queries, download and visualization. As of August 2017, Expression Atlas holds data from 3,126 studies across 33 different species, including 731 from plants. Data from large-scale RNA sequencing studies including Blueprint, PCAWG, ENCODE, GTEx and HipSci can be visualized next to each other. In Expression Atlas, users can query genes or gene-sets of interest and explore their expression across or within species, tissues, developmental stages in a constitutive or differential context, representing the effects of diseases, conditions or experimental interventions. All processed data matrices are available for direct download in tab-delimited format or as R-data. In addition to the web interface, data sets can now be searched and downloaded through the Expression Atlas R package. Novel features and visualizations include the on-the-fly analysis of gene set overlaps and the option to view gene co-expression in experiments investigating constitutive gene expression across tissues or other conditions.

Assuntos

Bases de Dados Genéticas , Animais , Perfilação da Expressão Gênica , Humanos , Mamíferos/genética , Mamíferos/metabolismo , Análise de Sequência com Séries de Oligonucleotídeos , Plantas/genética , Plantas/metabolismo , Proteômica , Análise de Sequência de RNA , Especificidade da Espécie , Interface Usuário-Computador

10.

Gramene 2018: unifying comparative genomics and pathway resources for plant research.

Tello-Ruiz, Marcela K; Naithani, Sushma; Stein, Joshua C; Gupta, Parul; Campbell, Michael; Olson, Andrew; Wei, Sharon; Preece, Justin; Geniza, Matthew J; Jiao, Yinping; Lee, Young Koung; Wang, Bo; Mulvaney, Joseph; Chougule, Kapeel; Elser, Justin; Al-Bader, Noor; Kumari, Sunita; Thomason, James; Kumar, Vivek; Bolser, Daniel M; Naamati, Guy; Tapanari, Electra; Fonseca, Nuno; Huerta, Laura; Iqbal, Haider; Keays, Maria; Munoz-Pomer Fuentes, Alfonso; Tang, Amy; Fabregat, Antonio; D'Eustachio, Peter; Weiser, Joel; Stein, Lincoln D; Petryszak, Robert; Papatheodorou, Irene; Kersey, Paul J; Lockhart, Patti; Taylor, Crispin; Jaiswal, Pankaj; Ware, Doreen.

Nucleic Acids Res ; 46(D1): D1181-D1189, 2018 01 04.

Artigo em Inglês | MEDLINE | ID: mdl-29165610

RESUMO

Gramene (http://www.gramene.org) is a knowledgebase for comparative functional analysis in major crops and model plant species. The current release, #54, includes over 1.7 million genes from 44 reference genomes, most of which were organized into 62,367 gene families through orthologous and paralogous gene classification, whole-genome alignments, and synteny. Additional gene annotations include ontology-based protein structure and function; genetic, epigenetic, and phenotypic diversity; and pathway associations. Gramene's Plant Reactome provides a knowledgebase of cellular-level plant pathway networks. Specifically, it uses curated rice reference pathways to derive pathway projections for an additional 66 species based on gene orthology, and facilitates display of gene expression, gene-gene interactions, and user-defined omics data in the context of these pathways. As a community portal, Gramene integrates best-of-class software and infrastructure components including the Ensembl genome browser, Reactome pathway browser, and Expression Atlas widgets, and undergoes periodic data and software upgrades. Via powerful, intuitive search interfaces, users can easily query across various portals and interactively analyze search results by clicking on diverse features such as genomic context, highly augmented gene trees, gene expression anatomograms, associated pathways, and external informatics resources. All data in Gramene are accessible through both visual and programmatic interfaces.

Assuntos

Bases de Dados Genéticas , Regulação da Expressão Gênica de Plantas , Genômica/métodos , Bases de Conhecimento , Plantas/genética , Epigênese Genética , Ontologia Genética , Pesquisa em Genética , Variação Genética , Genoma de Planta , Redes e Vias Metabólicas/genética , Anotação de Sequência Molecular , Plantas/metabolismo , Software , Interface Usuário-Computador

11.

Discovering and linking public omics data sets using the Omics Discovery Index.

Perez-Riverol, Yasset; Bai, Mingze; da Veiga Leprevost, Felipe; Squizzato, Silvano; Park, Young Mi; Haug, Kenneth; Carroll, Adam J; Spalding, Dylan; Paschall, Justin; Wang, Mingxun; Del-Toro, Noemi; Ternent, Tobias; Zhang, Peng; Buso, Nicola; Bandeira, Nuno; Deutsch, Eric W; Campbell, David S; Beavis, Ronald C; Salek, Reza M; Sarkans, Ugis; Petryszak, Robert; Keays, Maria; Fahy, Eoin; Sud, Manish; Subramaniam, Shankar; Barbera, Ariana; Jiménez, Rafael C; Nesvizhskii, Alexey I; Sansone, Susanna-Assunta; Steinbeck, Christoph; Lopez, Rodrigo; Vizcaíno, Juan A; Ping, Peipei; Hermjakob, Henning.

Nat Biotechnol ; 35(5): 406-409, 2017 05 09.

Artigo em Inglês | MEDLINE | ID: mdl-28486464

Assuntos

Mineração de Dados/métodos , Genômica , Armazenamento e Recuperação da Informação/métodos , Proteômica , Biologia Computacional , Humanos

12.

The RNASeq-er API-a gateway to systematically updated analysis of public RNA-seq data.

Petryszak, Robert; Fonseca, Nuno A; Füllgrabe, Anja; Huerta, Laura; Keays, Maria; Tang, Y Amy; Brazma, Alvis.

Bioinformatics ; 33(14): 2218-2220, 2017 Jul 15.

Artigo em Inglês | MEDLINE | ID: mdl-28369191

RESUMO

MOTIVATION: The exponential growth of publicly available RNA-sequencing (RNA-Seq) data poses an increasing challenge to researchers wishing to discover, analyse and store such data, particularly those based in institutions with limited computational resources. EMBL-EBI is in an ideal position to address these challenges and to allow the scientific community easy access to not just raw, but also processed RNA-Seq data. We present a Web service to access the results of a systematically and continually updated standardized alignment as well as gene and exon expression quantification of all public bulk (and in the near future also single-cell) RNA-Seq runs in 264 species in European Nucleotide Archive, using Representational State Transfer. RESULTS: The RNASeq-er API (Application Programming Interface) enables ontology-powered search for and retrieval of CRAM, bigwig and bedGraph files, gene and exon expression quantification matrices (Fragments Per Kilobase Of Exon Per Million Fragments Mapped, Transcripts Per Million, raw counts) as well as sample attributes annotated with ontology terms. To date over 270 00 RNA-Seq runs in nearly 10 000 studies (1PB of raw FASTQ data) in 264 species in ENA have been processed and made available via the API. AVAILABILITY AND IMPLEMENTATION: The RNASeq-er API can be accessed at http://www.ebi.ac.uk/fg/rnaseq/api . The commands used to analyse the data are available in supplementary materials and at https://github.com/nunofonseca/irap/wiki/iRAP-single-library . CONTACT: rnaseq@ebi.ac.uk ; rpetry@ebi.ac.uk. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Assuntos

Biologia Computacional/métodos , Eucariotos/genética , Análise de Sequência de RNA/métodos , Software , Transcriptoma , Animais , Bases de Dados Genéticas , Expressão Gênica , Ontologia Genética , Humanos , Internet

13.

Plant Reactome: a resource for plant pathways and comparative analysis.

Naithani, Sushma; Preece, Justin; D'Eustachio, Peter; Gupta, Parul; Amarasinghe, Vindhya; Dharmawardhana, Palitha D; Wu, Guanming; Fabregat, Antonio; Elser, Justin L; Weiser, Joel; Keays, Maria; Fuentes, Alfonso Munoz-Pomer; Petryszak, Robert; Stein, Lincoln D; Ware, Doreen; Jaiswal, Pankaj.

Nucleic Acids Res ; 45(D1): D1029-D1039, 2017 01 04.

Artigo em Inglês | MEDLINE | ID: mdl-27799469

RESUMO

Plant Reactome (http://plantreactome.gramene.org/) is a free, open-source, curated plant pathway database portal, provided as part of the Gramene project. The database provides intuitive bioinformatics tools for the visualization, analysis and interpretation of pathway knowledge to support genome annotation, genome analysis, modeling, systems biology, basic research and education. Plant Reactome employs the structural framework of a plant cell to show metabolic, transport, genetic, developmental and signaling pathways. We manually curate molecular details of pathways in these domains for reference species Oryza sativa (rice) supported by published literature and annotation of well-characterized genes. Two hundred twenty-two rice pathways, 1025 reactions associated with 1173 proteins, 907 small molecules and 256 literature references have been curated to date. These reference annotations were used to project pathways for 62 model, crop and evolutionarily significant plant species based on gene homology. Database users can search and browse various components of the database, visualize curated baseline expression of pathway-associated genes provided by the Expression Atlas and upload and analyze their Omics datasets. The database also offers data access via Application Programming Interfaces (APIs) and in various standardized pathway formats, such as SBML and BioPAX.

Assuntos

Biologia Computacional/métodos , Bases de Dados Genéticas , Plantas/genética , Plantas/metabolismo , Ferramenta de Busca , Genômica/métodos , Redes e Vias Metabólicas , Transdução de Sinais , Biologia de Sistemas/métodos , Interface Usuário-Computador , Navegador

14.

Open Targets: a platform for therapeutic target identification and validation.

Koscielny, Gautier; An, Peter; Carvalho-Silva, Denise; Cham, Jennifer A; Fumis, Luca; Gasparyan, Rippa; Hasan, Samiul; Karamanis, Nikiforos; Maguire, Michael; Papa, Eliseo; Pierleoni, Andrea; Pignatelli, Miguel; Platt, Theo; Rowland, Francis; Wankar, Priyanka; Bento, A Patrícia; Burdett, Tony; Fabregat, Antonio; Forbes, Simon; Gaulton, Anna; Gonzalez, Cristina Yenyxe; Hermjakob, Henning; Hersey, Anne; Jupe, Steven; Kafkas, Senay; Keays, Maria; Leroy, Catherine; Lopez, Francisco-Javier; Magarinos, Maria Paula; Malone, James; McEntyre, Johanna; Munoz-Pomer Fuentes, Alfonso; O'Donovan, Claire; Papatheodorou, Irene; Parkinson, Helen; Palka, Barbara; Paschall, Justin; Petryszak, Robert; Pratanwanich, Naruemon; Sarntivijal, Sirarat; Saunders, Gary; Sidiropoulos, Konstantinos; Smith, Thomas; Sondka, Zbyslaw; Stegle, Oliver; Tang, Y Amy; Turner, Edward; Vaughan, Brendan; Vrousgou, Olga; Watkins, Xavier.

Nucleic Acids Res ; 45(D1): D985-D994, 2017 01 04.

Artigo em Inglês | MEDLINE | ID: mdl-27899665

RESUMO

We have designed and developed a data integration and visualization platform that provides evidence about the association of known and potential drug targets with diseases. The platform is designed to support identification and prioritization of biological targets for follow-up. Each drug target is linked to a disease using integrated genome-wide data from a broad range of data sources. The platform provides either a target-centric workflow to identify diseases that may be associated with a specific target, or a disease-centric workflow to identify targets that may be associated with a specific disease. Users can easily transition between these target- and disease-centric workflows. The Open Targets Validation Platform is accessible at https://www.targetvalidation.org.

Assuntos

Biologia Computacional/métodos , Terapia de Alvo Molecular , Ferramenta de Busca , Software , Bases de Dados Factuais , Humanos , Terapia de Alvo Molecular/métodos , Reprodutibilidade dos Testes , Navegador , Fluxo de Trabalho

15.

Gramene Database: Navigating Plant Comparative Genomics Resources.

Gupta, Parul; Naithani, Sushma; Tello-Ruiz, Marcela Karey; Chougule, Kapeel; D'Eustachio, Peter; Fabregat, Antonio; Jiao, Yinping; Keays, Maria; Lee, Young Koung; Kumari, Sunita; Mulvaney, Joseph; Olson, Andrew; Preece, Justin; Stein, Joshua; Wei, Sharon; Weiser, Joel; Huerta, Laura; Petryszak, Robert; Kersey, Paul; Stein, Lincoln D; Ware, Doreen; Jaiswal, Pankaj.

Curr Plant Biol ; 7-8: 10-15, 2016 Nov.

Artigo em Inglês | MEDLINE | ID: mdl-28713666

RESUMO

Gramene (http://www.gramene.org) is an online, open source, curated resource for plant comparative genomics and pathway analysis designed to support researchers working in plant genomics, breeding, evolutionary biology, system biology, and metabolic engineering. It exploits phylogenetic relationships to enrich the annotation of genomic data and provides tools to perform powerful comparative analyses across a wide spectrum of plant species. It consists of an integrated portal for querying, visualizing and analyzing data for 44 plant reference genomes, genetic variation data sets for 12 species, expression data for 16 species, curated rice pathways and orthology-based pathway projections for 66 plant species including various crops. Here we briefly describe the functions and uses of the Gramene database.

16.

Expression Atlas update--an integrated database of gene and protein expression in humans, animals and plants.

Petryszak, Robert; Keays, Maria; Tang, Y Amy; Fonseca, Nuno A; Barrera, Elisabet; Burdett, Tony; Füllgrabe, Anja; Fuentes, Alfonso Muñoz-Pomer; Jupp, Simon; Koskinen, Satu; Mannion, Oliver; Huerta, Laura; Megy, Karine; Snow, Catherine; Williams, Eleanor; Barzine, Mitra; Hastings, Emma; Weisser, Hendrik; Wright, James; Jaiswal, Pankaj; Huber, Wolfgang; Choudhary, Jyoti; Parkinson, Helen E; Brazma, Alvis.

Nucleic Acids Res ; 44(D1): D746-52, 2016 Jan 04.

Artigo em Inglês | MEDLINE | ID: mdl-26481351

RESUMO

Expression Atlas (http://www.ebi.ac.uk/gxa) provides information about gene and protein expression in animal and plant samples of different cell types, organism parts, developmental stages, diseases and other conditions. It consists of selected microarray and RNA-sequencing studies from ArrayExpress, which have been manually curated, annotated with ontology terms, checked for high quality and processed using standardised analysis methods. Since the last update, Atlas has grown seven-fold (1572 studies as of August 2015), and incorporates baseline expression profiles of tissues from Human Protein Atlas, GTEx and FANTOM5, and of cancer cell lines from ENCODE, CCLE and Genentech projects. Plant studies constitute a quarter of Atlas data. For genes of interest, the user can view baseline expression in tissues, and differential expression for biologically meaningful pairwise comparisons-estimated using consistent methodology across all of Atlas. Our first proteomics study in human tissues is now displayed alongside transcriptomics data in the same tissues. Novel analyses and visualisations include: 'enrichment' in each differential comparison of GO terms, Reactome, Plant Reactome pathways and InterPro domains; hierarchical clustering (by baseline expression) of most variable genes and experimental conditions; and, for a given gene-condition, distribution of baseline expression across biological replicates.

Assuntos

Bases de Dados Genéticas , Perfilação da Expressão Gênica , Plantas/metabolismo , Proteínas/metabolismo , Proteômica , Animais , Linhagem Celular Tumoral , Humanos , Plantas/genética , Interface Usuário-Computador

17.

Gramene 2016: comparative plant genomics and pathway resources.

Tello-Ruiz, Marcela K; Stein, Joshua; Wei, Sharon; Preece, Justin; Olson, Andrew; Naithani, Sushma; Amarasinghe, Vindhya; Dharmawardhana, Palitha; Jiao, Yinping; Mulvaney, Joseph; Kumari, Sunita; Chougule, Kapeel; Elser, Justin; Wang, Bo; Thomason, James; Bolser, Daniel M; Kerhornou, Arnaud; Walts, Brandon; Fonseca, Nuno A; Huerta, Laura; Keays, Maria; Tang, Y Amy; Parkinson, Helen; Fabregat, Antonio; McKay, Sheldon; Weiser, Joel; D'Eustachio, Peter; Stein, Lincoln; Petryszak, Robert; Kersey, Paul J; Jaiswal, Pankaj; Ware, Doreen.

Nucleic Acids Res ; 44(D1): D1133-40, 2016 Jan 04.

Artigo em Inglês | MEDLINE | ID: mdl-26553803

RESUMO

Gramene (http://www.gramene.org) is an online resource for comparative functional genomics in crops and model plant species. Its two main frameworks are genomes (collaboration with Ensembl Plants) and pathways (The Plant Reactome and archival BioCyc databases). Since our last NAR update, the database website adopted a new Drupal management platform. The genomes section features 39 fully assembled reference genomes that are integrated using ontology-based annotation and comparative analyses, and accessed through both visual and programmatic interfaces. Additional community data, such as genetic variation, expression and methylation, are also mapped for a subset of genomes. The Plant Reactome pathway portal (http://plantreactome.gramene.org) provides a reference resource for analyzing plant metabolic and regulatory pathways. In addition to â¼ 200 curated rice reference pathways, the portal hosts gene homology-based pathway projections for 33 plant species. Both the genome and pathway browsers interface with the EMBL-EBI's Expression Atlas to enable the projection of baseline and differential expression data from curated expression studies in plants. Gramene's archive website (http://archive.gramene.org) continues to provide previously reported resources on comparative maps, markers and QTL. To further aid our users, we have also introduced a live monthly educational webinar series and a Gramene YouTube channel carrying video tutorials.

Assuntos

Bases de Dados Genéticas , Genoma de Planta , Plantas/metabolismo , Expressão Gênica , Variação Genética , Genômica , Internet , Redes e Vias Metabólicas , Anotação de Sequência Molecular , Plantas/genética

18.

Comparison of GENCODE and RefSeq gene annotation and the impact of reference geneset on variant effect prediction.

Frankish, Adam; Uszczynska, Barbara; Ritchie, Graham R S; Gonzalez, Jose M; Pervouchine, Dmitri; Petryszak, Robert; Mudge, Jonathan M; Fonseca, Nuno; Brazma, Alvis; Guigo, Roderic; Harrow, Jennifer.

BMC Genomics ; 16 Suppl 8: S2, 2015.

Artigo em Inglês | MEDLINE | ID: mdl-26110515

RESUMO

BACKGROUND: A vast amount of DNA variation is being identified by increasingly large-scale exome and genome sequencing projects. To be useful, variants require accurate functional annotation and a wide range of tools are available to this end. McCarthy et al recently demonstrated the large differences in prediction of loss-of-function (LoF) variation when RefSeq and Ensembl transcripts are used for annotation, highlighting the importance of the reference transcripts on which variant functional annotation is based. RESULTS: We describe a detailed analysis of the similarities and differences between the gene and transcript annotation in the GENCODE and RefSeq genesets. We demonstrate that the GENCODE Comprehensive set is richer in alternative splicing, novel CDSs, novel exons and has higher genomic coverage than RefSeq, while the GENCODE Basic set is very similar to RefSeq. Using RNAseq data we show that exons and introns unique to one geneset are expressed at a similar level to those common to both. We present evidence that the differences in gene annotation lead to large differences in variant annotation where GENCODE and RefSeq are used as reference transcripts, although this is predominantly confined to non-coding transcripts and UTR sequence, with at most ~30% of LoF variants annotated discordantly. We also describe an investigation of dominant transcript expression, showing that it both supports the utility of the GENCODE Basic set in providing a smaller set of more highly expressed transcripts and provides a useful, biologically-relevant filter for further reducing the complexity of the transcriptome. CONCLUSIONS: The reference transcripts selected for variant functional annotation do have a large effect on the outcome. The GENCODE Comprehensive transcripts contain more exons, have greater genomic coverage and capture many more variants than RefSeq in both genome and exome datasets, while the GENCODE Basic set shows a higher degree of concordance with RefSeq and has fewer unique features. We propose that the GENCODE Comprehensive set has great utility for the discovery of new variants with functional potential, while the GENCODE Basic set is more suitable for applications demanding less complex interpretation of functional variants.

Assuntos

Biologia Computacional , Genoma Humano , Anotação de Sequência Molecular , Isoformas de Proteínas/metabolismo , Software , Processamento Alternativo , Bases de Dados Genéticas , Humanos , Isoformas de Proteínas/genética , Transcriptoma

19.

ArrayExpress update--simplifying data submissions.

Kolesnikov, Nikolay; Hastings, Emma; Keays, Maria; Melnichuk, Olga; Tang, Y Amy; Williams, Eleanor; Dylag, Miroslaw; Kurbatova, Natalja; Brandizi, Marco; Burdett, Tony; Megy, Karyn; Pilicheva, Ekaterina; Rustici, Gabriella; Tikhonov, Andrew; Parkinson, Helen; Petryszak, Robert; Sarkans, Ugis; Brazma, Alvis.

Nucleic Acids Res ; 43(Database issue): D1113-6, 2015 Jan.

Artigo em Inglês | MEDLINE | ID: mdl-25361974

RESUMO

The ArrayExpress Archive of Functional Genomics Data (http://www.ebi.ac.uk/arrayexpress) is an international functional genomics database at the European Bioinformatics Institute (EMBL-EBI) recommended by most journals as a repository for data supporting peer-reviewed publications. It contains data from over 7000 public sequencing and 42,000 array-based studies comprising over 1.5 million assays in total. The proportion of sequencing-based submissions has grown significantly over the last few years and has doubled in the last 18 months, whilst the rate of microarray submissions is growing slightly. All data in ArrayExpress are available in the MAGE-TAB format, which allows robust linking to data analysis and visualization tools and standardized analysis. The main development over the last two years has been the release of a new data submission tool Annotare, which has reduced the average submission time almost 3-fold. In the near future, Annotare will become the only submission route into ArrayExpress, alongside MAGE-TAB format-based pipelines. ArrayExpress is a stable and highly accessed resource. Our future tasks include automation of data flows and further integration with other EMBL-EBI resources for the representation of multi-omics data.

Assuntos

Bases de Dados Genéticas , Perfilação da Expressão Gênica , Análise de Sequência com Séries de Oligonucleotídeos , Genômica , Sequenciamento de Nucleotídeos em Larga Escala , Internet , Software

20.

Expression Atlas update--a database of gene and transcript expression from microarray- and sequencing-based functional genomics experiments.

Petryszak, Robert; Burdett, Tony; Fiorelli, Benedetto; Fonseca, Nuno A; Gonzalez-Porta, Mar; Hastings, Emma; Huber, Wolfgang; Jupp, Simon; Keays, Maria; Kryvych, Nataliya; McMurry, Julie; Marioni, John C; Malone, James; Megy, Karine; Rustici, Gabriella; Tang, Amy Y; Taubert, Jan; Williams, Eleanor; Mannion, Oliver; Parkinson, Helen E; Brazma, Alvis.

Nucleic Acids Res ; 42(Database issue): D926-32, 2014 Jan.

Artigo em Inglês | MEDLINE | ID: mdl-24304889

RESUMO

Expression Atlas (http://www.ebi.ac.uk/gxa) is a value-added database providing information about gene, protein and splice variant expression in different cell types, organism parts, developmental stages, diseases and other biological and experimental conditions. The database consists of selected high-quality microarray and RNA-sequencing experiments from ArrayExpress that have been manually curated, annotated with Experimental Factor Ontology terms and processed using standardized microarray and RNA-sequencing analysis methods. The new version of Expression Atlas introduces the concept of 'baseline' expression, i.e. gene and splice variant abundance levels in healthy or untreated conditions, such as tissues or cell types. Differential gene expression data benefit from an in-depth curation of experimental intent, resulting in biologically meaningful 'contrasts', i.e. instances of differential pairwise comparisons between two sets of biological replicates. Other novel aspects of Expression Atlas are its strict quality control of raw experimental data, up-to-date RNA-sequencing analysis methods, expression data at the level of gene sets, as well as genes and a more powerful search interface designed to maximize the biological value provided to the user.

Assuntos

Bases de Dados Genéticas , Perfilação da Expressão Gênica , Genômica , Humanos , Internet , Análise de Sequência com Séries de Oligonucleotídeos , Proteínas/genética , Proteínas/metabolismo , Isoformas de RNA/metabolismo , Análise de Sequência de RNA

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA