Pesquisa | BVS Integralidade em Saúde

1.

Mouse Phenome Database: towards a more FAIR-compliant and TRUST-worthy data repository and tool suite for phenotypes and genotypes.

Bogue, Molly A; Ball, Robyn L; Philip, Vivek M; Walton, David O; Dunn, Matthew H; Kolishovski, Georgi; Lamoureux, Anna; Gerring, Matthew; Liang, Hongping; Emerson, Jake; Stearns, Tim; He, Hao; Mukherjee, Gaurab; Bluis, John; Desai, Sejal; Sundberg, Beth; Kadakkuzha, Beena; Kunde-Ramamoorthy, Govindarajan; Chesler, Elissa J.

Nucleic Acids Res ; 51(D1): D1067-D1074, 2023 01 06.

Artigo em Inglês | MEDLINE | ID: mdl-36330959

RESUMO

The Mouse Phenome Database (MPD; https://phenome.jax.org; RRID:SCR_003212), supported by the US National Institutes of Health, is a Biomedical Data Repository listed in the Trans-NIH Biomedical Informatics Coordinating Committee registry. As an increasingly FAIR-compliant and TRUST-worthy data repository, MPD accepts phenotype and genotype data from mouse experiments and curates, organizes, integrates, archives, and distributes those data using community standards. Data are accompanied by rich metadata, including widely used ontologies and detailed protocols. Data are from all over the world and represent genetic, behavioral, morphological, and physiological disease-related characteristics in mice at baseline or those exposed to drugs or other treatments. MPD houses data from over 6000 strains and populations, representing many reproducible strain types and heterogenous populations such as the Diversity Outbred where each mouse is unique but can be genotyped throughout the genome. A suite of analysis tools is available to aggregate, visualize, and analyze these data within and across studies and populations in an increasingly traceable and reproducible manner. We have refined existing resources and developed new tools to continue to provide users with access to consistent, high-quality data that has translational relevance in a modernized infrastructure that enables interaction with a suite of bioinformatics analytic and data services.

Assuntos

Bases de Dados Genéticas , Fenômica , Camundongos , Animais , Camundongos Endogâmicos , Fenótipo , Genótipo

2.

Mouse phenome database: curated data repository with interactive multi-population and multi-trait analyses.

Bogue, Molly A; Ball, Robyn L; Walton, David O; Dunn, Matthew H; Kolishovski, Georgi; Berger, Alexander; Lamoureux, Anna; Grubb, Stephen C; Gerring, Matthew; Kim, Matthew; Liang, Hongping; Emerson, Jake; Stearns, Timothy; He, Hao; Mukherjee, Gaurab; Bluis, John; Davis, Sara; Desai, Sejal; Sundberg, Beth; Kadakkuzha, Beena; Kunde-Ramamoorthy, Govindarajan; Philip, Vivek M; Chesler, Elissa J.

Mamm Genome ; 34(4): 509-519, 2023 12.

Artigo em Inglês | MEDLINE | ID: mdl-37581698

RESUMO

The Mouse Phenome Database continues to serve as a curated repository and analysis suite for measured attributes of members of diverse mouse populations. The repository includes annotation to community standard ontologies and guidelines, a database of allelic states for 657 mouse strains, a collection of protocols, and analysis tools for flexible, interactive, user directed analyses that increasingly integrates data across traits and populations. The database has grown from its initial focus on a standard set of inbred strains to include heterogeneous mouse populations such as the Diversity Outbred and mapping crosses and well as Collaborative Cross, Hybrid Mouse Diversity Panel, and recombinant inbred strains. Most recently the system has expanded to include data from the International Mouse Phenotyping Consortium. Collectively these data are accessible by API and provided with an interactive tool suite that enables users' persistent selection, storage, and operation on collections of measures. The tool suite allows basic analyses, advanced functions with dynamic visualization including multi-population meta-analysis, multivariate outlier detection, trait pattern matching, correlation analyses and other functions. The data resources and analysis suite provide users a flexible environment in which to explore the basis of phenotypic variation in health and disease across the lifespan.

Assuntos

Fenômica , Camundongos , Animais , Camundongos Endogâmicos , Fenótipo

3.

Mouse Phenome Database: a data repository and analysis suite for curated primary mouse phenotype data.

Bogue, Molly A; Philip, Vivek M; Walton, David O; Grubb, Stephen C; Dunn, Matthew H; Kolishovski, Georgi; Emerson, Jake; Mukherjee, Gaurab; Stearns, Timothy; He, Hao; Sinha, Vinita; Kadakkuzha, Beena; Kunde-Ramamoorthy, Govindarajan; Chesler, Elissa J.

Nucleic Acids Res ; 48(D1): D716-D723, 2020 01 08.

Artigo em Inglês | MEDLINE | ID: mdl-31696236

RESUMO

The Mouse Phenome Database (MPD; https://phenome.jax.org) is a widely accessed and highly functional data repository housing primary phenotype data for the laboratory mouse accessible via APIs and providing tools to analyze and visualize those data. Data come from investigators around the world and represent a broad scope of phenotyping endpoints and disease-related traits in naïve mice and those exposed to drugs, environmental agents or other treatments. MPD houses rigorously curated per-animal data with detailed protocols. Public ontologies and controlled vocabularies are used for annotation. In addition to phenotype tools, genetic analysis tools enable users to integrate and interpret genome-phenome relations across the database. Strain types and populations include inbred, recombinant inbred, F1 hybrid, transgenic, targeted mutants, chromosome substitution, Collaborative Cross, Diversity Outbred and other mapping populations. Our new analysis tools allow users to apply selected data in an integrated fashion to address problems in trait associations, reproducibility, polygenic syndrome model selection and multi-trait modeling. As we refine these tools and approaches, we will continue to provide users a means to identify consistent, quality studies that have high translational relevance.

Assuntos

Biologia Computacional/métodos , Bases de Dados Genéticas , Genoma , Fenômica , Fenótipo , Algoritmos , Animais , Modelos Animais de Doenças , Camundongos , Camundongos Endogâmicos , Camundongos Transgênicos , Mutação , Linguagens de Programação , Ferramenta de Busca , Software , Especificidade da Espécie , Navegador

4.

GENCODE: the reference human genome annotation for The ENCODE Project.

Harrow, Jennifer; Frankish, Adam; Gonzalez, Jose M; Tapanari, Electra; Diekhans, Mark; Kokocinski, Felix; Aken, Bronwen L; Barrell, Daniel; Zadissa, Amonida; Searle, Stephen; Barnes, If; Bignell, Alexandra; Boychenko, Veronika; Hunt, Toby; Kay, Mike; Mukherjee, Gaurab; Rajan, Jeena; Despacio-Reyes, Gloria; Saunders, Gary; Steward, Charles; Harte, Rachel; Lin, Michael; Howald, Cédric; Tanzer, Andrea; Derrien, Thomas; Chrast, Jacqueline; Walters, Nathalie; Balasubramanian, Suganthi; Pei, Baikang; Tress, Michael; Rodriguez, Jose Manuel; Ezkurdia, Iakes; van Baren, Jeltje; Brent, Michael; Haussler, David; Kellis, Manolis; Valencia, Alfonso; Reymond, Alexandre; Gerstein, Mark; Guigó, Roderic; Hubbard, Tim J.

Genome Res ; 22(9): 1760-74, 2012 Sep.

Artigo em Inglês | MEDLINE | ID: mdl-22955987

RESUMO

The GENCODE Consortium aims to identify all gene features in the human genome using a combination of computational analysis, manual annotation, and experimental validation. Since the first public release of this annotation data set, few new protein-coding loci have been added, yet the number of alternative splicing transcripts annotated has steadily increased. The GENCODE 7 release contains 20,687 protein-coding and 9640 long noncoding RNA loci and has 33,977 coding transcripts not represented in UCSC genes and RefSeq. It also has the most comprehensive annotation of long noncoding RNA (lncRNA) loci publicly available with the predominant transcript form consisting of two exons. We have examined the completeness of the transcript annotation and found that 35% of transcriptional start sites are supported by CAGE clusters and 62% of protein-coding genes have annotated polyA sites. Over one-third of GENCODE protein-coding genes are supported by peptide hits derived from mass spectrometry spectra submitted to Peptide Atlas. New models derived from the Illumina Body Map 2.0 RNA-seq data identify 3689 new loci not currently in GENCODE, of which 3127 consist of two exon models indicating that they are possibly unannotated long noncoding loci. GENCODE 7 is publicly available from gencodegenes.org and via the Ensembl and UCSC Genome Browsers.

Assuntos

Bases de Dados Genéticas , Genoma Humano , Genômica/métodos , Anotação de Sequência Molecular , Animais , Biologia Computacional/métodos , DNA Complementar/química , DNA Complementar/genética , Evolução Molecular , Éxons , Loci Gênicos , Humanos , Internet , Modelos Moleculares , Fases de Leitura Aberta , Pseudogenes , Controle de Qualidade , Sítios de Splice de RNA , RNA Longo não Codificante , Reprodutibilidade dos Testes , Regiões não Traduzidas

5.

Petabyte-scale innovations at the European Nucleotide Archive.

Cochrane, Guy; Akhtar, Ruth; Bonfield, James; Bower, Lawrence; Demiralp, Fehmi; Faruque, Nadeem; Gibson, Richard; Hoad, Gemma; Hubbard, Tim; Hunter, Christopher; Jang, Mikyung; Juhos, Szilveszter; Leinonen, Rasko; Leonard, Steven; Lin, Quan; Lopez, Rodrigo; Lorenc, Dariusz; McWilliam, Hamish; Mukherjee, Gaurab; Plaister, Sheila; Radhakrishnan, Rajesh; Robinson, Stephen; Sobhany, Siamak; Hoopen, Petra Ten; Vaughan, Robert; Zalunin, Vadim; Birney, Ewan.

Nucleic Acids Res ; 37(Database issue): D19-25, 2009 Jan.

Artigo em Inglês | MEDLINE | ID: mdl-18978013

RESUMO

Dramatic increases in the throughput of nucleotide sequencing machines, and the promise of ever greater performance, have thrust bioinformatics into the era of petabyte-scale data sets. Sequence repositories, which provide the feed for these data sets into the worldwide computational infrastructure, are challenged by the impact of these data volumes. The European Nucleotide Archive (ENA; http://www.ebi.ac.uk/embl), comprising the EMBL Nucleotide Sequence Database and the Ensembl Trace Archive, has identified challenges in the storage, movement, analysis, interpretation and visualization of petabyte-scale data sets. We present here our new repository for next generation sequence data, a brief summary of contents of the ENA and provide details of major developments to submission pipelines, high-throughput rule-based validation infrastructure and data integration approaches.

Assuntos

Bases de Dados de Ácidos Nucleicos , Análise de Sequência/tendências , Internet , Integração de Sistemas

6.

Priorities for nucleotide trace, sequence and annotation data capture at the Ensembl Trace Archive and the EMBL Nucleotide Sequence Database.

Cochrane, Guy; Akhtar, Ruth; Aldebert, Philippe; Althorpe, Nicola; Baldwin, Alastair; Bates, Kirsty; Bhattacharyya, Sumit; Bonfield, James; Bower, Lawrence; Browne, Paul; Castro, Matias; Cox, Tony; Demiralp, Fehmi; Eberhardt, Ruth; Faruque, Nadeem; Hoad, Gemma; Jang, Mikyung; Kulikova, Tamara; Labarga, Alberto; Leinonen, Rasko; Leonard, Steven; Lin, Quan; Lopez, Rodrigo; Lorenc, Dariusz; McWilliam, Hamish; Mukherjee, Gaurab; Nardone, Francesco; Plaister, Sheila; Robinson, Stephen; Sobhany, Siamak; Vaughan, Robert; Wu, Dan; Zhu, Weimin; Apweiler, Rolf; Hubbard, Tim; Birney, Ewan.

Nucleic Acids Res ; 36(Database issue): D5-12, 2008 Jan.

Artigo em Inglês | MEDLINE | ID: mdl-18039715

RESUMO

The Ensembl Trace Archive (http://trace.ensembl.org/) and the EMBL Nucleotide Sequence Database (http://www.ebi.ac.uk/embl/), known together as the European Nucleotide Archive, continue to see growth in data volume and diversity. Selected major developments of 2007 are presented briefly, along with data submission and retrieval information. In the face of increasing requirements for nucleotide trace, sequence and annotation data archiving, data capture priority decisions have been taken at the European Nucleotide Archive. Priorities are discussed in terms of how reliably information can be captured, the long-term benefits of its capture and the ease with which it can be captured.

Assuntos

Bases de Dados de Ácidos Nucleicos , Análise de Sequência de DNA , Animais , Arquivos , Genômica , Internet

7.

Investigation of COVID-19 comorbidities reveals genes and pathways coincident with the SARS-CoV-2 viral disease.

Dolan, Mary E; Hill, David P; Mukherjee, Gaurab; McAndrews, Monica S; Chesler, Elissa J; Blake, Judith A.

Sci Rep ; 10(1): 20848, 2020 11 30.

Artigo em Inglês | MEDLINE | ID: mdl-33257774

RESUMO

The emergence of the SARS-CoV-2 virus and subsequent COVID-19 pandemic initiated intense research into the mechanisms of action for this virus. It was quickly noted that COVID-19 presents more seriously in conjunction with other human disease conditions such as hypertension, diabetes, and lung diseases. We conducted a bioinformatics analysis of COVID-19 comorbidity-associated gene sets, identifying genes and pathways shared among the comorbidities, and evaluated current knowledge about these genes and pathways as related to current information about SARS-CoV-2 infection. We performed our analysis using GeneWeaver (GW), Reactome, and several biomedical ontologies to represent and compare common COVID-19 comorbidities. Phenotypic analysis of shared genes revealed significant enrichment for immune system phenotypes and for cardiovascular-related phenotypes, which might point to alleles and phenotypes in mouse models that could be evaluated for clues to COVID-19 severity. Through pathway analysis, we identified enriched pathways shared by comorbidity datasets and datasets associated with SARS-CoV-2 infection.

Assuntos

COVID-19/mortalidade , COVID-19/patologia , Biologia Computacional/métodos , Animais , Doenças Cardiovasculares/epidemiologia , Doenças Cardiovasculares/genética , Comorbidade , Síndrome da Liberação de Citocina/mortalidade , Bases de Dados Genéticas , Diabetes Mellitus/epidemiologia , Diabetes Mellitus/genética , Modelos Animais de Doenças , Hepatite/epidemiologia , Hepatite/genética , Humanos , Nefropatias/epidemiologia , Nefropatias/genética , Pneumopatias/epidemiologia , Pneumopatias/genética , Camundongos , Síndrome do Desconforto Respiratório/mortalidade , SARS-CoV-2 , Índice de Gravidade de Doença

8.

Investigation of COVID-19 comorbidities reveals genes and pathways coincident with the SARS-CoV-2 viral disease.

Dolan, Mary E; Hill, David P; Mukherjee, Gaurab; McAndrews, Monica S; Chesler, Elissa J; Blake, Judith A.

bioRxiv ; 2020 Sep 21.

Artigo em Inglês | MEDLINE | ID: mdl-32995795

RESUMO

The emergence of the SARS-CoV-2 virus and subsequent COVID-19 pandemic initiated intense research into the mechanisms of action for this virus. It was quickly noted that COVID-19 presents more seriously in conjunction with other human disease conditions such as hypertension, diabetes, and lung diseases. We conducted a bioinformatics analysis of COVID-19 comorbidity-associated gene sets, identifying genes and pathways shared among the comorbidities, and evaluated current knowledge about these genes and pathways as related to current information about SARS-CoV-2 infection. We performed our analysis using GeneWeaver (GW), Reactome, and several biomedical ontologies to represent and compare common COVID-19 comorbidities. Phenotypic analysis of shared genes revealed significant enrichment for immune system phenotypes and for cardiovascular-related phenotypes, which might point to alleles and phenotypes in mouse models that could be evaluated for clues to COVID-19 severity. Through pathway analysis, we identified enriched pathways shared by comorbidity datasets and datasets associated with SARS-CoV-2 infection.

9.

EMBL Nucleotide Sequence Database in 2006.

Kulikova, Tamara; Akhtar, Ruth; Aldebert, Philippe; Althorpe, Nicola; Andersson, Mikael; Baldwin, Alastair; Bates, Kirsty; Bhattacharyya, Sumit; Bower, Lawrence; Browne, Paul; Castro, Matias; Cochrane, Guy; Duggan, Karyn; Eberhardt, Ruth; Faruque, Nadeem; Hoad, Gemma; Kanz, Carola; Lee, Charles; Leinonen, Rasko; Lin, Quan; Lombard, Vincent; Lopez, Rodrigo; Lorenc, Dariusz; McWilliam, Hamish; Mukherjee, Gaurab; Nardone, Francesco; Pastor, Maria Pilar Garcia; Plaister, Sheila; Sobhany, Siamak; Stoehr, Peter; Vaughan, Robert; Wu, Dan; Zhu, Weimin; Apweiler, Rolf.

Nucleic Acids Res ; 35(Database issue): D16-20, 2007 Jan.

Artigo em Inglês | MEDLINE | ID: mdl-17148479

RESUMO

The EMBL Nucleotide Sequence Database (http://www.ebi.ac.uk/embl) at the EMBL European Bioinformatics Institute, UK, offers a large and freely accessible collection of nucleotide sequences and accompanying annotation. The database is maintained in collaboration with DDBJ and GenBank. Data are exchanged between the collaborating databases on a daily basis to achieve optimal synchrony. Webin is the preferred tool for individual submissions of nucleotide sequences, including Third Party Annotation, alignments and bulk data. Automated procedures are provided for submissions from large-scale sequencing projects and data from the European Patent Office. In 2006, the volume of data has continued to grow exponentially. Access to the data is provided via SRS, ftp and variety of other methods. Extensive external and internal cross-references enable users to search for related information across other databases and within the database. All available resources can be accessed via the EBI home page at http://www.ebi.ac.uk/. Changes over the past year include changes to the file format, further development of the EMBLCDS dataset and developments to the XML format.

Assuntos

Bases de Dados de Ácidos Nucleicos , Sequência de Bases , Bases de Dados de Ácidos Nucleicos/tendências , Internet , Interface Usuário-Computador

10.

Curating gene sets: challenges and opportunities for integrative analysis.

Bubier, Jason; Hill, David; Mukherjee, Gaurab; Reynolds, Timothy; Baker, Erich J; Berger, Alexander; Emerson, Jake; Blake, Judith A; Chesler, Elissa J.

Database (Oxford) ; 20192019 01 01.

Artigo em Inglês | MEDLINE | ID: mdl-30888410

RESUMO

Genomic data interpretation often requires analyses that move from a gene-by-gene focus to a focus on sets of genes that are associated with biological phenomena such as molecular processes, phenotypes, diseases, drug interactions or environmental conditions. Unique challenges exist in the curation of gene sets beyond the challenges in curation of individual genes. Here we highlight a literature curation workflow whereby gene sets are curated from peer-reviewed published data into GeneWeaver (GW), a data repository and analysis platform. We describe the system features that allow for a flexible yet precise curation procedure. We illustrate the value of curation by gene sets through analysis of independently curated sets that relate to the integrated stress response, showing that sets curated from independent sources all share significant Jaccard similarity. A suite of reproducible analysis tools is provided in GW as services to carry out interactive functional investigation of user-submitted gene sets within the context of over 150 000 gene sets constructed from publicly available resources and published gene lists. A curation interface supports the ability of users to design and maintain curation workflows of gene sets, including assigning, reviewing and releasing gene sets within a curation project context.

Assuntos

Curadoria de Dados , Bases de Dados Genéticas , Genes , Fenômenos Biológicos , Software , Estresse Fisiológico , Fluxo de Trabalho

11.

EMBL Nucleotide Sequence Database: developments in 2005.

Cochrane, Guy; Aldebert, Philippe; Althorpe, Nicola; Andersson, Mikael; Baker, Wendy; Baldwin, Alastair; Bates, Kirsty; Bhattacharyya, Sumit; Browne, Paul; van den Broek, Alexandra; Castro, Matias; Duggan, Karyn; Eberhardt, Ruth; Faruque, Nadeem; Gamble, John; Kanz, Carola; Kulikova, Tamara; Lee, Charles; Leinonen, Rasko; Lin, Quan; Lombard, Vincent; Lopez, Rodrigo; McHale, Michelle; McWilliam, Hamish; Mukherjee, Gaurab; Nardone, Francesco; Pastor, Maria Pilar Garcia; Sobhany, Siamak; Stoehr, Peter; Tzouvara, Katerina; Vaughan, Robert; Wu, Dan; Zhu, Weimin; Apweiler, Rolf.

Nucleic Acids Res ; 34(Database issue): D10-5, 2006 Jan 01.

Artigo em Inglês | MEDLINE | ID: mdl-16381823

RESUMO

The EMBL Nucleotide Sequence Database (www.ebi.ac.uk/embl) at the EMBL European Bioinformatics Institute, UK, offers a comprehensive set of publicly available nucleotide sequence and annotation, freely accessible to all. Maintained in collaboration with partners DDBJ and GenBank, coverage includes whole genome sequencing project data, directly submitted sequence, sequence recorded in support of patent applications and much more. The database continues to offer submission tools, data retrieval facilities and user support. In 2005, the volume of data offered has continued to grow exponentially. In addition to the newly presented data, the database encompasses a range of new data types generated by novel technologies, offers enhanced presentation and searchability of the data and has greater integration with other data resources offered at the EBI and elsewhere. In stride with these developing data types, the database has continued to develop submission and retrieval tools to maximise the information content of submitted data and to offer the simplest possible submission routes for data producers. New developments, the submission process, data retrieval and access to support are presented in this paper, along with links to sources of further information.

Assuntos

Bases de Dados de Ácidos Nucleicos , Animais , Sequência de Bases , Genômica , Internet , Software , Interface Usuário-Computador

12.

ArrayExpress: a public database of gene expression data at EBI.

Rocca-Serra, Philippe; Brazma, Alvis; Parkinson, Helen; Sarkans, Ugis; Shojatalab, Mohammadreza; Contrino, Sergio; Vilo, Jaak; Abeygunawardena, Niran; Mukherjee, Gaurab; Holloway, Ele; Kapushesky, Misha; Kemmeren, Patrick; Lara, Gonzalo Garcia; Oezcimen, Ahmet; Sansone, Susanna-Assunta.

C R Biol ; 326(10-11): 1075-8, 2003.

Artigo em Inglês | MEDLINE | ID: mdl-14744115

RESUMO

ArrayExpress is a public repository for microarray-based gene expression data, resulting from the implementation of the MAGE object model to ensure accurate data structuring and the MIAME standard, which defines the annotation requirements. ArrayExpress accepts data as MAGE-ML files for direct submissions or data from MIAMExpress, the MIAME compliant web-based annotation and submission tool of EBI. A team of curators supports the submission process, providing assistance in data annotation. Data retrieval is performed through a dedicated web interface. Relevant results may be exported to ExpressionProfiler, the EBI based expression analysis tool available online (http://www.ebi.ac.uk/arrayexpress).

Assuntos

Biologia Computacional , Bases de Dados Genéticas , Expressão Gênica , Análise de Sequência com Séries de Oligonucleotídeos

13.

MIAME/Plant - adding value to plant microarrray experiments.

Zimmermann, Philip; Schildknecht, Beatrice; Craigon, David; Garcia-Hernandez, Margarita; Gruissem, Wilhelm; May, Sean; Mukherjee, Gaurab; Parkinson, Helen; Rhee, Seung; Wagner, Ulrich; Hennig, Lars.

Plant Methods ; 2: 1, 2006 Jan 09.

Artigo em Inglês | MEDLINE | ID: mdl-16401339

RESUMO

Appropriate biological interpretation of microarray data calls for relevant experimental annotation. The widely accepted MIAME guidelines provide a generic, organism-independant standard for minimal information about microarray experiments. In its overall structure, MIAME is very general and specifications cover mostly technical aspects, while relevant organism-specific information useful to understand the underlying experiments is largely missing. If plant biologists want to use results from published microarray experiments, they need detailed information about biological aspects, such as growth conditions, harvesting time or harvested organ(s). Here, we propose MIAME/Plant, a standard describing which biological details to be captured for describing microarray experiments involving plants. We expect that a more detailed and more systematic annotation of microarray experiments will greatly increase the use of transcriptome data sets for the scientific community. The power and value of systematic annotation of microarray data is convincingly demonstrated by data warehouses such as Genevestigator(R) or NASCArrays, and better experimental annotation will make these applications even more powerful.

14.

Plant-based microarray data at the European Bioinformatics Institute. Introducing AtMIAMExpress, a submission tool for Arabidopsis gene expression data to ArrayExpress.

Mukherjee, Gaurab; Abeygunawardena, Niran; Parkinson, Helen; Contrino, Sergio; Durinck, Steffen; Farne, Anna; Holloway, Ele; Lilja, Per; Moreau, Yves; Oezcimen, Ahmet; Rayner, Tim; Sharma, Anjan; Brazma, Alvis; Sarkans, Ugis; Shojatalab, Mohammadreza.

Plant Physiol ; 139(2): 632-6, 2005 Oct.

Artigo em Inglês | MEDLINE | ID: mdl-16219923

RESUMO

ArrayExpress is a public microarray repository founded on the Minimum Information About a Microarray Experiment (MIAME) principles that stores MIAME-compliant gene expression data. Plant-based data sets represent approximately one-quarter of the experiments in ArrayExpress. The majority are based on Arabidopsis (Arabidopsis thaliana); however, there are other data sets based on Triticum aestivum, Hordeum vulgare, and Populus subsp. AtMIAMExpress is an open-source Web-based software application for the submission of Arabidopsis-based microarray data to ArrayExpress. AtMIAMExpress exports data in MAGE-ML format for upload to any MAGE-ML-compliant application, such as J-Express and ArrayExpress. It was designed as a tool for users with minimal bioinformatics expertise, has comprehensive help and user support, and represents a simple solution to meeting the MIAME guidelines for the Arabidopsis community. Plant data are queryable both in ArrayExpress and in the Data Warehouse databases, which support queries based on gene-centric and sample-centric annotation. The AtMIAMExpress submission tool is available at http://www.ebi.ac.uk/at-miamexpress/. The software is open source and is available from http://sourceforge.net/projects/miamexpress/. For information, contact miamexpress@ebi.ac.uk.

Assuntos

Arabidopsis/genética , Bases de Dados Genéticas , Academias e Institutos , Biologia Computacional , Europa (Continente) , Perfilação da Expressão Gênica , Internet , Análise de Sequência com Séries de Oligonucleotídeos , Software , Triticum/genética

15.

The ArrayExpress gene expression database: a software engineering and implementation perspective.

Sarkans, Ugis; Parkinson, Helen; Lara, Gonzalo Garcia; Oezcimen, Ahmet; Sharma, Anjan; Abeygunawardena, Niran; Contrino, Sergio; Holloway, Ele; Rocca-Serra, Philippe; Mukherjee, Gaurab; Shojatalab, Mohammadreza; Kapushesky, Misha; Sansone, Susanna-Assunta; Farne, Anna; Rayner, Tim; Brazma, Alvis.

Bioinformatics ; 21(8): 1495-501, 2005 Apr 15.

Artigo em Inglês | MEDLINE | ID: mdl-15564302

RESUMO

MOTIVATION: The lack of microarray data management systems and databases is still one of the major problems faced by many life sciences laboratories. While developing the public repository for microarray data ArrayExpress we had to find novel solutions to many non-trivial software engineering problems. Our experience will be both relevant and useful for most bioinformaticians involved in developing information systems for a wide range of high-throughput technologies. RESULTS: ArrayExpress has been online since February 2002, growing exponentially to well over 10,000 hybridizations (as of September 2004). It has been demonstrated that our chosen design and implementation works for databases aimed at storage, access and sharing of high-throughput data. AVAILABILITY: The ArrayExpress database is available at http://www.ebi.ac.uk/arrayexpress/. The software is open source. CONTACT: ugis@ebi.ac.uk.

Assuntos

Sistemas de Gerenciamento de Base de Dados , Bases de Dados Genéticas , Perfilação da Expressão Gênica/métodos , Armazenamento e Recuperação da Informação/métodos , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Proteínas/genética , Proteínas/metabolismo , Software , Algoritmos , Disseminação de Informação/métodos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

Detalhe da pesquisa