Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 14 de 14
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
1.
Genetics ; 227(1)2024 05 07.
Artículo en Inglés | MEDLINE | ID: mdl-38531069

RESUMEN

Mouse Genome Informatics (MGI) is a federation of expertly curated information resources designed to support experimental and computational investigations into genetic and genomic aspects of human biology and disease using the laboratory mouse as a model system. The Mouse Genome Database (MGD) and the Gene Expression Database (GXD) are core MGI databases that share data and system architecture. MGI serves as the central community resource of integrated information about mouse genome features, variation, expression, gene function, phenotype, and human disease models acquired from peer-reviewed publications, author submissions, and major bioinformatics resources. To facilitate integration and standardization of data, biocuration scientists annotate using terms from controlled metadata vocabularies and biological ontologies (e.g. Mammalian Phenotype Ontology, Mouse Developmental Anatomy, Disease Ontology, Gene Ontology, etc.), and by applying international community standards for gene, allele, and mouse strain nomenclature. MGI serves basic scientists, translational researchers, and data scientists by providing access to FAIR-compliant data in both human-readable and compute-ready formats. The MGI resource is accessible at https://informatics.jax.org. Here, we present an overview of the core data types represented in MGI and highlight recent enhancements to the resource with a focus on new data and functionality for MGD and GXD.


Asunto(s)
Bases de Datos Genéticas , Genoma , Animales , Ratones , Bases del Conocimiento , Genómica/métodos , Biología Computacional/métodos , Humanos
2.
Mamm Genome ; 33(1): 4-18, 2022 03.
Artículo en Inglés | MEDLINE | ID: mdl-34698891

RESUMEN

The Mouse Genome Informatics (MGI) database system combines multiple expertly curated community data resources into a shared knowledge management ecosystem united by common metadata annotation standards. MGI's mission is to facilitate the use of the mouse as an experimental model for understanding the genetic and genomic basis of human health and disease. MGI is the authoritative source for mouse gene, allele, and strain nomenclature and is the primary source of mouse phenotype annotations, functional annotations, developmental gene expression information, and annotations of mouse models with human diseases. MGI maintains mouse anatomy and phenotype ontologies and contributes to the development of the Gene Ontology and Disease Ontology and uses these ontologies as standard terminologies for annotation. The Mouse Genome Database (MGD) and the Gene Expression Database (GXD) are MGI's two major knowledgebases. Here, we highlight some of the recent changes and enhancements to MGD and GXD that have been implemented in response to changing needs of the biomedical research community and to improve the efficiency of expert curation. MGI can be accessed freely at http://www.informatics.jax.org .


Asunto(s)
Bases de Datos Genéticas , Ecosistema , Alelos , Animales , Ontología de Genes , Genómica , Ratones
3.
Mamm Genome ; 33(1): 44-54, 2022 03.
Artículo en Inglés | MEDLINE | ID: mdl-34448927

RESUMEN

The assembled and annotated genomes for 16 inbred mouse strains (Lilue et al., Nat Genet 50:1574-1583, 2018) and two wild-derived strains (CAROLI/EiJ and PAHARI/EiJ) (Thybert et al., Genome Res 28:448-459, 2018) are valuable resources for mouse genetics and comparative genomics. We developed the multiple genome viewer (MGV; http://www.informatics.jax.org/mgv ) to support visualization, exploration, and comparison of genome annotations within and across these genomes. MGV displays chromosomal regions of user-selected genomes as horizontal tracks. Equivalent features across the genome tracks are highlighted using vertical 'swim lane' connectors. Navigation across the genomes is synchronized as a researcher uses the scroll and zoom functions. Researchers can generate custom sets of genes and other genome features to be displayed in MGV by entering genome coordinates, function, phenotype, disease, and/or pathway terms. MGV was developed to be genome agnostic and can be used to display homologous features across genomes of different organisms.


Asunto(s)
Genoma , Genómica , Animales , Bases de Datos Genéticas , Genoma/genética , Internet , Ratones , Ratones Endogámicos , Programas Informáticos
4.
Nucleic Acids Res ; 49(D1): D981-D987, 2021 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-33231642

RESUMEN

The Mouse Genome Database (MGD; http://www.informatics.jax.org) is the community model organism knowledgebase for the laboratory mouse, a widely used animal model for comparative studies of the genetic and genomic basis for human health and disease. MGD is the authoritative source for biological reference data related to mouse genes, gene functions, phenotypes and mouse models of human disease. MGD is the primary source for official gene, allele, and mouse strain nomenclature based on the guidelines set by the International Committee on Standardized Nomenclature for Mice. MGD's biocuration scientists curate information from the biomedical literature and from large and small datasets contributed directly by investigators. In this report we describe significant enhancements to the content and interfaces at MGD, including (i) improvements in the Multi Genome Viewer for exploring the genomes of multiple mouse strains, (ii) inclusion of many more mouse strains and new mouse strain pages with extended query options and (iii) integration of extensive data about mouse strain variants. We also describe improvements to the efficiency of literature curation processes and the implementation of an information portal focused on mouse models and genes for the study of COVID-19.


Asunto(s)
COVID-19/prevención & control , Bases de Datos Genéticas , Genoma/genética , Genómica/métodos , Bases del Conocimiento , SARS-CoV-2/genética , Animales , COVID-19/epidemiología , COVID-19/virología , Curaduría de Datos/métodos , Modelos Animales de Enfermedad , Epidemias , Ontología de Genes , Humanos , Almacenamiento y Recuperación de la Información/métodos , Internet , Ratones , SARS-CoV-2/fisiología
5.
Nucleic Acids Res ; 49(D1): D924-D931, 2021 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-33104772

RESUMEN

The Gene Expression Database (GXD; www.informatics.jax.org/expression.shtml) is an extensive and well-curated community resource of mouse developmental gene expression information. For many years, GXD has collected and integrated data from RNA in situ hybridization, immunohistochemistry, RT-PCR, northern blot, and western blot experiments through curation of the scientific literature and by collaborations with large-scale expression projects. Since our last report in 2019, we have continued to acquire these classical types of expression data; developed a searchable index of RNA-Seq and microarray experiments that allows users to quickly and reliably find specific mouse expression studies in ArrayExpress (https://www.ebi.ac.uk/arrayexpress/) and GEO (https://www.ncbi.nlm.nih.gov/geo/); and expanded GXD to include RNA-Seq data. Uniformly processed RNA-Seq data are imported from the EBI Expression Atlas and then integrated with the other types of expression data in GXD, and with the genetic, functional, phenotypic and disease-related information in Mouse Genome Informatics (MGI). This integration has made the RNA-Seq data accessible via GXD's enhanced searching and filtering capabilities. Further, we have embedded the Morpheus heat map utility into the GXD user interface to provide additional tools for display and analysis of RNA-Seq data, including heat map visualization, sorting, filtering, hierarchical clustering, nearest neighbors analysis and visual enrichment.


Asunto(s)
Bases de Datos Genéticas , Perfilación de la Expresión Génica/métodos , Expresión Génica , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Animales , Análisis por Conglomerados , Internet , Ratones , Proteínas/genética , Proteínas/metabolismo , Interfaz Usuario-Computador
6.
Database (Oxford) ; 20202020 01 01.
Artículo en Inglés | MEDLINE | ID: mdl-32140729

RESUMEN

The Gene Expression Database (GXD), an extensive community resource of curated expression information for the mouse, has developed an RNA-Seq and Microarray Experiment Search (http://www.informatics.jax.org/gxd/htexp_index). This tool allows users to quickly and reliably find specific experiments in ArrayExpress and the Gene Expression Omnibus (GEO) that study endogenous gene expression in wild-type and mutant mice. Standardized metadata annotations, curated by GXD, allow users to specify the anatomical structure, developmental stage, mutated gene, strain and sex of samples of interest, as well as the study type and key parameters of the experiment. These searches, powered by controlled vocabularies and ontologies, can be combined with free text searching of experiment titles and descriptions. Search result summaries include link-outs to ArrayExpress and GEO, providing easy access to the expression data itself. Links to the PubMed entries for accompanying publications are also included. More information about this tool and GXD can be found at the GXD home page (http://www.informatics.jax.org/expression.shtml). Database URL: http://www.informatics.jax.org/expression.shtml.


Asunto(s)
Curaduría de Datos/métodos , Bases de Datos Genéticas , Perfilación de la Expresión Génica/métodos , Metadatos , Análisis de Secuencia por Matrices de Oligonucleótidos/métodos , RNA-Seq/métodos , Animales , Almacenamiento y Recuperación de la Información/métodos , Internet , Ratones , Interfaz Usuario-Computador
7.
Nucleic Acids Res ; 47(D1): D774-D779, 2019 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-30335138

RESUMEN

The mouse Gene Expression Database (GXD) is an extensive, well-curated community resource freely available at www.informatics.jax.org/expression.shtml. Covering all developmental stages, GXD includes data from RNA in situ hybridization, immunohistochemistry, RT-PCR, northern blot and western blot experiments in wild-type and mutant mice. GXD's gene expression information is integrated with the other data in Mouse Genome Informatics and interconnected with other databases, placing these data in the larger biological and biomedical context. Since the last report, the ability of GXD to provide insights into the molecular mechanisms of development and disease has been greatly enhanced by the addition of new data and by the implementation of new web features. These include: improvements to the Differential Gene Expression Data Search, facilitating searches for genes that have been shown to be exclusively expressed in a specified structure and/or developmental stage; an enhanced anatomy browser that now provides access to expression data and phenotype data for a given anatomical structure; direct access to the wild-type gene expression data for the tissues affected in a specific mutant; and a comparison matrix that juxtaposes tissues where a gene is normally expressed against tissues, where mutations in that gene cause abnormalities.


Asunto(s)
Bases de Datos Genéticas , Genoma/genética , Transcriptoma/genética , Animales , Internet , Ratones , Interfaz Usuario-Computador
8.
Nucleic Acids Res ; 45(D1): D730-D736, 2017 01 04.
Artículo en Inglés | MEDLINE | ID: mdl-27899677

RESUMEN

The Gene Expression Database (GXD; www.informatics.jax.org/expression.shtml) is an extensive and well-curated community resource of mouse developmental expression information. Through curation of the scientific literature and by collaborations with large-scale expression projects, GXD collects and integrates data from RNA in situ hybridization, immunohistochemistry, RT-PCR, northern blot and western blot experiments. Expression data from both wild-type and mutant mice are included. The expression data are combined with genetic and phenotypic data in Mouse Genome Informatics (MGI) and made readily accessible to many types of database searches. At present, GXD includes over 1.5 million expression results and more than 300 000 images, all annotated with detailed and standardized metadata. Since our last report in 2014, we have added a large amount of data, we have enhanced data and database infrastructure, and we have implemented many new search and display features. Interface enhancements include: a new Mouse Developmental Anatomy Browser; interactive tissue-by-developmental stage and tissue-by-gene matrix views; capabilities to filter and sort expression data summaries; a batch search utility; gene-based expression overviews; and links to expression data from other species.


Asunto(s)
Biología Computacional/métodos , Bases de Datos Genéticas , Perfilación de la Expresión Génica/métodos , Expresión Génica , Genómica/métodos , Animales , Ontología de Genes , Ratones , Especificidad de Órganos , Motor de Búsqueda , Interfaz Usuario-Computador , Navegador Web
9.
Mamm Genome ; 26(7-8): 305-13, 2015 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-26223881

RESUMEN

The mouse genome database (MGD) is the model organism database component of the mouse genome informatics system at The Jackson Laboratory. MGD is the international data resource for the laboratory mouse and facilitates the use of mice in the study of human health and disease. Since its beginnings, MGD has included comparative genomics data with a particular focus on human-mouse orthology, an essential component of the use of mouse as a model organism. Over the past 25 years, novel algorithms and addition of orthologs from other model organisms have enriched comparative genomics in MGD data, extending the use of orthology data to support the laboratory mouse as a model of human biology. Here, we describe current comparative data in MGD and review the history and refinement of orthology representation in this resource.


Asunto(s)
Bases de Datos Genéticas/historia , Genoma , Genómica/métodos , Homología de Secuencia de Aminoácido , Alelos , Animales , Modelos Animales de Enfermedad , Genómica/historia , Genotipo , Historia del Siglo XX , Historia del Siglo XXI , Humanos , Ratones , Anotación de Secuencia Molecular , Fenotipo , Filogenia
11.
Genome Res ; 13(6B): 1478-87, 2003 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-12819146

RESUMEN

The identification of coding sequences (CDS) is an important step in the functional annotation of genes. CDS prediction for mammalian genes from genomic sequence is complicated by the vast abundance of intergenic sequence in the genome, and provides little information about how different parts of potential CDS regions are expressed. In contrast, mammalian gene CDS prediction from cDNA sequence offers obvious advantages, yet encounters a different set of complexities when performed on high-throughput cDNA (HTC) sequences, such as the set of 60,770 cDNAs isolated from full-length enriched libraries of the FANTOM2 project. We developed a CDS annotation strategy that uses a variety of different CDS prediction programs to annotate the CDS regions of FANTOM2 cDNAs. These include rsCDS, which uses sequence similarity to known proteins; ProCrest; Longest-ORF and Truncated-ORF, which are ab initio based predictors; and finally, DECODER and NCBI CDS predictor, which use a combination of both principles. Aided by graphical displays of these CDS prediction results in the context of other sequence similarity results for each cDNA, FANTOM2 CDS inspection by curators and follow-up quality control procedures resulted in high quality CDS predictions for a total of 14,345 FANTOM2 clones.


Asunto(s)
Codón/análisis , ADN Complementario/análisis , Proteínas/fisiología , Animales , Secuencia de Bases , Biología Computacional/métodos , Biología Computacional/estadística & datos numéricos , Bases de Datos Genéticas/estadística & datos numéricos , Predicción , Genoma , Humanos , Ratones , Sistemas de Lectura Abierta/genética , Biosíntesis de Proteínas/genética , Proteínas/química , Proteínas/genética , Empalme del ARN/genética , Proyectos de Investigación , Selenocisteína/genética , Diseño de Software
12.
Genome Res ; 13(6B): 1505-19, 2003 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-12819150

RESUMEN

The Mouse Genome Sequencing Consortium and the RIKEN Genome Exploration Research grouphave generated large sets of sequence data representing the mouse genome and transcriptome, respectively. These data provide a valuable foundation for genomic research. The challenges for the informatics community are how to integrate these data with the ever-expanding knowledge about the roles of genes and gene products in biological processes, and how to provide useful views to the scientific community. Public resources, such as the National Center for Biotechnology Information (NCBI; http://www.ncbi.nih.gov), and model organism databases, such as the Mouse Genome Informatics database (MGI; http://www.informatics.jax.org), maintain the primary data and provide connections between sequence and biology. In this paper, we describe how the partnership of MGI and NCBI LocusLink contributes to the integration of sequence and biology, especially in the context of the large-scale genome and transcriptome data now available for the laboratory mouse. In particular, we describe the methods and results of integration of 60,770 FANTOM2 mouse cDNAs with gene records in the databases of MGI and LocusLink.


Asunto(s)
Secuencia de Bases/genética , Biología Computacional/métodos , Animales , Secuencia de Bases/fisiología , Biología Computacional/estadística & datos numéricos , Gráficos por Computador/estadística & datos numéricos , Gráficos por Computador/tendencias , ADN Complementario/genética , ADN Complementario/fisiología , Bases de Datos Genéticas/estadística & datos numéricos , Bases de Datos Genéticas/tendencias , Genes/genética , Genes/fisiología , Genoma , Internet/estadística & datos numéricos , Internet/tendencias , Ratones
13.
Genome Res ; 13(6B): 1542-51, 2003 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-12819153

RESUMEN

Manual curation has long been held to be the "gold standard" for functional annotation of DNA sequence. Our experience with the annotation of more than 20,000 full-length cDNA sequences revealed problems with this approach, including inaccurate and inconsistent assignment of gene names, as well as many good assignments that were difficult to reproduce using only computational methods. For the FANTOM2 annotation of more than 60,000 cDNA clones, we developed a number of methods and tools to circumvent some of these problems, including an automated annotation pipeline that provides high-quality preliminary annotation for each sequence by introducing an "uninformative filter" that eliminates uninformative annotations, controlled vocabularies to accurately reflect both the functional assignments and the evidence supporting them, and a highly refined, Web-based manual annotation tool that allows users to view a wide array of sequence analyses and to assign gene names and putative functions using a consistent nomenclature. The ultimate utility of our approach is reflected in the low rate of reassignment of automated assignments by manual curation. Based on these results, we propose a new standard for large-scale annotation, in which the initial automated annotations are manually investigated and then computational methods are iteratively modified and improved based on the results of manual curation.


Asunto(s)
Biología Computacional/métodos , ADN Complementario/clasificación , ADN Complementario/genética , Bases de Datos Genéticas/tendencias , Diseño de Software , Animales , Biología Computacional/normas , Mapeo Contig/clasificación , Mapeo Contig/métodos , Mapeo Contig/normas , Procesamiento Automatizado de Datos/métodos , Procesamiento Automatizado de Datos/tendencias , Genes/genética , Genes/fisiología , Humanos , Gestión de la Información/métodos , Gestión de la Información/tendencias , Ratones , Control de Calidad , Estándares de Referencia , Interfaz Usuario-Computador
14.
Mol Biol Cell ; 13(12): 4111-3, 2002 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-12475938

RESUMEN

There are 10 known mammalian septin genes, some of which produce multiple splice variants. The current nomenclature for the genes and gene products is very confusing, with several different names having been given to the same gene product and distinct names given to splice variants of the same gene. Moreover, some names are based on those of yeast or Drosophila septins that are not the closest homologues. Therefore, we suggest that the mammalian septin field adopt a common nomenclature system, based on that adopted by the Mouse Genomic Nomenclature Committee and accepted by the Human Genome Organization Gene Nomenclature Committee. The human and mouse septin genes will be named SEPT1-SEPT10 and Sept1-Sept10, respectively. Splice variants will be designated by an underscore followed by a lowercase "v" and a number, e.g., SEPT4_v1.


Asunto(s)
GTP Fosfohidrolasas/clasificación , Terminología como Asunto , Empalme Alternativo , Animales , Proteínas del Citoesqueleto , Proteínas Fúngicas/genética , GTP Fosfohidrolasas/genética , Proteínas de Unión al GTP/genética , Humanos , Filogenia , Estructura Terciaria de Proteína , Septinas
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...