Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 15 de 15
Filtrar
2.
Nature ; 537(7621): 508-514, 2016 09 22.
Artículo en Inglés | MEDLINE | ID: mdl-27626380

RESUMEN

Approximately one-third of all mammalian genes are essential for life. Phenotypes resulting from knockouts of these genes in mice have provided tremendous insight into gene function and congenital disorders. As part of the International Mouse Phenotyping Consortium effort to generate and phenotypically characterize 5,000 knockout mouse lines, here we identify 410 lethal genes during the production of the first 1,751 unique gene knockouts. Using a standardized phenotyping platform that incorporates high-resolution 3D imaging, we identify phenotypes at multiple time points for previously uncharacterized genes and additional phenotypes for genes with previously reported mutant phenotypes. Unexpectedly, our analysis reveals that incomplete penetrance and variable expressivity are common even on a defined genetic background. In addition, we show that human disease genes are enriched for essential genes, thus providing a dataset that facilitates the prioritization and validation of mutations identified in clinical sequencing efforts.


Asunto(s)
Embrión de Mamíferos/embriología , Embrión de Mamíferos/metabolismo , Genes Esenciales/genética , Genes Letales/genética , Mutación/genética , Fenotipo , Animales , Secuencia Conservada/genética , Enfermedad , Estudio de Asociación del Genoma Completo , Ensayos Analíticos de Alto Rendimiento , Humanos , Imagenología Tridimensional , Ratones , Ratones Endogámicos C57BL , Ratones Noqueados , Penetrancia , Polimorfismo de Nucleótido Simple/genética , Homología de Secuencia
3.
Mamm Genome ; 26(9-10): 574-83, 2015 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-26141960

RESUMEN

The Gene Ontology (GO) is an important component of modern biological knowledge representation with great utility for computational analysis of genomic and genetic data. The Gene Ontology Consortium (GOC) consists of a large team of contributors including curation teams from most model organism database groups as well as curation teams focused on representation of data relevant to specific human diseases. Key to the generation of consistent and comprehensive annotations is the development and use of shared standards and measures of curation quality. The GOC engages all contributors to work to a defined standard of curation that is presented here in the context of annotation of genes in the laboratory mouse. Comprehensive understanding of the origin, epistemology, and coverage of GO annotations is essential for most effective use of GO resources. Here the application of comparative approaches to capturing functional data in the mouse system is described.


Asunto(s)
Bases de Datos Genéticas , Ontología de Genes , Anotación de Secuencia Molecular , Animales , Biología Computacional , Genómica , Humanos , Ratones , Análisis de Secuencia de ADN
4.
Mamm Genome ; 26(7-8): 305-13, 2015 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-26223881

RESUMEN

The mouse genome database (MGD) is the model organism database component of the mouse genome informatics system at The Jackson Laboratory. MGD is the international data resource for the laboratory mouse and facilitates the use of mice in the study of human health and disease. Since its beginnings, MGD has included comparative genomics data with a particular focus on human-mouse orthology, an essential component of the use of mouse as a model organism. Over the past 25 years, novel algorithms and addition of orthologs from other model organisms have enriched comparative genomics in MGD data, extending the use of orthology data to support the laboratory mouse as a model of human biology. Here, we describe current comparative data in MGD and review the history and refinement of orthology representation in this resource.


Asunto(s)
Bases de Datos Genéticas/historia , Genoma , Genómica/métodos , Homología de Secuencia de Aminoácido , Alelos , Animales , Modelos Animales de Enfermedad , Genómica/historia , Genotipo , Historia del Siglo XX , Historia del Siglo XXI , Humanos , Ratones , Anotación de Secuencia Molecular , Fenotipo , Filogenia
5.
BMC Bioinformatics ; 15: 405, 2014 Dec 12.
Artículo en Inglés | MEDLINE | ID: mdl-25495798

RESUMEN

BACKGROUND: Biomedical ontologies are increasingly instrumental in the advancement of biological research primarily through their use to efficiently consolidate large amounts of data into structured, accessible sets. However, ontology development and usage can be hampered by the segregation of knowledge by domain that occurs due to independent development and use of the ontologies. The ability to infer data associated with one ontology to data associated with another ontology would prove useful in expanding information content and scope. We here focus on relating two ontologies: the Gene Ontology (GO), which encodes canonical gene function, and the Mammalian Phenotype Ontology (MP), which describes non-canonical phenotypes, using statistical methods to suggest GO functional annotations from existing MP phenotype annotations. This work is in contrast to previous studies that have focused on inferring gene function from phenotype primarily through lexical or semantic similarity measures. RESULTS: We have designed and tested a set of algorithms that represents a novel methodology to define rules for predicting gene function by examining the emergent structure and relationships between the gene functions and phenotypes rather than inspecting the terms semantically. The algorithms inspect relationships among multiple phenotype terms to deduce if there are cases where they all arise from a single gene function. We apply this methodology to data about genes in the laboratory mouse that are formally represented in the Mouse Genome Informatics (MGI) resource. From the data, 7444 rule instances were generated from five generalized rules, resulting in 4818 unique GO functional predictions for 1796 genes. CONCLUSIONS: We show that our method is capable of inferring high-quality functional annotations from curated phenotype data. As well as creating inferred annotations, our method has the potential to allow for the elucidation of unforeseen, biologically significant associations between gene function and phenotypes that would be overlooked by a semantics-based approach. Future work will include the implementation of the described algorithms for a variety of other model organism databases, taking full advantage of the abundance of available high quality curated data.


Asunto(s)
Algoritmos , Redes Reguladoras de Genes , Anotación de Secuencia Molecular , Fenotipo , Animales , Bases de Datos Factuales , Ratones
6.
bioRxiv ; 2020 Sep 21.
Artículo en Inglés | MEDLINE | ID: mdl-32995795

RESUMEN

The emergence of the SARS-CoV-2 virus and subsequent COVID-19 pandemic initiated intense research into the mechanisms of action for this virus. It was quickly noted that COVID-19 presents more seriously in conjunction with other human disease conditions such as hypertension, diabetes, and lung diseases. We conducted a bioinformatics analysis of COVID-19 comorbidity-associated gene sets, identifying genes and pathways shared among the comorbidities, and evaluated current knowledge about these genes and pathways as related to current information about SARS-CoV-2 infection. We performed our analysis using GeneWeaver (GW), Reactome, and several biomedical ontologies to represent and compare common COVID-19 comorbidities. Phenotypic analysis of shared genes revealed significant enrichment for immune system phenotypes and for cardiovascular-related phenotypes, which might point to alleles and phenotypes in mouse models that could be evaluated for clues to COVID-19 severity. Through pathway analysis, we identified enriched pathways shared by comorbidity datasets and datasets associated with SARS-CoV-2 infection.

7.
Sci Rep ; 10(1): 20848, 2020 11 30.
Artículo en Inglés | MEDLINE | ID: mdl-33257774

RESUMEN

The emergence of the SARS-CoV-2 virus and subsequent COVID-19 pandemic initiated intense research into the mechanisms of action for this virus. It was quickly noted that COVID-19 presents more seriously in conjunction with other human disease conditions such as hypertension, diabetes, and lung diseases. We conducted a bioinformatics analysis of COVID-19 comorbidity-associated gene sets, identifying genes and pathways shared among the comorbidities, and evaluated current knowledge about these genes and pathways as related to current information about SARS-CoV-2 infection. We performed our analysis using GeneWeaver (GW), Reactome, and several biomedical ontologies to represent and compare common COVID-19 comorbidities. Phenotypic analysis of shared genes revealed significant enrichment for immune system phenotypes and for cardiovascular-related phenotypes, which might point to alleles and phenotypes in mouse models that could be evaluated for clues to COVID-19 severity. Through pathway analysis, we identified enriched pathways shared by comorbidity datasets and datasets associated with SARS-CoV-2 infection.


Asunto(s)
COVID-19/mortalidad , COVID-19/patología , Biología Computacional/métodos , Animales , Enfermedades Cardiovasculares/epidemiología , Enfermedades Cardiovasculares/genética , Comorbilidad , Síndrome de Liberación de Citoquinas/mortalidad , Bases de Datos Genéticas , Diabetes Mellitus/epidemiología , Diabetes Mellitus/genética , Modelos Animales de Enfermedad , Hepatitis/epidemiología , Hepatitis/genética , Humanos , Enfermedades Renales/epidemiología , Enfermedades Renales/genética , Enfermedades Pulmonares/epidemiología , Enfermedades Pulmonares/genética , Ratones , Síndrome de Dificultad Respiratoria/mortalidad , SARS-CoV-2 , Índice de Severidad de la Enfermedad
8.
Database (Oxford) ; 20202020 01 01.
Artículo en Inglés | MEDLINE | ID: mdl-32559296

RESUMEN

Short paragraphs that describe gene function, referred to as gene summaries, are valued by users of biological knowledgebases for the ease with which they convey key aspects of gene function. Manual curation of gene summaries, while desirable, is difficult for knowledgebases to sustain. We developed an algorithm that uses curated, structured gene data at the Alliance of Genome Resources (Alliance; www.alliancegenome.org) to automatically generate gene summaries that simulate natural language. The gene data used for this purpose include curated associations (annotations) to ontology terms from the Gene Ontology, Disease Ontology, model organism knowledgebase (MOK)-specific anatomy ontologies and Alliance orthology data. The method uses sentence templates for each data category included in the gene summary in order to build a natural language sentence from the list of terms associated with each gene. To improve readability of the summaries when numerous gene annotations are present, we developed a new algorithm that traverses ontology graphs in order to group terms by their common ancestors. The algorithm optimizes the coverage of the initial set of terms and limits the length of the final summary, using measures of information content of each ontology term as a criterion for inclusion in the summary. The automated gene summaries are generated with each Alliance release, ensuring that they reflect current data at the Alliance. Our method effectively leverages category-specific curation efforts of the Alliance member databases to create modular, structured and standardized gene summaries for seven member species of the Alliance. These automatically generated gene summaries make cross-species gene function comparisons tenable and increase discoverability of potential models of human disease. In addition to being displayed on Alliance gene pages, these summaries are also included on several MOK gene pages.


Asunto(s)
Bases de Datos Genéticas , Genómica , Anotación de Secuencia Molecular/métodos , Ontología de Genes , Almacenamiento y Recuperación de la Información
9.
BMC Bioinformatics ; 7: 416, 2006 Sep 19.
Artículo en Inglés | MEDLINE | ID: mdl-16984652

RESUMEN

BACKGROUND: Many commonly used genome browsers display sequence annotations and related attributes as horizontal data tracks that can be toggled on and off according to user preferences. Most genome browsers use only simple keyword searches and limit the display of detailed annotations to one chromosomal region of the genome at a time. We have employed concepts, methodologies, and tools that were developed for the display of geographic data to develop a Genome Spatial Information System (GenoSIS) for displaying genomes spatially, and interacting with genome annotations and related attribute data. In contrast to the paradigm of horizontally stacked data tracks used by most genome browsers, GenoSIS uses the concept of registered spatial layers composed of spatial objects for integrated display of diverse data. In addition to basic keyword searches, GenoSIS supports complex queries, including spatial queries, and dynamically generates genome maps. Our adaptation of the geographic information system (GIS) model in a genome context supports spatial representation of genome features at multiple scales with a versatile and expressive query capability beyond that supported by existing genome browsers. RESULTS: We implemented an interactive genome sequence feature map for the mouse genome in GenoSIS, an application that uses ArcGIS, a commercially available GIS software system. The genome features and their attributes are represented as spatial objects and data layers that can be toggled on and off according to user preferences or displayed selectively in response to user queries. GenoSIS supports the generation of custom genome maps in response to complex queries about genome features based on both their attributes and locations. Our example application of GenoSIS to the mouse genome demonstrates the powerful visualization and query capability of mature GIS technology applied in a novel domain. CONCLUSION: Mapping tools developed specifically for geographic data can be exploited to display, explore and interact with genome data. The approach we describe here is organism independent and is equally useful for linear and circular chromosomes. One of the unique capabilities of GenoSIS compared to existing genome browsers is the capacity to generate genome feature maps dynamically in response to complex attribute and spatial queries.


Asunto(s)
Mapeo Cromosómico/métodos , Genoma/genética , Sistemas de Información Geográfica , Animales , Bases de Datos Genéticas , Regulación de la Expresión Génica , Ratones
10.
BMC Genomics ; 7: 229, 2006 Sep 08.
Artículo en Inglés | MEDLINE | ID: mdl-16961921

RESUMEN

BACKGROUND: Many agricultural species and their pathogens have sequenced genomes and more are in progress. Agricultural species provide food, fiber, xenotransplant tissues, biopharmaceuticals and biomedical models. Moreover, many agricultural microorganisms are human zoonoses. However, systems biology from functional genomics data is hindered in agricultural species because agricultural genome sequences have relatively poor structural and functional annotation and agricultural research communities are smaller with limited funding compared to many model organism communities. DESCRIPTION: To facilitate systems biology in these traditionally agricultural species we have established "AgBase", a curated, web-accessible, public resource http://www.agbase.msstate.edu for structural and functional annotation of agricultural genomes. The AgBase database includes a suite of computational tools to use GO annotations. We use standardized nomenclature following the Human Genome Organization Gene Nomenclature guidelines and are currently functionally annotating chicken, cow and sheep gene products using the Gene Ontology (GO). The computational tools we have developed accept and batch process data derived from different public databases (with different accession codes), return all existing GO annotations, provide a list of products without GO annotation, identify potential orthologs, model functional genomics data using GO and assist proteomics analysis of ESTs and EST assemblies. Our journal database helps prevent redundant manual GO curation. We encourage and publicly acknowledge GO annotations from researchers and provide a service for researchers interested in GO and analysis of functional genomics data. CONCLUSION: The AgBase database is the first database dedicated to functional genomics and systems biology analysis for agriculturally important species and their pathogens. We use experimental data to improve structural annotation of genomes and to functionally characterize gene products. AgBase is also directly relevant for researchers in fields as diverse as agricultural production, cancer biology, biopharmaceuticals, human health and evolutionary biology. Moreover, the experimental methods and bioinformatics tools we provide are widely applicable to many other species including model organisms.


Asunto(s)
Agricultura , Bases de Datos Genéticas , Genómica , Animales , Bases de Datos de Proteínas , Genoma/genética , Humanos
11.
Artículo en Inglés | MEDLINE | ID: mdl-25717398

RESUMEN

We have developed an ontology, OncoCL, to classify cancer cells and provide a framework for consistent annotation of cancer-associated data from conventional surgical pathology and cancer molecular biology for the purpose of access, comparison, and analysis. The cell type ontology, CL, describes normal cell types and was not designed to capture the pathology of cancer cells. OncoCL builds upon CL, as a canonical cell (represented in CL) undergoes oncogenic change and tumorigenesis with the acquisition of the cancer hallmarks described by Hanahan and Weinberg.

12.
Database (Oxford) ; 2012: bar063, 2012.
Artículo en Inglés | MEDLINE | ID: mdl-22434831

RESUMEN

Optimal curation of human diseases requires an ontology or structured vocabulary that contains terms familiar to end users, is robust enough to support multiple levels of annotation granularity, is limited to disease terms and is stable enough to avoid extensive reannotation following updates. At Mouse Genome Informatics (MGI), we currently use disease terms from Online Mendelian Inheritance in Man (OMIM) to curate mouse models of human disease. While OMIM provides highly detailed disease records that are familiar to many in the medical community, it lacks structure to support multilevel annotation. To improve disease annotation at MGI, we evaluated the merged Medical Subject Headings (MeSH) and OMIM disease vocabulary created by the Comparative Toxicogenomics Database (CTD) project. Overlaying MeSH onto OMIM provides hierarchical access to broad disease terms, a feature missing from the OMIM. We created an extended version of the vocabulary to meet the genetic disease-specific curation needs at MGI. Here we describe our evaluation of the CTD application, the extensions made by MGI and discuss the strengths and weaknesses of this approach. DATABASE URL: http://www.informatics.jax.org/


Asunto(s)
Sistemas de Administración de Bases de Datos , Bases de Datos Factuales , Modelos Animales de Enfermedad , Genoma , Ratones/genética , Animales , Humanos , Anotación de Secuencia Molecular , Interfaz Usuario-Computador
13.
Genome Biol ; 10(8): R84, 2009.
Artículo en Inglés | MEDLINE | ID: mdl-19682380

RESUMEN

Linking biochemical genetic data to the reference genome for the laboratory mouse is important for comparative physiology and for developing mouse models of human biology and disease. We describe here a new database of curated metabolic pathways for the laboratory mouse called MouseCyc http://mousecyc.jax.org. MouseCyc has been integrated with genetic and genomic data for the laboratory mouse available from the Mouse Genome Informatics database and with pathway data from other organisms, including human.


Asunto(s)
Vías Biosintéticas , Bases de Datos Genéticas , Ratones/genética , Animales , Modelos Animales de Enfermedad , Humanos
14.
Bioinformatics ; 21 Suppl 1: i136-43, 2005 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-15961450

RESUMEN

MOTIVATION: The Gene Ontology (GO) is widely used to annotate molecular attributes of genes and gene products. Multiple groups undertaking functional annotations of genomes contribute their annotation sets to the GO database resource and these data are subsequently used in comparative functional analysis research. Although GO curators adhere to the same protocols and standards while assigning GO annotations, the specific procedure followed by each annotation group can vary. Since differences in application of annotation standards would dilute the effectiveness of comparative analysis, methods for assessing annotation consistency are essential. The development of methodologies that are broadly applicable for the assessment of GO annotation consistency is an important issue for the comparative genomics community. RESULTS: We have developed a methodology for assessing the consistency of GO annotations provided by different annotation groups. The method is completely general and can be applied to compare any two sets of GO annotations. This is the first attempt to assess cross-species GO annotation consistency. Our method compares annotation sets utilizing the hierarchical structure of the GO to compare GO annotations between orthologous gene pairs. The method produces a report on the annotation consistency and inconsistency for each orthologous pair. We present results obtained by comparing GO annotations for mouse and human gene sets. AVAILABILITY: The complete current MGI_GOA GO annotation consistency report is available online at http://www.spatial.maine.edu/~mdolan/


Asunto(s)
Biología Computacional/métodos , Animales , Bases de Datos Genéticas , Bases de Datos de Proteínas , Genes , Genoma , Genómica/métodos , Humanos , Ratones , Proteómica/métodos , Programas Informáticos , Especificidad de la Especie
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA