Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 18 de 18
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
1.
Res Sq ; 2023 Jul 19.
Artículo en Inglés | MEDLINE | ID: mdl-37503119

RESUMEN

The Encyclopedia of DNA elements (ENCODE) project is a collaborative effort to create a comprehensive catalog of functional elements in the human genome. The current database comprises more than 19000 functional genomics experiments across more than 1000 cell lines and tissues using a wide array of experimental techniques to study the chromatin structure, regulatory and transcriptional landscape of the Homo sapiens and Mus musculus genomes. All experimental data, metadata, and associated computational analyses created by the ENCODE consortium are submitted to the Data Coordination Center (DCC) for validation, tracking, storage, and distribution to community resources and the scientific community. The ENCODE project has engineered and distributed uniform processing pipelines in order to promote data provenance and reproducibility as well as allow interoperability between genomic resources and other consortia. All data files, reference genome versions, software versions, and parameters used by the pipelines are captured and available via the ENCODE Portal. The pipeline code, developed using Docker and Workflow Description Language (WDL; https://openwdl.org/) is publicly available in GitHub, with images available on Dockerhub (https://hub.docker.com), enabling access to a diverse range of biomedical researchers. ENCODE pipelines maintained and used by the DCC can be installed to run on personal computers, local HPC clusters, or in cloud computing environments via Cromwell. Access to the pipelines and data via the cloud allows small labs the ability to use the data or software without access to institutional compute clusters. Standardization of the computational methodologies for analysis and quality control leads to comparable results from different ENCODE collections - a prerequisite for successful integrative analyses.

2.
bioRxiv ; 2023 Apr 06.
Artículo en Inglés | MEDLINE | ID: mdl-37066421

RESUMEN

The Encyclopedia of DNA elements (ENCODE) project is a collaborative effort to create a comprehensive catalog of functional elements in the human genome. The current database comprises more than 19000 functional genomics experiments across more than 1000 cell lines and tissues using a wide array of experimental techniques to study the chromatin structure, regulatory and transcriptional landscape of the Homo sapiens and Mus musculus genomes. All experimental data, metadata, and associated computational analyses created by the ENCODE consortium are submitted to the Data Coordination Center (DCC) for validation, tracking, storage, and distribution to community resources and the scientific community. The ENCODE project has engineered and distributed uniform processing pipelines in order to promote data provenance and reproducibility as well as allow interoperability between genomic resources and other consortia. All data files, reference genome versions, software versions, and parameters used by the pipelines are captured and available via the ENCODE Portal. The pipeline code, developed using Docker and Workflow Description Language (WDL; https://openwdl.org/) is publicly available in GitHub, with images available on Dockerhub (https://hub.docker.com), enabling access to a diverse range of biomedical researchers. ENCODE pipelines maintained and used by the DCC can be installed to run on personal computers, local HPC clusters, or in cloud computing environments via Cromwell. Access to the pipelines and data via the cloud allows small labs the ability to use the data or software without access to institutional compute clusters. Standardization of the computational methodologies for analysis and quality control leads to comparable results from different ENCODE collections - a prerequisite for successful integrative analyses.

3.
J Gerontol A Biol Sci Med Sci ; 78(5): 762-770, 2023 05 11.
Artículo en Inglés | MEDLINE | ID: mdl-36708182

RESUMEN

Frailty indexes (FIs) provide quantitative measurements of nonspecific health decline and are particularly useful as longitudinal monitors of morbidity in aging studies. For mouse studies, frailty assessments can be taken noninvasively, but they require handling and direct observation that is labor-intensive to the scientist and stress inducing to the animal. Here, we implement, evaluate, and provide a refined digital FI composed entirely of computational analyses of home-cage video and compare it to manually obtained frailty scores in both C57BL/6 and genetically heterogeneous Diversity Outbred mice. We show that the frailty scores assigned by our digital index correlate with both manually obtained frailty scores and chronological age. Thus, we provide an automated tool for frailty assessment that can be collected reproducibly, at scale, without substantial labor cost.


Asunto(s)
Fragilidad , Animales , Ratones , Humanos , Anciano , Fragilidad/diagnóstico , Ratones de Colaboración Cruzada , Ratones Endogámicos C57BL , Envejecimiento , Anciano Frágil , Evaluación Geriátrica
5.
Nucleic Acids Res ; 46(D1): D794-D801, 2018 01 04.
Artículo en Inglés | MEDLINE | ID: mdl-29126249

RESUMEN

The Encyclopedia of DNA Elements (ENCODE) Data Coordinating Center has developed the ENCODE Portal database and website as the source for the data and metadata generated by the ENCODE Consortium. Two principles have motivated the design. First, experimental protocols, analytical procedures and the data themselves should be made publicly accessible through a coherent, web-based search and download interface. Second, the same interface should serve carefully curated metadata that record the provenance of the data and justify its interpretation in biological terms. Since its initial release in 2013 and in response to recommendations from consortium members and the wider community of scientists who use the Portal to access ENCODE data, the Portal has been regularly updated to better reflect these design principles. Here we report on these updates, including results from new experiments, uniformly-processed data from other projects, new visualization tools and more comprehensive metadata to describe experiments and analyses. Additionally, the Portal is now home to meta(data) from related projects including Genomics of Gene Regulation, Roadmap Epigenome Project, Model organism ENCODE (modENCODE) and modERN. The Portal now makes available over 13000 datasets and their accompanying metadata and can be accessed at: https://www.encodeproject.org/.


Asunto(s)
ADN/genética , Bases de Datos Genéticas , Componentes del Gen , Genómica , Secuenciación de Nucleótidos de Alto Rendimiento , Metadatos , Animales , Caenorhabditis elegans/genética , Presentación de Datos , Conjuntos de Datos como Asunto , Drosophila melanogaster/genética , Predicción , Genoma Humano , Humanos , Ratones/genética , Interfaz Usuario-Computador
6.
PLoS One ; 12(4): e0175310, 2017.
Artículo en Inglés | MEDLINE | ID: mdl-28403240

RESUMEN

The Encyclopedia of DNA elements (ENCODE) project is an ongoing collaborative effort to create a comprehensive catalog of functional elements initiated shortly after the completion of the Human Genome Project. The current database exceeds 6500 experiments across more than 450 cell lines and tissues using a wide array of experimental techniques to study the chromatin structure, regulatory and transcriptional landscape of the H. sapiens and M. musculus genomes. All ENCODE experimental data, metadata, and associated computational analyses are submitted to the ENCODE Data Coordination Center (DCC) for validation, tracking, storage, unified processing, and distribution to community resources and the scientific community. As the volume of data increases, the identification and organization of experimental details becomes increasingly intricate and demands careful curation. The ENCODE DCC has created a general purpose software system, known as SnoVault, that supports metadata and file submission, a database used for metadata storage, web pages for displaying the metadata and a robust API for querying the metadata. The software is fully open-source, code and installation instructions can be found at: http://github.com/ENCODE-DCC/snovault/ (for the generic database) and http://github.com/ENCODE-DCC/encoded/ to store genomic data in the manner of ENCODE. The core database engine, SnoVault (which is completely independent of ENCODE, genomic data, or bioinformatic data) has been released as a separate Python package.


Asunto(s)
Bases de Datos Genéticas , Genómica/métodos , Metadatos , Programas Informáticos , Animales , ADN/genética , Genoma , Humanos , Ratones
9.
Genome Biol ; 17: 74, 2016 Apr 23.
Artículo en Inglés | MEDLINE | ID: mdl-27107712

RESUMEN

Obtaining RNA-seq measurements involves a complex data analytical process with a large number of competing algorithms as options. There is much debate about which of these methods provides the best approach. Unfortunately, it is currently difficult to evaluate their performance due in part to a lack of sensitive assessment metrics. We present a series of statistical summaries and plots to evaluate the performance in terms of specificity and sensitivity, available as a R/Bioconductor package ( http://bioconductor.org/packages/rnaseqcomp ). Using two independent datasets, we assessed seven competing pipelines. Performance was generally poor, with two methods clearly underperforming and RSEM slightly outperforming the rest.


Asunto(s)
Algoritmos , Análisis de Secuencia de ARN/métodos , Animales , Humanos , Valores de Referencia , Sensibilidad y Especificidad , Análisis de Secuencia de ARN/normas
10.
Artículo en Inglés | MEDLINE | ID: mdl-26980513

RESUMEN

The Encyclopedia of DNA Elements (ENCODE) Data Coordinating Center (DCC) is responsible for organizing, describing and providing access to the diverse data generated by the ENCODE project. The description of these data, known as metadata, includes the biological sample used as input, the protocols and assays performed on these samples, the data files generated from the results and the computational methods used to analyze the data. Here, we outline the principles and philosophy used to define the ENCODE metadata in order to create a metadata standard that can be applied to diverse assays and multiple genomic projects. In addition, we present how the data are validated and used by the ENCODE DCC in creating the ENCODE Portal (https://www.encodeproject.org/). Database URL: www.encodeproject.org.


Asunto(s)
Biología Computacional/métodos , ADN/genética , Bases de Datos Genéticas , Algoritmos , Animales , Caenorhabditis elegans , Biología Computacional/normas , Recolección de Datos , Drosophila melanogaster , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Ratones , Ácidos Nucleicos/genética , Control de Calidad , Reproducibilidad de los Resultados , Alineación de Secuencia
11.
Nucleic Acids Res ; 44(D1): D726-32, 2016 Jan 04.
Artículo en Inglés | MEDLINE | ID: mdl-26527727

RESUMEN

The Encyclopedia of DNA Elements (ENCODE) Project is in its third phase of creating a comprehensive catalog of functional elements in the human genome. This phase of the project includes an expansion of assays that measure diverse RNA populations, identify proteins that interact with RNA and DNA, probe regions of DNA hypersensitivity, and measure levels of DNA methylation in a wide range of cell and tissue types to identify putative regulatory elements. To date, results for almost 5000 experiments have been released for use by the scientific community. These data are available for searching, visualization and download at the new ENCODE Portal (www.encodeproject.org). The revamped ENCODE Portal provides new ways to browse and search the ENCODE data based on the metadata that describe the assays as well as summaries of the assays that focus on data provenance. In addition, it is a flexible platform that allows integration of genomic data from multiple projects. The portal experience was designed to improve access to ENCODE data by relying on metadata that allow reusability and reproducibility of the experiments.


Asunto(s)
Bases de Datos Genéticas , Genoma Humano , Genómica , Animales , ADN/metabolismo , Genes , Humanos , Ratones , Proteínas/metabolismo , ARN/metabolismo
12.
Artículo en Inglés | MEDLINE | ID: mdl-25776021

RESUMEN

The Encyclopedia of DNA elements (ENCODE) project is an ongoing collaborative effort to create a catalog of genomic annotations. To date, the project has generated over 4000 experiments across more than 350 cell lines and tissues using a wide array of experimental techniques to study the chromatin structure, regulatory network and transcriptional landscape of the Homo sapiens and Mus musculus genomes. All ENCODE experimental data, metadata and associated computational analyses are submitted to the ENCODE Data Coordination Center (DCC) for validation, tracking, storage and distribution to community resources and the scientific community. As the volume of data increases, the organization of experimental details becomes increasingly complicated and demands careful curation to identify related experiments. Here, we describe the ENCODE DCC's use of ontologies to standardize experimental metadata. We discuss how ontologies, when used to annotate metadata, provide improved searching capabilities and facilitate the ability to find connections within a set of experiments. Additionally, we provide examples of how ontologies are used to annotate ENCODE metadata and how the annotations can be identified via ontology-driven searches at the ENCODE portal. As genomic datasets grow larger and more interconnected, standardization of metadata becomes increasingly vital to allow for exploration and comparison of data between different scientific projects.


Asunto(s)
Curaduría de Datos/métodos , Bases de Datos Genéticas , Ontología de Genes , Redes Reguladoras de Genes/fisiología , Anotación de Secuencia Molecular/métodos , Transcripción Genética/fisiología , Animales , Humanos , Ratones
13.
Nucleic Acids Res ; 42(Database issue): D764-70, 2014 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-24270787

RESUMEN

The University of California Santa Cruz (UCSC) Genome Browser (http://genome.ucsc.edu) offers online public access to a growing database of genomic sequence and annotations for a large collection of organisms, primarily vertebrates, with an emphasis on the human and mouse genomes. The Browser's web-based tools provide an integrated environment for visualizing, comparing, analysing and sharing both publicly available and user-generated genomic data sets. As of September 2013, the database contained genomic sequence and a basic set of annotation 'tracks' for ∼90 organisms. Significant new annotations include a 60-species multiple alignment conservation track on the mouse, updated UCSC Genes tracks for human and mouse, and several new sets of variation and ENCODE data. New software tools include a Variant Annotation Integrator that returns predicted functional effects of a set of variants uploaded as a custom track, an extension to UCSC Genes that displays haplotype alleles for protein-coding genes and an expansion of data hubs that includes the capability to display remotely hosted user-provided assembly sequence in addition to annotation data. To improve European access, we have added a Genome Browser mirror (http://genome-euro.ucsc.edu) hosted at Bielefeld University in Germany.


Asunto(s)
Bases de Datos Genéticas , Genoma , Genómica , Alelos , Animales , Genoma Humano , Humanos , Internet , Ratones , Anotación de Secuencia Molecular , Polimorfismo de Nucleótido Simple , Alineación de Secuencia , Programas Informáticos
14.
Nucleic Acids Res ; 41(Database issue): D56-63, 2013 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-23193274

RESUMEN

The Encyclopedia of DNA Elements (ENCODE), http://encodeproject.org, has completed its fifth year of scientific collaboration to create a comprehensive catalog of functional elements in the human genome, and its third year of investigations in the mouse genome. Since the last report in this journal, the ENCODE human data repertoire has grown by 898 new experiments (totaling 2886), accompanied by a major integrative analysis. In the mouse genome, results from 404 new experiments became available this year, increasing the total to 583, collected during the course of the project. The University of California, Santa Cruz, makes this data available on the public Genome Browser http://genome.ucsc.edu for visual browsing and data mining. Download of raw and processed data files are all supported. The ENCODE portal provides specialized tools and information about the ENCODE data sets.


Asunto(s)
Bases de Datos Genéticas , Genoma Humano , Genómica , Animales , Humanos , Internet , Ratones , Programas Informáticos
15.
Nucleic Acids Res ; 41(Database issue): D64-9, 2013 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-23155063

RESUMEN

The University of California Santa Cruz (UCSC) Genome Browser (http://genome.ucsc.edu) offers online public access to a growing database of genomic sequence and annotations for a wide variety of organisms. The Browser is an integrated tool set for visualizing, comparing, analysing and sharing both publicly available and user-generated genomic datasets. As of September 2012, genomic sequence and a basic set of annotation 'tracks' are provided for 63 organisms, including 26 mammals, 13 non-mammal vertebrates, 3 invertebrate deuterostomes, 13 insects, 6 worms, yeast and sea hare. In the past year 19 new genome assemblies have been added, and we anticipate releasing another 28 in early 2013. Further, a large number of annotation tracks have been either added, updated by contributors or remapped to the latest human reference genome. Among these are an updated UCSC Genes track for human and mouse assemblies. We have also introduced several features to improve usability, including new navigation menus. This article provides an update to the UCSC Genome Browser database, which has been previously featured in the Database issue of this journal.


Asunto(s)
Bases de Datos Genéticas , Genómica , Animales , Genoma Humano , Humanos , Internet , Ratones , Anotación de Secuencia Molecular , Programas Informáticos
16.
Nucleic Acids Res ; 40(Database issue): D918-23, 2012 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-22086951

RESUMEN

The University of California Santa Cruz Genome Browser (http://genome.ucsc.edu) offers online public access to a growing database of genomic sequence and annotations for a wide variety of organisms. The Browser is an integrated tool set for visualizing, comparing, analyzing and sharing both publicly available and user-generated genomic data sets. In the past year, the local database has been updated with four new species assemblies, and we anticipate another four will be released by the end of 2011. Further, a large number of annotation tracks have been either added, updated by contributors, or remapped to the latest human reference genome. Among these are new phenotype and disease annotations, UCSC genes, and a major dbSNP update, which required new visualization methods. Growing beyond the local database, this year we have introduced 'track data hubs', which allow the Genome Browser to provide access to remotely located sets of annotations. This feature is designed to significantly extend the number and variety of annotation tracks that are publicly available for visualization and analysis from within our site. We have also introduced several usability features including track search and a context-sensitive menu of options available with a right-click anywhere on the Browser's image.


Asunto(s)
Bases de Datos de Ácidos Nucleicos , Genoma , Animales , Enfermedad/genética , Genoma Humano , Genómica , Humanos , Internet , Anotación de Secuencia Molecular , Fenotipo
17.
Nucleic Acids Res ; 40(Database issue): D912-7, 2012 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-22075998

RESUMEN

The Encyclopedia of DNA Elements (ENCODE) Consortium is entering its 5th year of production-level effort generating high-quality whole-genome functional annotations of the human genome. The past year has brought the ENCODE compendium of functional elements to critical mass, with a diverse set of 27 biochemical assays now covering 200 distinct human cell types. Within the mouse genome, which has been under study by ENCODE groups for the past 2 years, 37 cell types have been assayed. Over 2000 individual experiments have been completed and submitted to the Data Coordination Center for public use. UCSC makes this data available on the quality-reviewed public Genome Browser (http://genome.ucsc.edu) and on an early-access Preview Browser (http://genome-preview.ucsc.edu). Visual browsing, data mining and download of raw and processed data files are all supported. An ENCODE portal (http://encodeproject.org) provides specialized tools and information about the ENCODE data sets.


Asunto(s)
Bases de Datos de Ácidos Nucleicos , Genoma Humano , Genoma , Ratones/genética , Animales , Humanos , Internet , Anotación de Secuencia Molecular , Programas Informáticos
18.
Nucleic Acids Res ; 39(Database issue): D871-5, 2011 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-21037257

RESUMEN

The ENCODE project is an international consortium with a goal of cataloguing all the functional elements in the human genome. The ENCODE Data Coordination Center (DCC) at the University of California, Santa Cruz serves as the central repository for ENCODE data. In this role, the DCC offers a collection of high-throughput, genome-wide data generated with technologies such as ChIP-Seq, RNA-Seq, DNA digestion and others. This data helps illuminate transcription factor-binding sites, histone marks, chromatin accessibility, DNA methylation, RNA expression, RNA binding and other cell-state indicators. It includes sequences with quality scores, alignments, signals calculated from the alignments, and in most cases, element or peak calls calculated from the signal data. Each data set is available for visualization and download via the UCSC Genome Browser (http://genome.ucsc.edu/). ENCODE data can also be retrieved using a metadata system that captures the experimental parameters of each assay. The ENCODE web portal at UCSC (http://encodeproject.org/) provides information about the ENCODE data and links for access.


Asunto(s)
Bases de Datos Genéticas , Genoma Humano , Regulación de la Expresión Génica , Genómica , Humanos , Internet , Programas Informáticos , Interfaz Usuario-Computador
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...