Your browser doesn't support javascript.
loading
Montrer: 20 | 50 | 100
Résultats 1 - 8 de 8
Filtrer
Plus de filtres










Base de données
Gamme d'année
1.
Nucleic Acids Res ; 48(D1): D882-D889, 2020 01 08.
Article de Anglais | MEDLINE | ID: mdl-31713622

RÉSUMÉ

The Encyclopedia of DNA Elements (ENCODE) is an ongoing collaborative research project aimed at identifying all the functional elements in the human and mouse genomes. Data generated by the ENCODE consortium are freely accessible at the ENCODE portal (https://www.encodeproject.org/), which is developed and maintained by the ENCODE Data Coordinating Center (DCC). Since the initial portal release in 2013, the ENCODE DCC has updated the portal to make ENCODE data more findable, accessible, interoperable and reusable. Here, we report on recent updates, including new ENCODE data and assays, ENCODE uniform data processing pipelines, new visualization tools, a dataset cart feature, unrestricted public access to ENCODE data on the cloud (Amazon Web Services open data registry, https://registry.opendata.aws/encode-project/) and more comprehensive tutorials and documentation.


Sujet(s)
ADN/génétique , Bases de données génétiques , Génome humain , Logiciel , Animaux , Génomique , Humains , Souris
2.
Curr Protoc Bioinformatics ; 68(1): e89, 2019 12.
Article de Anglais | MEDLINE | ID: mdl-31751002

RÉSUMÉ

The Encyclopedia of DNA Elements (ENCODE) web portal hosts genomic data generated by the ENCODE Consortium, Genomics of Gene Regulation, The NIH Roadmap Epigenomics Consortium, and the modENCODE and modERN projects. The goal of the ENCODE project is to build a comprehensive map of the functional elements of the human and mouse genomes. Currently, the portal database stores over 500 TB of raw and processed data from over 15,000 experiments spanning assays that measure gene expression, DNA accessibility, DNA and RNA binding, DNA methylation, and 3D chromatin structure across numerous cell lines, tissue types, and differentiation states with selected genetic and molecular perturbations. The ENCODE portal provides unrestricted access to the aforementioned data and relevant metadata as a service to the scientific community. The metadata model captures the details of the experiments, raw and processed data files, and processing pipelines in human and machine-readable form and enables the user to search for specific data either using a web browser or programmatically via REST API. Furthermore, ENCODE data can be freely visualized or downloaded for additional analyses. © 2019 The Authors. Basic Protocol: Query the portal Support Protocol 1: Batch downloading Support Protocol 2: Using the cart to download files Support Protocol 3: Visualize data Alternate Protocol: Query building and programmatic access.


Sujet(s)
Chromatine/métabolisme , ADN/génétique , Bases de données génétiques , Épigénomique/méthodes , Animaux , Méthylation de l'ADN , Génome humain , Humains , Internet , Métadonnées , Souris , Logiciel
4.
Nucleic Acids Res ; 46(D1): D794-D801, 2018 01 04.
Article de Anglais | MEDLINE | ID: mdl-29126249

RÉSUMÉ

The Encyclopedia of DNA Elements (ENCODE) Data Coordinating Center has developed the ENCODE Portal database and website as the source for the data and metadata generated by the ENCODE Consortium. Two principles have motivated the design. First, experimental protocols, analytical procedures and the data themselves should be made publicly accessible through a coherent, web-based search and download interface. Second, the same interface should serve carefully curated metadata that record the provenance of the data and justify its interpretation in biological terms. Since its initial release in 2013 and in response to recommendations from consortium members and the wider community of scientists who use the Portal to access ENCODE data, the Portal has been regularly updated to better reflect these design principles. Here we report on these updates, including results from new experiments, uniformly-processed data from other projects, new visualization tools and more comprehensive metadata to describe experiments and analyses. Additionally, the Portal is now home to meta(data) from related projects including Genomics of Gene Regulation, Roadmap Epigenome Project, Model organism ENCODE (modENCODE) and modERN. The Portal now makes available over 13000 datasets and their accompanying metadata and can be accessed at: https://www.encodeproject.org/.


Sujet(s)
ADN/génétique , Bases de données génétiques , Composants de gène , Génomique , Séquençage nucléotidique à haut débit , Métadonnées , Animaux , Caenorhabditis elegans/génétique , Affichage de données , Jeux de données comme sujet , Drosophila melanogaster/génétique , Prévision , Génome humain , Humains , Souris/génétique , Interface utilisateur
5.
PLoS One ; 11(5): e0156034, 2016.
Article de Anglais | MEDLINE | ID: mdl-27206019

RÉSUMÉ

Some cyanobacteria are capable of differentiating a variety of cell types in response to environmental factors. For instance, in low nitrogen conditions, some cyanobacteria form heterocysts, which are specialized for N2 fixation. Many heterocyst-forming cyanobacteria have DNA elements interrupting key N2 fixation genes, elements that are excised during heterocyst differentiation. While the mechanism for the excision of the element has been well-studied, many questions remain regarding the introduction of the elements into the cyanobacterial lineage and whether they have been retained ever since or have been lost and reintroduced. To examine the evolutionary relationships and possible function of DNA sequences that interrupt genes of heterocyst-forming cyanobacteria, we identified and compared 101 interruption element sequences within genes from 38 heterocyst-forming cyanobacterial genomes. The interruption element lengths ranged from about 1 kb (the minimum able to encode the recombinase responsible for element excision), up to nearly 1 Mb. The recombinase gene sequences served as genetic markers that were common across the interruption elements and were used to track element evolution. Elements were found that interrupted 22 different orthologs, only five of which had been previously observed to be interrupted by an element. Most of the newly identified interrupted orthologs encode proteins that have been shown to have heterocyst-specific activity. However, the presence of interruption elements within genes with no known role in N2 fixation, as well as in three non-heterocyst-forming cyanobacteria, indicates that the processes that trigger the excision of elements may not be limited to heterocyst development or that the elements move randomly within genomes. This comprehensive analysis provides the framework to study the history and behavior of these unique sequences, and offers new insight regarding the frequency and persistence of interruption elements in heterocyst-forming cyanobacteria.


Sujet(s)
Protéines bactériennes/génétique , Cyanobactéries/génétique , ADN/génétique , Fixation de l'azote/génétique , Phylogenèse
6.
Article de Anglais | MEDLINE | ID: mdl-26980513

RÉSUMÉ

The Encyclopedia of DNA Elements (ENCODE) Data Coordinating Center (DCC) is responsible for organizing, describing and providing access to the diverse data generated by the ENCODE project. The description of these data, known as metadata, includes the biological sample used as input, the protocols and assays performed on these samples, the data files generated from the results and the computational methods used to analyze the data. Here, we outline the principles and philosophy used to define the ENCODE metadata in order to create a metadata standard that can be applied to diverse assays and multiple genomic projects. In addition, we present how the data are validated and used by the ENCODE DCC in creating the ENCODE Portal (https://www.encodeproject.org/). Database URL: www.encodeproject.org.


Sujet(s)
Biologie informatique/méthodes , ADN/génétique , Bases de données génétiques , Algorithmes , Animaux , Caenorhabditis elegans , Biologie informatique/normes , Collecte de données , Drosophila melanogaster , Séquençage nucléotidique à haut débit , Humains , Souris , Acides nucléiques/génétique , Contrôle de qualité , Reproductibilité des résultats , Alignement de séquences
7.
ISME J ; 9(7): 1557-69, 2015 Jul.
Article de Anglais | MEDLINE | ID: mdl-25514535

RÉSUMÉ

Biological N2 fixation is an important nitrogen source for surface ocean microbial communities. However, nearly all information on the diversity and gene expression of organisms responsible for oceanic N2 fixation in the environment has come from targeted approaches that assay only a small number of genes and organisms. Using genomes of diazotrophic cyanobacteria to extract reads from extensive meta-genomic and -transcriptomic libraries, we examined diazotroph diversity and gene expression from the Amazon River plume, an area characterized by salinity and nutrient gradients. Diazotroph genome and transcript sequences were most abundant in the transitional waters compared with lower salinity or oceanic water masses. We were able to distinguish two genetically divergent phylotypes within the Hemiaulus-associated Richelia sequences, which were the most abundant diazotroph sequences in the data set. Photosystem (PS)-II transcripts in Richelia populations were much less abundant than those in Trichodesmium, and transcripts from several Richelia PS-II genes were absent, indicating a prominent role for cyclic electron transport in Richelia. In addition, there were several abundant regulatory transcripts, including one that targets a gene involved in PS-I cyclic electron transport in Richelia. High sequence coverage of the Richelia transcripts, as well as those from Trichodesmium populations, allowed us to identify expressed regions of the genomes that had been overlooked by genome annotations. High-coverage genomic and transcription analysis enabled the characterization of distinct phylotypes within diazotrophic populations, revealed a distinction in a core process between dominant populations and provided evidence for a prominent role for noncoding RNAs in microbial communities.


Sujet(s)
Cyanobactéries/métabolisme , Fixation de l'azote/physiologie , Azote/métabolisme , Rivières/microbiologie , Eau de mer/microbiologie , Cyanobactéries/génétique , Diatomées/métabolisme , Régulation de l'expression des gènes bactériens/physiologie , Fixation de l'azote/génétique , Rivières/composition chimique , Eau de mer/composition chimique , Transcriptome
8.
Nat Commun ; 4: 1767, 2013.
Article de Anglais | MEDLINE | ID: mdl-23612308

RÉSUMÉ

Diatoms with symbiotic N2-fixing cyanobacteria are often abundant in the oligotrophic open ocean gyres. The most abundant cyanobacterial symbionts form heterocysts (specialized cells for N2 fixation) and provide nitrogen (N) to their hosts, but their morphology, cellular locations and abundances differ depending on the host. Here we show that the location of the symbiont and its dependency on the host are linked to the evolution of the symbiont genome. The genome of Richelia (found inside the siliceous frustule of Hemiaulus) is reduced and lacks ammonium transporters, nitrate/nitrite reductases and glutamine:2-oxoglutarate aminotransferase. In contrast, the genome of the closely related Calothrix (found outside the frustule of Chaetoceros) is more similar to those of free-living heterocyst-forming cyanobacteria. The genome of Richelia is an example of metabolic streamlining that has implications for the evolution of N2-fixing symbiosis and potentially for manipulating plant-cyanobacterial interactions.


Sujet(s)
Cyanobactéries/génétique , Diatomées/microbiologie , Génome bactérien/génétique , Voies et réseaux métaboliques/génétique , Azote/métabolisme , Délétion de séquence/génétique , Symbiose/génétique , Cytométrie en flux , Gènes bactériens/génétique , Nitrogenase/génétique , Opéron/génétique , Phylogenèse
SÉLECTION CITATIONS
DÉTAIL DE RECHERCHE
...