Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 19 de 19
Filtrar
1.
Nucleic Acids Res ; 50(D1): D316-D325, 2022 01 07.
Artículo en Inglés | MEDLINE | ID: mdl-34751401

RESUMEN

ReMap (https://remap.univ-amu.fr) aims to provide manually curated, high-quality catalogs of regulatory regions resulting from a large-scale integrative analysis of DNA-binding experiments in Human, Mouse, Fly and Arabidopsis thaliana for hundreds of transcription factors and regulators. In this 2022 update, we have uniformly processed >11 000 DNA-binding sequencing datasets from public sources across four species. The updated Human regulatory atlas includes 8103 datasets covering a total of 1210 transcriptional regulators (TRs) with a catalog of 182 million (M) peaks, while the updated Arabidopsis atlas reaches 4.8M peaks, 423 TRs across 694 datasets. Also, this ReMap release is enriched by two new regulatory catalogs for Mus musculus and Drosophila melanogaster. First, the Mouse regulatory catalog consists of 123M peaks across 648 TRs as a result of the integration and validation of 5503 ChIP-seq datasets. Second, the Drosophila melanogaster catalog contains 16.6M peaks across 550 TRs from the integration of 1205 datasets. The four regulatory catalogs are browsable through track hubs at UCSC, Ensembl and NCBI genome browsers. Finally, ReMap 2022 comes with a new Cis Regulatory Module identification method, improved quality controls, faster search results, and better user experience with an interactive tour and video tutorials on browsing and filtering ReMap catalogs.


Asunto(s)
Arabidopsis/genética , Bases de Datos Genéticas , Drosophila melanogaster/genética , Elementos Reguladores de la Transcripción , Programas Informáticos , Factores de Transcripción/genética , Transcripción Genética , Animales , Arabidopsis/metabolismo , Atlas como Asunto , Secuencia de Bases , Sitios de Unión , ADN/genética , ADN/metabolismo , Conjuntos de Datos como Asunto , Drosophila melanogaster/metabolismo , Redes Reguladoras de Genes , Humanos , Internet , Ratones , Análisis de Secuencia de ADN , Factores de Transcripción/clasificación , Factores de Transcripción/metabolismo
2.
Nucleic Acids Res ; 48(D1): D180-D188, 2020 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-31665499

RESUMEN

ReMap (http://remap.univ-amu.fr) aims to provide the largest catalogs of high-quality regulatory regions resulting from a large-scale integrative analysis of hundreds of transcription factors and regulators from DNA-binding experiments in Human and Arabidopsis (Arabidopsis thaliana). In this 2020 update of ReMap we have collected, analyzed and retained after quality control 2764 new human ChIP-seq and 208 ChIP-exo datasets available from public sources. The updated human atlas totalize 5798 datasets covering a total of 1135 transcriptional regulators (TRs) with a catalog of 165 million (M) peaks. This ReMap update comes with two unique Arabidopsis regulatory catalogs. First, a catalog of 372 Arabidopsis TRs across 2.6M peaks as a result of the integration of 509 ChIP-seq and DAP-seq datasets. Second, a catalog of 33 histone modifications and variants across 4.5M peaks from the integration of 286 ChIP-seq datasets. All catalogs are made available through track hubs at Ensembl and UCSC Genome Browsers. Additionally, this update comes with a new web framework providing an interactive user-interface, including improved search features. Finally, full programmatically access to the underlying data is available using a RESTful API together with a new R Shiny interface for a TRs binding enrichment analysis tool.


Asunto(s)
Arabidopsis/genética , Bases de Datos Genéticas , Elementos Reguladores de la Transcripción , Factores de Transcripción/metabolismo , Arabidopsis/metabolismo , Secuenciación de Inmunoprecipitación de Cromatina , Proteínas de Unión al ADN/metabolismo , Código de Histonas , Humanos , Interfaz Usuario-Computador
3.
Hum Reprod ; 36(3): 693-701, 2021 02 18.
Artículo en Inglés | MEDLINE | ID: mdl-33332558

RESUMEN

After the two meiotic divisions, haploid round spermatids undergo dramatic changes to become mature spermatozoa. One of the main transformations consists of compacting the cell nucleus to confer the sperm its remarkable hydrodynamic property and to protect its DNA from the oxidative stress it will encounter during its reproductive journey. Here, we studied an infertile subject with low sperm count, poor motility and highly abnormal spermatozoa with strikingly large heads due to highly uncondensed nuclear sperm DNA. Whole-exome sequencing was performed on the subject's DNA to identify the genetic defect responsible for this severe sperm anomaly. Bioinformatics analysis of exome sequence data uncovered a homozygous loss of function variant, ENST00000368559.7:c.718-1G>A, altering a consensus splice site expected to prevent the synthesis of the nucleoporin 210 like (NUP210L) protein. High-resolution mass spectrometry of sperm protein extracts did not reveal any NUP210L peptide sequence in the patient's sperm, contrary to what was observed in control donors, thus confirming the absence of NUP210L in the patient's sperm. Interestingly, homozygous Nup210l knock-out mice have been shown to be infertile due to a reduced sperm count, a high proportion of round-headed sperm, other head and flagella defects and a poor motility. NUP210L is almost exclusively expressed in the testis and sequence analogy suggests that it encodes a nuclear pore membrane glycoprotein. The protein might be crucial to regulate nuclear trafficking during and/or before spermiogenesis, its absence potentially impeding adequate nuclear compaction by preventing the entry of histone variants/transition proteins/protamines into the nucleus and/or by preventing the adequate replacement of core histones. This work describes a new gene necessary for male fertility, potentially improving the efficiency of the genetic diagnosis of male infertility. The function of NUP210L still remains to be resolved and its future investigation will help to understand the complex mechanisms necessary for sperm compaction.


Asunto(s)
Infertilidad Masculina , Poro Nuclear , Animales , Cromatina/genética , Humanos , Infertilidad Masculina/genética , Masculino , Glicoproteínas de Membrana , Ratones , Poro Nuclear/genética , Espermatogénesis , Espermatozoides
4.
Genes Dev ; 27(15): 1680-92, 2013 Aug 01.
Artículo en Inglés | MEDLINE | ID: mdl-23884607

RESUMEN

The conversion of male germ cell chromatin to a nucleoprotamine structure is fundamental to the life cycle, yet the underlying molecular details remain obscure. Here we show that an essential step is the genome-wide incorporation of TH2B, a histone H2B variant of hitherto unknown function. Using mouse models in which TH2B is depleted or C-terminally modified, we show that TH2B directs the final transformation of dissociating nucleosomes into protamine-packed structures. Depletion of TH2B induces compensatory mechanisms that permit histone removal by up-regulating H2B and programming nucleosome instability through targeted histone modifications, including lysine crotonylation and arginine methylation. Furthermore, after fertilization, TH2B reassembles onto the male genome during protamine-to-histone exchange. Thus, TH2B is a unique histone variant that plays a key role in the histone-to-protamine packing of the male genome and guides genome-wide chromatin transitions that both precede and follow transmission of the male genome to the egg.


Asunto(s)
Cromatina/metabolismo , Histonas/metabolismo , Protaminas/metabolismo , Animales , Epigénesis Genética , Femenino , Fertilización/fisiología , Regulación del Desarrollo de la Expresión Génica , Genoma , Histonas/genética , Masculino , Meiosis , Ratones , Nucleosomas , Espermatogénesis/genética , Testículo/metabolismo
5.
J Biol Chem ; 291(20): 10684-99, 2016 May 13.
Artículo en Inglés | MEDLINE | ID: mdl-27002148

RESUMEN

Glioblastomas are the most common primary brain tumors, highly vascularized, infiltrating, and resistant to current therapies. This cancer leads to a fatal outcome in less than 18 months. The aggressive behavior of glioblastomas, including resistance to current treatments and tumor recurrence, has been attributed to glioma stemlike/progenitor cells. The transcription factor EGR1 (early growth response 1), a member of a zinc finger transcription factor family, has been described as tumor suppressor in gliomas when ectopically overexpressed. Although EGR1 expression in human glioblastomas has been associated with patient survival, its precise location in tumor territories as well as its contribution to glioblastoma progression remain elusive. In the present study, we show that EGR1-expressing cells are more frequent in high grade gliomas where the nuclear expression of EGR1 is restricted to proliferating/progenitor cells. We show in primary cultures of glioma stemlike cells that EGR1 contributes to stemness marker expression and proliferation by orchestrating a PDGFA-dependent growth-stimulatory loop. In addition, we demonstrate that EGR1 acts as a positive regulator of several important genes, including SHH, GLI1, GLI2, and PDGFA, previously linked to the maintenance and proliferation of glioma stemlike cells.


Asunto(s)
Comunicación Autocrina , Neoplasias Encefálicas/metabolismo , Proliferación Celular , Proteína 1 de la Respuesta de Crecimiento Precoz/metabolismo , Regulación Neoplásica de la Expresión Génica , Glioblastoma/metabolismo , Proteínas de Neoplasias/metabolismo , Células Madre Neoplásicas/metabolismo , Factor de Crecimiento Derivado de Plaquetas/biosíntesis , Neoplasias Encefálicas/patología , Femenino , Glioblastoma/patología , Humanos , Masculino , Células Madre Neoplásicas/patología , Células Tumorales Cultivadas
6.
EMBO J ; 31(19): 3809-20, 2012 Oct 03.
Artículo en Inglés | MEDLINE | ID: mdl-22922464

RESUMEN

Male germ cell differentiation is a highly regulated multistep process initiated by the commitment of progenitor cells into meiosis and characterized by major chromatin reorganizations in haploid spermatids. We report here that a single member of the double bromodomain BET factors, Brdt, is a master regulator of both meiotic divisions and post-meiotic genome repackaging. Upon its activation at the onset of meiosis, Brdt drives and determines the developmental timing of a testis-specific gene expression program. In meiotic and post-meiotic cells, Brdt initiates a genuine histone acetylation-guided programming of the genome by activating essential genes and repressing a 'progenitor cells' gene expression program. At post-meiotic stages, a global chromatin hyperacetylation gives the signal for Brdt's first bromodomain to direct the genome-wide replacement of histones by transition proteins. Brdt is therefore a unique and essential regulator of male germ cell differentiation, which, by using various domains in a developmentally controlled manner, first drives a specific spermatogenic gene expression program, and later controls the tight packaging of the male genome.


Asunto(s)
Proteínas Nucleares/metabolismo , Espermatogénesis/fisiología , Animales , Perfilación de la Expresión Génica , Genoma/fisiología , Histona Acetiltransferasas/metabolismo , Histonas/metabolismo , Masculino , Meiosis/fisiología , Ratones , Espermatozoides/crecimiento & desarrollo , Espermatozoides/metabolismo
7.
Stem Cells ; 31(5): 979-91, 2013 May.
Artículo en Inglés | MEDLINE | ID: mdl-23362228

RESUMEN

Chromatin states are believed to play a key role in distinct patterns of gene expression essential for self-renewal and pluripotency of embryonic stem cells (ESCs); however, the genes governing the establishment and propagation of the chromatin signature characteristic of pluripotent cells are poorly understood. Here, we show that conditional deletion of the histone acetyltransferase cofactor Trrap in mouse ESCs triggers unscheduled differentiation associated with loss of histone acetylation, condensation of chromatin into distinct foci (heterochromatization), and uncoupling of H3K4 dimethylation and H3K27 trimethylation. Trrap loss results in downregulation of stemness master genes Nanog, Oct4, and Sox2 and marked upregulation of specific differentiation markers from the three germ layers. Chromatin immunoprecipitation-sequencing analysis of genome-wide binding revealed a significant overlap between Oct4 and Trrap binding in ESCs but not in differentiated mouse embryonic fibroblasts, further supporting a functional interaction between Trrap and Oct4 in the maintenance of stemness. Remarkably, failure to downregulate Trrap prevents differentiation of ESCs, suggesting that downregulation of Trrap may be a critical step guiding transcriptional reprogramming and differentiation of ESCs. These findings establish Trrap as a critical part of the mechanism that restricts differentiation and promotes the maintenance of key features of ESCs.


Asunto(s)
Proteínas Adaptadoras Transductoras de Señales/metabolismo , Células Madre Embrionarias/citología , Histona Acetiltransferasas/metabolismo , Proteínas Nucleares/metabolismo , Proteínas Adaptadoras Transductoras de Señales/genética , Animales , Apoptosis/fisiología , Diferenciación Celular/fisiología , Cromatina/metabolismo , Inmunoprecipitación de Cromatina , Regulación hacia Abajo , Células Madre Embrionarias/enzimología , Células Madre Embrionarias/metabolismo , Regulación del Desarrollo de la Expresión Génica , Histona Acetiltransferasas/genética , Histonas/genética , Histonas/metabolismo , Ratones , Ratones Noqueados , Proteínas Nucleares/genética , Factor 3 de Transcripción de Unión a Octámeros/genética , Factor 3 de Transcripción de Unión a Octámeros/metabolismo , Regiones Promotoras Genéticas
8.
Database (Oxford) ; 20232023 05 23.
Artículo en Inglés | MEDLINE | ID: mdl-37221041

RESUMEN

Chagas disease is a parasitical disease caused by Trypanosoma cruzi which affects ∼7 million people worldwide. Per year, ∼10 000 people die from this pathology. Indeed, ∼30% of humans develop severe chronic forms, including cardiac, digestive or neurological disorders, for which there is still no treatment. In order to facilitate research on Chagas disease, a manual curation of all papers corresponding to 'Chagas disease' referenced on PubMed has been performed. All deregulated molecules in hosts (all mammals, humans, mice or others) following T. cruzi infection were retrieved and included in a database, named ChagasDB. A website has been developed to make this database accessible to all. In this article, we detail the construction of this database, its contents and how to use it. Database URL https://chagasdb.tagc.univ-amu.fr.


Asunto(s)
Enfermedad de Chagas , Humanos , Animales , Ratones , Bases de Datos Factuales , PubMed , Mamíferos
9.
BMC Bioinformatics ; 13: 19, 2012 Jan 31.
Artículo en Inglés | MEDLINE | ID: mdl-22292669

RESUMEN

BACKGROUND: Deciphering gene regulatory networks by in silico approaches is a crucial step in the study of the molecular perturbations that occur in diseases. The development of regulatory maps is a tedious process requiring the comprehensive integration of various evidences scattered over biological databases. Thus, the research community would greatly benefit from having a unified database storing known and predicted molecular interactions. Furthermore, given the intrinsic complexity of the data, the development of new tools offering integrated and meaningful visualizations of molecular interactions is necessary to help users drawing new hypotheses without being overwhelmed by the density of the subsequent graph. RESULTS: We extend the previously developed TranscriptomeBrowser database with a set of tables containing 1,594,978 human and mouse molecular interactions. The database includes: (i) predicted regulatory interactions (computed by scanning vertebrate alignments with a set of 1,213 position weight matrices), (ii) potential regulatory interactions inferred from systematic analysis of ChIP-seq experiments, (iii) regulatory interactions curated from the literature, (iv) predicted post-transcriptional regulation by micro-RNA, (v) protein kinase-substrate interactions and (vi) physical protein-protein interactions. In order to easily retrieve and efficiently analyze these interactions, we developed In-teractomeBrowser, a graph-based knowledge browser that comes as a plug-in for Transcriptome-Browser. The first objective of InteractomeBrowser is to provide a user-friendly tool to get new insight into any gene list by providing a context-specific display of putative regulatory and physical interactions. To achieve this, InteractomeBrowser relies on a "cell compartments-based layout" that makes use of a subset of the Gene Ontology to map gene products onto relevant cell compartments. This layout is particularly powerful for visual integration of heterogeneous biological information and is a productive avenue in generating new hypotheses. The second objective of InteractomeBrowser is to fill the gap between interaction databases and dynamic modeling. It is thus compatible with the network analysis software Cytoscape and with the Gene Interaction Network simulation software (GINsim). We provide examples underlying the benefits of this visualization tool for large gene set analysis related to thymocyte differentiation. CONCLUSIONS: The InteractomeBrowser plugin is a powerful tool to get quick access to a knowledge database that includes both predicted and validated molecular interactions. InteractomeBrowser is available through the TranscriptomeBrowser framework and can be found at: http://tagc.univ-mrs.fr/tbrowser/. Our database is updated on a regular basis.


Asunto(s)
Perfilación de la Expresión Génica , Redes Reguladoras de Genes , Genómica/métodos , Programas Informáticos , Animales , Diferenciación Celular , Bases de Datos Genéticas , Perros , Humanos , Ratones , Análisis de Secuencia por Matrices de Oligonucleótidos , Proteínas/genética , Proteínas/metabolismo , Ratas , Timocitos/citología , Timocitos/metabolismo , Interfaz Usuario-Computador
10.
Sci Rep ; 11(1): 13691, 2021 07 01.
Artículo en Inglés | MEDLINE | ID: mdl-34211067

RESUMEN

Integrating -omics data with biological networks such as protein-protein interaction networks is a popular and useful approach to interpret expression changes of genes in changing conditions, and to identify relevant cellular pathways, active subnetworks or network communities. Yet, most -omics data integration tools are restricted to static networks and therefore cannot easily be used for analyzing time-series data. Determining regulations or exploring the network structure over time requires time-dependent networks which incorporate time as one component in their structure. Here, we present a method to project time-series data on sequential layers of a multilayer network, thus creating a temporal multilayer network (tMLN). We implemented this method as a Cytoscape app we named TimeNexus. TimeNexus allows to easily create, manage and visualize temporal multilayer networks starting from a combination of node and edge tables carrying the information on the temporal network structure. To allow further analysis of the tMLN, TimeNexus creates and passes on regular Cytoscape networks in form of static versions of the tMLN in three different ways: (i) over the entire set of layers, (ii) over two consecutive layers at a time, (iii) or on one single layer at a time. We combined TimeNexus with the Cytoscape apps PathLinker and AnatApp/ANAT to extract active subnetworks from tMLNs. To test the usability of our app, we applied TimeNexus together with PathLinker or ANAT on temporal expression data of the yeast cell cycle and were able to identify active subnetworks relevant for different cell cycle phases. We furthermore used TimeNexus on our own temporal expression data from a mouse pain assay inducing hindpaw inflammation and detected active subnetworks relevant for an inflammatory response to injury, including immune response, cell stress response and regulation of apoptosis. TimeNexus is freely available from the Cytoscape app store at https://apps.cytoscape.org/apps/TimeNexus .

11.
Genomics ; 93(3): 213-20, 2009 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-19059335

RESUMEN

The Alternative Splicing and Transcript Diversity database (ASTD) gives access to a vast collection of alternative transcripts that integrate transcription initiation, polyadenylation and splicing variant data. Alternative transcripts are derived from the mapping of transcribed sequences to the complete human, mouse and rat genomes using an extension of the computational pipeline developed for the ASD (Alternative Splicing Database) and ATD (Alternative Transcript Diversity) databases, which are now superseded by ASTD. For the human genome, ASTD identifies splicing variants, transcription initiation variants and polyadenylation variants in 68%, 68% and 62% of the gene set, respectively, consistent with current estimates for transcription variation. Users can access ASTD through a variety of browsing and query tools, including expression state-based queries for the identification of tissue-specific isoforms. Participating laboratories have experimentally validated a subset of ASTD-predicted alternative splice forms and alternative polyadenylation forms that were not previously reported. The ASTD database can be accessed at http://www.ebi.ac.uk/astd.


Asunto(s)
Empalme Alternativo/genética , Bases de Datos Genéticas , Animales , Sistemas de Administración de Bases de Datos , Humanos , Almacenamiento y Recuperación de la Información/métodos , Ratones , Ratas , Reproducibilidad de los Resultados , Programas Informáticos , Interfaz Usuario-Computador
12.
Int J Antimicrob Agents ; 56(6): 106153, 2020 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-32911069

RESUMEN

OBJECTIVES: Fluoroquinolone (FQ)-resistant mutants were previously selected from the live vaccine strain (LVS) of Francisella tularensis (F. tularensis) subsp. holarctica. This study further characterised all genetic changes that occurred in these mutants during the evolutionary trajectory toward high-level FQ resistance, and their potential impact on F. tularensis antibiotic resistance and intracellular fitness. METHODS: The whole genomes of FQ-resistant mutants were determined and compared with those of their parental strain. All detected mutations were evaluated for their potential impact on FQ resistance and intracellular multiplication of F. tularensis. RESULTS: As compared with the parental LVS genome, 28 mutations were found in the derived FQ-resistant mutants. These mutations involved all genes encoding type II topoisomerases (i.e. gyrA, gyrB, parC, and parE). Interestingly, some of them were not previously associated with FQ resistance, warranting further characterisation. Mutations associated with FQ resistance were also found in other genes, including three encoding proteins involved in transport processes. Most of the detected mutations did not alter multiplication of the corresponding mutants in J774 cells. In contrast, all mutations at locus FTL_0439 encoding FupA/B, a membrane protein involved in iron transport, were associated with FQ resistance and fitness loss. CONCLUSION: FQ resistance in F. tularensis is complex and may involve single or combined mutations in genes encoding type II topoisomerases, transport systems and FupA/B. In vivo studies are now required to assess the potential role of these mutations in FQ treatment failures.


Asunto(s)
Antibacterianos/farmacología , Ciprofloxacina/farmacología , Farmacorresistencia Bacteriana/genética , Francisella/efectos de los fármacos , Francisella/genética , Transporte Biológico/genética , Proteínas Portadoras/antagonistas & inhibidores , ADN-Topoisomerasas/genética , Fluoroquinolonas/farmacología , Genoma Bacteriano/genética , Humanos , Pruebas de Sensibilidad Microbiana , Mutación/genética , Tularemia/tratamiento farmacológico , Tularemia/microbiología , Secuenciación Completa del Genoma
13.
Nucleic Acids Res ; 35(6): 1947-57, 2007.
Artículo en Inglés | MEDLINE | ID: mdl-17339231

RESUMEN

High throughput EST and full-length cDNA sequencing have revealed extensive variations at the 3' ends of mammalian transcripts. Whether all of these changes are biologically meaningful has been the subject of controversy, as such, results may reflect in part transcription or polyadenylation leakage. We selected here a set of tandem poly(A) sites predicted from EST/cDNA sequence analysis that (i) are conserved between human and mouse, (ii) produce alternative 3' isoforms with unusual size features and (iii) are not documented in current genome databases, and we submitted these sites to experimental validation in mouse tissues. Out of 86 tested poly(A) sites from 44 genes, 84 were individually confirmed using a specially devised RT-PCR strategy. We then focused on validating the exon structure between distant tandem poly(A) sites separated by over 3 kb, and between stop codons and alternative poly(A) sites located at 4.5 kb or more, using a long-distance RT-PCR strategy. In most cases, long transcripts spanning the whole poly(A)-poly(A) or stop-poly(A) distance were detected, confirming that tandem sites were part of the same transcription unit. Given the apparent conservation of these long alternative 3' ends, different regulatory functions can be foreseen, depending on the location where transcription starts.


Asunto(s)
Regiones no Traducidas 3'/química , Poli A/análisis , Animales , Secuencia de Bases , Células Cultivadas , Secuencia Conservada , ADN Complementario/química , Bases de Datos de Ácidos Nucleicos , Etiquetas de Secuencia Expresada/química , Humanos , Ratones , Ratones Endogámicos BALB C , Poliadenilación , Isoformas de Proteínas/biosíntesis , Isoformas de Proteínas/genética , Reacción en Cadena de la Polimerasa de Transcriptasa Inversa
14.
PLoS Comput Biol ; 2(5): e43, 2006 May.
Artículo en Inglés | MEDLINE | ID: mdl-16699595

RESUMEN

Alternative polyadenylation sites produce transcript isoforms with 3' untranslated regions (UTRs) of different lengths. If a microRNA (miRNA) target is present in the UTR, then only those target-containing isoforms should be sensitive to control by a cognate miRNA. We carried out a systematic examination of 3' UTRs containing multiple poly(A) sites and putative miRNA targets. Based on expressed sequence tag (EST) counts and EST library information, we observed that levels of isoforms containing targets for miR-1 or miR-124, two miRNAs causing downregulation of transcript levels, were reduced in tissues expressing the corresponding miRNA. This analysis was repeated for all conserved 7-mers in 3' UTRs, resulting in a selection of 312 motifs. We show that this set is significantly enriched in known miRNA targets and mRNA-destabilizing elements, which validates our initial hypothesis. We scanned the human genome for possible cognate miRNAs and identified phylogenetically conserved precursors matching our motifs. This analysis can help identify target-miRNA couples that went undetected in previous screens, but it may also reveal targets for other types of regulatory factors.


Asunto(s)
Regulación hacia Abajo/genética , MicroARNs/genética , Transcripción Genética/genética , Animales , Secuencia de Bases , Etiquetas de Secuencia Expresada , Humanos , Ratones , Especificidad de Órganos , Isoformas de Proteínas/genética , ARN Mensajero/genética
15.
BMC Bioinformatics ; 7: 169, 2006 Mar 23.
Artículo en Inglés | MEDLINE | ID: mdl-16556303

RESUMEN

BACKGROUND: The three major mechanisms that regulate transcript formation involve the selection of alternative sites for transcription start (TS), splicing, and polyadenylation. Currently there are efforts that collect data & annotation individually for each of these variants. It is important to take an integrated view of these data sets and to derive a data set of alternate transcripts along with consolidated annotation. We have been developing in the past computational pipelines that generate value-added data at genome-scale on individual variant types; these include AltSplice on splicing and AltPAS on polyadenylation. We now extend these pipelines and integrate the resultant data sets to facilitate an integrated view of the contributions from splicing and polyadenylation in the formation of transcript variants. DESCRIPTION: The AltSplice pipeline examines gene-transcript alignments and delineates alternative splice events and splice patterns; this pipeline is extended as AltTrans to delineate isoform transcript patterns for each of which both introns/exons and 'terminating' polyA site are delineated; EST/mRNA sequences that qualify the transcript pattern confirm both the underlying splicing and polyadenylation. The AltPAS pipeline examines gene-transcript alignments and delineates all potential polyA sites irrespective of underlying splicing patterns. Resultant polyA sites from both AltTrans and AltPAS are merged. The generated database reports data on alternative splicing, alternative polyadenylation and the resultant alternate transcript patterns; the basal data is annotated for various biological features. The data (named as integrated AltTrans data) generated for both the organisms of human and mouse is made available through the Alternate Transcript Diversity web site at http://www.ebi.ac.uk/atd/. CONCLUSION: The reported data set presents alternate transcript patterns that are annotated for both alternative splicing and alternative polyadenylation. Results based on current transcriptome data indicate that the contribution of alternative splicing is larger than that of alternative polyadenylation.


Asunto(s)
Empalme Alternativo/genética , Mapeo Cromosómico/métodos , Análisis Mutacional de ADN/métodos , Poliadenilación/genética , Programas Informáticos , Factores de Transcripción/genética , Variación Genética/genética
16.
BMC Genomics ; 7: 189, 2006 Jul 26.
Artículo en Inglés | MEDLINE | ID: mdl-16872498

RESUMEN

BACKGROUND: Alternative polyadenylation is a widespread mechanism contributing to transcript diversity in eukaryotes. Over half of mammalian genes are alternatively polyadenylated. Our understanding of poly(A) site evolution is limited by the lack of a reliable identification of conserved, equivalent poly(A) sites among species. We introduce here a working definition of conserved poly(A) sites as sites that are both (i) properly aligned in human and mouse orthologous 3' untranslated regions (UTRs) and (ii) supported by EST or cDNA data in both species. RESULTS: We identified about 4800 such conserved poly(A) sites covering one third of the orthologous gene set studied. Characteristics of conserved poly(A) sites such as processing efficiency and tissue-specificity were analyzed. Conserved sites show a higher processing efficiency but no difference in tissular distribution when compared to non-conserved sites. In general, alternative poly(A) sites are species-specific and involve minor, non-conserved sites that are unlikely to play essential roles. However, there are about 500 genes with conserved tandem poly(A) sites. A significant fraction of these conserved tandems display a conserved arrangement of major/minor sites in their 3' UTR, suggesting that these alternative 3' ends may be under selection. CONCLUSION: This analysis allows us to identify potential functional alternative poly(A) sites and provides clues on the selective mechanisms at play in the appearance of multiple poly(A) sites and their maintenance in the 3' UTRs of genes.


Asunto(s)
Empalme Alternativo/genética , Secuencia Conservada , Evolución Molecular , Mamíferos/genética , Poliadenilación/genética , Regiones no Traducidas 3'/genética , Animales , Eficiencia , Procesamiento Automatizado de Datos , Etiquetas de Secuencia Expresada , Humanos , Ratones , Modelos Teóricos , Especificidad de Órganos , Poli A/genética , Procesamiento Postranscripcional del ARN , Ratas , Distribución Tisular
17.
Mol Biosyst ; 5(12): 1787-96, 2009 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-19763337

RESUMEN

Systems biologists are facing the difficult challenge of modelling and analysing regulatory networks encompassing numerous and diverse components and interactions. Furthermore, available data sets are often qualitative, which complicates the definition of truly quantitative models. In order to build comprehensive and predictive models, there is clearly a need for incremental strategies, enabling the progression from relatively small to large scale models. Leaning on former models, we have defined a logical model for three regulatory modules involved in the control of the mitotic cell cycle in budding yeast, namely the core cell cycle module, the morphogenetic checkpoint, and a module controlling the exit from mitosis. Consistency with available data has been assessed through a systematic analysis of model behaviours for various genetic backgrounds and other perturbations. Next, we take advantage of compositional facilities of the logical formalism to combine these three models in order to generate a single comprehensive model involving over thirty regulatory components. The resulting logical model preserves all relevant characteristics of the original modules, while enabling the simulation of more sophisticated experiments.


Asunto(s)
Modelos Biológicos , Saccharomycetales/fisiología , Biología de Sistemas/métodos , Ciclo Celular/genética , Ciclo Celular/fisiología , Simulación por Computador , Proteínas Fúngicas/genética , Mutación , Saccharomycetales/citología , Saccharomycetales/genética , Transducción de Señal
18.
PLoS One ; 3(12): e4001, 2008.
Artículo en Inglés | MEDLINE | ID: mdl-19104654

RESUMEN

BACKGROUND: As public microarray repositories are constantly growing, we are facing the challenge of designing strategies to provide productive access to the available data. METHODOLOGY: We used a modified version of the Markov clustering algorithm to systematically extract clusters of co-regulated genes from hundreds of microarray datasets stored in the Gene Expression Omnibus database (n = 1,484). This approach led to the definition of 18,250 transcriptional signatures (TS) that were tested for functional enrichment using the DAVID knowledgebase. Over-representation of functional terms was found in a large proportion of these TS (84%). We developed a JAVA application, TBrowser that comes with an open plug-in architecture and whose interface implements a highly sophisticated search engine supporting several Boolean operators (http://tagc.univ-mrs.fr/tbrowser/). User can search and analyze TS containing a list of identifiers (gene symbols or AffyIDs) or associated with a set of functional terms. CONCLUSIONS/SIGNIFICANCE: As proof of principle, TBrowser was used to define breast cancer cell specific genes and to detect chromosomal abnormalities in tumors. Finally, taking advantage of our large collection of transcriptional signatures, we constructed a comprehensive map that summarizes gene-gene co-regulations observed through all the experiments performed on HGU133A Affymetrix platform. We provide evidences that this map can extend our knowledge of cellular signaling pathways.


Asunto(s)
Bases de Datos Genéticas , Perfilación de la Expresión Génica , Expresión Génica/fisiología , Programas Informáticos , Interfaz Usuario-Computador , Algoritmos , Animales , Análisis por Conglomerados , Eficiencia , Redes Reguladoras de Genes/fisiología , Humanos , Metaanálisis como Asunto , Ratones , Análisis de Secuencia por Matrices de Oligonucleótidos , Ratas , Transducción de Señal/genética
19.
RNA ; 12(10): 1794-801, 2006 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-16931874

RESUMEN

The termination of mature eukaryotic mRNAs occurs at specific polyadenylation sites located downstream from stop codons in the 3'-untranslated region (UTR). An accurate delineation of these sites is essential for the study of 3'-UTR-based gene regulation and for the design of pertinent probes for transcriptome analysis. Although typical poly(A) sites are located between 0 and 2 kb from the stop codon, EST sequence analyses have identified sites located at unexpectedly long ranges (5-10 kb) in a number of genes. Here we perform a complete mapping of EST and full-length cDNA sequences on the mouse and human genome to observe putative poly(A) sites extending beyond annotated 3'-ends and into the intergenic regions. We introduce several quality parameters for poly(A) site prediction and train a classification tree to associate P-values to predicted sites. We observe a higher than background level of high-scoring sites up to 12-15 kb past the stop codon, both in human and mouse. This leads to an estimate of about 5000 human genes having unreported 3'-end extensions and about 3500 novel polyadenylated transcripts lying in present "intergenic" regions. These high-scoring, long-range poly(A) sites corresponding to novel transcripts and gene extensions should be incorporated into current human and mouse gene repositories.


Asunto(s)
ARN Mensajero/química , ARN Mensajero/genética , Regiones no Traducidas 3'/química , Regiones no Traducidas 3'/genética , Regiones no Traducidas 3'/metabolismo , Animales , Secuencia de Bases , Sitios de Unión/genética , Codón de Terminación/genética , Biología Computacional , ADN Intergénico/genética , Etiquetas de Secuencia Expresada , Humanos , Ratones , ARN Mensajero/metabolismo , Transcripción Genética
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA