Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 46
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Tipo de documento
Intervalo de ano de publicação
1.
Hum Genet ; 142(12): 1721-1735, 2023 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-37889307

RESUMO

Episignatures are popular tools for the diagnosis of rare neurodevelopmental disorders. They are commonly based on a set of differentially methylated CpGs used in combination with a support vector machine model. DNA methylation (DNAm) data often include missing values due to changes in data generation technology and batch effects. While many normalization methods exist for DNAm data, their impact on episignature performance have never been assessed. In addition, technologies to quantify DNAm evolve quickly and this may lead to poor transposition of existing episignatures generated on deprecated array versions to new ones. Indeed, probe removal between array versions, technologies or during preprocessing leads to missing values. Thus, the effect of missing data on episignature performance must also be carefully evaluated and addressed through imputation or an innovative approach to episignatures design. In this paper, we used data from patients suffering from Kabuki and Sotos syndrome to evaluate the influence of normalization methods, classification models and missing data on the prediction performances of two existing episignatures. We compare how six popular normalization methods for methylarray data affect episignature classification performances in Kabuki and Sotos syndromes and provide best practice suggestions when building new episignatures. In this setting, we show that Illumina, Noob or Funnorm normalization methods achieved higher classification performances on the testing sets compared to Quantile, Raw and Swan normalization methods. We further show that penalized logistic regression and support vector machines perform best in the classification of Kabuki and Sotos syndrome patients. Then, we describe a new paradigm to build episignatures based on the detection of differentially methylated regions (DMRs) and evaluate their performance compared to classical differentially methylated cytosines (DMCs)-based episignatures in the presence of missing data. We show that the performance of classical DMC-based episignatures suffers from the presence of missing data more than the DMR-based approach. We present a comprehensive evaluation of how the normalization of DNA methylation data affects episignature performance, using three popular classification models. We further evaluate how missing data affect those models' predictions. Finally, we propose a novel methodology to develop episignatures based on differentially methylated regions identification and show how this method slightly outperforms classical episignatures in the presence of missing data.


Assuntos
Transtornos do Neurodesenvolvimento , Síndrome de Sotos , Humanos , Síndrome de Sotos/genética , Transtornos do Neurodesenvolvimento/diagnóstico , Transtornos do Neurodesenvolvimento/genética , Metilação de DNA
2.
Bioinformatics ; 38(4): 1037-1044, 2022 01 27.
Artigo em Inglês | MEDLINE | ID: mdl-34850828

RESUMO

MOTIVATION: Single-cell RNA sequencing (scRNA-seq) provides transcriptomic profiling for individual cells, allowing researchers to study the heterogeneity of tissues, recognize rare cell identities and discover new cellular subtypes. Clustering analysis is usually used to predict cell class assignments and infer cell identities. However, the high sparsity of scRNA-seq data, accentuated by dropout events generates challenges that have motivated the development of numerous dedicated clustering methods. Nevertheless, there is still no consensus on the best performing method. RESULTS: graph-sc is a new method leveraging a graph autoencoder network to create embeddings for scRNA-seq cell data. While this work analyzes the performance of clustering the embeddings with various clustering algorithms, other downstream tasks can also be performed. A broad experimental study has been performed on both simulated and scRNA-seq datasets. The results indicate that although there is no consistently best method across all the analyzed datasets, graph-sc compares favorably to competing techniques across all types of datasets. Furthermore, the proposed method is stable across consecutive runs, robust to input down-sampling, generally insensitive to changes in the network architecture or training parameters and more computationally efficient than other competing methods based on neural networks. Modeling the data as a graph provides increased flexibility to define custom features characterizing the genes, the cells and their interactions. Moreover, external data (e.g. gene network) can easily be integrated into the graph and used seamlessly under the same optimization task. AVAILABILITY AND IMPLEMENTATION: https://github.com/ciortanmadalina/graph-sc. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Análise de Célula Única , Análise da Expressão Gênica de Célula Única , Análise de Sequência de RNA/métodos , Análise de Célula Única/métodos , Perfilação da Expressão Gênica/métodos , Análise por Conglomerados
3.
Plant Cell Physiol ; 63(10): 1457-1473, 2022 Oct 31.
Artigo em Inglês | MEDLINE | ID: mdl-35799371

RESUMO

The identification of transcription factor (TF) target genes is central in biology. A popular approach is based on the location by pattern matching of potential cis-regulatory elements (CREs). During the last few years, tools integrating next-generation sequencing data have been developed to improve the performance of pattern matching. However, such tools have not yet been comprehensively evaluated in plants. Hence, we developed a new streamlined method aiming at predicting CREs and target genes of plant TFs in specific organs or conditions. Our approach implements a supervised machine learning strategy, which allows decision rule models to be learnt using TF ChIP-chip/seq experimental data. Different layers of genomic features were integrated in predictive models: the position on the gene, the DNA sequence conservation, the chromatin state and various CRE footprints. Among the tested features, the chromatin features were crucial for improving the accuracy of the method. Furthermore, we evaluated the transferability of predictive models across TFs, organs and species. Finally, we validated our method by correctly inferring the target genes of key TFs controlling metabolite biosynthesis at the organ level in Arabidopsis. We developed a tool-Wimtrap-to reproduce our approach in plant species and conditions/organs for which ChIP-chip/seq data are available. Wimtrap is a user-friendly R package that supports an R Shiny web interface and is provided with pre-built models that can be used to quickly get predictions of CREs and TF gene targets in different organs or conditions in Arabidopsis thaliana, Solanum lycopersicum, Oryza sativa and Zea mays.


Assuntos
Arabidopsis , Fatores de Transcrição , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo , Ligação Proteica/genética , Genômica , Plantas/metabolismo , Cromatina/genética , Arabidopsis/genética , Arabidopsis/metabolismo , Sítios de Ligação/genética
4.
Bioinformatics ; 37(17): 2738-2740, 2021 Sep 09.
Artigo em Inglês | MEDLINE | ID: mdl-33471071

RESUMO

MOTIVATION: Long-read sequencing technologies can be employed to detect and map DNA modifications at the nucleotide resolution on a genome-wide scale. However, published software packages neglect the integration of genomic annotation and comprehensive filtering when analyzing patterns of modified bases detected using Pacific Biosciences (PacBio) or Oxford Nanopore Technologies (ONT) data. Here, we present DNA Modification Annotation (DNAModAnnot), a R package designed for the global analysis of DNA modification patterns using adapted filtering and visualization tools. RESULTS: We tested our package using PacBio sequencing data to analyze patterns of the 6-methyladenine (6mA) in the ciliate Paramecium tetraurelia, in which high 6mA amounts were previously reported. We found P. tetraurelia 6mA genome-wide distribution to be similar to other ciliates. We also performed 5-methylcytosine (5mC) analysis in human lymphoblastoid cells using ONT data and confirmed previously known patterns of 5mC. DNAModAnnot provides a toolbox for the genome-wide analysis of different DNA modifications using PacBio and ONT long-read sequencing data. AVAILABILITY AND IMPLEMENTATION: DNAModAnnot is distributed as a R package available via GitHub (https://github.com/AlexisHardy/DNAModAnnot). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

5.
Nucleic Acids Res ; 48(13): 7119-7134, 2020 07 27.
Artigo em Inglês | MEDLINE | ID: mdl-32542321

RESUMO

Single-cell RNA-sequencing (scRNA-seq) of the Caenorhabditis elegans nervous system offers the unique opportunity to obtain a partial expression profile for each neuron within a known connectome. Building on recent scRNA-seq data and on a molecular atlas describing the expression pattern of ∼800 genes at the single cell resolution, we designed an iterative clustering analysis aiming to match each cell-cluster to the ∼100 anatomically defined neuron classes of C. elegans. This heuristic approach successfully assigned 97 of the 118 neuron classes to a cluster. Sixty two clusters were assigned to a single neuron class and 15 clusters grouped neuron classes sharing close molecular signatures. Pseudotime analysis revealed a maturation process occurring in some neurons (e.g. PDA) during the L2 stage. Based on the molecular profiles of all identified neurons, we predicted cell fate regulators and experimentally validated unc-86 for the normal differentiation of RMG neurons. Furthermore, we observed that different classes of genes functionally diversify sensory neurons, interneurons and motorneurons. Finally, we designed 15 new neuron class-specific promoters validated in vivo. Amongst them, 10 represent the only specific promoter reported to this day, expanding the list of neurons amenable to genetic manipulations.


Assuntos
Caenorhabditis elegans/genética , Neurônios/classificação , Neurônios/metabolismo , RNA/metabolismo , Animais , Sequência de Bases , Biomarcadores/metabolismo , Perfilação da Expressão Gênica , Neurônios/citologia , Análise de Célula Única/métodos
6.
BMC Bioinformatics ; 22(1): 280, 2021 May 27.
Artigo em Inglês | MEDLINE | ID: mdl-34044773

RESUMO

BACKGROUND: Single-cell RNA sequencing (scRNA-seq) has emerged has a main strategy to study transcriptional activity at the cellular level. Clustering analysis is routinely performed on scRNA-seq data to explore, recognize or discover underlying cell identities. The high dimensionality of scRNA-seq data and its significant sparsity accentuated by frequent dropout events, introducing false zero count observations, make the clustering analysis computationally challenging. Even though multiple scRNA-seq clustering techniques have been proposed, there is no consensus on the best performing approach. On a parallel research track, self-supervised contrastive learning recently achieved state-of-the-art results on images clustering and, subsequently, image classification. RESULTS: We propose contrastive-sc, a new unsupervised learning method for scRNA-seq data that perform cell clustering. The method consists of two consecutive phases: first, an artificial neural network learns an embedding for each cell through a representation training phase. The embedding is then clustered in the second phase with a general clustering algorithm (i.e. KMeans or Leiden community detection). The proposed representation training phase is a new adaptation of the self-supervised contrastive learning framework, initially proposed for image processing, to scRNA-seq data. contrastive-sc has been compared with ten state-of-the-art techniques. A broad experimental study has been conducted on both simulated and real-world datasets, assessing multiple external and internal clustering performance metrics (i.e. ARI, NMI, Silhouette, Calinski scores). Our experimental analysis shows that constastive-sc compares favorably with state-of-the-art methods on both simulated and real-world datasets. CONCLUSION: On average, our method identifies well-defined clusters in close agreement with ground truth annotations. Our method is computationally efficient, being fast to train and having a limited memory footprint. contrastive-sc maintains good performance when only a fraction of input cells is provided and is robust to changes in hyperparameters or network architecture. The decoupling between the creation of the embedding and the clustering phase allows the flexibility to choose a suitable clustering algorithm (i.e. KMeans when the number of expected clusters is known, Leiden otherwise) or to integrate the embedding with other existing techniques.


Assuntos
RNA Citoplasmático Pequeno , Análise de Célula Única , Algoritmos , Análise por Conglomerados , Análise de Sequência de RNA
7.
Mol Ecol ; 30(21): 5503-5516, 2021 11.
Artigo em Inglês | MEDLINE | ID: mdl-34415643

RESUMO

Over the last decade, increasing attention has been paid to the molecular adaptations used by organisms to cope with thermal stress. However, to date, few studies have focused on thermophilic species living in hot, arid climates. In this study, we explored molecular adaptations to heat stress in the thermophilic ant genus Cataglyphis, one of the world's most thermotolerant animal taxa. We compared heat tolerance and gene expression patterns across six Cataglyphis species from distinct phylogenetic groups that live in different habitats and experience different thermal regimes. We found that all six species had high heat tolerance levels with critical thermal maxima (CTmax ) ranging from 43℃ to 45℃ and a median lethal temperature (LT50) ranging from 44.5℃ to 46.8℃. Transcriptome analyses revealed that, although the number of differentially expressed genes varied widely for the six species (from 54 to 1118), many were also shared. Functional annotation of the differentially expressed and co-expressed genes showed that the biological pathways involved in heat-shock responses were similar among species and were associated with four major processes: the regulation of transcriptional machinery and DNA metabolism; the preservation of proteome stability; the elimination of toxic residues; and the maintenance of cellular integrity. Overall, our results suggest that molecular responses to heat stress have been evolutionarily conserved in the ant genus Cataglyphis and that their diversity may help workers withstand temperatures close to their physiological limits.


Assuntos
Formigas , Aclimatação , Adaptação Fisiológica/genética , Animais , Formigas/genética , Resposta ao Choque Térmico/genética , Temperatura Alta , Humanos , Filogenia
8.
BMC Ophthalmol ; 20(1): 106, 2020 Mar 17.
Artigo em Inglês | MEDLINE | ID: mdl-32183784

RESUMO

BACKGROUND: Blood-retinal barrier cells are known to exhibit a massive phenotypic change during experimental autoimmune uveitis (EAU) development. In an attempt to investigate the mechanisms of blood-retinal barrier (BRB) breakdown at a global level, we studied the gene regulation of total retinal cells and retinal endothelial cells during non-infectious uveitis. METHODS: Retinal endothelial cells were isolated by flow cytometry either in Tie2-GFP mice (CD31+ CD45- GFP+ cells), or in wild type C57BL/6 mice (CD31+ CD45- endoglin+ cells). EAU was induced in C57BL/6 mice by adoptive transfer of IRBP1-20-specific T cells. Total retinal cells and retinal endothelial cells from naïve and EAU mice were sorted and their gene expression compared by RNA-Seq. Protein expression of selected genes was validated by immunofluorescence on retinal wholemounts and cryosections and by flow cytometry. RESULTS: Retinal endothelial cell sorting in wild type C57BL/6 mice was validated by comparative transcriptome analysis with retinal endothelial cells sorted from Tie2-GFP mice, which express GFP under the control of the endothelial-specific receptor tyrosine kinase promoter Tie2. RNA-Seq analysis of total retinal cells mainly brought to light upregulation of genes involved in antigen presentation and T cell activation during EAU. Specific transcriptome analysis of retinal endothelial cells allowed us to identify 82 genes modulated in retinal endothelial cells during EAU development. Protein expression of 5 of those genes (serpina3n, lcn2, ackr1, lrg1 and lamc3) was validated at the level of inner BRB cells. CONCLUSION: Those data not only confirm the involvement of known pathogenic molecules but further provide a list of new candidate genes and pathways possibly implicated in inner BRB breakdown during non-infectious posterior uveitis.


Assuntos
Doenças Autoimunes/diagnóstico , Células Endoteliais/patologia , Imunidade Celular , Retina/patologia , Linfócitos T/imunologia , Uveíte/diagnóstico , Animais , Doenças Autoimunes/imunologia , Doenças Autoimunes/metabolismo , Barreira Hematorretiniana , Contagem de Células , Modelos Animais de Doenças , Feminino , Citometria de Fluxo , Masculino , Camundongos , Camundongos Endogâmicos C57BL , Uveíte/imunologia , Uveíte/metabolismo
9.
Eur J Immunol ; 47(1): 168-179, 2017 01.
Artigo em Inglês | MEDLINE | ID: mdl-27861791

RESUMO

The forkhead box P1 (FOXP1) transcription factor has been shown to regulate the generation and maintenance of quiescent naïve murine T cells. In humans, FOXP1 expression has been correlated with overall survival in patients with peripheral T-cell lymphoma (PTCL), although its regulatory role in T-cell function is currently unknown. We found that FOXP1 is normally expressed in all human leukocyte subpopulations. Focusing on primary human CD4+ T cells, we show that nuclear expression of FOXP1 predominates in naïve cells with significant downregulation detected in memory cells from blood and tonsils. FOXP1 is repressed following in vitro T-cell activation of naïve T cells, and later re-established in memory CD4+ T cells, albeit at lower levels. DNA methylation analysis revealed that epigenetic mechanisms participate in regulating the human FOXP1 gene. ShRNA-mediated FOXP1 repression induces CD4+ T cells to enter the cell cycle, acquire memory-like markers and upregulate helper T-cell differentiation genes. In patients with lymphoproliferative disorders, FOXP1 expression is constitutionally repressed in the clonal T cells in parallel with overexpression of helper T-cell differentiation genes. Collectively, these data identify FOXP1 as an essential transcriptional regulator for primary human CD4+ T cells and suggest its potential important role in the development of PTCL.


Assuntos
Linfócitos T CD4-Positivos/imunologia , Linfócitos T CD4-Positivos/metabolismo , Fatores de Transcrição Forkhead/metabolismo , Transtornos Linfoproliferativos/imunologia , Transtornos Linfoproliferativos/metabolismo , Proteínas Repressoras/metabolismo , Subpopulações de Linfócitos T/imunologia , Subpopulações de Linfócitos T/metabolismo , Biomarcadores , Ciclo Celular/genética , Linhagem Celular , Metilação de DNA , Epigênese Genética , Fatores de Transcrição Forkhead/genética , Expressão Gênica , Regulação da Expressão Gênica , Humanos , Imunofenotipagem , Leucócitos/imunologia , Leucócitos/metabolismo , Ativação Linfocitária/imunologia , Transtornos Linfoproliferativos/genética , Fenótipo , Regiões Promotoras Genéticas , Receptores de Antígenos de Linfócitos T/metabolismo , Proteínas Repressoras/genética
10.
Acta Neuropathol ; 135(2): 267-283, 2018 02.
Artigo em Inglês | MEDLINE | ID: mdl-29149419

RESUMO

Although a growing body of evidence indicates that phenotypic plasticity exhibited by glioblastoma cells plays a central role in tumor development and post-therapy recurrence, the master drivers of their aggressiveness remain elusive. Here we mapped the changes in active (H3K4me3) and repressive (H3K27me3) histone modifications accompanying the repression of glioblastoma stem-like cells tumorigenicity. Genes with changing histone marks delineated a network of transcription factors related to cancerous behavior, stem state, and neural development, highlighting a previously unsuspected association between repression of ARNT2 and loss of cell tumorigenicity. Immunohistochemistry confirmed ARNT2 expression in cell sub-populations within proliferative zones of patients' glioblastoma. Decreased ARNT2 expression was consistently observed in non-tumorigenic glioblastoma cells, compared to tumorigenic cells. Moreover, ARNT2 expression correlated with a tumorigenic molecular signature at both the tissue level within the tumor core and at the single cell level in the patients' tumors. We found that ARNT2 knockdown decreased the expression of SOX9, POU3F2 and OLIG2, transcription factors implicated in glioblastoma cell tumorigenicity, and repressed glioblastoma stem-like cell tumorigenic properties in vivo. Our results reveal ARNT2 as a pivotal component of the glioblastoma cell tumorigenic signature, located at a node of a transcription factor network controlling glioblastoma cell aggressiveness.


Assuntos
Translocador Nuclear Receptor Aril Hidrocarboneto/metabolismo , Fatores de Transcrição Hélice-Alça-Hélice Básicos/metabolismo , Neoplasias Encefálicas/metabolismo , Cromatina/metabolismo , Glioblastoma/metabolismo , Idoso , Animais , Translocador Nuclear Receptor Aril Hidrocarboneto/genética , Fatores de Transcrição Hélice-Alça-Hélice Básicos/genética , Neoplasias Encefálicas/genética , Neoplasias Encefálicas/patologia , Células Cultivadas , Epigênese Genética , Regulação Neoplásica da Expressão Gênica , Técnicas de Silenciamento de Genes , Glioblastoma/genética , Glioblastoma/patologia , Código das Histonas , Proteínas de Homeodomínio/metabolismo , Humanos , Camundongos Nus , Pessoa de Meia-Idade , Invasividade Neoplásica/genética , Invasividade Neoplásica/patologia , Invasividade Neoplásica/fisiopatologia , Transplante de Neoplasias , Células-Tronco Neoplásicas/metabolismo , Células-Tronco Neoplásicas/patologia , Fator de Transcrição 2 de Oligodendrócitos/metabolismo , Fatores do Domínio POU/metabolismo , Fatores de Transcrição SOX9/metabolismo
12.
EMBO J ; 32(5): 645-55, 2013 Mar 06.
Artigo em Inglês | MEDLINE | ID: mdl-23353889

RESUMO

TET proteins convert 5-methylcytosine to 5-hydroxymethylcytosine, an emerging dynamic epigenetic state of DNA that can influence transcription. Evidence has linked TET1 function to epigenetic repression complexes, yet mechanistic information, especially for the TET2 and TET3 proteins, remains limited. Here, we show a direct interaction of TET2 and TET3 with O-GlcNAc transferase (OGT). OGT does not appear to influence hmC activity, rather TET2 and TET3 promote OGT activity. TET2/3-OGT co-localize on chromatin at active promoters enriched for H3K4me3 and reduction of either TET2/3 or OGT activity results in a direct decrease in H3K4me3 and concomitant decreased transcription. Further, we show that Host Cell Factor 1 (HCF1), a component of the H3K4 methyltransferase SET1/COMPASS complex, is a specific GlcNAcylation target of TET2/3-OGT, and modification of HCF1 is important for the integrity of SET1/COMPASS. Additionally, we find both TET proteins and OGT activity promote binding of the SET1/COMPASS H3K4 methyltransferase, SETD1A, to chromatin. Finally, studies in Tet2 knockout mouse bone marrow tissue extend and support the data as decreases are observed of global GlcNAcylation and also of H3K4me3, notably at several key regulators of haematopoiesis. Together, our results unveil a step-wise model, involving TET-OGT interactions, promotion of GlcNAcylation, and influence on H3K4me3 via SET1/COMPASS, highlighting a novel means by which TETs may induce transcriptional activation.


Assuntos
Metilação de DNA , Proteínas de Ligação a DNA/metabolismo , Dioxigenases/metabolismo , Regulação da Expressão Gênica , Histona-Lisina N-Metiltransferase/metabolismo , N-Acetilglucosaminiltransferases/metabolismo , Proteínas Proto-Oncogênicas/metabolismo , Transcrição Gênica , 5-Metilcitosina/metabolismo , Sequência de Aminoácidos , Animais , Western Blotting , Proliferação de Células , Células Cultivadas , Imunoprecipitação da Cromatina , Ilhas de CpG , Citosina/análogos & derivados , Citosina/metabolismo , Epigênese Genética , Glicosilação , Histonas/metabolismo , Fator C1 de Célula Hospedeira/metabolismo , Humanos , Imunoprecipitação , Camundongos , Camundongos Knockout , Dados de Sequência Molecular , Regiões Promotoras Genéticas/genética
13.
J Neuroinflammation ; 14(1): 136, 2017 07 18.
Artigo em Inglês | MEDLINE | ID: mdl-28720143

RESUMO

BACKGROUND: Controversy exists regarding which cell types are responsible for autoantigen presentation in the retina during experimental autoimmune uveitis (EAU) development. In this study, we aimed to identify and characterize the retinal resident and infiltrating cells susceptible to express major histocompatibility complex (MHC) class II during EAU. METHODS: EAU was induced in C57BL/6 mice by adoptive transfer of autoreactive lymphocytes from IRBP1-20-immunized animals. MHC class II expression was studied by immunostainings on eye cryosections. For flow cytometry (FC) analysis, retinas were dissected and enzymatically digested into single-cell suspensions. Three MHC class II+ retinal cell populations were sorted by FC, and their RNA processed for RNA-Seq. RESULTS: Immunostainings demonstrate strong induction of MHC class II expression in EAU, especially in the inner retina at the level of inflamed vessels, extending to the outer retinal layers and the subretinal space in severely inflamed eyes. Most MHC class II+ cells express the hematopoietic marker IBA1. FC quantitative analyses demonstrate that MHC class II induction significantly correlates with disease severity and is associated with upregulation of co-stimulatory molecule expression. In particular, most MHC class IIhi cells express co-stimulatory molecules during EAU. Further phenotyping identified three MHC class II+ retinal cell populations: CD45-CD11b- non-hematopoietic cells with low MHC class II expression and CD45+CD11b+ hematopoietic cells with higher MHC class II expression, which can be further separated into Ly6C+ and Ly6C- cells, possibly corresponding to infiltrating macrophages and resident microglia. Transcriptome analysis of the three sorted populations leads to a clear sample clustering with some enrichment in macrophage markers and microglial cell markers in Ly6C+ and Ly6C- cells, respectively. Functional annotation analysis reveals that both hematopoietic cell populations are more competent in MHC class II-associated antigen presentation and in T cell activation than non-hematopoietic cells. CONCLUSION: Our results highlight the potential of cells of hematopoietic origin in local antigen presentation, whatever their Ly6C expression. Our work further provides a first transcriptomic study of MHC class II-expressing retinal cells during EAU and delivers a series of new candidate genes possibly implicated in the pathogenesis of retinal autoimmunity.


Assuntos
Células Apresentadoras de Antígenos/metabolismo , Doenças Autoimunes/metabolismo , Genes MHC da Classe II/fisiologia , Retina/metabolismo , Uveíte/metabolismo , Sequência de Aminoácidos , Animais , Células Apresentadoras de Antígenos/imunologia , Doenças Autoimunes/genética , Doenças Autoimunes/imunologia , Expressão Gênica , Humanos , Camundongos , Camundongos Endogâmicos C57BL , Retina/imunologia , Uveíte/genética , Uveíte/imunologia
14.
Nucleic Acids Res ; 43(W1): W50-6, 2015 Jul 01.
Artigo em Inglês | MEDLINE | ID: mdl-25904632

RESUMO

RSAT (Regulatory Sequence Analysis Tools) is a modular software suite for the analysis of cis-regulatory elements in genome sequences. Its main applications are (i) motif discovery, appropriate to genome-wide data sets like ChIP-seq, (ii) transcription factor binding motif analysis (quality assessment, comparisons and clustering), (iii) comparative genomics and (iv) analysis of regulatory variations. Nine new programs have been added to the 43 described in the 2011 NAR Web Software Issue, including a tool to extract sequences from a list of coordinates (fetch-sequences from UCSC), novel programs dedicated to the analysis of regulatory variants from GWAS or population genomics (retrieve-variation-seq and variation-scan), a program to cluster motifs and visualize the similarities as trees (matrix-clustering). To deal with the drastic increase of sequenced genomes, RSAT public sites have been reorganized into taxon-specific servers. The suite is well-documented with tutorials and published protocols. The software suite is available through Web sites, SOAP/WSDL Web services, virtual machines and stand-alone programs at http://www.rsat.eu/.


Assuntos
Elementos Reguladores de Transcrição , Software , Sítios de Ligação , Variação Genética , Genômica , Humanos , Internet , Motivos de Nucleotídeos , Fatores de Transcrição/metabolismo
15.
EMBO J ; 31(6): 1405-26, 2012 Mar 21.
Artigo em Inglês | MEDLINE | ID: mdl-22293752

RESUMO

In addition to genetic predisposition, environmental and lifestyle factors contribute to the pathogenesis of type 2 diabetes (T2D). Epigenetic changes may provide the link for translating environmental exposures into pathological mechanisms. In this study, we performed the first comprehensive DNA methylation profiling in pancreatic islets from T2D and non-diabetic donors. We uncovered 276 CpG loci affiliated to promoters of 254 genes displaying significant differential DNA methylation in diabetic islets. These methylation changes were not present in blood cells from T2D individuals nor were they experimentally induced in non-diabetic islets by exposure to high glucose. For a subgroup of the differentially methylated genes, concordant transcriptional changes were present. Functional annotation of the aberrantly methylated genes and RNAi experiments highlighted pathways implicated in ß-cell survival and function; some are implicated in cellular dysfunction while others facilitate adaptation to stressors. Together, our findings offer new insights into the intricate mechanisms of T2D pathogenesis, underscore the important involvement of epigenetic dysregulation in diabetic islets and may advance our understanding of T2D aetiology.


Assuntos
Metilação de DNA , Diabetes Mellitus Tipo 2/genética , Diabetes Mellitus Tipo 2/metabolismo , Ilhotas Pancreáticas/metabolismo , Idoso , Animais , Linhagem Celular , Ilhas de CpG , Impressões Digitais de DNA/métodos , Epigênese Genética , Loci Gênicos , Glucose/metabolismo , Humanos , Regiões Promotoras Genéticas , Ratos , Transcrição Gênica
16.
Brief Bioinform ; 15(6): 929-41, 2014 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-23990268

RESUMO

Infinium HumanMethylation450 beadarray is a popular technology to explore DNA methylomes in health and disease, and there is a current explosion in the use of this technique. Despite experience acquired from gene expression microarrays, analyzing Infinium Methylation arrays appeared more complex than initially thought and several difficulties have been encountered, as those arrays display specific features that need to be taken into consideration during data processing. Here, we review several issues that have been highlighted by the scientific community, and we present an overview of the general data processing scheme and an evaluation of the different normalization methods available to date to guide the 450K users in their analysis and data interpretation.


Assuntos
Metilação de DNA , Biologia Computacional , Ilhas de CpG , Interpretação Estatística de Dados , Genoma Humano , Humanos , Análise de Sequência com Séries de Oligonucleotídeos/estatística & dados numéricos , Sondas de Oligonucleotídeos , Polimorfismo de Nucleotídeo Único , Software
17.
Nucleic Acids Res ; 40(4): e31, 2012 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-22156162

RESUMO

ChIP-seq is increasingly used to characterize transcription factor binding and chromatin marks at a genomic scale. Various tools are now available to extract binding motifs from peak data sets. However, most approaches are only available as command-line programs, or via a website but with size restrictions. We present peak-motifs, a computational pipeline that discovers motifs in peak sequences, compares them with databases, exports putative binding sites for visualization in the UCSC genome browser and generates an extensive report suited for both naive and expert users. It relies on time- and memory-efficient algorithms enabling the treatment of several thousand peaks within minutes. Regarding time efficiency, peak-motifs outperforms all comparable tools by several orders of magnitude. We demonstrate its accuracy by analyzing data sets ranging from 4000 to 1,28,000 peaks for 12 embryonic stem cell-specific transcription factors. In all cases, the program finds the expected motifs and returns additional motifs potentially bound by cofactors. We further apply peak-motifs to discover tissue-specific motifs in peak collections for the p300 transcriptional co-activator. To our knowledge, peak-motifs is the only tool that performs a complete motif analysis and offers a user-friendly web interface without any restriction on sequence size or number of peaks.


Assuntos
Imunoprecipitação da Cromatina , Elementos Reguladores de Transcrição , Análise de Sequência de DNA , Software , Animais , Células-Tronco Embrionárias/metabolismo , Camundongos , Motivos de Nucleotídeos , Fatores de Transcrição/metabolismo , Interface Usuário-Computador , Fatores de Transcrição de p300-CBP/metabolismo
18.
Life Sci Alliance ; 7(2)2024 02.
Artigo em Inglês | MEDLINE | ID: mdl-38081640

RESUMO

High-throughput omics technologies have generated a wealth of large protein, gene, and transcript datasets that have exacerbated the need for new methods to analyse and compare big datasets. Rank-rank hypergeometric overlap is an important threshold-free method to combine and visualize two ranked lists of P-values or fold-changes, usually from differential gene expression analyses. Here, we introduce a new rank-rank hypergeometric overlap-based method aimed at gene level and alternative splicing analyses at transcript or exon level, hitherto unreachable as transcript numbers are an order of magnitude larger than gene numbers. We tested the tool on synthetic and real datasets at gene and transcript levels to detect correlation and anticorrelation patterns and found it to be fast and accurate, even on very large datasets thanks to an evolutionary algorithm-based minimal P-value search. The tool comes with a ready-to-use permutation scheme allowing the computation of adjusted P-values at low time cost. The package compatibility mode is a drop-in replacement to previous packages. RedRibbon holds the promise to accurately extricate detailed information from large comparative analyses.


Assuntos
Algoritmos , Perfilação da Expressão Gênica , Perfilação da Expressão Gênica/métodos , Éxons/genética , Processamento Alternativo/genética
19.
Nucleic Acids Res ; 39(Web Server issue): W86-91, 2011 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-21715389

RESUMO

RSAT (Regulatory Sequence Analysis Tools) comprises a wide collection of modular tools for the detection of cis-regulatory elements in genome sequences. Thirteen new programs have been added to the 30 described in the 2008 NAR Web Software Issue, including an automated sequence retrieval from EnsEMBL (retrieve-ensembl-seq), two novel motif discovery algorithms (oligo-diff and info-gibbs), a 100-times faster version of matrix-scan enabling the scanning of genome-scale sequence sets, and a series of facilities for random model generation and statistical evaluation (random-genome-fragments, random-motifs, random-sites, implant-sites, sequence-probability, permute-matrix). Our most recent work also focused on motif comparison (compare-matrices) and evaluation of motif quality (matrix-quality) by combining theoretical and empirical measures to assess the predictive capability of position-specific scoring matrices. To process large collections of peak sequences obtained from ChIP-seq or related technologies, RSAT provides a new program (peak-motifs) that combines several efficient motif discovery algorithms to predict transcription factor binding motifs, match them against motif databases and predict their binding sites. Availability (web site, stand-alone programs and SOAP/WSDL (Simple Object Access Protocol/Web Services Description Language) web services): http://rsat.ulb.ac.be/rsat/.


Assuntos
Sequências Reguladoras de Ácido Nucleico , Software , Sítios de Ligação , Genômica , Elementos Reguladores de Transcrição , Fatores de Transcrição/metabolismo
20.
Methods Mol Biol ; 2624: 87-114, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-36723811

RESUMO

Mapping DNA modifications at the base resolution is now possible at the genome level thanks to advances in sequencing technologies. Long-read sequencing data can be used to identify modified base patterns. However, the downstream analysis of Pacific Biosciences (PacBio) or Oxford Nanopore Technologies (ONT) data requires the integration of genomic annotation and comprehensive filtering to prevent the accumulation of artifact signals. We present in this chapter, a linear workflow to fully analyze modified base patterns using the DNA Modification Annotation (DNAModAnnot) package. This workflow includes a thorough filtering based on sequencing quality and false discovery rate estimation and provides tools for a global analysis of DNA modifications. Here, we provide an application example of this workflow with PacBio data and guide the user by explaining expected outputs via a fully integrated Rmarkdown script. This protocol is presented with tips showing how to adapt the provided code for annotating epigenomes of any organism according to the user needs.


Assuntos
DNA , Sequenciamento de Nucleotídeos em Larga Escala , Sequenciamento de Nucleotídeos em Larga Escala/métodos , DNA/genética , Genômica , Análise de Sequência de DNA/métodos , Genoma
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA