Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 48
Filtrar
Más filtros

Bases de datos
Tipo del documento
Intervalo de año de publicación
1.
Brief Bioinform ; 23(2)2022 03 10.
Artículo en Inglés | MEDLINE | ID: mdl-35134107

RESUMEN

Numerous cancer types have shown to present hypermethylation of CpG islands, also known as a CpG island methylator phenotype (CIMP), often associated with survival variation. Despite extensive research on CIMP, the etiology of this variability remains elusive, possibly due to lack of consistency in defining CIMP. In this work, we utilize a pan-cancer approach to further explore CIMP, focusing on 26 cancer types profiled in the Cancer Genome Atlas (TCGA). We defined CIMP systematically and agnostically, discarding any effects associated with age, gender or tumor purity. We then clustered samples based on their most variable DNA methylation values and analyzed resulting patient groups. Our results confirmed the existence of CIMP in 19 cancers, including gliomas and colorectal cancer. We further showed that CIMP was associated with survival differences in eight cancer types and, in five, represented a prognostic biomarker independent of clinical factors. By analyzing genetic and transcriptomic data, we further uncovered potential drivers of CIMP and classified them in four categories: mutations in genes directly involved in DNA demethylation; mutations in histone methyltransferases; mutations in genes not involved in methylation turnover, such as KRAS and BRAF; and microsatellite instability. Among the 19 CIMP-positive cancers, very few shared potential driver events, and those drivers were only IDH1 and SETD2 mutations. Finally, we found that CIMP was strongly correlated with tumor microenvironment characteristics, such as lymphocyte infiltration. Overall, our results indicate that CIMP does not exhibit a pan-cancer manifestation; rather, general dysregulation of CpG DNA methylation is caused by heterogeneous mechanisms.


Asunto(s)
Neoplasias Colorrectales , Neoplasias Colorrectales/genética , Islas de CpG , Metilación de ADN , Humanos , Inestabilidad de Microsatélites , Mutación , Fenotipo , Microambiente Tumoral
2.
Nucleic Acids Res ; 50(14): 7938-7958, 2022 08 12.
Artículo en Inglés | MEDLINE | ID: mdl-35871293

RESUMEN

Although originally described as transcriptional activator, SPI1/PU.1, a major player in haematopoiesis whose alterations are associated with haematological malignancies, has the ability to repress transcription. Here, we investigated the mechanisms underlying gene repression in the erythroid lineage, in which SPI1 exerts an oncogenic function by blocking differentiation. We show that SPI1 represses genes by binding active enhancers that are located in intergenic or gene body regions. HDAC1 acts as a cooperative mediator of SPI1-induced transcriptional repression by deacetylating SPI1-bound enhancers in a subset of genes, including those involved in erythroid differentiation. Enhancer deacetylation impacts on promoter acetylation, chromatin accessibility and RNA pol II occupancy. In addition to the activities of HDAC1, polycomb repressive complex 2 (PRC2) reinforces gene repression by depositing H3K27me3 at promoter sequences when SPI1 is located at enhancer sequences. Moreover, our study identified a synergistic relationship between PRC2 and HDAC1 complexes in mediating the transcriptional repression activity of SPI1, ultimately inducing synergistic adverse effects on leukaemic cell survival. Our results highlight the importance of the mechanism underlying transcriptional repression in leukemic cells, involving complex functional connections between SPI1 and the epigenetic regulators PRC2 and HDAC1.


Asunto(s)
Histona Desacetilasa 1 , Leucemia Eritroblástica Aguda , Complejo Represivo Polycomb 2 , Proteínas Proto-Oncogénicas , Transactivadores , Acetilación , Animales , Cromatina/genética , Histona Desacetilasa 1/genética , Leucemia Eritroblástica Aguda/genética , Ratones , Complejo Represivo Polycomb 2/genética , Complejo Represivo Polycomb 2/metabolismo , Regiones Promotoras Genéticas , Proteínas Proto-Oncogénicas/genética , Transactivadores/genética
3.
Mol Cell ; 53(2): 301-16, 2014 Jan 23.
Artículo en Inglés | MEDLINE | ID: mdl-24462204

RESUMEN

During X chromosome inactivation (XCI), the Polycomb Repressive Complex 2 (PRC2) is thought to participate in the early maintenance of the inactive state. Although Xist RNA is essential for the recruitment of PRC2 to the X chromosome, the precise mechanism remains unclear. Here, we demonstrate that the PRC2 cofactor Jarid2 is an important mediator of Xist-induced PRC2 targeting. The region containing the conserved B and F repeats of Xist is critical for Jarid2 recruitment via its unique N-terminal domain. Xist-induced Jarid2 recruitment occurs chromosome-wide independently of a functional PRC2 complex, unlike at other parts of the genome, such as CG-rich regions, where Jarid2 and PRC2 binding are interdependent. Conversely, we show that Jarid2 loss prevents efficient PRC2 and H3K27me3 enrichment to Xist-coated chromatin. Jarid2 thus represents an important intermediate between PRC2 and Xist RNA for the initial targeting of the PRC2 complex to the X chromosome during onset of XCI.


Asunto(s)
Complejo Represivo Polycomb 2/metabolismo , ARN Largo no Codificante/fisiología , Inactivación del Cromosoma X , Cromosoma X/metabolismo , Animales , Compensación de Dosificación (Genética) , Humanos , Ratones , Complejo Represivo Polycomb 2/genética , Complejo Represivo Polycomb 2/fisiología , ARN Largo no Codificante/metabolismo
4.
BMC Bioinformatics ; 22(1): 407, 2021 Aug 17.
Artículo en Inglés | MEDLINE | ID: mdl-34404353

RESUMEN

BACKGROUND: Multiple studies rely on ChIP-seq experiments to assess the effect of gene modulation and drug treatments on protein binding and chromatin structure. However, most methods commonly used for the normalization of ChIP-seq binding intensity signals across conditions, e.g., the normalization to the same number of reads, either assume a constant signal-to-noise ratio across conditions or base the estimates of correction factors on genomic regions with intrinsically different signals between conditions. Inaccurate normalization of ChIP-seq signal may, in turn, lead to erroneous biological conclusions. RESULTS: We developed a new R package, CHIPIN, that allows normalizing ChIP-seq signals across different conditions/samples when spike-in information is not available, but gene expression data are at hand. Our normalization technique is based on the assumption that, on average, no differences in ChIP-seq signals should be observed in the regulatory regions of genes whose expression levels are constant across samples/conditions. In addition to normalizing ChIP-seq signals, CHIPIN provides as output a number of graphs and calculates statistics allowing the user to assess the efficiency of the normalization and qualify the specificity of the antibody used. In addition to ChIP-seq, CHIPIN can be used without restriction on open chromatin ATAC-seq or DNase hypersensitivity data. We validated the CHIPIN method on several ChIP-seq data sets and documented its superior performance in comparison to several commonly used normalization techniques. CONCLUSIONS: The CHIPIN method provides a new way for ChIP-seq signal normalization across conditions when spike-in experiments are not available. The method is implemented in a user-friendly R package available on GitHub: https://github.com/BoevaLab/CHIPIN.


Asunto(s)
Secuenciación de Inmunoprecipitación de Cromatina , Cromatina , Inmunoprecipitación de Cromatina , Unión Proteica , Análisis de Secuencia de ADN
5.
Am J Hum Genet ; 102(5): 920-942, 2018 05 03.
Artículo en Inglés | MEDLINE | ID: mdl-29727691

RESUMEN

We describe a method based on a latent Dirichlet allocation model for predicting functional effects of noncoding genetic variants in a cell-type- and/or tissue-specific way (FUN-LDA). Using this unsupervised approach, we predict tissue-specific functional effects for every position in the human genome in 127 different tissues and cell types. We demonstrate the usefulness of our predictions by using several validation experiments. Using eQTL data from several sources, including the GTEx project, Geuvadis project, and TwinsUK cohort, we show that eQTLs in specific tissues tend to be most enriched among the predicted functional variants in relevant tissues in Roadmap. We further show how these integrated functional scores can be used for (1) deriving the most likely cell or tissue type causally implicated for a complex trait by using summary statistics from genome-wide association studies and (2) estimating a tissue-based correlation matrix of various complex traits. We found large enrichment of heritability in functional components of relevant tissues for various complex traits, and FUN-LDA yielded higher enrichment estimates than existing methods. Finally, using experimentally validated functional variants from the literature and variants possibly implicated in disease by previous studies, we rigorously compare FUN-LDA with state-of-the-art functional annotation methods and show that FUN-LDA has better prediction accuracy and higher resolution than these methods. In particular, our results suggest that tissue- and cell-type-specific functional prediction methods tend to have substantially better prediction accuracy than organism-level prediction methods. Scores for each position in the human genome and for each ENCODE and Roadmap tissue are available online (see Web Resources).


Asunto(s)
Algoritmos , ADN Intergénico/genética , Variación Genética , Modelos Genéticos , Especificidad de Órganos/genética , Estudio de Asociación del Genoma Completo , Humanos , Desequilibrio de Ligamiento/genética , Anotación de Secuencia Molecular , Polimorfismo de Nucleótido Simple/genética , Probabilidad , Sitios de Carácter Cuantitativo/genética , Reproducibilidad de los Resultados , Gemelos/genética
6.
Proc Natl Acad Sci U S A ; 115(52): E12265-E12274, 2018 12 26.
Artículo en Inglés | MEDLINE | ID: mdl-30541888

RESUMEN

Adrenal cortex steroids are essential for body homeostasis, and adrenal insufficiency is a life-threatening condition. Adrenal endocrine activity is maintained through recruitment of subcapsular progenitor cells that follow a unidirectional differentiation path from zona glomerulosa to zona fasciculata (zF). Here, we show that this unidirectionality is ensured by the histone methyltransferase EZH2. Indeed, we demonstrate that EZH2 maintains adrenal steroidogenic cell differentiation by preventing expression of GATA4 and WT1 that cause abnormal dedifferentiation to a progenitor-like state in Ezh2 KO adrenals. EZH2 further ensures normal cortical differentiation by programming cells for optimal response to adrenocorticotrophic hormone (ACTH)/PKA signaling. This is achieved by repression of phosphodiesterases PDE1B, 3A, and 7A and of PRKAR1B. Consequently, EZH2 ablation results in blunted zF differentiation and primary glucocorticoid insufficiency. These data demonstrate an all-encompassing role for EZH2 in programming steroidogenic cells for optimal response to differentiation signals and in maintaining their differentiated state.


Asunto(s)
Corteza Suprarrenal/enzimología , Subunidad RIbeta de la Proteína Quinasa Dependiente de AMP Cíclico/metabolismo , Proteína Potenciadora del Homólogo Zeste 2/metabolismo , Transducción de Señal , Corteza Suprarrenal/metabolismo , Animales , Diferenciación Celular , Subunidad RIbeta de la Proteína Quinasa Dependiente de AMP Cíclico/genética , Fosfodiesterasas de Nucleótidos Cíclicos Tipo 1/genética , Fosfodiesterasas de Nucleótidos Cíclicos Tipo 1/metabolismo , Fosfodiesterasas de Nucleótidos Cíclicos Tipo 3/genética , Fosfodiesterasas de Nucleótidos Cíclicos Tipo 3/metabolismo , Fosfodiesterasas de Nucleótidos Cíclicos Tipo 7/genética , Fosfodiesterasas de Nucleótidos Cíclicos Tipo 7/metabolismo , Proteína Potenciadora del Homólogo Zeste 2/genética , Femenino , Masculino , Ratones Endogámicos C57BL , Ratones Noqueados , Esteroides/metabolismo , Zona Fascicular/citología , Zona Fascicular/enzimología , Zona Fascicular/metabolismo , Zona Glomerular/citología , Zona Glomerular/enzimología , Zona Glomerular/metabolismo
7.
Genome Res ; 27(2): 259-268, 2017 02.
Artículo en Inglés | MEDLINE | ID: mdl-27965291

RESUMEN

Super-enhancers (SEs) are key transcriptional drivers of cellular, developmental, and disease states in mammals, yet the conservational and regulatory features of these enhancer elements in nonmammalian vertebrates are unknown. To define SEs in zebrafish and enable sequence and functional comparisons to mouse and human SEs, we used genome-wide histone H3 lysine 27 acetylation (H3K27ac) occupancy as a primary SE delineator. Our study determined the set of SEs in pluripotent state cells and adult zebrafish tissues and revealed both similarities and differences between zebrafish and mammalian SEs. Although the total number of SEs was proportional to the genome size, the genomic distribution of zebrafish SEs differed from that of the mammalian SEs. Despite the evolutionary distance separating zebrafish and mammals and the low overall SE sequence conservation, ∼42% of zebrafish SEs were located in close proximity to orthologs that also were associated with SEs in mouse and human. Compared to their nonassociated counterparts, higher sequence conservation was revealed for those SEs that have maintained orthologous gene associations. Functional dissection of two of these SEs identified conserved sequence elements and tissue-specific expression patterns, while chromatin accessibility analyses predicted transcription factors governing the function of pluripotent state zebrafish SEs. Our zebrafish annotations and comparative studies show the extent of SE usage and their conservation across vertebrates, permitting future gene regulatory studies in several tissues.


Asunto(s)
Cromatina/genética , Secuencia Conservada/genética , Elementos de Facilitación Genéticos , Pez Cebra/genética , Acetilación , Animales , Desarrollo Embrionario/genética , Regulación del Desarrollo de la Expresión Génica , Genómica , Histonas/genética , Humanos , Ratones , Factores de Transcripción/genética
8.
Bioinformatics ; 34(11): 1808-1816, 2018 06 01.
Artículo en Inglés | MEDLINE | ID: mdl-29342233

RESUMEN

Motivation: In cancer, clonal evolution is assessed based on information coming from single nucleotide variants and copy number alterations. Nonetheless, existing methods often fail to accurately combine information from both sources to truthfully reconstruct clonal populations in a given tumor sample or in a set of tumor samples coming from the same patient. Moreover, previously published methods detect clones from a single set of variants. As a result, compromises have to be done between stringent variant filtering [reducing dispersion in variant allele frequency estimates (VAFs)] and using all biologically relevant variants. Results: We present a framework for defining cancer clones using most reliable variants of high depth of coverage and assigning functional mutations to the detected clones. The key element of our framework is QuantumClone, a method for variant clustering into clones based on VAFs, genotypes of corresponding regions and information about tumor purity. We validated QuantumClone and our framework on simulated data. We then applied our framework to whole genome sequencing data for 19 neuroblastoma trios each including constitutional, diagnosis and relapse samples. We confirmed an enrichment of damaging variants within such pathways as MAPK (mitogen-activated protein kinases), neuritogenesis, epithelial-mesenchymal transition, cell survival and DNA repair. Most pathways had more damaging variants in the expanding clones compared to shrinking ones, which can be explained by the increased total number of variants between these two populations. Functional mutational rate varied for ancestral clones and clones shrinking or expanding upon treatment, suggesting changes in clone selection mechanisms at different time points of tumor evolution. Availability and implementation: Source code and binaries of the QuantumClone R package are freely available for download at https://CRAN.R-project.org/package=QuantumClone. Contact: gudrun.schleiermacher@curie.fr or valentina.boeva@inserm.fr. Supplementary information: Supplementary data are available at Bioinformatics online.


Asunto(s)
Evolución Clonal , Variaciones en el Número de Copia de ADN , Tipificación Molecular/métodos , Neoplasias/genética , Programas Informáticos , Secuenciación Completa del Genoma/métodos , Análisis por Conglomerados , Análisis Mutacional de ADN/métodos , Frecuencia de los Genes , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Humanos , Mutación , Neoplasias/diagnóstico
9.
Nucleic Acids Res ; 45(8): e58, 2017 05 05.
Artículo en Inglés | MEDLINE | ID: mdl-28053124

RESUMEN

Comparing histone modification profiles between cancer and normal states, or across different tumor samples, can provide insights into understanding cancer initiation, progression and response to therapy. ChIP-seq histone modification data of cancer samples are distorted by copy number variation innate to any cancer cell. We present HMCan-diff, the first method designed to analyze ChIP-seq data to detect changes in histone modifications between two cancer samples of different genetic backgrounds, or between a cancer sample and a normal control. HMCan-diff explicitly corrects for copy number bias, and for other biases in the ChIP-seq data, which significantly improves prediction accuracy compared to methods that do not consider such corrections. On in silico simulated ChIP-seq data generated using genomes with differences in copy number profiles, HMCan-diff shows a much better performance compared to other methods that have no correction for copy number bias. Additionally, we benchmarked HMCan-diff on four experimental datasets, characterizing two histone marks in two different scenarios. We correlated changes in histone modifications between a cancer and a normal control sample with changes in gene expression. On all experimental datasets, HMCan-diff demonstrated better performance compared to the other methods.


Asunto(s)
Regulación Neoplásica de la Expresión Génica , Código de Histonas , Histonas/genética , Neoplasias/genética , Programas Informáticos , Algoritmos , Inmunoprecipitación de Cromatina , Conjuntos de Datos como Asunto , Progresión de la Enfermedad , Dosificación de Gen , Histonas/metabolismo , Humanos , Cadenas de Markov , Neoplasias/metabolismo , Neoplasias/patología
10.
Bioinformatics ; 32(1): 136-9, 2016 Jan 01.
Artículo en Inglés | MEDLINE | ID: mdl-26353839

RESUMEN

MOTIVATION: Read simulators combined with alignment evaluation tools provide the most straightforward way to evaluate and compare mappers. Simulation of reads is accompanied by information about their positions in the source genome. This information is then used to evaluate alignments produced by the mapper. Finally, reports containing statistics of successful read alignments are created.In default of standards for encoding read origins, every evaluation tool has to be made explicitly compatible with the simulator used to generate reads. RESULTS: To solve this obstacle, we have created a generic format Read Naming Format (Rnf) for assigning read names with encoded information about original positions. Futhermore, we have developed an associated software package RnfTools containing two principal components. MIShmash applies one of popular read simulating tools (among DwgSim, Art, Mason, CuReSim, etc.) and transforms the generated reads into Rnf format. LAVEnder evaluates then a given read mapper using simulated reads in Rnf format. A special attention is payed to mapping qualities that serve for parametrization of Roc curves, and to evaluation of the effect of read sample contamination. AVAILABILITY AND IMPLEMENTATION: RnfTools: http://karel-brinda.github.io/rnftools Spec. of Rnf: http://karel-brinda.github.io/rnf-spec CONTACT: karel.brinda@univ-mlv.fr.


Asunto(s)
Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Programas Informáticos , Simulación por Computador , Genoma , Humanos
11.
Bioinformatics ; 32(7): 984-92, 2016 04 01.
Artículo en Inglés | MEDLINE | ID: mdl-26740523

RESUMEN

MOTIVATION: Whole genome sequencing of paired-end reads can be applied to characterize the landscape of large somatic rearrangements of cancer genomes. Several methods for detecting structural variants with whole genome sequencing data have been developed. So far, none of these methods has combined information about abnormally mapped read pairs connecting rearranged regions and associated global copy number changes automatically inferred from the same sequencing data file. Our aim was to create a computational method that could use both types of information, i.e. normal and abnormal reads, and demonstrate that by doing so we can highly improve both sensitivity and specificity rates of structural variant prediction. RESULTS: We developed a computational method, SV-Bay, to detect structural variants from whole genome sequencing mate-pair or paired-end data using a probabilistic Bayesian approach. This approach takes into account depth of coverage by normal reads and abnormalities in read pair mappings. To estimate the model likelihood, SV-Bay considers GC-content and read mappability of the genome, thus making important corrections to the expected read count. For the detection of somatic variants, SV-Bay makes use of a matched normal sample when it is available. We validated SV-Bay on simulated datasets and an experimental mate-pair dataset for the CLB-GA neuroblastoma cell line. The comparison of SV-Bay with several other methods for structural variant detection demonstrated that SV-Bay has better prediction accuracy both in terms of sensitivity and false-positive detection rate. AVAILABILITY AND IMPLEMENTATION: https://github.com/InstitutCurie/SV-Bay CONTACT: valentina.boeva@inserm.fr SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Teorema de Bayes , Estudio de Asociación del Genoma Completo , Variación Estructural del Genoma , Neoplasias/genética , Composición de Base , Genoma , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Metagenómica
12.
BMC Genomics ; 16: 873, 2015 Oct 28.
Artículo en Inglés | MEDLINE | ID: mdl-26510534

RESUMEN

BACKGROUND: Methylation of high-density CpG regions known as CpG Islands (CGIs) has been widely described as a mechanism associated with gene expression regulation. Aberrant promoter methylation is considered a hallmark of cancer involved in silencing of tumor suppressor genes and activation of oncogenes. However, recent studies have also challenged the simple model of gene expression control by promoter methylation in cancer, and the precise mechanism of and role played by changes in DNA methylation in carcinogenesis remains elusive. RESULTS: Using a large dataset of 672 matched cancerous and healthy methylomes, gene expression, and copy number profiles accross 3 types of tissues from The Cancer Genome Atlas (TCGA), we perform a detailed meta-analysis to clarify the interplay between promoter methylation and gene expression in normal and cancer samples. On the one hand, we recover the existence of a CpG island methylator phenotype (CIMP) with prognostic value in a subset of breast, colon and lung cancer samples, where a common subset of promoter CGIs hypomethylated in normal samples become hypermethylated. However, this hypermethylation is not accompanied by a decrease in expression of the corresponding genes, which are already lowly expressed in the normal genes. On the other hand, we identify tissue-specific sets of genes, different between normal and cancer samples, whose inter-individual variation in expression is significantly correlated with the variation in methylation of the 3' flanking regions of the promoter CGIs. These subsets of genes are not the same in the different tissues, nor between normal and cancerous samples, but transcription factors are over-represented in all subsets. CONCLUSION: Our results suggest that epigenetic reprogramming in cancer does not contribute to cancer development via direct inhibition of gene expression through promoter hypermethylation. It may instead modify how the expression of a few specific genes, particularly transcription factors, are associated with DNA methylation variations in a tissue-dependent manner.


Asunto(s)
Metilación de ADN/genética , Regulación Neoplásica de la Expresión Génica , Neoplasias/genética , Regiones Promotoras Genéticas/genética , Humanos
13.
Bioinformatics ; 30(24): 3443-50, 2014 Dec 15.
Artículo en Inglés | MEDLINE | ID: mdl-25016581

RESUMEN

MOTIVATION: Because of its low cost, amplicon sequencing, also known as ultra-deep targeted sequencing, is now becoming widely used in oncology for detection of actionable mutations, i.e. mutations influencing cell sensitivity to targeted therapies. Amplicon sequencing is based on the polymerase chain reaction amplification of the regions of interest, a process that considerably distorts the information on copy numbers initially present in the tumor DNA. Therefore, additional experiments such as single nucleotide polymorphism (SNP) or comparative genomic hybridization (CGH) arrays often complement amplicon sequencing in clinics to identify copy number status of genes whose amplification or deletion has direct consequences on the efficacy of a particular cancer treatment. So far, there has been no proven method to extract the information on gene copy number aberrations based solely on amplicon sequencing. RESULTS: Here we present ONCOCNV, a method that includes a multifactor normalization and annotation technique enabling the detection of large copy number changes from amplicon sequencing data. We validated our approach on high and low amplicon density datasets and demonstrated that ONCOCNV can achieve a precision comparable with that of array CGH techniques in detecting copy number aberrations. Thus, ONCOCNV applied on amplicon sequencing data would make the use of additional array CGH or SNP array experiments unnecessary.


Asunto(s)
Dosificación de Gen , Genes Relacionados con las Neoplasias , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Análisis de Secuencia de ADN/métodos , Hibridación Genómica Comparativa , ADN de Neoplasias/química , Exoma , Femenino , Humanos , Masculino , Reacción en Cadena de la Polimerasa , Polimorfismo de Nucleótido Simple
14.
Bioinformatics ; 30(11): 1539-46, 2014 Jun 01.
Artículo en Inglés | MEDLINE | ID: mdl-24493034

RESUMEN

MOTIVATION: DNA copy number profiles characterize regions of chromosome gains, losses and breakpoints in tumor genomes. Although many models have been proposed to detect these alterations, it is not clear which model is appropriate before visual inspection the signal, noise and models for a particular profile. RESULTS: We propose SegAnnDB, a Web-based computer vision system for genomic segmentation: first, visually inspect the profiles and manually annotate altered regions, then SegAnnDB determines the precise alteration locations using a mathematical model of the data and annotations. SegAnnDB facilitates collaboration between biologists and bioinformaticians, and uses the University of California, Santa Cruz genome browser to visualize copy number alterations alongside known genes. AVAILABILITY AND IMPLEMENTATION: The breakpoints project on INRIA GForge hosts the source code, an Amazon Machine Image can be launched and a demonstration Web site is http://bioviz.rocq.inria.fr.


Asunto(s)
Variaciones en el Número de Copia de ADN , Programas Informáticos , Algoritmos , Puntos de Rotura del Cromosoma , Genómica/métodos , Internet
15.
Bioinformatics ; 29(23): 2979-86, 2013 Dec 01.
Artículo en Inglés | MEDLINE | ID: mdl-24021381

RESUMEN

MOTIVATION: Cancer cells are often characterized by epigenetic changes, which include aberrant histone modifications. In particular, local or regional epigenetic silencing is a common mechanism in cancer for silencing expression of tumor suppressor genes. Though several tools have been created to enable detection of histone marks in ChIP-seq data from normal samples, it is unclear whether these tools can be efficiently applied to ChIP-seq data generated from cancer samples. Indeed, cancer genomes are often characterized by frequent copy number alterations: gains and losses of large regions of chromosomal material. Copy number alterations may create a substantial statistical bias in the evaluation of histone mark signal enrichment and result in underdetection of the signal in the regions of loss and overdetection of the signal in the regions of gain. RESULTS: We present HMCan (Histone modifications in cancer), a tool specially designed to analyze histone modification ChIP-seq data produced from cancer genomes. HMCan corrects for the GC-content and copy number bias and then applies Hidden Markov Models to detect the signal from the corrected data. On simulated data, HMCan outperformed several commonly used tools developed to analyze histone modification data produced from genomes without copy number alterations. HMCan also showed superior results on a ChIP-seq dataset generated for the repressive histone mark H3K27me3 in a bladder cancer cell line. HMCan predictions matched well with experimental data (qPCR validated regions) and included, for example, the previously detected H3K27me3 mark in the promoter of the DLEC1 gene, missed by other tools we tested.


Asunto(s)
Ensamble y Desensamble de Cromatina/genética , Inmunoprecipitación de Cromatina/métodos , Epigénesis Genética , Histonas/genética , Procesamiento Proteico-Postraduccional , Programas Informáticos , Neoplasias de la Vejiga Urinaria/genética , Composición de Base , Simulación por Computador , Variaciones en el Número de Copia de ADN/genética , Genoma Humano , Histonas/metabolismo , Humanos , Cadenas de Markov , Análisis de Secuencia por Matrices de Oligonucleótidos , Regiones Promotoras Genéticas/genética , Neoplasias de la Vejiga Urinaria/diagnóstico
16.
Nucleic Acids Res ; 40(18): 8927-41, 2012 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-22790984

RESUMEN

Acute leukemias are characterized by deregulation of transcriptional networks that control the lineage specificity of gene expression. The aberrant overexpression of the Spi-1/PU.1 transcription factor leads to erythroleukemia. To determine how Spi-1 mechanistically influences the transcriptional program, we combined a ChIP-seq analysis with transcriptional profiling in cells from an erythroleukemic mouse model. We show that Spi-1 displays a selective DNA-binding that does not often cause transcriptional modulation. We report that Spi-1 controls transcriptional activation and repression partially through distinct Spi-1 recruitment to chromatin. We revealed several parameters impacting on Spi-1-mediated transcriptional activation. Gene activation is facilitated by Spi-1 occupancy close to transcriptional starting site of genes devoid of CGIs. Moreover, in those regions Spi-1 acts by binding to multiple motifs tightly clustered and with similar orientation. Finally, in contrast to the myeloid and lymphoid B cells in which Spi-1 exerts a physiological activity, in the erythroleukemic cells, lineage-specific cooperating factors do not play a prevalent role in Spi-1-mediated transcriptional activation. Thus, our work describes a new mechanism of gene activation through clustered site occupancy of Spi-1 particularly relevant in regard to the strong expression of Spi-1 in the erythroleukemic cells.


Asunto(s)
Leucemia Eritroblástica Aguda/genética , Proteínas Proto-Oncogénicas/metabolismo , Elementos Reguladores de la Transcripción , Transactivadores/metabolismo , Activación Transcripcional , Animales , Sitios de Unión , Línea Celular Tumoral , Inmunoprecipitación de Cromatina , Islas de CpG , ADN/química , ADN/metabolismo , Perfilación de la Expresión Génica , Regulación Neoplásica de la Expresión Génica , Genoma , Leucemia Eritroblástica Aguda/metabolismo , Ratones , Ratones Transgénicos , Motivos de Nucleótidos , Análisis de Secuencia de ADN , Sitio de Iniciación de la Transcripción
17.
J Natl Cancer Inst ; 116(6): 974-982, 2024 Jun 07.
Artículo en Inglés | MEDLINE | ID: mdl-38273663

RESUMEN

BACKGROUND: The phenomenon of field cancerization reflects the transition of normal cells into those predisposed to cancer. Assessing the scope and intensity of this process in the colon may support risk prediction and colorectal cancer prevention. METHODS: The Swiss Epigenetic Colorectal Cancer Study (SWEPIC) study, encompassing 1111 participants for DNA methylation analysis and a subset of 84 for RNA sequencing, was employed to detect field cancerization in individuals with adenomatous polyps (AP). Methylation variations were evaluated for their discriminative capability, including in external cohorts, genomic localization, clinical correlations, and associated RNA expression patterns. RESULTS: Normal cecal tissue of individuals harboring an AP in the proximal colon manifested dysregulated DNA methylation compared to tissue from healthy individuals at 558 unique loci. Leveraging these adenoma-related differentially variable and methylated CpGs (aDVMCs), our classifier discerned between healthy and AP-adjacent tissues across SWEPIC datasets (cross-validated area under the receiver operating characteristic curve [ROC AUC] = 0.63-0.81), including within age-stratified cohorts. This discriminative capacity was validated in 3 external sets, differentiating healthy from cancer-adjacent tissue (ROC AUC = 0.82-0.88). Notably, aDVMC dysregulation correlated with polyp multiplicity. More than 50% of aDVMCs were significantly associated with age. These aDVMCs were enriched in active regions of the genome (P < .001), and associated genes exhibited altered expression in AP-adjacent tissues. CONCLUSIONS: Our findings underscore the early onset of field cancerization in the right colon during the neoplastic transformation process. A more extensive validation of aDVMC dysregulation as a stratification tool could pave the way for enhanced surveillance approaches, especially given its linkage to adenoma emergence.


Asunto(s)
Pólipos Adenomatosos , Metilación de ADN , Humanos , Pólipos Adenomatosos/genética , Pólipos Adenomatosos/patología , Femenino , Masculino , Persona de Mediana Edad , Anciano , Biomarcadores de Tumor/genética , Mucosa Intestinal/patología , Mucosa Intestinal/metabolismo , Neoplasias Colorrectales/genética , Neoplasias Colorrectales/patología , Regulación Neoplásica de la Expresión Génica , Transformación Celular Neoplásica/genética , Islas de CpG/genética , Epigénesis Genética
18.
BMC Bioinformatics ; 14: 164, 2013 May 22.
Artículo en Inglés | MEDLINE | ID: mdl-23697330

RESUMEN

BACKGROUND: Many models have been proposed to detect copy number alterations in chromosomal copy number profiles, but it is usually not obvious to decide which is most effective for a given data set. Furthermore, most methods have a smoothing parameter that determines the number of breakpoints and must be chosen using various heuristics. RESULTS: We present three contributions for copy number profile smoothing model selection. First, we propose to select the model and degree of smoothness that maximizes agreement with visual breakpoint region annotations. Second, we develop cross-validation procedures to estimate the error of the trained models. Third, we apply these methods to compare 17 smoothing models on a new database of 575 annotated neuroblastoma copy number profiles, which we make available as a public benchmark for testing new algorithms. CONCLUSIONS: Whereas previous studies have been qualitative or limited to simulated data, our annotation-guided approach is quantitative and suggests which algorithms are fastest and most accurate in practice on real data. In the neuroblastoma data, the equivalent pelt.n and cghseg.k methods were the best breakpoint detectors, and exhibited reasonable computation times.


Asunto(s)
Puntos de Rotura del Cromosoma , Dosificación de Gen/genética , Perfilación de la Expresión Génica/métodos , Modelos Genéticos , Algoritmos , Mapeo Cromosómico/métodos , Humanos
19.
Bioinformatics ; 28(19): 2517-9, 2012 Oct 01.
Artículo en Inglés | MEDLINE | ID: mdl-22829625

RESUMEN

MOTIVATION: ChIP-seq consists of chromatin immunoprecipitation and deep sequencing of the extracted DNA fragments. It is the technique of choice for accurate characterization of the binding sites of transcription factors and other DNA-associated proteins. We present a web service, Nebula, which allows inexperienced users to perform a complete bioinformatics analysis of ChIP-seq data. RESULTS: Nebula was designed for both bioinformaticians and biologists. It is based on the Galaxy open source framework. Galaxy already includes a large number of functionalities for mapping reads and peak calling. We added the following to Galaxy: (i) peak calling with FindPeaks and a module for immunoprecipitation quality control, (ii) de novo motif discovery with ChIPMunk, (iii) calculation of the density and the cumulative distribution of peak locations relative to gene transcription start sites, (iv) annotation of peaks with genomic features and (v) annotation of genes with peak information. Nebula generates the graphs and the enrichment statistics at each step of the process. During Steps 3-5, Nebula optionally repeats the analysis on a control dataset and compares these results with those from the main dataset. Nebula can also incorporate gene expression (or gene modulation) data during these steps. In summary, Nebula is an innovative web service that provides an advanced ChIP-seq analysis pipeline providing ready-to-publish results. AVAILABILITY: Nebula is available at http://nebula.curie.fr/ SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Inmunoprecipitación de Cromatina/métodos , Biología Computacional/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Internet , Sitios de Unión , Programas Informáticos
20.
Bioinformatics ; 28(3): 423-5, 2012 Feb 01.
Artículo en Inglés | MEDLINE | ID: mdl-22155870

RESUMEN

SUMMARY: More and more cancer studies use next-generation sequencing (NGS) data to detect various types of genomic variation. However, even when researchers have such data at hand, single-nucleotide polymorphism arrays have been considered necessary to assess copy number alterations and especially loss of heterozygosity (LOH). Here, we present the tool Control-FREEC that enables automatic calculation of copy number and allelic content profiles from NGS data, and consequently predicts regions of genomic alteration such as gains, losses and LOH. Taking as input aligned reads, Control-FREEC constructs copy number and B-allele frequency profiles. The profiles are then normalized, segmented and analyzed in order to assign genotype status (copy number and allelic content) to each genomic region. When a matched normal sample is provided, Control-FREEC discriminates somatic from germline events. Control-FREEC is able to analyze overdiploid tumor samples and samples contaminated by normal cells. Low mappability regions can be excluded from the analysis using provided mappability tracks. AVAILABILITY: C++ source code is available at: http://bioinfo.curie.fr/projects/freec/ CONTACT: freec@curie.fr SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Variaciones en el Número de Copia de ADN , Programas Informáticos , Alelos , Humanos , Pérdida de Heterocigocidad , Neoplasias/genética , Polimorfismo de Nucleótido Simple
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA