Búsqueda | Portal Regional de la BVS

Analysis of 3760 hematologic malignancies reveals rare transcriptomic aberrations of driver genes.

Cao, Xueqi; Huber, Sandra; Ahari, Ata Jadid; Traube, Franziska R; Seifert, Marc; Oakes, Christopher C; Secheyko, Polina; Vilov, Sergey; Scheller, Ines F; Wagner, Nils; Yépez, Vicente A; Blombery, Piers; Haferlach, Torsten; Heinig, Matthias; Wachutka, Leonhard; Hutter, Stephan; Gagneur, Julien.

Genome Med ; 16(1): 70, 2024 May 20.

Artículo en Inglés | MEDLINE | ID: mdl-38769532

RESUMEN

BACKGROUND: Rare oncogenic driver events, particularly affecting the expression or splicing of driver genes, are suspected to substantially contribute to the large heterogeneity of hematologic malignancies. However, their identification remains challenging. METHODS: To address this issue, we generated the largest dataset to date of matched whole genome sequencing and total RNA sequencing of hematologic malignancies from 3760 patients spanning 24 disease entities. Taking advantage of our dataset size, we focused on discovering rare regulatory aberrations. Therefore, we called expression and splicing outliers using an extension of the workflow DROP (Detection of RNA Outliers Pipeline) and AbSplice, a variant effect predictor that identifies genetic variants causing aberrant splicing. We next trained a machine learning model integrating these results to prioritize new candidate disease-specific driver genes. RESULTS: We found a median of seven expression outlier genes, two splicing outlier genes, and two rare splice-affecting variants per sample. Each category showed significant enrichment for already well-characterized driver genes, with odds ratios exceeding three among genes called in more than five samples. On held-out data, our integrative modeling significantly outperformed modeling based solely on genomic data and revealed promising novel candidate driver genes. Remarkably, we found a truncated form of the low density lipoprotein receptor LRP1B transcript to be aberrantly overexpressed in about half of hairy cell leukemia variant (HCL-V) samples and, to a lesser extent, in closely related B-cell neoplasms. This observation, which was confirmed in an independent cohort, suggests LRP1B as a novel marker for a HCL-V subclass and a yet unreported functional role of LRP1B within these rare entities. CONCLUSIONS: Altogether, our census of expression and splicing outliers for 24 hematologic malignancy entities and the companion computational workflow constitute unique resources to deepen our understanding of rare oncogenic events in hematologic cancers.

Asunto(s)

Neoplasias Hematológicas , Transcriptoma , Humanos , Neoplasias Hematológicas/genética , Empalme del ARN , Regulación Neoplásica de la Expresión Génica , Oncogenes , Perfilación de la Expresión Génica , Receptores de LDL/genética

Transmicron: accurate prediction of insertion probabilities improves detection of cancer driver genes from transposon mutagenesis screens.

Bredthauer, Carl; Fischer, Anja; Ahari, Ata Jadid; Cao, Xueqi; Weber, Julia; Rad, Lena; Rad, Roland; Wachutka, Leonhard; Gagneur, Julien.

Nucleic Acids Res ; 51(4): e21, 2023 02 28.

Artículo en Inglés | MEDLINE | ID: mdl-36617985

RESUMEN

Transposon screens are powerful in vivo assays used to identify loci driving carcinogenesis. These loci are identified as Common Insertion Sites (CISs), i.e. regions with more transposon insertions than expected by chance. However, the identification of CISs is affected by biases in the insertion behaviour of transposon systems. Here, we introduce Transmicron, a novel method that differs from previous methods by (i) modelling neutral insertion rates based on chromatin accessibility, transcriptional activity and sequence context and (ii) estimating oncogenic selection for each genomic region using Poisson regression to model insertion counts while controlling for neutral insertion rates. To assess the benefits of our approach, we generated a dataset applying two different transposon systems under comparable conditions. Benchmarking for enrichment of known cancer genes showed improved performance of Transmicron against state-of-the-art methods. Modelling neutral insertion rates allowed for better control of false positives and stronger agreement of the results between transposon systems. Moreover, using Poisson regression to consider intra-sample and inter-sample information proved beneficial in small and moderately-sized datasets. Transmicron is open-source and freely available. Overall, this study contributes to the understanding of transposon biology and introduces a novel approach to use this knowledge for discovering cancer driver genes.

Asunto(s)

Elementos Transponibles de ADN , Neoplasias , Programas Informáticos , Humanos , Secuencia de Bases , Carcinogénesis , Mutagénesis Insercional , Oncogenes , Neoplasias/genética

Detection of aberrant gene expression events in RNA sequencing data.

Yépez, Vicente A; Mertes, Christian; Müller, Michaela F; Klaproth-Andrade, Daniela; Wachutka, Leonhard; Frésard, Laure; Gusic, Mirjana; Scheller, Ines F; Goldberg, Patricia F; Prokisch, Holger; Gagneur, Julien.

Nat Protoc ; 16(2): 1276-1296, 2021 02.

Artículo en Inglés | MEDLINE | ID: mdl-33462443

RESUMEN

RNA sequencing (RNA-seq) has emerged as a powerful approach to discover disease-causing gene regulatory defects in individuals affected by genetically undiagnosed rare disorders. Pioneering studies have shown that RNA-seq could increase the diagnosis rates over DNA sequencing alone by 8-36%, depending on the disease entity and tissue probed. To accelerate adoption of RNA-seq by human genetics centers, detailed analysis protocols are now needed. We present a step-by-step protocol that details how to robustly detect aberrant expression levels, aberrant splicing and mono-allelic expression in RNA-seq data using dedicated statistical methods. We describe how to generate and assess quality control plots and interpret the analysis results. The protocol is based on the detection of RNA outliers pipeline (DROP), a modular computational workflow that integrates all the analysis steps, can leverage parallel computing infrastructures and generates browsable web page reports.

Asunto(s)

Secuencia de Bases/genética , Expresión Génica/genética , Análisis de Secuencia de ARN/métodos , Diagnóstico , Técnicas y Procedimientos Diagnósticos , Enfermedad/genética , Perfilación de la Expresión Génica/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Humanos , ARN/genética , Programas Informáticos , Flujo de Trabajo

Global donor and acceptor splicing site kinetics in human cells.

Wachutka, Leonhard; Caizzi, Livia; Gagneur, Julien; Cramer, Patrick.

Elife ; 82019 04 26.

Artículo en Inglés | MEDLINE | ID: mdl-31025937

RESUMEN

RNA splicing is an essential part of eukaryotic gene expression. Although the mechanism of splicing has been extensively studied in vitro, in vivo kinetics for the two-step splicing reaction remain poorly understood. Here, we combine transient transcriptome sequencing (TT-seq) and mathematical modeling to quantify RNA metabolic rates at donor and acceptor splice sites across the human genome. Splicing occurs in the range of minutes and is limited by the speed of RNA polymerase elongation. Splicing kinetics strongly depends on the position and nature of nucleotides flanking splice sites, and on structural interactions between unspliced RNA and small nuclear RNAs in spliceosomal intermediates. Finally, we introduce the 'yield' of splicing as the efficiency of converting unspliced to spliced RNA and show that it is highest for mRNAs and independent of splicing kinetics. These results lead to quantitative models describing how splicing rates and yield are encoded in the human genome.

Asunto(s)

Genoma Humano , Empalme del ARN , Perfilación de la Expresión Génica , Humanos , Células K562 , Cinética , Modelos Teóricos , Análisis de Secuencia de ARN , Empalmosomas/metabolismo

OCR-Stats: Robust estimation and statistical testing of mitochondrial respiration activities using Seahorse XF Analyzer.

Yépez, Vicente A; Kremer, Laura S; Iuso, Arcangela; Gusic, Mirjana; Kopajtich, Robert; Konaríková, Eliska; Nadel, Agnieszka; Wachutka, Leonhard; Prokisch, Holger; Gagneur, Julien.

PLoS One ; 13(7): e0199938, 2018.

Artículo en Inglés | MEDLINE | ID: mdl-29995917

RESUMEN

The accurate quantification of cellular and mitochondrial bioenergetic activity is of great interest in medicine and biology. Mitochondrial stress tests performed with Seahorse Bioscience XF Analyzers allow the estimation of different bioenergetic measures by monitoring the oxygen consumption rates (OCR) of living cells in multi-well plates. However, studies of the statistical best practices for determining aggregated OCR measurements and comparisons have been lacking. Therefore, to understand how OCR behaves across different biological samples, wells, and plates, we performed mitochondrial stress tests in 126 96-well plates involving 203 fibroblast cell lines. We show that the noise of OCR is multiplicative, that outlier data points can concern individual measurements or all measurements of a well, and that the inter-plate variation is greater than the intra-plate variation. Based on these insights, we developed a novel statistical method, OCR-Stats, that: i) robustly estimates OCR levels modeling multiplicative noise and automatically identifying outlier data points and outlier wells; and ii) performs statistical testing between samples, taking into account the different magnitudes of the between- and within-plate variations. This led to a significant reduction of the coefficient of variation across plates of basal respiration by 45% and of maximal respiration by 29%. Moreover, using positive and negative controls, we show that our statistical test outperforms the existing methods, which suffer from an excess of either false positives (within-plate methods), or false negatives (between-plate methods). Altogether, this study provides statistical good practices to support experimentalists in designing, analyzing, testing, and reporting the results of mitochondrial stress tests using this high throughput platform.

Asunto(s)

Mitocondrias/metabolismo , Análisis de Matrices Tisulares/métodos , Línea Celular , Respiración de la Célula , Metabolismo Energético , Fibroblastos/citología , Modelos Estadísticos , Consumo de Oxígeno

Measures of RNA metabolism rates: Toward a definition at the level of single bonds.

Wachutka, Leonhard; Gagneur, Julien.

Transcription ; 8(2): 75-80, 2017 03 15.

Artículo en Inglés | MEDLINE | ID: mdl-27841720

RESUMEN

We give an overview of experimental and computational methods to estimate RNA metabolism rates genome-wide. We then advocate a local definition of RNA metabolism rate at the level of individual phosphodiester bonds. Rates of formation and disappearance of individual bonds are unambiguously defined, in contrast to rates of complete transcripts. We show that over previous approaches, the recently developed transient transcriptome sequencing (TT-seq) protocol allows for estimation of metabolism rates of individual bonds with least positional bias.

Asunto(s)

ARN/metabolismo , Exones , Genoma , Humanos , Cinética , ARN/química , Empalme del ARN , Análisis de Secuencia de ARN , Transcriptoma

Determinants of RNA metabolism in the Schizosaccharomyces pombe genome.

Eser, Philipp; Wachutka, Leonhard; Maier, Kerstin C; Demel, Carina; Boroni, Mariana; Iyer, Srignanakshi; Cramer, Patrick; Gagneur, Julien.

Mol Syst Biol ; 12(2): 857, 2016 Feb 16.

Artículo en Inglés | MEDLINE | ID: mdl-26883383

RESUMEN

To decrypt the regulatory code of the genome, sequence elements must be defined that determine the kinetics of RNA metabolism and thus gene expression. Here, we attempt such decryption in an eukaryotic model organism, the fission yeast S. pombe. We first derive an improved genome annotation that redefines borders of 36% of expressed mRNAs and adds 487 non-coding RNAs (ncRNAs). We then combine RNA labeling in vivo with mathematical modeling to obtain rates of RNA synthesis and degradation for 5,484 expressed RNAs and splicing rates for 4,958 introns. We identify functional sequence elements in DNA and RNA that control RNA metabolic rates and quantify the contributions of individual nucleotides to RNA synthesis, splicing, and degradation. Our approach reveals distinct kinetics of mRNA and ncRNA metabolism, separates antisense regulation by transcription interference from RNA interference, and provides a general tool for studying the regulatory code of genomes.

Asunto(s)

Regulación Fúngica de la Expresión Génica , Genoma Fúngico , ARN de Hongos/genética , ARN Mensajero/genética , Schizosaccharomyces/genética , Intrones , Interferencia de ARN , Empalme del ARN , ARN sin Sentido/genética , Análisis de Secuencia de ARN , Transcripción Genética

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA