RESUMEN
Acute physical activity leads to several changes in metabolic, cardiovascular, and immune pathways. Although studies have examined selected changes in these pathways, the system-wide molecular response to an acute bout of exercise has not been fully characterized. We performed longitudinal multi-omic profiling of plasma and peripheral blood mononuclear cells including metabolome, lipidome, immunome, proteome, and transcriptome from 36 well-characterized volunteers, before and after a controlled bout of symptom-limited exercise. Time-series analysis revealed thousands of molecular changes and an orchestrated choreography of biological processes involving energy metabolism, oxidative stress, inflammation, tissue repair, and growth factor response, as well as regulatory pathways. Most of these processes were dampened and some were reversed in insulin-resistant participants. Finally, we discovered biological pathways involved in cardiopulmonary exercise response and developed prediction models revealing potential resting blood-based biomarkers of peak oxygen consumption.
Asunto(s)
Metabolismo Energético/fisiología , Ejercicio Físico/fisiología , Anciano , Biomarcadores/metabolismo , Femenino , Humanos , Insulina/metabolismo , Resistencia a la Insulina , Leucocitos Mononucleares/metabolismo , Estudios Longitudinales , Masculino , Metaboloma , Persona de Mediana Edad , Oxígeno/metabolismo , Consumo de Oxígeno , Proteoma , TranscriptomaRESUMEN
Cancer genomes often harbor hundreds of somatic DNA rearrangement junctions, many of which cannot be easily classified into simple (e.g., deletion) or complex (e.g., chromothripsis) structural variant classes. Applying a novel genome graph computational paradigm to analyze the topology of junction copy number (JCN) across 2,778 tumor whole-genome sequences, we uncovered three novel complex rearrangement phenomena: pyrgo, rigma, and tyfonas. Pyrgo are "towers" of low-JCN duplications associated with early-replicating regions, superenhancers, and breast or ovarian cancers. Rigma comprise "chasms" of low-JCN deletions enriched in late-replicating fragile sites and gastrointestinal carcinomas. Tyfonas are "typhoons" of high-JCN junctions and fold-back inversions associated with expressed protein-coding fusions, breakend hypermutation, and acral, but not cutaneous, melanomas. Clustering of tumors according to genome graph-derived features identified subgroups associated with DNA repair defects and poor prognosis.
Asunto(s)
Variación Estructural del Genoma/genética , Genómica/métodos , Neoplasias/genética , Inversión Cromosómica/genética , Cromotripsis , Variaciones en el Número de Copia de ADN/genética , Reordenamiento Génico/genética , Genoma Humano/genética , Humanos , Mutación/genética , Secuenciación Completa del Genoma/métodosRESUMEN
The splicing of pre-mRNAs into mature transcripts is remarkable for its precision, but the mechanisms by which the cellular machinery achieves such specificity are incompletely understood. Here, we describe a deep neural network that accurately predicts splice junctions from an arbitrary pre-mRNA transcript sequence, enabling precise prediction of noncoding genetic variants that cause cryptic splicing. Synonymous and intronic mutations with predicted splice-altering consequence validate at a high rate on RNA-seq and are strongly deleterious in the human population. De novo mutations with predicted splice-altering consequence are significantly enriched in patients with autism and intellectual disability compared to healthy controls and validate against RNA-seq in 21 out of 28 of these patients. We estimate that 9%-11% of pathogenic mutations in patients with rare genetic disorders are caused by this previously underappreciated class of disease variation.
Asunto(s)
Predicción/métodos , Precursores del ARN/genética , Empalme del ARN/genética , Algoritmos , Empalme Alternativo/genética , Trastorno Autístico/genética , Aprendizaje Profundo , Exones/genética , Humanos , Discapacidad Intelectual/genética , Intrones/genética , Redes Neurales de la Computación , Precursores del ARN/metabolismo , Sitios de Empalme de ARN/genética , Sitios de Empalme de ARN/fisiologíaRESUMEN
Characterizing cell-cell communication and tracking its variability over time are crucial for understanding the coordination of biological processes mediating normal development, disease progression, and responses to perturbations such as therapies. Existing tools fail to capture time-dependent intercellular interactions and primarily rely on databases compiled from limited contexts. We introduce DIISCO, a Bayesian framework designed to characterize the temporal dynamics of cellular interactions using single-cell RNA-sequencing data from multiple time points. Our method utilizes structured Gaussian process regression to unveil time-resolved interactions among diverse cell types according to their coevolution and incorporates prior knowledge of receptor-ligand complexes. We show the interpretability of DIISCO in simulated data and new data collected from T cells cocultured with lymphoma cells, demonstrating its potential to uncover dynamic cell-cell cross talk.
Asunto(s)
Teorema de Bayes , Comunicación Celular , Análisis de la Célula Individual , Análisis de la Célula Individual/métodos , Humanos , Linfocitos T/metabolismo , Análisis de Secuencia de ARN/métodosRESUMEN
It is a generally accepted model that environmental influences can exert their effects, at least in part, by changing the molecular regulators of transcription that are described as epigenetic. As there is biochemical evidence that some epigenetic regulators of transcription can maintain their states long term and through cell division, an epigenetic model encompasses the idea of maintenance of the effect of an exposure long after it is no longer present. The evidence supporting this model is mostly from the observation of alterations of molecular regulators of transcription following exposures. With the understanding that the interpretation of these associations is more complex than originally recognised, this model may be oversimplistic; therefore, adopting novel perspectives and experimental approaches when examining how environmental exposures are linked to phenotypes may prove worthwhile. In this review, we have chosen to use the example of nonalcoholic fatty liver disease (NAFLD), a common, complex human disease with strong environmental and genetic influences. We describe how epigenomic approaches combined with emerging functional genetic and single-cell genomic techniques are poised to generate new insights into the pathogenesis of environmentally influenced human disease phenotypes exemplified by NAFLD.
Asunto(s)
Enfermedad del Hígado Graso no Alcohólico , Humanos , Enfermedad del Hígado Graso no Alcohólico/genética , Epigénesis Genética , Epigenómica , Exposición a Riesgos Ambientales/efectos adversos , FenotipoRESUMEN
Modern population-scale biobanks contain simultaneous measurements of many phenotypes, providing unprecedented opportunity to study the relationship between biomarkers and disease. However, inferring causal effects from observational data is notoriously challenging. Mendelian randomization (MR) has recently received increased attention as a class of methods for estimating causal effects using genetic associations. However, standard methods result in pervasive false positives when two traits share a heritable, unobserved common cause. This is the problem of correlated pleiotropy. Here, we introduce a flexible framework for simulating traits with a common genetic confounder that generalizes recently proposed models, as well as a simple approach we call Welch-weighted Egger regression (WWER) for estimating causal effects. We show in comprehensive simulations that our method substantially reduces false positives due to correlated pleiotropy while being fast enough to apply to hundreds of phenotypes. We apply our method first to a subset of the UK Biobank consisting of blood traits and inflammatory disease, and then to a broader set of 411 heritable phenotypes. We detect many effects with strong literature support, as well as numerous behavioral effects that appear to stem from physician advice given to people at high risk for disease. We conclude that WWER is a powerful tool for exploratory data analysis in ever-growing databases of genotypes and phenotypes.
Asunto(s)
Reacciones Falso Positivas , Pleiotropía Genética , Análisis de la Aleatorización Mendeliana/métodos , Modelos Genéticos , Análisis de Regresión , Simulación por Computador , Femenino , Humanos , Inflamación/sangre , Inflamación/genética , Masculino , Análisis de la Aleatorización Mendeliana/normas , Fenotipo , Polimorfismo de Nucleótido SimpleRESUMEN
Complex traits and diseases can be influenced by both genetics and environment. However, given the large number of environmental stimuli and power challenges for gene-by-environment testing, it remains a critical challenge to identify and prioritize specific disease-relevant environmental exposures. We propose a framework for leveraging signals from transcriptional responses to environmental perturbations to identify disease-relevant perturbations that can modulate genetic risk for complex traits and inform the functions of genetic variants associated with complex traits. We perturbed human skeletal-muscle-, fat-, and liver-relevant cell lines with 21 perturbations affecting insulin resistance, glucose homeostasis, and metabolic regulation in humans and identified thousands of environmentally responsive genes. By combining these data with GWASs from 31 distinct polygenic traits, we show that the heritability of multiple traits is enriched in regions surrounding genes responsive to specific perturbations and, further, that environmentally responsive genes are enriched for associations with specific diseases and phenotypes from the GWAS Catalog. Overall, we demonstrate the advantages of large-scale characterization of transcriptional changes in diversely stimulated and pathologically relevant cells to identify disease-relevant perturbations.
Asunto(s)
Interacción Gen-Ambiente , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Herencia Multifactorial , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo , Enfermedades Autoinmunes/etiología , Enfermedades Autoinmunes/patología , Humanos , Trastornos Mentales/etiología , Trastornos Mentales/patología , Enfermedades Metabólicas/etiología , Enfermedades Metabólicas/patología , FenotipoRESUMEN
MOTIVATION: Linkage disequilibrium (LD) matrices derived from large populations are widely used in population genetics in fine-mapping, LD score regression, and linear mixed models for Genome-wide Association Studies (GWAS). However, these matrices can reach large sizes when they are derived from millions of individuals; hence, moving, sharing and extracting granular information from this large amount of data can be cumbersome. RESULTS: We sought to address the need for compressing and easily querying large LD matrices by developing LDmat. LDmat is a standalone tool to compress large LD matrices in an HDF5 file format and query these compressed matrices. It can extract submatrices corresponding to a sub-region of the genome, a list of select loci, and loci within a minor allele frequency range. LDmat can also rebuild the original file formats from the compressed files. AVAILABILITY AND IMPLEMENTATION: LDmat is implemented in python, and can be installed on Unix systems with the command 'pip install ldmat'. It can also be accessed through https://github.com/G2Lab/ldmat and https://pypi.org/project/ldmat/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Asunto(s)
Compresión de Datos , Programas Informáticos , Humanos , Desequilibrio de Ligamiento , Estudio de Asociación del Genoma Completo , GenomaRESUMEN
Multi-task deep learning (DL) models can accurately predict diverse genomic marks from sequence, but whether these models learn the causal relationships between genomic marks is unknown. Here, we describe Deep Mendelian Randomization (DeepMR), a method for estimating causal relationships between genomic marks learned by genomic DL models. By combining Mendelian randomization with in silico mutagenesis, DeepMR obtains local (locus specific) and global estimates of (an assumed) linear causal relationship between marks. In a simulation designed to test recovery of pairwise causal relations between transcription factors (TFs), DeepMR gives accurate and unbiased estimates of the 'true' global causal effect, but its coverage decays in the presence of sequence-dependent confounding. We then apply DeepMR to examine the global relationships learned by a state-of-the-art DL model, BPNet, between TFs involved in reprogramming. DeepMR's causal effect estimates validate previously hypothesized relationships between TFs and suggest new relationships for future investigation.
Asunto(s)
Aprendizaje Profundo , Análisis de la Aleatorización Mendeliana , Análisis de la Aleatorización Mendeliana/métodos , Causalidad , Proyectos de Investigación , GenómicaRESUMEN
To fully understand animal transcription networks, it is essential to accurately measure the spatial and temporal expression patterns of transcription factors and their targets. We describe a registration technique that takes image-based data from hundreds of Drosophila blastoderm embryos, each costained for a reference gene and one of a set of genes of interest, and builds a model VirtualEmbryo. This model captures in a common framework the average expression patterns for many genes in spite of significant variation in morphology and expression between individual embryos. We establish the method's accuracy by showing that relationships between a pair of genes' expression inferred from the model are nearly identical to those measured in embryos costained for the pair. We present a VirtualEmbryo containing data for 95 genes at six time cohorts. We show that known gene-regulatory interactions can be automatically recovered from this data set and predict hundreds of new interactions.
Asunto(s)
Drosophila melanogaster/genética , Redes Reguladoras de Genes , Modelos Genéticos , Animales , Blastodermo , Drosophila melanogaster/metabolismo , Embrión no Mamífero/metabolismo , Regulación del Desarrollo de la Expresión Génica , Genes de InsectoRESUMEN
Amyotrophic lateral sclerosis (ALS) is a rapidly progressing neurodegenerative disease that is characterized by motor neuron loss and that leads to paralysis and death 2-5 years after disease onset. Nearly all patients with ALS have aggregates of the RNA-binding protein TDP-43 in their brains and spinal cords, and rare mutations in the gene encoding TDP-43 can cause ALS. There are no effective TDP-43-directed therapies for ALS or related TDP-43 proteinopathies, such as frontotemporal dementia. Antisense oligonucleotides (ASOs) and RNA-interference approaches are emerging as attractive therapeutic strategies in neurological diseases. Indeed, treatment of a rat model of inherited ALS (caused by a mutation in Sod1) with ASOs against Sod1 has been shown to substantially slow disease progression. However, as SOD1 mutations account for only around 2-5% of ALS cases, additional therapeutic strategies are needed. Silencing TDP-43 itself is probably not appropriate, given its critical cellular functions. Here we present a promising alternative therapeutic strategy for ALS that involves targeting ataxin-2. A decrease in ataxin-2 suppresses TDP-43 toxicity in yeast and flies, and intermediate-length polyglutamine expansions in the ataxin-2 gene increase risk of ALS. We used two independent approaches to test whether decreasing ataxin-2 levels could mitigate disease in a mouse model of TDP-43 proteinopathy. First, we crossed ataxin-2 knockout mice with TDP-43 (also known as TARDBP) transgenic mice. The decrease in ataxin-2 reduced aggregation of TDP-43, markedly increased survival and improved motor function. Second, in a more therapeutically applicable approach, we administered ASOs targeting ataxin-2 to the central nervous system of TDP-43 transgenic mice. This single treatment markedly extended survival. Because TDP-43 aggregation is a component of nearly all cases of ALS, targeting ataxin-2 could represent a broadly effective therapeutic strategy.
Asunto(s)
Esclerosis Amiotrófica Lateral/genética , Esclerosis Amiotrófica Lateral/terapia , Ataxina-2/deficiencia , Proteínas de Unión al ADN/metabolismo , Longevidad , Oligonucleótidos Antisentido/uso terapéutico , Agregación Patológica de Proteínas/terapia , Esclerosis Amiotrófica Lateral/metabolismo , Esclerosis Amiotrófica Lateral/fisiopatología , Animales , Ataxina-2/genética , Sistema Nervioso Central/metabolismo , Gránulos Citoplasmáticos/metabolismo , Proteínas de Unión al ADN/química , Proteínas de Unión al ADN/genética , Progresión de la Enfermedad , Femenino , Técnicas de Silenciamiento del Gen , Humanos , Masculino , Ratones , Ratones Noqueados , Ratones Transgénicos , Destreza Motora/fisiología , Oligonucleótidos Antisentido/administración & dosificación , Oligonucleótidos Antisentido/genética , Agregación Patológica de Proteínas/genética , Estrés Fisiológico , Análisis de SupervivenciaRESUMEN
ABSTRACT: There is increasing reliance on computed tomography to evaluate fractures and dislocations following routine evaluation with plain radiography, critical in preoperative planning; computed tomography can provide multiplanar reformats and 3-dimensional volume-rendered imaging, providing a better global assessment for the orthopedic surgeon. The radiologist plays a critical role in appropriately reformatting the raw axial images to illustrate best the findings that will help determine further management. In addition, the radiologist must succinctly report the pertinent findings that will have the most significant bearing on treatment, assisting the surgeon in deciding between nonoperative and operative management. The radiologist should also carefully review imaging to look for ancillary findings in the setting of trauma beyond the bones and joints, including the lungs and rib cage when visualized.In this review article, we will systematically describe key features for fractures of the scapula, proximal humerus, distal humerus, radial head and neck, olecranon, coronoid process through a case-based approach, and distal radius. Although there are numerous detailed classification systems for each of these fractures, we aim to focus on the core descriptors that underpin these classification systems. The goal is to provide the radiologist with a checklist of critical structures they must assess and findings that they should mention in their report, emphasizing those descriptors that influence patient management.
Asunto(s)
Articulación del Codo , Fracturas Óseas , Hombro , Tomografía Computarizada por Rayos X , Adulto , Humanos , Articulación del Codo/diagnóstico por imagen , Fracturas Óseas/diagnóstico por imagen , Radiografía , Hombro/diagnóstico por imagen , Tomografía Computarizada por Rayos X/métodos , Escápula/diagnóstico por imagenRESUMEN
Previous studies have prioritized trait-relevant cell types by looking for an enrichment of genome-wide association study (GWAS) signal within functional regions. However, these studies are limited in cell resolution by the lack of functional annotations from difficult-to-characterize or rare cell populations. Measurement of single-cell gene expression has become a popular method for characterizing novel cell types, and yet limited work has linked single-cell RNA sequencing (RNA-seq) to phenotypes of interest. To address this deficiency, we present RolyPoly, a regression-based polygenic model that can prioritize trait-relevant cell types and genes from GWAS summary statistics and gene expression data. RolyPoly is designed to use expression data from either bulk tissue or single-cell RNA-seq. In this study, we demonstrated RolyPoly's accuracy through simulation and validated previously known tissue-trait associations. We discovered a significant association between microglia and late-onset Alzheimer disease and an association between schizophrenia and oligodendrocytes and replicating fetal cortical cells. Additionally, RolyPoly computes a trait-relevance score for each gene to reflect the importance of expression specific to a cell type. We found that differentially expressed genes in the prefrontal cortex of individuals with Alzheimer disease were significantly enriched with genes ranked highly by RolyPoly gene scores. Overall, our method represents a powerful framework for understanding the effect of common variants on cell types contributing to complex traits.
Asunto(s)
Enfermedad de Alzheimer/genética , Microglía/metabolismo , Oligodendroglía/metabolismo , Esquizofrenia/genética , Análisis de la Célula Individual/estadística & datos numéricos , Programas Informáticos , Enfermedad de Alzheimer/diagnóstico , Enfermedad de Alzheimer/patología , Simulación por Computador , Feto , Estudio de Asociación del Genoma Completo , Humanos , Microglía/patología , Modelos Genéticos , Oligodendroglía/patología , Corteza Prefrontal/metabolismo , Corteza Prefrontal/patología , Sitios de Carácter Cuantitativo , Esquizofrenia/diagnóstico , Esquizofrenia/patología , Análisis de la Célula Individual/métodos , TranscriptomaRESUMEN
Identifying interactions between genetics and the environment (GxE) remains challenging. We have developed EAGLE, a hierarchical Bayesian model for identifying GxE interactions based on associations between environmental variables and allele-specific expression. Combining whole-blood RNA-seq with extensive environmental annotations collected from 922 human individuals, we identified 35 GxE interactions, compared with only four using standard GxE interaction testing. EAGLE provides new opportunities for researchers to identify GxE interactions using functional genomic data.
Asunto(s)
Alelos , Epigénesis Genética , Regulación de la Expresión Génica , Variación Genética , Adulto , Estudios de Cohortes , Femenino , Humanos , Masculino , Modelos Genéticos , Sitios de Carácter CuantitativoRESUMEN
Drug screening studies typically involve assaying the sensitivity of a range of cancer cell lines across an array of anti-cancer therapeutics. Alongside these sensitivity measurements high dimensional molecular characterizations of the cell lines are typically available, including gene expression, copy number variation and genomic mutations. We propose a sparse multitask regression model which learns discriminative latent characteristics that predict drug sensitivity and are associated with specific molecular features. We use ideas from Bayesian nonparametrics to automatically infer the appropriate number of these latent characteristics. The resulting analysis couples high predictive performance with interpretability since each latent characteristic involves a typically small set of drugs, cell lines and genomic features. Our model uncovers a number of drug-gene sensitivity associations missed by single gene analyses. We functionally validate one such novel association: that increased expression of the cell-cycle regulator C/EBPδ decreases sensitivity to the histone deacetylase (HDAC) inhibitor panobinostat.
Asunto(s)
Predicción/métodos , Neoplasias/genética , Antineoplásicos/farmacología , Teorema de Bayes , Biomarcadores Farmacológicos , Proteína delta de Unión al Potenciador CCAAT/genética , Línea Celular Tumoral , Variaciones en el Número de Copia de ADN , Genoma , Genómica , Inhibidores de Histona Desacetilasas/farmacología , Humanos , Neoplasias/tratamiento farmacológico , Panobinostat/farmacología , Análisis de Regresión , Estadísticas no ParamétricasRESUMEN
Methods for multiple-testing correction in local expression quantitative trait locus (cis-eQTL) studies are a trade-off between statistical power and computational efficiency. Bonferroni correction, though computationally trivial, is overly conservative and fails to account for linkage disequilibrium between variants. Permutation-based methods are more powerful, though computationally far more intensive. We present an alternative correction method called eigenMT, which runs over 500 times faster than permutations and has adjusted p values that closely approximate empirical ones. To achieve this speed while also maintaining the accuracy of permutation-based methods, we estimate the effective number of independent variants tested for association with a particular gene, termed Meff, by using the eigenvalue decomposition of the genotype correlation matrix. We employ a regularized estimator of the correlation matrix to ensure Meff is robust and yields adjusted p values that closely approximate p values from permutations. Finally, using a common genotype matrix, we show that eigenMT can be applied with even greater efficiency to studies across tissues or conditions. Our method provides a simpler, more efficient approach to multiple-testing correction than existing methods and fits within existing pipelines for eQTL discovery.
Asunto(s)
Desequilibrio de Ligamiento , Sitios de Carácter Cuantitativo , HumanosRESUMEN
The X Chromosome, with its unique mode of inheritance, contributes to differences between the sexes at a molecular level, including sex-specific gene expression and sex-specific impact of genetic variation. Improving our understanding of these differences offers to elucidate the molecular mechanisms underlying sex-specific traits and diseases. However, to date, most studies have either ignored the X Chromosome or had insufficient power to test for the sex-specific impact of genetic variation. By analyzing whole blood transcriptomes of 922 individuals, we have conducted the first large-scale, genome-wide analysis of the impact of both sex and genetic variation on patterns of gene expression, including comparison between the X Chromosome and autosomes. We identified a depletion of expression quantitative trait loci (eQTL) on the X Chromosome, especially among genes under high selective constraint. In contrast, we discovered an enrichment of sex-specific regulatory variants on the X Chromosome. To resolve the molecular mechanisms underlying such effects, we generated chromatin accessibility data through ATAC-sequencing to connect sex-specific chromatin accessibility to sex-specific patterns of expression and regulatory variation. As sex-specific regulatory variants discovered in our study can inform sex differences in heritable disease prevalence, we integrated our data with genome-wide association study data for multiple immune traits identifying several traits with significant sex biases in genetic susceptibilities. Together, our study provides genome-wide insight into how genetic variation, the X Chromosome, and sex shape human gene regulation and disease.
Asunto(s)
Cromosomas Humanos X/genética , Transcriptoma , Femenino , Perfilación de la Expresión Génica , Regulación de la Expresión Génica , Predisposición Genética a la Enfermedad , Genoma Humano , Humanos , Masculino , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo , Caracteres SexualesRESUMEN
Recent and rapid human population growth has led to an excess of rare genetic variants that are expected to contribute to an individual's genetic burden of disease risk. To date, much of the focus has been on rare protein-coding variants, for which potential impact can be estimated from the genetic code, but determining the impact of rare noncoding variants has been more challenging. To improve our understanding of such variants, we combined high-quality genome sequencing and RNA sequencing data from a 17-individual, three-generation family to contrast expression quantitative trait loci (eQTLs) and splicing quantitative trait loci (sQTLs) within this family to eQTLs and sQTLs within a population sample. Using this design, we found that eQTLs and sQTLs with large effects in the family were enriched with rare regulatory and splicing variants (minor allele frequency < 0.01). They were also more likely to influence essential genes and genes involved in complex disease. In addition, we tested the capacity of diverse noncoding annotation to predict the impact of rare noncoding variants. We found that distance to the transcription start site, evolutionary constraint, and epigenetic annotation were considerably more informative for predicting the impact of rare variants than for predicting the impact of common variants. These results highlight that rare noncoding variants are important contributors to individual gene-expression profiles and further demonstrate a significant capability for genomic annotation to predict the impact of rare noncoding variants.
Asunto(s)
Genoma Humano , Polimorfismo de Nucleótido Simple/genética , Sitios de Carácter Cuantitativo , ARN no Traducido/genética , Análisis de Secuencia de ARN , Transcriptoma , Familia , Haplotipos/genética , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Linfocitos/metabolismo , Población Blanca/genéticaRESUMEN
Chronic lymphocytic leukemia (CLL) has heterogeneous clinical and biological behavior. Whole-genome and -exome sequencing has contributed to the characterization of the mutational spectrum of the disease, but the underlying transcriptional profile is still poorly understood. We have performed deep RNA sequencing in different subpopulations of normal B-lymphocytes and CLL cells from a cohort of 98 patients, and characterized the CLL transcriptional landscape with unprecedented resolution. We detected thousands of transcriptional elements differentially expressed between the CLL and normal B cells, including protein-coding genes, noncoding RNAs, and pseudogenes. Transposable elements are globally derepressed in CLL cells. In addition, two thousand genes-most of which are not differentially expressed-exhibit CLL-specific splicing patterns. Genes involved in metabolic pathways showed higher expression in CLL, while genes related to spliceosome, proteasome, and ribosome were among the most down-regulated in CLL. Clustering of the CLL samples according to RNA-seq derived gene expression levels unveiled two robust molecular subgroups, C1 and C2. C1/C2 subgroups and the mutational status of the immunoglobulin heavy variable (IGHV) region were the only independent variables in predicting time to treatment in a multivariate analysis with main clinico-biological features. This subdivision was validated in an independent cohort of patients monitored through DNA microarrays. Further analysis shows that B-cell receptor (BCR) activation in the microenvironment of the lymph node may be at the origin of the C1/C2 differences.
Asunto(s)
Linfocitos B , Regulación Neoplásica de la Expresión Génica , Secuenciación de Nucleótidos de Alto Rendimiento , Leucemia Linfocítica Crónica de Células B/genética , Anciano , Secuencia de Bases , Femenino , Perfilación de la Expresión Génica , Humanos , Región Variable de Inmunoglobulina , Leucemia Linfocítica Crónica de Células B/patología , Masculino , Persona de Mediana Edad , Mutación , Ribosomas/genética , Empalmosomas/genéticaRESUMEN
Retinitis pigmentosa (RP) is a rare genetic disease that causes gradual blindness through retinal degeneration. Intriguingly, seven of the 24 genes identified as responsible for the autosomal-dominant form (adRP) are ubiquitous spliceosome components whose impairment causes disease only in the retina. The fact that these proteins are essential in all organisms hampers genetic, genomic, and physiological studies, but we addressed these difficulties by using RNAi in Caenorhabditis elegans. Our study of worm phenotypes produced by RNAi of splicing-related adRP (s-adRP) genes functionally distinguishes between components of U4 and U5 snRNP complexes, because knockdown of U5 proteins produces a stronger phenotype. RNA-seq analyses of worms where s-adRP genes were partially inactivated by RNAi, revealed mild intron retention in developing animals but not in adults, suggesting a positive correlation between intron retention and transcriptional activity. Interestingly, RNAi of s-adRP genes produces an increase in the expression of atl-1 (homolog of human ATR), which is normally activated in response to replicative stress and certain DNA-damaging agents. The up-regulation of atl-1 correlates with the ectopic expression of the pro-apoptotic gene egl-1 and apoptosis in hypodermal cells, which produce the cuticle, but not in other cell types. Our model in C. elegans resembles s-adRP in two aspects: The phenotype caused by global knockdown of s-adRP genes is cell type-specific and associated with high transcriptional activity. Finally, along with a reduced production of mature transcripts, we propose a model in which the retina-specific cell death in s-adRP patients can be induced through genomic instability.