RESUMO
Autoimmune diseases disproportionately affect females more than males. The XX sex chromosome complement is strongly associated with susceptibility to autoimmunity. Xist long non-coding RNA (lncRNA) is expressed only in females to randomly inactivate one of the two X chromosomes to achieve gene dosage compensation. Here, we show that the Xist ribonucleoprotein (RNP) complex comprising numerous autoantigenic components is an important driver of sex-biased autoimmunity. Inducible transgenic expression of a non-silencing form of Xist in male mice introduced Xist RNP complexes and sufficed to produce autoantibodies. Male SJL/J mice expressing transgenic Xist developed more severe multi-organ pathology in a pristane-induced lupus model than wild-type males. Xist expression in males reprogrammed T and B cell populations and chromatin states to more resemble wild-type females. Human patients with autoimmune diseases displayed significant autoantibodies to multiple components of XIST RNP. Thus, a sex-specific lncRNA scaffolds ubiquitous RNP components to drive sex-biased immunity.
Assuntos
Autoanticorpos , Doenças Autoimunes , RNA Longo não Codificante , Animais , Feminino , Humanos , Masculino , Camundongos , Autoanticorpos/genética , Doenças Autoimunes/genética , Autoimunidade/genética , Ribonucleoproteínas/genética , Ribonucleoproteínas/metabolismo , RNA Longo não Codificante/genética , RNA Longo não Codificante/metabolismo , Cromossomo X/genética , Cromossomo X/metabolismo , Inativação do Cromossomo X , Caracteres SexuaisRESUMO
Single-cell RNA sequencing (scRNA-seq) is a recent technology that enables fine-grained discovery of cellular subtypes and specific cell states. Analysis of scRNA-seq data routinely involves machine learning methods, such as feature learning, clustering, and classification, to assist in uncovering novel information from scRNA-seq data. However, current methods are not well suited to deal with the substantial amount of noise that is created by the experiments or the variation that occurs due to differences in the cells of the same type. To address this, we developed a new hybrid approach, deep unsupervised single-cell clustering (DUSC), which integrates feature generation based on a deep learning architecture by using a new technique to estimate the number of latent features, with a model-based clustering algorithm, to find a compact and informative representation of the single-cell transcriptomic data generating robust clusters. We also include a technique to estimate an efficient number of latent features in the deep learning model. Our method outperforms both classical and state-of-the-art feature learning and clustering methods, approaching the accuracy of supervised learning. We applied DUSC to a single-cell transcriptomics data set obtained from a triple-negative breast cancer tumor to identify potential cancer subclones accentuated by copy-number variation and investigate the role of clonal heterogeneity. Our method is freely available to the community and will hopefully facilitate our understanding of the cellular atlas of living organisms as well as provide the means to improve patient diagnostics and treatment.
Assuntos
Perfilação da Expressão Gênica/métodos , RNA-Seq/métodos , Análise de Célula Única/métodos , Algoritmos , Animais , Análise por Conglomerados , Biologia Computacional , Humanos , Aprendizado de Máquina , Análise de Sequência de RNA/métodos , Transcriptoma/genéticaRESUMO
Raf kinases play vital roles in normal mitogenic signaling and cancer, however, the identities of functionally important Raf-proximal proteins throughout the cell are not fully known. Raf1 proximity proteomics/BioID in Raf1-dependent cancer cells unexpectedly identified Raf1-adjacent proteins known to reside in the mitochondrial matrix. Inner-mitochondrial localization of Raf1 was confirmed by mitochondrial purification and super-resolution microscopy. Inside mitochondria, Raf1 associated with glutaminase (GLS) in diverse human cancers and enabled glutaminolysis, an important source of biosynthetic precursors in cancer. These impacts required Raf1 kinase activity and were independent of canonical MAP kinase pathway signaling. Kinase-dead mitochondrial matrix-localized Raf1 impaired glutaminolysis and tumorigenesis in vivo. These data indicate that Raf1 localizes inside mitochondria where it interacts with GLS to engage glutamine catabolism and support tumorigenesis.
RESUMO
RNA binding proteins ( RBPs ) control varied processes, including RNA splicing, stability, transport, and translation 1-3 . Dysfunctional RNA-RBP interactions contribute to the pathogenesis of human disease 1,4,5 , however, characterizing the nature and dynamics of multiprotein assemblies on RNA has been challenging. To address this, non-isotopic ligation-based ultraviolet crosslinking immunoprecipitation 6 was combined with mass spectrometry ( irCLIP-RNP ) to identify RNA-dependent associated proteins ( RDAPs ) co-bound to RNA with any RBP of interest. irCLIP-RNP defined landscapes of multimeric protein assemblies on RNA, uncovering previously unknown patterns of RBP-RNA associations, including cell-type-selective combinatorial relationships between RDAPs and primary RBPs. irCLIP-RNP also defined dynamic RDAP remodeling in response to epidermal growth factor ( EGF ), uncovering EGF-induced recruitment of UPF1 adjacent to HNRNPC to effect splicing surveillance of cell proliferation mRNAs. To identify the RNAs simultaneously co-bound by multiple studied RBPs, a sequential immunoprecipitation irCLIP ( Re-CLIP ) method was also developed. Re-CLIP confirmed binding relationships seen in irCLIP-RNP and detected simultaneous HNRNPC and UPF1 co-binding on RND3 and DDX3X mRNAs. irCLIP-RNP and Re-CLIP provide a framework to identify and characterize dynamic RNA-protein assemblies in living cells.
RESUMO
Until recently, efforts in population genetics have been focused primarily on people of European ancestry. To attenuate this bias, global population studies, such as the 1000 Genomes Project, have revealed differences in genetic variation across ethnic groups. How many of these differences can be attributed to population-specific traits? To answer this question, the mutation data must be linked with functional outcomes. A new "edgotype" concept has been proposed, which emphasizes the interaction-specific, "edgetic", perturbations caused by mutations in the interacting proteins. In this work, we performed systematic in silico edgetic profiling of ~50,000 non-synonymous SNVs (nsSNVs) from the 1000 Genomes Project by leveraging our semi-supervised learning approach SNP-IN tool on a comprehensive set of over 10,000 protein interaction complexes. We interrogated the functional roles of the variants and their impact on the human interactome and compared the results with the pathogenic variants disrupting PPIs in the same interactome. Our results demonstrated that a considerable number of nsSNVs from healthy populations could rewire the interactome. We also showed that the proteins enriched with interaction-disrupting mutations were associated with diverse functions and had implications in a broad spectrum of diseases. Further analysis indicated that distinct gene edgetic profiles among major populations could shed light on the molecular mechanisms behind the population phenotypic variances. Finally, the network analysis revealed that the disease-associated modules surprisingly harbored a higher density of interaction-disrupting mutations from healthy populations. The variation in the cumulative network damage within these modules could potentially account for the observed disparities in disease susceptibility, which are distinctly specific to certain populations. Our work demonstrates the feasibility of a large-scale in silico edgetic study, and reveals insights into the orchestrated play of population-specific mutations in the human interactome.
Assuntos
Perfil Genético , Projetos de Pesquisa , Humanos , Mutação , Fenótipo , Aprendizado de Máquina SupervisionadoRESUMO
Background: Suicide is a leading cause of death, and rates of attempted suicide have increased during the COVID-19 pandemic. The under-diagnosed psychiatric phenotype of dissociation is associated with elevated suicidal self-injury; however, it has largely been left out of attempts to predict and prevent suicide.Objective: We designed an artificial intelligence approach to identify dissociative patients and predict prior suicide attempts in an unbiased, data-driven manner.Method: Participants were 30 controls and 93 treatment-seeking female patients with posttraumatic stress disorder (PTSD) and various levels of dissociation, including some with the PTSD dissociative subtype and some with dissociative identity disorder (DID).Results: Unsupervised learning models identified patients along a spectrum of dissociation. Moreover, supervised learning models accurately predicted prior suicide attempts with an F1 score up to 0.83. DID had the highest risk of prior suicide attempts, and distinct subtypes of dissociation predicted suicide attempts in PTSD and DID.Conclusions: These findings expand our understanding of the dissociative phenotype and underscore the urgent need to assess for dissociation to identify individuals at high-risk of suicidal self-injury.
Dissociation, feelings of detachment and disruption in one's sense of self and surroundings, is associated with an elevated risk of suicidal self-injury; however, it has largely been left out of attempts to predict and prevent suicide.Using machine learning techniques, we found dissociative identity disorder had the highest risk of prior suicide attempts, and distinct subtypes of dissociation predicted suicide attempts in posttraumatic stress disorder and dissociative identity disorder.These findings underscore the urgent need to assess for dissociation to identify individuals at high-risk of suicidal self-injury.
RESUMO
During its first two and a half months, the recently emerged 2019 novel coronavirus, SARS-CoV-2, has already infected over one-hundred thousand people worldwide and has taken more than four thousand lives. However, the swiftly spreading virus also caused an unprecedentedly rapid response from the research community facing the unknown health challenge of potentially enormous proportions. Unfortunately, the experimental research to understand the molecular mechanisms behind the viral infection and to design a vaccine or antivirals is costly and takes months to develop. To expedite the advancement of our knowledge, we leveraged data about the related coronaviruses that is readily available in public databases and integrated these data into a single computational pipeline. As a result, we provide comprehensive structural genomics and interactomics roadmaps of SARS-CoV-2 and use this information to infer the possible functional differences and similarities with the related SARS coronavirus. All data are made publicly available to the research community.
Assuntos
Betacoronavirus/genética , Proteínas Virais/genética , Animais , Betacoronavirus/química , Sítios de Ligação , Evolução Biológica , COVID-19 , Quirópteros/virologia , Biologia Computacional , Sequência Conservada , Infecções por Coronavirus , Proteínas do Nucleocapsídeo de Coronavírus , Genoma Viral , Genômica , Humanos , Ligantes , Modelos Moleculares , Proteínas do Nucleocapsídeo/química , Pandemias , Fosfoproteínas , Filogenia , Pneumonia Viral , Mapeamento de Interação de Proteínas , Estrutura Terciária de Proteína , Coronavírus Relacionado à Síndrome Respiratória Aguda Grave , SARS-CoV-2 , Alinhamento de Sequência , Glicoproteína da Espícula de Coronavírus/química , Proteínas do Envelope Viral/química , Proteínas da Matriz Viral/químicaRESUMO
Rapid progress in high-throughput -omics technologies moves us one step closer to the datacalypse in life sciences. In spite of the already generated volumes of data, our knowledge of the molecular mechanisms underlying complex genetic diseases remains limited. Increasing evidence shows that biological networks are essential, albeit not sufficient, for the better understanding of these mechanisms. The identification of disease-specific functional modules in the human interactome can provide a more focused insight into the mechanistic nature of the disease. However, carving a disease network module from the whole interactome is a difficult task. In this paper, we propose a computational framework, Discovering most IMpacted SUbnetworks in interactoMe (DIMSUM), which enables the integration of genome-wide association studies (GWAS) and functional effects of mutations into the protein-protein interaction (PPI) network to improve disease module detection. Specifically, our approach incorporates and propagates the functional impact of non-synonymous single nucleotide polymorphisms (nsSNPs) on PPIs to implicate the genes that are most likely influenced by the disruptive mutations, and to identify the module with the greatest functional impact. Comparison against state-of-the-art seed-based module detection methods shows that our approach could yield modules that are biologically more relevant and have stronger association with the studied disease. We expect for our method to become a part of the common toolbox for the disease module analysis, facilitating the discovery of new disease markers.