Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 15 de 15
Filter
1.
Mol Cell ; 81(10): 2246-2260.e12, 2021 05 20.
Article in English | MEDLINE | ID: mdl-33861991

ABSTRACT

Exitron splicing (EIS) creates a cryptic intron (called an exitron) within a protein-coding exon to increase proteome diversity. EIS is poorly characterized, but emerging evidence suggests a role for EIS in cancer. Through a systematic investigation of EIS across 33 cancers from 9,599 tumor transcriptomes, we discovered that EIS affected 63% of human coding genes and that 95% of those events were tumor specific. Notably, we observed a mutually exclusive pattern between EIS and somatic mutations in their affected genes. Functionally, we discovered that EIS altered known and novel cancer driver genes for causing gain- or loss-of-function, which promotes tumor progression. Importantly, we identified EIS-derived neoepitopes that bind to major histocompatibility complex (MHC) class I or II. Analysis of clinical data from a clear cell renal cell carcinoma cohort revealed an association between EIS-derived neoantigen load and checkpoint inhibitor response. Our findings establish the importance of considering EIS alterations when nominating cancer driver events and neoantigens.


Subject(s)
Epitopes/genetics , Exons/genetics , Gene Expression Profiling , Introns/genetics , Neoplasms/genetics , Oncogenes , RNA Splicing/genetics , Amino Acid Sequence , Cell Line , Cohort Studies , Humans , Mutation/genetics
2.
Brief Bioinform ; 25(4)2024 May 23.
Article in English | MEDLINE | ID: mdl-39041189

ABSTRACT

Studies have identified genes and molecular pathways regulating cancer metastasis. However, it remains largely unknown whether metastatic potentials of cancer cells from different lineage types are driven by the same or different gene networks. Here, we aim to address this question through integrative analyses of 493 human cancer cells' transcriptomic profiles and their metastatic potentials in vivo. Using an unsupervised approach and considering both gene coexpression and protein-protein interaction networks, we identify different gene networks associated with various biological pathways (i.e. inflammation, cell cycle, and RNA translation), the expression of which are correlated with metastatic potentials across subsets of lineage types. By developing a regularized random forest regression model, we show that the combination of the gene module features expressed in the native cancer cells can predict their metastatic potentials with an overall Pearson correlation coefficient of 0.90. By analyzing transcriptomic profile data from cancer patients, we show that these networks are conserved in vivo and contribute to cancer aggressiveness. The intrinsic expression levels of these networks are correlated with drug sensitivity. Altogether, our study provides novel comparative insights into cancer cells' intrinsic gene networks mediating metastatic potentials across different lineage types, and our results can potentially be useful for designing personalized treatments for metastatic cancers.


Subject(s)
Gene Expression Regulation, Neoplastic , Gene Regulatory Networks , Neoplasm Metastasis , Neoplasms , Humans , Neoplasms/genetics , Neoplasms/pathology , Neoplasms/metabolism , Protein Interaction Maps/genetics , Transcriptome , Gene Expression Profiling , Cell Lineage/genetics
3.
J Biol Chem ; 295(28): 9335-9348, 2020 07 10.
Article in English | MEDLINE | ID: mdl-32393580

ABSTRACT

The oncogene RAS is one of the most widely studied proteins in cancer biology, and mutant active RAS is a driver in many types of solid tumors and hematological malignancies. Yet the biological effects of different RAS mutations and the tissue-specific clinical implications are complex and nuanced. Here, we identified an internal tandem duplication (ITD) in the switch II domain of NRAS from a patient with extremely aggressive colorectal carcinoma. Results of whole-exome DNA sequencing of primary and metastatic tumors indicated that this mutation was present in all analyzed metastases and excluded the presence of any other clear oncogenic driver mutations. Biochemical analysis revealed increased interaction of the RAS ITD with Raf proto-oncogene Ser/Thr kinase (RAF), leading to increased phosphorylation of downstream MAPK/ERK kinase (MEK)/extracellular signal-regulated kinase (ERK). The ITD prevented interaction with neurofibromin 1 (NF1)-GTPase-activating protein (GAP), providing a mechanism for sustained activity of the RAS ITD protein. We present the first crystal structures of NRAS and KRAS ITD at 1.65-1.75 Å resolution, respectively, providing insight into the physical interactions of this class of RAS variants with its regulatory and effector proteins. Our in-depth bedside-to-bench analysis uncovers the molecular mechanism underlying a case of highly aggressive colorectal cancer and illustrates the importance of robust biochemical and biophysical approaches in the implementation of individualized medicine.


Subject(s)
Colorectal Neoplasms , GTP Phosphohydrolases , MAP Kinase Signaling System , Membrane Proteins , Mutation , Proto-Oncogene Proteins p21(ras) , Colorectal Neoplasms/enzymology , Colorectal Neoplasms/genetics , Colorectal Neoplasms/pathology , Crystallography, X-Ray , GTP Phosphohydrolases/chemistry , GTP Phosphohydrolases/genetics , GTP Phosphohydrolases/metabolism , HEK293 Cells , Humans , Membrane Proteins/chemistry , Membrane Proteins/genetics , Membrane Proteins/metabolism , Protein Domains , Proto-Oncogene Mas , Proto-Oncogene Proteins p21(ras)/chemistry , Proto-Oncogene Proteins p21(ras)/genetics , Proto-Oncogene Proteins p21(ras)/metabolism , Exome Sequencing , raf Kinases/genetics , raf Kinases/metabolism
4.
BMC Genomics ; 22(1): 574, 2021 Jul 27.
Article in English | MEDLINE | ID: mdl-34315441

ABSTRACT

BACKGROUND: Long non-coding RNAs (lncRNAs) are a growing focus in cancer research. Deciphering pathways influenced by lncRNAs is important to understand their role in cancer. Although knock-down or overexpression of lncRNAs followed by gene expression profiling in cancer cell lines are established approaches to address this problem, these experimental data are not available for a majority of the annotated lncRNAs. RESULTS: As a surrogate, we present lncGSEA, a convenient tool to predict the lncRNA associated pathways through Gene Set Enrichment Analysis of gene expression profiles from large-scale cancer patient samples. We demonstrate that lncGSEA is able to recapitulate lncRNA associated pathways supported by literature and experimental validations in multiple cancer types. CONCLUSIONS: LncGSEA allows researchers to infer lncRNA regulatory pathways directly from clinical samples in oncology. LncGSEA is written in R, and is freely accessible at https://github.com/ylab-hi/lncGSEA .


Subject(s)
Neoplasms , RNA, Long Noncoding , Gene Expression Profiling , Humans , Microarray Analysis , Neoplasms/genetics , RNA, Long Noncoding/genetics , Transcriptome
5.
Bioinformatics ; 35(20): 4159-4161, 2019 10 15.
Article in English | MEDLINE | ID: mdl-30887025

ABSTRACT

SUMMARY: Insertion and deletion (indels) have been recognized as an important source generating tumor-specific mutant peptides (neoantigens). The focus of indel-derived neoantigen identification has been on leveraging DNA sequencing such as whole exome sequencing, with the effort of using RNA-seq less well explored. Here we present ScanNeo, a fast-streamlined computational pipeline for analyzing RNA-seq to predict neoepitopes derived from small to large-sized indels. We applied ScanNeo in a prostate cancer cell line and validated our predictions with matched mass spectrometry data. Finally, we demonstrated that indel neoantigens predicted from RNA-seq were associated with checkpoint inhibitor response in a cohort of melanoma patients. AVAILABILITY AND IMPLEMENTATION: ScanNeo is implemented in Python. It is freely accessible at the GitHub repository (https://github.com/ylab-hi/ScanNeo). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
RNA-Seq , Software , Humans , INDEL Mutation , Male , Sequence Analysis, RNA , Exome Sequencing
6.
Bioinformatics ; 35(7): 1244-1246, 2019 04 01.
Article in English | MEDLINE | ID: mdl-30169743

ABSTRACT

SUMMARY: HLA allele imputation from SNP genotypes has become increasingly useful, but its accuracy is heavily dependent on the reference panels used. HLA-IMPUTER implements HIBAG algorithm for HLA imputation with different population specific reference panels, including a new Han Chinese reference panel derived from 10 689 samples. We provide a convenient platform for researchers to impute HLA alleles and perform association analysis. AVAILABILITY AND IMPLEMENTATION: http://wyanglab.org: 3838/RefPanelWebsite/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Polymorphism, Single Nucleotide , Software , Alleles , Asian People , Genome-Wide Association Study , Genotype , HLA Antigens , Humans
7.
Ann Rheum Dis ; 77(7): 1078-1084, 2018 07.
Article in English | MEDLINE | ID: mdl-29625966

ABSTRACT

OBJECTIVES: Systemic lupus erythematosus (SLE) is a prototype autoimmune disease with a strong genetic component in its pathogenesis. Through genome-wide association studies (GWAS), we recently identified 10 novel loci associated with SLE and uncovered a number of suggestive loci requiring further validation. This study aimed to validate those loci in independent cohorts and evaluate the role of SLE genetics in drug repositioning. METHODS: We conducted GWAS and replication studies involving 12 280 SLE cases and 18 828 controls, and performed fine-mapping analyses to identify likely causal variants within the newly identified loci. We further scanned drug target databases to evaluate the role of SLE genetics in drug repositioning. RESULTS: We identified three novel loci that surpassed genome-wide significance, including ST3AGL4 (rs13238909, pmeta=4.40E-08), MFHAS1 (rs2428, pmeta=1.17E-08) and CSNK2A2 (rs2731783, pmeta=1.08E-09). We also confirmed the association of CD226 locus with SLE (rs763361, pmeta=2.45E-08). Fine-mapping and functional analyses indicated that the putative causal variants in CSNK2A2 locus reside in an enhancer and are associated with expression of CSNK2A2 in B-lymphocytes, suggesting a potential mechanism of association. In addition, we demonstrated that SLE risk genes were more likely to be interacting proteins with targets of approved SLE drugs (OR=2.41, p=1.50E-03) which supports the role of genetic studies to repurpose drugs approved for other diseases for the treatment of SLE. CONCLUSION: This study identified three novel loci associated with SLE and demonstrated the role of SLE GWAS findings in drug repositioning.


Subject(s)
Antigens, Differentiation, T-Lymphocyte/genetics , Cell Cycle Proteins/genetics , DNA-Binding Proteins/genetics , Genetic Predisposition to Disease/epidemiology , Lupus Erythematosus, Systemic/drug therapy , Lupus Erythematosus, Systemic/genetics , Oncogene Proteins/genetics , Case-Control Studies , Casein Kinase II/genetics , Databases, Factual , Drug Repositioning , Female , Genome-Wide Association Study , Humans , Immunosuppressive Agents/therapeutic use , Lupus Erythematosus, Systemic/diagnosis , Male , Molecular Targeted Therapy/methods , Reproducibility of Results , Treatment Outcome
8.
Methods Mol Biol ; 2493: 67-75, 2022.
Article in English | MEDLINE | ID: mdl-35751809

ABSTRACT

Insertions and deletions (indels) are primarily detected from DNA sequencing (DNA-seq) data, but their transcriptional consequences remain unexplored due to challenges in distinguishing medium- and large-sized indels from RNA splicing events in RNA-seq data. We introduce transIndel, a splice-aware algorithm that parses the chimeric alignments predicted by a short read aligner and reconstructs the mid-sized insertions and large deletions based on the linear alignments of split reads from DNA-seq or RNA-seq data. Here, we describe the method and provide a tutorial on the installation and application of transIndel.


Subject(s)
INDEL Mutation , Software , Algorithms , DNA , RNA Splicing
9.
STAR Protoc ; 2(3): 100788, 2021 09 17.
Article in English | MEDLINE | ID: mdl-34522901

ABSTRACT

Exitron splicing (EIS) events in cancers can disrupt functional protein domains to cause cancer driver effects. EIS has been recognized as a new source of tumor neoantigens. Here, we describe an integrated protocol for EIS and EIS-derived neoantigen identification using RNA-seq data. The protocol constitutes a step-by-step guide from data collection to neoantigen prediction. For complete details on the use and execution of this protocol, please refer to Wang et al. (2021).


Subject(s)
Antigens, Neoplasm/genetics , Genomics/methods , RNA Splicing/genetics , RNA-Seq/methods , Databases, Genetic , Humans , Neoplasms/genetics , Software
10.
Nat Commun ; 12(1): 6377, 2021 11 04.
Article in English | MEDLINE | ID: mdl-34737261

ABSTRACT

Endocrine therapies for prostate cancer inhibit the androgen receptor (AR) transcription factor. In most cases, AR activity resumes during therapy and drives progression to castration-resistant prostate cancer (CRPC). However, therapy can also promote lineage plasticity and select for AR-independent phenotypes that are uniformly lethal. Here, we demonstrate the stem cell transcription factor Krüppel-like factor 5 (KLF5) is low or absent in prostate cancers prior to endocrine therapy, but induced in a subset of CRPC, including CRPC displaying lineage plasticity. KLF5 and AR physically interact on chromatin and drive opposing transcriptional programs, with KLF5 promoting cellular migration, anchorage-independent growth, and basal epithelial cell phenotypes. We identify ERBB2 as a point of transcriptional convergence displaying activation by KLF5 and repression by AR. ERBB2 inhibitors preferentially block KLF5-driven oncogenic phenotypes. These findings implicate KLF5 as an oncogene that can be upregulated in CRPC to oppose AR activities and promote lineage plasticity.


Subject(s)
Kruppel-Like Transcription Factors/metabolism , Neuroendocrine Cells/metabolism , Prostatic Neoplasms, Castration-Resistant/metabolism , Receptor, ErbB-2/metabolism , Receptors, Androgen/metabolism , Cell Line, Tumor , Humans , Male , Neoplasm Staging , Neuroendocrine Cells/pathology , Prostatic Neoplasms, Castration-Resistant/genetics , Prostatic Neoplasms, Castration-Resistant/pathology , Signal Transduction , Transcriptional Activation
11.
Nat Commun ; 12(1): 772, 2021 02 03.
Article in English | MEDLINE | ID: mdl-33536424

ABSTRACT

Systemic lupus erythematosus (SLE), a worldwide autoimmune disease with high heritability, shows differences in prevalence, severity and age of onset among different ancestral groups. Previous genetic studies have focused more on European populations, which appear to be the least affected. Consequently, the genetic variations that underlie the commonalities, differences and treatment options in SLE among ancestral groups have not been well elucidated. To address this, we undertake a genome-wide association study, increasing the sample size of Chinese populations to the level of existing European studies. Thirty-eight novel SLE-associated loci and incomplete sharing of genetic architecture are identified. In addition to the human leukocyte antigen (HLA) region, nine disease loci show clear ancestral differences and implicate antibody production as a potential mechanism for differences in disease manifestation. Polygenic risk scores perform significantly better when trained on ancestry-matched data sets. These analyses help to reveal the genetic basis for disparities in SLE among ancestral groups.


Subject(s)
Genetic Heterogeneity , Genetic Predisposition to Disease/genetics , Genome-Wide Association Study/methods , Lupus Erythematosus, Systemic/genetics , Polymorphism, Single Nucleotide , Asian People/genetics , Case-Control Studies , Genetic Predisposition to Disease/ethnology , Genotype , Humans , Linkage Disequilibrium , Lupus Erythematosus, Systemic/ethnology , White People/genetics
12.
Gigascience ; 9(8)2020 08 01.
Article in English | MEDLINE | ID: mdl-32852038

ABSTRACT

BACKGROUND: Internal tandem duplications (ITDs) are tandem duplications within coding exons and are important prognostic markers and drug targets for acute myeloid leukemia (AML). Next-generation sequencing has enabled the discovery of ITD at single-nucleotide resolution. ITD allele frequency is used in the risk stratification of patients with AML; higher ITD allele frequency is associated with poorer clinical outcomes. However, the ITD allele frequency data are often unavailable to treating physicians and the detection of ITDs with accurate variant allele frequency (VAF) estimation remains challenging for short-read sequencing. RESULTS: Here we present the ScanITD approach, which performs a stepwise seed-and-realignment procedure for ITD detection with accurate VAF prediction. The evaluations on simulated and real data demonstrate that ScanITD outperforms 3 state-of-the-art ITD detectors, especially for VAF estimation. Importantly, ScanITD yields better accuracy than general-purpose structural variation callers for predicting ITD size range duplications. CONCLUSIONS: ScanITD enables the accurate identification of ITDs with robust VAF estimation. ScanITD is written in Python and is open-source software that is freely accessible at https://github.com/ylab-hi/ScanITD.


Subject(s)
Leukemia, Myeloid, Acute , Tandem Repeat Sequences , Gene Frequency , High-Throughput Nucleotide Sequencing , Humans , Leukemia, Myeloid, Acute/genetics , Mutation , fms-Like Tyrosine Kinase 3
13.
Mol Ther Nucleic Acids ; 19: 318-329, 2020 Mar 06.
Article in English | MEDLINE | ID: mdl-31877408

ABSTRACT

Although recent advances in genetic studies have shed light on systemic lupus erythematosus (SLE), its detailed mechanisms remain elusive. In this study, using datasets on SLE transcriptomic profiles, we identified 750 differentially expressed genes (DEGs) in T and B lymphocytes and peripheral blood cells. Using transcription factor (TF) binding data derived from chromatin immunoprecipitation sequencing (ChIP-seq) experiments from the Encyclopedia of DNA Elements (ENCODE) project, we inferred networks of co-regulated genes (NcRGs) based on binding profiles of the upregulated DEGs by significantly enriched TFs. Modularization analysis of NcRGs identified co-regulatory modules among the DEGs and master TFs vital for each module. Remarkably, the co-regulatory modules stratified the common SLE interferon (IFN) signature and revealed SLE pathogenesis pathways, including the complement cascade, cell cycle regulation, NETosis, and epigenetic regulation. By integrative analyses of disease-associated genes (DAGs), DEGs, and enriched TFs, as well as proteins interacting with them, we identified a hierarchical regulatory cascade with TFs regulated by DAGs, which in turn regulates gene expression. Integrative analysis of multi-omics data provided valuable molecular insights into the molecular mechanisms of SLE.

14.
Biol Sex Differ ; 8(1): 35, 2017 11 02.
Article in English | MEDLINE | ID: mdl-29096703

ABSTRACT

BACKGROUND: Sex is an important but understudied factor in the genetics of human diseases. Analyses using a combination of gene expression data, ENCODE data, and evolutionary data of sex-biased gene expression in human tissues can give insight into the regulatory and evolutionary forces acting on sex-biased genes. METHODS: In this study, we analyzed the differentially expressed genes between males and females. On the X chromosome, we used a novel method and investigated the status of genes that escape X-chromosome inactivation (escape genes), taking into account the clonality of lymphoblastoid cell lines (LCLs). To investigate the regulation of sex-biased differentially expressed genes (sDEG), we conducted pathway and transcription factor enrichment analyses on the sDEGs, as well as analyses on the genomic distribution of sDEGs. Evolutionary analyses were also conducted on both sDEGs and escape genes. RESULTS: Genome-wide, we characterized differential gene expression between sexes in 462 RNA-seq samples and identified 587 sex-biased genes, or 3.2% of the genes surveyed. On the X chromosome, sDEGs were distributed in evolutionary strata in a similar pattern as escape genes. We found a trend of negative correlation between the gene expression breadth and nonsynonymous over synonymous mutation (dN/dS) ratios, showing a possible pleiotropic constraint on evolution of genes. Genome-wide, nine transcription factors were found enriched in binding to the regions surrounding the transcription start sites of female-biased genes. Many pathways and protein domains were enriched in sex-biased genes, some of which hint at sex-biased physiological processes. CONCLUSIONS: These findings lend insight into the regulatory and evolutionary forces shaping sex-biased gene expression and their involvement in the physiological and pathological processes in human health and diseases.


Subject(s)
Evolution, Molecular , Gene Expression Regulation , Gene Expression , Genes, X-Linked , Sex Characteristics , Female , Genome, Human , Humans , Male
15.
Mol Biosyst ; 7(7): 2278-85, 2011 Jul.
Article in English | MEDLINE | ID: mdl-21584303

ABSTRACT

The filamentous fungus Neurospora crassa is a leading model organism for circadian clock studies. Computational identification of a protein-protein interaction (PPI) network (also known as an interactome) in N. crassa can provide new insights into the cellular functions of proteins. Using two well-established bioinformatics methods (the interolog method and the domain interaction-based method), we predicted 27,588 PPIs among 3006 N. crassa proteins. To the best of our knowledge, this is the first identified interactome for N. crassa, although it remains problematic because of incomplete interactions and false positives. In particular, the established PPI network has provided clues to further decipher the molecular mechanism of circadian rhythmicity. For instance, we found that clock-controlled genes (ccgs) are more likely to act as bottlenecks in the established PPI network. We also identified an important module related to circadian oscillators, and some functional unknown proteins in this module may serve as potential candidates for new oscillators. Finally, all predicted PPIs were compiled into a user-friendly database server (NCPI), which is freely available at .


Subject(s)
Fungal Proteins/metabolism , Neurospora crassa/metabolism , Protein Interaction Mapping/methods , Circadian Clocks/physiology , Circadian Rhythm/physiology , Databases, Protein , Internet , Neurospora crassa/physiology , Protein Binding , Reproducibility of Results , Signal Transduction
SELECTION OF CITATIONS
SEARCH DETAIL